WO2000052135A2 - Engineering intracellular sialylation pathways - Google Patents

Engineering intracellular sialylation pathways Download PDF

Info

Publication number
WO2000052135A2
WO2000052135A2 PCT/US2000/005313 US0005313W WO0052135A2 WO 2000052135 A2 WO2000052135 A2 WO 2000052135A2 US 0005313 W US0005313 W US 0005313W WO 0052135 A2 WO0052135 A2 WO 0052135A2
Authority
WO
WIPO (PCT)
Prior art keywords
leu
lys
glu
gly
val
Prior art date
Application number
PCT/US2000/005313
Other languages
French (fr)
Other versions
WO2000052135A3 (en
WO2000052135A9 (en
Inventor
Michael J. Betenbaugh
Shawn Lawrence
Yuan C. Lee
Don Jarvis
Timothy A. Coleman
Original Assignee
Human Genome Sciences, Inc.
Johns Hopkins University
University Of Wyoming
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Human Genome Sciences, Inc., Johns Hopkins University, University Of Wyoming filed Critical Human Genome Sciences, Inc.
Priority to CA2363297A priority Critical patent/CA2363297C/en
Priority to EP00913684A priority patent/EP1399538A2/en
Priority to JP2000602747A priority patent/JP2003524395A/en
Priority to AU35083/00A priority patent/AU3508300A/en
Publication of WO2000052135A2 publication Critical patent/WO2000052135A2/en
Publication of WO2000052135A9 publication Critical patent/WO2000052135A9/en
Publication of WO2000052135A3 publication Critical patent/WO2000052135A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y302/00Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
    • C12Y302/01Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
    • C12Y302/01052Beta-N-acetylhexosaminidase (3.2.1.52)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1048Glycosyltransferases (2.4)
    • C12N9/1081Glycosyltransferases (2.4) transferring other glycosyl groups (2.4.99)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1241Nucleotidyltransferases (2.7.7)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/24Hydrolases (3) acting on glycosyl compounds (3.2)
    • C12N9/2402Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/88Lyases (4.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/90Isomerases (5.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P21/00Preparation of peptides or proteins
    • C12P21/005Glycopeptides, glycoproteins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2799/00Uses of viruses
    • C12N2799/02Uses of viruses as vector
    • C12N2799/021Uses of viruses as vector for the expression of a heterologous nucleic acid
    • C12N2799/026Uses of viruses as vector for the expression of a heterologous nucleic acid where the vector is derived from a baculovirus

Definitions

  • the invention relates to methods and compositions for expressing sialylated glycoproteins in heterologous expression systems, particularly insect cells.
  • heterologous proteins are generally identical at the amino acid level, their post-translationally attached carbohydrate moieties often differ from the carbohydrate moieties found on proteins expressed in their natural host species.
  • carbohydrate processing is specific and limiting in a wide variety of organisms including insect, yeast, mammalian, and plant cells.
  • the baculovirus expression vector has promoted the use of insect cells as hosts for the production of heterologous proteins (Luckow et al. (1993) Curr. Opin.
  • the carbohydrate composition of an attached oligosaccharide, especially sialic acid can affect a glycoprotein' s solubility, structural stability, resistance to protease degradation, biological activity, and in vivo circulation (Goochee et al. (1991) Bio/technology 9:1347-1355, Cumming et al. (1991) Glycobiology 1 :115-130, Opdenakker et al. (1993) FASEB J. 7:1330, Rademacher et ⁇ /. (1988) Ann. Rev. Biochem., Lis et al. (1993) Ewr. J. Biochem. 218:1-27).
  • the terminal residues of a carbohydrate are particularly important for therapeutic proteins since the final sugar moiety often controls its in vivo circulatory half-life (Cumming et al. (1991)
  • Glycobiology 1 :115-130 Glycobiology 1 :115-130. Glycoproteins with oligosaccharides terminating in sialic acid typically remain in circulation longer due to the presence of receptors in hepatocytes and macrophages that bind and rapidly remove structures terminating in mannose (Man), N-acetylglucosamine (GlcNAc), and galactose (Gal), from the bloodstream (Ashwell et al. (1974) Giochem. Soe. Symp. 40:117-124, Goochee et al. (1991) Bio/technology 9:1347-1355, Opdenakker et al. (1993) FASEB J. 7:1330).
  • Man mannose
  • GlcNAc N-acetylglucosamine
  • Gal galactose
  • sialic acid residues are the residues most commonly found on the termini of glycoproteins produced by insect cells.
  • the presence of sialic acid can also be important to the structure and function of a glycoprotein since sialic acid is one of the few sugars that is charged at physiological pH.
  • the sialic acid residue is often involved in biological recognition events such as protein targeting, viral infection, cell adhesion, tissue targeting, and tissue organization (Brandley et al. (1986) J. of Leukocyte bio. 40:97-111, Varki et al. (1997) FASEB 11 :248-255, Goochee et al. (1991) Bio/technology 9:1347-1355, Lopez et al. (1997) Glycobiology 7:635-651, Opdenakker et al. (1993) FASEB J. 7:1330).
  • composition of the attached oligosaccharide for a secreted or membrane- bound glycoprotein is dictated by the structure of the protein and by the post- translational processing events that occur in the endoplasmic reticulum and Golgi apparatus of the host cell. Since the secretory processing machinery in mammalian cells differs from that in insect cells, glycoproteins with very different carbohydrate structures are produced by these two host cells (Jarvis et al. (1995) Virology 212:500- 511, Maru et ⁇ /. (1996) J. Biol. Chem. 271 :16294-16299, Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-114).
  • N-glycosylation is highly significant to glycoprotein structure and function.
  • N-glycosylation begins in the endoplasmic reticulum (ER) with the addition of the oligosaccharide, Glc 3 Man 9 GlcNAc onto the asparagine (Asn) residue in the consensus sequence Asn-X-Ser/Thr (Moremen, et al. (1994) Glycobiology 4:113-125, Varki et al. (1993) Glycobiology 3(2):97-130, Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-114).
  • glycoprotein passes through the ER and Golgi apparatus, enzymes trim and add different sugars to this N-linked glycan.
  • carbohydrate modification steps can differ in mammalian and insect hosts.
  • the initial trimming steps are followed by the enzyme-catalyzed addition of sugars including N-acetylglucosamine (GlcNAc), galactose (Gal), and sialic acid (SA) by the steps shown in Figure 2, and as described in Goochee et al. (1991) Bio/technology 9:1347-1355.
  • GlcNAc N-acetylglucosamine
  • Gal galactose
  • SA sialic acid
  • N-linked glycans attached to heterologous and homologous glycoproteins comprise either high-mannose (Man 9 . 5 GlcNAc 2 ) or truncated
  • GlcNAcTl co-expression can increase the number of recombinant glycoproteins with oligosaccharides containing GlcNAc on the Man alphail, 3) branch (Jarvis et al. (1996) Nature Biotech. 14:1288-1292, Jarvis et al. (1995) Virology 212:500-511, Hollister et al. (1998) Glycobiology 5:473-480; Wagner et al. (1996) Glycobiology 6:165-175).
  • the production of complex carbohydrates comprising sialic acid has not been observed in these studies.
  • a similar lack or limitation in donor nucleotide substrates may be observed in other eukaryotes as well.
  • the co-expression of sialyltransferase and other transferases must be accompanied by the intracellular generation of the proper donor nucleotide substrates and the proper acceptor substrates in order for the production of sialylated and other complex glycoproteins in eukaryotes.
  • sialic acid and CMP-sialic acid are not permeable to cells so these substrates can not be provided directly to the medium of the cultures (Bennett et al. (1981) J. Cell. Biol. 88:1-15).
  • Glycoproteins containing sialylated oligosaccharides would have improved in vivo circulatory half-lives that could lead to their increased utilization as vaccines and therapeutics.
  • complex sialylated glycoproteins from insect cells would be more appropriate biological mimics of native mammalian glycoproteins in molecular recognition events in which sialic acid plays a role.
  • manipulating carbohydrate processing pathways in insect and other eukaryotic cells so that the cells produce complex sialylated glycoproteins is useful for enhancing the value of heterologous expression systems and increasing the application of heterologous cell expression products as vaccines, therapeutics, and diagnostic tools; for increasing the variety of glycosylated products to be generated in heterologous hosts; and for lowering biotechnology production costs, since particular expression systems can be selected based on efficiency of production rather than the capacity to produce particular product glycoforms.
  • compositions and methods for producing glycoproteins having sialylated oligosaccharides are provided.
  • the compositions of the invention comprise enzymes involved in carbohydrate processing and production of nucleotide sugars, nucleotide sequences encoding such enzymes, and cells transformed with these nucleotide sequences.
  • the compositions of the invention are useful in methods for producing complex sialylated glycoproteins in cells of interest including, but not limited to, mammalian cells and non-mammalian cells (e.g., insect cells).
  • the sialylation process involves the post-translational addition of a donor substrate, cytidine monophosphate-sialic acid (CMP-SA) onto a specific acceptor carbohydrate (GalGlcNAcMan-R) via an enzymatic reaction catalyzed by a sialyltransferase in the Golgi apparatus. Since one or more of these three reaction components (i.e., acceptor, donor substrate, and the enzyme sialyltransferase) is limiting or absent in certain cells of interest, methods are provided to enhance the production of the limiting components.
  • CMP-SA cytidine monophosphate-sialic acid
  • GalGlcNAcMan-R specific acceptor carbohydrate
  • Polynucleotide sequences encoding the enzymes used according to the methods of the invention are known or novel bacterial invertebrate, fungal, or mammalian sequences and/or fragments or variants thereof , that are optionally identified using bioinformatics searches.
  • completion of the sialylation reaction is achieved by expressing a sialyltransferase enzyme, or a fragment or variant thereof, in the presence of acceptor and/or donor substrates.
  • the invention also provides an assay for sialylation, wherein the structures and compositions of N-linked oligosaccharides attached to a model secreted glycoprotein, (e.g., transferrin), is elucidated using multidimensional chromatography.
  • Cells of interest that have been recombinantly engineered to produce new forms of sialylated glycoproteins, higher concentrations of sialylated glycoproteins, and/or elevated concentrations of donor substrates (.g., nucleotides sugars) required for sialylation, as well as kits for expression of sialylated glycoproteins are also provided.
  • donor substrates .g., nucleotides sugars
  • Figure 1 depicts the typical differences in insect and mammalian carbohydrate structures.
  • Figure 2 depicts the enzymatic generation of a complex sialylated carbohydrate in mammalian cells.
  • Figure 3 depicts a Paucimannosidic oligosaccharide.
  • Figure 4a depicts a hybrid glycan from Estigmena acrea (Ea-4) insect cells.
  • Figure 4b depicts a complex glycan from Estigmena acrea (Ea-4) insect cells.
  • Figure 5 depicts the nucleotide sugar production pathways in mammalian and E. coli cells leading to sialylation.
  • Figure 6 depicts a chromatogram of labeled oligosaccharides separated by reverse phase High Performance Liquid Chromatography (HPLC) on an ODS-silica column. Using this technique, oligosaccharides are fractionated according to their carbohydrate structures. Panel “L” represents cell lysate fractions and panel “S” represents cell supernatant fractions.
  • HPLC High Performance Liquid Chromatography
  • Figure 7 depicts the structure of Oligosaccharide G.
  • Figure 8 depicts the glycosylation pathway in Trichoplusia ni insect cells (High FiveTM cells; Invitrogen Corp., Carlsbad, CA, USA).
  • Figure 9 depicts the chromatogram of a Galactose-transferase assay following High Performance Anion Exchange Chromatography (HP AEC), as described in the Examples and references cited therein.
  • HP AEC High Performance Anion Exchange Chromatography
  • Figure 10 depicts the chromatogram of a 2,3-Sialyltransferase assay following Reverse Phase-High Performance Liquid Chromatography (RP-HPLC), as described in the Examples.
  • RP-HPLC Reverse Phase-High Performance Liquid Chromatography
  • Figure 11 depicts the results of a Galactose-transferase (Gal-T) assay of insect cell lysates performed using a Europium (Eu +3 )-labeled Ricinus cummunis lectin (RCA 120) probe; which specifically binds Gal or GalNAc oligosaccharide structures as described in the Examples.
  • Each column represents the Gal-T activity in a given sample;
  • Column (A) represents boiled T. ni cell lysates,
  • Column (B) represents normal T. ni cell lysates,
  • Column (C) represents activity in 0.5 mU of enzyme standard,
  • Column (D) represents lysate from T.
  • FIG. 12 depicts the product of reacting UDP-Gal-6-Naph with dans- AE-GlcNAc in the presence of GalT.
  • Figure 12 depicts the reaction products resulting from incubation of UDP-Gal- 6-Naph and Dans-AE-GlcNAc in the presence of Galactose-transferase, as described in the "Experimental” section below.
  • Figure 13 depicts the distinguishing emission spectra of GalT assay reactants and products, as described in the "Experimental” section below. Irradiation of the naphthyl group in UDP-Gal-6-Naph at 260-290 nm ("ex") results in an emission peak at 320-370 nm ("em” dotted line) while irradiation of the Galactose-transferase reaction products at these same low wavelengths results in energy transfer to the dansyl group and an emission peak at 500-560 nm ("em" solid line).
  • Figure 14 depicts the oxidation reaction of sialic acid.
  • Figure 15 schematically depicts a new GlcNAc Tl assay utilizing a synthetic 6-aminohexyl glycoside of the trimannosyl N-glycan core structure labeled with DTPA (Diethylenetriaminepentaacetic acid) and complexed with Eu +3 (see “Experimental” section below).
  • This substrate is incubated with insect cell lysates or positive controls containing GlcNAc Tl and UDP-GlcNAc. Chemical inhibitors are added to minimize background N-acetylglucosaminidase activity. After the reaction, an excess of Crocus lectin CVL (Misaki et al. (1997) J. Biol. Chem.
  • Figure 16 depicts a chromatogram of sialic acid levels in SF9 insect cells and CHO (Chinese hamster ovary) cells.
  • Sf-9 Free Sialic Acid Levels the known sialic acid standard elutes just prior to 10 minutes, while no corresponding sialic acid peak can be detected (above background levels) in Sf-9 cells.
  • CHO sialic acid levels the sialic acid standard elutes at approximately 9 minutes, while bound and free (released by acid hydrolysis) sialic acid peaks are observed at similar elution positions.
  • Figure 17 depicts how selective inhibition of N-acetylglucosaminidase allows for production of complex oligosaccharide structures.
  • Figure 18 depicts ethidium bromide-stained agarose gels following electrophoresis of PCR amplification products from Sf9 genomic DNA or High FiveTM (Invitrogen Corp., Carlsbad, CA, USA) cell cDNA templates using degenerate primers corresponding to three different regions conserved within N- acetylglucosaminidases.
  • Figure 19 depicts two potential specific chemical inhibitors of N- acetylglucosaminidase.
  • Figure 20 schematically depicts that the overexpression of various glycosyltransferases leads to greater production of oligosaccharide acceptor substrates.
  • Figure 21 depicts three possible N-glycan acceptor structures which include the terminal Gal (G) acceptor residue required for subsequent sialylation.
  • FIG. 22 depicts a structure of CMP-sialic acid (CMP-SA).
  • Figure 23 depicts a metabolic pathway for ManNAc (N-acetylmannosamine) from glucosamine and N-acetylglucosamine (GlcNAc).
  • ManNAc N-acetylmannosamine
  • GlcNAc N-acetylglucosamine
  • Figure 24 depicts a ManNAc (N-acetylmannosamine) to sialic acid metabolic pathway.
  • Figure 25 depicts the formation of CMP-sialic acid (CMP-S A) catalyzed by CMP-SA synthetase.
  • CMP-S A CMP-sialic acid
  • Figure 26 depicts detection of purified (P) transferrin (hTf) or transferrin from unpurified insect cell lysates (M) following separation on an SDS-PAGE gel, as described the Examples.
  • Figure 27 depicts the nucleotide sequence of human aldolase.
  • Figure 28 depicts the amino acid sequence of human aldolase encoded by the sequence shown in Figure 27.
  • Figure 29 depicts the nucleotide sequence of human CMP-SA synthetase (cytidine monophosphate-sialic acid synthetase)
  • Figure 30 depicts the amino acid sequence of human CMP-SA synthetase encoded by the sequence shown in Figure 29.
  • Figure 31 depicts the nucleotide sequence of human sialic acid synthetase
  • Figure 32 depicts the amino acid sequence of human SA-synthetase (SAS) encoded by the sequence shown in Figure 31.
  • SAS SA-synthetase
  • Figure 33 depicts the types and quantities of oligosaccharide structures found on recombinant human transferrin in the presence and absence of Gal T overexpression.
  • Figure 34 depicts bacterial and mammalian sialic acid metabolic pathways.
  • Figure 35 depicts human sialic acid synthetase (SAS) genetic information: (A) depicts an alignment of the polypeptide encoded by the human SAS polynucleotide open-reading frame; (B) shows the amino acid sequence homology between human SAS (top) and bacterial sialic acid synthetase (NeuB) (bottom).
  • SAS sialic acid synthetase
  • Figure 36 (A) depicts an autoradiogram of human sialic acid synthetase gene products following gel electrophoresis.
  • the lanes labeled “In Vitro” represent in vitro transcription and translation products of SAS cDNA (amplified via polymerase chain reaction (PCR)).
  • Lane 1 (“pA2") depicts a negative control reaction in which pA2 plasmid (without the SAS cDNA) was PCR amplified, transcribed, translated, and radiolabled.
  • Lane 2 (“pA2-SAS ") depicts a sample reaction in which pA2-SAS plasmid (containing the human SAS cDNA) was PCR amplified, transcribed, translated, and radiolabeled.
  • Lane 3 depicts radiolabeled protein standards migrating at approximately 66, 46, 30, 21.5, and 14.3 kD.
  • the lanes labeled "Pulse Label” show radioactive 35 S pulse labeling of polypeptides from insect cells infected by virions not containing or containing the human SAS cDNA.
  • Lane 4 (“A35”) depicts a negative control reaction of radiolabled polypeptides from insect cells infected with virions not containing the SAS cDNA.
  • Lane 5 (“AcSAS”) depicts a sample reaction of radiolabeled polypeptides from insect cells infected with baculovirus containing the human SAS cDNA.
  • Figure 36 (B) depicts an RNA (Northern) blot of human tissues (spleen, thymus, prostate, testis, ovary, small intestine, peripheral blood lymphocytes (PBL), colon, heart, brain, placenta, lung, liver, skeletal muscle, kidney, and pancreas) probed for sialic acid synthetase RNA transcripts. Transcript sizes (in kilobases) are indicated by comparison to the scale on the left side.
  • Figure 37 depicts chromatograms indicating the in vivo sialic acid content of various cells as monitored following DMB derivitization and reverse phase HPLC separation.
  • Figure 37 (A) depicts the sialic acid content of lysed cell lines after filtration through a 10,000 MWCO membrane. The cell lines analyzed were Sf-9
  • FIG. 37 depicts a chromatogram of the sialic acid content of lysates from various Sf-9 cells.
  • "AcSAS Infected" cell lysates were from Sf-9 cells infected with baculovirus containing the human SAS cDNA.
  • the Neu5Ac and KDN "Standards" are shown at 1,000 fmol concentrations.
  • "A35 Infected” cell lysates are from Sf-9 infected by baculovirus not containing the SAS cDNA.
  • "Uninfected” cell lysates are from normal Sf-9 cells not infected by any baculovirus.
  • Original chromatogram values have been divided by protein concentration to normalize chromatograms.
  • Figure 37 (C) depicts a chromatogram of the sialic acid content from lysates of Sf-9 grown in media supplemented by 10 mM ManNAc; cells were infected or not infected with baculovirus as shown in Figure 37 (B).
  • Figure 38 depicts chromatograms of in vitro assays for sialic acid phosphorylation activity. Assays were performed with and without alkaline phosphatase (AP) treatment.
  • Figure 38 (A) depicts chromatogram results of a Neu5 Ac-9-phosphate assay performed using lysates from Sf-9 cells infected with the AcSAS baculovirus (containing the human SAS cDNA). KDN and Neu5Ac standards are shown at 5000 fmol.
  • Figure 38 (B) depicts chromatogram results of a KDN-9-phosphate assay performed using lysates from Sf-9 cells infected with the AcSAS baculovirus (containing the human SAS cDNA). KDN and Neu5Ac standards are shown at 5000 fmol.
  • Figure 39 depicts a chromatogram demonstrating production of sialylated nucleotides in SF-9 insect cells following infection with CMP-SA synthetase and SA synthetase containing baculoviruses.
  • Sf-9 cells were grown in six well plates and infected with baculovirus containing CMP-SA synthase and supplemented with 10 mM ManNAc ("CMP" line), with baculovirus containing CMP-SA synthase and SA synthase plus 10 mM ManNAc supplementation (“CMP+SA” line), or with no baculovirus and no ManNAc supplementation (“SF9” line).
  • CMP mM ManNAc
  • CMP+SA baculovirus containing CMP-SA synthase and SA synthase plus 10 mM ManNAc supplementation
  • SF9 no baculovirus and no ManNAc supplementation
  • compositions and methods for producing glycoproteins with sialylated oligosaccharides are provided.
  • carbohydrate processing pathways of cell lines of interest are manipulated to produce complex sialylated glycoproteins.
  • Such sialylated glycoproteins find use as pharmaceutical compositions, vaccines, diagnostics, therapeutics, and the like.
  • Cells of interest include, but are not limited to, mammalian cells and non- mammalian cells, such as, for example, CHO, plant, yeast, bacterial, insect, and the like.
  • the methods of the invention can be practiced with any cells of interest.
  • methods for the manipulation of insect cells are described fully herein.
  • the methods may be applied to other cells of interest to construct processing pathways in any cell of interest for generating sialylated glycoproteins.
  • Oligosaccharides on proteins are commonly attached to asparagine residues found within Asn-X-Ser/Thr consensus sequences; such asparagine-linked oligosaccharides are commonly referred to as "N-linked”.
  • the sialylation of N-linked glycans occurs in the Golgi apparatus by the following enzymatic mechanism: CMP- SA + GalGlcNAcMan-R sialyltransferase SAGalGlcNAcMan-R + CMP.
  • the successful execution of this sialylation reaction depends on the presence of three elements: 1) the correct carbohydrate acceptor substrate (designated GalGlcNAcMan- R in the above reaction; where the acceptor substrate is a branched glycan, GalGlcNAcMan is comprised by at least one branch of the glycan, the Gal is a terminal Gal, and R is an N-linked glycan); 2) the proper donor nucleotide sugar, cytidine monophosphate-sialic acid (CMP-SA); and 3) a sialyltransferase enzyme.
  • Each of these reaction components is limiting or missing in insect cells (Hooker et al.
  • any oligosaccharide or monosaccharide any compound containing an oligosaccharide or monosaccharide, any compatible aglycon (for example Gal- sphingosine), any asparagine (N)-linked glycan, any serine- or threonine-linked (O- linked) glycan, and any lipid containing a monosaccharide or oligosaccharide structure can be a proper acceptor substrate and can be sialylated within the cell of interest.
  • any compatible aglycon for example Gal- sphingosine
  • any asparagine (N)-linked glycan any serine- or threonine-linked (O- linked) glycan
  • any lipid containing a monosaccharide or oligosaccharide structure can be a proper acceptor substrate and can be sialylated within the cell of interest.
  • the methods of the invention may be applied to generate sialylated glycoproteins for which the acceptor substrate is not necessarily limited to the structure GalGlcNAcMan-R, although this structure is particularly recognized as an appropriate acceptor substrate structure for production of N-linked sialylated glycoproteins.
  • the acceptor substrate can be any glycan.
  • the acceptor substrate according to the methods of the invention is a branched glycan.
  • the acceptor substrate according to the methods of the invention is a branched glycan comprising a terminal Gal in at least one branch of the glycan.
  • the acceptor substrate according to the methoids of the invention has the structure GalGlcNAcMan in at least one branch of the glycan and the Gal is a terminal Gal.
  • engineering the sialylation process into cells of interest according to the methods of the present invention requires the successful manipulation and integration of multiple interacting metabolic pathways involved in carbohydrate processing. These pathways include participation of glycosyltransferases, glycosidases, the donor nucleotide sugar (CMP- SA) synthetases, and sialic acid transferases.
  • Carbohydrate processing enzymes are enzymes involved in any of the glycosyltransfer, glycosidase, CMP- SA synthesis, and sialic acid transfer pathways.
  • Known carbohydrate engineering efforts have generally focused on the expression of transferases (Lee et al. (1989) J. Biol Chem. 264:13848-13855, Wagner et al. (1996) J. Virology 70:4103-4109, Jarvis et al. (1996) N ⁇ twre Biotech. 14:1288-1292, Hollister et al. (1998) Glycobiology 5:473-480, Smith et al. (1990) J. Biol. Chem.
  • the methods of the present invention permit manipulation of glycoprotein production in cells of interest by enhancing the production of donor nucleotide sugar substrate (CMP-SA) and optionally, by introducing and expressing sialyltransferase and/or acceptor substrates.
  • CMP-SA donor nucleotide sugar substrate
  • cells of interest any cells in which the endogenous CMP-SA levels are not sufficient for the production of a desired level of sialylated glycoprotein in that cell.
  • the cell of interest can be any eukaryotic or prokaryotic cell.
  • Cells of interest include, for example, insect cells, fungal cells, yeast cells, bacterial cells, plant cells, mammalian cells, and the like.
  • Human cells and cell lines are also included in the cells of interest and may be utilized according to the methods of the present invention to, for example, manipulate sialylated glycoproteins in human cells and/or cell lines, such as, for example, kidney, liver, and the like.
  • desired level is intended that the quantity of a biochemical comprised by the cell of interest is altered subsequent to subjecting the cell to the methods of the invention.
  • the invention comprises manipulating levels of CMP-SA and/or sialylated glycoprotein in the cell of interest.
  • manipulating levels of CMP-SA and sialylated glycoprotein comprise increasing the levels to above endogenous levels. It is recognized that the increase can be from a non-detectable level to any detectable level; or the increase can be from a detected endogenous level to a higher level.
  • production of the acceptor substrate is achieved by optionally screening a variety of cell lines for desirable processing enzymes, suppressing unfavorable cleavage reactions that generate truncated carbohydrates, and/or by enhancing expression of desired glycosyltransferase enzymes such as galactose transferase.
  • desired glycosyltransferase enzymes such as galactose transferase.
  • SA may be achieved by adding key precursors such as N-acetylmannosamine (ManNAc), N-acetylglucosamine (GlcNAc) and glucosamine to cell growth media, by enhancing expression of limiting enzymes in CMP-SA production pathway in the cells, or any combination thereof.
  • ManNAc N-acetylmannosamine
  • GlcNAc N-acetylglucosamine
  • glucosamine glucosamine
  • enhancing expression is intended to mean that the translated product of a nucleic acid encoding a desired protein is higher than the endogenous level of that protein in the host cell in which the nucleic acid is expressed.
  • the biological activity of a desired carbohydrate processing enzyme is increased by enhancing expression of the enzyme.
  • the invention encompasses reducing the endogenous expression of the enzyme protein, for example, by using antisense and/or ribozyme nucleic acid sequences corresponding to the amino acid sequences of the enzyme; gene knock-out mutagenesis; and/or by inhibiting the activity of the enzyme protein, for example, by using chemical inhibitors.
  • endogenous is intended to mean the type and/or quantity of a biological function or a biochemical composition that is present in a naturally occurring or recombinant cell prior to manipulation of that cell according to the methods of the invention.
  • heterologous is intended to mean the type and/or quantity of a biological function or a biochemical composition that is not present in a naturally occurring or recombinant cell prior to manipulation of that cell by the methods of the invention.
  • a heterologous polypeptide or protein is meant as a polypeptide or protein expressed (i.e. synthesized) in a cell species of interest that is different from the cell species in which the polypeptide or protein is normally expressed (i.e. expressed in nature).
  • the cells of interest are manipulated (using techniques described herein or otherwise known in the art) to contain this substrate.
  • insect cells which principally produce truncated carbohydrates terminating in Man or GlcNAc, such cells may routinely be manipulated to produce a significant fraction of complex oligosaccharides terminating in Gal.
  • Three non limiting, non-exclusive approaches that may be routinely applied to produce a significant fraction of complex oligosaccharides terminating in Gal include: (1) developing screening assays to analyze a selection of insect cell lines for the presence of particular carbohydrate processing enzymes; (2) elevating production of Gal- terminated oligosaccharides by expressing specific enzymes relevant to carbohydrate processing pathways; and (3) suppressing carbohydrate processing pathways that produce truncated N-linked glycans which cannot serve as acceptors in downstream glycosyltransferase reactions.
  • cell lines of interest are initially, and optionally, screened to identify cell lines with the desired endogenous carbohydrate production for subsequent metabolic manipulations. More particularly, the screening process includes characterizing cell lines for glycosyl transferase activity using techniques described herein or otherwise known in the art. Furthermore, it is recognized that any screened cell line could generate some paucimannosidic carbohydrates. Accordingly, the screening process also includes using techniques described herein or otherwise known in the art to characterize cell lines for particular glycosidase activity leading to production of paucimannosidic structures.
  • the invention for the production of the acceptor substrates, encompasses utilizing methods described herein or otherwise known in the art to enhance the expression of one or more transferases.
  • Such methods include, but are not limited to, methods that enhance expression of Gal T, GlcNAc -Tl and -TII or any combination thereof; for example, as described in International patent application publication number WO 98/06835 and U.S. Patent No. 5,047,335.
  • concentrations of acceptor substrates are increased by using methods described herein or otherwise known in the art to suppress the activity of one or more endogenous glycosidases.
  • an endogenous glycosidase the activity of which may be suppressed accoreding to the methods of the invention includes, but is not limited to, the hexosaminidase, N- acetylglucosaminidase (an enzyme that degrades the substrate required for oligosaccharide elongation).
  • the invention encompasses enhancing metabolic pathways that produce the desired acceptor carbohydrates and/or suppressing those pathways that produce truncated acceptors.
  • the cell lines of interest produce different N-glycan structures. Thus, such cells can routinely be screened using techniques described herein or otherwise known in the art to determine the presence of carbohydrate processing enzymes of interest.
  • insect cells for example, different insect cell lines produce very different N-glycan structures (Jarvis et al. (1995) Virology 212:500-511, Hsu et al. (1997) J. Biol. Chem. 272:9062-9070, Nishimura et al. (1996) Bioorg. Med. Chem. 4:91-96).
  • only a few cell lines have been characterized, in part due to the lack of efficient screening assays.
  • the present invention provides methods implementing fluorescence energy transfer and Europium fluorescence assays to screen a selection of different cells of interest, such as, for example, insect cell lines for the presence of critical carbohydrate processing enzymes.
  • Analytical bioassays described herein or otherwise known in the art are also provided according to the methods of the present invention to detect the presence of favorable carbohydrate processing enzymes, including, but not limited to, galactosyl transferase (Gal T), GlcNAc transferase I (GlcNAc T I), and sialyltransferase; and to detect undesirable enzymes including, but not limited to, N-acetylglucosaminidase.
  • the cells of interest are insect cells
  • many of these lines can routinely be infected by the baculovirus, Autographa californica nuclear polyhedrosis virus (A ⁇ NPV), and used for the production of heterologous proteins.
  • a ⁇ NPV Autographa californica nuclear polyhedrosis virus
  • Only a few cell lines are routinely used for recombinant protein production using techniques described herein or otherwise known in the art.
  • These cell lines will be immediately apparent by one skilled in the art. It is recognized that any cell line can be screened for specific carbohydrate processing enzymes, and manipulated for the purposes of the present invention.
  • Such cell lines include, but are not limited to, insect cell lines, including but not limited to, Spodoptera frugiperda (e.g. Sf-9 or Sf-21 cells), Trichoplusia ni (T. ni), and Estigmene acrea (Ea4).
  • Spodoptera frugiperda lines Sf-9 or Sf-21 are the most widely used cell lines and a significant amount information is known about the oligosaccharide processing in these cells.
  • Trichoplusia ni e.g.
  • Drosophila Schneider S2 cell lines represent another insect cell line used for the production of heterologous proteins. Though these cells cannot be infected by the AcNPV expression vector, they are used for production of heterologous proteins via an alternative technology known in the art. These cell lines represent other insect cell line candidates whose glycosylation processing characteristics may be modified to include sialylation. In insect cells, paucimannosidic structures are produced by a membrane-bound N-acetylglucosaminidase, which removes terminal GlcNAc residues from the alpha(l,3) arm of the trimannosyl core (Altmann et al. (1995) J. Biol. Chem. 270:17344-17349).
  • trimannosyl core structure lacks the proper termini required for conversion of side chains to sialylated complex structures; therefore, suppression of the N-acetylglucosaminidase activity can reduce or eliminate the formation of these undesired oligosaccharide structures, as illustrated in Figure 17.
  • the invention provides vectors encoding N-acetylglucosaminidase or other glucosaminidase cDNAs in the antisense orientation and/or, vectors encoding ribozymes and/or, vectors containing sequences capable of "knocking out" the N- acetylglucosaminidase other glucosaminidase genes via homologous recombination.
  • Expression plasmids described herein or otherwise known in the art are constructed using techniques known in the art to produce stably-transformed insect cells that constitutively express the antisense construct and/or ribozyme construct to suppress translation of N-acetylglucosaminidase other glucosaminidases or alternatively, to use homologous recombination techniques known in the art are to "knock-out" the N- acetylglucosaminidase other glucosaminidase genes.
  • Particular sequences to be used in the antisense and/or ribozyme construction are described herein, for example, in Example 4. Techniques described herein or otherwise known in the art may be routinely applied to analyze N-linked oligosaccharide structures and to determine if N-glycan processing is altered and of the number of paucimannosidic structures in these cells is reduced.
  • Antisense technology can be used to control gene expression through antisense DNA or RNA or through triple-helix formation. Antisense techniques are discussed, for example, in Okano, j. Neurochem. 56: 560 (1991); "Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988). Antisense technology can be used to control gene expression through antisense DNA or RNA, or through triple-helix formation. Antisense techniques are discussed for example, in Okano, J., Neurochem. 56:560 (1991);
  • Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988). Triple helix formation is discussed in, for instance Lee et al., Nucleic Acids Research 6: 3073 (1979); Cooney et al., Science 241 : 456 (1988); and Dervan et al., Science 251 : 1360 (1991). The methods are based on binding of a polynucleotide to a complementary DNA or RNA.
  • the 5' coding portion of a polynucleotide that encodes the amino terminal portion of N- acetylglucosammidase and/or other glucosaminidases may be used to design antisense RNA oligonucleotides of from about 10 to 40 base pairs in length.
  • a DNA oligonucleotide is designed to be complementary to a region of the gene involved in transcription thereby preventing transcription and the production of N- acetylglucosaminidase and/or other glucosaminidases.
  • the antisense RNA oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into N-acetylglucosaminidase and/or other glucosaminidase polypeptides.
  • the oligonucleotides described above can also be delivered to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of N- acetylglucosaminidase and/or other glucosaminidases.
  • the N-acetylglucosaminidase and/or other glucosaminidase antisense nucleic acids of the invention are produced intracellulariy by transcription from an exogenous sequence.
  • a vector or a portion thereof is transcribed, producing an antisense nucleic acid (RNA) of the invention.
  • RNA antisense nucleic acid
  • Such a vector would contain a sequence encoding a N-acetylglucosaminidase and/or other glucosaminidase antisense nucleic acids.
  • Such a vector can remain episomal or become chromosomally integrated, as long as it can be transcribed to produce the desired antisense RNA.
  • Such vectors can be constructed by recombinant DNA technology methods standard in the art.
  • Vectors can be plasmid, viral, or others know in the art, used for replication and expression in insect, yeast, mammalian, and plant cells.
  • Expression of the sequences encoding N-acetylglucosaminidase and/or other glucosaminidases, or fragments thereof, can be by any promoter known in the art to act in insect, yeast, mammalian, and plant cells. Such promoters can be inducible or constitutive. Such promoters include, but are not limited to, the baculovirus polyhedrin promoter (Luckow et al. (1993) Curr. Opin. Biotech. 4:564-572, Luckow et al.
  • the SV40 early promoter region (Bernoist and Chambon, Nature 29:304-310 (1981), the promoter contained in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto et al., Cell 22:787-797 (1 80), the herpes thymidine promoter (Wagner et al., Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445 (1981), the regulatory sequences of the metallothionein gene (Brinster, et al., Nature 296:39-42 (1982)), etc.
  • the antisense nucleic acids of the invention comprise sequences complementary to at least a portion of an RNA transcript of N-acetylglucosaminidase and/or other glucosaminidase genes.
  • absolute complementarity although preferred, is not required.
  • a sequence "complementary to at least a portion of an RNA,” referred to herein, means a sequence having sufficient complementarity to be able to hybridize with the RNA, forming a stable duplex; in the case of double stranded N-acetylglucosaminidase and/or other glucosaminidase antisense nucleic acids, a single strand of the duplex DNA may thus be tested, or triplex formation may be assayed.
  • the ability to hybridize will depend on both the degree of complementarity and the length of the antisense nucleic acid Generally, the larger the hybridizing nucleic acid, the more base mismatches with a N-acetylglucosaminidase and/or other glucosaminidase RNAs it may contain and still form a stable duplex (or triplex as the case may be).
  • One skilled in the art can ascertain a tolerable degree of mismatch by use of standard procedures to determine the melting point of the hybridized complex.
  • Oligonucleotides that are complementary to the 5' end of the message should work most efficiently at inhibiting translation.
  • sequences complementary to the 3' untranslated sequences of mRNAs have been shown to be effective at inhibiting translation of mRNAs as well. See generally, Wagner, R., 1994, Nature 372:333-335.
  • oligonucleotides complementary to either the 5'- or 3'- non- translated, non-coding regions of N-acetylglucosaminidase and/or other glucosaminidases could be used in an antisense approach to inhibit translation of endogenous N-acetylglucosaminidase and/or other glucosaminidase mRNAs.
  • Oligonucleotides complementary to the 5' untranslated region of the mRNA should include the complement of the AUG start codon.
  • Antisense oligonucleotides complementary to mRNA coding regions are less efficient inhibitors of translation but could be used in accordance with the invention.
  • antisense nucleic acids should be at least six nucleotides in length, and are preferably oligonucleotides ranging from 6 to about 50 nucleotides in length. In specific aspects the oligonucleotide is at least 10 nucleotides, at least 17 nucleotides, at least 25 nucleotides or at least 50 nucleotides.
  • the polynucleotides of the invention can be DNA or RNA or chimeric mixtures or derivatives or modified versions thereof, single-stranded or double- stranded.
  • the oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate backbone, for example, to improve stability of the molecule, hybridization, etc.
  • the oligonucleotide may include other appended groups such as peptides (e.g., for targeting host cell receptors in vivo), agents facilitating transport across the cell membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553- 6556; Lemaitre et al., Proc. Natl. Acad. Sci. 84:648-652 (1987); PCT Publication No.
  • oligonucleotide may be conjugated to another molecule, e.g., a peptide, hybridization triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc.
  • the antisense oligonucleotide may comprise at least one modified base moiety which is selected from the group including, but not limited to, 5-fluorouracil, 5- bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5- (carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1 -methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 2- methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-man
  • the antisense oligonucleotide comprises at least one modified phosphate backbone selected from the group including, but not limited to, a phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a formacetal or analog thereof.
  • the antisense oligonucleotide is an alpha-anomeric oligonucleotide.
  • An alpha -anomeric oligonucleotide forms specific double-stranded hybrids with complementary RNA in which, contrary to the usual beta-units, the strands run parallel to each other (Gautier et al., Nucl. Acids Res. 15:6625-6641 (1987)).
  • the oligonucleotide is a 2-0-methylribonucleotide (Inoue et al., Nucl. Acids Res. 15:6131-6148 (1987)), or a chimeric RNA-DNA analogue (Inoue et al., FEBS Lett. 215:327-330 (1997)).
  • Polynucleotides of the invention may be synthesized by standard methods known in the art, e.g. by use of an automated DNA synthesizer (such as are commercially available from Biosearch, Applied Biosystems, etc.).
  • an automated DNA synthesizer such as are commercially available from Biosearch, Applied Biosystems, etc.
  • phosphorothioate oligonucleotides may be synthesized by the method of Stein et al. (Nucl. Acids Res. 16:3209 (1988))
  • methylphosphonate oligonucleotides can be prepared by use of controlled pore glass polymer supports (Sarin et al., Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451 (1988)), etc.
  • antisense nucleotides complementary to the N-acetylglucosaminidase and/or other glucosaminidase coding region sequences could be used, those complementary to the transcribed untranslated region are most preferred.
  • N-acetylglucosaminidase or other glucosaminidase activity suppressors also include catalytic RNA, or a ribozyme (See, e.g., PCT International Publication WO 90/11364, published October 4, 1990; Sarver et al, Science 247:1222-1225 (1990). While ribozymes that cleave mRNA at site specific recognition sequences can be used to destroy N-acetylglucosaminidase and/or other glucosaminidase mRNAs, the use of hammerhead ribozymes is preferred.
  • Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form complementary base pairs with the target mRNA. The sole requirement is that the target mRNA have the following sequence of two bases: 5'- UG-3'.
  • the construction and production of hammerhead ribozymes is well known in the art and is described more fully in Haseloff and Gerlach, Nature 334:585-591 (1988).
  • the ribozyme is engineered so that the cleavage recognition site is located near the 5' end of the N-acetylglucosaminidase and/or other glucosaminidase mRNAs; i.e., to increase efficiency and minimize the intracellular accumulation of non-functional mRNA transcripts.
  • the ribozymes of the invention can be composed of modified oligonucleotides (e.g. for improved stability, targeting, etc.) and should be delivered to cells which express N-acetylglucosaminidase and/or other glucosaminidases in vivo.
  • DNA constructs encoding the ribozyme may be introduced into the cell in the same manner as described above for the introduction of antisense encoding DNA.
  • a preferred method of delivery involves using a DNA construct "encoding" the ribozyme under the control of a strong constitutive promoter, such as, for example, pol III or pol II promoter, so that transfected cells will produce sufficient quantities of the ribozyme to destroy endogenous N-acetylglucosaminidase and/or other glucosaminidase messages and inhibit translation. Since ribozymes unlike antisense molecules, are catalytic, a lower intracellular concentration is required for efficiency.
  • Endogenous gene expression can also be reduced by inactivating or "knocking out" the N-acetylglucosaminidase and/or other glucosaminidase gene and/or its promoter using targeted homologous recombination.
  • endogenous gene expression can also be reduced by inactivating or "knocking out" the N-acetylglucosaminidase and/or other glucosaminidase gene and/or its promoter using targeted homologous recombination.
  • a mutant, non-functional polynucleotide of the invention or a completely unrelated DNA sequence (such as for example, a sialic acid synthetase) flanked by DNA homologous to the endogenous polynucleotide sequence (either the coding regions or regulatory regions of the gene) can be used, with or without a selectable marker and/or a negative selectable marker, to transfect cells that express polypeptides of the invention in vivo.
  • techniques known in the art are used to generate knockouts in cells that contain, but do not express the gene of interest. Insertion of the DNA construct, via targeted homologous recombination, results in inactivation of the targeted gene.
  • chemical inhibitors are also within the scope of the present invention, in addition to, or as an alternative to, the antisense approach, and/or the ribozyme approach, and/or the gene "knock-out” approach, as means for suppressing glucosaminidase activity in insect cell cultures.
  • Chemical inhibitors that may be used to suppress glucosaminidase activity include, but are not limited to, 2-acetamido- l,2,5-trideoxy-l,5 amino-D-glucitol can limit the N-acetylglucosaminidase activity in insect cells (Legler et al. (1991) Biochim. Biophys. Acta 1080:80-95, Wagner et al. (1996) J.
  • N- acetylglucosaminidase inhibitors may also be used according to the present invention, including, but not limited to, nagastatin (with a Ki value in the 10 "8 range) and GlcNAc-oxime (Ki in 0.45-22 mM) which are commercially, publicly, or otherwise available for the pu ⁇ oses of the present invention (Nishimura et al. (1996) Bioorg. Med. Chem. 4:91-96, Aoyagi et al. (1992) J. Antibiotics 45:1404-1408).
  • these inhibitors may be tested and compared in in vitro and/or in vivo trials using techniques described herein or otherwise known in the art. As above, these chemical inhibitors are then used in addition to, or as an alternative to, antisense suppression, ribozyme suppression, and/or gene knock-out mutagenesis, of glucosaminidase activity in insect cells.
  • Gal T activity in insect cells can be increased significantly by using techniques described described herein or otherwise known in the art to express a heterologous gene using a baculovirus construct containing nucleic acid sequences encoding Gal T or a fragment or variant thereof, or by stably transforming the cells with a gene coding for Gal T or a fragment or variant thereof.
  • N-glycan analysis indicates that lower than a desired level of the acceptor substrates are present even following glucosaminidase suppression, techniques described herein or otherwise known in the art may be applied to express glycosyltransferase enzymes as needed in insect cells to produce a larger fraction of the desired acceptor structures.
  • Figure 20 depicts that the overexpression of various glycosyltransferases leads to greater production of acceptor substrates.
  • glycosyltransferases will serve to limit generation of paucimannosidic structures by generating unacceptable glucosaminidase substrates terminating in Gal, or by competing against the glucosaminidase reaction (Wagner et al. , Glycobiology 6: 165-175 (1996)).
  • the invention comprises expression of glycosyltransferases combined with, or as an alternative to, suppression of N-acetylglucosaminidase activity in selected insect cell lines to produce desired quantities of carbohydrates containing the correct Gal (G) acceptor substrate for sialylation.
  • Figure 21 illustrates, without limitation, three examples of acceptor N-glycan structures that comprise the terminal Gal acceptor residue required for subsequent sialylation. Other desired carbohydrates structures with a branch terminating Gal are also possible and are encompassed by the invention.
  • Baculovirus expression vectors containing the coding sequence for GlcNAc - Tl and -Til, and Gal T or fragments or variants thereof, and stable transfectants overexpressing GlcNAc-TI and GlcNAc-TII, and Gal T, or fragments or variants thereof can be routinely generated using techniques known in the art, and are commercially, publicly, or otherwise available for the pu ⁇ oses of this invention. (See Jarvis et al. (1996) Nature Biotech. 14:1288-1292; Hollister et al. (1998) Glycobiology 8: 473-480; the contents of which are herein inco ⁇ orated by reference).
  • stable transfectants expressing GlcNAc-TI and GlcNAc-TII can be routinely generated using techniques known in the art, if overexpression proves desirable.
  • CMP-Sialic Acid For production of the donor substrate, CMP-SA, the invention provides methods and compositions comprising expression of limiting enzymes in the CMP- SA production pathway; in addition, or as an alternative to, the feeding of precursor substrates.
  • CMP-SA CMP-sialic acid
  • the structure of CMP-SA is shown in Figure 22.
  • CMP-SA can be enzymatically synthesized from glucose or other simple sugars, glutamine, and nucleotides in mammalian cells and E coli using the metabolic pathways shown in Figure 5, and as described in Ferwerda et al. (1983) Biochem. J. 216:87-92; Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400; Schachter et al. (1973) Metabolic Conjugation and Metabolic Hydrolysis (New York Academic Press) 2-135.
  • CMP-SA In some mammalian tissues and cell lines, the production and delivery of CMP-SA limits the sialylation capacity of these cells (Gu et al. (1997) Improvement of the inter feron-gamma sialylation in Chinese hamster ovary cell culture by feeding N-acetylmannosamine). This problem is likely to be amplified in insect cells since negligible sialic acid levels are detected in Trichoplusia ni insect cells as compared to levels in Chinese Hamster Ovary (CHO) mammalian cells ( Figure 16). Furthermore, negligible CMP-SA was observed in Sf-9 and ⁇ a-4 insect cells when compared to CHO cells (Hooker et al.
  • NeAc from the precursor substrate ManNAc can proceed through three alternative pathways shown in Figure 5.
  • the principal pathway for the production of SA in E coli and other bacteria utilizes the phosphoenylpyruvate (PEP) and ManNAc to produce sialic acids in the presence of sialic acid synthetase (Vann et al. (1997) Glycobiology 7:697-701).
  • a second pathway, observed in bacteria and mammals, involves the reversible conversion by aldolase (also named N-acetylneuraminate lyase) of ManNAc and pyruvate to sialic acid (Schachter et al.
  • the third pathway begins with the ATP driven phosphorylation of ManNAc, and is followed by the enzymatic conversion of phosphorylated ManNAc to a phosphorylated form of sialic acid, from which the phosphate is removed in a subsequent step (van Rinsum et al. (1983) Biochem. J. 210:21-28, Schachter et al. (1973) Metabolic Conjugation and metabolic Hydrolysis (New York Academic Press) 2-135).
  • feeding of alternative precursor substrates may be applied to eliminate or reduce the need to produce CMP-SA from simple sugars (see Example 6).
  • CMP-SA and its direct precursor, SA are not permeable to cell membranes (Bennetts et al. (1981) J Cell. Biol. 88:1-15), these substrates cannot be added to the culture medium for uptake by the cell.
  • other precursors including N-acetylmannosamine (ManNAc), glucosamine, and N- acetylglucosamine (GlcNAc) when added to the culture medium are absorbed into mammalian cells (see Example 6).
  • the substrates are then enzymatically converted to CMP-SA and inco ⁇ orated into homologous and heterologous glycoproteins (Gu et al. (1997) Improvement of the interferon-gamma sialylation in Chinese hamster ovary cell culture by feeding N-acetylmannosamine, Ferwerda et al. (1983) Biochem. J. 216:87-92, Kohn et al. (1962) J. Biol. Chem. 237:304-308, Bennetts et al. (1981) J. Cell Biol. 88:1-15).
  • CMP-SA CMP-sialic acid synthetase
  • This enzyme has been cloned and sequenced from E. coli and used for the in vitro production of CMP-SA, as described in Zapata et al. (1989) J. Biol. Chem. 264:14769-14774, Kittleman et al. (1995) Appl. Microbiol. Biotechnol. 44:59-67, Ichikawa et al. (1992) Anal. Biochem. 202:215-238, Shames et al. (1991) Glycobiology 1:187-191; the contents of which are herein inco ⁇ orated by reference).
  • CMP-SA the activated sugar nucleotide
  • the Golgi lumen for sialylation to proceed
  • Transport through the trans-Golgi membrane is facilitated by the CMP-SA transporter protein, which was identified by complementation cloning into sialylation deficient CHO cells ( ⁇ ckhardt et al. (1996) Proc. Natl. Acad. Sci. USA 93:7572- 7576).
  • This mammalian gene has also been cloned and expressed in a functional form in the heterologous host, S. cerevisiae (Bernisone et al. (1997) J Biol. Chem. 272:12616-12619).
  • CMP-SA transporter genes are introduced and expressed using routine recombinant DNA techniques may also be employed according to the methods of the present invention. These techniques are optionally combined with ManNAc, GlcNAc, or glucosamine feeding strategies described above, to maximize CMP-SA production.
  • ManNAc can be produced chemically using sodium hydroxide (Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400).
  • the enzymes that convert these substrates to ManNAc or fragments or variants of these enzymes can be expressed in insect cells using techniques described herein or otherwise known in the art.
  • the production of ManNAc from GlcNAc and glucosamine proceeds through the metabolic pathway shown in Figure 23.
  • approach (a) is achieved using the gene encoding a GlcNAc-2- epimerase isolated from pig kidney, or fragments or variants thereof, to directly convert GlcNAc to ManNAc (See Maru et al. (1996) J Biol. Chem. 271 :16294- 16299; the contents of which are herein inco ⁇ orated by reference).
  • sequence for a homologue of this enzyme can be routinely obtained from bioinformatics databases, and cloned into baculovirus vectors, or stably integrated into insect cells using techniques described herein or otherwise known in the art.
  • approach (b) requires insertion of the gene to convert UDP- GlcNAc to ManNAc.
  • Engineering the production of UDP-GlcNAc from glucosamine or GlcNAc is likely not required since most insect cells comprise metabolic pathways to synthesize UDP-GlcNAc; as indicated by the presence of GlcNAc-containing oligosaccharides.
  • the gene encoding a rat bifunctional enzyme coding for conversion of UDP-GlcNAc to ManNAc and ManNAc to ManNAc-6-P, or fragments or variants thereof is used to engineer the production of UDP-GlcNAc using techniques described herein or otherwise known in the art (Stasche et al. (1997) J. Biol. Chem. 272:24319-24324, the contents which are herein inco ⁇ orated by reference).
  • the segment of this enzyme responsible for conversion of UDP-GlNAc to ManNAc may be expressed independently in insect cells using techniques known in the art to produce ManNAc rather than ManNAc-6-P.
  • ManNAc Once ManNAc is generated, it is converted to SA according to the methods of the invention. There are three possible metabolic pathways for the conversion of ManNAc to SA in bacteria and mammals, as shown in Figure 24. Negligible SA levels have previously been observed in insect cells (in the absence of exogenous supplementation of ManNAc to the culture media).
  • the conversion of ManNAc and PEP to S A using sialic acid synthetase is the predominant pathway for SA production in E. coli (Vann et al. (1997) Glycobiology 7:697-701).
  • the E. coli sialic acid (SA) synthetase gene NeuB (SEQ ID NO:7 and 8) has been cloned and sequenced and is commercially, publicly, and/or otherwise available for the pu ⁇ oses of the present invention.
  • the human sialic acid synthetase gene has also been cloned (cDNA clone HA5AA37), sequenced, and deposited with the American Type Culture Collection ("ATCC”) on
  • the nucleic acid compositions encoding a SA synthetase such as, for example, an E.coli and/or human sialic acid synthetase and/or a fragment or variant thereof, may be inserted into a host expression vector or into the host genome using techniques described herein or otherwise known in the art.
  • the production of SA can also be achieved from ManNAc and pyruvate using an aldolase, such as, for example, bacterial aldolase (Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400), or a human aldolase (as described herein) or fragment or variant thereof.
  • an aldolase such as, for example, bacterial aldolase (Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400), or a human aldolase (as described herein) or fragment or variant thereof.
  • the human aldolase gene has been cloned (cDNA clone HDPAK85), sequenced, and deposited with the American Type Culture Collection ("ATCC”) on February 24, 2000 and was given the ATCC Deposit Number .
  • ATCC American Type Culture Collection
  • the aldolase enzyme is considered as an alternative for converting ManNAc to SA.
  • the aldolase sequences can be amplified directly from E coli and human DNA using primers and PCR amplification as described in Mahmoudian et al. (Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400); the contents of which are herein inco ⁇ orated by reference) and herein, and using techniques described herein or otherwise known in the art to enhance expression of aldolase, or a fragment or variant thereof. Since the aldolase reaction is reversible, high levels of added ManNAc and pyruvate, may be used according to the methods of the invention to drive this reversible reaction in the direction of the product SA (Mahmoudian et al.
  • an exclusively eukaryotic pathway may also employed according to the methods of the invention to convert ManNAc to SA through the phosphate intermediates ManNAc-6-phosphate and SA-9-phosphate. It is recognized that the mammalian enzymes (synthetase and phosphatase) responsible for converting ManNAc to SA through phosphate intermediates can be utilized for engineering this eukaryotic pathway into insect cells.
  • the methods of the invention also encompass the use of CMP-SA synthetase to enzymatically converts SA to CMP-SA (see, e.g., the reaction shown in Figure 25).
  • insect cells such as, for example, Sf9 insect cells, have negligible endogenous CMP-SA synthetase activity.
  • Evidence of limited CMP-SA synthetase in insect cells is also demonstrated by increased SA levels found following substrate feeding and genetic manipulation without a concomitant increase in CMP-SA.
  • specific embodiments of the invention provide methods for enhancing the expression of CMP-SA synthetase, and/or fragments or variants thereof.
  • the methods of the present invention provide for enhancing expression of bacterial or human CMP-SA synthetase or fragments, or variants thereof, in cells of interest, such as, for example, in insect cells, using techniques described herein, or otherwise known in the art.
  • CMP-SA must be delivered into the Golgi apparatus in order for sialylation to occur, and this transport process depends on the presence of the CMP-SA transporter protein (Deutscher et al. (1984) Cell 39:295-299).
  • CMP-SA transporter protein Deutscher et al. (1984) Cell 39:295-299.
  • insect cell vesicles are prepared and transport of CMP-SA is measured as described in (Bernisone et al. (1997) J Biol. Chem. 272:12616-12619) and/or using techniques otherwise known in the art.
  • a transporter enzyme is cloned and expressed in insect cells using the known mammalian gene sequence (as described in Bernisone et al. (1997) J. Biol. Chem. 272:12616-12619, Eckhardt et al. (1996) Proc. Natl. Acad. Sci. USA 93:7572- 7576; the contents of which are herein inco ⁇ orated by reference) and/or sequences otherwise known in the art. Corresponding sequences are available from bioinformatics databases for the pu ⁇ oses of this invention. Localization of the protein to the Golgi is evaluated using an antibody generated against the heterologous protein using techniques known in the art in concert with commercially available fluorescent probes that identify the Golgi apparatus.
  • transcripts for example, transcripts encoding CMP-SA pathway enzymes, glycosyl transferases, and ribozymes or anti-sense RNAs to suppress hexosaminidases
  • co-infection of cells with multiple viruses using techniques known in the art can also be used to simultaneously produce multiple recombinant transcripts.
  • plasmids that inco ⁇ orate multiple foreign genes including some under the control of the early promoter IE1 are commercially, publicly, or otherwise available for the pu ⁇ oses of the invention, and can be used to create baculovirus constructs.
  • the present invention encompasses using any of these techniques.
  • the invention also encompasses using the above mentioned types of vectors to enable expression of desired carbohydrate processing enzymes in baculovirus infected insect cells prior to production of a heterologous glycoprotein of interest under control of the very late polyhedrin promoter. In this manner, once the desired polypeptide is synthesized essential N-glycan processing enzymes can facilitate N-glycan processing once the glycoprotein of interest.
  • genes for some of the enzymes may be inco ⁇ orated directly into the insect cell genome using vectors known in the art, such as, for example, vectors similar to those described in (Jarvis et al. (1990) Bio/Technology 8:950-955, Jarvis et al. (1995) Baculovirus Expr. Protocols ed. 39:187-202).
  • Genomic integration eliminates the need to infect the cells with a large number of viral constructs.
  • constructs for genomic integration contain one or more early viral promoters, including Ac NPV IE1 and 39K, which provide constitutive expression in transfected insect cells (Jarvis et al. (1990) Bio/Technology 8:950-955).
  • a sequential transformation strategy may routinely be developed for producing stable transformants that constitutively express up to four different heterologous genes simultaneously.
  • These vectors and transformation techniques are provided for the pu ⁇ oses of this invention. In this manner, inco ⁇ oration of plasmids containing heterologous genes into the insect cell genome combined with baculovirus infection integrates the metabolic pathways leading to efficient acceptor and donor substrate production in insect cells.
  • the final step in the generation of sialylated glycoproteins or glycolipids in mammalian cells is the enzymatic transfer of sialic acid from the donor substrate, CMP-SA, onto an acceptor substrate in the Golgi apparatus; a reaction which is catalyzed by sialyltransferase.
  • the sialic acid (SA) residues occurring in N-linked glycoproteins are alpha-linked to the 3 or 6 position of the Gal GlcNAc sugars (Tsuji, S. (1996) J. Biochem. 120:1-13).
  • the SA ⁇ b ⁇ 2-3GalGlcNAc linkage is found in heterologous glycoproteins expressed by CHO and human cells and the SA alpha2- 6GalGlcNAc linkage is found in many human glycoproteins (Goochee et al. (1991) Bio/technology 9: 1347-1355).
  • the alpha2-3- and/or ⁇ /pb ⁇ 2-6-sialyltransferase genes along with a number of other sialyltransferase genes have been cloned, sequenced and expressed as active heterologous proteins as described in Lee et al. (1989) J Biol. Chem. 264:13848-13855, Ichikawa et ⁇ /. (1992) Anal.
  • the methods of the invention further comprise expression of a sialyltransferase or fragment or variant thereof, in the cells.
  • the completion of the sialylation reaction can be verified by elucidating the N-glycan structures attached to a desired glycoprotein using techniques described herein or otherwise known in the art. It is recognized that evaluation of N-glycans attachments may also suggest additional metabolic engineering strategies that can further enhance the level of sialylation in insect cells.
  • T. ni insect cell lysates failed to generate any sialylated compounds when incubated with the substrate, LacMU, and the nucleotide sugar, CMP-SA. Thus, it is concluded that these cells comprise negligible native sialyltransferase activity.
  • infection of insect cells with a baculovirus containing alpha2,3 sialyltransferase provided significant enzymatic conversion of LacMU and CMP-SA to sialylLacMU.
  • heterologous sialyltransferase can be expressed using techniques described herein or otherwise known in the art either by co-infection with a virus coding for sialyltransferase, or fragment, or variant thereof, or by using stable transfectants expressing the enzyme.
  • baculovirus vectors comprising sequences coding for alpha2,6 sialyltransferase and/or fragments or variants thereof as well as stably transformed insect cells stably expressing both gal T and sialyltransferase are commercially, or publicly available, and/or may routinely be generated using techniques described herein or otherwise known in the art.
  • sialyltransferase activity is determined using the FRET or HPLC assays described herein and/or using other assays known in the art. Localization of the sialyltransferase to the Golgi is accomplished using anti- sialyltransferase antibodies commercially, publicly, or otherwise available for the pu ⁇ ose of this invention in concert with Golgi specific marker proteins.
  • pu ⁇ oses of enhancing carbohydrate processing enzymes of the invention suppressing activity of endogenous N-acetylglucosaminidase, expressing heterologous proteins in the cells of the invention, and constructing vectors for the pu ⁇ oses of the invention; genetic engineering methods are known to those of ordinary skill in the art. For example, see Schneider, A. et al, (1998) Mol. Gen.
  • Zl 1234 and Zl 1235 for two human galactosyltransferases see also United States Patent Number 5,955,282; the contents of which are herein inco ⁇ orated by reference); and/or in Genbank accession No. D83766 for GlcNAc-2-epimerase, Y07744 for the bifunctional rate liver enzyme capable of catalyzing conversion of UDP-GlcNAc to ManNAc, J05023 for E. coli CMP-SA synthetase, AJ006215 for murine CMP-SA synthetase, Z71268 for murine CMP-SA transporter, X03345 for E. coli aldolase, U05248 for E.
  • coli SA synthetase X17247 for human 2,6 sialyltransferase, L29553 for human 2,3 sialyltransferase, M13214 for bovine galactosyltransferase, L77081 for human GlcNAc T-I, U15128 or L36537 for human GlcNAc T-II, D87969 for human CMP-SA transporter, and S95936 for human transferrin; and fragments or variants of the enzymes that display one or more of the biological activities of the enzymes (such biological activities may routinely be assayed using techniques described herein or otherwise known in the art).
  • the sequences described above are readily accessible using the provided accession number in the NCBI Entrez database, known to the person of ordinary skill in the art.
  • one aspect of the invention provides for use of isolated nucleic acid molecules comprising polynucleotides having nucleotide sequences selected from the group consisting of : (a) nucleotide sequences encoding a biologically active fragment or variant of the polypeptide having the amino acid sequence described in GenSeq accession No. Zl 1234 and Zl 1235 for two human galactosyltransferases; and/or in Genbank accession No. D83766 for GlcNAc-2-epimerase, Y07744 for the bifunctional rate liver enzyme capable of catalyzing conversion of UDP-GlcNAc to ManNAc, J05023 for E.
  • coli SA synthetase XI 7247 for human 2,6 sialyltransferase, L29553 for human 2,3 sialyltransferase, Ml 3214 for bovine galactosyltransferase, L77081 for human GlcNAc T-I, U15128 or L36537 for human GlcNAc T-II, D87969 for human CMP-SA transporter, and/or S95936 for human transferrin; (b) nucleotide sequences encoding an antigenic fragment of the polypeptide having the amino acid sequence described in GenSeq accession No.
  • Zl 1234 and Zl 1235 for two human galactosyltransferases see also United States Patent Number 5,955,282; the contents of which are herein inco ⁇ orated by reference); and/or in Genbank accession No. D83766 for GlcNAc -2-epimerase, Y07744 for the bifunctional rate liver enzyme capable of catalyzing conversion of UDP-GlcNAc to ManNAc, J05023 for E coli CMP-SA synthetase, AJ006215 for murine CMP-SA synthetase, Z71268 for murine CMP-SA transporter, X03345 for E.
  • Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention.
  • Further embodiments of the invention include use of isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 80%, 85%, or 90% identical, and more preferably at least 95%, 97%, 98% or 99% identical, to any of the above nucleotide sequences, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide that is complementary to any of the above nucleotide sequences.
  • This polynucleotide which hybridizes does not hybridize under stringent hybridization conditions to a polynucleotide having a nucleotide sequence consisting of only A residues or of only T residues.
  • Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention.
  • the nucleic acid sequences (including fragments or variants) that may be used according to the methods of the present invention encode a polypeptide having a biological activity. Such biological activity may routinely be assayed using techniques described herein or otherwise known in the art.
  • nucleotide sequences and amino acid sequences disclosed in Figures 27-32, and fragments and variants of these sequences may also be used according to the methods of the invention.
  • specific enzyme polypeptides comprise the amino acid sequences shown in Figures 28, 30 and 32; or otherwise described herein.
  • the invention also encompasses sequence variants of the polypeptide sequences shown in Figures 28, 30 and 32.
  • one, two, three, four, five or more human polynucleotide sequences, or fragments, or variants thereof, and/or the polypeptides encoded thereby, are used according to the methods of the present invention to convert ManNAc to SA (see Example 6).
  • Such polynucleotide and polypeptide sequences include, but are not limited to, sequences corresponding to human aldolase (SEQ ID NO:l and SEQ ID NO:2), human CMP-SA synthetase (SEQ ID NO:3 and SEQ ID NO:4), and human SA synthetase (SEQ ID NO:5 and SEQ ID NO:6); see also Figures 27 - 32.
  • the methods of present invention include the use of one or more novel isolated nucleic acid molecules comprising polynucleotides encoding polypeptides important to intracellular carbohydrate processing in humans.
  • Such polynucleotide sequences include those disclosed in the figures and/or Sequence Listing and/or encoded by the human cDNA plasmids (Human CMP-Sialic Acid Synthetase, cDNA clone HWLLM34; Human Sialic Acid Synthetase, cDNA clone HA5AA37; and Human Aldolase cDNA clone HDPAK85) deposited with the American Type Culture Collection (ATCC) on February 24, 2000 and receiving accession numbers .
  • ATCC American Type Culture Collection
  • the present invention further includes the use of polypeptides encoded by these polynucleotides.
  • the present invention also provides for use of isolated nucleic acid molecules encoding fragments and variants of these polypeptides, and for the polypeptides encoded by these nucleic acids.
  • one aspect of the invention provides for use of isolated nucleic acid molecules comprising polynucleotides having nucleotide sequences selected from the group consisting of : (a) nucleotide sequences encoding human aldolase having the amino acid sequences as shown in SEQ ID NO:2; (b) nucleotide sequences encoding a biologically active fragment of the human aldolase polypeptide having the amino acid sequence shown in SEQ ID NO:2; (c) nucleotide sequences encoding an antigenic fragment of the human aldolase polypeptide having the amino acid sequence shown in SEQ ID NO:2; (d) nucleotide sequences encoding the human aldolase polypeptide comprising the complete amino acid sequence encoded by the plasmid contained in the ATCC Deposit; (e) nucleotide sequences encoding a biologically active fragment of the human aldolase polypeptide having the amino acid sequence encoded by the plasmid contained in the ATCC Deposit; (
  • nucleic acids may also be used according to the methods of the present invention.
  • Further embodiments of the invention include use of isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 80%, 85%, or 90% identical, and more preferably at least 95%, 97%, 98% or 99% identical, to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), or (g), above, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide in (a), (b), (c), (d), (e), (f), or (g), above.
  • This polynucleotide which hybridizes does not hybridize under stringent hybridization conditions to a polynucleotide having a nucleotide sequence consisting of only A residues or of only T residues.
  • Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention.
  • nucleic acid molecules comprising polynucleotides having nucleotide sequences selected from the group consisting of : (a) nucleotide sequences encoding human CMP-SA synthetase having the amino acid sequences as shown in SEQ ID NO:4; (b) nucleotide sequences encoding a biologically active fragment of human CMP-SA synthetase polypeptide having the amino acid sequence shown in SEQ ID NO:4; (c) nucleotide sequences encoding an antigenic fragment of the human CMP-SA synthetase polypeptide having the amino acid sequence shown in SEQ ID NO:4; (d) nucleotide sequences encoding the human CMP-SA synthetase polypeptide comprising the complete amino acid sequence encoded by the plasmid contained in the ATCC Deposit; (e) nucleotide sequences encoding a biologically active fragment of the human CMP-SA synthet
  • nucleic acids may also be used according to the methods of the present invention.
  • Further embodiments of the invention include use of isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 80%, 85%, or 90% identical, and more preferably at least 95%, 97%, 98% or 99%) identical, to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), or (g) above, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide in (a), (b), (c), (d), (e), (f), or (g), above.
  • This polynucleotide which hybridizes does not hybridize under stringent hybridization conditions to a polynucleotide having a nucleotide sequence consisting of only A residues or of only T residues.
  • Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention.
  • nucleic acid molecules comprising polynucleotides having nucleotide sequences selected from the group consisting of: (a) nucleotide sequences encoding human SA synthetase having the amino acid sequences as shown in SEQ ID NO:6; (b) nucleotide sequences encoding a biologically active fragment of the human SA synthetase polypeptide having the amino acid sequence shown in SEQ ID NO:6; (c) nucleotide sequences encoding an antigenic fragment of the human SA synthetase polypeptide having the amino acid sequence shown in SEQ ID NO:6; (d) nucleotide sequences encoding the human SA synthetase polypeptide comprising the complete amino acid sequence encoded by the plasmid contained in the ATCC Deposit; (e) nucleotide sequences encoding a biologically active fragment of the human SA synthetase polypeptide having the amino acid sequence encoded by the group consisting of: (a) nu
  • nucleic acids may also be used according to the methods of the present invention.
  • Further embodiments of the invention include use of isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 80%), 85%, or 90% identical, and more preferably at least 95%, 97%, 98% or 99% identical, to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), or (g) above, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide in (a), (b), (c), (d), (e), (f), or (g), above.
  • This polynucleotide which hybridizes does not hybridize under stringent hybridization conditions to a polynucleotide having a nucleotide sequence consisting of only A residues or of only T residues.
  • Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention.
  • nucleic acid having a nucleotide sequence at least, for example, 95% "identical" to a reference nucleotide sequence of the present invention it is intended that the nucleotide sequence of the nucleic acid is identical to the reference sequence except that the nucleotide sequence may include up to five point mutations per each 100 nucleotides of the reference nucleotide sequence encoding the described polypeptide.
  • nucleic acid having a nucleotide sequence at least 95% identical to a reference nucleotide sequence up to 5% of the nucleotides in the reference sequence may be deleted or substituted with another nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference sequence may be inserted into the reference sequence.
  • the query sequence may be an entire sequence, such as, for example, that shown of SEQ ID NO:l, the ORF (open reading frame), or any fragment as described herein.
  • nucleic acid molecule or polypeptide is at least, for example, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the presence invention can be determined conventionally using known computer programs.
  • a preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. (Comp. App. Biosci. (1990) 6:237-245.) In a sequence alignment the query and subject sequences are both DNA sequences.
  • RNA sequence can be compared by converting U's to T's.
  • the result of said global sequence alignment is in percent identity.
  • the percent identity is corrected by calculating the number of bases of the query sequence that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent of the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by results of the FASTDB sequence alignment.
  • This percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score.
  • This corrected score is what is used for the pu ⁇ oses of the present invention. Only bases outside the 5' and 3' bases of the subject sequence, as displayed by the FASTDB alignment, which are not matched/aligned with the query sequence, are calculated for the pu ⁇ oses of manually adjusting the percent identity score.
  • a 90 base subject sequence is aligned to a 100 base query sequence to determine percent identity.
  • the deletions occur at the 5' end of the subject sequence and therefore, the FASTDB alignment does not show a matched/alignment of the first 10 bases at 5' end.
  • the 10 unpaired bases represent 10% of the sequence (number of bases at the 5' and 3' ends not matched/total number of bases in the query sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 90 bases were perfectly matched the final percent identity would be 90%.
  • a 90 base subject sequence is compared with a 100 base query sequence.
  • deletions are internal deletions so that there are no bases on the 5' or 3' of the subject sequence which are not matched/aligned with the query.
  • percent identity calculated by FASTDB is not manually corrected.
  • bases 5' and 3' of the subject sequence which are not matched/aligned with the query sequence are manually corrected for. No other manual corrections are to made for the pu ⁇ oses of the present invention.
  • polypeptide having an amino acid sequence at least, for example, 95% "identical" to a query amino acid sequence of the present invention it is intended that the amino acid sequence of the subject polypeptide is identical to the query sequence except that the subject polypeptide sequence may include up to five amino acid alterations per each 100 amino acids of the query amino acid sequence.
  • up to 5% of the amino acid residues in the subject sequence may be inserted, deleted (indels) or substituted with another amino acid.
  • alterations of the reference sequence may occur at the amino or carboxy terminal positions of the reference amino acid sequence or anywhere between those terminal positions, interspersed either individually among residues in the reference sequence or in one or more contiguous groups within the reference sequence.
  • whether any particular polypeptide is at least, for example, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to, for example, the amino acid sequences of SEQ ID NO:2 or to the amino acid sequence encoded by the cDNA contained in a deposited clone can be determined conventionally using known computer programs.
  • a preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. (Comp. App. Biosci. 6:237-245(1990)).
  • the query and subject sequences are either both nucleotide sequences or both amino acid sequences.
  • the result of said global sequence alignment is in percent identity.
  • the percent identity is corrected by calculating the number of residues of the query sequence that are N- and C-terminal of the subject sequence, which are not matched/aligned with a corresponding subject residue, as a percent of the total bases of the query sequence. Whether a residue is matched/aligned is determined by results of the FASTDB sequence alignment.
  • This percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score.
  • This final percent identity score is what is used for the pu ⁇ oses of the present invention. Only residues to the N- and C-termini of the subject sequence, which are not matched/aligned with the query sequence, are considered for the pu ⁇ oses of manually adjusting the percent identity score. That is, only query residue positions outside the farthest N- and C-terminal residues of the subject sequence. For example, a 90 amino acid residue subject sequence is aligned with a 100 residue query sequence to determine percent identity.
  • the deletion occurs at the N- terminus of the subject sequence and therefore, the FASTDB alignment does not show a matching/alignment of the first 10 residues at the N-terminus.
  • the 10 unpaired residues represent 10% of the sequence (number of residues at the N- and C- termini not matched/total number of residues in the query sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 90 residues were perfectly matched the final percent identity would be 90%).
  • a 90 residue subject sequence is compared with a 100 residue query sequence. This time the deletions are internal deletions so there are no residues at the N- or C-termini of the subject sequence which are not matched/aligned with the query.
  • the sequences are aligned for optimal comparison pu ⁇ oses (e.g., gaps can be introduced in the sequence of one protein or nucleic acid for optimal alignment with the other protein or nucleic acid).
  • the amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in one sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the other sequence, then the molecules are homologous at that position.
  • amino acid or nucleic acid "homology” is equivalent to amino acid or nucleic acid "identity”.
  • the percent homology between the two sequences is a function of the number of identical positions shared by the sequences (i.e., per cent homology equals the number of identical positions/total number of positions times 100).
  • Variants of above described sequences include a substantially homologous protein encoded by the same genetic locus in an organism, i.e., an allelic variant.
  • Variants also encompass proteins derived from other genetic loci in an organism, but having substantial homology to the proteins of Figures 27-32, or otherwise described WO 00/52135 g PCT/USOO/05313
  • Variants also include proteins substantially homologous to the protein but derived from another organism, i.e., an ortholog. Variants also include proteins that are substantially homologous to the proteins that are produced by chemical synthesis. Variants also include proteins that are substantially homologous to the proteins that are produced by recombinant methods. As used herein, two proteins (or a region of the proteins) are substantially homologous when the amino acid sequences are at least about 55-60%), typically at least about 70-75%), more typically at least about 80-85%), and most typically at least about 90-95% or more homologous.
  • a substantially homologous amino acid sequence will be encoded by a nucleic acid sequence hybridizing to the nucleic acid sequence, or portion thereof, of the sequence shown in Figures 27, 28, 31 or otherwise described herein under stringent conditions as more fully described below.
  • Orthologs, homologs, and allelic variants that are encompassed by the invention and that may be used according to the methods of the invention can be identified using methods well known in the art. These variants comprise a nucleotide sequence encoding a protein that is at least about 55%, typically at least about 70- 75%, more typically at least about 80-85%), and most typically at least about 90-95% or more homologous to the nucleotide sequence shown in Figures 27, 29, 31, or otherwise described herein, or a fragment of this sequence.
  • nucleic acid molecules can readily be identified as being able to hybridize under stringent conditions, to the nucleotide sequence shown in Figures 27, 29, 31, or complementary sequence thereto, or otherwise described herein, or a fragment of the sequence. It is understood that stringent hybridization does not indicate substantial homology where it is due to general homology, such as poly A sequences, or sequences common to all or most proteins in an organism or class of proteins.
  • the invention also encompasses polypeptides having a lower degree of identity but having sufficient similarity so as to perform one or more of the same functions performed by the enzyme polypeptides described herein. Similarity is determined by conserved amino acid substitution. Such substitutions are those that substitute a given amino acid in a polypeptide by another amino acid of like characteristics (see Table 1). Conservative substitutions are likely to be phenotypically silent.
  • conservative substitutions are the replacements, one for another, among the aliphatic amino acids Ala, Val, Leu, and He; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe, Tyr.
  • Guidance concerning which amino acid changes are likely to be phenotypically silent are found in Bowie et al, Science 247:1306-1310 (1990). TABLE 1. Conservative Amino Acid Substitutions.
  • a variant polypeptide can differ in amino acid sequence by one or more substitutions, deletions, insertions, inversions, fusions, and truncations or a combination of any of these.
  • Variant polypeptides can be fully functional or can lack function in one or more activities.
  • variations can affect the function, for example, of one or more of the modules, domains, or functional subregions of the enzyme polypeptides of the invention.
  • polypeptide variants and fragments have the described activities routinely assayed via bioassays described herein or otherwise known in the art.
  • Fully functional variants typically contain only conservative variation or variation in non-critical residues or in non-critical regions. Functional variants can also contain substitution of similar amino acids, which result in no change or an insignificant change in function. Alternatively, such substitutions may positively or negatively affect function to some degree.
  • Non- functional variants typically contain one or more non-conservative amino acid substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, inversion, or deletion in a critical residue or critical region.
  • variants can be naturally-occurring or can be made by recombinant means or chemical synthesis to provide useful and novel characteristics for the polypeptide.
  • Amino acids that are essential for function can be identified by methods known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham et al, Science 244:1081-1085 (1989)). The latter procedure introduces single alanine mutations at every residue in the molecule. The resulting mutant molecules are then tested for biological activity. Sites that are critical can also be determined by structural analysis such as crystallization, nuclear magnetic resonance or photoaffmity labeling (Smith et al, J. Mol. Biol. 224:899-904 (1992); de Vos et al. Science 255:306-312 (1992)).
  • the invention further encompasses variant polynucleotides, and fragments thereof, that differ from the nucleotide sequence, such as, for example, those shown in Figures 27, 29, 31 or otherwise described herein, due to degeneracy of the genetic code and thus encode the same protein as that encoded by the nucleotide sequence shown in Figures 27, 29, 31 or otherwise described herein.
  • the invention also provides nucleic acid molecules encoding the variant polypeptides described herein.
  • polynucleotides may be naturally occurring, such as allelic variants (same locus), homologs (different locus), and orthologs (different organism), or may be constructed by recombinant DNA methods or by chemical synthesis.
  • non-naturally occurring variants may be made by mutagenesis techniques, including those applied to polynucleotides, cells, or organisms. Accordingly, as discussed above, the variants can contain nucleotide substitutions, deletions, inversions and insertions.
  • Variation can occur in either or both the coding and non-coding regions.
  • the variations can produce both conservative and non-conservative amino acid substitutions.
  • Polynucleotides” or “nucleic acids” that may be used according to the methods of the invention also include those polynucleotides capable of hybridizing, under stringent hybridization conditions, to sequences contained in SEQ ID NO:l, the complement thereof, or a cDNA within the deposited plasmids.
  • hybridizes under stringent conditions is intended to describe conditions for hybridization and washing under which nucleotide sequences encoding a receptor at least 55% homologous to each other typically remain hybridized to each other. The conditions can be such that sequences at least about 65%, at least about 70%, or at least about 75% or more homologous to each other typically remain hybridized to each other.
  • stringent conditions are known to those skilled in the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6.
  • One example of stringent hybridization conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45degrees C, followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50-65 degrees C.
  • SSC sodium chloride/sodium citrate
  • SDS sodium chloride/sodium citrate
  • nucleic acid molecules that hybridize to a polynucleotide disclosed herein under lower stringency hybridization conditions.
  • Changes in the stringency of hybridization and signal detection are primarily accomplished through the manipulation of formamide concentration (lower percentages of formamide result in lowered stringency); salt conditions, or temperature.
  • washes performed following stringent hybridization can be done at higher salt concentrations (e.g. 5X SSC).
  • blocking reagents include Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and commercially available proprietary formulations.
  • the inclusion of specific blocking reagents may require modification of the hybridization conditions described above, due to problems with compatibility.
  • polynucleotide which hybridizes only to polyA+ sequences (such as any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a complementary stretch of T (or U) residues, would not be included in the definition of "polynucleotide,” since such a polynucleotide would hybridize to any nucleic acid molecule containing a poly (A) stretch or the complement thereof (e.g., practically any double-stranded cDNA clone generated using oligo-dT as a primer).
  • an isolated nucleic acid molecule that hybridizes under stringent conditions to a sequence disclosed herein, or the complement thereof, such as, for example, the sequence of Figures 27, 29, 31, corresponds to a naturally- occurring nucleic acid molecule.
  • a "naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a natural protein).
  • the present invention also encompasses recombinant vectors, which include the isolated nucleic acid molecules and polynucleotides that may be used according to the methods of the present invention, and to host cells containing the recombinant vectors and/or nucleic acid molecules, as well as to methods of making such vectors and host cells and for using them for production of glycosylation enzyme by recombinant techniques. Polypeptides produced by such methods are also provided.
  • the invention encompasses utilizing vectors for the maintenance (cloning vectors) or vectors for expression (expression vectors) of the desired polynucleotides encoding the carbohydrate processing of the invention, or those encoding proteins to be sialylated by the methods of the invention and/or by expression of the proteins the cells of the invention.
  • the vectors can function in prokaryotic or eukaryotic cells or in both (shuttle vectors).
  • one or more of the polynucleotide sequences used according to the methods of the invention are inserted into commercially, publicly, or otherwise available baculovirus expression vectors for enhanced expression of the corresponding enzyme.
  • one ore more of the polynucleotides used according to the methods of the invention are inserted into other viral vectors or for generation of stable insect cell lines. Techniques known in the art, such as, for example, HP AEC and HPLC techniques, may be routinely used to evaluate the enzymatic activity of these enzymes from both eukaryotic and bacterial sources to determine which source is best for generating SA in insect cells.
  • expression vectors contain cis-acting regulatory regions that are operably linked in the vector to the polynucleotide to be expressed, or other relevant polynucleotides such that transcription of the polynucleotides is allowed in a host cell.
  • the polynucleotides can be introduced into the host cell with a separate polynucleotide capable of affecting transcription.
  • the second polynucleotide may provide a trans-acting factor interacting with the cis-regulatory control region to allow transcription of the polynucleotides from the vector.
  • a transacting factor may be supplied by the host cell.
  • a trans-acting factor can be produced from the vector itself.
  • transcription of the polynucleotides can occur in a cell-free system.
  • the regulatory sequence to which the polynucleotides described herein can be operably linked include, for example, promoters for directing mRNA transcription. These promoters include, but are not limited to, baculovirus promoters including, but not limited to, 1E0, 1E1, 1E2, 39k, 35k, egt, ME53, ORF 142, PE38, p6.9, capsid, gp64 polyhedrin, plO, basic and core; and insect cell promoters including, but not limited to, Drosophila actin, metallothionine, and the like.
  • promoters include, but are not limited to, the left promoter from bacteriophage lambda, the lac, TRP, and TAC promoters from E. coli, promoters from Actinomycetes, including Nocardia, and Streptomyces. Promoters may be isolated, if they have not already been isolated, by standard promoter identification and trapping methods known in the art, see, for example, in Sambrook et al. , Molecular Cloning: A Laboratory Manual. 2nd. ed. , Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989).
  • host cells can be used for simply amplifying, but not expressing, the nucleic acid.
  • host cells can also be used to produce desirable amounts of the desired polypeptide.
  • the host cell is simply used to express the protein , er se.
  • amounts of the protein could be produced that enable its purification and subsequent use, for example, in a cell free system.
  • the promoter is compatible with the host cell.
  • Host cells can be chosen from virtually any of the known host cells that are manipulated by the methods of the invention to produce the desired glycosylation patterns. These could include mammalian, bacterial, yeast, filamentous fungi, or plant cells.
  • expression vectors may also include regions that modulate transcription, such as repressor binding sites and enhancers.
  • expression vectors can also contain sequences necessary for transcription termination and, in the transcribed region a ribosome binding site for translation.
  • Other regulatory control elements for expression include initiation and termination codons as well as polyadenylation signals. The person of ordinary skill in the art would be aware of the numerous regulatory sequences that are useful in expression vectors. Such regulatory sequences are described, for example, in Sambrook et al, cited above.
  • vectors can be used to express the polynucleotide.
  • Such vectors include chromosomal, episomal, and particularly virus-derived vectors, for example, A ⁇ MNPV, OpMNPV, BmNPV, HzMNPV, and RoMNPV.
  • Vectors may also be derived from combinations of these sources such as those derived from plasmid and bacteriophage genetic elements, e.g. cosmids and phagemids. Appropriate cloning and expression vectors for prokaryotic and eukaryotic hosts are described in Sambrook et al. , Molecular Cloning: A
  • the regulatory sequence may provide constitutive expression in one or more host cells or may provide for inducible expression in one or more cell types such as by temperature, nutrient additive, or exogenous factor such as a hormone or other ligand.
  • a variety of vectors providing for constitutive and inducible expression in prokaryotic and eukaryotic hosts are well known to those of ordinary skill in the art.
  • the polynucleotides can be inserted into the vector nucleic acid using techniques known in the art. Generally, the DNA sequence that will ultimately be expressed is joined to an expression vector by cleaving the DNA sequence and the expression vector with one or more restriction enzymes and then ligating the fragments together. Procedures for restriction enzyme digestion and ligation are well known to those of ordinary skill in the art.
  • Any cell type or expression system can be used for the pu ⁇ oses of the invention including but not limited to, for example, baculovirus systems (O'Riley et al. (1992) Baculovirus Expression Vectors, W.H. Freeman and Company, New York 1992) and Drosophila-derived systems (Johansen et al. (1989) Genes Dev 3(6):882-889).
  • the invention also encompasses vectors in which the nucleic acid sequences described herein are cloned into the vector in reverse orientation, but operably linked to a regulatory sequence that permits transcription of antisense RNA.
  • an antisense transcript can be produced to all, or to a portion, of the polynucleotide sequences described herein, including both coding and non-coding regions. Expression of this antisense RNA is subject to each of the parameters described above in relation to expression of the sense RNA (regulatory sequences, constitutive or inducible expression, tissue-specific expression).
  • the recombinant host cells are prepared by introducing the vector constructs described herein into the cells by techniques readily available to the person of ordinary skill in the art. These include, but are not limited to, calcium phosphate transfection, DEAE-dextran-mediated transfection, cationic lipid-mediated transfection, electroporation, transduction, infection, lipofection, and other techniques such as those found in Sambrook, et al. (Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). Where secretion of the polypeptide is desired, appropriate secretion signals known in the art are inco ⁇ orated into the vector using techniques known in the art. The signal sequence can be endogenous to the polypeptides or heterologous to these polypeptides.
  • the desired protein can be isolated from the host cell by techniques known in the art, such as, for example, standard disruption procedures, including freeze thaw, sonication, mechanical disruption, use of lysing agents and the like.
  • the polypeptide can then be recovered and purified by well-known purification methods including, but not limited to, ammonium sulfate precipitation, acid extraction, anion or cationic exchange chromatography, phosphocellulose chromatography, hydrophobic-interaction chromatography, affinity chromatography, hydroxylapatite chromatography, lectin chromatography, and high performance liquid chromatography.
  • the invention encompasses utilizing the sequences deduced from the fragment identified in Figure 18, and described in Example 4. More particularly, in this aspect, the invention comprises utilization of the glucosaminidase nucleotide sequences which are produced by using primers, such as, for example, those primer combinations described in Example 4. These nucleotide sequences may be used in the construction and expression of anti-sense RNA, ribozymes, or homologous recombination (gene "knock-out") constructs, using methods readily available to those skilled in the art, to reduce or eliminate in vivo glucosaminidase activity.
  • Cell lines produced by the methods of the invention can be tested by expressing a model recombinant glycoprotein in such cell lines and assessing the N- glycans attached therein using techniques described herein or otherwise known in the art. The assessment can be done, for example, by 3 -dimensional HPLC techniques.
  • human transferrin is used as a model target glycoprotein, since this glycoprotein is sialylated in humans and extensive oligosaccharide structural information for the protein is available (Montreuil et al. (1997) Glycoproteins II Ed. 203-242). In this manner, cell lines with superior processing characteristics are identified.
  • Such a cell line can then be evaluated for its growth rate, product yields, and capacity to grow in suspension culture (Lindsay et al. (1992) Biotech, and Bioeng. 39:614-618, Reuveny et al. (1992) Ann. NY Acad. Sci. 665:320, Reuveny et al. (1993) Appl. Microbiol. Biotechnol. 38:619-623, Reuveny et al. (1993) Biotechnol. Bioeng. 42:235-239).
  • the invention encompasses expressing heterologous proteins in the cells of the invention and/or according to the methods of the invention for any pu ⁇ ose benefiting from such expression.
  • a pu ⁇ ose includes, but is not limited to, increasing the in vivo circulatory half life of a protein; producing a desired quantity of the protein; increasing the biological function of the protein including, but not limited to, enzyme activity, receptor activity, binding capacity, antigenicity, therapeutic property, capacity as a vaccine or a diagnostic tool, and the like.
  • Such proteins may be naturally occurring chemically synthesized or recombinant proteins.
  • proteins that benefit from the heterologous expression of the invention include, but are not limited to, transferrin, plasminogen, Na + , K + - ATPase , thyrotropin, tissue plasminogen activator, erythropoietin, interleukins, and interferons.
  • transferrin transferrin
  • plasminogen Na + , K + - ATPase
  • thyrotropin tissue plasminogen activator
  • erythropoietin interleukins
  • interferons include transferrin, transferrin, plasminogen, Na + , K + - ATPase , thyrotropin, tissue plasminogen activator, erythropoietin, interleukins, and interferons.
  • proteins that benefit from the heterologous expression of the invention are mammalian proteins.
  • mammals include but are not limited to, cats, dogs, rats, mice, cows, pigs,
  • heterologous expression of the invention not only encompasses proteins that are sialylated in their native source; but also those that are not sialylated as such, and benefit from the expression in the cells of and/or according to the methods of the invention.
  • proteins that are not sialylated in their native source can be altered by known genetic engineering methods so that the heterologous expression of the protein according to the invention will result in sialylation of the protein.
  • Such methods include, but are not limited to, the genetic engineering methods described herein.
  • altering the proteins could encompass engineering into the protein targeting signals to ensure targeting of the proteins to the ER and Golgi apparatus for sialylation, where such signals are needed.
  • the cells of the invention contain proteins, which are not sialylated prior to manipulation of the cells according to the methods of the invention, but are sialylated subsequent to the manipulation.
  • the invention also encompasses proteins that have amino acid sequences that are endogenous to the cells of the invention, but are sialylated as a result manipulation of the cells according to the methods of the invention. It is recognized that the analysis of the N-glycans produced according to the methods of the invention may suggest additional strategies to further enhance the sialylation of glycoproteins in insect cells. If the production of Gal containing carbohydrate acceptor structures is low relative to those containing GlcNAc, then the levels of Gal transferase expression are increased by integrating multiple copies of this gene into the insect cell genome or by expressing Gal T under a stronger promoter using techniques described herein or otherwise known in the art.
  • substrate feeding strategies are used to enhance the levels of UDP-Gal for this carbohydrate processing reaction.
  • sialyltransferase or CMP-SA production is enhanced.
  • Examination of sialyltransferase activity using techniques described herein or otherwise known in the art, such as, for example, FRET or HPLC and CMP-SA levels using HP AEC, is used to determine which step is the metabolic limiting step to sialylation.
  • Analytical bioassays are implemented to evaluate enzymatic activities in the N-glycosylation pathway of insect cells.
  • bioassays in which multiple samples can be analyzed simultaneously are advantageous. Consequently, bioassays based on fluorescence energy transfer (FRET) and time-resolved fluorometry of europium (Eu) are designed to screen native and recombinant insect cell lines for carbohydrate processing enzymes in a format that can handle multiple samples. Fluorescence assays are especially useful in detecting limiting steps in carbohydrate processing due to their sensitivity and specificity.
  • FRET fluorescence energy transfer
  • Eu time-resolved fluorometry of europium
  • FRET and Eu assays detect enzymatic activities at levels as low as 10 "14 M, which is greater than the sensitivity obtained with I25 I.
  • substrates modified with fluorophores enables the measurement of one specific enzyme activity in an insect cell lysate, and multiple samples can be analyzed simultaneously in a microtiter plate configuration used in an appropriate fluorometer. With these assays, insect cell lines are rapidly screened for the presence of processing enzymes including Gal, GlcNAc, and sialic acid transferases to identify limiting enzymes in N-glycosylation in native and recombinant cells.
  • Glycosyl transferase activity assays are based on the principle of fluorescence energy transfer (FRET), which has been used to study glycopeptide conformation (Rice et al (1991) Biochemistry 30:6646-6655) and to develop endo-type glycosidase assays (Lee et al (1995) Anal. Biochem. 230:31-36).
  • FRET fluorescence energy transfer
  • the fluorescent compound, UDP-Gal-6-Naph, synthesized by consecutive reactions of galactose oxidase (generating 6-oxo compound) and reductive amination with naphthylamine, is found to be effective as a substrate for Gal transferase.
  • UDP-Gal-6-Naph is reacted with an acceptor carrying a dansyl group (Dans-AE- GlcNAc) in the presence of Gal-T, a product is created that can transfer energy (Figure 12).
  • a sialyltransferase assay is designed using similar FRET technology described in the above example for Gal T.
  • the 3-carbon tail (exocyclic chain) of sialic acid in particular, its glycoside
  • This intermediate is reductively aminated to generate a fluorescently tagged sialic acid (after removal of its aglycon), which is then modified to form a fluorescently modified CMP-sialic acid (See also Lee et al. (1994) Anal. Biochem. 216:358-364, Brossamer et al. (1994) Methods Enzymol 247:153-177).
  • the acceptor substrate is modified as described above to include the dansyl group. Then the FRET approach is used to measure either alpha(2, 3) or alpha(2, 6) sialyltransferase activity since these enzymes should utilize the modified CMP-SA as donor substrate to generate a product with altered fluorescent emission characteristics.
  • the choice of the fluorescent donor and acceptor pair can be flexible. The above examples are given using naphthyl-dansyl pairs, but other fluorescent combinations may be even more sensitive (Wu et al. (1994) Anal. Biochem. 250:260- 262).
  • a new GlcNAc-TI assay illustrated in Figure 15, utilizes a synthetic 6- aminohexyl glycoside of the trimannosyl N-glycan core structure labeled with DTPA (Diethylenetriaminepentaacetic acid) and complexed with Eu +3 .
  • This substrate is then incubated with insect cell lysates or positive controls containing GlcNAc Tl and UDP-GlcNAc. Addition of chemical inhibitors are used to minimize background N- acetylglucosaminidase activity. After the reaction, an excess of Crocus lectin CVL (Misaki et al. (1997) J Biol. Chem.
  • N-acetylglucosaminidase activity is developed using a different lectin, GS-II, which is specific for GlcNAc.
  • the substrate is prepared by modification of the same trimannosyl core glycoside described above using in vitro purified GlcNAc Tl, which results in addition of a GlcNAc_bet ⁇ (l-2) residue to the Man_alpha(l-3) residue.
  • enzymatic hydrolysis by N-acetylglucosaminidase removes GlcNAc from the substrate resulting in the tri-mannosyl core product. The product is not susceptible to lectin binding and thus escapes into the filtrate.
  • Evaluation of Eu +3 fluorescence in the filtrate provides a measure of the N-acetylglucosaminidase activity.
  • enhanced binding of the Eu-bound trimannosyl core to the Crocus lectin described above can be used as another assay for N-acetylglucosaminidase activity.
  • composition of these structures provided insights into the carbohydrate processing pathways present in insect cells and allowed a comparison of intracellular and secreted N-glycan structures.
  • the Trichoplusia ni cells grown in serum free medium in suspension culture were infected with a baculovirus vector encoding a murine IgG (Summers et al. (1987) A manual of methods for baculovirus vectors and insect cells culture procedures).
  • IgG includes an N-linked oligosaccharide attachment on each of the two heavy chains.
  • Heterologous IgG was purified from the culture supernatant and soluble cell lysates using a Protein A-Sepharose column.
  • N-linked oligosaccharides were isolated following protease digestion of IgG and treatment with glycoamidase A to release the N-glycans. Oligosaccharides were then derivatized with 2-aminopyridine (PA) at the reducing ends to provide fluorogenic properties for detection.
  • PA 2-aminopyridine
  • a DEAE column was used to separate oligosaccharides on the basis of carbohydrate acidity (first dimension). None of the oligosaccharides retained on this column were found to include sialic acid. Treatment of the acidic fractions with neuraminidase from Arthrobacter ureafaciens (known to cleave all known sialic acid linkages) failed to release any sialic acid, and ODS-chromatography of the fractions revealed several minor components different from all known sialylated oligosaccharides.
  • the second dimension used reverse phase HPLC with an ODS-silica column to fractionate the labeled oligosaccharides according to carbohydrate structure.
  • Supernatant (S) and lysate (L) IgGs oligosaccharides were separated into 6 and 10 fractions, respectively, labeled A-L in Figure 6.
  • Separation in the third and final dimension was accomplished using an amide column to isolate oligosaccharides on the basis of molecular size. Peak B from the ODS column was separated into two separate oligosaccharide fractions, and peak H was separated into three separate oligosaccharide fractions on the amide-column.
  • oligosaccharide purification After oligosaccharide purification, structures of unknown oligosaccharides were determined by comparing their positions on the 3-dimensional map with the positions of over 450 known oligosaccharides. Co-elution of an unknown sample with a known PA-oligosaccharide on the ODS and amide-silica columns was used to confirm the identity of an oligosaccharide. Digestion by glycosidases with specific cleavage sites (alp b ⁇ -L-fucosidase, bet ⁇ -galactosidase, beta-N- acetylglucosaminidase, and ⁇ - b ⁇ -mannosidase) followed by reseparation provided further confirmation.
  • oligosaccharide G All the oligosaccharides in the culture medium and cell lysates matched known carbohydrates except for oligosaccharide G.
  • the structure of oligosaccharide G was elucidated by treatment of the N-glycan with ⁇ /pb ⁇ -L-fucosidase, known to digest Fuc_ ⁇ //.b ⁇ l-6GIcNAc, followed by treatment with 13.5 M trifluoroacetic acid to remove the alphal, 3 linked fucose.
  • the de-alphal, 6- and de-alphal, 3-fucosylated oligosaccharide G co-eluted with a known oligosaccharide, allowing the identification of G.
  • the structure of oligosaccharide G is shown in Figure 7.
  • oligosaccharide G was further confirmed by l H-NMR and electrospray ionization (ESI) mass spectrometry (Hsu et al. (1997) J. Biol Chem. 272:9062-9070). Thus, the combination of these techniques can be used to elucidate both known and unknown oligosaccharides.
  • the initial processing in the T. ni cells appears to be similar to the mammalian pathway, including trimming of the terminal glucose and mannose residues.
  • the trimming process follows a linear pathway with the exception of two different forms of the Man 7 GlcNAc 2 (M7GN, in Figure 8 also observed in native insect glycoproteins (Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-114) and IgG , from NS/0 cells (Ip et al. (1994) Arch. Biochem. Biophys. 308:387-399).
  • M7GN Man 7 GlcNAc 2
  • IgG IgG
  • GlcNAc GN
  • GlcNAc Tl N- acetylglusosaminyltransf erase I
  • Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-114).
  • GlcNAc i Man 5 GlcN AC 2 must be a shortlived intermediate quickly processed by alpha-Man II, since this structure was not detected in the T. ni cell lysate.
  • Man 3 GlcNAc oligosaccharide At the GlcNAci, Man 3 GlcNAc oligosaccharide, several branching steps in the N-glycan processing pathway are possible in insect cells.
  • Complex glycoforms can be generated by the action of GlcNAc Til (N- acetylglucosaminyltransferase II) and Gal T (galactosyltransferase T) to provide oligosaccharides which include terminal GlcNAc (GN) and Gal (G) residues. None of the complex oligosaccharide structures included sialic acid indicating that sialylation is negligible or non-existent in these cells.
  • the intracellular N-glycans obtained from insect cells include more than 50% high-mannose type structures.
  • the fraction of intracellular complex oligosaccharides is less than 15% and only 8% include a terminal Gal residue.
  • the high level of high-mannose structures from intracellular sources indicates significantly less oligosaccharide processing for most of the intracellular immunoglobulins. Many of these intracellular immunoglobulins may not reach the compartments in which carbohydrate trimming takes place (Jarvis et al (1989) Mol. Cell. Biol. 9:214-223). High mannose glycoforms are also observed intracellulariy for mammalian cells (Jenkins et al. (1998) Cell Culture Engineering VI).
  • Example 1 Evaluation of N-glycosylation Pathway Enzymes The levels of N-linked oligosaccharide processing enzymes are measured using analytical assays to characterize carbohydrate processing in native and recombinant insect cells. These assays are used to compare the N-glycan processing capacity of different cell lines and to evaluate changes in processing and metabolite levels following metabolic engineering modifications.
  • HP AEC High Performance Anion Exchange Chromatography
  • HP AEC is used in combination with pulsed amperometric detection (HPAEC- PAD) or conductivity to detect metabolite levels in the CMP-SA pathway and to evaluate N-linked oligosaccharide processing enzymes essentially as described by (Lee et al. (1990) Anal. Biochem. 34:953-95 ', Lee et al. (1996) J. Chromatography A 720:137-149).
  • HPAEC- PAD pulsed amperometric detection
  • conductivity to detect metabolite levels in the CMP-SA pathway and to evaluate N-linked oligosaccharide processing enzymes essentially as described by (Lee et al. (1990) Anal. Biochem. 34:953-95 ', Lee et al. (1996) J. Chromatography A 720:137-149).
  • Shown in Figure 9 is an example of the use of HPAEC-PAD for measuring Gal T activity by following the lactose formation reaction:
  • Trichoplusia ni lysates were incubated in the presence of exogenously added CMP-SA and the fluorescent substrate, 4-methylumbelliferyl lactoside (Lac-MU). Negligible conversion of the substrate was observed, indicating the absence of endogenous sialyltransferase activity.
  • conversion of Lac-MU to the product sialyl LacMU was observed in cell lysates using Reverse Phase HPLC and a fluorescence detector ( Figure 10).
  • Lac-ABA o-aminobenzamide
  • HPLC and HP AEC is used in conjunction with other fluorometric methods detailed in the procedures to analyze the metabolites and enzymatic activities in insect cells.
  • DELFIA Dissociation Enhanced Lanthananide FluorommunoAssay
  • FIG. 11 depicts GlcNAc-BSA in (A) Boiled lysate; (B) T. ni; (C) Standard enzyme, 0.5 mU; (D) T.
  • Gal T activity level is increased significantly following infection with a baculovirus vector including a mammalian Gal T gene under the IE1 promoter or by using Sf-9 cells stably- transformed with the Gal T gene (cell lines are described in Jarvis et al (1996) Nature Biotech. 14:1288-1292; and Hollister et al. (1998) Glycobiology 5:473-480).
  • the DELFIA method is not limited to Gal T measurement. This technique is used to evaluate the activity of any processing enzyme which generates carbohydrate structures containing binding sites for a specific lectin or carbohydrate-specific antibodies (Taki et al. (1994) Anal. Biochem. 219:104-108, Rabina et /. (1997) Anal. Biochem. 246:459-470).
  • Example 2 Enhancing SA levels by Substrate Addition Because the conventional substrates in insect cell media are not efficiently converted to CMP-SA in insect cells as demonstrated by the low levels of CMP-SA, alternative substrates are added to the culture medium. Because sialic acid and CMP- SA are not permeable to cell membranes (Bennetts et al. (1981) J. Cell. Biol. 88:1- 15), they are not considered as appropriate substrates. However, other precursors in the CMP-SA pathway are inco ⁇ orated into cells and considered as substrates for the generation of CMP-SA in insect cells.
  • ManNAc has been added to mammalian tissue and cell cultures and enzymatically converted to SA and CMP-SA (Ferwerda et al. (1983) Biochem. J. 216:87-92, Gu et al. (1997) Improvement of the inter feron-gamma sialylation in Chinese hamster ovary cell culture by feeding N-acetylmannosamine, Thomas et al. (1985) Biochim. Biophys. Acta 846:37-43). Consequently, external feeding of ManNAc is examined as one strategy to enhance CMP-SA levels in insect cells.
  • ManNAc is available commercially (Sigma Chemical Co.) or can be prepared chemically from the less expensive feedstock GlcNAc in vitro using sodium hydroxide (Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393- 400). Initially, the levels of native cellular ManNAc, if any, is determined using HPAEC-PAD techniques (Lee et al. (1990) Anal. Biochem. 34:953-957, Lee et al. (1996) J. Chromatography A 720:137-149, Hardy et al. (1988) Anal. Biochem.
  • ManNAc The ability to increase intracellular ManNAc levels is evaluated by adding ManNAc to cell culture media. Inco ⁇ oration of exogenous ManNAc is quantified using unlabeled ManNAc if levels of native ManNAc are negligible, or 14 C- or 3 H-labeled ManNAc if significant levels of native ManNAc are present) (Bennetts et al. (1981) J Cell. Biol. 88:1-15, Kriesel et al. (1988) J. Biol. Chem. 263:11736-11742). The levels of radioactive ManNAc and other metabolites axe determined by collecting ManNAc peaks following HP AEC and measuring the radioactivity using scintillation countering.
  • the ManNAc To be effective as a substrate for sialylation, the ManNAc must be converted to SA and CMP-SA through intracellular pathways. This conversion is detected directly from externally added ManNAc by following an increase in internal SA and CMP-SA levels using HP AEC or thin layer chromatography (TLC) combined with liquid scintillation counting to detect the radiolabeled metabolites.
  • HP AEC techniques have been used to quantify cellular pools of CMP-SA in as few as 6 x 10 6 mammalian cells (Fritsch et ⁇ l. (1996) Journal of ' Chromatography A 727:223-230), and TLC has been used to evaluate conversion of 14 C labeled ManNAc to sialic acid in bacteria (Vann et al.
  • ManNAc a limiting step exists in the production of ManNAc from conventional insect cell media substrates. Different ManNAc feeding concentrations are tested and the effect on CMP-SA levels and insect cell viability evaluated to determine if there are any deleterious effects from feeding the ManNAc as substrate. Conversion of ManNAc to SA through the aldolase pathway requires pyruvate, and the addition of cytidine can enhance CMP- SA production from SA (Thomas et al. (1985) Biochim. Biophys. Acta 846:37-43).
  • pyruvate and cytidine are optionally added to the medium to enhance conversion of ManNAc to CMP-SA (Tomita et al. (1995) Biochim. Biophys. Acta 1243:329-335, Thomas et al (1985) Biochim. Biophys. Acta 846:37-43).
  • a bioinformatics search of the cDNA libraries of HGS revealed a novel human CMP-SA synthetase gene based on its homology with the E. coli DNA sequence.
  • the bacterial enzyme includes a nucleotide binding site for CTP. This binding site contains a number of amino acids that are conserved among all known bacterial CMP-SA synthetase enzymes (See Stoughton et al, Biochem J. 15:397-402 (1999).
  • the identity of the human cDNA as a CMP-SA synthetase gene was confirmed by the presence of significant homology within this binding motif:
  • This human homologue commercially, publicly, or otherwise available for the pu ⁇ oses of this invention is cloned and expressed in insect cells.
  • the nucleotide and amino acid sequences of human CMP SA synthetase are shown in Figures 29 and 30 respectively.
  • Example 4 Isolation and Inhibition of glucosaminidase It is recognized that insect cells could possess additional N- acetylglucosaminidase enzymes other than the enzyme responsible for generating low-mannose structures, so both recombinant DNA and biochemical approaches are implemented to isolate the target N-acetylglucosaminidase gene. PCR techniques are used to isolate fragments of N-acetylglucosaminidase genes by the same strategies used in isolating ⁇ /pb ⁇ -mannosidase cDNAs from Sf-9 cells (Jarvis et al. (1997)
  • oligonucleotide primers are designed corresponding to regions of conserved amino acid sequence identified in all N-acetylglucosaminidases described thus far, from human to bacteria, including two lepidopteran insect enzymes (Zen et al. (1996) Insect Biochem. Mol. Biol. 26:435-444). These primers are used to amplify a fragment of the N-acetylglucosaminidase gene(s) from genomic DNA or cDNA of lepidopteran insect cell lines commercially, publicly, or otherwise available for the pu ⁇ oses of this invention.
  • FIG. 18 A putative N-acetylglucosaminidase gene fragment from Sf9 genomic DNA and from High FiveTM cell (Invitrogen Co ⁇ ., Carlsbad, CA, USA) cDNA has been identified ( Figure 18). Similar techniques are used to isolate cDNAs from other insect cell lines of interest. The identification of cDNAs for the Sf9 or High FiveTM N-acetylglucosaminidase facilitates the isolation of the gene in other insect cell lines.
  • Figure 18 depicts PCR amplification of Sf9 genomic DNA (A) or High
  • Sense primer #1 5'-T/C,T,I,C,A,C/T,T,G,G,C,A,C/T,- ⁇ /T/C,T,I,G,T,I,G,A-3' (SEQ ID NO:9)
  • Sense primer #2 5'-G,A,G/A,A/T,T,A/C/T,G,A,C/T,I,I,I,C,C,I,G,G/C,I,C,A-3' (SEQ ID NO: 10)
  • Antisense primer #2 5'-T,G,I,C/G,C,I,G,G,I,I,I,G/A,T,C,T/G/A,A,T/A,C/T,T,C-3' (SEQ ID NO: 11)
  • Antisense primer #3 5 * -A,C/A/G,C/T,T,C,G/A,T,C,I,C,C,I,C,C,I,I,I,G/A,T,G-3' (SEQ ID NO: 12)
  • the PCR amplified fragments are cloned and sequenced using the chain termination method (Sanger et al. (1977) Proc. Natl. Acad. Sci. USA 74:5463-5467). The results are used to design exact-match oligonucleotide primers to isolate an N- acetylglucosaminidase clone(s) from existing Sf9 and/or High FiveTM lambda ZAPII cDNA libraries by sibling selection and PCR (Jarvis et al. (1997) Glycobiology 7:113- 127, Kawar et al. (1997) Glycobiology 7:433-443).
  • the library is consecutively split into sub-pools that score positive in PCR screens until a positive sub-pool of approximately 2,000 clones is obtained. These clones are then screened by plaque hybridization (Benton et al. (1977) Science 196:180-182) using the cloned PCR fragment, and positive clones are identified and plaque purified. The cDNA(s) are then excised in vivo as a pBluescript-based subclone in E. coli.
  • the target N-acetylglucosaminidase is membrane bound, so it must be solubilized using a detergent such as Triton-X 100 prior to purification.
  • a detergent such as Triton-X 100 prior to purification.
  • the enzyme is purified by a combination of gel filtration, ion exchange, and affinity chromatography.
  • affinity chromatography the affinants 6- aminohexyl thio-N-acetylglucosaminide (Chipowsky et al. (1973) Carbohydr. Res. 31 :339-346) or BSA modified with thio-N-acetylglucosaminide (Lee et al. (1976) Biochemistry 15:3956-3963) is tried first.
  • 6-aminohexyl a-D-[2-(thio-2- amino-2-deoxy-b-D-glucosaminyl)-mannopyranodside or other thio-oligosaccharides are synthesized and used as affinants.
  • Affinity matrices are prepared using commercially available products.
  • the target enzyme is "anchored" to the membrane by a glycophosphoinositide.
  • a specific phospholipase C is used to release the active enzyme from the membrane, and the use of detergent for solubilization is avoided.
  • the purity of the enzyme is examined with SDS-PAGE and mass spectroscopy, and the activity of the enzyme characterized.
  • Once the enzyme is sufficiently purified its amino-terminal region is sequenced by conventional Edman degradation techniques, available commercially. If N-terminal blockage is encountered, the purified protein are digested, peptides purified, and these peptides are used to obtain internal amino acid sequences. The resulting sequence information is used to design degenerate oligonucleotide primers that are used, in turn, to isolate cDNAs as described above.
  • Isolated full-length cDNAs are sequenced, compared to other N- acetylglucosaminidase cDNAs, and expressed using known polyhedrin-based baculovirus vectors.
  • the overexpressed proteins are purified, their biochemical activities and substrate specificities characterized, and new polyclonal antisera is produced to establish the subcellular locations of the enzymes in insect cells.
  • the locations are optionally identified by using the antisera in conjunction with secretory pathway markers, including Golgi and endoplasmic reticulum specific dyes and GFP- tagged N-glycan processing enzymes commercially, publicly, or otherwise available for the pu ⁇ oses of this invention.
  • Example 5 Expression of the model glycoprotein transferrin The gene encoding human transferrin as described in Genbank accession No.
  • S95936 is cloned into the baculovirus vector, expressed in multiple insect cell lines, and purified to homogeneity.
  • the transferrin is purified to homogeneity, the structures of the oligosaccharides which are N-linked at two sites of the transferrin are analyzed using 3-dimensional HPLC mapping techniques. Over 450 N-glycans have been mapped with this technique. For example, characterization of the N-linked oligosaccharides attached to the heavy chain of secreted and intracellular IgG is described.
  • Sialylation is confirmed by treating the purified N-glycan with sialidase from A. ureafaciens and measuring the release of sialic acid using HPAEC-PAD.
  • Example 6 Cloning, expression, and characterization of the human sialic acid synthetase (SAS) gene and gene product.
  • SAS sialic acid synthetase
  • This example reports the cloning and characterization of a novel human gene having homology to the Escherichia coli sialic acid synthetase gene (neuB).
  • This human gene is ubiquitously expressed and encodes a 40 kD enzyme which results in N-acetylneuraminic acid ( ⁇ eu5Ac) and 2-keto-3-deoxy-D-g/ cero-D-g ⁇ / ⁇ cto-nononic acid (KDN) production in insect cells upon recombinant baculovirus infection.
  • the human enzyme uses N-acetylmannosamine-6-phosphate and mannose-6- phosphate as substrates to generate phosphorylated forms of ⁇ eu5Ac and KDN, respectively, but exhibits much higher activity toward the Neu5Ac phosphate product.
  • the E. coli sialic acid synthetase gene (Annunziato et al., J. Bacteriol 177, 312-319 (1995)) was used to search the human EST database of Human Genome Sciences, Inc. (Rockville, MD).
  • One EST with significant homology to the neuB gene was found in a human liver cDNA library and used to identify a full length cDNA ( Figure 35 A) with an ORF homologous to the bacterial synthetase over most of its length.
  • the putative synthetase consisted of 359 amino acids (SEQ ID NO:6) while the neuB gene product contained 346 amino acids (SEQ ID NO:8). Alignment of the human against the bacterial enzyme demonstrated that significant differences were found primarily in the N-terminus (Figure 35B). Overall, the two synthetases were found to be 36.1% identical and 56.1%> similar at the amino acid level.
  • SAS was inserted into baculovirus under the polh promoter using lacZ as a positive selection marker. After transfection and viral titering, the resulting virus (AcSAS) was used to infect Spodoptera frugiperda (Sf-9) cells followed by pulse labeling. An -40 kD band was observed in the Sf-9 lysates from cells infected by AcSAS ( Figure 36A, lane 5) and not in the mock infected control ( Figure 36A, lane 4). Furthermore, this band co-migrated with the protein produced in vitro. To verify SAS expression, the band was visualized in the non-nuclear fraction (Miyamoto et al., Mol. Cell. Biol.
  • the sialic acid content in cell lysates before and after filtration through a 10,000 MWCO membrane was determined by DMB labeling and HPLC separation.
  • the native sialic acid levels in Sf-9 cells grown without fetal bovine serum (FBS) supplementation are substantially lower than the levels found in CHO cells (Table 2; Figure 37A).
  • FBS fetal bovine serum
  • the sialic acid content of insect cells cultured in 10% FBS was determined.
  • the Neu5 Ac content of Sf-9 cells is nearly an order of magnitude lower than the content of CHO cells (Table 2).
  • the origin of the sialic acid detected in insect cells, whether natively produced or the result of contamination from the media, is not clear since even serum free insect cell media contains significant levels of sialic acid (data not shown).
  • CHO and Sf-9 cells were grown to confluency in T-75 flasks.
  • Cell lysates with and without 10,000 MWCO filtration were analyzed for sialic acid content following DMB derivatization and HPLC separation.
  • Sialic acid levels have been normalized based on lysate protein content. Dashes indicate sialic acid was not detectable.
  • Sialic acid levels were quantified in lysates of uninfected, A35 infected, and AcSAS infected Sf-9 cells grown in media with and without Man, mannosamine (ManN), or ManNAc supplementation (Table 3).
  • Man feeding resulted in detection of KDN slightly above background, and ManNAc feeding marginally increased Neu5Ac levels in uninfected and A35 infected cells (Table 3).
  • ManN supplementation had no effect on KDN levels but increased Neu5Ac levels (Table 3).
  • the most significant changes in sialic acid levels occurred with AcSAS infection.
  • AcSAS infection of Sf-9 cells led to large increases in KDN levels with slight enhancements upon Man or ManNAc supplementation. Both AcSAS infection and ManNAc feeding were required to obtain substantial Neu5Ac levels.
  • the mammalian pathway for Neu5Ac synthesis uses a phosphate intermediate (Jourdian et al., J Biol. Chem. 239, PC2714-PC2716 (1964); Kundig et al., J Biol. Chem. 241, 5619-5626 (1966); Watson et al., J. Biol. Chem. 241, 5627- 5636 (1966)) while the E. coli pathway directly converts ManNAc and PEP to Neu5Ac (Vann et al, Glycobiology 7, 697-701 (1997)).
  • in vitro assays were performed using lysates of infected Sf-9 cells and protein purified from the prokaryotic expression system.
  • Lysates or purified protein plus PEP and MnCl 2 were incubated with Man, mannose-6-phosphate (Man-6- P), ManNAc, or ManNAc-6-P followed by DMB labeling and HPLC analysis.
  • Assays were performed by incubating lysates with different substrate solution concentrations of Man-6-P and ManNAc-6-P in order to evaluate substrate preference. After incubation for a fixed time period, the samples were treated with AP, and DMB derivatives of Neu5Ac and KDN were quantified and compared (Table 4). When equimolar amounts of substrates are used, Neu5Ac production is significantly favored over KDN especially at higher equimolar concentrations (10 and 20 mM) of the two substrates. Only when the substrate concentration of ManNAc-6- P is substantially lower than the Man-6-P levels are production levels of the two sialic acids comparable.
  • the enzyme prefers ManNAc-6-P over Man-6-P in the production of phosphorylated forms of Neu5Ac and KDN, respectively.
  • the 40 kD sialic acid phosphate synthetase enzyme, SAS was expressed in cells.
  • SAS was identified based on homology with neuB whose enzyme product directly forms Neu5Ac from ManNAc and PEP (Vann et al., Glycobiology 7, 697-701 (1997)). Furthermore, insect cells produce Neu5Ac following recombinant SAS expression and ManNAc supplementation. However, mammalian cells are known only to produce Neu5Ac from ManNAc through a three-step pathway with phosphorylated intermediates. Therefore, in vitro assays were performed to determine the substrate specificity of SAS. Both AcSAS infected insect cell lysates and protein purified from the prokaryotic expression system were assayed using ManNAc and ManNAc-6-P as possible substrates.
  • Sf-9 cells natively possess the ability to complete the three-step mammalian pathway when only the sialic acid phosphate synthetase gene is provided. Sf-9 cells have been shown to have substantial ManNAc kinase ability (Effertz et al., J. Biol. Chem. 274, 28771-28778 (1999)), and phosphatase activity has also been detected in insect cells (Sukhanova et al, Genetika 34, 1239-1242 (1998)).
  • N-glycans of recombinant glycoproteins produced in insect cells lack significant levels of terminal sialic acid residues (Jarvis and Finn, Virology 212, 500-511 (1995); Ogonah et al, Bio/Technology 14, 197-202 (1996)).
  • insect cells may possess limited native sialic acid synthetic ability. Similar substrate supplementation results have been reported in mammalian cells, as cultivation in Man-rich or ManNAc-rich media enhanced the synthesis of native intracellular KDN and Neu5Ac, respectively (Angata et al., Biochem. Biophys. Res. Commun. 261, 326- 331 (1999)).
  • KDN encoding any enzyme with KDN synthetic ability.
  • KDN enzymatic activity has been characterized in trout testis, a tissue high in KDN content.
  • KDN is synthesized from Man in trout through a three- step pathway involving a synthetase with a Man-6-P substrate (Angata et al., J. Biol. Chem. 274, 22949-22956 (1999)).
  • the fish synthetase enzyme partially purified from trout testis, was approximately 80 kD as compared to the human enzyme of 40 kD.
  • the ratio of Neu5Ac to KDN is on the order of 100:1 in blood cells and ovaries (Inoue et al., 1998), although this ratio may change during development and cancer.
  • the levels of free KDN in newborn fetal cord red blood cells are higher than those of maternal red blood cells (Inoue et al., J. Biol. Chem. 273, 27199-27204 (1998)).
  • a 4.2 fold increase in the ratio of free KDN to free Neu5 Ac was observed in ovarian tumor cells as compared to normal cells, and the ratio appears to increase with the extent of invasion or malignancy for ovarian adenocarcinomas (Inoue et al., J. Biol. Chem. 273, 27199-27204 (1998)).
  • nucleotide sugars from lysed cells were extracted with 75% ethanol, dried, resuspended in water, and filtered through a 10,000 molecular weight cut-off membrane. Samples were then separated on a Dionex Carbopac PA-1 column using a Shimadzu VP series HPLC. Nucleotide sugars were detected based upon their absorbance at 280 nm, and CMP sialic acid standards were shown to elute at approximately 7 minutes. These results demonstrate the ability to produce the desired oligosaccharide products in insect cells via introduction and expression of sialyltransferase enzymes.
  • the E. coli neuB coding sequence was used to query the Human Genome Sciences (Rockville, MD) cDNA database with BLAST software.
  • One EST clone, HMKAK61, from a human (liver) cDNA library demonstrated significant homology to neuB and was chosen for further characterization.
  • the tissue distribution profile was determined by Northern blot hybridization. Briefly, the cDNA was radio-labeled with [ 32 P]-dCTP using a RediPrimeTMII kit (Amersham/Pharmacia Biotech, Piscataway, NJ) following the manufacturer's directions.
  • the full length ORF was amplified by PCR using the following primers.
  • the forward primer, 5'- TGTAATACGACTCACTATAGGGCGG ⁇ ECCGCCATC ATGCCGCTGGAGCTG GAGC (SEQ ID NO: 13) contained a synthetic T7 promoter sequence (underlined), a BamHI site (italics), a KOZAK sequence (bold), and sequence corresponding to the first six codons of SAS.
  • GTACGG ⁇ CCTTATTAAGACTTGATTTTTTTGCC (SEQ ID NO: 14), contained an Asp 718 site (italics), two in-frame stop codons (underlined), and sequences representing the last six codons of SAS.
  • the PCR product was digested with BamHI and Asp 718 (Roche, Indianapolis, IN) and the resulting fragment cloned into the corresponding sites of the baculovirus transfer vector, pA2.
  • the plasmid (pA2-SAS) was transfected into Sf-9 cells to generate the recombinant baculovirus AcSAS as previously described (Coleman et al., Gene 190, 163-171 (1997)). Amplified virus was used to infect cells, and the gene product was radio- labeled with [ 35 S]-Met and [ 35 S]-Cys. Bands corresponding to the gene product were visualized by SDS-PAGE and autoradiography.
  • the PCR product was used as a template for in vitro transcription and translation using rabbit reticulocyte lysate (Promega, Madison, WI) in the presence of [ 35 S]-Met. Translation products were resolved by SDS-PAGE and visualized by autoradiography.
  • Sf-9 cells were seeded in serum- free media at a density of lxl 0 6 cells/ml in spinner flasks and infected at a multiplicity of infection of 1-2 with the recombinant virus.
  • a detergent fractionation procedure was employed (Miyamoto et al., Mol. Cell. Biol. 5, 2860-2865 (1985)) to separate nuclear from non- nuclear fractions.
  • Protein was resolved by SDS-PAGE, transferred to a ProBlottTM membrane (ABI, Foster City, CA), and visualized by Ponceau S staining. A prominent band at the expected MW of ⁇ 40 kD was visible and excised for protein microsequencing using an ABI-494 sequencer (PE Biosystems, Foster City, CA).
  • the mobile phase was an acetonitrile, methanol, and water mixture (9:7:84, v/v) with a flow rate of 0.7 ml/min.
  • Response factors of Neu5Ac and KDN were established with authentic standards based on peak areas for quantifying sample sialic acid levels.
  • Sialic acid content was normalized based on protein content measured with the Pierce (Rockford, IL) BCA assay kit and a Molecular Devices (Sunnyvale, CA) microplate reader.
  • Sf-9 (ATCC, Manassas, VA) cells were grown in Ex-CellTM 405 media (JRH BioScience, Lenexa, KS) with and without 10% FBS at 27°C.
  • CHO-K1 cells ATCC, Manassas, VA were cultured at 37°C in a humidified atmosphere with 5% CO 2 in Dulbecco's Modified Eagle Medium (Life Technologies, Rockville, MD) supplemented with 10% FBS, 100 U/ml penicillin, 100 ⁇ g/ml streptomycin, 100 ⁇ M MEM essential amino acids, and 4 mM L-glutamine (Life Technologies, Rockville, MD).
  • Lysates were prepared from A35 and AcSAS infected and uninfected Sf-9 cells cultured in T-75 flasks with and without 10 mM ManNAc supplementation.
  • ManNAc-6-P was prepared by acid hydrolysis of meningococcal Group A polysaccharide. The polysaccharide (15.5 mg) in 5.8 ml water was mixed with 770 mg of Dowex 50 H+ and heated for 1 hr. at 100°C. The filtered hydro lysate was dried in vacuo and the residue dissolved to give a solution of 50 mM ManNAc-6-P and stored frozen.
  • Substrate solutions containing 25 mM Man and ManNAc were also used. Boiled samples were used as negative controls. Following incubation, all samples were boiled 3 min., centrifuged for 10 min. at 12,000g, and split into two 10 ⁇ l aliquots. One aliquot was treated with 9 units of calf intestine alkaline phosphatase (Roche, Indianapolis, IN) along with 3 ⁇ l of accompanying buffer while the other aliquot was diluted with water and buffer. AP treated aliquots were incubated 4 hrs. at 37°C, and 10 ⁇ l of both AP treated and untreated samples were reacted with DMB as described above. 2 ⁇ l of the samples incubated with insect lysates and 10 ⁇ l of the samples incubated with bacterial protein were injected onto the HPLC for sialic acid analysis as described above.
  • Man-6-P and ManNAc-6-P concentrations in the substrate solution were varied from 1 to 20 mM.
  • In vitro assays were run with Sf-9 lysates as described above. Samples were treated with 7 ⁇ l buffer and 18 units of AP, incubated for 4 hrs. at 37°C, and analyzed for sialic acid content. Samples containing more than 1 mM ManNAc-6-P in the substrate solution produced high levels of sialic acid and were diluted 1 :5 before injection to avoid fluorescence detector signal saturation.
  • HP AEC Sf-9 cells were grown in T-75 flasks and then infected with A35 or AcSAS or left uninfected in the presence or absence of 10 mM ManNAc. After 80 hrs., cells were washed twice in PBS and sonicated. Aliquots (200 ⁇ l ) were filtered through 10,000 MWCO membranes, and 50 ⁇ l samples were treated with 12.5 ⁇ l aldolase solution [0.0055 U aldolase (ICN, Costa Mesa, CA), 1.4 mM NADH (Sigma, St.
  • Samples were normalized based on protein content by dilution with water, and 20 ⁇ l of each sample were analyzed. Ten ⁇ l of each sample were also derivatized with DMB and analyzed by HPLC as described above to confirm the elimination of sialic acids by aldolase treatment.
  • the applicant hereby requests that the application has been laid open to public inspection (by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent Office without having been laid open inspection, the furnishing of a sample shall only be effected to an expert in the art.
  • the request to this effect shall be filed by the applicant with the Norwegian Patent Office not later than at the time when the application is made available to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been filed by the applicant, any request made by a third party for the furnishing of a sample shall indicate the expert to be used. That expert may be any person entered on the list of recognized experts drawn up by the Norwegian Patent Office or any person approved by the applicant in the individual case.
  • the applicant hereby requests that, until the application has been laid open to public inspection (by the Danish Patent Office), or has been finally decided upon by the Danish Patent office without having been laid open to public inspection, the furnishing of a sample shall only be effected to an expert in the art.
  • the request to this effect shall be filed by the applicant with the Danish Patent Office not later that at the time when the application is made available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed by the applicant, any request made by a third party for the furnishing of a sample shall indicate the expert to be used. That expert may be any person entered on a list of recognized experts drawn up by the Danish Patent Office or any person by the applicant in the individual case.
  • the applicant hereby requests that, until the application has been laid open to public inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office without having been laid open to public inspection, the furnishing of a sample shall only be effected to an expert in the art.
  • the request to this effect shall be filed by the applicant with the International Bureau before the expiration of 16 months from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such a request has been filed by the applicant any request made by a third party for the furnishing of a sample shall indicate the expert to be used. That expert may be any person entered on a list of recognized experts drawn up by the Swedish Patent Office or any person approved by a applicant in the individual case.
  • the applicant hereby requests that until the date of a grant of a Netherlands patent or until the date on which the application is refused or withdrawn or lapsed, the microorganism shall be made available as provided in the 31 F( 1 ) of the Patent Rules only by the issue of a sample to an expert.
  • the request to this effect must be furnished by the applicant with the Netherlands Industrial Property Office before the date on which the application is made available to the public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, whichever of the two dates occurs earlier.

Abstract

Methods for manipulating carbohydrate processing pathways in cells of interest are provided. Methods are directed at manipulating multiple pathways involved with the sialylation reaction by using recombinant DNA technology and substrate feeding approaches to enable the production of sialylated glycoproteins in cells of interest. These carbohydrate engineering efforts encompass the implementation of new carbohydrate bioassays, the examination of a selection of insect cell lines and the use of bioinformatics to identify gene sequences for critical processing enzymes. The compositions comprise cells of interest producing sialylated glycoproteins. The methods and compositions are useful for heterologous expression of glycoproteins.

Description

ENGINEERING INTRACELLULAR SIALYLATION PATHWAYS
FIELD OF THE INVENTION The invention relates to methods and compositions for expressing sialylated glycoproteins in heterologous expression systems, particularly insect cells.
BACKGROUND OF THE INVENTION
While heterologous proteins are generally identical at the amino acid level, their post-translationally attached carbohydrate moieties often differ from the carbohydrate moieties found on proteins expressed in their natural host species. Thus, carbohydrate processing is specific and limiting in a wide variety of organisms including insect, yeast, mammalian, and plant cells.
The baculovirus expression vector has promoted the use of insect cells as hosts for the production of heterologous proteins (Luckow et al. (1993) Curr. Opin.
Biotech. 4:564-572, Luckow et al. (1995) Protein production and processing from baculovirus expression vectors). Commercially available cassettes allow rapid generation of recombinant baculovirus vectors containing foreign genes under the control of the strong, polyhedrin promoter. This expression system is often used to produce heterologous secreted and membrane-bound glycoproteins normally of mammalian origin.
However, post-translational processing events in the secretory apparatus of insect cells yield glycoproteins with covalently-linked oligosaccharide attachments that differ significantly from those produced by mammalian cells. While mammalian cells often generate complex oligosaccharides terminating in sialic acid (SA), insect cells typically produce truncated (paucimannosidic) and hybrid structures terminating in mannose (Man) or N-acetylglucosamine (GlcNAc) (Figure 1). The inability of insect cell lines to generate complex carbohydrates comprising sialic acid significantly limits the wider application of this expression system. The carbohydrate composition of an attached oligosaccharide, especially sialic acid, can affect a glycoprotein' s solubility, structural stability, resistance to protease degradation, biological activity, and in vivo circulation (Goochee et al. (1991) Bio/technology 9:1347-1355, Cumming et al. (1991) Glycobiology 1 :115-130, Opdenakker et al. (1993) FASEB J. 7:1330, Rademacher et α/. (1988) Ann. Rev. Biochem., Lis et al. (1993) Ewr. J. Biochem. 218:1-27). The terminal residues of a carbohydrate are particularly important for therapeutic proteins since the final sugar moiety often controls its in vivo circulatory half-life (Cumming et al. (1991)
Glycobiology 1 :115-130). Glycoproteins with oligosaccharides terminating in sialic acid typically remain in circulation longer due to the presence of receptors in hepatocytes and macrophages that bind and rapidly remove structures terminating in mannose (Man), N-acetylglucosamine (GlcNAc), and galactose (Gal), from the bloodstream (Ashwell et al. (1974) Giochem. Soe. Symp. 40:117-124, Goochee et al. (1991) Bio/technology 9:1347-1355, Opdenakker et al. (1993) FASEB J. 7:1330). Unfortunately, Man and GlcNAc are the residues most commonly found on the termini of glycoproteins produced by insect cells. The presence of sialic acid can also be important to the structure and function of a glycoprotein since sialic acid is one of the few sugars that is charged at physiological pH. The sialic acid residue is often involved in biological recognition events such as protein targeting, viral infection, cell adhesion, tissue targeting, and tissue organization (Brandley et al. (1986) J. of Leukocyte bio. 40:97-111, Varki et al. (1997) FASEB 11 :248-255, Goochee et al. (1991) Bio/technology 9:1347-1355, Lopez et al. (1997) Glycobiology 7:635-651, Opdenakker et al. (1993) FASEB J. 7:1330).
The composition of the attached oligosaccharide for a secreted or membrane- bound glycoprotein is dictated by the structure of the protein and by the post- translational processing events that occur in the endoplasmic reticulum and Golgi apparatus of the host cell. Since the secretory processing machinery in mammalian cells differs from that in insect cells, glycoproteins with very different carbohydrate structures are produced by these two host cells (Jarvis et al. (1995) Virology 212:500- 511, Maru et α/. (1996) J. Biol. Chem. 271 :16294-16299, Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-114). These differences in carbohydrate structure can have dramatic effects on the in vitro and in vivo properties of the resulting glycoprotein. For example, the in vitro activity of human thyrotropin (hTSH) expressed in insect cells was five times higher than the activity of the same glycoprotein produced from mammalian Chinese hamster ovary (CHO) cells (Grossman et al. (1997) Endocrinology 138:92-100). However, the in vivo activity of the insect cell-derived product was substantially lower due to its rapid clearance from injected rats. The drop in in vivo hTSH activity was linked to the absence of complex-type oligosaccharides terminating in sialic acid in the insect cell product (Grossman et al. (1997) Endocrinologyl 138:92-100).
N-glycosylation is highly significant to glycoprotein structure and function. In insect and mammalian cells N-glycosylation begins in the endoplasmic reticulum (ER) with the addition of the oligosaccharide, Glc3Man9GlcNAc onto the asparagine (Asn) residue in the consensus sequence Asn-X-Ser/Thr (Moremen, et al. (1994) Glycobiology 4:113-125, Varki et al. (1993) Glycobiology 3(2):97-130, Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-114). As the glycoprotein passes through the ER and Golgi apparatus, enzymes trim and add different sugars to this N-linked glycan. These carbohydrate modification steps can differ in mammalian and insect hosts. In mammalian cell lines, the initial trimming steps are followed by the enzyme-catalyzed addition of sugars including N-acetylglucosamine (GlcNAc), galactose (Gal), and sialic acid (SA) by the steps shown in Figure 2, and as described in Goochee et al. (1991) Bio/technology 9:1347-1355.
In insect cells, N-linked glycans attached to heterologous and homologous glycoproteins comprise either high-mannose (Man9.5GlcNAc2) or truncated
(paucimannosidic) (Man3. GlcNAc2) oligosaccharides; occasionally comprising alpha(l, 6)-fucose (Figure 3; Jarvis et al. (1989) Mol Cell. Biol. 9:214-223, Kuroda et al. (1990) Virology 174:418-329, Marz et α/. (1995) Glycoproteins 543-563, Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-114). These reports primarily directed to Sf-9 or Sf-21 cells from Spodoptera frugiperda, indicated that insect cells could trim N-linked oligosaccharides but could not elongate these trimmed structures to produce complex carbohydrates. Reports from other insect cell lines, including Tricoplusia ni (T. ni; High Five™) and Estigmena acrea (Ea-4), indicated the presence of limited levels of partially elongated hybrid (structures with one terminal Man branch and one branch with terminal Gal, GlcNAc, or another sugar; Figure 4a) and complex (structures with two non-Man termini; Figure 4b) N-linked oligosaccharides (Oganah et al. (1996) Bio/Technology 14:197- 202, Hsu et al. (1997) J. Biol. Chem. 272:9062-9070). Low levels of GlcNAc transferase I and II (GlcNAc TI and Til), fucosyltransferase, mannosidases I and II, and Gal transferase (Gal T) have been reported in these insect cells; indicating a limited capability for production of these hybrid and complex N-linked oligosaccharides in these cells (Velardo et al. (1993) J. Biol. Chem. 268:17902-17907, Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-114, van Die et al. (1996) Glycobiology 6:157-164).
However; most insect cell derived glycoproteins lack complex N-glycans. This absence may be attributed to the presence of the hexosaminidase N- acetylglucosammidase that cleaves GlcNAc attached to the alphail, 3) Man branch to generate paucimannosidic oligosaccharides (Licari et al. (1993) Biotech. Prog. 9:146- 152, Altmann et al. (1995) J. Biol. Chem. 270:17344-17349). Chemicals have been added in an attempt to inhibit this glycosidase activity, but significant levels of paucimannosidic structures remain even in the presence of these inhibitors (Wagner et al. (1996) J. Virology 70:4103-4109).
Manipulating carbohydrate processing in insect cells has been attempted; and in mammalian cells, the expression of sialyltransferases, galactosyltransferases and other enzymes is well established in order to enhance the level of oligosaccharide attachment (see U.S. Patent No. 5,047,335). However, in these cases, the presence of the necessary donor nucleotide substrates, most significantly the sialylation nucleotide, CMP-sialic acid, in the proper subcellular compartment has been assumed. Attempts to manipulate carbohydrate processing have been made by expressing single transferases such as N-Acetylglucosamine transferase I (GlcNAc Tl), galactose transferase (GAL T), or sialyltransferase (Lee et al. (1989) J Biol. Chem. 264:13848- 13855, Wagner et al. (1996) Glycobiology (5:165-175, Jarvis et al. (1996) Nature Biotech. 14:1288-1292, Hollister et al. (1998) Glycobiology 5:473-480, Smith et al. (1990) J. Biol. Chem. 265:6225-6234, Grabenhorst et al. (1995) Ewr. J. Biochem. 232:718-725). Introduction of a mammalian beta(l , 4)-GalT using viral vectors (Jarvis et al. (1995) Virology 212:500-511) or stably-transformed cell lines (Hollister et al. (1998) Glycobiology 5:473-480) indicates that both approaches can enhance the extent of complex glycosylation of foreign glycoproteins expressed in insect cells. GlcNAcTl co-expression can increase the number of recombinant glycoproteins with oligosaccharides containing GlcNAc on the Man alphail, 3) branch (Jarvis et al. (1996) Nature Biotech. 14:1288-1292, Jarvis et al. (1995) Virology 212:500-511, Hollister et al. (1998) Glycobiology 5:473-480; Wagner et al. (1996) Glycobiology 6:165-175). However, the production of complex carbohydrates comprising sialic acid has not been observed in these studies. Sialylation of a single recombinant protein (plasminogen) produced in baculovirus-infected insect cells has been reported (Davidson et al. (1990) Biochemistry 29:5584-5590), but findings appear to be specific to this glycoprotein. Conversely, many reports indicate the complete absence of any attached sialic acid on glycoproteins from all insect cell lines tested to date (Noss et al. (1993) Eur. J. Biochem. 217:913-919, Jarvis et al. (1995) Virology 212:500-511, Marz et al. (1995) Glycoproteins 543-563, Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-114, Hsu et al. (1997) J. Biol. Chem. 272:9062-9070). The reason for this absence of sialylated glycoproteins was initially puzzling since polysialic acid structures were obtained in Drosophila embryos (Roth et al. (1992) Science 256:673-675). However, as demonstrated herein, it is now evident that insect cell lines generate very little sialic acid as compared to mammalian CHO cells (See Figure 16). With very little sialic acid, the insect cells cannot generate the donor nucleotide CMP-sialic acid essential for sialylation. A similar lack or limitation in donor nucleotide substrates may be observed in other eukaryotes as well. Thus, the co-expression of sialyltransferase and other transferases must be accompanied by the intracellular generation of the proper donor nucleotide substrates and the proper acceptor substrates in order for the production of sialylated and other complex glycoproteins in eukaryotes. In addition, sialic acid and CMP-sialic acid are not permeable to cells so these substrates can not be provided directly to the medium of the cultures (Bennett et al. (1981) J. Cell. Biol. 88:1-15).
The manipulation of post-translational processing is particularly relevant to biotechnology since recombinant DΝA products generated in different hosts are usually identical at the amino acid level and differ only in the attached carbohydrate composition (Goochee et al. (1991) Bio/technology 9:1347-1355). Engineering carbohydrate pathways is useful to make recombinant DΝA technology more versatile and expand the number of hosts that can generate particular glycoforms. This flexibility could ultimately lower biotechnology production costs since host efficiency would be the primary factor dictating which expression system is chosen rather than a host's capacity to produce a specific glycoform. Furthermore, carbohydrate engineering is useful to tailor a glycoprotein to include specific oligosaccharides that could alter biological activity, structural properties or circulatory targets. Such carbohydrate engineering efforts will provide a greater variety of recombinant glyco- products to the biotechnology industry.
Glycoproteins containing sialylated oligosaccharides would have improved in vivo circulatory half-lives that could lead to their increased utilization as vaccines and therapeutics. In particular, complex sialylated glycoproteins from insect cells would be more appropriate biological mimics of native mammalian glycoproteins in molecular recognition events in which sialic acid plays a role.
Therefore, manipulating carbohydrate processing pathways in insect and other eukaryotic cells so that the cells produce complex sialylated glycoproteins is useful for enhancing the value of heterologous expression systems and increasing the application of heterologous cell expression products as vaccines, therapeutics, and diagnostic tools; for increasing the variety of glycosylated products to be generated in heterologous hosts; and for lowering biotechnology production costs, since particular expression systems can be selected based on efficiency of production rather than the capacity to produce particular product glycoforms.
SUMMARY OF THE INVENTION
Compositions and methods for producing glycoproteins having sialylated oligosaccharides are provided. The compositions of the invention comprise enzymes involved in carbohydrate processing and production of nucleotide sugars, nucleotide sequences encoding such enzymes, and cells transformed with these nucleotide sequences. The compositions of the invention are useful in methods for producing complex sialylated glycoproteins in cells of interest including, but not limited to, mammalian cells and non-mammalian cells (e.g., insect cells).
The sialylation process involves the post-translational addition of a donor substrate, cytidine monophosphate-sialic acid (CMP-SA) onto a specific acceptor carbohydrate (GalGlcNAcMan-R) via an enzymatic reaction catalyzed by a sialyltransferase in the Golgi apparatus. Since one or more of these three reaction components (i.e., acceptor, donor substrate, and the enzyme sialyltransferase) is limiting or absent in certain cells of interest, methods are provided to enhance the production of the limiting components. Polynucleotide sequences encoding the enzymes used according to the methods of the invention are known or novel bacterial invertebrate, fungal, or mammalian sequences and/or fragments or variants thereof , that are optionally identified using bioinformatics searches. According to one embodiment of the invention, completion of the sialylation reaction is achieved by expressing a sialyltransferase enzyme, or a fragment or variant thereof, in the presence of acceptor and/or donor substrates. The invention also provides an assay for sialylation, wherein the structures and compositions of N-linked oligosaccharides attached to a model secreted glycoprotein, (e.g., transferrin), is elucidated using multidimensional chromatography. Cells of interest that have been recombinantly engineered to produce new forms of sialylated glycoproteins, higher concentrations of sialylated glycoproteins, and/or elevated concentrations of donor substrates (.g., nucleotides sugars) required for sialylation, as well as kits for expression of sialylated glycoproteins are also provided.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 depicts the typical differences in insect and mammalian carbohydrate structures.
Figure 2 depicts the enzymatic generation of a complex sialylated carbohydrate in mammalian cells.
Figure 3 depicts a Paucimannosidic oligosaccharide.
Figure 4a depicts a hybrid glycan from Estigmena acrea (Ea-4) insect cells. Figure 4b depicts a complex glycan from Estigmena acrea (Ea-4) insect cells.
Figure 5 depicts the nucleotide sugar production pathways in mammalian and E. coli cells leading to sialylation.
Figure 6 depicts a chromatogram of labeled oligosaccharides separated by reverse phase High Performance Liquid Chromatography (HPLC) on an ODS-silica column. Using this technique, oligosaccharides are fractionated according to their carbohydrate structures. Panel "L" represents cell lysate fractions and panel "S" represents cell supernatant fractions.
Figure 7 depicts the structure of Oligosaccharide G.
Figure 8 depicts the glycosylation pathway in Trichoplusia ni insect cells (High Five™ cells; Invitrogen Corp., Carlsbad, CA, USA).
Figure 9 depicts the chromatogram of a Galactose-transferase assay following High Performance Anion Exchange Chromatography (HP AEC), as described in the Examples and references cited therein.
Figure 10 depicts the chromatogram of a 2,3-Sialyltransferase assay following Reverse Phase-High Performance Liquid Chromatography (RP-HPLC), as described in the Examples.
Figure 11 depicts the results of a Galactose-transferase (Gal-T) assay of insect cell lysates performed using a Europium (Eu+3)-labeled Ricinus cummunis lectin (RCA 120) probe; which specifically binds Gal or GalNAc oligosaccharide structures as described in the Examples. Each column represents the Gal-T activity in a given sample; Column (A) represents boiled T. ni cell lysates, Column (B) represents normal T. ni cell lysates, Column (C) represents activity in 0.5 mU of enzyme standard, Column (D) represents lysate from T. ni cells infected with a baculovirus coding for GalT, Column (E) represents lysates from Sf-9 cells stably transfected with the GalT gene. Figure 12 depicts the product of reacting UDP-Gal-6-Naph with Dans- AE-GlcNAc in the presence of GalT.
Figure 12 depicts the reaction products resulting from incubation of UDP-Gal- 6-Naph and Dans-AE-GlcNAc in the presence of Galactose-transferase, as described in the "Experimental" section below.
Figure 13 depicts the distinguishing emission spectra of GalT assay reactants and products, as described in the "Experimental" section below. Irradiation of the naphthyl group in UDP-Gal-6-Naph at 260-290 nm ("ex") results in an emission peak at 320-370 nm ("em" dotted line) while irradiation of the Galactose-transferase reaction products at these same low wavelengths results in energy transfer to the dansyl group and an emission peak at 500-560 nm ("em" solid line).
Figure 14 depicts the oxidation reaction of sialic acid.
Figure 15 schematically depicts a new GlcNAc Tl assay utilizing a synthetic 6-aminohexyl glycoside of the trimannosyl N-glycan core structure labeled with DTPA (Diethylenetriaminepentaacetic acid) and complexed with Eu+3 (see "Experimental" section below). This substrate is incubated with insect cell lysates or positive controls containing GlcNAc Tl and UDP-GlcNAc. Chemical inhibitors are added to minimize background N-acetylglucosaminidase activity. After the reaction, an excess of Crocus lectin CVL (Misaki et al. (1997) J. Biol. Chem. 272:25455- 25461), which specifically binds the trimannosyl core, is added. The amount of lectin required to bind all the trimannosyl glycoside (and hence all the Eu +3 label) in the absence of any GlcNAc binding is predetermined. Following an ultrafiltration step, the glycoside modified with GlcNAc (not binding CVL) appears in the filtrate. Measurement of the Eu+ fluorescence in the filtrate reflects the level of GlcNAc Tl activity in the culture lysates.
Figure 16 depicts a chromatogram of sialic acid levels in SF9 insect cells and CHO (Chinese hamster ovary) cells. In the panel labeled "Sf-9 Free Sialic Acid Levels" the known sialic acid standard elutes just prior to 10 minutes, while no corresponding sialic acid peak can be detected (above background levels) in Sf-9 cells. In the panel labeled "CHO sialic acid levels" the sialic acid standard elutes at approximately 9 minutes, while bound and free (released by acid hydrolysis) sialic acid peaks are observed at similar elution positions.
Figure 17 depicts how selective inhibition of N-acetylglucosaminidase allows for production of complex oligosaccharide structures.
Figure 18 depicts ethidium bromide-stained agarose gels following electrophoresis of PCR amplification products from Sf9 genomic DNA or High Five™ (Invitrogen Corp., Carlsbad, CA, USA) cell cDNA templates using degenerate primers corresponding to three different regions conserved within N- acetylglucosaminidases.
Figure 19 depicts two potential specific chemical inhibitors of N- acetylglucosaminidase.
Figure 20 schematically depicts that the overexpression of various glycosyltransferases leads to greater production of oligosaccharide acceptor substrates.
Figure 21 depicts three possible N-glycan acceptor structures which include the terminal Gal (G) acceptor residue required for subsequent sialylation.
Figure 22 depicts a structure of CMP-sialic acid (CMP-SA).
Figure 23 depicts a metabolic pathway for ManNAc (N-acetylmannosamine) from glucosamine and N-acetylglucosamine (GlcNAc).
Figure 24 depicts a ManNAc (N-acetylmannosamine) to sialic acid metabolic pathway. Figure 25 depicts the formation of CMP-sialic acid (CMP-S A) catalyzed by CMP-SA synthetase.
Figure 26 depicts detection of purified (P) transferrin (hTf) or transferrin from unpurified insect cell lysates (M) following separation on an SDS-PAGE gel, as described the Examples.
Figure 27 depicts the nucleotide sequence of human aldolase.
Figure 28 depicts the amino acid sequence of human aldolase encoded by the sequence shown in Figure 27.
Figure 29 depicts the nucleotide sequence of human CMP-SA synthetase (cytidine monophosphate-sialic acid synthetase)
Figure 30 depicts the amino acid sequence of human CMP-SA synthetase encoded by the sequence shown in Figure 29.
Figure 31 depicts the nucleotide sequence of human sialic acid synthetase
(human SA-synthetase; human SAS).
Figure 32 depicts the amino acid sequence of human SA-synthetase (SAS) encoded by the sequence shown in Figure 31.
Figure 33 depicts the types and quantities of oligosaccharide structures found on recombinant human transferrin in the presence and absence of Gal T overexpression.
Figure 34 depicts bacterial and mammalian sialic acid metabolic pathways. Figure 35 depicts human sialic acid synthetase (SAS) genetic information: (A) depicts an alignment of the polypeptide encoded by the human SAS polynucleotide open-reading frame; (B) shows the amino acid sequence homology between human SAS (top) and bacterial sialic acid synthetase (NeuB) (bottom).
Figure 36 (A) depicts an autoradiogram of human sialic acid synthetase gene products following gel electrophoresis. The lanes labeled "In Vitro" represent in vitro transcription and translation products of SAS cDNA (amplified via polymerase chain reaction (PCR)). Lane 1 ("pA2") depicts a negative control reaction in which pA2 plasmid (without the SAS cDNA) was PCR amplified, transcribed, translated, and radiolabled. Lane 2 ("pA2-SAS ") depicts a sample reaction in which pA2-SAS plasmid (containing the human SAS cDNA) was PCR amplified, transcribed, translated, and radiolabeled. Lane 3 ("Marker") depicts radiolabeled protein standards migrating at approximately 66, 46, 30, 21.5, and 14.3 kD. The lanes labeled "Pulse Label" show radioactive 35S pulse labeling of polypeptides from insect cells infected by virions not containing or containing the human SAS cDNA. Lane 4 ("A35") depicts a negative control reaction of radiolabled polypeptides from insect cells infected with virions not containing the SAS cDNA. Lane 5 ("AcSAS") depicts a sample reaction of radiolabeled polypeptides from insect cells infected with baculovirus containing the human SAS cDNA. Figure 36 (B) depicts an RNA (Northern) blot of human tissues (spleen, thymus, prostate, testis, ovary, small intestine, peripheral blood lymphocytes (PBL), colon, heart, brain, placenta, lung, liver, skeletal muscle, kidney, and pancreas) probed for sialic acid synthetase RNA transcripts. Transcript sizes (in kilobases) are indicated by comparison to the scale on the left side.
Figure 37 depicts chromatograms indicating the in vivo sialic acid content of various cells as monitored following DMB derivitization and reverse phase HPLC separation. Figure 37 (A) depicts the sialic acid content of lysed cell lines after filtration through a 10,000 MWCO membrane. The cell lines analyzed were Sf-9
(insect) cells in standard media, SF-9 cells supplemented with 10% FBS (fetal bovine serum), or CHO (Chinese Hamster Ovary) cells. The original chromatogram values have been divided by protein concentration to normalize chromatograms. The standards shown are Neu5Ac at 1000 fmol, Neu5Gc at 200 fmol, and KDN at 50 ftnol. Figure 37 (B) depicts a chromatogram of the sialic acid content of lysates from various Sf-9 cells. "AcSAS Infected" cell lysates were from Sf-9 cells infected with baculovirus containing the human SAS cDNA. The Neu5Ac and KDN "Standards" are shown at 1,000 fmol concentrations. "A35 Infected" cell lysates are from Sf-9 infected by baculovirus not containing the SAS cDNA. "Uninfected" cell lysates are from normal Sf-9 cells not infected by any baculovirus. Original chromatogram values have been divided by protein concentration to normalize chromatograms. Figure 37 (C) depicts a chromatogram of the sialic acid content from lysates of Sf-9 grown in media supplemented by 10 mM ManNAc; cells were infected or not infected with baculovirus as shown in Figure 37 (B). Original chromatogram values have been divided by protein concentrations to normalize chromatograms. Neu5Ac and KDN standards represent 1,000 fmol. Figure 37(D) HP AEC (high performance anion- exchange chromatography) analysis of lysates from Sf-9 cells infected with AcSAS or A35 baculovirus with and without aldolase treatment. Samples were diluted prior to column loading to normalize sialic acid quantities based on original sample protein concentration. Neu5Ac standard is shown at 250 pmol and KDN standard is shown at lOO pmol.
Figure 38 depicts chromatograms of in vitro assays for sialic acid phosphorylation activity. Assays were performed with and without alkaline phosphatase (AP) treatment. Figure 38 (A) depicts chromatogram results of a Neu5 Ac-9-phosphate assay performed using lysates from Sf-9 cells infected with the AcSAS baculovirus (containing the human SAS cDNA). KDN and Neu5Ac standards are shown at 5000 fmol. Figure 38 (B) depicts chromatogram results of a KDN-9-phosphate assay performed using lysates from Sf-9 cells infected with the AcSAS baculovirus (containing the human SAS cDNA). KDN and Neu5Ac standards are shown at 5000 fmol.
Figure 39 depicts a chromatogram demonstrating production of sialylated nucleotides in SF-9 insect cells following infection with CMP-SA synthetase and SA synthetase containing baculoviruses. Sf-9 cells were grown in six well plates and infected with baculovirus containing CMP-SA synthase and supplemented with 10 mM ManNAc ("CMP" line), with baculovirus containing CMP-SA synthase and SA synthase plus 10 mM ManNAc supplementation ("CMP+SA" line), or with no baculovirus and no ManNAc supplementation ("SF9" line).
DETAILED DESCRIPTION OF THE INVENTION
Compositions and methods for producing glycoproteins with sialylated oligosaccharides are provided. In particular, the carbohydrate processing pathways of cell lines of interest are manipulated to produce complex sialylated glycoproteins. Such sialylated glycoproteins find use as pharmaceutical compositions, vaccines, diagnostics, therapeutics, and the like.
Cells of interest include, but are not limited to, mammalian cells and non- mammalian cells, such as, for example, CHO, plant, yeast, bacterial, insect, and the like. The methods of the invention can be practiced with any cells of interest. By way of example, methods for the manipulation of insect cells are described fully herein. However, it is recognized that the methods may be applied to other cells of interest to construct processing pathways in any cell of interest for generating sialylated glycoproteins. Oligosaccharides on proteins are commonly attached to asparagine residues found within Asn-X-Ser/Thr consensus sequences; such asparagine-linked oligosaccharides are commonly referred to as "N-linked". The sialylation of N-linked glycans occurs in the Golgi apparatus by the following enzymatic mechanism: CMP- SA + GalGlcNAcMan-R sialyltransferase SAGalGlcNAcMan-R + CMP. The successful execution of this sialylation reaction depends on the presence of three elements: 1) the correct carbohydrate acceptor substrate (designated GalGlcNAcMan- R in the above reaction; where the acceptor substrate is a branched glycan, GalGlcNAcMan is comprised by at least one branch of the glycan, the Gal is a terminal Gal, and R is an N-linked glycan); 2) the proper donor nucleotide sugar, cytidine monophosphate-sialic acid (CMP-SA); and 3) a sialyltransferase enzyme. Each of these reaction components is limiting or missing in insect cells (Hooker et al. (1997) Monitoring the glycosylation pathway of recombinant human inter fer on- gamma produced by animal cells , Hsu et al. (1997) J. Biol. Chem. 272:9062-9070, Jarvis et al. (1995) Virology 212:500-511, Jenkins et al. (1998) Cell Culture Engineering VI , Oganah et al. (1996) Bio/Technology 14:197-202).
It will be apparent to those skilled in the art that where a cell of interest is manipulated according to the methods of the invention such that the cell produces a desired level of the donor substrate CMP-SA, and expresses a desired level of sialyltransferase; any oligosaccharide or monosaccharide, any compound containing an oligosaccharide or monosaccharide, any compatible aglycon (for example Gal- sphingosine), any asparagine (N)-linked glycan, any serine- or threonine-linked (O- linked) glycan, and any lipid containing a monosaccharide or oligosaccharide structure can be a proper acceptor substrate and can be sialylated within the cell of interest.
Accordingly, the methods of the invention may be applied to generate sialylated glycoproteins for which the acceptor substrate is not necessarily limited to the structure GalGlcNAcMan-R, although this structure is particularly recognized as an appropriate acceptor substrate structure for production of N-linked sialylated glycoproteins. Thus, according to the methods of the present invention, the acceptor substrate can be any glycan. Preferably, the acceptor substrate according to the methods of the invention is a branched glycan. Even more preferably, the acceptor substrate according to the methods of the invention is a branched glycan comprising a terminal Gal in at least one branch of the glycan. Yet even more preferably, the acceptor substrate according to the methoids of the invention has the structure GalGlcNAcMan in at least one branch of the glycan and the Gal is a terminal Gal. It will also be apparent to those skilled in the art that engineering the sialylation process into cells of interest according to the methods of the present invention requires the successful manipulation and integration of multiple interacting metabolic pathways involved in carbohydrate processing. These pathways include participation of glycosyltransferases, glycosidases, the donor nucleotide sugar (CMP- SA) synthetases, and sialic acid transferases. "Carbohydrate processing enzymes" of the invention are enzymes involved in any of the glycosyltransfer, glycosidase, CMP- SA synthesis, and sialic acid transfer pathways. Known carbohydrate engineering efforts have generally focused on the expression of transferases (Lee et al. (1989) J. Biol Chem. 264:13848-13855, Wagner et al. (1996) J. Virology 70:4103-4109, Jarvis et al. (1996) Nαtwre Biotech. 14:1288-1292, Hollister et al. (1998) Glycobiology 5:473-480, Smith et al. (1990) J. Biol. Chem. 265:6225-6234, Grabenhorst et al. (1995) Eur. J. Biochem. 232:718-725; U.S. Patent No. 5,047,335; International patent application publication number WO 98/06835). However, it is recognized in this invention that the mere insertion of one or more transferases into cells of interest does not ensure sialylation, as there are generally insufficient levels of the donor (CMP- SA) and the acceptor substrates, particularly GalGlcNAcMan-R.
The methods of the present invention permit manipulation of glycoprotein production in cells of interest by enhancing the production of donor nucleotide sugar substrate (CMP-SA) and optionally, by introducing and expressing sialyltransferase and/or acceptor substrates. By "cells of interest" is intended any cells in which the endogenous CMP-SA levels are not sufficient for the production of a desired level of sialylated glycoprotein in that cell. The cell of interest can be any eukaryotic or prokaryotic cell. Cells of interest include, for example, insect cells, fungal cells, yeast cells, bacterial cells, plant cells, mammalian cells, and the like. Human cells and cell lines are also included in the cells of interest and may be utilized according to the methods of the present invention to, for example, manipulate sialylated glycoproteins in human cells and/or cell lines, such as, for example, kidney, liver, and the like. By "desired level" is intended that the quantity of a biochemical comprised by the cell of interest is altered subsequent to subjecting the cell to the methods of the invention. In this manner, the invention comprises manipulating levels of CMP-SA and/or sialylated glycoprotein in the cell of interest. In a preferred embodiment of the invention, manipulating levels of CMP-SA and sialylated glycoprotein comprise increasing the levels to above endogenous levels. It is recognized that the increase can be from a non-detectable level to any detectable level; or the increase can be from a detected endogenous level to a higher level.
According to the present invention, production of the acceptor substrate is achieved by optionally screening a variety of cell lines for desirable processing enzymes, suppressing unfavorable cleavage reactions that generate truncated carbohydrates, and/or by enhancing expression of desired glycosyltransferase enzymes such as galactose transferase. Methods of enhancing expression of certain carbohydrate processing enzymes, including but not limited to, glycosyltransferases, are described in U.S. Patent No. 5,047,335 and International patent application publication number WO 98/06835, the contents of which are herein incorporated by reference. According to the present invention, production of the donor substrate, CMP-
SA, may be achieved by adding key precursors such as N-acetylmannosamine (ManNAc), N-acetylglucosamine (GlcNAc) and glucosamine to cell growth media, by enhancing expression of limiting enzymes in CMP-SA production pathway in the cells, or any combination thereof. For purposes of the present invention, by "enhancing expression" is intended to mean that the translated product of a nucleic acid encoding a desired protein is higher than the endogenous level of that protein in the host cell in which the nucleic acid is expressed. In a preferred embodiment of the invention, the biological activity of a desired carbohydrate processing enzyme is increased by enhancing expression of the enzyme.
For the purposes of the invention, by "suppressing activity" is intended to mean decreasing the biological activity of an enzyme. In this aspect, the invention encompasses reducing the endogenous expression of the enzyme protein, for example, by using antisense and/or ribozyme nucleic acid sequences corresponding to the amino acid sequences of the enzyme; gene knock-out mutagenesis; and/or by inhibiting the activity of the enzyme protein, for example, by using chemical inhibitors.
By "endogenous" is intended to mean the type and/or quantity of a biological function or a biochemical composition that is present in a naturally occurring or recombinant cell prior to manipulation of that cell according to the methods of the invention.
By "heterologous" is intended to mean the type and/or quantity of a biological function or a biochemical composition that is not present in a naturally occurring or recombinant cell prior to manipulation of that cell by the methods of the invention. For purposes the present invention, by "a heterologous polypeptide or protein" is meant as a polypeptide or protein expressed (i.e. synthesized) in a cell species of interest that is different from the cell species in which the polypeptide or protein is normally expressed (i.e. expressed in nature).
Methods for determining endogenous and heterologous functions and compositions relevant to the invention are provided herein; and otherwise encompass those methods known in the art.
Generation of Acceptor Carbohydrate Substrate: GalGlcNAcMan-R:
According to the methods of the present invention, production of the acceptor substrate glycan GalGlcNAcMan-R, is particularly desirable for the sialylation reaction of N-linked glycoproteins, moreover the terminal Gal is required. Thus, in one embodiment of the invention the cells of interest are manipulated (using techniques described herein or otherwise known in the art) to contain this substrate. For example, for insect cells which principally produce truncated carbohydrates terminating in Man or GlcNAc, such cells may routinely be manipulated to produce a significant fraction of complex oligosaccharides terminating in Gal. Three non limiting, non-exclusive approaches that may be routinely applied to produce a significant fraction of complex oligosaccharides terminating in Gal include: (1) developing screening assays to analyze a selection of insect cell lines for the presence of particular carbohydrate processing enzymes; (2) elevating production of Gal- terminated oligosaccharides by expressing specific enzymes relevant to carbohydrate processing pathways; and (3) suppressing carbohydrate processing pathways that produce truncated N-linked glycans which cannot serve as acceptors in downstream glycosyltransferase reactions.
Thus, in one embodiment, to produce GalGlcNAcMan-R acceptor substrates according to the methods of the invention, cell lines of interest are initially, and optionally, screened to identify cell lines with the desired endogenous carbohydrate production for subsequent metabolic manipulations. More particularly, the screening process includes characterizing cell lines for glycosyl transferase activity using techniques described herein or otherwise known in the art. Furthermore, it is recognized that any screened cell line could generate some paucimannosidic carbohydrates. Accordingly, the screening process also includes using techniques described herein or otherwise known in the art to characterize cell lines for particular glycosidase activity leading to production of paucimannosidic structures.
Thus, in another embodiment, for the production of the acceptor substrates, the invention encompasses utilizing methods described herein or otherwise known in the art to enhance the expression of one or more transferases. Such methods include, but are not limited to, methods that enhance expression of Gal T, GlcNAc -Tl and -TII or any combination thereof; for example, as described in International patent application publication number WO 98/06835 and U.S. Patent No. 5,047,335.
Thus, in another embodiment, concentrations of acceptor substrates are increased by using methods described herein or otherwise known in the art to suppress the activity of one or more endogenous glycosidases. By way of example, an endogenous glycosidase, the activity of which may be suppressed accoreding to the methods of the invention includes, but is not limited to, the hexosaminidase, N- acetylglucosaminidase (an enzyme that degrades the substrate required for oligosaccharide elongation).
Thus, the invention encompasses enhancing metabolic pathways that produce the desired acceptor carbohydrates and/or suppressing those pathways that produce truncated acceptors.
Characterizing cell lines using enzyme screening assay
The cell lines of interest produce different N-glycan structures. Thus, such cells can routinely be screened using techniques described herein or otherwise known in the art to determine the presence of carbohydrate processing enzymes of interest. In insect cells, for example, different insect cell lines produce very different N-glycan structures (Jarvis et al. (1995) Virology 212:500-511, Hsu et al. (1997) J. Biol. Chem. 272:9062-9070, Nishimura et al. (1996) Bioorg. Med. Chem. 4:91-96). However, only a few cell lines have been characterized, in part due to the lack of efficient screening assays. The present invention provides methods implementing fluorescence energy transfer and Europium fluorescence assays to screen a selection of different cells of interest, such as, for example, insect cell lines for the presence of critical carbohydrate processing enzymes. Analytical bioassays described herein or otherwise known in the art are also provided according to the methods of the present invention to detect the presence of favorable carbohydrate processing enzymes, including, but not limited to, galactosyl transferase (Gal T), GlcNAc transferase I (GlcNAc T I), and sialyltransferase; and to detect undesirable enzymes including, but not limited to, N-acetylglucosaminidase.
Where the cells of interest are insect cells, it will be immediately apparent that substantial diversity exists among established insect cell lines due to the range of species and tissues from which these lines were derived. Many of these lines can routinely be infected by the baculovirus, Autographa californica nuclear polyhedrosis virus (Aα NPV), and used for the production of heterologous proteins. However, only a few cell lines are routinely used for recombinant protein production using techniques described herein or otherwise known in the art. These cell lines will be immediately apparent by one skilled in the art. It is recognized that any cell line can be screened for specific carbohydrate processing enzymes, and manipulated for the purposes of the present invention. Examples of such cell lines include, but are not limited to, insect cell lines, including but not limited to, Spodoptera frugiperda (e.g. Sf-9 or Sf-21 cells), Trichoplusia ni (T. ni), and Estigmene acrea (Ea4). Spodoptera frugiperda lines (Sf-9 or Sf-21) are the most widely used cell lines and a significant amount information is known about the oligosaccharide processing in these cells. Trichoplusia ni (e.g. High Five™ cells; Invitrogen Corp., Carlsbad, CA, USA) cells have been shown to secrete high yields of heterologous proteins with attached hybrid and complex N-glycans (Davis et al (1993) In Vitro Cell. Dev. Biol. 29:842-846). Estigmena acrea (Ea-4) have been used to generate hybrid and complex N-linked oligosaccharides terminating in GlcNAc and Gal residues (Oganah et al. (1996) Bio/Technology 14:197-202).
Drosophila Schneider S2 cell lines represent another insect cell line used for the production of heterologous proteins. Though these cells cannot be infected by the AcNPV expression vector, they are used for production of heterologous proteins via an alternative technology known in the art. These cell lines represent other insect cell line candidates whose glycosylation processing characteristics may be modified to include sialylation. In insect cells, paucimannosidic structures are produced by a membrane-bound N-acetylglucosaminidase, which removes terminal GlcNAc residues from the alpha(l,3) arm of the trimannosyl core (Altmann et al. (1995) J. Biol. Chem. 270:17344-17349). This trimannosyl core structure lacks the proper termini required for conversion of side chains to sialylated complex structures; therefore, suppression of the N-acetylglucosaminidase activity can reduce or eliminate the formation of these undesired oligosaccharide structures, as illustrated in Figure 17.
To reduce the N-acetylglucosaminidase activity in the target insect cell line(s), the invention provides vectors encoding N-acetylglucosaminidase or other glucosaminidase cDNAs in the antisense orientation and/or, vectors encoding ribozymes and/or, vectors containing sequences capable of "knocking out" the N- acetylglucosaminidase other glucosaminidase genes via homologous recombination. Expression plasmids described herein or otherwise known in the art are constructed using techniques known in the art to produce stably-transformed insect cells that constitutively express the antisense construct and/or ribozyme construct to suppress translation of N-acetylglucosaminidase other glucosaminidases or alternatively, to use homologous recombination techniques known in the art are to "knock-out" the N- acetylglucosaminidase other glucosaminidase genes. Particular sequences to be used in the antisense and/or ribozyme construction are described herein, for example, in Example 4. Techniques described herein or otherwise known in the art may be routinely applied to analyze N-linked oligosaccharide structures and to determine if N-glycan processing is altered and of the number of paucimannosidic structures in these cells is reduced.
Antisense technology can be used to control gene expression through antisense DNA or RNA or through triple-helix formation. Antisense techniques are discussed, for example, in Okano, j. Neurochem. 56: 560 (1991); "Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988). Antisense technology can be used to control gene expression through antisense DNA or RNA, or through triple-helix formation. Antisense techniques are discussed for example, in Okano, J., Neurochem. 56:560 (1991);
Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988). Triple helix formation is discussed in, for instance Lee et al., Nucleic Acids Research 6: 3073 (1979); Cooney et al., Science 241 : 456 (1988); and Dervan et al., Science 251 : 1360 (1991). The methods are based on binding of a polynucleotide to a complementary DNA or RNA. For example, the 5' coding portion of a polynucleotide that encodes the amino terminal portion of N- acetylglucosammidase and/or other glucosaminidases may be used to design antisense RNA oligonucleotides of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be complementary to a region of the gene involved in transcription thereby preventing transcription and the production of N- acetylglucosaminidase and/or other glucosaminidases. The antisense RNA oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into N-acetylglucosaminidase and/or other glucosaminidase polypeptides. The oligonucleotides described above can also be delivered to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of N- acetylglucosaminidase and/or other glucosaminidases. In one embodiment, the N-acetylglucosaminidase and/or other glucosaminidase antisense nucleic acids of the invention are produced intracellulariy by transcription from an exogenous sequence. For example, a vector or a portion thereof, is transcribed, producing an antisense nucleic acid (RNA) of the invention. Such a vector would contain a sequence encoding a N-acetylglucosaminidase and/or other glucosaminidase antisense nucleic acids. Such a vector can remain episomal or become chromosomally integrated, as long as it can be transcribed to produce the desired antisense RNA. Such vectors can be constructed by recombinant DNA technology methods standard in the art. Vectors can be plasmid, viral, or others know in the art, used for replication and expression in insect, yeast, mammalian, and plant cells. Expression of the sequences encoding N-acetylglucosaminidase and/or other glucosaminidases, or fragments thereof, can be by any promoter known in the art to act in insect, yeast, mammalian, and plant cells. Such promoters can be inducible or constitutive. Such promoters include, but are not limited to, the baculovirus polyhedrin promoter (Luckow et al. (1993) Curr. Opin. Biotech. 4:564-572, Luckow et al. (1995)), the SV40 early promoter region (Bernoist and Chambon, Nature 29:304-310 (1981), the promoter contained in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto et al., Cell 22:787-797 (1 80), the herpes thymidine promoter (Wagner et al., Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445 (1981), the regulatory sequences of the metallothionein gene (Brinster, et al., Nature 296:39-42 (1982)), etc.
The antisense nucleic acids of the invention comprise sequences complementary to at least a portion of an RNA transcript of N-acetylglucosaminidase and/or other glucosaminidase genes. However, absolute complementarity, although preferred, is not required. A sequence "complementary to at least a portion of an RNA," referred to herein, means a sequence having sufficient complementarity to be able to hybridize with the RNA, forming a stable duplex; in the case of double stranded N-acetylglucosaminidase and/or other glucosaminidase antisense nucleic acids, a single strand of the duplex DNA may thus be tested, or triplex formation may be assayed. The ability to hybridize will depend on both the degree of complementarity and the length of the antisense nucleic acid Generally, the larger the hybridizing nucleic acid, the more base mismatches with a N-acetylglucosaminidase and/or other glucosaminidase RNAs it may contain and still form a stable duplex (or triplex as the case may be). One skilled in the art can ascertain a tolerable degree of mismatch by use of standard procedures to determine the melting point of the hybridized complex.
Oligonucleotides that are complementary to the 5' end of the message, e.g., the 5' untranslated sequence up to and including the AUG initiation codon, should work most efficiently at inhibiting translation. However, sequences complementary to the 3' untranslated sequences of mRNAs have been shown to be effective at inhibiting translation of mRNAs as well. See generally, Wagner, R., 1994, Nature 372:333-335. Thus, oligonucleotides complementary to either the 5'- or 3'- non- translated, non-coding regions of N-acetylglucosaminidase and/or other glucosaminidases, could be used in an antisense approach to inhibit translation of endogenous N-acetylglucosaminidase and/or other glucosaminidase mRNAs. Oligonucleotides complementary to the 5' untranslated region of the mRNA should include the complement of the AUG start codon. Antisense oligonucleotides complementary to mRNA coding regions are less efficient inhibitors of translation but could be used in accordance with the invention. Whether designed to hybridize to the 5'-, 3'- or coding region of N-acetylglucosaminidase and/or other glucosaminidase mRNAs, antisense nucleic acids should be at least six nucleotides in length, and are preferably oligonucleotides ranging from 6 to about 50 nucleotides in length. In specific aspects the oligonucleotide is at least 10 nucleotides, at least 17 nucleotides, at least 25 nucleotides or at least 50 nucleotides. The polynucleotides of the invention can be DNA or RNA or chimeric mixtures or derivatives or modified versions thereof, single-stranded or double- stranded. The oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate backbone, for example, to improve stability of the molecule, hybridization, etc. The oligonucleotide may include other appended groups such as peptides (e.g., for targeting host cell receptors in vivo), agents facilitating transport across the cell membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553- 6556; Lemaitre et al., Proc. Natl. Acad. Sci. 84:648-652 (1987); PCT Publication No. WO88/09810, published December 15, 1988), or hybridization-triggered cleavage agents (See, e.g., Krol et al., BioTechniques 6:958-976 (1988)) or intercalating agents. (See, e.g., Zon, Pharm. Res. 5:539-549 (1988)). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a peptide, hybridization triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc.
The antisense oligonucleotide may comprise at least one modified base moiety which is selected from the group including, but not limited to, 5-fluorouracil, 5- bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5- (carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1 -methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 2- methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5-methoxycarboxymethyluracil, 5-methoxyuracil, 2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4- thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5 -methyl-2 -thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. The antisense oligonucleotide may also comprise at least one modified sugar moiety selected from the group including, but not limited to, arabinose, 2-fluoroarabinose, xylulose, and hexose.
In yet another embodiment, the antisense oligonucleotide comprises at least one modified phosphate backbone selected from the group including, but not limited to, a phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a formacetal or analog thereof.
In yet another embodiment, the antisense oligonucleotide is an alpha-anomeric oligonucleotide. An alpha -anomeric oligonucleotide forms specific double-stranded hybrids with complementary RNA in which, contrary to the usual beta-units, the strands run parallel to each other (Gautier et al., Nucl. Acids Res. 15:6625-6641 (1987)). The oligonucleotide is a 2-0-methylribonucleotide (Inoue et al., Nucl. Acids Res. 15:6131-6148 (1987)), or a chimeric RNA-DNA analogue (Inoue et al., FEBS Lett. 215:327-330 (1997)).
Polynucleotides of the invention may be synthesized by standard methods known in the art, e.g. by use of an automated DNA synthesizer (such as are commercially available from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate oligonucleotides may be synthesized by the method of Stein et al. (Nucl. Acids Res. 16:3209 (1988)), methylphosphonate oligonucleotides can be prepared by use of controlled pore glass polymer supports (Sarin et al., Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451 (1988)), etc.
While antisense nucleotides complementary to the N-acetylglucosaminidase and/or other glucosaminidase coding region sequences could be used, those complementary to the transcribed untranslated region are most preferred.
Potential N-acetylglucosaminidase or other glucosaminidase activity suppressors according to the invention also include catalytic RNA, or a ribozyme (See, e.g., PCT International Publication WO 90/11364, published October 4, 1990; Sarver et al, Science 247:1222-1225 (1990). While ribozymes that cleave mRNA at site specific recognition sequences can be used to destroy N-acetylglucosaminidase and/or other glucosaminidase mRNAs, the use of hammerhead ribozymes is preferred. Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form complementary base pairs with the target mRNA. The sole requirement is that the target mRNA have the following sequence of two bases: 5'- UG-3'. The construction and production of hammerhead ribozymes is well known in the art and is described more fully in Haseloff and Gerlach, Nature 334:585-591 (1988). Preferably, the ribozyme is engineered so that the cleavage recognition site is located near the 5' end of the N-acetylglucosaminidase and/or other glucosaminidase mRNAs; i.e., to increase efficiency and minimize the intracellular accumulation of non-functional mRNA transcripts.
As in the antisense approach, the ribozymes of the invention can be composed of modified oligonucleotides (e.g. for improved stability, targeting, etc.) and should be delivered to cells which express N-acetylglucosaminidase and/or other glucosaminidases in vivo. DNA constructs encoding the ribozyme may be introduced into the cell in the same manner as described above for the introduction of antisense encoding DNA. A preferred method of delivery involves using a DNA construct "encoding" the ribozyme under the control of a strong constitutive promoter, such as, for example, pol III or pol II promoter, so that transfected cells will produce sufficient quantities of the ribozyme to destroy endogenous N-acetylglucosaminidase and/or other glucosaminidase messages and inhibit translation. Since ribozymes unlike antisense molecules, are catalytic, a lower intracellular concentration is required for efficiency.
Endogenous gene expression can also be reduced by inactivating or "knocking out" the N-acetylglucosaminidase and/or other glucosaminidase gene and/or its promoter using targeted homologous recombination. (E.g., see Smithies et al., Nature 317:230-234 (1985); Thomas & Capecchi, Cell 51 :503-512 (1987); Thompson et al., Cell 5:313-321 (1989); each of which is incoφorated by reference herein in its entirety). For example, a mutant, non-functional polynucleotide of the invention, or a completely unrelated DNA sequence (such as for example, a sialic acid synthetase) flanked by DNA homologous to the endogenous polynucleotide sequence (either the coding regions or regulatory regions of the gene) can be used, with or without a selectable marker and/or a negative selectable marker, to transfect cells that express polypeptides of the invention in vivo. In another embodiment, techniques known in the art are used to generate knockouts in cells that contain, but do not express the gene of interest. Insertion of the DNA construct, via targeted homologous recombination, results in inactivation of the targeted gene. Such approaches are particularly suited in research and agricultural fields where modifications to embryonic stem cells can be used to generate animal offspring with an inactive targeted gene (e.g., see Thomas & Capecchi 1987 and Thompson 1989, supra). The contents of each of the documents recited in this paragraph is herein incorporated by reference in its entirety.
The use of chemical inhibitors is also within the scope of the present invention, in addition to, or as an alternative to, the antisense approach, and/or the ribozyme approach, and/or the gene "knock-out" approach, as means for suppressing glucosaminidase activity in insect cell cultures. Chemical inhibitors that may be used to suppress glucosaminidase activity include, but are not limited to, 2-acetamido- l,2,5-trideoxy-l,5 amino-D-glucitol can limit the N-acetylglucosaminidase activity in insect cells (Legler et al. (1991) Biochim. Biophys. Acta 1080:80-95, Wagner et al. (1996) J. Virology 70:4103-4109). In addition, a number of other N- acetylglucosaminidase inhibitors may also be used according to the present invention, including, but not limited to, nagastatin (with a Ki value in the 10"8 range) and GlcNAc-oxime (Ki in 0.45-22 mM) which are commercially, publicly, or otherwise available for the puφoses of the present invention (Nishimura et al. (1996) Bioorg. Med. Chem. 4:91-96, Aoyagi et al. (1992) J. Antibiotics 45:1404-1408).
The chemical inhibitors mentioned above do not distinguish between lysosomal N-acetylglucosaminidase and the target membrane-bound N- acetylglucosaminidase activity in the secretory compartment. Thus, a more specific inhibitor, based on the substrate structure, is provided to serve not merely as a competitive inhibitor, but also as an affinity labeling reagent. The chemical structure for two possible chemical compounds with specificity for inhibiting membrane-bound glucosaminidase one or both of which may be used according to the present invention, are shown in Figure 19. Subsequent to expression and purification of the N- acetylglucosaminidase, the effectiveness of these inhibitors may be tested and compared in in vitro and/or in vivo trials using techniques described herein or otherwise known in the art. As above, these chemical inhibitors are then used in addition to, or as an alternative to, antisense suppression, ribozyme suppression, and/or gene knock-out mutagenesis, of glucosaminidase activity in insect cells.
It is recognized that the suppression of glucosaminidase activity alone may not lead to production of the desired acceptor carbohydrate, if the enzymes responsible for generating structures terminating in Gal are lacking in particular cell lines. Thus, according to the methods of the present invention, Gal T activity in insect cells can be increased significantly by using techniques described described herein or otherwise known in the art to express a heterologous gene using a baculovirus construct containing nucleic acid sequences encoding Gal T or a fragment or variant thereof, or by stably transforming the cells with a gene coding for Gal T or a fragment or variant thereof. If N-glycan analysis indicates that lower than a desired level of the acceptor substrates are present even following glucosaminidase suppression, techniques described herein or otherwise known in the art may be applied to express glycosyltransferase enzymes as needed in insect cells to produce a larger fraction of the desired acceptor structures. Figure 20 depicts that the overexpression of various glycosyltransferases leads to greater production of acceptor substrates.
Alternatively, the expression of glycosyltransferases will serve to limit generation of paucimannosidic structures by generating unacceptable glucosaminidase substrates terminating in Gal, or by competing against the glucosaminidase reaction (Wagner et al. , Glycobiology 6: 165-175 (1996)).
Thus, the invention comprises expression of glycosyltransferases combined with, or as an alternative to, suppression of N-acetylglucosaminidase activity in selected insect cell lines to produce desired quantities of carbohydrates containing the correct Gal (G) acceptor substrate for sialylation. Figure 21 illustrates, without limitation, three examples of acceptor N-glycan structures that comprise the terminal Gal acceptor residue required for subsequent sialylation. Other desired carbohydrates structures with a branch terminating Gal are also possible and are encompassed by the invention.
Baculovirus expression vectors containing the coding sequence for GlcNAc - Tl and -Til, and Gal T or fragments or variants thereof, and stable transfectants overexpressing GlcNAc-TI and GlcNAc-TII, and Gal T, or fragments or variants thereof are known, can be routinely generated using techniques known in the art, and are commercially, publicly, or otherwise available for the puφoses of this invention. (See Jarvis et al. (1996) Nature Biotech. 14:1288-1292; Hollister et al. (1998) Glycobiology 8: 473-480; the contents of which are herein incoφorated by reference). In addition, stable transfectants expressing GlcNAc-TI and GlcNAc-TII can be routinely generated using techniques known in the art, if overexpression proves desirable.
Production and delivery of the Donor Substrate: CMP-Sialic Acid (CMP-SA') For production of the donor substrate, CMP-SA, the invention provides methods and compositions comprising expression of limiting enzymes in the CMP- SA production pathway; in addition, or as an alternative to, the feeding of precursor substrates.
To produce sialylated N-linked glycoproteins, the donor substrate, CMP-sialic acid (CMP-SA), must be synthesized. The structure of CMP-SA is shown in Figure 22. CMP-SA can be enzymatically synthesized from glucose or other simple sugars, glutamine, and nucleotides in mammalian cells and E coli using the metabolic pathways shown in Figure 5, and as described in Ferwerda et al. (1983) Biochem. J. 216:87-92; Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400; Schachter et al. (1973) Metabolic Conjugation and Metabolic Hydrolysis (New York Academic Press) 2-135.
In some mammalian tissues and cell lines, the production and delivery of CMP-SA limits the sialylation capacity of these cells (Gu et al. (1997) Improvement of the inter feron-gamma sialylation in Chinese hamster ovary cell culture by feeding N-acetylmannosamine). This problem is likely to be amplified in insect cells since negligible sialic acid levels are detected in Trichoplusia ni insect cells as compared to levels in Chinese Hamster Ovary (CHO) mammalian cells (Figure 16). Furthermore, negligible CMP-SA was observed in Sf-9 and Εa-4 insect cells when compared to CHO cells (Hooker et al. (1997) Monitoring the Glycosylation Pathway of Recombinant Human Interferon-Gamma Produced by Animal Cells, European Workshop on Animal Cell Engineering, Costa Brava, Spain; and Jenkins (1998) Restructuring the Carbohydrates of Recombinant Glycoproteins, Cell Culture Engineering VI, San Diego, CA). These findings are relevant in light of the previously published observation that polysialic acid can be detected in Drosophila embryos (Roth et al. (1992) Science 256:673-675) and the observation of sialylated glycoproteins produced by other insect cells (Davidson et al. (1990) Biochemistry 29:5584-5590). Production of sialic acid (S A), more specifically N-acetylneuraminic acid
(NeuAc), from the precursor substrate ManNAc can proceed through three alternative pathways shown in Figure 5. The principal pathway for the production of SA in E coli and other bacteria utilizes the phosphoenylpyruvate (PEP) and ManNAc to produce sialic acids in the presence of sialic acid synthetase (Vann et al. (1997) Glycobiology 7:697-701). A second pathway, observed in bacteria and mammals, involves the reversible conversion by aldolase (also named N-acetylneuraminate lyase) of ManNAc and pyruvate to sialic acid (Schachter et al. (1973) Metabolic Conjugation and metabolic Hydrolysis (New York Academic Press) 2-135, Lilley et al. (1992) Prot. Expr. and Pur. 3:434-440). The aldolation reaction equilibrates toward ManNAc but can be manipulated to favor the production of sialic acid by the addition of excess ManNAc or pyruvate in vitro (Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400). The third pathway, observed only in mammalian tissue, begins with the ATP driven phosphorylation of ManNAc, and is followed by the enzymatic conversion of phosphorylated ManNAc to a phosphorylated form of sialic acid, from which the phosphate is removed in a subsequent step (van Rinsum et al. (1983) Biochem. J. 210:21-28, Schachter et al. (1973) Metabolic Conjugation and metabolic Hydrolysis (New York Academic Press) 2-135).
According to one embodiment of the invention, to overcome intracellular limitations of CMP-SA in mammalian cells, feeding of alternative precursor substrates may be applied to eliminate or reduce the need to produce CMP-SA from simple sugars (see Example 6). Since CMP-SA and its direct precursor, SA, are not permeable to cell membranes (Bennetts et al. (1981) J Cell. Biol. 88:1-15), these substrates cannot be added to the culture medium for uptake by the cell. However, other precursors, including N-acetylmannosamine (ManNAc), glucosamine, and N- acetylglucosamine (GlcNAc) when added to the culture medium are absorbed into mammalian cells (see Example 6). See, for example, Gu et al. (1997) Improvement of tbe inter feron-gamma sialylation in Chinese hamster ovary cell culture by feeding N- acetylmannosamine, Zanghi et al. (1997) European Workshop on Animal Cell Engineering, Ferwerda et al. (1983) Biochem. J. 216:87-92, Kohn et al. (1962) J. Biol. Chem. 237:304-308, Thomas et al. (1985) Biochim. Biophys. Acta 846:37-43, Bennetts et al. (1981) J. Cell. Biol. 88:1-15. The substrates are then enzymatically converted to CMP-SA and incoφorated into homologous and heterologous glycoproteins (Gu et al. (1997) Improvement of the interferon-gamma sialylation in Chinese hamster ovary cell culture by feeding N-acetylmannosamine, Ferwerda et al. (1983) Biochem. J. 216:87-92, Kohn et al. (1962) J. Biol. Chem. 237:304-308, Bennetts et al. (1981) J. Cell Biol. 88:1-15).
To be incoφorated into oligosaccharides, sialic acid and cytidine triphosphate (CTP) must be converted to CMP-SA by the enzyme, CMP-sialic acid (CMP-SA) synthetase (Schachter et al. (1973) Metabolic Conjugation and metabolic Hydrolysis (New York Academic Press) 2-135):
Sialic Acid + CTP →CMP-SA + PPi
This enzyme has been cloned and sequenced from E. coli and used for the in vitro production of CMP-SA, as described in Zapata et al. (1989) J. Biol. Chem. 264:14769-14774, Kittleman et al. (1995) Appl. Microbiol. Biotechnol. 44:59-67, Ichikawa et al. (1992) Anal. Biochem. 202:215-238, Shames et al. (1991) Glycobiology 1:187-191; the contents of which are herein incoφorated by reference).
In eukaryotes, the activated sugar nucleotide, CMP-SA, must be transported into the Golgi lumen for sialylation to proceed (Deutscher et al. (1984) Cell 39:295- 299). Transport through the trans-Golgi membrane is facilitated by the CMP-SA transporter protein, which was identified by complementation cloning into sialylation deficient CHO cells (Εckhardt et al. (1996) Proc. Natl. Acad. Sci. USA 93:7572- 7576). This mammalian gene has also been cloned and expressed in a functional form in the heterologous host, S. cerevisiae (Bernisone et al. (1997) J Biol. Chem. 272:12616-12619).
In addition to feeding of external precursor substrates such as ManNAc, GlcNAc, or glucosamine to increase CMP-SA levels, a supplementary approach in which CMP-SA transporter genes are introduced and expressed using routine recombinant DNA techniques may also be employed according to the methods of the present invention. These techniques are optionally combined with ManNAc, GlcNAc, or glucosamine feeding strategies described above, to maximize CMP-SA production.
Conversion of GlcNAc or glucosamine to ManNAc
Also according to the methods of the present invention, where the utilization of GlcNAc or glucosamine is preferred and ManNAc is not generated naturally in insect cells, ManNAc can be produced chemically using sodium hydroxide (Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400). Alternatively, the enzymes that convert these substrates to ManNAc or fragments or variants of these enzymes, can be expressed in insect cells using techniques described herein or otherwise known in the art. The production of ManNAc from GlcNAc and glucosamine proceeds through the metabolic pathway shown in Figure 23.
Two approaches are provided to accomplish this conversion: (a) direct epimerization of GlcNAc; or (b) conversion of GlcNAc or glucosamine to UDP-N- acetylglucosamine (UDP-GlcNAc), and then ManNAc. According to one embodiment of the invention, approach (a) is achieved using the gene encoding a GlcNAc-2- epimerase isolated from pig kidney, or fragments or variants thereof, to directly convert GlcNAc to ManNAc (See Maru et al. (1996) J Biol. Chem. 271 :16294- 16299; the contents of which are herein incoφorated by reference). Additionally, the sequence for a homologue of this enzyme can be routinely obtained from bioinformatics databases, and cloned into baculovirus vectors, or stably integrated into insect cells using techniques described herein or otherwise known in the art. Alternatively, approach (b) requires insertion of the gene to convert UDP- GlcNAc to ManNAc. Engineering the production of UDP-GlcNAc from glucosamine or GlcNAc is likely not required since most insect cells comprise metabolic pathways to synthesize UDP-GlcNAc; as indicated by the presence of GlcNAc-containing oligosaccharides. According to one embodiment of the invention, the gene encoding a rat bifunctional enzyme coding for conversion of UDP-GlcNAc to ManNAc and ManNAc to ManNAc-6-P, or fragments or variants thereof is used to engineer the production of UDP-GlcNAc using techniques described herein or otherwise known in the art (Stasche et al. (1997) J. Biol. Chem. 272:24319-24324, the contents which are herein incoφorated by reference). In a specific embodiment, the segment of this enzyme responsible for conversion of UDP-GlNAc to ManNAc may be expressed independently in insect cells using techniques known in the art to produce ManNAc rather than ManNAc-6-P.
Conversion of ManNAc to SA
Once ManNAc is generated, it is converted to SA according to the methods of the invention. There are three possible metabolic pathways for the conversion of ManNAc to SA in bacteria and mammals, as shown in Figure 24. Negligible SA levels have previously been observed in insect cells (in the absence of exogenous supplementation of ManNAc to the culture media).
The conversion of ManNAc and PEP to S A using sialic acid synthetase is the predominant pathway for SA production in E. coli (Vann et al. (1997) Glycobiology 7:697-701). The E. coli sialic acid (SA) synthetase gene NeuB (SEQ ID NO:7 and 8) has been cloned and sequenced and is commercially, publicly, and/or otherwise available for the puφoses of the present invention. Additionally, as disclosed herein, the human sialic acid synthetase gene has also been cloned (cDNA clone HA5AA37), sequenced, and deposited with the American Type Culture Collection ("ATCC") on
February 24, 2000 and was given the ATCC Deposit Number . (The
ATCC is located at 10801 University Boulevard, Manassas, VA 20110-2209, USA. ATCC deposits were made pursuant to the terms of the Budapest Treaty on the international recognition of the deposit of microorganisms for puφoses of patent procedure.) Thus, for enhancing expression of SA synthetase according to certain embodiments of the invention, the nucleic acid compositions encoding a SA synthetase such as, for example, an E.coli and/or human sialic acid synthetase and/or a fragment or variant thereof, may be inserted into a host expression vector or into the host genome using techniques described herein or otherwise known in the art. According to the methods of the invention, the production of SA can also be achieved from ManNAc and pyruvate using an aldolase, such as, for example, bacterial aldolase (Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400), or a human aldolase (as described herein) or fragment or variant thereof. The human aldolase gene has been cloned (cDNA clone HDPAK85), sequenced, and deposited with the American Type Culture Collection ("ATCC") on February 24, 2000 and was given the ATCC Deposit Number . Thus, the aldolase enzyme is considered as an alternative for converting ManNAc to SA. For enhancing expression of aldolase, the aldolase sequences can be amplified directly from E coli and human DNA using primers and PCR amplification as described in Mahmoudian et al. (Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400); the contents of which are herein incoφorated by reference) and herein, and using techniques described herein or otherwise known in the art to enhance expression of aldolase, or a fragment or variant thereof. Since the aldolase reaction is reversible, high levels of added ManNAc and pyruvate, may be used according to the methods of the invention to drive this reversible reaction in the direction of the product SA (Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400). In addition to the pathways which convert ManNAc to S A present in both prokaryotes and eukaryotes, an exclusively eukaryotic pathway may also employed according to the methods of the invention to convert ManNAc to SA through the phosphate intermediates ManNAc-6-phosphate and SA-9-phosphate. It is recognized that the mammalian enzymes (synthetase and phosphatase) responsible for converting ManNAc to SA through phosphate intermediates can be utilized for engineering this eukaryotic pathway into insect cells.
Conversion ofSA to CMP-SA
The methods of the invention also encompass the use of CMP-SA synthetase to enzymatically converts SA to CMP-SA (see, e.g., the reaction shown in Figure 25). However, insect cells, such as, for example, Sf9 insect cells, have negligible endogenous CMP-SA synthetase activity. Evidence of limited CMP-SA synthetase in insect cells is also demonstrated by increased SA levels found following substrate feeding and genetic manipulation without a concomitant increase in CMP-SA. Thus, specific embodiments of the invention provide methods for enhancing the expression of CMP-SA synthetase, and/or fragments or variants thereof. Bacterial CMP-SA synthetase has been cloned and sequenced as described in Zapata et al. (1989) J. Biol. Chem. 264:14769-14774; the contents of which are herein incoφorated by reference. Additionally, as described herein the gene encoding human CMP-SA synthetase has also been cloned (cDNA clone HWLLM34), sequenced and deposited with the American Type Culture Collection ("ATCC") on February 24, 2000 and was given the ATCC Deposit Number . Thus, in specific embodiments, the methods of the present invention provide for enhancing expression of bacterial or human CMP-SA synthetase or fragments, or variants thereof, in cells of interest, such as, for example, in insect cells, using techniques described herein, or otherwise known in the art.
Golgi transport of CMP-SA
CMP-SA must be delivered into the Golgi apparatus in order for sialylation to occur, and this transport process depends on the presence of the CMP-SA transporter protein (Deutscher et al. (1984) Cell 39:295-299). To determine if CMP-SA synthesized in insect cells is efficiently transported into the proper cellular compartment, insect cell vesicles are prepared and transport of CMP-SA is measured as described in (Bernisone et al. (1997) J Biol. Chem. 272:12616-12619) and/or using techniques otherwise known in the art. Where the native enzymatic transport is lower than desired, a transporter enzyme is cloned and expressed in insect cells using the known mammalian gene sequence (as described in Bernisone et al. (1997) J. Biol. Chem. 272:12616-12619, Eckhardt et al. (1996) Proc. Natl. Acad. Sci. USA 93:7572- 7576; the contents of which are herein incoφorated by reference) and/or sequences otherwise known in the art. Corresponding sequences are available from bioinformatics databases for the puφoses of this invention. Localization of the protein to the Golgi is evaluated using an antibody generated against the heterologous protein using techniques known in the art in concert with commercially available fluorescent probes that identify the Golgi apparatus.
Expression cloning of multiple transcripts (for example, transcripts encoding CMP-SA pathway enzymes, glycosyl transferases, and ribozymes or anti-sense RNAs to suppress hexosaminidases) in a single cell line using techniques known in the art may be required to bring about the desired sialylation reactions and/or to optimize these reactions. Alternatively, co-infection of cells with multiple viruses using techniques known in the art can also be used to simultaneously produce multiple recombinant transcripts. In addition, plasmids that incoφorate multiple foreign genes including some under the control of the early promoter IE1 are commercially, publicly, or otherwise available for the puφoses of the invention, and can be used to create baculovirus constructs. The present invention encompasses using any of these techniques. The invention also encompasses using the above mentioned types of vectors to enable expression of desired carbohydrate processing enzymes in baculovirus infected insect cells prior to production of a heterologous glycoprotein of interest under control of the very late polyhedrin promoter. In this manner, once the desired polypeptide is synthesized essential N-glycan processing enzymes can facilitate N-glycan processing once the glycoprotein of interest.
Alternatively, genes for some of the enzymes may be incoφorated directly into the insect cell genome using vectors known in the art, such as, for example, vectors similar to those described in (Jarvis et al. (1990) Bio/Technology 8:950-955, Jarvis et al. (1995) Baculovirus Expr. Protocols ed. 39:187-202). Genomic integration eliminates the need to infect the cells with a large number of viral constructs. These constructs for genomic integration contain one or more early viral promoters, including Ac NPV IE1 and 39K, which provide constitutive expression in transfected insect cells (Jarvis et al. (1990) Bio/Technology 8:950-955). In addition, a sequential transformation strategy may routinely be developed for producing stable transformants that constitutively express up to four different heterologous genes simultaneously. These vectors and transformation techniques are provided for the puφoses of this invention. In this manner, incoφoration of plasmids containing heterologous genes into the insect cell genome combined with baculovirus infection integrates the metabolic pathways leading to efficient acceptor and donor substrate production in insect cells.
Generation of N-linked sialylated glycoproteins
The final step in the generation of sialylated glycoproteins or glycolipids in mammalian cells is the enzymatic transfer of sialic acid from the donor substrate, CMP-SA, onto an acceptor substrate in the Golgi apparatus; a reaction which is catalyzed by sialyltransferase. The sialic acid (SA) residues occurring in N-linked glycoproteins are alpha-linked to the 3 or 6 position of the Gal GlcNAc sugars (Tsuji, S. (1996) J. Biochem. 120:1-13). The SA < bα2-3GalGlcNAc linkage is found in heterologous glycoproteins expressed by CHO and human cells and the SA alpha2- 6GalGlcNAc linkage is found in many human glycoproteins (Goochee et al. (1991) Bio/technology 9: 1347-1355). The alpha2-3- and/or α/pbα2-6-sialyltransferase genes along with a number of other sialyltransferase genes have been cloned, sequenced and expressed as active heterologous proteins as described in Lee et al. (1989) J Biol. Chem. 264:13848-13855, Ichikawa et α/. (1992) Anal. Biochem. 202:215-238, Tsuji, S. (1996) J. Biochem. 120:1-13; U.S. Patent No. 5,047,335, the contents of which are herein incoφorated by reference. Any one or more of these genes, as well as fragments, and/or variants thereof may be introduced and expressed in cells of interest using techniques described herein or otherwise known in the art, and may be used according to the methods of the present invention to enhance the enzymatic transfer of sialic acid from the donor substrate. For generating N-Linked sialylated glycoproteins in insect cells, once the donor (CMP-SA) and acceptor (GalGlcNAc-R) substrates are produced as described above, the methods of the invention further comprise expression of a sialyltransferase or fragment or variant thereof, in the cells. The completion of the sialylation reaction can be verified by elucidating the N-glycan structures attached to a desired glycoprotein using techniques described herein or otherwise known in the art. It is recognized that evaluation of N-glycans attachments may also suggest additional metabolic engineering strategies that can further enhance the level of sialylation in insect cells.
It is observed that unmodified T. ni insect cell lysates failed to generate any sialylated compounds when incubated with the substrate, LacMU, and the nucleotide sugar, CMP-SA. Thus, it is concluded that these cells comprise negligible native sialyltransferase activity. However, infection of insect cells with a baculovirus containing alpha2,3 sialyltransferase provided significant enzymatic conversion of LacMU and CMP-SA to sialylLacMU. For the puφoses of the invention, heterologous sialyltransferase can be expressed using techniques described herein or otherwise known in the art either by co-infection with a virus coding for sialyltransferase, or fragment, or variant thereof, or by using stable transfectants expressing the enzyme. In addition to the 2,3 sialyltransferase baculovirus constructs, baculovirus vectors comprising sequences coding for alpha2,6 sialyltransferase and/or fragments or variants thereof as well as stably transformed insect cells stably expressing both gal T and sialyltransferase are commercially, or publicly available, and/or may routinely be generated using techniques described herein or otherwise known in the art. Evaluation of sialyltransferase activity is determined using the FRET or HPLC assays described herein and/or using other assays known in the art. Localization of the sialyltransferase to the Golgi is accomplished using anti- sialyltransferase antibodies commercially, publicly, or otherwise available for the puφose of this invention in concert with Golgi specific marker proteins.
For the puφoses of enhancing carbohydrate processing enzymes of the invention, suppressing activity of endogenous N-acetylglucosaminidase, expressing heterologous proteins in the cells of the invention, and constructing vectors for the puφoses of the invention; genetic engineering methods are known to those of ordinary skill in the art. For example, see Schneider, A. et al, (1998) Mol. Gen.
Genet. 257:308-318. Where the invention encompasses utilizing baculovirus based expression, such methods are known in the art, for example, as described in O'Riley et al. (1992) Baculovirus Expression Vectors, W.H. Freeman and Company, New York 1992. For the puφoses of enhancing carbohydrate processing enzymes of the invention, suppressing activity of endogenous N-acetylglucosaminidase, expressing heterologous proteins in the cells of the invention, and constructing vectors as described herein, known sequences can be utilized in the methods of the invention, including but not limited to the sequences described in GenSeq accession No. Zl 1234 and Zl 1235 for two human galactosyltransferases (see also United States Patent Number 5,955,282; the contents of which are herein incoφorated by reference); and/or in Genbank accession No. D83766 for GlcNAc-2-epimerase, Y07744 for the bifunctional rate liver enzyme capable of catalyzing conversion of UDP-GlcNAc to ManNAc, J05023 for E. coli CMP-SA synthetase, AJ006215 for murine CMP-SA synthetase, Z71268 for murine CMP-SA transporter, X03345 for E. coli aldolase, U05248 for E. coli SA synthetase, X17247 for human 2,6 sialyltransferase, L29553 for human 2,3 sialyltransferase, M13214 for bovine galactosyltransferase, L77081 for human GlcNAc T-I, U15128 or L36537 for human GlcNAc T-II, D87969 for human CMP-SA transporter, and S95936 for human transferrin; and fragments or variants of the enzymes that display one or more of the biological activities of the enzymes (such biological activities may routinely be assayed using techniques described herein or otherwise known in the art). The sequences described above are readily accessible using the provided accession number in the NCBI Entrez database, known to the person of ordinary skill in the art.
Thus, one aspect of the invention provides for use of isolated nucleic acid molecules comprising polynucleotides having nucleotide sequences selected from the group consisting of : (a) nucleotide sequences encoding a biologically active fragment or variant of the polypeptide having the amino acid sequence described in GenSeq accession No. Zl 1234 and Zl 1235 for two human galactosyltransferases; and/or in Genbank accession No. D83766 for GlcNAc-2-epimerase, Y07744 for the bifunctional rate liver enzyme capable of catalyzing conversion of UDP-GlcNAc to ManNAc, J05023 for E. coli CMP-SA synthetase, AJ006215 for murine CMP-SA synthetase, Z71268 for murine CMP-SA transporter, X03345 for E. coli aldolase, U05248 for E. coli SA synthetase, XI 7247 for human 2,6 sialyltransferase, L29553 for human 2,3 sialyltransferase, Ml 3214 for bovine galactosyltransferase, L77081 for human GlcNAc T-I, U15128 or L36537 for human GlcNAc T-II, D87969 for human CMP-SA transporter, and/or S95936 for human transferrin; (b) nucleotide sequences encoding an antigenic fragment of the polypeptide having the amino acid sequence described in GenSeq accession No. Zl 1234 and Zl 1235 for two human galactosyltransferases (see also United States Patent Number 5,955,282; the contents of which are herein incoφorated by reference); and/or in Genbank accession No. D83766 for GlcNAc -2-epimerase, Y07744 for the bifunctional rate liver enzyme capable of catalyzing conversion of UDP-GlcNAc to ManNAc, J05023 for E coli CMP-SA synthetase, AJ006215 for murine CMP-SA synthetase, Z71268 for murine CMP-SA transporter, X03345 for E. coli aldolase, U05248 for E coli SA synthetase, XI 7247 for human 2,6 sialyltransferase, L29553 for human 2,3 sialyltransferase, M13214 for bovine galactosyltransferase, L77081 for human GlcNAc T-I, U15128 or L36537 for human GlcNAc T-II, D87969 for human CMP-SA transporter, and/or S95936 for human transferrin; and (c) nucleotide sequences complementary to any of the nucleotide sequences in (a) or (b), above. Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention. Further embodiments of the invention include use of isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 80%, 85%, or 90% identical, and more preferably at least 95%, 97%, 98% or 99% identical, to any of the above nucleotide sequences, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide that is complementary to any of the above nucleotide sequences. This polynucleotide which hybridizes does not hybridize under stringent hybridization conditions to a polynucleotide having a nucleotide sequence consisting of only A residues or of only T residues. Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention. Preferably, the nucleic acid sequences (including fragments or variants) that may be used according to the methods of the present invention encode a polypeptide having a biological activity. Such biological activity may routinely be assayed using techniques described herein or otherwise known in the art.
In addition to the sequences described above, the nucleotide sequences and amino acid sequences disclosed in Figures 27-32, and fragments and variants of these sequences may also be used according to the methods of the invention.
In one embodiment, specific enzyme polypeptides comprise the amino acid sequences shown in Figures 28, 30 and 32; or otherwise described herein. However, the invention also encompasses sequence variants of the polypeptide sequences shown in Figures 28, 30 and 32.
In a specific embodiment, one, two, three, four, five or more human polynucleotide sequences, or fragments, or variants thereof, and/or the polypeptides encoded thereby, are used according to the methods of the present invention to convert ManNAc to SA (see Example 6). Such polynucleotide and polypeptide sequences include, but are not limited to, sequences corresponding to human aldolase (SEQ ID NO:l and SEQ ID NO:2), human CMP-SA synthetase (SEQ ID NO:3 and SEQ ID NO:4), and human SA synthetase (SEQ ID NO:5 and SEQ ID NO:6); see also Figures 27 - 32. Thus, in certain embodiments the methods of present invention include the use of one or more novel isolated nucleic acid molecules comprising polynucleotides encoding polypeptides important to intracellular carbohydrate processing in humans. Such polynucleotide sequences include those disclosed in the figures and/or Sequence Listing and/or encoded by the human cDNA plasmids (Human CMP-Sialic Acid Synthetase, cDNA clone HWLLM34; Human Sialic Acid Synthetase, cDNA clone HA5AA37; and Human Aldolase cDNA clone HDPAK85) deposited with the American Type Culture Collection (ATCC) on February 24, 2000 and receiving accession numbers . The present invention further includes the use of polypeptides encoded by these polynucleotides. The present invention also provides for use of isolated nucleic acid molecules encoding fragments and variants of these polypeptides, and for the polypeptides encoded by these nucleic acids. Thus, one aspect of the invention provides for use of isolated nucleic acid molecules comprising polynucleotides having nucleotide sequences selected from the group consisting of : (a) nucleotide sequences encoding human aldolase having the amino acid sequences as shown in SEQ ID NO:2; (b) nucleotide sequences encoding a biologically active fragment of the human aldolase polypeptide having the amino acid sequence shown in SEQ ID NO:2; (c) nucleotide sequences encoding an antigenic fragment of the human aldolase polypeptide having the amino acid sequence shown in SEQ ID NO:2; (d) nucleotide sequences encoding the human aldolase polypeptide comprising the complete amino acid sequence encoded by the plasmid contained in the ATCC Deposit; (e) nucleotide sequences encoding a biologically active fragment of the human aldolase polypeptide having the amino acid sequence encoded by the plasmid contained in the ATCC Deposit; (f) a nucleotide sequence encoding an antigenic fragment of the human aldolase polypeptide having the amino acid sequence encoded by the plasmid contained in the ATCC Deposit; and (g) nucleotide sequences complementary to any of the nucleotide sequences in (a) through (f), above. Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention. Further embodiments of the invention include use of isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 80%, 85%, or 90% identical, and more preferably at least 95%, 97%, 98% or 99% identical, to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), or (g), above, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide in (a), (b), (c), (d), (e), (f), or (g), above. This polynucleotide which hybridizes does not hybridize under stringent hybridization conditions to a polynucleotide having a nucleotide sequence consisting of only A residues or of only T residues. Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention. Another aspect of the invention provides for use of isolated nucleic acid molecules comprising polynucleotides having nucleotide sequences selected from the group consisting of : (a) nucleotide sequences encoding human CMP-SA synthetase having the amino acid sequences as shown in SEQ ID NO:4; (b) nucleotide sequences encoding a biologically active fragment of human CMP-SA synthetase polypeptide having the amino acid sequence shown in SEQ ID NO:4; (c) nucleotide sequences encoding an antigenic fragment of the human CMP-SA synthetase polypeptide having the amino acid sequence shown in SEQ ID NO:4; (d) nucleotide sequences encoding the human CMP-SA synthetase polypeptide comprising the complete amino acid sequence encoded by the plasmid contained in the ATCC Deposit; (e) nucleotide sequences encoding a biologically active fragment of the human CMP-SA synthetase polypeptide having the amino acid sequence encoded by the plasmid contained in the ATCC Deposit; (f) a nucleotide sequence encoding an antigenic fragment of the human CMP-SA synthetase polypeptide having the amino acid sequence encoded by the plasmid contained in the ATCC Deposit; and (g) nucleotide sequences complementary to any of the nucleotide sequences in (a) through (f), above. Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention. Further embodiments of the invention include use of isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 80%, 85%, or 90% identical, and more preferably at least 95%, 97%, 98% or 99%) identical, to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), or (g) above, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide in (a), (b), (c), (d), (e), (f), or (g), above. This polynucleotide which hybridizes does not hybridize under stringent hybridization conditions to a polynucleotide having a nucleotide sequence consisting of only A residues or of only T residues. Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention. Another aspect of the invention provides for use of isolated nucleic acid molecules comprising polynucleotides having nucleotide sequences selected from the group consisting of: (a) nucleotide sequences encoding human SA synthetase having the amino acid sequences as shown in SEQ ID NO:6; (b) nucleotide sequences encoding a biologically active fragment of the human SA synthetase polypeptide having the amino acid sequence shown in SEQ ID NO:6; (c) nucleotide sequences encoding an antigenic fragment of the human SA synthetase polypeptide having the amino acid sequence shown in SEQ ID NO:6; (d) nucleotide sequences encoding the human SA synthetase polypeptide comprising the complete amino acid sequence encoded by the plasmid contained in the ATCC Deposit; (e) nucleotide sequences encoding a biologically active fragment of the human SA synthetase polypeptide having the amino acid sequence encoded by the plasmid contained in the ATCC Deposit; (f) a nucleotide sequence encoding an antigenic fragment of the human SA synthetase polypeptide having the amino acid sequence encoded by the plasmid contained in the ATCC Deposit; and (g) nucleotide sequences complementary to any of the nucleotide sequences in (a) through (f), above. Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention. Further embodiments of the invention include use of isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 80%), 85%, or 90% identical, and more preferably at least 95%, 97%, 98% or 99% identical, to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), or (g) above, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide in (a), (b), (c), (d), (e), (f), or (g), above. This polynucleotide which hybridizes does not hybridize under stringent hybridization conditions to a polynucleotide having a nucleotide sequence consisting of only A residues or of only T residues. Polypeptides encoded by such nucleic acids may also be used according to the methods of the present invention.
By a nucleic acid having a nucleotide sequence at least, for example, 95% "identical" to a reference nucleotide sequence of the present invention, it is intended that the nucleotide sequence of the nucleic acid is identical to the reference sequence except that the nucleotide sequence may include up to five point mutations per each 100 nucleotides of the reference nucleotide sequence encoding the described polypeptide. In other words, to obtain a nucleic acid having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence may be deleted or substituted with another nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference sequence may be inserted into the reference sequence. The query sequence may be an entire sequence, such as, for example, that shown of SEQ ID NO:l, the ORF (open reading frame), or any fragment as described herein.
As a practical matter, whether any particular nucleic acid molecule or polypeptide is at least, for example, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the presence invention can be determined conventionally using known computer programs. A preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. (Comp. App. Biosci. (1990) 6:237-245.) In a sequence alignment the query and subject sequences are both DNA sequences. An RNA sequence can be compared by converting U's to T's. The result of said global sequence alignment is in percent identity. Preferred parameters used in a FASTDB alignment of DNA sequences to calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l , Joining Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the length of the subject nucleotide sequence, whichever is shorter.
If the subject sequence is shorter than the query sequence because of 5' or 3' deletions, not because of internal deletions, a manual correction must be made to the results. This is because the FASTDB program does not account for 5' and 3' truncations of the subject sequence when calculating percent identity. For subject sequences truncated at the 5' or 3' ends, relative to the query sequence, the percent identity is corrected by calculating the number of bases of the query sequence that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent of the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by results of the FASTDB sequence alignment. This percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This corrected score is what is used for the puφoses of the present invention. Only bases outside the 5' and 3' bases of the subject sequence, as displayed by the FASTDB alignment, which are not matched/aligned with the query sequence, are calculated for the puφoses of manually adjusting the percent identity score.
For example, a 90 base subject sequence is aligned to a 100 base query sequence to determine percent identity. The deletions occur at the 5' end of the subject sequence and therefore, the FASTDB alignment does not show a matched/alignment of the first 10 bases at 5' end. The 10 unpaired bases represent 10% of the sequence (number of bases at the 5' and 3' ends not matched/total number of bases in the query sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 90 bases were perfectly matched the final percent identity would be 90%. In another example, a 90 base subject sequence is compared with a 100 base query sequence. This time the deletions are internal deletions so that there are no bases on the 5' or 3' of the subject sequence which are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is not manually corrected. Once again, only bases 5' and 3' of the subject sequence which are not matched/aligned with the query sequence are manually corrected for. No other manual corrections are to made for the puφoses of the present invention.
By a polypeptide having an amino acid sequence at least, for example, 95% "identical" to a query amino acid sequence of the present invention, it is intended that the amino acid sequence of the subject polypeptide is identical to the query sequence except that the subject polypeptide sequence may include up to five amino acid alterations per each 100 amino acids of the query amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at least 95% identical to a query amino acid sequence, up to 5% of the amino acid residues in the subject sequence may be inserted, deleted (indels) or substituted with another amino acid. These alterations of the reference sequence may occur at the amino or carboxy terminal positions of the reference amino acid sequence or anywhere between those terminal positions, interspersed either individually among residues in the reference sequence or in one or more contiguous groups within the reference sequence. As a practical matter, whether any particular polypeptide is at least, for example, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to, for example, the amino acid sequences of SEQ ID NO:2 or to the amino acid sequence encoded by the cDNA contained in a deposited clone can be determined conventionally using known computer programs. A preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. (Comp. App. Biosci. 6:237-245(1990)). In a sequence alignment the query and subject sequences are either both nucleotide sequences or both amino acid sequences. The result of said global sequence alignment is in percent identity. Preferred parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, whichever is shorter.
If the subject sequence is shorter than the query sequence due to N- or C- terminal deletions, not because of internal deletions, a manual correction must be made to the results. This is because the FASTDB program does not account for N- and C-terminal truncations of the subject sequence when calculating global percent identity. For subject sequences truncated at the N- and C-termini, relative to the query sequence, the percent identity is corrected by calculating the number of residues of the query sequence that are N- and C-terminal of the subject sequence, which are not matched/aligned with a corresponding subject residue, as a percent of the total bases of the query sequence. Whether a residue is matched/aligned is determined by results of the FASTDB sequence alignment. This percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This final percent identity score is what is used for the puφoses of the present invention. Only residues to the N- and C-termini of the subject sequence, which are not matched/aligned with the query sequence, are considered for the puφoses of manually adjusting the percent identity score. That is, only query residue positions outside the farthest N- and C-terminal residues of the subject sequence. For example, a 90 amino acid residue subject sequence is aligned with a 100 residue query sequence to determine percent identity. The deletion occurs at the N- terminus of the subject sequence and therefore, the FASTDB alignment does not show a matching/alignment of the first 10 residues at the N-terminus. The 10 unpaired residues represent 10% of the sequence (number of residues at the N- and C- termini not matched/total number of residues in the query sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 90 residues were perfectly matched the final percent identity would be 90%). In another example, a 90 residue subject sequence is compared with a 100 residue query sequence. This time the deletions are internal deletions so there are no residues at the N- or C-termini of the subject sequence which are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is not manually corrected. Once again, only residue positions outside the N- and C-terminal ends of the subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned with the query sequence are manually corrected for. No other manual corrections are to made for the puφoses of the present invention.
In another embodiment of the invention, to determine the percent homology of two amino acid sequences, or of two nucleic acids, the sequences are aligned for optimal comparison puφoses (e.g., gaps can be introduced in the sequence of one protein or nucleic acid for optimal alignment with the other protein or nucleic acid). The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in one sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the other sequence, then the molecules are homologous at that position. As used herein, amino acid or nucleic acid "homology" is equivalent to amino acid or nucleic acid "identity". The percent homology between the two sequences is a function of the number of identical positions shared by the sequences (i.e., per cent homology equals the number of identical positions/total number of positions times 100).
Variants of above described sequences include a substantially homologous protein encoded by the same genetic locus in an organism, i.e., an allelic variant.
Variants also encompass proteins derived from other genetic loci in an organism, but having substantial homology to the proteins of Figures 27-32, or otherwise described WO 00/52135 g PCT/USOO/05313
herein. Variants also include proteins substantially homologous to the protein but derived from another organism, i.e., an ortholog. Variants also include proteins that are substantially homologous to the proteins that are produced by chemical synthesis. Variants also include proteins that are substantially homologous to the proteins that are produced by recombinant methods. As used herein, two proteins (or a region of the proteins) are substantially homologous when the amino acid sequences are at least about 55-60%), typically at least about 70-75%), more typically at least about 80-85%), and most typically at least about 90-95% or more homologous. A substantially homologous amino acid sequence, according to the present invention, will be encoded by a nucleic acid sequence hybridizing to the nucleic acid sequence, or portion thereof, of the sequence shown in Figures 27, 28, 31 or otherwise described herein under stringent conditions as more fully described below.
Orthologs, homologs, and allelic variants that are encompassed by the invention and that may be used according to the methods of the invention can be identified using methods well known in the art. These variants comprise a nucleotide sequence encoding a protein that is at least about 55%, typically at least about 70- 75%, more typically at least about 80-85%), and most typically at least about 90-95% or more homologous to the nucleotide sequence shown in Figures 27, 29, 31, or otherwise described herein, or a fragment of this sequence. Such nucleic acid molecules can readily be identified as being able to hybridize under stringent conditions, to the nucleotide sequence shown in Figures 27, 29, 31, or complementary sequence thereto, or otherwise described herein, or a fragment of the sequence. It is understood that stringent hybridization does not indicate substantial homology where it is due to general homology, such as poly A sequences, or sequences common to all or most proteins in an organism or class of proteins.
The invention also encompasses polypeptides having a lower degree of identity but having sufficient similarity so as to perform one or more of the same functions performed by the enzyme polypeptides described herein. Similarity is determined by conserved amino acid substitution. Such substitutions are those that substitute a given amino acid in a polypeptide by another amino acid of like characteristics (see Table 1). Conservative substitutions are likely to be phenotypically silent. Typically seen as conservative substitutions are the replacements, one for another, among the aliphatic amino acids Ala, Val, Leu, and He; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe, Tyr. Guidance concerning which amino acid changes are likely to be phenotypically silent are found in Bowie et al, Science 247:1306-1310 (1990). TABLE 1. Conservative Amino Acid Substitutions.
Figure imgf000051_0001
Both identity and similarity can be readily calculated (Computational Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988;
Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part 1, Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991). Preferred computer program methods to determine identify and similarity between two sequences include, but are not limited to, GCG program package (Devereux, J. (1984) Nuc. Acids Res. 12(1):3S7), BLASTP, BLASTN, FASTA (Atschul, S.F. (1990) J. Molec. Biol. 215:403).
A variant polypeptide can differ in amino acid sequence by one or more substitutions, deletions, insertions, inversions, fusions, and truncations or a combination of any of these. Variant polypeptides can be fully functional or can lack function in one or more activities. Thus, in the present case, variations can affect the function, for example, of one or more of the modules, domains, or functional subregions of the enzyme polypeptides of the invention. Preferably, polypeptide variants and fragments have the described activities routinely assayed via bioassays described herein or otherwise known in the art.
Fully functional variants typically contain only conservative variation or variation in non-critical residues or in non-critical regions. Functional variants can also contain substitution of similar amino acids, which result in no change or an insignificant change in function. Alternatively, such substitutions may positively or negatively affect function to some degree.
Non- functional variants typically contain one or more non-conservative amino acid substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, inversion, or deletion in a critical residue or critical region. As indicated, variants can be naturally-occurring or can be made by recombinant means or chemical synthesis to provide useful and novel characteristics for the polypeptide.
Amino acids that are essential for function can be identified by methods known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham et al, Science 244:1081-1085 (1989)). The latter procedure introduces single alanine mutations at every residue in the molecule. The resulting mutant molecules are then tested for biological activity. Sites that are critical can also be determined by structural analysis such as crystallization, nuclear magnetic resonance or photoaffmity labeling (Smith et al, J. Mol. Biol. 224:899-904 (1992); de Vos et al. Science 255:306-312 (1992)).
The invention further encompasses variant polynucleotides, and fragments thereof, that differ from the nucleotide sequence, such as, for example, those shown in Figures 27, 29, 31 or otherwise described herein, due to degeneracy of the genetic code and thus encode the same protein as that encoded by the nucleotide sequence shown in Figures 27, 29, 31 or otherwise described herein.
The invention also provides nucleic acid molecules encoding the variant polypeptides described herein. Such polynucleotides may be naturally occurring, such as allelic variants (same locus), homologs (different locus), and orthologs (different organism), or may be constructed by recombinant DNA methods or by chemical synthesis. Such non-naturally occurring variants may be made by mutagenesis techniques, including those applied to polynucleotides, cells, or organisms. Accordingly, as discussed above, the variants can contain nucleotide substitutions, deletions, inversions and insertions.
Variation can occur in either or both the coding and non-coding regions. The variations can produce both conservative and non-conservative amino acid substitutions.
"Polynucleotides" or "nucleic acids" that may be used according to the methods of the invention also include those polynucleotides capable of hybridizing, under stringent hybridization conditions, to sequences contained in SEQ ID NO:l, the complement thereof, or a cDNA within the deposited plasmids. As used herein, the term "hybridizes under stringent conditions" is intended to describe conditions for hybridization and washing under which nucleotide sequences encoding a receptor at least 55% homologous to each other typically remain hybridized to each other. The conditions can be such that sequences at least about 65%, at least about 70%, or at least about 75% or more homologous to each other typically remain hybridized to each other. Such stringent conditions are known to those skilled in the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. One example of stringent hybridization conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45degrees C, followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50-65 degrees C. Also contemplated for use according to the methods of the invention are nucleic acid molecules that hybridize to a polynucleotide disclosed herein under lower stringency hybridization conditions. Changes in the stringency of hybridization and signal detection are primarily accomplished through the manipulation of formamide concentration (lower percentages of formamide result in lowered stringency); salt conditions, or temperature. For example, lower stringency conditions include an overnight incubation at 37 degree C in a solution comprising 6X SSPE (2 OX SSPE = 3M NaCl; 0.2M NaH2PO4; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml salmon sperm blocking DNA; followed by washes at 50 degree C with IXSSPE, 0.1% SDS. In addition, to achieve even lower stringency, washes performed following stringent hybridization can be done at higher salt concentrations (e.g. 5X SSC).
Note that variations in the above conditions may be accomplished through the inclusion and/or substitution of alternate blocking reagents used to suppress background in hybridization experiments. Typical blocking reagents include Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and commercially available proprietary formulations. The inclusion of specific blocking reagents may require modification of the hybridization conditions described above, due to problems with compatibility. Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a complementary stretch of T (or U) residues, would not be included in the definition of "polynucleotide," since such a polynucleotide would hybridize to any nucleic acid molecule containing a poly (A) stretch or the complement thereof (e.g., practically any double-stranded cDNA clone generated using oligo-dT as a primer).
In one embodiment, an isolated nucleic acid molecule that hybridizes under stringent conditions to a sequence disclosed herein, or the complement thereof, such as, for example, the sequence of Figures 27, 29, 31, corresponds to a naturally- occurring nucleic acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a natural protein). The present invention also encompasses recombinant vectors, which include the isolated nucleic acid molecules and polynucleotides that may be used according to the methods of the present invention, and to host cells containing the recombinant vectors and/or nucleic acid molecules, as well as to methods of making such vectors and host cells and for using them for production of glycosylation enzyme by recombinant techniques. Polypeptides produced by such methods are also provided.
The invention encompasses utilizing vectors for the maintenance (cloning vectors) or vectors for expression (expression vectors) of the desired polynucleotides encoding the carbohydrate processing of the invention, or those encoding proteins to be sialylated by the methods of the invention and/or by expression of the proteins the cells of the invention. The vectors can function in prokaryotic or eukaryotic cells or in both (shuttle vectors).
In one embodiment, one or more of the polynucleotide sequences used according to the methods of the invention are inserted into commercially, publicly, or otherwise available baculovirus expression vectors for enhanced expression of the corresponding enzyme. In another non-exclusive embodiment, one ore more of the polynucleotides used according to the methods of the invention are inserted into other viral vectors or for generation of stable insect cell lines. Techniques known in the art, such as, for example, HP AEC and HPLC techniques, may be routinely used to evaluate the enzymatic activity of these enzymes from both eukaryotic and bacterial sources to determine which source is best for generating SA in insect cells.
Generally, expression vectors contain cis-acting regulatory regions that are operably linked in the vector to the polynucleotide to be expressed, or other relevant polynucleotides such that transcription of the polynucleotides is allowed in a host cell. The polynucleotides can be introduced into the host cell with a separate polynucleotide capable of affecting transcription. Thus, the second polynucleotide may provide a trans-acting factor interacting with the cis-regulatory control region to allow transcription of the polynucleotides from the vector. Alternatively, a transacting factor may be supplied by the host cell. Finally, a trans-acting factor can be produced from the vector itself.
It is understood, however, that in some embodiments, transcription of the polynucleotides can occur in a cell-free system. The regulatory sequence to which the polynucleotides described herein can be operably linked include, for example, promoters for directing mRNA transcription. These promoters include, but are not limited to, baculovirus promoters including, but not limited to, 1E0, 1E1, 1E2, 39k, 35k, egt, ME53, ORF 142, PE38, p6.9, capsid, gp64 polyhedrin, plO, basic and core; and insect cell promoters including, but not limited to, Drosophila actin, metallothionine, and the like. Where the host cell is not an insect cell, such promoters include, but are not limited to, the left promoter from bacteriophage lambda, the lac, TRP, and TAC promoters from E. coli, promoters from Actinomycetes, including Nocardia, and Streptomyces. Promoters may be isolated, if they have not already been isolated, by standard promoter identification and trapping methods known in the art, see, for example, in Sambrook et al. , Molecular Cloning: A Laboratory Manual. 2nd. ed. , Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989).
It would be understood by a person of ordinary skill in the art that the choice of promoter would depend upon the choice of host cell. Similarly, the choice of host cell will depend upon the use of the host cell. Accordingly, host cells can be used for simply amplifying, but not expressing, the nucleic acid. However, host cells can also be used to produce desirable amounts of the desired polypeptide. In this embodiment, the host cell is simply used to express the protein , er se. For example, amounts of the protein could be produced that enable its purification and subsequent use, for example, in a cell free system. In this case, the promoter is compatible with the host cell. Host cells can be chosen from virtually any of the known host cells that are manipulated by the methods of the invention to produce the desired glycosylation patterns. These could include mammalian, bacterial, yeast, filamentous fungi, or plant cells.
In addition to control regions that promote transcription, expression vectors may also include regions that modulate transcription, such as repressor binding sites and enhancers.
In addition to containing sites for transcription initiation and control, expression vectors can also contain sequences necessary for transcription termination and, in the transcribed region a ribosome binding site for translation. Other regulatory control elements for expression include initiation and termination codons as well as polyadenylation signals. The person of ordinary skill in the art would be aware of the numerous regulatory sequences that are useful in expression vectors. Such regulatory sequences are described, for example, in Sambrook et al, cited above.
Depending on the choice of a host cell, a variety of expression vectors can be used to express the polynucleotide. Such vectors include chromosomal, episomal, and particularly virus-derived vectors, for example, AαMNPV, OpMNPV, BmNPV, HzMNPV, and RoMNPV. Vectors may also be derived from combinations of these sources such as those derived from plasmid and bacteriophage genetic elements, e.g. cosmids and phagemids. Appropriate cloning and expression vectors for prokaryotic and eukaryotic hosts are described in Sambrook et al. , Molecular Cloning: A
Laboratory Manual 2nd. ed. , Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989).
The regulatory sequence may provide constitutive expression in one or more host cells or may provide for inducible expression in one or more cell types such as by temperature, nutrient additive, or exogenous factor such as a hormone or other ligand. A variety of vectors providing for constitutive and inducible expression in prokaryotic and eukaryotic hosts are well known to those of ordinary skill in the art.
The polynucleotides can be inserted into the vector nucleic acid using techniques known in the art. Generally, the DNA sequence that will ultimately be expressed is joined to an expression vector by cleaving the DNA sequence and the expression vector with one or more restriction enzymes and then ligating the fragments together. Procedures for restriction enzyme digestion and ligation are well known to those of ordinary skill in the art.
Specific expression vectors are described herein for the puφoses of the invention; for example, AcΛ NPV. Other expression vectors listed herein are not intended to be limiting, and are merely provided by way of example. The person of ordinary skill in the art would be aware of other vectors suitable for maintenance, propagation, or expression of the polynucleotides described herein. These are found for example in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. Any cell type or expression system can be used for the puφoses of the invention including but not limited to, for example, baculovirus systems (O'Riley et al. (1992) Baculovirus Expression Vectors, W.H. Freeman and Company, New York 1992) and Drosophila-derived systems (Johansen et al. (1989) Genes Dev 3(6):882-889).
The invention also encompasses vectors in which the nucleic acid sequences described herein are cloned into the vector in reverse orientation, but operably linked to a regulatory sequence that permits transcription of antisense RNA. Thus, an antisense transcript can be produced to all, or to a portion, of the polynucleotide sequences described herein, including both coding and non-coding regions. Expression of this antisense RNA is subject to each of the parameters described above in relation to expression of the sense RNA (regulatory sequences, constitutive or inducible expression, tissue-specific expression).
The recombinant host cells are prepared by introducing the vector constructs described herein into the cells by techniques readily available to the person of ordinary skill in the art. These include, but are not limited to, calcium phosphate transfection, DEAE-dextran-mediated transfection, cationic lipid-mediated transfection, electroporation, transduction, infection, lipofection, and other techniques such as those found in Sambrook, et al. (Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). Where secretion of the polypeptide is desired, appropriate secretion signals known in the art are incoφorated into the vector using techniques known in the art. The signal sequence can be endogenous to the polypeptides or heterologous to these polypeptides.
Where the polypeptide is not secreted into the medium, the desired protein can be isolated from the host cell by techniques known in the art, such as, for example, standard disruption procedures, including freeze thaw, sonication, mechanical disruption, use of lysing agents and the like. The polypeptide can then be recovered and purified by well-known purification methods including, but not limited to, ammonium sulfate precipitation, acid extraction, anion or cationic exchange chromatography, phosphocellulose chromatography, hydrophobic-interaction chromatography, affinity chromatography, hydroxylapatite chromatography, lectin chromatography, and high performance liquid chromatography. Furthermore, for suppressing activity of endogenous N-acetylglucosaminidase, the invention encompasses utilizing the sequences deduced from the fragment identified in Figure 18, and described in Example 4. More particularly, in this aspect, the invention comprises utilization of the glucosaminidase nucleotide sequences which are produced by using primers, such as, for example, those primer combinations described in Example 4. These nucleotide sequences may be used in the construction and expression of anti-sense RNA, ribozymes, or homologous recombination (gene "knock-out") constructs, using methods readily available to those skilled in the art, to reduce or eliminate in vivo glucosaminidase activity. Cell lines produced by the methods of the invention can be tested by expressing a model recombinant glycoprotein in such cell lines and assessing the N- glycans attached therein using techniques described herein or otherwise known in the art. The assessment can be done, for example, by 3 -dimensional HPLC techniques. In the Examples of the invention, human transferrin is used as a model target glycoprotein, since this glycoprotein is sialylated in humans and extensive oligosaccharide structural information for the protein is available (Montreuil et al. (1997) Glycoproteins II Ed. 203-242). In this manner, cell lines with superior processing characteristics are identified. Such a cell line can then be evaluated for its growth rate, product yields, and capacity to grow in suspension culture (Lindsay et al. (1992) Biotech, and Bioeng. 39:614-618, Reuveny et al. (1992) Ann. NY Acad. Sci. 665:320, Reuveny et al. (1993) Appl. Microbiol. Biotechnol. 38:619-623, Reuveny et al. (1993) Biotechnol. Bioeng. 42:235-239).
The invention encompasses expressing heterologous proteins in the cells of the invention and/or according to the methods of the invention for any puφose benefiting from such expression. Such a puφose includes, but is not limited to, increasing the in vivo circulatory half life of a protein; producing a desired quantity of the protein; increasing the biological function of the protein including, but not limited to, enzyme activity, receptor activity, binding capacity, antigenicity, therapeutic property, capacity as a vaccine or a diagnostic tool, and the like. Such proteins may be naturally occurring chemically synthesized or recombinant proteins. Examples of proteins that benefit from the heterologous expression of the invention include, but are not limited to, transferrin, plasminogen, Na+, K+- ATPase , thyrotropin, tissue plasminogen activator, erythropoietin, interleukins, and interferons. Other examples of such proteins include, but are not limited to, those described in International patent application publication number WO 98/06835, the contents of which are herein incoφorated by reference. In one embodiment, proteins that benefit from the heterologous expression of the invention are mammalian proteins. In this aspect, mammals include but are not limited to, cats, dogs, rats, mice, cows, pigs, non-human primates, and humans.
It is recognized that the heterologous expression of the invention not only encompasses proteins that are sialylated in their native source; but also those that are not sialylated as such, and benefit from the expression in the cells of and/or according to the methods of the invention.
It is recognized that proteins that are not sialylated in their native source, can be altered by known genetic engineering methods so that the heterologous expression of the protein according to the invention will result in sialylation of the protein. Such methods include, but are not limited to, the genetic engineering methods described herein. In this aspect, it is further recognized that altering the proteins could encompass engineering into the protein targeting signals to ensure targeting of the proteins to the ER and Golgi apparatus for sialylation, where such signals are needed. It is also recognized that the cells of the invention contain proteins, which are not sialylated prior to manipulation of the cells according to the methods of the invention, but are sialylated subsequent to the manipulation. In this manner, the invention also encompasses proteins that have amino acid sequences that are endogenous to the cells of the invention, but are sialylated as a result manipulation of the cells according to the methods of the invention. It is recognized that the analysis of the N-glycans produced according to the methods of the invention may suggest additional strategies to further enhance the sialylation of glycoproteins in insect cells. If the production of Gal containing carbohydrate acceptor structures is low relative to those containing GlcNAc, then the levels of Gal transferase expression are increased by integrating multiple copies of this gene into the insect cell genome or by expressing Gal T under a stronger promoter using techniques described herein or otherwise known in the art. Additionally, or alternatively, substrate feeding strategies are used to enhance the levels of UDP-Gal for this carbohydrate processing reaction. In contrast, if the fraction of carbohydrate structures terminating in Gal is high and the fraction with terminal SA is low, then sialyltransferase or CMP-SA production is enhanced. Examination of sialyltransferase activity using techniques described herein or otherwise known in the art, such as, for example, FRET or HPLC and CMP-SA levels using HP AEC, is used to determine which step is the metabolic limiting step to sialylation. These metabolic limitations are overcome by increasing expression of specific enzymes or by altering substrate feeding strategies or a combination thereof.
ASSAYS
Having generally described the invention, the same will be more readily understood by reference to the following assays and examples, which are provided by way of illustration and are not intended as limiting.
Analytical bioassays are implemented to evaluate enzymatic activities in the N-glycosylation pathway of insect cells. In order to screen a larger selection of insect cells for particular oligosaccharide processing enzymes, bioassays in which multiple samples can be analyzed simultaneously are advantageous. Consequently, bioassays based on fluorescence energy transfer (FRET) and time-resolved fluorometry of europium (Eu) are designed to screen native and recombinant insect cell lines for carbohydrate processing enzymes in a format that can handle multiple samples. Fluorescence assays are especially useful in detecting limiting steps in carbohydrate processing due to their sensitivity and specificity. FRET and Eu assays detect enzymatic activities at levels as low as 10"14 M, which is greater than the sensitivity obtained with I25I. In addition, the use of substrates modified with fluorophores enables the measurement of one specific enzyme activity in an insect cell lysate, and multiple samples can be analyzed simultaneously in a microtiter plate configuration used in an appropriate fluorometer. With these assays, insect cell lines are rapidly screened for the presence of processing enzymes including Gal, GlcNAc, and sialic acid transferases to identify limiting enzymes in N-glycosylation in native and recombinant cells. Fluorescence energy transfer (FRET) assays
Glycosyl transferase activity assays are based on the principle of fluorescence energy transfer (FRET), which has been used to study glycopeptide conformation (Rice et al (1991) Biochemistry 30:6646-6655) and to develop endo-type glycosidase assays (Lee et al (1995) Anal. Biochem. 230:31-36).
Gal T assay
The fluorescent compound, UDP-Gal-6-Naph, synthesized by consecutive reactions of galactose oxidase (generating 6-oxo compound) and reductive amination with naphthylamine, is found to be effective as a substrate for Gal transferase. When UDP-Gal-6-Naph is reacted with an acceptor carrying a dansyl group (Dans-AE- GlcNAc) in the presence of Gal-T, a product is created that can transfer energy (Figure 12). While irradiation of the naphthyl group in UDP-Gal-6-Naph at 260-290 nm ("ex" in Figure 13) results in the usual emission at 320-370 nm ("em" dotted line in Figure 13), irradiation of the product at these same low wavelengths results in energy transfer to the dansyl group and emission at 500-560 nm ("em" solid line in Figure 13). Assay sensitivity is as great as the fluorometer allows (pico- to femtomol range) and exceeds that of radioisotopes. In addition, multiple samples can be monitored simultaneously in the fluorometer, allowing a number of cell lines to be evaluated rapidly for Gal T activity.
Sialyltransferase assay
A sialyltransferase assay is designed using similar FRET technology described in the above example for Gal T. The 3-carbon tail (exocyclic chain) of sialic acid (in particular, its glycoside) can be readily oxidized with mild periodate to yield an aldehyde (Figure 14). This intermediate is reductively aminated to generate a fluorescently tagged sialic acid (after removal of its aglycon), which is then modified to form a fluorescently modified CMP-sialic acid (See also Lee et al. (1994) Anal. Biochem. 216:358-364, Brossamer et al. (1994) Methods Enzymol 247:153-177). The acceptor substrate is modified as described above to include the dansyl group. Then the FRET approach is used to measure either alpha(2, 3) or alpha(2, 6) sialyltransferase activity since these enzymes should utilize the modified CMP-SA as donor substrate to generate a product with altered fluorescent emission characteristics.
The choice of the fluorescent donor and acceptor pair can be flexible. The above examples are given using naphthyl-dansyl pairs, but other fluorescent combinations may be even more sensitive (Wu et al. (1994) Anal. Biochem. 250:260- 262).
Europium (Eu ') fluorescence assays.
An example of the use of Eu+3 fluorescence for the evaluation of Gal T activity is provided herein in the N-linked oligosaccharides from insect cells. The same techniques are used to develop enzymatic assay for transferases such as GlcNAc Tl and glycosidases such as N-acetylglucosaminidase. Further enhancements in sensitivity are obtained with the advent of the super-sensitive Eu-chelator, BHHT (4, 4*- bis (1", 1", I", 2", 2", 3", 3'-heptatluro-4", 6"-hexanedione-6*-yl)-chlorosulfo-o- teφhenyl) (Yuan et al. (1998) Anal. Chem. 70:596-601), which allows detection down to the lower fmol range.
GlcNac-TI Assay
A new GlcNAc-TI assay, illustrated in Figure 15, utilizes a synthetic 6- aminohexyl glycoside of the trimannosyl N-glycan core structure labeled with DTPA (Diethylenetriaminepentaacetic acid) and complexed with Eu+3. This substrate is then incubated with insect cell lysates or positive controls containing GlcNAc Tl and UDP-GlcNAc. Addition of chemical inhibitors are used to minimize background N- acetylglucosaminidase activity. After the reaction, an excess of Crocus lectin CVL (Misaki et al. (1997) J Biol. Chem. 272:25455-25461), which specifically binds the trimannosyl core, is added. The amount of the lectin required to bind all the trimannosyl glycoside (and hence all the Eu+3 label) in the absence of any GlcNAc binding is predetermined. The reacted mixture is then filtered through a 10,000 molecular weight cut off (MWCO) microfuge ultrafiltration cup. The glycoside modified with GlcNAc does not bind CVL and appears in the filtrate. Measurement of the Eu +3 fluorescence in the filtrate reflects the level of GlcNAc Tl activity in the culture lysates. N-acetylglucosaminidase assay
An assay for N-acetylglucosaminidase activity is developed using a different lectin, GS-II, which is specific for GlcNAc. The substrate is prepared by modification of the same trimannosyl core glycoside described above using in vitro purified GlcNAc Tl, which results in addition of a GlcNAc_betα(l-2) residue to the Man_alpha(l-3) residue. Following incubation with insect cell lysates, enzymatic hydrolysis by N-acetylglucosaminidase removes GlcNAc from the substrate resulting in the tri-mannosyl core product. The product is not susceptible to lectin binding and thus escapes into the filtrate. Evaluation of Eu +3 fluorescence in the filtrate provides a measure of the N-acetylglucosaminidase activity. Alternatively, enhanced binding of the Eu-bound trimannosyl core to the Crocus lectin described above can be used as another assay for N-acetylglucosaminidase activity.
Characterization of N-linked Oligosaccharides from Insect Cells
Carbohydrate structure elucidation of the N-glycans of a recombinant glycoprotein, IgG, purified from Trichoplusia ni (High Five™ cells; Invitrogen Coφ., Carlsbad, CA, USA) has been undertaken (Davis et al. (1993) In Vitro Cell. Dev. Biol. 29:842-846; Hsu et al. (1997) J. Biol. Chem. 272:9062-9070). The recombinant glycoprotein, immunoglobulin G (IgG), was purified from both intracellular and extracellular (secreted) sources and all the attached N-glycans determined using three dimensional HPLC techniques. The composition of these structures provided insights into the carbohydrate processing pathways present in insect cells and allowed a comparison of intracellular and secreted N-glycan structures. The Trichoplusia ni cells grown in serum free medium in suspension culture were infected with a baculovirus vector encoding a murine IgG (Summers et al. (1987) A manual of methods for baculovirus vectors and insect cells culture procedures). IgG includes an N-linked oligosaccharide attachment on each of the two heavy chains. Heterologous IgG was purified from the culture supernatant and soluble cell lysates using a Protein A-Sepharose column. N-linked oligosaccharides were isolated following protease digestion of IgG and treatment with glycoamidase A to release the N-glycans. Oligosaccharides were then derivatized with 2-aminopyridine (PA) at the reducing ends to provide fluorogenic properties for detection.
Three-dimensional HPLC analysis, was performed to elucidate the N-linked oligosaccharide structures attached to the heavy chain of IgG (Tomiya et al. (1988) Anal. Biochem. 171 :73-90, Takahashi et al. (1992) Handbook of Endoglycosidases and Glycoamidases Ed. 199-332). This technique separates oligosaccharides by three successive HPLC steps and enables the identification of structures by comparison of elution conditions with those of known standards.
A DEAE column was used to separate oligosaccharides on the basis of carbohydrate acidity (first dimension). None of the oligosaccharides retained on this column were found to include sialic acid. Treatment of the acidic fractions with neuraminidase from Arthrobacter ureafaciens (known to cleave all known sialic acid linkages) failed to release any sialic acid, and ODS-chromatography of the fractions revealed several minor components different from all known sialylated oligosaccharides.
The second dimension used reverse phase HPLC with an ODS-silica column to fractionate the labeled oligosaccharides according to carbohydrate structure. Supernatant (S) and lysate (L) IgGs oligosaccharides were separated into 6 and 10 fractions, respectively, labeled A-L in Figure 6. Separation in the third and final dimension was accomplished using an amide column to isolate oligosaccharides on the basis of molecular size. Peak B from the ODS column was separated into two separate oligosaccharide fractions, and peak H was separated into three separate oligosaccharide fractions on the amide-column. After oligosaccharide purification, structures of unknown oligosaccharides were determined by comparing their positions on the 3-dimensional map with the positions of over 450 known oligosaccharides. Co-elution of an unknown sample with a known PA-oligosaccharide on the ODS and amide-silica columns was used to confirm the identity of an oligosaccharide. Digestion by glycosidases with specific cleavage sites (alp bα-L-fucosidase, betα-galactosidase, beta-N- acetylglucosaminidase, and <- bα-mannosidase) followed by reseparation provided further confirmation. All the oligosaccharides in the culture medium and cell lysates matched known carbohydrates except for oligosaccharide G. The structure of oligosaccharide G was elucidated by treatment of the N-glycan with α/pbα-L-fucosidase, known to digest Fuc_α//.bαl-6GIcNAc, followed by treatment with 13.5 M trifluoroacetic acid to remove the alphal, 3 linked fucose. The de-alphal, 6- and de-alphal, 3-fucosylated oligosaccharide G co-eluted with a known oligosaccharide, allowing the identification of G. The structure of oligosaccharide G is shown in Figure 7.
The structure of oligosaccharide G was further confirmed by l H-NMR and electrospray ionization (ESI) mass spectrometry (Hsu et al. (1997) J. Biol Chem. 272:9062-9070). Thus, the combination of these techniques can be used to elucidate both known and unknown oligosaccharides.
The carbohydrates attached to IgG from the culture medium and intracellular lysate were identified and the levels present in each source were quantified. These structures were then used in conjunction with previous studies of oligosaccharide processing in insect cells (Altmann et al. (1996) Trends in Glycoscience and
Glycotechnology 8:101-114) to generate a detailed map of N-linked oligosaccharide processing in Trichoplusia ni insect cells. The pathway and the levels of the oligosaccharides from secreted and intracellular sources are detailed in Figure 8.
The initial processing in the T. ni cells appears to be similar to the mammalian pathway, including trimming of the terminal glucose and mannose residues. The trimming process follows a linear pathway with the exception of two different forms of the Man7GlcNAc2 (M7GN, in Figure 8 also observed in native insect glycoproteins (Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-114) and IgG , from NS/0 cells (Ip et al. (1994) Arch. Biochem. Biophys. 308:387-399). The presence of these two Man7 forms suggests the possible existence of an alternative processing pathway that yields Man7GlcNAc through the action of endo-alpha- mannosidase. Following cleavage of the mannose residues, GlcNAc (GN) is added to the alphal, 3 branch of Man5GlcNAC2 by GlcNAc Tl (N- acetylglusosaminyltransf erase I) (Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-114). However, GlcNAc i Man5 GlcN AC2 must be a shortlived intermediate quickly processed by alpha-Man II, since this structure was not detected in the T. ni cell lysate. At the GlcNAci, Man3 GlcNAc oligosaccharide, several branching steps in the N-glycan processing pathway are possible in insect cells. Complex glycoforms can be generated by the action of GlcNAc Til (N- acetylglucosaminyltransferase II) and Gal T (galactosyltransferase T) to provide oligosaccharides which include terminal GlcNAc (GN) and Gal (G) residues. None of the complex oligosaccharide structures included sialic acid indicating that sialylation is negligible or non-existent in these cells.
The production of these complex glycoforms must compete with an alternative processing pathway that is catalyzed by N-acetylglucosaminidase (Altmann et al. (1995) J. Biol. Chem. 270:17344-17349) (see Branch Points in Figure 8), leading to the production of hybrid and paucimannosidic structures. While the complex-type N- glycans represent 35%) of the total secreted glycoforms (supernatant % in Figure 8), the majority of secreted N-glycans are either paucimannosidic (35%) or hybrid structures (30%). Furthermore, those complex structures with a branch terminating in Gal represent less than 20% of the total secreted glycoforms and no structures were observed with terminal Gal on both branches of the N-glycan.
In contrast to the secreted glycoforms, the intracellular N-glycans (lysate % in Figure 8) obtained from insect cells include more than 50% high-mannose type structures. The fraction of intracellular complex oligosaccharides is less than 15% and only 8% include a terminal Gal residue. The high level of high-mannose structures from intracellular sources indicates significantly less oligosaccharide processing for most of the intracellular immunoglobulins. Many of these intracellular immunoglobulins may not reach the compartments in which carbohydrate trimming takes place (Jarvis et al (1989) Mol. Cell. Biol. 9:214-223). High mannose glycoforms are also observed intracellulariy for mammalian cells (Jenkins et al. (1998) Cell Culture Engineering VI).
Examples
Example 1 : Evaluation of N-glycosylation Pathway Enzymes The levels of N-linked oligosaccharide processing enzymes are measured using analytical assays to characterize carbohydrate processing in native and recombinant insect cells. These assays are used to compare the N-glycan processing capacity of different cell lines and to evaluate changes in processing and metabolite levels following metabolic engineering modifications.
High Performance Anion Exchange Chromatography (HP AEC) assay for galactose transferase
HP AEC is used in combination with pulsed amperometric detection (HPAEC- PAD) or conductivity to detect metabolite levels in the CMP-SA pathway and to evaluate N-linked oligosaccharide processing enzymes essentially as described by (Lee et al. (1990) Anal. Biochem. 34:953-95 ', Lee et al. (1996) J. Chromatography A 720:137-149). Shown in Figure 9 is an example of the use of HPAEC-PAD for measuring Gal T activity by following the lactose formation reaction:
UDP - Gal + Glc Gall Lac + UDP
The peak labeled "Lac" indicates the formation of the product lactose (Lac).
Many of the enzymes involved in N-glycosylation (e.g., aldolase, CMP-NeuAc synthetase, sialyltransferase) and metabolic intermediates (e.g., sialic acid, CMP- sialic acid, ManNAc, ManN Ac-6-phosphate) in the CMP-SA production pathway are measured using this form of chromatography, essentially as described by Lee et al. (1990) Anal. Biochem. 34:953-957, Lee et al. (1996) J Chromatography A 720:137- 149, Hardy et al. (1988) Anal. Biochem. 170:54-62, Townsend et al. (1988) Anal. Biochem. 174:459-470, Kiang et al. (1997) Anal. Biochem. 245:97-101.
Reverse phase High Performance Liquid Chromatography (HPLC) for sialyltransferase
To detect native sialyltransferase enzyme activity, Trichoplusia ni lysates were incubated in the presence of exogenously added CMP-SA and the fluorescent substrate, 4-methylumbelliferyl lactoside (Lac-MU). Negligible conversion of the substrate was observed, indicating the absence of endogenous sialyltransferase activity. However, following infection of the insect cells with a baculovirus encoding human o/ .ba2-3-sialyltransferase, conversion of Lac-MU to the product sialyl LacMU was observed in cell lysates using Reverse Phase HPLC and a fluorescence detector (Figure 10). For higher sensitivity, Lac-PA (PA=2-aminopyridine) or Lac- ABA (ABA=o-aminobenzamide) are used as substrates. HPLC and HP AEC is used in conjunction with other fluorometric methods detailed in the procedures to analyze the metabolites and enzymatic activities in insect cells.
Dissociation Enhanced Lanthananide FluorommunoAssay (DELFIA) for GalT
The previous chromatography techniques have one limitation in that only one sample can be handled at a time. When samples from several cell lines must be assayed, a method such as DELFIA is advantageous since a multiwell fluorometer can simultaneously examine activities in many samples on a microtiter plate (Hemmila et al. (1984) Anal. Biochem. 137:335-343). The application of such a technique for the measurement of Gal T activity in several different insect cell lysates and controls is shown in Figure 11. First, the wells of the microtiter plate are coated with the substrate GlcNAc-BSA (Stowell et al. (1993) Meth. in Carb. Chem. 9:178-181). After incubation with Gal T and UDP-Gal, the well is washed and the Gal residue newly attached to GlcNAc-BSA is measured with europium (Eu+3)-labeled Ricinus cummunis lectin, which specifically binds Gal or GalNAc structures. The sensitivity of Eu fluorescence under appropriate conditions can reach the fmol range and match or eclipse that of radioiodides (Kawasaki et al. (1997) Anal. Biochem. 250:260-262). Figure 11 depicts GlcNAc-BSA in (A) Boiled lysate; (B) T. ni; (C) Standard enzyme, 0.5 mU; (D) T. ni insect cells infected with a baculovirus coding for GalT (E) Sf-9 cells stably transfected with GalT gene. The increase in Gal T activity in untreated cell lysates (B in Figure 11) relative to boiled lysates (A) indicates that T. ni cells have low but measurable endogenous Gal T activity. The Gal T activity level is increased significantly following infection with a baculovirus vector including a mammalian Gal T gene under the IE1 promoter or by using Sf-9 cells stably- transformed with the Gal T gene (cell lines are described in Jarvis et al (1996) Nature Biotech. 14:1288-1292; and Hollister et al. (1998) Glycobiology 5:473-480).
The DELFIA method is not limited to Gal T measurement. This technique is used to evaluate the activity of any processing enzyme which generates carbohydrate structures containing binding sites for a specific lectin or carbohydrate-specific antibodies (Taki et al. (1994) Anal. Biochem. 219:104-108, Rabina et /. (1997) Anal. Biochem. 246:459-470).
Example 2: Enhancing SA levels by Substrate Addition Because the conventional substrates in insect cell media are not efficiently converted to CMP-SA in insect cells as demonstrated by the low levels of CMP-SA, alternative substrates are added to the culture medium. Because sialic acid and CMP- SA are not permeable to cell membranes (Bennetts et al. (1981) J. Cell. Biol. 88:1- 15), they are not considered as appropriate substrates. However, other precursors in the CMP-SA pathway are incoφorated into cells and considered as substrates for the generation of CMP-SA in insect cells.
Incorporation and conversion of N-acetylmannosamine (ManNAc)
ManNAc has been added to mammalian tissue and cell cultures and enzymatically converted to SA and CMP-SA (Ferwerda et al. (1983) Biochem. J. 216:87-92, Gu et al. (1997) Improvement of the inter feron-gamma sialylation in Chinese hamster ovary cell culture by feeding N-acetylmannosamine, Thomas et al. (1985) Biochim. Biophys. Acta 846:37-43). Consequently, external feeding of ManNAc is examined as one strategy to enhance CMP-SA levels in insect cells. ManNAc is available commercially (Sigma Chemical Co.) or can be prepared chemically from the less expensive feedstock GlcNAc in vitro using sodium hydroxide (Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393- 400). Initially, the levels of native cellular ManNAc, if any, is determined using HPAEC-PAD techniques (Lee et al. (1990) Anal. Biochem. 34:953-957, Lee et al. (1996) J. Chromatography A 720:137-149, Hardy et al. (1988) Anal. Biochem.
170:54-62, Townsend et al. (1988) Anal. Biochem. 174:459-470, Kiang et al. (1997) Anal. Biochem. 245:97-101). The ability to increase intracellular ManNAc levels is evaluated by adding ManNAc to cell culture media. Incoφoration of exogenous ManNAc is quantified using unlabeled ManNAc if levels of native ManNAc are negligible, or 14C- or 3H-labeled ManNAc if significant levels of native ManNAc are present) (Bennetts et al. (1981) J Cell. Biol. 88:1-15, Kriesel et al. (1988) J. Biol. Chem. 263:11736-11742). The levels of radioactive ManNAc and other metabolites axe determined by collecting ManNAc peaks following HP AEC and measuring the radioactivity using scintillation countering.
To be effective as a substrate for sialylation, the ManNAc must be converted to SA and CMP-SA through intracellular pathways. This conversion is detected directly from externally added ManNAc by following an increase in internal SA and CMP-SA levels using HP AEC or thin layer chromatography (TLC) combined with liquid scintillation counting to detect the radiolabeled metabolites. HP AEC techniques have been used to quantify cellular pools of CMP-SA in as few as 6 x 106 mammalian cells (Fritsch et αl. (1996) Journal of 'Chromatography A 727:223-230), and TLC has been used to evaluate conversion of 14C labeled ManNAc to sialic acid in bacteria (Vann et al. (1997) Glycobiology 7:697-701). If the addition of ManNAc leads to a significant increase in the CMP-SA levels, a limiting step exists in the production of ManNAc from conventional insect cell media substrates. Different ManNAc feeding concentrations are tested and the effect on CMP-SA levels and insect cell viability evaluated to determine if there are any deleterious effects from feeding the ManNAc as substrate. Conversion of ManNAc to SA through the aldolase pathway requires pyruvate, and the addition of cytidine can enhance CMP- SA production from SA (Thomas et al. (1985) Biochim. Biophys. Acta 846:37-43). Thus, pyruvate and cytidine are optionally added to the medium to enhance conversion of ManNAc to CMP-SA (Tomita et al. (1995) Biochim. Biophys. Acta 1243:329-335, Thomas et al (1985) Biochim. Biophys. Acta 846:37-43).
Alternative Substrates
Other precursors substrates such as N-acetylglucosamine (GlcNAc) and glucosamine are converted to SA and CMP-SA through the ManNAc pathway in eukaryotic cells (Pederson et al. (1992) Cancer Res. 52:3782-3786, Kohn et al. (1962) J. Biol. Chem. 237:304-308). The disposition of these alternative precursor substrates are monitored using HP AEC analysis using techniques known in the art and compared with ManNAc feeding strategies to determine which substrate provides for the most efficient production of CMP-SA, in particular insect cells. Example 3 : Purification and cloning of CMP-SA synthetase
A bioinformatics search of the cDNA libraries of HGS revealed a novel human CMP-SA synthetase gene based on its homology with the E. coli DNA sequence. The bacterial enzyme includes a nucleotide binding site for CTP. This binding site contains a number of amino acids that are conserved among all known bacterial CMP-SA synthetase enzymes (See Stoughton et al, Biochem J. 15:397-402 (1999). The identity of the human cDNA as a CMP-SA synthetase gene was confirmed by the presence of significant homology within this binding motif:
bacterial sequence: IIAIIPARSGSKGL identity/homology + A+I AR GSKG+ human cDNA: LAALILARGGSKGI
This human homologue commercially, publicly, or otherwise available for the puφoses of this invention is cloned and expressed in insect cells. The nucleotide and amino acid sequences of human CMP SA synthetase are shown in Figures 29 and 30 respectively.
Example 4: Isolation and Inhibition of glucosaminidase It is recognized that insect cells could possess additional N- acetylglucosaminidase enzymes other than the enzyme responsible for generating low-mannose structures, so both recombinant DNA and biochemical approaches are implemented to isolate the target N-acetylglucosaminidase gene. PCR techniques are used to isolate fragments of N-acetylglucosaminidase genes by the same strategies used in isolating α/pbα-mannosidase cDNAs from Sf-9 cells (Jarvis et al. (1997)
Glycobiology 7:113-127, Kawar et al. (1997) Glycobiology 7:433-443). Degenerate oligonucleotide primers are designed corresponding to regions of conserved amino acid sequence identified in all N-acetylglucosaminidases described thus far, from human to bacteria, including two lepidopteran insect enzymes (Zen et al. (1996) Insect Biochem. Mol. Biol. 26:435-444). These primers are used to amplify a fragment of the N-acetylglucosaminidase gene(s) from genomic DNA or cDNA of lepidopteran insect cell lines commercially, publicly, or otherwise available for the puφoses of this invention. A putative N-acetylglucosaminidase gene fragment from Sf9 genomic DNA and from High Five™ cell (Invitrogen Coφ., Carlsbad, CA, USA) cDNA has been identified (Figure 18). Similar techniques are used to isolate cDNAs from other insect cell lines of interest. The identification of cDNAs for the Sf9 or High Five™ N-acetylglucosaminidase facilitates the isolation of the gene in other insect cell lines. Figure 18 depicts PCR amplification of Sf9 genomic DNA (A) or High
Five™cell cDNA (B) with degenerate primers corresponding to three different regions conserved within N-acetylglucosaminidases. These regions were designated 1, 2, and 3, from 5 to 3'. Lane 1 (sense primer 1 and antisense primer 2); Lanes 2 (sense primer 1 and antisense primer 3); Lanes 3 (sense primer 2 and antisense primer 3). M (size standards with sizes indicated in Kbp). The results show that specific fragments of N- acetylglucosaminidase genes were amplified from both DNAs (lanes A2 and B3). The specificity of the reactions is indicated by the fact that different primer pairs produced different amplification products from different templates. The primer sequences utilized in amplifying the putative N-acetylglucosaminidase gene were as follows:
Sense primer #1: 5'-T/C,T,I,C,A,C/T,T,G,G,C,A,C/T,-\/T/C,T,I,G,T,I,G,A-3' (SEQ ID NO:9)
Sense primer #2: 5'-G,A,G/A,A/T,T,A/C/T,G,A,C/T,I,I,I,C,C,I,G,G/C,I,C,A-3' (SEQ ID NO: 10)
Antisense primer #2: 5'-T,G,I,C/G,C,I,G,G,I,I,I,G/A,T,C,T/G/A,A,T/A,C/T,T,C-3' (SEQ ID NO: 11)
Antisense primer #3: 5*-A,C/A/G,C/T,T,C,G/A,T,C,I,C,C,I,C,C,I,I,I,G/A,T,G-3' (SEQ ID NO: 12)
The PCR amplified fragments are cloned and sequenced using the chain termination method (Sanger et al. (1977) Proc. Natl. Acad. Sci. USA 74:5463-5467). The results are used to design exact-match oligonucleotide primers to isolate an N- acetylglucosaminidase clone(s) from existing Sf9 and/or High Five™ lambda ZAPII cDNA libraries by sibling selection and PCR (Jarvis et al. (1997) Glycobiology 7:113- 127, Kawar et al. (1997) Glycobiology 7:433-443). The library is consecutively split into sub-pools that score positive in PCR screens until a positive sub-pool of approximately 2,000 clones is obtained. These clones are then screened by plaque hybridization (Benton et al. (1977) Science 196:180-182) using the cloned PCR fragment, and positive clones are identified and plaque purified. The cDNA(s) are then excised in vivo as a pBluescript-based subclone in E. coli.
Isolation ofN-acetylglucosaminidases using biochemical techniques Since insect cells may have multiple N-acetylglucosaminidases, a biochemical purification approach is also used to broaden the search for the cDNA encoding the target enzyme. A polyclonal antiserum against a Manduca sexta N- acetylglucosaminidase (Koga et al. (1983) Manduca sexta Comparative Biochemistry and Physiology 74:515-520) is used to examine Sf9 and High Five™ cells for cross- reactivity. This antiserum is used to probe for the N-acetylglucosaminidase during biochemical isolation techniques. In addition, specific assays for N- acetylglucosaminidase described earlier are used to monitor enzyme activity in isolated biochemical fractions.
The target N-acetylglucosaminidase is membrane bound, so it must be solubilized using a detergent such as Triton-X 100 prior to purification. Once solubilized, the enzyme is purified by a combination of gel filtration, ion exchange, and affinity chromatography. For affinity chromatography, the affinants 6- aminohexyl thio-N-acetylglucosaminide (Chipowsky et al. (1973) Carbohydr. Res. 31 :339-346) or BSA modified with thio-N-acetylglucosaminide (Lee et al. (1976) Biochemistry 15:3956-3963) is tried first. If necessary, 6-aminohexyl a-D-[2-(thio-2- amino-2-deoxy-b-D-glucosaminyl)-mannopyranodside or other thio-oligosaccharides are synthesized and used as affinants. Affinity matrices are prepared using commercially available products.
Alternatively, the target enzyme is "anchored" to the membrane by a glycophosphoinositide. In such a case, a specific phospholipase C is used to release the active enzyme from the membrane, and the use of detergent for solubilization is avoided. The purity of the enzyme is examined with SDS-PAGE and mass spectroscopy, and the activity of the enzyme characterized. Once the enzyme is sufficiently purified, its amino-terminal region is sequenced by conventional Edman degradation techniques, available commercially. If N-terminal blockage is encountered, the purified protein are digested, peptides purified, and these peptides are used to obtain internal amino acid sequences. The resulting sequence information is used to design degenerate oligonucleotide primers that are used, in turn, to isolate cDNAs as described above.
Expression of glucosaminidase
Isolated full-length cDNAs are sequenced, compared to other N- acetylglucosaminidase cDNAs, and expressed using known polyhedrin-based baculovirus vectors. The overexpressed proteins are purified, their biochemical activities and substrate specificities characterized, and new polyclonal antisera is produced to establish the subcellular locations of the enzymes in insect cells. The locations are optionally identified by using the antisera in conjunction with secretory pathway markers, including Golgi and endoplasmic reticulum specific dyes and GFP- tagged N-glycan processing enzymes commercially, publicly, or otherwise available for the puφoses of this invention. Evaluation of the N-glycan structures on secreted glycoproteins from insect cells overexpressing the glucosaminidase gene demonstrates the involvement of this enzyme in N-glycan processing as opposed to lysosomal degradation, a common activity for other glucosaminidases.
Example 5: Expression of the model glycoprotein transferrin The gene encoding human transferrin as described in Genbank accession No.
S95936 is cloned into the baculovirus vector, expressed in multiple insect cell lines, and purified to homogeneity. Figure 26 shows SDS-PAGE of transferrin from insect cells (M=unpurified lysates, P=purified protein). Similar techniques are used to express and purify this glycoprotein in the target cell line(s) of interest following manipulation of the glycosyltransferase and CMP-SA production pathways. Once the transferrin is purified to homogeneity, the structures of the oligosaccharides which are N-linked at two sites of the transferrin are analyzed using 3-dimensional HPLC mapping techniques. Over 450 N-glycans have been mapped with this technique. For example, characterization of the N-linked oligosaccharides attached to the heavy chain of secreted and intracellular IgG is described.
Confirmation of particular carbohydrate structures is provided by treating the oligosaccharides with glycosidases and re-eluting through the HPLC columns.
Additional structural information on unknown oligosaccharides are obtained using mass spectrometry and NMR techniques previously used for analysis of IgG glycoforms (Hsu et al. (1997) J. Biol. Chem. 272:9062-9070).
These analytical techniques allow the identification and quantification of N- glycans to determine if a fraction of these structures are sialylated oligosaccharides.
Sialylation is confirmed by treating the purified N-glycan with sialidase from A. ureafaciens and measuring the release of sialic acid using HPAEC-PAD.
The present invention now will be described more fully hereinafter with reference to the accompanying drawings, in which preferred embodiments of the invention are shown. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.
Like numbers refer to like elements throughout. Many modifications and other embodiments of the invention will come to mind to one skilled in the art to which this invention pertains having the benefit of the teachings presented in the foregoing descriptions and the associated drawings.
Therefore, it is to be understood that the invention is not to be limited to the specific embodiments disclosed and that modifications and other embodiments are intended to be included within the scope of the appended claims. Although specific terms are employed herein, they are used in a generic and descriptive sense only and not for puφoses of limitation.
Example 6: Cloning, expression, and characterization of the human sialic acid synthetase (SAS) gene and gene product.
This example reports the cloning and characterization of a novel human gene having homology to the Escherichia coli sialic acid synthetase gene (neuB). This human gene is ubiquitously expressed and encodes a 40 kD enzyme which results in N-acetylneuraminic acid (Νeu5Ac) and 2-keto-3-deoxy-D-g/ cero-D-gα/αcto-nononic acid (KDN) production in insect cells upon recombinant baculovirus infection. In vitro the human enzyme uses N-acetylmannosamine-6-phosphate and mannose-6- phosphate as substrates to generate phosphorylated forms of Νeu5Ac and KDN, respectively, but exhibits much higher activity toward the Neu5Ac phosphate product.
In order to identify genes involved in sialic acid biosynthesis in eukaryotes, homology searches of a human expressed sequence tag (EST) database were performed using the E. coli sialic acid synthetase gene. A cDNA of approximately 1 kb with a predicted open reading frame (ORF) of 359 amino acids was identified. Northern blot analysis indicated that the mRNA is ubiquitously expressed, and in vitro transcription and translation along with recombinant expression in insect cells demonstrated that the human sialic acid synthetase (SAS) gene encodes a 40 kD protein. SAS rescued an E. coli neuB mutant although less efficiently than neuB. Neu5 Ac production in insect culture supplemented with ManNAc further supported the role of SAS in sialic acid biosynthesis. In addition to Neu5Ac, a second sialic acid, KDN, was generated suggesting that the human enzyme has broad substrate specificity. The human enzyme (SAS), unlike its E. coli homologue, uses phosphorylated substrates to generate phosphorylated sialic acids and thus likely represents the previously described sialic acid-9-phosphate synthetase of mammalian cells (Watson et al., J. Biol. Chem. 241, 5627-5636 (1966)).
Identification of a Human Sialic Acid Synthetase Gene
The E. coli sialic acid synthetase gene (Annunziato et al., J. Bacteriol 177, 312-319 (1995)) was used to search the human EST database of Human Genome Sciences, Inc. (Rockville, MD). One EST with significant homology to the neuB gene was found in a human liver cDNA library and used to identify a full length cDNA (Figure 35 A) with an ORF homologous to the bacterial synthetase over most of its length. The putative synthetase consisted of 359 amino acids (SEQ ID NO:6) while the neuB gene product contained 346 amino acids (SEQ ID NO:8). Alignment of the human against the bacterial enzyme demonstrated that significant differences were found primarily in the N-terminus (Figure 35B). Overall, the two synthetases were found to be 36.1% identical and 56.1%> similar at the amino acid level.
The product of a cDNA amplification with a T7 promoter was expressed by in vitro transcription and translation using rabbit reticulocyte lysates. The generation of an ~40 kD protein, consistent with a predicted molecular weight of 40.3 kD, confirmed the existence of an ORF (Figure 36A, lane 2). The negative control, namely the vector without an insert, did not produce a protein product (Figure 36A, lane 1). Northern blot analysis was performed on poly-A+ RNA blots representing a selection of human tissues (Figure 36B). The full-length cDNA was radio-labeled and used as probe. A band of expected size, ~1.3 kb, was observed in all tissues tested suggesting that the putative synthetase is ubiquitously expressed.
Expression and Purification of Human Sialic Acid Synthetase
SAS was inserted into baculovirus under the polh promoter using lacZ as a positive selection marker. After transfection and viral titering, the resulting virus (AcSAS) was used to infect Spodoptera frugiperda (Sf-9) cells followed by pulse labeling. An -40 kD band was observed in the Sf-9 lysates from cells infected by AcSAS (Figure 36A, lane 5) and not in the mock infected control (Figure 36A, lane 4). Furthermore, this band co-migrated with the protein produced in vitro. To verify SAS expression, the band was visualized in the non-nuclear fraction (Miyamoto et al., Mol. Cell. Biol. 5, 2860-2865 (1985)) after electrophoretic transfer to a ProBlott™ membrane and Ponceau S staining (data not shown) and excised for amino acid sequencing. The five N-terminal amino acids were identical to the second through sixth amino acids of the predicted protein (data not shown). Interestingly, the initiator methionine was also removed from the purified recombinant E. coli sialic acid synthetase (Vann et al., 1997).
In Vivo Activity of Human Sialic Acid Synthetase
Covalent labeling of sialic acids with the fluorescent reagent l,2-diamino-4,5- methylene dioxybenzene dihydrochloride (DMB) allows very specific and sensitive sialic acid detection (Hara et al., Anal. Biochem. 179, 162-166 (1989); Manzi et al., Anal. Biochem. 188, 20-32 (1990)). The DMB reaction products are identified after separation by reverse phase HPLC chromatography. Using this technique, sialic acid standards were measured in quantities as low as 50 fmol (data not shown). Sialic acid levels of an insect cell line (Sf-9) and a mammalian cell line (Chinese hamster ovary, CHO) were compared (Table 2). The sialic acid content in cell lysates before and after filtration through a 10,000 MWCO membrane was determined by DMB labeling and HPLC separation. The native sialic acid levels in Sf-9 cells grown without fetal bovine serum (FBS) supplementation are substantially lower than the levels found in CHO cells (Table 2; Figure 37A). To ensure that the low sialic acid content was not due to the absence of serum, the sialic acid content of insect cells cultured in 10% FBS was determined. Even with FBS addition, the Neu5 Ac content of Sf-9 cells is nearly an order of magnitude lower than the content of CHO cells (Table 2). The origin of the sialic acid detected in insect cells, whether natively produced or the result of contamination from the media, is not clear since even serum free insect cell media contains significant levels of sialic acid (data not shown).
Table 2. Sialic Acid Content of CHO and Sf-9 Cell Lines
KDN (fmol/μg protein) Neu5Ac (fmol/μg protein)
+ Filtration - Filtration + Filtration - Filtration
Sf-9 20 30
Sf-9 + FBS 80 600
CHO 70 100 900 4,200
CHO and Sf-9 cells were grown to confluency in T-75 flasks. Cell lysates with and without 10,000 MWCO filtration were analyzed for sialic acid content following DMB derivatization and HPLC separation. Sialic acid levels have been normalized based on lysate protein content. Dashes indicate sialic acid was not detectable.
The lack of large sialic acid pools in Sf-9 cells grown in serum-free media facilitated the detection of sialic acids produced by recombinant enzymes. In order to examine the production of sialic acids from cells infected with recombinant virus, Sf- 9 cells were infected with AcSAS and a negative control virus, A35. The A35 virus was generated by recombining a transfer vector without a gene inserted downstream of t e polh promoter. Low levels of Neu5Ac were observed in lysates from insect cells infected by either virus (Figure 37B) indicating additional Neu5Ac was not produced following the expression of SAS. However, a significant new peak was seen in AcSAS lysates at 12.5 min. that was not observed in A35 negative control lysates (Figure 37B). Published chromatograms suggested the unknown early eluting peak could be N-glycolylneuraminic acid (Νeu5Gc) or KDN (Inoue et al., 1998). The elution time of the unknown peak was the same as DMB-derivatized KDN standard (Figure 37B) and co-chromatographed with authentic DMB-KDN (data not shown) confirming KDN generation in AcSAS infected Sf-9 cells. KDN was not detected in uninfected Sf-9 cells either with or without FBS supplementation (Table 2).
In a further attempt to demonstrate Neu5Ac synthetic functionality, the culture media was supplemented with ManNAc, the metabolic precursor of Neu5Ac. In addition to a DMB-KDN peak, a prominent peak eluting at 17.5 min. corresponding with that of the Neu5Ac standard was observed from the lysates of ManNAc supplemented Sf-9 cells infected with AcSAS (Figure 37C). Neu5Ac quantities were more than 100 times lower in the uninfected lysates and even less in A35 infected lysates (Table 2).
Sialic acid levels were quantified in lysates of uninfected, A35 infected, and AcSAS infected Sf-9 cells grown in media with and without Man, mannosamine (ManN), or ManNAc supplementation (Table 3). In uninfected cells, Man feeding resulted in detection of KDN slightly above background, and ManNAc feeding marginally increased Neu5Ac levels in uninfected and A35 infected cells (Table 3). ManN supplementation had no effect on KDN levels but increased Neu5Ac levels (Table 3). The most significant changes in sialic acid levels occurred with AcSAS infection. AcSAS infection of Sf-9 cells led to large increases in KDN levels with slight enhancements upon Man or ManNAc supplementation. Both AcSAS infection and ManNAc feeding were required to obtain substantial Neu5Ac levels.
Table 3. Sialic Acid Content of Sf-9 with Media Supplementation
KDN . fmol/ug protein) Neu5Ac (fmol/μg protein)
Feeding: None Man ManN ManNAc None Man ManN ManNAc No Infection 20 30 20 60 140
A35 . . . . 80 80 100 120
AcSAS 5,300 7,600 5,200 6,300 50 40 200 27,000 Uninfected, A35 infected, and AcSAS infected Sf-9 cells were grown in unsupplemented media and media that was supplemented with 10 mM Man, ManN, or ManNAc. Cell lysates were analyzed for KDN and Neu5Ac content using DMB derivatization and HPLC separation. Sialic acid levels have been normalized based on lysate protein content. Dashes indicate sialic acid was not detectable.
The presence of KDN and Neu5Ac in AcSAS lysates has been confirmed by high-performance anion-exchange chromatography (HP AEC) with a pulsed amperometric detector (Figure 37D). When culture media is supplemented with ManNAc, peaks with elution times corresponding to authentic KDN and Neu5Ac standards are seen in AcSAS infected lysates that are absent in A35 infected lysates. Neu5 Ac aldolase has been demonstrated previously to break Neu5 Ac into ManNAc and pyruvic acid (Comb and Roseman, J. Biol. Chem. 235, 2529-2537 (I960)) and KDN into Man and pyruvic acid (Nadano et al., J Biol. Chem. 261, 1 1550-11557 (1986)). KDN and Neu5 Ac disappear from the AcSAS lysates after aldolase treatment (Figure 37D). A similar disappearance of the sialic acid peaks following aldolase treatment was observed using DMB-labeling and HPLC analysis (data not shown).
In Vitro Activity of Human Sialic Acid Synthetase
The mammalian pathway for Neu5Ac synthesis uses a phosphate intermediate (Jourdian et al., J Biol. Chem. 239, PC2714-PC2716 (1964); Kundig et al., J Biol. Chem. 241, 5619-5626 (1966); Watson et al., J. Biol. Chem. 241, 5627- 5636 (1966)) while the E. coli pathway directly converts ManNAc and PEP to Neu5Ac (Vann et al, Glycobiology 7, 697-701 (1997)). In order to determine which substrates are used by the human enzyme, in vitro assays were performed using lysates of infected Sf-9 cells and protein purified from the prokaryotic expression system. Lysates or purified protein plus PEP and MnCl2 (Angata et al., J. Biol. Chem. 274, 22949-22956 (1999)) were incubated with Man, mannose-6-phosphate (Man-6- P), ManNAc, or ManNAc-6-P followed by DMB labeling and HPLC analysis.
AcSAS infected cell lysates incubated with ManNAc-6-P and PEP produced a peak eluting at 5.5 min (Figure 38A) consistent with phosphorylated sugars. In previous studies, phosphorylated KDN was detected as DMB-KDN after alkaline phosphatase (AP) treatment and DMB derivatization (Angata et al., J. Biol. Chem. 274, 22949-22956 (1999)). Similarly, the peak eluting at 5.5 min. was exchanged for one that eluted at the same time as authentic Neu5Ac following AP treatment (Figure 38A). Likewise, an early eluting peak from the incubation mixture containing Man-6- P yielded a KDN peak after AP treatment (Figure 38B). No sialic acid products were detected when A35 infected cell lysates were used in the equivalent assays or when Man or ManNAc were used as substrates (data not shown).
Assays were performed by incubating lysates with different substrate solution concentrations of Man-6-P and ManNAc-6-P in order to evaluate substrate preference. After incubation for a fixed time period, the samples were treated with AP, and DMB derivatives of Neu5Ac and KDN were quantified and compared (Table 4). When equimolar amounts of substrates are used, Neu5Ac production is significantly favored over KDN especially at higher equimolar concentrations (10 and 20 mM) of the two substrates. Only when the substrate concentration of ManNAc-6- P is substantially lower than the Man-6-P levels are production levels of the two sialic acids comparable. When the ManNAc-6-P concentration is 1 mM and the Man-6-P level is 20 mM, the Neu5Ac:KDN production ratio approaches unity. Therefore, the enzyme prefers ManNAc-6-P over Man-6-P in the production of phosphorylated forms of Neu5Ac and KDN, respectively.
Table 4. Competitive Formation of Neu5Ac and KDN
Concentration in Substrate Solution (mM) Final Concentration (pmol/μl) Neu5Ac/KDN Man-6-P ManNAc-6-P KDN Neu5Ac Ratio i 1 8 33 4.2
5 1 19 47 2.5
10 1 33 53 1.6 20 1 56 60 1.1
5 5 14 190 14
10 10 18 440 24 20 20 16 820 51 20 5 40 300 7.6 20 10 18 470 25 Lysates from AcSAS infected Sf-9 cells were incubated with substrate solutions containing the indicated concentrations of Man-6-P and ManNAc-6-P. After incubation and AP treatment, samples were analyzed for KDN and Neu5Ac content using DMB derivatization and HPLC separation. Neu5Ac and KDN concentrations of the final solution (50 μl) and the Neu5Ac/KDN ratio are reported.
Discussion of Human Sialic Acid Synthetase Characterization
We have identified the sequence of a human sialic acid phosphate synthetase gene, SAS, whose protein product condenses ManNAc-6-P or Man-6-P with PEP to form Neu5Ac and KDN phosphates, respectively. To our knowledge, this is the first report of the cloning of a eukaryotic sialic acid phosphate synthetase gene. Despite the importance of sialic acids in many biological recognition phenomena, sialic acid phosphate synthetase genes have not been cloned because the enzymes they encode are unstable and difficult to purify (Watson et al., J. Biol. Chem. 241, 5627-5636 (1966); Angata et al., J. Biol. Chem. 274, 22949-22956 (1999)). Even the E. coli sialic acid synthetase enzyme, whose sequence is known, has low specific activity and is unstable (Vann et al., Glycobiology 7, 697-701 (1997)).
Consequently, a bioinformatics approach based on the E. coli synthetase sequence was used to identify a putative human gene 36%> identical and 56% similar to neuB. In vitro transcription and translation verified an open reading frame which encoded a 359 amino acid protein. In addition, Northern blots revealed ubiquitous transcription of the human synthetase gene in a selection of human tissues. The wide distribution of S-4SmRNA is consistent with the detection of sialic acids in many different mammalian tissues (Inoue and Inoue, Sialobiology and Other Novel Forms of Glycosylation (Osaka, Japan: Gakushin Publishing) pp.57-67 (1999)).
Using the baculovirus expression system, the 40 kD sialic acid phosphate synthetase enzyme, SAS, was expressed in cells. The use of Sf-9 cells which have little if any native sialic acid greatly facilitated the detection of sialic acids and the characterization of SAS. However, Neu5Ac was observed only when insect cells were infected with AcSAS and the cell culture media was supplemented with ManNAc, a sialic acid precursor. This ManNAc feeding requirement indicates that Sf-9 cells may lack sizeable ManNAc pools and synthetic pathways.
SAS was identified based on homology with neuB whose enzyme product directly forms Neu5Ac from ManNAc and PEP (Vann et al., Glycobiology 7, 697-701 (1997)). Furthermore, insect cells produce Neu5Ac following recombinant SAS expression and ManNAc supplementation. However, mammalian cells are known only to produce Neu5Ac from ManNAc through a three-step pathway with phosphorylated intermediates. Therefore, in vitro assays were performed to determine the substrate specificity of SAS. Both AcSAS infected insect cell lysates and protein purified from the prokaryotic expression system were assayed using ManNAc and ManNAc-6-P as possible substrates. A rapidly eluting DMB derivatized product, typical of a phosphorylated sialic acid, was observed only when ManNAc-6-P was used as the substrate. Furthermore, this peak disappears with the appearance of an unsubstituted DMB-Neu5Ac peak following AP treatment. SAS therefore condenses PEP and ManNAc-6-P to form a Neu5 Ac phosphate product. Although the exact position of the phosphorylated carbon on the product has not yet been specified, SAS is likely the sialic acid phosphate synthetase enzyme of the previously described three-step mammalian pathway (Kundig et al, J Biol. Chem. 241, 5619-5626 (1966); Watson et al., J. Biol. Chem. 241, 5627-5636 (1966); Jourdian et al., J. Biol. Chem. 239, PC2714-PC2716 (1964)). Despite little if any native pools of sialic acids, Sf-9 cells natively possess the ability to complete the three-step mammalian pathway when only the sialic acid phosphate synthetase gene is provided. Sf-9 cells have been shown to have substantial ManNAc kinase ability (Effertz et al., J. Biol. Chem. 274, 28771-28778 (1999)), and phosphatase activity has also been detected in insect cells (Sukhanova et al, Genetika 34, 1239-1242 (1998)).
The capacity to produce sialic acids in Sf-9 cells following AcSAS infection and ManNAc supplementation at levels even higher than those seen in a mammalian cell lines such as CHO may help overcome a major limitation of the baculovirus expression system. N-glycans of recombinant glycoproteins produced in insect cells lack significant levels of terminal sialic acid residues (Jarvis and Finn, Virology 212, 500-511 (1995); Ogonah et al, Bio/Technology 14, 197-202 (1996)). The lack of sialylation on human thyrotropin produced by the baculovirus expression system resulted in rapid in vivo thyrotropin clearance as compared to thyrotropin produced by a mammalian system (Grossmann et al., Endocrinology 138, 92-100 (1997)). Generation of significant sialic acid pools along with expression of other genes such as sialyltransferases may lead to production of significant levels of sialylated glycoproteins in insect cells.
Another interesting observation was the occurrence of a second DMB reactive peak in AcSAS infected Sf-9 lysates. This peak has been identified as KDN, a deaminated Neu5Ac. We subsequently demonstrated that the SAS enzyme generates KDN phosphate from Man-6-P and PEP in vitro. While Neu5Ac production in insect cells requires both AcSAS infection and ManNAc supplementation, only AcSAS infection is necessary for KDN synthesis. Therefore, significant substrate pools for the generation of KDN already exist in insect cells or are present in the media. In addition, mannose feeding increased KDN production even further. Interestingly, Man feeding of the uninfected insect cells increased KDN levels above background, and ManNAc feeding also led to higher Neu5Ac levels in uninfected cells. Therefore, insect cells may possess limited native sialic acid synthetic ability. Similar substrate supplementation results have been reported in mammalian cells, as cultivation in Man-rich or ManNAc-rich media enhanced the synthesis of native intracellular KDN and Neu5Ac, respectively (Angata et al., Biochem. Biophys. Res. Commun. 261, 326- 331 (1999)).
This study is the first report of a eukaryotic gene encoding any enzyme with KDN synthetic ability. Recently, KDN enzymatic activity has been characterized in trout testis, a tissue high in KDN content. KDN is synthesized from Man in trout through a three- step pathway involving a synthetase with a Man-6-P substrate (Angata et al., J. Biol. Chem. 274, 22949-22956 (1999)). However, the fish synthetase enzyme, partially purified from trout testis, was approximately 80 kD as compared to the human enzyme of 40 kD. Furthermore, KDN and Neu5Ac phosphate synthesis in trout were likely catalyzed by two separate synthetase activities (Angata et al., J. Biol. Chem. 274, 22949-22956 (1999)) while the current study indicates that both products were generated from a single human enzyme with broad substrate specificity. Neu5Ac, usually bound to glycoconjugates, is the predominant sialic acid found in mammalian tissue, but KDN, primarily found free in the ethanol soluble fractions, has also been detected all human tissues examined so far (Inoue and Inoue, Sialobiology and Other Novel Forms of Glycosylation (Osaka, Japan: Gakushin Publishing, pp.57-67 (1999)). The ratio of Neu5Ac to KDN is on the order of 100:1 in blood cells and ovaries (Inoue et al., 1998), although this ratio may change during development and cancer. The levels of free KDN in newborn fetal cord red blood cells are higher than those of maternal red blood cells (Inoue et al., J. Biol. Chem. 273, 27199-27204 (1998)). Furthermore, a 4.2 fold increase in the ratio of free KDN to free Neu5 Ac was observed in ovarian tumor cells as compared to normal cells, and the ratio appears to increase with the extent of invasion or malignancy for ovarian adenocarcinomas (Inoue et al., J. Biol. Chem. 273, 27199-27204 (1998)).
Because the KDN Neu5Ac ratio has biological significance, we performed competitive in vitro assays with insect cell lysates using both ManNAc-6-P and Man- 6-P as substrates. SAS demonstrated a preference for phosphorylated Neu5 Ac over phosphorylated KDN synthesis in vitro, although the concentrations of the particular substrates relative to the enzyme level altered this production ratio. Thus changes in the ratios of free KDN to Neu5 Ac observed in different developmental states and cancer tissue may reflect variability either in the levels of specific substrates or the amount of active enzyme present in vivo. The identification of the SAS genetic sequence and characterization of the enzyme it encodes should help further our understanding of sialic acid biosynthesis as well as the roles sialic acids play in development and disease states.
In Figure 39 the production of sialylated nucleotides in SF-9 insect cells following infection with human CMP-SA synthetase and SA synthetase containing baculoviruses is demonstrated. Sf-9 cells were grown in six well plates and infected with baculovirus containing CMP-SA synthase and supplemented with 10 mM ManNAc ("CMP" line), baculovirus containing CMP-SA synthase and SA synthase plus 10 mM ManNAc supplementation ("CMP+SA" line), or no baculovirus and no ManNAc supplementation ("SF9" line). The nucleotide sugars from lysed cells were extracted with 75% ethanol, dried, resuspended in water, and filtered through a 10,000 molecular weight cut-off membrane. Samples were then separated on a Dionex Carbopac PA-1 column using a Shimadzu VP series HPLC. Nucleotide sugars were detected based upon their absorbance at 280 nm, and CMP sialic acid standards were shown to elute at approximately 7 minutes. These results demonstrate the ability to produce the desired oligosaccharide products in insect cells via introduction and expression of sialyltransferase enzymes.
Materials and Method of Example 6 Gene Characterization
The E. coli neuB coding sequence was used to query the Human Genome Sciences (Rockville, MD) cDNA database with BLAST software. One EST clone, HMKAK61, from a human (liver) cDNA library demonstrated significant homology to neuB and was chosen for further characterization. The tissue distribution profile was determined by Northern blot hybridization. Briefly, the cDNA was radio-labeled with [32P]-dCTP using a RediPrime™II kit (Amersham/Pharmacia Biotech, Piscataway, NJ) following the manufacturer's directions. Multiple tissue Northern blots containing poly-A+ RNA (Clontech, Palo Alto, CA) were pre-hybridized at 42°C for 4 hours and then hybridized overnight with radio-labeled probe at lxl 06 CPM/ml. The blots were sequentially washed twice for 15 min. at 42°C and once for 20 min. at 65°C in 0.1X SSC, 0.1% SDS and subsequently autoradiographed.
Baculovirus Cloning and Protein Expression
The full length ORF was amplified by PCR using the following primers. The forward primer, 5'- TGTAATACGACTCACTATAGGGCGGΛ ECCGCCATC ATGCCGCTGGAGCTG GAGC (SEQ ID NO: 13) contained a synthetic T7 promoter sequence (underlined), a BamHI site (italics), a KOZAK sequence (bold), and sequence corresponding to the first six codons of SAS. The minus strand primer, 5'-
GTACGGΓ CCTTATTAAGACTTGATTTTTTTGCC (SEQ ID NO: 14), contained an Asp 718 site (italics), two in-frame stop codons (underlined), and sequences representing the last six codons of SAS.
After amplification, the PCR product was digested with BamHI and Asp 718 (Roche, Indianapolis, IN) and the resulting fragment cloned into the corresponding sites of the baculovirus transfer vector, pA2. Following DNA sequence confirmation, the plasmid (pA2-SAS) was transfected into Sf-9 cells to generate the recombinant baculovirus AcSAS as previously described (Coleman et al., Gene 190, 163-171 (1997)). Amplified virus was used to infect cells, and the gene product was radio- labeled with [35S]-Met and [35S]-Cys. Bands corresponding to the gene product were visualized by SDS-PAGE and autoradiography. Alternatively, the PCR product was used as a template for in vitro transcription and translation using rabbit reticulocyte lysate (Promega, Madison, WI) in the presence of [35S]-Met. Translation products were resolved by SDS-PAGE and visualized by autoradiography. For protein production, Sf-9 cells were seeded in serum- free media at a density of lxl 06 cells/ml in spinner flasks and infected at a multiplicity of infection of 1-2 with the recombinant virus. A detergent fractionation procedure was employed (Miyamoto et al., Mol. Cell. Biol. 5, 2860-2865 (1985)) to separate nuclear from non- nuclear fractions. Protein was resolved by SDS-PAGE, transferred to a ProBlott™ membrane (ABI, Foster City, CA), and visualized by Ponceau S staining. A prominent band at the expected MW of ~40 kD was visible and excised for protein microsequencing using an ABI-494 sequencer (PE Biosystems, Foster City, CA).
Neu5Ac/KDN Detection Sialic acid was measured by the procedure of Hara et al. (Anal. Biochem. 179,
162-166 (1989). Ten microliters of sample were treated with 200 μl DMB (Sigma Chemicals, St. Louis, MO) solution (7.0 mM DMB in 1.4 M acetic acid, 0.75 M β- mercaptoethanol, and 18 mM sodium hydrosulfite) at 50°C for 2.5 hrs, from which 10 μl was used for HPLC analysis on a Shimadzu (Columbia, MD) VP series HPLC using a Waters (Milford, MA) Spherisorb 5 μm ODS2 column. Peaks were detected using a Shimadzu RF-10AXL fluorescence detector with 448 nm emission and 373 nm excitation wavelengths. The mobile phase was an acetonitrile, methanol, and water mixture (9:7:84, v/v) with a flow rate of 0.7 ml/min. Response factors of Neu5Ac and KDN were established with authentic standards based on peak areas for quantifying sample sialic acid levels. Sialic acid content was normalized based on protein content measured with the Pierce (Rockford, IL) BCA assay kit and a Molecular Devices (Sunnyvale, CA) microplate reader. Cell Culture and Sialic Acid Quantification
Sf-9 (ATCC, Manassas, VA) cells were grown in Ex-Cell™ 405 media (JRH BioScience, Lenexa, KS) with and without 10% FBS at 27°C. CHO-K1 cells (ATCC, Manassas, VA) were cultured at 37°C in a humidified atmosphere with 5% CO2 in Dulbecco's Modified Eagle Medium (Life Technologies, Rockville, MD) supplemented with 10% FBS, 100 U/ml penicillin, 100 μg/ml streptomycin, 100 μM MEM essential amino acids, and 4 mM L-glutamine (Life Technologies, Rockville, MD). Cells were grown to confluency in T-75 flasks, washed twice with PBS, and lysed in 0.05 M bicine, pH 8.5, with 1 mM DTT (Vann et al., Glycobiology 7, 697- 701 (1997)) using a Tekmar Sonic Disruptor (Cincinnati, OH). For determination of sialic acid content, 10 μl of lysates with and without 10,000 MWCO microfiltration (Millipore, Bedford, MA) were analyzed by DMB derivatization as described above. Sugar substrate feeding was studied by plating approximately 106 Sf-9 cells on each well of a six well plate. Media was replaced with 2 ml fresh media supplemented with 10 mM sterile-filtered Man, ManN, or ManNAc. Cells were left uninfected or infected with 20 μl of the appropriate (A35 or AcSAS) amplified baculovirus stock. Cells were harvested at 80 hours post infection by separating the pellet from the media by centrifugation and washing twice with PBS. Cells were lysed and analyzed for sialic acid content as described above.
In vitro Activity
In vitro activity assays were based on the procedure of Angata et al. (J. Biol. Chem. 274, 22949-22956 (1999)). Lysates were prepared from A35 and AcSAS infected and uninfected Sf-9 cells cultured in T-75 flasks with and without 10 mM ManNAc supplementation. After washing twice with PBS, cells were lysed on ice with 25 strokes of a tight-fitting Dounce homogenizer (Wheaton, Millville, NJ) in 2.5 ml lysis buffer [50mM HEPES pH = 7.0 with 1 mM DTT, leupeptin (1 μg/ml), antipain (0.5 μg/ml), benzamidine-HCl (15.6 μg/ml), aprotinin (0.5 μg/ml), chymostatin (0.5 μg/ml), and 1 mM phenylmethylsulfonylfluoride]. 5 μl of substrate solution was incubated with either 20 μl insect cell lysate (30 min.) or purified E. coli protein (60 min.) at 37°C. The substrate solution contained 10 mM MnCl2, 20 mM PEP, and either 5 mM ManNAc-6-P or 25 mM Man-6-P (Sigma, St. Louis, MO). ManNAc-6-P was prepared by acid hydrolysis of meningococcal Group A polysaccharide. The polysaccharide (15.5 mg) in 5.8 ml water was mixed with 770 mg of Dowex 50 H+ and heated for 1 hr. at 100°C. The filtered hydro lysate was dried in vacuo and the residue dissolved to give a solution of 50 mM ManNAc-6-P and stored frozen. Substrate solutions containing 25 mM Man and ManNAc were also used. Boiled samples were used as negative controls. Following incubation, all samples were boiled 3 min., centrifuged for 10 min. at 12,000g, and split into two 10 μl aliquots. One aliquot was treated with 9 units of calf intestine alkaline phosphatase (Roche, Indianapolis, IN) along with 3 μl of accompanying buffer while the other aliquot was diluted with water and buffer. AP treated aliquots were incubated 4 hrs. at 37°C, and 10 μl of both AP treated and untreated samples were reacted with DMB as described above. 2 μl of the samples incubated with insect lysates and 10 μl of the samples incubated with bacterial protein were injected onto the HPLC for sialic acid analysis as described above.
For substrate competition experiments, Man-6-P and ManNAc-6-P concentrations in the substrate solution were varied from 1 to 20 mM. In vitro assays were run with Sf-9 lysates as described above. Samples were treated with 7 μl buffer and 18 units of AP, incubated for 4 hrs. at 37°C, and analyzed for sialic acid content. Samples containing more than 1 mM ManNAc-6-P in the substrate solution produced high levels of sialic acid and were diluted 1 :5 before injection to avoid fluorescence detector signal saturation.
Analysis with Aldolase Using HP AEC Sf-9 cells were grown in T-75 flasks and then infected with A35 or AcSAS or left uninfected in the presence or absence of 10 mM ManNAc. After 80 hrs., cells were washed twice in PBS and sonicated. Aliquots (200 μl ) were filtered through 10,000 MWCO membranes, and 50 μl samples were treated with 12.5 μl aldolase solution [0.0055 U aldolase (ICN, Costa Mesa, CA), 1.4 mM NADH (Sigma, St. Louis, MO), 0.5 M HEPES pH 7.5, 0.7 U lactate dehydrogenase (Roche, Indianapolis, IN)] or left untreated and incubated at 37°C for one hour (Lilley et al., 1992). Samples were analyzed by HP AEC with a Dionex (Sunnyvale, CA) BioLC system using a pulsed amperometric detector (PAD-II) on a Carbopac PA-1 column. The initial elution composition was 50% A (200 mM NaOH), 45% B (water), and 5% C (IM NaOAc, 200 mM NaOH) with a linear gradient to 50% A, 25% B, and 25% C at 20 min. A 6 min. 50% A and 50 % C washing followed. Samples were normalized based on protein content by dilution with water, and 20 μl of each sample were analyzed. Ten μl of each sample were also derivatized with DMB and analyzed by HPLC as described above to confirm the elimination of sialic acids by aldolase treatment.
INDICATIONS RELATING TO A DEPOSITED MICROORGANISM
(PCT Rule 136.-0
The indications made below relate to the microorganism referred to in the description on page 33 ) hne 21
B. IDENTIFICATIONOFDEPOSΓΓ Further deposits are identified on an additional sheet | |
Name of depositary institution American Type Culture Collection
Address of depositary institution (including postal code and country) 10801 University Boulevard Manassas, Virginia 201 10-2209 United States of America
Date of deposit Accession Number
24 February 2000 unknown
C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet |_j
D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States)
Europe
In respect to those designations in which a European Patent is sought a sample of the deposited microorganism will be made available until the publication of the mention of the grant of the European patent or until the date on which application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person requesting the sample (Rule 28 (4) EPC)
E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable)
The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications eg, "Accession Number of Deposit")
Figure imgf000092_0001
ATCC Deposit No.: unassigned
CANADA
The applicant requests that, until either a Canadian patent has been issued on the basis of an application or the application has been refused, or is abandoned and no longer subject to reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of a sample of the deposited biological material referred to in the application to an independent expert nominated by the Commissioner, the applicant must, by a written statement, inform the International Bureau accordingly before completion of technical preparations for publication of the international application.
NORWAY
The applicant hereby requests that the application has been laid open to public inspection (by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent Office without having been laid open inspection, the furnishing of a sample shall only be effected to an expert in the art. The request to this effect shall be filed by the applicant with the Norwegian Patent Office not later than at the time when the application is made available to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been filed by the applicant, any request made by a third party for the furnishing of a sample shall indicate the expert to be used. That expert may be any person entered on the list of recognized experts drawn up by the Norwegian Patent Office or any person approved by the applicant in the individual case.
AUSTRALIA
The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the application, to a person who is a skilled addressee without an interest in the invention (Regulation 3.25(3) of the Australian Patents Regulations).
FINLAND
The applicant hereby requests that, until the application has been laid open to public inspection (by the National Board of Patents and Regulations), or has been finally decided upon by the National Board of Patents and Registration without having been laid open to public inspection, the furnishing of a sample shall only be effected to an expert in the art.
UNITED KINGDOM
The applicant hereby requests that the furnishing of a sample of a microorganism shall only be made available to an expert. The request to this effect must be filed by the applicant with the International Bureau before the completion of the technical preparations for the international publication of the application. ATCC Deposit No.:unassigned
DENMARK
The applicant hereby requests that, until the application has been laid open to public inspection (by the Danish Patent Office), or has been finally decided upon by the Danish Patent office without having been laid open to public inspection, the furnishing of a sample shall only be effected to an expert in the art. The request to this effect shall be filed by the applicant with the Danish Patent Office not later that at the time when the application is made available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed by the applicant, any request made by a third party for the furnishing of a sample shall indicate the expert to be used. That expert may be any person entered on a list of recognized experts drawn up by the Danish Patent Office or any person by the applicant in the individual case.
SWEDEN
The applicant hereby requests that, until the application has been laid open to public inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office without having been laid open to public inspection, the furnishing of a sample shall only be effected to an expert in the art. The request to this effect shall be filed by the applicant with the International Bureau before the expiration of 16 months from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such a request has been filed by the applicant any request made by a third party for the furnishing of a sample shall indicate the expert to be used. That expert may be any person entered on a list of recognized experts drawn up by the Swedish Patent Office or any person approved by a applicant in the individual case.
NETHERLANDS
The applicant hereby requests that until the date of a grant of a Netherlands patent or until the date on which the application is refused or withdrawn or lapsed, the microorganism shall be made available as provided in the 31 F( 1 ) of the Patent Rules only by the issue of a sample to an expert. The request to this effect must be furnished by the applicant with the Netherlands Industrial Property Office before the date on which the application is made available to the public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, whichever of the two dates occurs earlier.

Claims

WHAT IS CLAIMED IS:
1. A cell of interest producing the donor substrate CMP-SA above endogenous levels.
2. A cell of interest producing an acceptor substrate, the donor substrate CMP-SA, and expressing the enzyme sialyltransferase; wherein said acceptor substrate is a glycan.
3. The cell of claim 2 wherein said glycan is a branched glycan comprising GalGlcNAcMan by at least one branch of said glycan and said Gal is a terminal Gal.
4. The cell of claim 3 wherein said glycan is an asparagine-linked glycan.
5. A cell of interest producing sialylated glycoprotein above endogenous levels.
6. The cell of claim 5, wherein said glycoprotein is asparagine (N)-linked.
7. The cell of claim 5, wherein said glycoprotein is heterologous.
8. The cell of claim 7, wherein said heterologous glycoprotein is mammalian.
9. The cell of claim 5, wherein said mammalian glycoprotein is selected from the group consisting of plasminogen, transferrin, Na+,K+- ATPase, and thyrotropin.
10. The cell of claim 5, wherein said cell expresses at least one enzyme selected from the group consisting of: a) GlcNAc-2 epimerase; b) an enzyme catalyzing conversion of UDP-GlcNAc to ManNAc; c) sialic acid synthetase; d) aldolase; e) CMP-SA synthetase; f) CMP-SA transporter; and wherein said expression is above endogenous levels.
11. The cell of claim 10, wherein said cell expresses enzyme (a).
12. The cell of claim 11, wherein said enzyme is human.
13. The cell of claim 10, wherein said cell expresses enzyme (b).
14. The cell of claim 13, wherein said enzyme is human.
15. The cell of claim 10, wherein said cell expresses enzyme (c).
16. The cell of claim 15, wherein said cell expresses the enzyme of SEQ ID NO:6.
17. The cell of claim 10, wherein said cell expresses enzyme (d).
18. The cell of claim 17, wherein said cell expresses the enzyme of SEQ ID NO:2.
19. The cell of claim 10, wherein said cell expresses enzyme (e).
20. The cell of claim 19, wherein said cell expresses the enzyme of SEQ ID NO:4.
21. The cell of claim 10, wherein said cell expresses enzyme (f).
22. The cell of claim 21, wherein said enzyme is human.
23. The cell of claim 10 wherein said cell further expresses at least one enzyme selected from the group consisting of: i) Gal T; ii) GlcNAc Tl; iii) GlcNAc Til; iv) sialyltransferase; and wherein said expression is above endogenous levels.
24. The cell of claim 10, wherein activity of endogenous N- acetylglucosaminidase is suppressed.
25. A kit for expression of sialylated glycoproteins, comprising the cell of any of claims 1-24.
26. A method for manipulating glycoprotein production in an insect cell, said method comprising enhancing expression of at least one enzyme selected from the group consisting of: a) GlcNAc-2 epimerase; b) an enzyme catalyzing conversion of UDP-GlcNAc to ManNAc; c) sialic acid synthetase; d) aldolase; e) CMP-SA synthetase; f) CMP-SA transporter; and wherein the expression of each enzyme expressed is enhanced to above endogenous levels.
27. The method of claim 26, wherein expression of enzyme (a) is enhanced.
28. The method of claim 27, wherein said enzyme is human.
29. The method of claim 26, wherein expression of enzyme (b) is enhanced.
30. The method of claim 29, wherein said enzyme is human.
31. The method of claim 26, wherein expression of enzyme (c) is enhanced.
32. The method of claim 31 , wherein said enzyme has the sequence of SEQ ID NO:6.
33. The method of claim 26, wherein expression of enzyme (d) is enhanced.
34. The method of claim 33, wherein said enzyme has the sequence of SEQ ID NO:2.
35. The method of claim 26, wherein expression of enzyme (e) is enhanced.
36. The method of claim 35, wherein said enzyme has the sequence of SEQ ID NO:4.
37. The method of claim 26, wherein expression of enzyme (f) is enhanced.
38. The method of claim 37, wherein said enzyme is human.
39. The method of claim 26, further comprising enhancing expression of at least one enzyme selected from the group consisting of: : i) Gal T; iii) GlcNAc Tl; iii) GlcNAc Til; iv) sialyltransferase; and wherein the expression of each enzyme expressed is enhanced to above endogenous levels.
40. The method of claims 26 or 39, further comprising suppressing activity of endogenous N-acetylglucosaminidase.
41. A method for producing sialylated glycoproteins, said method comprising expressing a heterologous protein in an insect cell manipulated according to the method of any of claims 26-40.
42. The method of claim 41 , wherein said heterologous protein is mammalian.
43. The method of claim 42, wherein said mammalian protein is selected from the group plasminogen, transferrin, Na+, K+- ATPase, thyrotropin.
44. A method for producing a sialylated glycoprotein in a cell of interest said method comprising: a) determining the carbohydrate substrates in said cell; b) transforming said cell with enzymes to produce necessary precursor substrates; and c) constructing a processing pathway in said cell to produce a sialylated glycoprotein.
45. The method of claim 44 wherein said cell is selected from the group consisting of yeast, insect, fungal, plant, and bacterial cells. SEQUENCE LISTING
<110> Human Genome Sciences, Inc.
<120> Engineering Intracellular Sialylation Pathways
<130> PF509.PCT
<140> Unassigned <141> 2000-03-01
<150> 60/122,582 <151> 1999-12-07
<150> 60/169,624 <151> 1999-12-08
<160> 8
<170> Patentln Ver. 2.1
<210> 1
<211> 1429
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1) .. (693)
<400> 1 atg gcc ttc cca aag aag aaa ctt cag ggt ctt gtg get gca ace ate 48
Met Ala Phe Pro Lys Lys Lys Leu Gin Gly Leu Val Ala Ala Thr He
1 5 10 15 acg cca atg act gag aat gga gaa ate aac ttt tea gta att ggt cag 96 Thr Pro Met Thr Glu Asn Gly Glu He Asn Phe Ser Val He Gly Gin 20 25 30 tat gtg gat tat ctt gtg aaa gaa cag gga gtg aag aac att ttt gtg 144 Tyr Val Asp Tyr Leu Val Lys Glu Gin Gly Val Lys Asn He Phe Val 35 40 45 aat ggc aca aca gga gaa ggc ctg tec ctg age gtc tea gag cgt cgc 192 Asn Gly Thr Thr Gly Glu Gly Leu Ser Leu Ser Val Ser Glu Arg Arg 50 55 60 cag gtt gca gag gag tgg gtg aca aaa ggg aag gac aag ctg gat cag 240 Gin Val Ala Glu Glu Trp Val Thr Lys Gly Lys Asp Lys Leu Asp Gin 65 70 75 80 gtg ata att cac gta gga gca ctg age ttg aag gag tea cag gaa ctg 288 Val He He His Val Gly Ala Leu Ser Leu Lys Glu Ser Gin Glu Leu 85 90 95 gcc caa cat gca gca gaa ata gga get gat ggc ate get gtc att gca 336 Ala Gin His Ala Ala Glu He Gly Ala Asp Gly He Ala Val He Ala 100 105 110 ccg ttc ttc etc aag cca tgg ace aaa gat ate ctg att aat ttc eta 384 Pro Phe Phe Leu Lys Pro Trp Thr Lys Asp He Leu He Asn Phe Leu 115 120 125 aag gaa gtg get get gcc gcc cct gcc ctg cca ttt tat tac tat cac 432 Lys Glu Val Ala Ala Ala Ala Pro Ala Leu Pro Phe Tyr Tyr Tyr His 130 135 140 att cct gcc ttg aca ggg gta aag att cgt get gag gag ttg ttg gat 480 He Pro Ala Leu Thr Gly Val Lys He Arg Ala Glu Glu Leu Leu Asp 145 150 155 160 ggg att ctg gat aag ate ccc ace ttc caa ggg ctg aaa ttc agt gat 528 Gly He Leu Asp Lys He Pro Thr Phe Gin Gly Leu Lys Phe Ser Asp 165 170 175 aca gat etc tta gac ttc ggg caa tgt gtt gat cag aat cgc cag caa 576 Thr Asp Leu Leu Asp Phe Gly Gin Cys Val Asp Gin Asn Arg Gin Gin 180 185 190 cag ttt get ttc ctt ttt ggg gtg gat gag caa ctg ttg agt get ctg 624 Gin Phe Ala Phe Leu Phe Gly Val Asp Glu Gin Leu Leu Ser Ala Leu 195 200 205 gtg atg gga gca act gga gca gtg ggc agt ttt gta tec aga gat tta 672 Val Met Gly Ala Thr Gly Ala Val Gly Ser Phe Val Ser Arg Asp Leu 210 215 220 tea act ttg ttg tea aac tag gttttggagt gtcacagacc aaagccatca 723 Ser Thr Leu Leu Ser Asn 225 230 tgactctggt ctctgggatt ccaatgggcc caccccggct tccactgcag aaagcctcca 783 gggagtttac tgatagtgct gaagctaaac tgaagagcct ggatttcctt tctttcactg 843 atttaaagga tggaaacttg gaagctggta gctagtgcct ctctatcaaa tcagggtttg 903 caccttgaga cataatctac cttaaatagt gcattttttt ctcagggaat tttagatgaa 963 cttgaataaa ctetcctage aaatgaaate teacaataag cattgaggta ccttttgtga 1023 geettaaaaa gtcttatttt gtgaaggggc aaaaactcta ggagteacaa ctetcagtca 1083 ttcatttcac agattttttt gtggagaaat ttctgtttat atggatgaaa tggaatcaag 1143 aggaaaattg taattgatta attccatctg tctttaggag ctctcattat ctcggtctct 1203 ggttcctaat cctattttaa agttgtctaa ttttaaaeca ctataatatg tcttcatttt 1263 aataaatatt catttggaat ctaggaaaac tctgagctac tgcatttagg caggcacttt 1323 aataccaaac tgtaacatgt ctcaactgta tacaactcaa aatacaccag ctcatttggc 1383 tgctcagtct aactctagaa tggatgcttt tgaattcatt tcgatg 1429
<210> 2 <211> 230 <212 > PRT
<213> Homo sapiens
<400> 2
Met Ala Phe Pro Lys Lys Lys Leu Gin Gly Leu Val Ala Ala Thr He
1 5 10 15
Thr Pro Met Thr Glu Asn Gly Glu He Asn Phe Ser Val He Gly Gin
20 25 30
Tyr Val Asp Tyr Leu Val Lys Glu Gin Gly Val Lys Asn He Phe Val
35 40 45
Asn Gly Thr Thr Gly Glu Gly Leu Ser Leu Ser Val Ser Glu Arg Arg
50 55 60
Gin Val Ala Glu Glu Trp Val Thr Lys Gly Lys Asp Lys Leu Asp Gin 65 70 75 80
Val He He His Val Gly Ala Leu Ser Leu Lys Glu Ser Gin Glu Leu
85 90 95
Ala Gin His Ala Ala Glu He Gly Ala Asp Gly He Ala Val He Ala
100 105 110
Pro Phe Phe Leu Lys Pro Trp Thr Lys Asp He Leu He Asn Phe Leu
115 120 125
Lys Glu Val Ala Ala Ala Ala Pro Ala Leu Pro Phe Tyr Tyr Tyr His
130 135 140
He Pro Ala Leu Thr Gly Val Lys He Arg Ala Glu Glu Leu Leu Asp 145 150 155 160
Gly He Leu Asp Lys He Pro Thr Phe Gin Gly Leu Lys Phe Ser Asp
165 170 175
Thr Asp Leu Leu Asp Phe Gly Gin Cys Val Asp Gin Asn Arg Gin Gin
180 185 190
Gin Phe Ala Phe Leu Phe Gly Val Asp Glu Gin Leu Leu Ser Ala Leu
195 200 205
Val Met Gly Ala Thr Gly Ala Val Gly Ser Phe Val Ser Arg Asp Leu
210 215 220
Ser Thr Leu Leu Ser Asn 225 230
<210> 3
<211> 1305
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1) .. (1305)
<400> 3 atg gac teg gtg gag aag ggg gcc gcc ace tec gtc tec aac ccg egg 48
Met Asp Ser Val Glu Lys Gly Ala Ala Thr Ser Val Ser Asn Pro Arg 1 5 10 15 ggg cga ccg tec egg ggc egg ccg ccg aag ctg cag cgc aac tct cgc 96 Gly Arg Pro Ser Arg Gly Arg Pro Pro Lys Leu Gin Arg Asn Ser Arg 20 25 30 ggc ggc cag ggc cga ggt gtg gag aag ccc ccg cac ctg gca gcc eta 144 Gly Gly Gin Gly Arg Gly Val Glu Lys Pro Pro His Leu Ala Ala Leu 35 40 45 att ctg gcc egg gga ggc age aaa ggc ate ccc ctg aag aac att aag 192 He Leu Ala Arg Gly Gly Ser Lys Gly He Pro Leu Lys Asn He Lys 50 55 60 cac ctg gcg ggg gtc ccg etc att ggc tgg gtc ctg cgt gcg gcc ctg 240 His Leu Ala Gly Val Pro Leu He Gly Trp Val Leu Arg Ala Ala Leu 65 70 75 80 gat tea ggg gcc ttc cag agt gta tgg gtt teg aca gac cat gat gaa 288 Asp Ser Gly Ala Phe Gin Ser Val Trp Val Ser Thr Asp His Asp Glu 85 90 95 att gag aat gtg gcc aaa caa ttt ggt gca caa gtt cat cga aga agt 336 He Glu Asn Val Ala Lys Gin Phe Gly Ala Gin Val His Arg Arg Ser 100 105 110 tct gaa gtt tea aaa gac age tct ace tea eta gat gcc ate ata gaa 384 Ser Glu Val Ser Lys Asp Ser Ser Thr Ser Leu Asp Ala He He Glu 115 120 125 ttt ctt aat tat yat aat gag gkt gac att gta gga aat att caa get 432 Phe Leu Asn Tyr Xaa Asn Glu Xaa Asp He Val Gly Asn He Gin Ala 130 135 140 act tct yea tgt tta cat cct act gat ctt caa aaa gtt gca gaa atg 480 Thr Ser Xaa Cys Leu His Pro Thr Asp Leu Gin Lys Val Ala Glu Met 145 150 155 160 att cga gaa gaa gga tat gat tct gkt ttc tct gtt gtg aga cgc cat 528 He Arg Glu Glu Gly Tyr Asp Ser Xaa Phe Ser Val Val Arg Arg His 165 170 175 cag ttt cga tgg agt gaa att cag aaa gga gtt cgt gaa gtg ace gaa 576 Gin Phe Arg Trp Ser Glu He Gin Lys Gly Val Arg Glu Val Thr Glu 180 185 190 cct ctg aat tta aat cca get aaa egg cct cgt cga caa gac tgg gat 624 Pro Leu Asn Leu Asn Pro Ala Lys Arg Pro Arg Arg Gin Asp Trp Asp 195 200 205 gga gaa tta tat gaa aat ggc tea ttt tat ttt get aaa aga cat ttg 672 Gly Glu Leu Tyr Glu Asn Gly Ser Phe Tyr Phe Ala Lys Arg His Leu 210 215 220 ata gag atg ggt tac ttg cag ggt gga aaa tgg cat act acg aaa tgc 720 He Glu Met Gly Tyr Leu Gin Gly Gly Lys Trp His Thr Thr Lys Cys 225 230 235 240 gag ctg gaa cat agt gtg gat ata gat gtg gat att gat tgg cct att 768 Glu Leu Glu His Ser Val Asp He Asp Val Asp He Asp Trp Pro He 245 250 255 gca gag caa aga gta tta aga tat ggc tat ttt ggc aaa gag aag ctt 816 Ala Glu Gin Arg Val Leu Arg Tyr Gly Tyr Phe Gly Lys Glu Lys Leu 260 265 270 aag gaa ata aaa ctt ttg gtt tgc aat att gat gga tgt etc ace aat 864 Lys Glu He Lys Leu Leu Val Cys Asn He Asp Gly Cys Leu Thr Asn 275 280 285 ggc cac att tat gta tea gga gac caa aaa gaa ata ata tct tat gat 912 Gly His He Tyr Val Ser Gly Asp Gin Lys Glu He He Ser Tyr Asp 290 295 300 gta aaa gat get att ggg ata agt tta tta aag aaa agt ggt att gag 960 Val Lys Asp Ala He Gly He Ser Leu Leu Lys Lys Ser Gly He Glu 305 310 315 320 gtg agg eta ate tea gaa agg gcc tgt tea aag cag acg ctg tct tct 1008 Val Arg Leu He Ser Glu Arg Ala Cys Ser Lys Gin Thr Leu Ser Ser 325 330 335 tta aaa ctg gat tgc aaa atg gaa gtc agt gta tea gac aag eta gca 1056 Leu Lys Leu Asp Cys Lys Met Glu Val Ser Val Ser Asp Lys Leu Ala 340 345 350 gtt gta gat gaa tgg aga aaa gaa atg ggc ctg tgc tgg aaa gaa gtg 1104 Val Val Asp Glu Trp Arg Lys Glu Met Gly Leu Cys Trp Lys Glu Val 355 360 365 gca tat ctt gga aat gaa gtg tct gat gaa gag tgc ttg aag aga gtg 1152 Ala Tyr Leu Gly Asn Glu Val Ser Asp Glu Glu Cys Leu Lys Arg Val 370 375 380 ggc eta agt ggc get cct get gat gcc tgt tec tac gcc cag aag get 1200 Gly Leu Ser Gly Ala Pro Ala Asp Ala Cys Ser Tyr Ala Gin Lys Ala 385 390 395 400 gtt gga tac att tgc aaa tgt aat ggt ggc cgt ggt gcc ate cga gaa 1248 Val Gly Tyr He Cys Lys Cys Asn Gly Gly Arg Gly Ala He Arg Glu 405 410 415 ttt gca gag cac att tgc eta eta atg gaa aaa gtt aat aat tea tgc 1296 Phe Ala Glu His He Cys Leu Leu Met Glu Lys Val Asn Asn Ser Cys 420 425 430 caa aaa tag 1305
Gin Lys
435
<210> 4
<211> 434
<212> PRT
<213> Homo sapiens
<400> 4
Met Asp Ser Val Glu Lys Gly Ala Ala Thr Ser Val Ser Asn Pro Arg
1 5 10 15
Gly Arg Pro Ser Arg Gly Arg Pro Pro Lys Leu Gin Arg Asn Ser Arg
20 25 30
Gly Gly Gin Gly Arg Gly Val Glu Lys Pro Pro His Leu Ala Ala Leu
35 40 45
He Leu Ala Arg Gly Gly Ser Lys Gly He Pro Leu Lys Asn He Lys
50 55 60
His Leu Ala Gly Val Pro Leu He Gly Trp Val Leu Arg Ala Ala Leu 65 70 75 80
Asp Ser Gly Ala Phe Gin Ser Val Trp Val Ser Thr Asp His Asp Glu 85 90 95
He Glu Asn Val Ala Lys Gin Phe Gly Ala Gin Val His Arg Arg Ser
100 105 110
Ser Glu Val Ser Lys Asp Ser Ser Thr Ser Leu Asp Ala He He Glu
115 120 125
Phe Leu Asn Tyr Xaa Asn Glu Xaa Asp He Val Gly Asn He Gin Ala
130 135 140
Thr Ser Xaa Cys Leu His Pro Thr Asp Leu Gin Lys Val Ala Glu Met 145 150 155 160
He Arg Glu Glu Gly Tyr Asp Ser Xaa Phe Ser Val Val Arg Arg His
165 170 175
Gin Phe Arg Trp Ser Glu He Gin Lys Gly Val Arg Glu Val Thr Glu
180 185 190
Pro Leu Asn Leu Asn Pro Ala Lys Arg Pro Arg Arg Gin Asp Trp Asp
195 200 205
Gly Glu Leu Tyr Glu Asn Gly Ser Phe Tyr Phe Ala Lys Arg His Leu
210 215 220
He Glu Met Gly Tyr Leu Gin Gly Gly Lys Trp His Thr Thr Lys Cys 225 230 235 240
Glu Leu Glu His Ser Val Asp He Asp Val Asp He Asp Trp Pro He
245 250 255
Ala Glu Gin Arg Val Leu Arg Tyr Gly Tyr Phe Gly Lys Glu Lys Leu
260 265 270
Lys Glu He Lys Leu Leu Val Cys Asn He Asp Gly Cys Leu Thr Asn
275 280 285
Gly His He Tyr Val Ser Gly Asp Gin Lys Glu He He Ser Tyr Asp
290 295 300
Val Lys Asp Ala He Gly He Ser Leu Leu Lys Lys Ser Gly He Glu 305 310 315 320
Val Arg Leu He Ser Glu Arg Ala Cys Ser Lys Gin Thr Leu Ser Ser
325 330 335
Leu Lys Leu Asp Cys Lys Met Glu Val Ser Val Ser Asp Lys Leu Ala
340 345 350
Val Val Asp Glu Trp Arg Lys Glu Met Gly Leu Cys Trp Lys Glu Val
355 360 365
Ala Tyr Leu Gly Asn Glu Val Ser Asp Glu Glu Cys Leu Lys Arg Val
370 375 380
Gly Leu Ser Gly Ala Pro Ala Asp Ala Cys Ser Tyr Ala Gin Lys Ala 385 390 395 400
Val Gly Tyr He Cys Lys Cys Asn Gly Gly Arg Gly Ala He Arg Glu
405 410 415
Phe Ala Glu His He Cys Leu Leu Met Glu Lys Val Asn Asn Ser Cys
420 425 430
Gin Lys
<210> 5
<211> 1080
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1) .. (1080)
<400> 5 atg ccg ctg gag ctg gag ctg tgt ccc ggg cgc tgg gtg ggc ggg caa 4!
Met Pro Leu Glu Leu Glu Leu Cys Pro Gly Arg Trp Val Gly Gly Gin 1 5 10 15 cac ccg tgc ttc ate att gcc gag ate ggc cag aac cac cag ggc gac 96 His Pro Cys Phe He He Ala Glu He Gly Gin Asn His Gin Gly Asp 20 25 30 ctg gac gta gcc aag cgc atg ate cgc atg gcc aag gag tgt ggg get 144 Leu Asp Val Ala Lys Arg Met He Arg Met Ala Lys Glu Cys Gly Ala 35 40 45 gat tgt gcc aag ttc cag aag agt gag eta gaa ttc aag ttt aat egg 192 Asp Cys Ala Lys Phe Gin Lys Ser Glu Leu Glu Phe Lys Phe Asn Arg 50 55 60 aaa gcc ttg gag agg cca tac ace teg aag cat tec tgg ggg aag acg 240 Lys Ala Leu Glu Arg Pro Tyr Thr Ser Lys His Ser Trp Gly Lys Thr 65 70 75 80 tac ggg gag cac aaa cga cat ctg gag ttc age cat gac cag tac agg 288 Tyr Gly Glu His Lys Arg His Leu Glu Phe Ser His Asp Gin Tyr Arg 85 90 95 gag ctg cag agg tac gcc gag gag gtt ggg ate ttc ttc act gcc tct 336 Glu Leu Gin Arg Tyr Ala Glu Glu Val Gly He Phe Phe Thr Ala Ser 100 105 110 ggc atg gat gag atg gca gtt gaa ttc ctg cat gaa ctg aat gtt cca 384 Gly Met Asp Glu Met Ala Val Glu Phe Leu His Glu Leu Asn Val Pro 115 120 125 ttt ttc aaa gtt gga tct gga gac act aat aat ttt cct tat ctg gaa 432 Phe Phe Lys Val Gly Ser Gly Asp Thr Asn Asn Phe Pro Tyr Leu Glu 130 135 140 aag aca gcc aaa aaa ggt cgc cca atg gtg ate tec agt ggg atg cag 480 Lys Thr Ala Lys Lys Gly Arg Pro Met Val He Ser Ser Gly Met Gin 145 150 155 160 tea atg gac ace atg aag caa gtt tat cag ate gtg aag ccc etc aac 528 Ser Met Asp Thr Met Lys Gin Val Tyr Gin He Val Lys Pro Leu Asn 165 170 175 ccc aac ttc tgc ttc ttg cag tgt ace age gca tac ccg etc cag cct 576 Pro Asn Phe Cys Phe Leu Gin Cys Thr Ser Ala Tyr Pro Leu Gin Pro 180 185 190 gag gac gtc aac ctg egg gtc ate teg gaa tat cag aag etc ttt cct 624 Glu Asp Val Asn Leu Arg Val He Ser Glu Tyr Gin Lys Leu Phe Pro 195 200 205 gac att ccc ata ggg tat tct ggg cat gaa aca ggc ata gcg ata tct 672 Asp He Pro He Gly Tyr Ser Gly His Glu Thr Gly He Ala He Ser 210 215 220 gtg gcc gca gtg get ctg ggg gcc aag gtg ttg gaa cgt cac ata act 720 Val Ala Ala Val Ala Leu Gly Ala Lys Val Leu Glu Arg His He Thr 225 230 235 240 ttg gac aag ace tgg aag ggg agt gac cac teg gcc teg ctg gag cct 768 Leu Asp Lys Thr Trp Lys Gly Ser Asp His Ser Ala Ser Leu Glu Pro 245 250 255 gga gaa ctg gcc gag ctg gtg egg tea gtg cgt ctt gtg gag cgt gcc 816 Gly Glu Leu Ala Glu Leu Val Arg Ser Val Arg Leu Val Glu Arg Ala 260 265 270 ctg ggc tec cca ace aag cag ctg ctg ccc tgt gag atg gcc tgc aat 864 Leu Gly Ser Pro Thr Lys Gin Leu Leu Pro Cys Glu Met Ala Cys Asn 275 280 285 gag aag ctg ggc aag tct gtg gtg gcc aaa gtg aaa att ccg gaa ggc 912 Glu Lys Leu Gly Lys Ser Val Val Ala Lys Val Lys He Pro Glu Gly 290 295 300 ace att eta aca atg gac atg etc ace gtg aag gtg ggt gag ccc aaa 960 Thr He Leu Thr Met Asp Met Leu Thr Val Lys Val Gly Glu Pro Lys 305 310 315 320 gcc tat cct cct gaa gac ate ttt aat eta gtg ggc aag aag gtc ctg 1008 Ala Tyr Pro Pro Glu Asp He Phe Asn Leu Val Gly Lys Lys Val Leu 325 330 335 gtc act gtt gaa gag gat gac ace ate atg gaa gaa ttg gta gat aat 1056 Val Thr Val Glu Glu Asp Asp Thr He Met Glu Glu Leu Val Asp Asn 340 345 350 cat ggc aaa aaa ate aag tct taa 1080
His Gly Lys Lys He Lys Ser
355 360
<210> 6
<211> 359
<212> PRT
<213> Homo sapiens
<400> 6
Met Pro Leu Glu Leu Glu Leu Cys Pro Gly Arg Trp Val Gly Gly Gin
1 5 10 15
His Pro Cys Phe He He Ala Glu He Gly Gin Asn His Gin Gly Asp
20 25 30
Leu Asp Val Ala Lys Arg Met He Arg Met Ala Lys Glu Cys Gly Ala
35 40 45
Asp Cys Ala Lys Phe Gin Lys Ser Glu Leu Glu Phe Lys Phe Asn Arg
50 55 60
Lys Ala Leu Glu Arg Pro Tyr Thr Ser Lys His Ser Trp Gly Lys Thr 65 70 75 80
Tyr Gly Glu His Lys Arg His Leu Glu Phe Ser His Asp Gin Tyr Arg
85 90 95
Glu Leu Gin Arg Tyr Ala Glu Glu Val Gly He Phe Phe Thr Ala Ser
100 105 110
Gly Met Asp Glu Met Ala Val Glu Phe Leu His Glu Leu Asn Val Pro
115 120 125
Phe Phe Lys Val Gly Ser Gly Asp Thr Asn Asn Phe Pro Tyr Leu Glu
130 135 140
Lys Thr Ala Lys Lys Gly Arg Pro Met Val He Ser Ser Gly Met Gin 145 150 155 160
Ser Met Asp Thr Met Lys Gin Val Tyr Gin He Val Lys Pro Leu Asn 165 170 175
Pro Asn Phe Cys Phe Leu Gin Cys Thr Ser Ala Tyr Pro Leu Gin Pro 180 185 190
Glu Asp Val Asn Leu Arg Val He Ser Glu Tyr Gin Lys Leu Phe Pro 195 200 205
Asp He Pro He Gly Tyr Ser Gly His Glu Thr Gly He Ala He Ser
210 215 220
Val Ala Ala Val Ala Leu Gly Ala Lys Val Leu Glu Arg His He Thr
225 230 235 240
Leu Asp Lys Thr Trp Lys Gly Ser Asp His Ser Ala Ser Leu Glu Pro 245 250 255
Gly Glu Leu Ala Glu Leu Val Arg Ser Val Arg Leu Val Glu Arg Ala 260 265 270
Leu Gly Ser Pro Thr Lys Gin Leu Leu Pro Cys Glu Met Ala Cys Asn 275 280 285
Glu Lys Leu Gly Lys Ser Val Val Ala Lys Val Lys He Pro Glu Gly 290 295 300
Thr He Leu Thr Met Asp Met Leu Thr Val Lys Val Gly Glu Pro Lys
305 310 315 320
Ala Tyr Pro Pro Glu Asp He Phe Asn Leu Val Gly Lys Lys Val Leu 325 330 335
Val Thr Val Glu Glu Asp Asp Thr He Met Glu Glu Leu Val Asp Asn 340 345 350
His Gly Lys Lys He Lys Ser 355
<210> 7
<211> 1059
<212 > DNA
<213 > Homo sapiens
<220 >
<221> CDS
<222> ( 1 ) . . ( 1041 )
<400> 7 atg agt aat ata tat ate gtt get gaa att ggt tgc aac cat aat ggt 48 Met Ser Asn He Tyr He Val Ala Glu He Gly Cys Asn His Asn Gly 1 5 10 15 agt gtt gat att gca aga gaa atg ata tta aaa gcc aaa gag gcc ggt 96 Ser Val Asp He Ala Arg Glu Met He Leu Lys Ala Lys Glu Ala Gly 20 25 30 gtt aat gca gta aaa ttc caa aca ttt aaa get gat aaa tta att tea 144 Val Asn Ala Val Lys Phe Gin Thr Phe Lys Ala Asp Lys Leu He Ser 35 40 45 get att gca cct aag gca gag tat caa ata aaa aac aca gga gaa tta 192 Ala He Ala Pro Lys Ala Glu Tyr Gin He Lys Asn Thr Gly Glu Leu 50 55 60 gaa tct cag tta gaa atg aca aaa aag ctt gaa atg aag tat gac gat 240 Glu Ser Gin Leu Glu Met Thr Lys Lys Leu Glu Met Lys Tyr Asp Asp 65 70 75 80 tat etc cat eta atg gaa tat gca gtc agt tta aat tta gat gtt ttt 288 Tyr Leu His Leu Met Glu Tyr Ala Val Ser Leu Asn Leu Asp Val Phe 85 90 95 tct ace cct ttt gac gaa gac tct att gat ttt tta gca tct ttg aaa 336 Ser Thr Pro Phe Asp Glu Asp Ser He Asp Phe Leu Ala Ser Leu Lys 100 105 110 caa aaa ata tgg aaa ate cct tea ggt gag tta ttg aat tta ccg tat 384 Gin Lys He Trp Lys He Pro Ser Gly Glu Leu Leu Asn Leu Pro Tyr 115 120 125 ctt gaa aaa ata gcc aag ctt ccg ate cct gat aag aaa ata ate ata 432 Leu Glu Lys He Ala Lys Leu Pro He Pro Asp Lys Lys He He He 130 135 140 tea aca gga atg get act att gat gag ata aaa cag tct gtt tct att 480 Ser Thr Gly Met Ala Thr He Asp Glu He Lys Gin Ser Val Ser He 145 150 155 160 ttt ata aat aat aaa gtt ccg gtt ggt aat att aca ata tta cat tgc 528 Phe He Asn Asn Lys Val Pro Val Gly Asn He Thr He Leu His Cys 165 170 175 aat act gaa tat cca acg ccc ttt gag gat gta aac ctt aat get att 576 Asn Thr Glu Tyr Pro Thr Pro Phe Glu Asp Val Asn Leu Asn Ala He 180 185 190 aat gat ttg aaa aaa cac ttc cct aag aat aac ata ggc ttc tct gat 624 Asn Asp Leu Lys Lys His Phe Pro Lys Asn Asn He Gly Phe Ser Asp 195 200 205 cat tct age ggg ttt tat gca get att gcg gcg gtg cct tat gga ata 672 His Ser Ser Gly Phe Tyr Ala Ala He Ala Ala Val Pro Tyr Gly He 210 215 220 act ttt att gaa aaa cat ttc act tta gat aaa tct atg tct ggc cca 720 Thr Phe He Glu Lys His Phe Thr Leu Asp Lys Ser Met Ser Gly Pro 225 230 235 240 gat cat ttg gcc tea ata gaa cct gat gaa ctg aaa cat ctt tgt att 768 Asp His Leu Ala Ser He Glu Pro Asp Glu Leu Lys His Leu Cys He 245 250 255 ggg gtc agg tgt gtt gaa aaa tct tta ggt tea aat agt aaa gtg gtt 816 Gly Val Arg Cys Val Glu Lys Ser Leu Gly Ser Asn Ser Lys Val Val 260 265 270 aca get tea gaa agg aag aat aaa ate gta gca aga aag tct att ata 864 Thr Ala Ser Glu Arg Lys Asn Lys He Val Ala Arg Lys Ser He He 275 280 285 get aaa aca gag ata aaa aaa ggt gag gtt ttt tea gaa aaa aat ata 912 Ala Lys Thr Glu He Lys Lys Gly Glu Val Phe Ser Glu Lys Asn He 290 295 300 aca aca aaa aga cct ggt aat ggt ate agt ccg atg gag tgg tat aat 960 Thr Thr Lys Arg Pro Gly Asn Gly He Ser Pro Met Glu Trp Tyr Asn 305 310 315 320 tta ttg ggt aaa att gca gag caa gac ttt att cca gat gaa tta ata 1008 Leu Leu Gly Lys He Ala Glu Gin Asp Phe He Pro Asp Glu Leu He 325 330 335 att cat age gaa ttc aaa aat cag ggg gaa taa tgagaacaaa aattattg 1059 He His Ser Glu Phe Lys Asn Gin Gly Glu 340 345
<210> 8
<211> 346
<212> PRT
<213> Homo sapiens
<400> 8
Met Ser Asn He Tyr He Val Ala Glu He Gly Cys Asn His Asn Gly
1 5 10 15
Ser Val Asp He Ala Arg Glu Met He Leu Lys Ala Lys Glu Ala Gly
20 25 30
Val Asn Ala Val Lys Phe Gin Thr Phe Lys Ala Asp Lys Leu He Ser
35 40 45
Ala He Ala Pro Lys Ala Glu Tyr Gin He Lys Asn Thr Gly Glu Leu
50 55 60
Glu Ser Gin Leu Glu Met Thr Lys Lys Leu Glu Met Lys Tyr Asp Asp 65 70 75 80
Tyr Leu His Leu Met Glu Tyr Ala Val Ser Leu Asn Leu Asp Val Phe
85 90 95
Ser Thr Pro Phe Asp Glu Asp Ser He Asp Phe Leu Ala Ser Leu Lys
100 105 110
Gin Lys He Trp Lys He Pro Ser Gly Glu Leu Leu Asn Leu Pro Tyr
115 120 125
Leu Glu Lys He Ala Lys Leu Pro He Pro Asp Lys Lys He He He
130 135 140
Ser Thr Gly Met Ala Thr He Asp Glu He Lys Gin Ser Val Ser He 145 150 155 160
Phe He Asn Asn Lys Val Pro Val Gly Asn He Thr He Leu His Cys
165 170 175
Asn Thr Glu Tyr Pro Thr Pro Phe Glu Asp Val Asn Leu Asn Ala He
180 185 190
Asn Asp Leu Lys Lys His Phe Pro Lys Asn Asn He Gly Phe Ser Asp
195 200 205
His Ser Ser Gly Phe Tyr Ala Ala He Ala Ala Val Pro Tyr Gly He
210 215 220
Thr Phe He Glu Lys His Phe Thr Leu Asp Lys Ser Met Ser Gly Pro 225 230 235 240
Asp His Leu Ala Ser He Glu Pro Asp Glu Leu Lys His Leu Cys He
245 250 255
Gly Val Arg Cys Val Glu Lys Ser Leu Gly Ser Asn Ser Lys Val Val
260 265 270
Thr Ala Ser Glu Arg Lys Asn Lys He Val Ala Arg Lys Ser He He
275 280 285
Ala Lys Thr Glu He Lys Lys Gly Glu Val Phe Ser Glu Lys Asn He
290 295 300
Thr Thr Lys Arg Pro Gly Asn Gly He Ser Pro Met Glu Trp Tyr Asn 305 310 315 320
Leu Leu Gly Lys He Ala Glu Gin Asp Phe He Pro Asp Glu Leu He
325 330 335
He His Ser Glu Phe Lys Asn Gin Gly Glu 340 345
PCT/US2000/005313 1999-03-02 2000-03-01 Engineering intracellular sialylation pathways WO2000052135A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
CA2363297A CA2363297C (en) 1999-03-02 2000-03-01 Engineering intracellular sialylation pathways
EP00913684A EP1399538A2 (en) 1999-03-02 2000-03-01 Engineering intracellular sialylation pathways
JP2000602747A JP2003524395A (en) 1999-03-02 2000-03-01 Manipulation of the intracellular sialylation pathway
AU35083/00A AU3508300A (en) 1999-03-02 2000-03-01 Engineering intracellular sialylation pathways

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US12258299P 1999-03-02 1999-03-02
US60/122,582 1999-03-02
US16962499P 1999-12-08 1999-12-08
US60/169,624 1999-12-08

Publications (3)

Publication Number Publication Date
WO2000052135A2 true WO2000052135A2 (en) 2000-09-08
WO2000052135A9 WO2000052135A9 (en) 2001-10-11
WO2000052135A3 WO2000052135A3 (en) 2004-01-08

Family

ID=26820692

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/005313 WO2000052135A2 (en) 1999-03-02 2000-03-01 Engineering intracellular sialylation pathways

Country Status (5)

Country Link
EP (1) EP1399538A2 (en)
JP (1) JP2003524395A (en)
AU (1) AU3508300A (en)
CA (1) CA2363297C (en)
WO (1) WO2000052135A2 (en)

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001059075A1 (en) * 2000-02-08 2001-08-16 Genentech, Inc. Improved sialylation of glycoproteins
WO2003074687A1 (en) * 2002-03-07 2003-09-12 Eidgenössische Technische Hochschule Zürich System and method for the production of recombinant glycosylated proteins in a prokaryotic host
WO2004063370A1 (en) * 2003-01-15 2004-07-29 Kazuhito Fujiyama Process for producing glycoprotein having animal type sugar chain
WO2005080585A1 (en) * 2004-02-13 2005-09-01 Glycotope Gmbh Highly active glycoproteins-process conditions and an efficient method for their production
US6949372B2 (en) 1999-03-02 2005-09-27 The Johns Hopkins University Engineering intracellular sialylation pathways
WO2005090552A2 (en) * 2004-03-17 2005-09-29 Glycofi, Inc. Method of engineering a cytidine monophosphate-sialic acid synthetic pathway in fungi and yeast
WO2007006570A3 (en) * 2005-07-12 2007-07-19 Greenovation Biotech Gmbh Improvements in or relating to protein production
US7479549B2 (en) 2005-02-23 2009-01-20 Jaques John Scott T Recombinant canine thyroid stimulating hormone and methods of production and use thereof
KR100938026B1 (en) 2002-03-07 2010-01-21 아이드게노쉬쉐 테흐니쉐 호흐슐레 쥬리히 System and method for the production of recombinant glycosylated protein in a prokaryotic host
US7863020B2 (en) 2000-06-28 2011-01-04 Glycofi, Inc. Production of sialylated N-glycans in lower eukaryotes
EP2412815A1 (en) * 2010-07-27 2012-02-01 Universite De Rouen N-glycosylation in transformed Phaeodactylum tricornutum
US8592165B2 (en) 2006-11-10 2013-11-26 Glycotope Gmbh Carbohydrate specific cellular immunity inducing microorganisms and fractions thereof
US8753864B2 (en) 2005-05-11 2014-06-17 Eth Zurich Recombinant N-glycosylated proteins from procaryotic cells
US8846342B2 (en) 2009-11-19 2014-09-30 Glycovaxyn Ag Biosynthetic system that produces immunogenic polysaccharides in prokaryotic cells
US8871491B2 (en) 2010-05-06 2014-10-28 Glycovaxyn Ag Capsular gram-positive bacteria bioconjugate vaccines
US8895014B2 (en) 2008-02-20 2014-11-25 Glycovaxyn Ag Bioconjugates made from recombinant N-glycosylated proteins from procaryotic cells
US9051356B2 (en) 2006-09-10 2015-06-09 Glycotope Gmbh Use of human cells of myeloid leukaemia origin for expression of antibodies
US9700610B2 (en) 2011-08-22 2017-07-11 Glycotope Gmbh Microorganisms carrying a tumor antigen
US11872289B2 (en) 2018-05-18 2024-01-16 Daiichi Sankyo Co., Ltd. Anti-MUC1 antibody-drug conjugate

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SG152290A1 (en) * 2004-05-04 2009-05-29 Univ Singapore Method for expressing sialylated glycoproteins in mammalian cells and cells thereof
JP5918697B2 (en) * 2010-10-15 2016-05-18 Jcrファーマ株式会社 Method for producing a glycoprotein in which the non-reducing end of the sugar chain is a mannose residue

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MUNSTER. ET. AL.: 'Mammalian cytidine 5'-monophosphate N-acetylneuraminic acid synthetase: A nuclear protein with avolutionarly conserved structural motifs.' PROC. NATL. ACAD. SCI. USA vol. 95, August 1998, pages 9140 - 9145, XP002928328 *
See also references of EP1399538A2 *

Cited By (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6949372B2 (en) 1999-03-02 2005-09-27 The Johns Hopkins University Engineering intracellular sialylation pathways
US8546105B2 (en) 1999-03-02 2013-10-01 The University Of Wyoming Engineering intracellular sialylation pathways
US7776565B2 (en) 1999-03-02 2010-08-17 The John Hopkins University Engineering intracellular sialylation pathways
WO2001059075A1 (en) * 2000-02-08 2001-08-16 Genentech, Inc. Improved sialylation of glycoproteins
US7863020B2 (en) 2000-06-28 2011-01-04 Glycofi, Inc. Production of sialylated N-glycans in lower eukaryotes
KR100938026B1 (en) 2002-03-07 2010-01-21 아이드게노쉬쉐 테흐니쉐 호흐슐레 쥬리히 System and method for the production of recombinant glycosylated protein in a prokaryotic host
WO2003074687A1 (en) * 2002-03-07 2003-09-12 Eidgenössische Technische Hochschule Zürich System and method for the production of recombinant glycosylated proteins in a prokaryotic host
US8703471B2 (en) 2002-03-07 2014-04-22 ETH Zürich System and method for the production of recombinant glycosylated proteins in a prokaryotic host
WO2004063370A1 (en) * 2003-01-15 2004-07-29 Kazuhito Fujiyama Process for producing glycoprotein having animal type sugar chain
US8999671B2 (en) 2003-02-20 2015-04-07 Glycofi, Inc. Production of sialylated N-glycans in lower eukaryotes
US8268609B2 (en) 2003-02-20 2012-09-18 Glycofi, Inc. Production of sialylated N-glycans in lower eukaryotes
US8609370B2 (en) * 2004-02-13 2013-12-17 Glycotope Gmbh Highly active glycoproteins-process conditions and an efficient method for their production
AU2005215889B2 (en) * 2004-02-13 2011-11-10 Glycotope Gmbh Highly active glycoproteins-process conditions and an efficient method for their production
AU2005215889B9 (en) * 2004-02-13 2011-11-17 Glycotope Gmbh Highly active glycoproteins-process conditions and an efficient method for their production
WO2005080585A1 (en) * 2004-02-13 2005-09-01 Glycotope Gmbh Highly active glycoproteins-process conditions and an efficient method for their production
AU2005224672B2 (en) * 2004-03-17 2011-06-02 Glycofi, Inc. Method of engineering a cytidine monophosphate-sialic acid synthetic pathway in fungi and yeast
EP2365089A1 (en) * 2004-03-17 2011-09-14 GlycoFi, Inc. Method of engineering a cytidine monophosphate-sialic acid synthetic pathway in fungi and yeast
WO2005090552A2 (en) * 2004-03-17 2005-09-29 Glycofi, Inc. Method of engineering a cytidine monophosphate-sialic acid synthetic pathway in fungi and yeast
WO2005090552A3 (en) * 2004-03-17 2006-01-26 Glycofi Inc Method of engineering a cytidine monophosphate-sialic acid synthetic pathway in fungi and yeast
US7838492B2 (en) 2005-02-23 2010-11-23 Jaques John Scott T Recombinant canine thyroid stimulating hormone and methods of production and use thereof
US7479549B2 (en) 2005-02-23 2009-01-20 Jaques John Scott T Recombinant canine thyroid stimulating hormone and methods of production and use thereof
US8753864B2 (en) 2005-05-11 2014-06-17 Eth Zurich Recombinant N-glycosylated proteins from procaryotic cells
US9551019B2 (en) 2005-05-11 2017-01-24 ETH Zürich Recombinant N-glycosylated proteins from procaryotic cells
US7741539B2 (en) 2005-07-12 2010-06-22 Greenovation Biotech Gmbh Transformed plant cell expressing five mammalian proteins involved in sialylation and a protein involved in galactosylation
WO2007006570A3 (en) * 2005-07-12 2007-07-19 Greenovation Biotech Gmbh Improvements in or relating to protein production
AU2007248485B2 (en) * 2006-05-05 2013-01-24 Glycofi, Inc. Production of sialylated N-glycans in lower eukaryotes
US10280230B2 (en) 2006-09-10 2019-05-07 Glycotope Gmbh Use of human cells of myeloid leukemia origin for expression of antibodies
US9051356B2 (en) 2006-09-10 2015-06-09 Glycotope Gmbh Use of human cells of myeloid leukaemia origin for expression of antibodies
US8592165B2 (en) 2006-11-10 2013-11-26 Glycotope Gmbh Carbohydrate specific cellular immunity inducing microorganisms and fractions thereof
US9494587B2 (en) 2006-11-10 2016-11-15 Glycotope Gmbh Microorganisms or fractions thereof capable of activating cellular immunity against carbohydrates
US8895014B2 (en) 2008-02-20 2014-11-25 Glycovaxyn Ag Bioconjugates made from recombinant N-glycosylated proteins from procaryotic cells
US11944675B2 (en) 2008-02-20 2024-04-02 Glaxosmithkline Biologicals Sa Bioconjugates made from recombinant N-glycosylated proteins from procaryotic cells
US10835592B2 (en) 2008-02-20 2020-11-17 Glaxosmithkline Biologicals Sa Bioconjugates made from recombinant N-glycosylated proteins from procaryotic cells
US8846342B2 (en) 2009-11-19 2014-09-30 Glycovaxyn Ag Biosynthetic system that produces immunogenic polysaccharides in prokaryotic cells
US9764018B2 (en) 2009-11-19 2017-09-19 Glycovaxyn Ag Biosynthetic system that produces immunogenic polysaccharides in prokaryotic cells
US8871491B2 (en) 2010-05-06 2014-10-28 Glycovaxyn Ag Capsular gram-positive bacteria bioconjugate vaccines
US9585950B2 (en) 2010-05-06 2017-03-07 Glycovaxyn Ag Capsular gram-positive bacteria bioconjugate vaccines
US10307473B2 (en) 2010-05-06 2019-06-04 Glaxosmithkline Biologicals Sa Capsular gram-positive bacteria bioconjugate vaccines
EP2412815A1 (en) * 2010-07-27 2012-02-01 Universite De Rouen N-glycosylation in transformed Phaeodactylum tricornutum
US9587245B2 (en) 2010-07-27 2017-03-07 Universite De Rouen N-glycosylation in transformed Phaeodactylum tricornutum
WO2012013337A1 (en) * 2010-07-27 2012-02-02 Universite De Rouen N-glycosylation in tranformed phaeodactylum tricornutum
US9700610B2 (en) 2011-08-22 2017-07-11 Glycotope Gmbh Microorganisms carrying a tumor antigen
US11872289B2 (en) 2018-05-18 2024-01-16 Daiichi Sankyo Co., Ltd. Anti-MUC1 antibody-drug conjugate

Also Published As

Publication number Publication date
AU3508300A (en) 2000-09-21
EP1399538A4 (en) 2004-03-24
CA2363297C (en) 2011-08-09
JP2003524395A (en) 2003-08-19
EP1399538A2 (en) 2004-03-24
CA2363297A1 (en) 2000-09-08
WO2000052135A3 (en) 2004-01-08
WO2000052135A9 (en) 2001-10-11

Similar Documents

Publication Publication Date Title
US6949372B2 (en) Engineering intracellular sialylation pathways
CA2363297C (en) Engineering intracellular sialylation pathways
Harrison et al. Protein N‐glycosylation in the baculovirus–insect cell expression system and engineering of insect cells to produce “mammalianized” recombinant glycoproteins
Aumiller et al. A transgenic insect cell line engineered to produce CMP–sialic acid and sialylated glycoproteins
Shi et al. Protein N-glycosylation in the baculovirus-insect cell system
Lawrence et al. Cloning and expression of the humanN-acetylneuraminic acid phosphate synthase gene with 2-Keto-3-deoxy-d-glycero-d-galacto-nononic acid biosynthetic ability
DK2533834T3 (en) DRUG DELIVERY DEVICES
US20170159095A1 (en) Method of production of recombinant glycoproteins with increased circulatory half-life in mammalian cells
Kim et al. Expression of a functional Drosophila melanogaster N-acetylneuraminic acid (Neu5Ac) phosphate synthase gene: evidence for endogenous sialic acid biosynthetic ability in insects
Geisler et al. Substrate specificities and intracellular distributions of three N-glycan processing enzymes functioning at a key branch point in the insect N-glycosylation pathway
Clark et al. Gene‐expression profiles for five key glycosylation genes for galactose‐fed CHO cells expressing recombinant IL‐4/13 cytokine trap
US6323332B1 (en) Sulfotransferase for HNK-1 glycan
Zheng et al. Molecular cloning and characterization of a novel α1, 2-fucosyltransferase (CE2FT-1) from Caenorhabditis elegans
WO2001042492A1 (en) Engineering intracellular sialylation pathways
US5856159A (en) Production of galactosyltransferase
JP2003047467A (en) Chondroitin synthetase
US20070154982A1 (en) Mammalian cell lines modified for the production of recombinant glycoproteins
WO2008127359A2 (en) An insect cell line for production of recombinant glycoproteins with sulfated complex n-glycans
Zheng et al. A novel α1, 2-fucosyltransferase (CE2FT-2) in Caenorhabditis elegans generates H-type 3 glycan structures
WO2001059075A1 (en) Improved sialylation of glycoproteins
KR20070060439A (en) MAMMALIANIZATION OF GLYCOPROTEINS PRODUCED IN INSECT CELLS EXPRESSING STABLY HUMAN beta;1,4-GALACTOSYLTRANSFERASE AND RAT alpha;2,6-SIALYLTRANSFERASE AND METHOD THEREOF
US20030104574A1 (en) Mutant cell lines and methods for producing enhanced levels of recombinant proteins
Lawrence Engineering sialic acid metabolism into insect cells
Chen Ammonium toxicity and amino acid protection in Chinese hamster ovary cells
Kawar Characterization of a lepidopteran insect alpha1, 2-mannosidase: cDNA cloning, biosynthesis, subcellular localization, and substrate specificity

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
ENP Entry into the national phase in:

Ref country code: CA

Ref document number: 2363297

Kind code of ref document: A

Format of ref document f/p: F

Ref document number: 2363297

Country of ref document: CA

ENP Entry into the national phase in:

Ref country code: JP

Ref document number: 2000 602747

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 2000913684

Country of ref document: EP

AK Designated states

Kind code of ref document: C2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

COP Corrected version of pamphlet

Free format text: PAGES 1/35-35/35, DRAWINGS, REPLACED BY NEW PAGES 1/40-40/40; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWP Wipo information: published in national office

Ref document number: 2000913684

Country of ref document: EP