WO1998020111A1 - Improved expression vectors - Google Patents

Improved expression vectors Download PDF

Info

Publication number
WO1998020111A1
WO1998020111A1 PCT/US1997/020528 US9720528W WO9820111A1 WO 1998020111 A1 WO1998020111 A1 WO 1998020111A1 US 9720528 W US9720528 W US 9720528W WO 9820111 A1 WO9820111 A1 WO 9820111A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
heterologous nucleic
promoter
encodes
bacterial
Prior art date
Application number
PCT/US1997/020528
Other languages
French (fr)
Inventor
Jody Schultz
Gary Hermanson
Original Assignee
Cytel Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cytel Corporation filed Critical Cytel Corporation
Priority to HU0001650A priority Critical patent/HUP0001650A3/en
Priority to AU52524/98A priority patent/AU718382B2/en
Priority to NZ335628A priority patent/NZ335628A/en
Priority to IL12984397A priority patent/IL129843A0/en
Priority to JP52185798A priority patent/JP2001503274A/en
Priority to CA002271230A priority patent/CA2271230A1/en
Priority to DE69738514T priority patent/DE69738514D1/en
Priority to EP97947444A priority patent/EP0946711B1/en
Publication of WO1998020111A1 publication Critical patent/WO1998020111A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/20Bacteria; Culture media therefor
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/70Vectors or expression systems specially adapted for E. coli
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/70Vectors or expression systems specially adapted for E. coli
    • C12N15/72Expression systems using regulatory sequences derived from the lac-operon
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/74Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P20/00Technologies relating to chemical industry
    • Y02P20/50Improvements relating to the production of bulk chemicals
    • Y02P20/582Recycling of unreacted starting or intermediate materials

Definitions

  • the present invention relates to improved vectors and fermentation protocols for expression of recombinant proteins in bacterial cells.
  • the invention can be used for expression of any desired protein, including enzymes useful in the enzymatic synthesis of oligosaccharides.
  • Recombinant DNA techniques using bacterial, fungal, mammalian, or insect cells as expression hosts are particularly useful means for producing large quantities of polypeptides.
  • Recombinant production of desired proteins generally involves transfecting host cells with an expression vector that contains signals which, when operably linked to a gene encoding the protein, control expression of the gene. The cells are grown under conditions suitable for expression of the recombinant protein.
  • the expression control signals typically include a promoter, which influences the rate at which a gene located downstream of the promoter is transcribed into RNA and determines the transcriptional start site.
  • Expression control signals are chosen so as to be functional in the host cell used for production of the desired protein.
  • the bacterium E. coli is commonly used to produce recombinant proteins in high yields.
  • Numerous references disclose methods of using E. coli and other bacteria to produce proteins recombinantly. (see, e.g., U.S. Pat. No. 4,565,785; U.S. Pat. No. 4,673,641; U.S. Pat. No. 4,738,921; U.S. Pat. No. 4,795,706; and U.S. Pat. No. 4,710,473).
  • the present invention provides isolated recombinant nucleic acid constructs that comprise a dual bacterial promoter operably linked to a heterologous nucleic acid that encodes a desired polypeptide.
  • the constructs are useful for expressing a desired polypeptide in bacterial host cells at high levels.
  • the dual promoters comprise a first component derived from a t ⁇ c-related promoter and a second promoter component obtained from a bacterial gene or operon that encodes an enzyme or enzymes involved in galactose metabolism.
  • a number of galactose promoters can be used as the second promoter component; an exemplary promoter is a UDPgalactose-4-epimerase (also known as UDPglucose-4-epimerase) promoter such as that derived from Streptococcus thermophilus.
  • UDPgalactose-4-epimerase also known as UDPglucose-4-epimerase
  • expression vectors which include a dual bacterial promoter operably linked to a heterologous nucleic acid that encodes a desired polypeptide.
  • the expression vectors can further comprise other components such as a selectable marker.
  • the constructs can also comprise an origin of replication sequence that functions in E. coli or other host cell.
  • a preferred construct of the invention, the plasmid pTGK (described in detail below) was deposited with the American Type Culture Collection under Accession No. 98059 on May 22, 1996.
  • the invention also provides a bacterial cell that contains a recombinant expression cassette that includes the dual bacterial promoter unit operably linked to a heterologous nucleic acid.
  • the expression cassette can be integrated into the genome of the host cell or be present on an independently replicating plasmid.
  • a preferred bacterial cell is E. coli.
  • the invention further provides methods of making a desired polypeptide. The methods involve culturing in an appropriate medium bacterial cells that contain a recombinant expression cassette having a dual bacterial promoter unit operably linked to a heterologous nucleic acid under conditions that allow expression of the heterologous nucleic acid. Typically E. coli are used as the host cell.
  • polypeptides include hormones, growth factors, and the like, as well as enzymes useful in the synthesis of carbohydrates.
  • enzymes include CMP-sialic acid synthetase, UDP-glucose pyrophosphorylase, adenylate kinase, pyruvate kinase, sialic acid aldolase, UDP-GlcNAc pyrophosphorylase, and myokinase, galactosyltransferase, and N-acetyl glucosaminyltransferase.
  • Figure 1 is a map of the plasmid pPHOX2.
  • Figure 2 shows the spacing between the ribosome binding site (Shine- Dalgarno sequence) and the initiation codon of the recombinant gene in the vectors of the invention (SEQ ID NO:2).
  • Figure 3 is a map of the plasmid pPHOX2/galE/Kan.
  • Figure 4 is a map of the plasmid pTGK. This plasmid is essentially the same as pPHOX2/galE/Kan, except that the 5' Xbal site of the promoter region and the Hin ⁇ lll site in the kanamycin resistance gene (kan 1 ) have been deleted.
  • Figures 5A and 5B show that one can maintain optimal spacing between the ribosome binding site and the initiation codon of the gene to be expressed in vectors of the invention by amplifying the DNA to be expressed using as primers oligonucleotides that begin with the ATG initiation codon.
  • Figure 5A shows the relationship between the ribosome binding site (RBS) and the Srfi restriction site in the plasmid pTGKS (SEQ ID
  • FIG. 3 shows the nucleotide sequence of one embodiment of a dual tac-gal promoter of the invention, flanked by pPHOX2 sequences, as indicated (SEQ ID NO:l).
  • SEQ ID NO:l The -35 and -10 consensus sequences of both the tac and galE promoters are shown, as are the locations of Xbal and Hwdlll sites which are useful for inserting a recombinant gene for expression.
  • the present invention provides improved promoters and vectors for the recombinant expression of desired polypeptides.
  • the promoters and vectors of the invention are particularly suited for the expression of recombinant proteins in bacterial hosts, such as E. coli.
  • fermentation protocols for obtaining high levels of recombinant protein expression using the promoters and vectors of the invention are also provided.
  • nucleic acid refers to a deoxyribonucleotide or ribonucleotide polymer in either single- or double-stranded form, and unless otherwise limited, encompasses known analogues of natural nucleotides that hybridize to nucleic acids in manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence includes the complementary sequence thereof.
  • operably linked refers to functional linkage between a nucleic acid expression control sequence (such as a promoter, signal sequence, or array of transcription factor binding sites) and a second nucleic acid sequence, wherein the expression control sequence affects transcription and/or translation of the nucleic acid corresponding to the second sequence.
  • a nucleic acid expression control sequence such as a promoter, signal sequence, or array of transcription factor binding sites
  • recombinant when used with reference to a cell indicates that the cell replicates a heterologous nucleic acid, or expresses a peptide or protein encoded by a heterologous nucleic acid.
  • Recombinant cells can express genes that are not found within the native (non-recombinant) form of the cell. Recombinant cells can also express genes that are found in the native form of the cell, but wherein the genes are modified and re-introduced into the cell by artificial means.
  • heterologous sequence or a “heterologous nucleic acid”, as used herein, is one that originates from a foreign source (or species) or, if from the same source, is modified from its original form.
  • a heterologous nucleic acid operably linked to a promoter is from a source different from that from which the promoter was derived, or, if from the same source, is modified from its original form.
  • a UDPglucose 4-epimerase gene promoter can be linked to a structural gene encoding a polypeptide other than native UDPglucose 4-epimerase.
  • Modification of the heterologous sequence may occur, e.g., by treating the DNA with a restriction enzyme to generate a DNA fragment that is capable of being operably linked to the promoter.
  • Techniques such as site-directed mutagenesis are also useful for modifying a heterologous sequence.
  • a “recombinant expression cassette” or simply an “expression cassette” is a nucleic acid construct, generated recombinantly or synthetically, with nucleic acid elements that are capable of affecting expression of a structural gene in hosts that are compatible with such sequences.
  • Expression cassettes include at least promoters and optionally, transcription termination signals.
  • the recombinant expression cassette includes a nucleic acid to be transcribed (e.g., a nucleic acid encoding a desired polypeptide), and a promoter (e.g., a dual promoter that contains a t ⁇ c promoter component and a g ⁇ l promoter component). Additional factors necessary or helpful in effecting expression may also be used as described herein.
  • an expression cassette can also include nucleotide sequences that encode a signal sequence that directs secretion of an expressed protein from the host cell.
  • isolated is meant to refer to material which is substantially or essentially free from components which normally accompany the material as found in its native state.
  • an isolated protein for example, does not include materials normally associated with their in situ environment.
  • isolated proteins of the invention are at least about 80% pure, usually at least about 90%, and preferably at least about 95% pure as measured by band intensity on a silver stained gel or other method for determining purity.
  • polypeptides are purified from transgenic cells.
  • UDPglucose-4-epimerase (E.C. 5.1.3.2) is also known as UDPgalactose-4- epimerase and phosphoribulose epimerase. These terms are used interchangeably herein.
  • Two polynucleotides or polypeptides are said to be "identical” if the sequence of nucleotides or amino acid residues in the two sequences is the same when aligned for maximum correspondence.
  • Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman, Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson and Lipman Proc. Natl.
  • polypeptide identity as applied to polypeptides means that a polypeptide comprises a sequence that is at least 80% identical, preferably 90%, more preferably 95% or more, to a reference sequence over a comparison window of about 20 residues to about 600 residues—typically about 50 to about 500 residues usually about 250 to 300 residues. The values of percent identity are determined using the programs above.
  • substantially identical or “substantial sequence identity” as applied to nucleic acid sequences and as used herein denote a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a sequence that has at least 85 percent sequence identity, preferably at least 90 to 95 percent sequence identity, and more preferably at least 99 percent sequence identity as compared to a reference sequence over a comparison window of at least 20 nucleotide positions, frequently over a window of at least 25-50 nucleotides, wherein the percentage of sequence identity is calculated by comparing the reference sequence to the polynucleotide sequence which may include deletions or additions which total 20 percent or less of the reference sequence over the window of comparison.
  • the reference sequence may be a subset of a larger sequence.
  • stringent conditions are sequence-dependent and will be different in different circumstances.
  • stringent conditions are selected to be about 5° to about 20° C, usually about 10° C to about 15° C, lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH.
  • Tm is the temperature (under defined ionic strength and pH) at which 50%o of the target sequence hybridizes to a perfectly matched probe.
  • stringent conditions will be those in which the salt concentration is about 0.02 molar at pH 7 and the temperature is at least about 60° C.
  • stringent conditions will include an initial wash in 6X SSC at 42° C followed by one or more additional washes in 0.2X SSC at a temperature of at least about 55° C, typically about 60° C and often about 65° C.
  • Nucleotide sequences are also substantially identical for purposes of this invention when the polypeptides which they encode are substantially identical. Thus, where one nucleic acid sequence encodes essentially the same polypeptide as a second nucleic acid sequence, the two nucleic acid sequences are substantially identical, even if they would not hybridize under stringent conditions due to silent substitutions permitted by the genetic code (see, Darnell et al. (1990) Molecular Cell Biology, Second Edition Scientific American Books W.H. Freeman and Company New York for an explanation of codon degeneracy and the genetic code).
  • Protein purity or homogeneity can be indicated by a number of means well known in the art, such as polyacrylamide gel electrophoresis of a protein sample, followed by visualization upon staining. For certain purposes high resolution will be needed and HPLC or a similar means for purification utilized.
  • the practice of this invention involves the construction of recombinant nucleic acids and the expression of genes in transfected bacterial cells.
  • Molecular cloning techniques to achieve these ends are known in the art.
  • a wide variety of cloning and in vitro amplification methods suitable for the construction of recombinant nucleic acids such as expression vectors are well-known to persons of skill. Examples of these techniques and instructions sufficient to direct persons of skill through many cloning exercises are found in Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology volume 152 Academic Press, Inc., San Diego, CA (Berger); and Current Protocols in Molecular Biology, F.M.
  • the invention provides expression cassettes that are useful for expressing recombinant genes in bacterial cells, such as E. coli, at high levels. Also provided are vectors that include the expression cassettes, as well as fermentation protocols for using the expression cassettes to obtain expression of a desired heterologous protein.
  • the expression cassettes of the invention contain a dual promoter operably linked to a heterologous nucleic acid that encodes a desired gene product, typically a polypeptide.
  • the dual promoters include a tac promoter component linked to a promoter component obtained from a gene or genes that encode enzymes involved in galactose metabolism (e.g., a promoter from a UDPgalactose 4-epimerase gene (galE)).
  • the dual tac-gal promoter provides a level of expression that is greater than that provided by either promoter alone.
  • the dual promoter can have a synergistic effect, having a greater than additive effect on expression level.
  • the expression cassettes can include other sequences such as ribosome binding sites for translational initiation and transcription/translation terminator sequences.
  • one or more selectable marker genes e.g., antibiotic-resistance genes
  • the vectors may comprise other sequences to allow the vector to be cloned in prokaryotic hosts, such as a broad host range prokaryote origin of replication.
  • prokaryotic hosts such as a broad host range prokaryote origin of replication.
  • tac promoter which is a combination of the lac and trp promoters.
  • An example of an expression vector that contains the tac promoter is pKK223-3 (Brosius and Holy, Proc. Nat 'I. Acad. Sci. USA 81 : 6929 (1984)); this vector is commercially available (Pharmacia Biotech, Inc., Piscataway NJ).
  • Variants of the tac promoter such as trc (Amann et al., Gene 69: 301 (1988), are also useful as a first component of the claimed dual promoters.
  • a second component of the dual promoters of the invention is a promoter obtained from a gene or genes that encode enzymes involved in galactose metabolism. In bacteria, such genes are often clustered, with more than one gal gene present in close proximity to others.
  • UDPgalactose-4-epimerase also known as UDPglucose-4-epimerase
  • mutarotase aldose 1- epimerase
  • coli four gal genes are linked (galE, gat ⁇ (UDPglucose-hexose-1- phosphate uridylyltransferase), galK (galactokinase), and galM) (Bouffard et al., J. Mol. Biol. 244: 269-278 (1994)).
  • the gal operon of Klebsiella pneumoniae also includes several genes (in the order galE, galT, and galK.) (Peng et al., J. Biochem. 112: 604- 608 (1992)), while that of Hemophilus influenzae includes, in order, galT, galK, and galM in a single operon (Maskell et al., Mol.
  • the gal operon of Streptomyces lividans has the gene order gall, galE, galK (Adams et al., J. Bacteriol. 170) 203-212 (1988)).
  • Galactose operons are expressed under the control of one or more promoters.
  • the S. lividans gal operon includes two promoters, one (galPl) that is galactose-inducible and directs transcription of the galT, galE, and galK genes and a second promoter (galP2) that is located within the operon just upstream of the galE gene and is constitutively expressed (Fornwald et al., Proc. Nat'l. Acad. Sci. USA 84: 2130-2134 (1987)).
  • the E. coli gal operon also includes, in addition to an inducible promoter, a constitutive promoter positioned upstream of the galE gene (Id.).
  • the dual promoter includes a promoter from a bacterial UDPgalactose 4-epimerase (galE) gene.
  • galE UDPgalactose 4-epimerase
  • UDPgalactose 4-epimerase gene described by Poolman et al. is a particular example of a gene from which one can obtain a promoter that is useful in the present invention.
  • Promoters from UDPglucose 4-epimerase genes of other organisms can be used in the present invention, so long as the promoters function in E. coli or other desired bacterial host cell.
  • Exemplary organisms that have genes encoding UDPglucose 4- epimerase include E. coli, K. pneumoniae, S. lividans, and E. stewartii, as well as Salmonella and Streptococcus species.
  • gal genes and their promoters may be accomplished by a number of techniques well known to those skilled in the art. For instance, oligonucleotide probes that selectively hybridize to the exemplified UDPglucose 4-epimerase gene or promoter described below can be used to identify the desired gene in DNA isolated from another organism. The use of such hybridization techniques for identifying homologous genes is well known in the art and need not be described further.
  • the promoters obtained are typically identical to or show substantial sequence identity to the exemplary glucose epimerase promoter described below.
  • polynucleotides having the nucleotide sequence of the desired promoter fragments can be synthesized by well-known techniques as described in the technical literature. See, e.g., Carruthers et al., Cold Spring Harbor Symp. Quant. Biol. 47:411-418 (1982), and Adams et al, J. Am. Chem. Soc. 105:661 (1983). One can then obtain double stranded DNA fragments either by synthesizing the complementary strand and annealing the strands together under appropriate conditions, or by adding the complementary strand using DNA polymerase with an appropriate primer sequence.
  • the tac and gal promoters that comprise the dual promoters of the invention can be identical to the corresponding promoters in the wild-type bacterial cells, or can be modified as desired, such as by insertion, deletion, or substitution of nucleotides. Such modifications will maintain those portions of the promoter sequences that are necessary for promoter function.
  • the -10 and -35 consensus sequences for promoters of gram-positive and gram-negative bacteria see, e.g., Graves et al., J. Biol. Chem. 261 : 11409- 11415 (1986); Singer and Berg, Genes & Genomes, University Science Books, Mill Valley, CA, 1991, pp. 140-143) will be maintained to the extent necessary to obtain expression.
  • Nucleotides that are important for proper function of a promoter in E. coli are shown, for example, in Singer and Berg at page 143. Regions of the promoter sequences that are not essential to promoter function can be modified as desired, for example, to facilitate cloning by inserting a restriction site adjacent to or within the promoter regions. Both the tac and the gal promoter components will generally have a binding site for the cAMP receptor protein (CRP, which is the product of the crp gene).
  • CRP cAMP receptor protein
  • cAMP is widely known as a signal for carbon source availability, with its levels being inversely correlated with the energetic state of the cell as evidenced by growth of cells on poor carbon sources (e.g., fructose, glycerol, acetate) eliciting higher cAMP levels than growth on a good carbon source (e.g., glucose).
  • cAMP regulates gene expression by binding to CRP. This high affinity binding produces a conformational change in the dimeric complex, which can then bind to a specific DNA site upstream of the binding site of RNA polymerase. Transcription is activated by accelerating the initial binding (increasing K B ) of the E ⁇ 70 form of RNA polymerase, at least in the case of the gal operon.
  • the tac promoter component of the dual promoters of the invention is generally located upstream of the gal promoter component.
  • the two promoter components are separated by about 0.1 to 2 kb of DNA. More preferably, about 0.5 to 1.5 kb separate the two promoter components. In a most preferred embodiment, the tac promoter component is located about 1 kb upstream of the gal promoter component.
  • the source of the DNA separating the two promoter components is not particularly critical, so long as the DNA does not contain sequences that interfere with gene expression, such as transcription terminators.
  • the tac promoter component is separated from the gal promoter component by DNA obtained from the native 5' flanking region of the gal promoter component. In a preferred embodiment, about one kb of DNA from the 5' flanking region of the S. thermophilus galE gene separate the tac promoter component from the gal promoter component.
  • the nucleotide sequence of a preferred dual promoter is shown in SEQ ID NO:l. As shown, the dual promoter is inserted into the Xbal site of the pPHOX2 expression vector, destroying the upstream Xbal site. Nucleotides 1-146 and 1490-1561 of the sequence are from pPHOX2. The -35 and -10 consensus sequences of the tac promoter are at nucleotides 362-367 and 384-389, respectively. The galE promoter consensus sequences are at nucleotides 1438-1443 (-35) and 1462-1467 (-10). A ribosome binding site (RBS) is found at nucleotides 1483-1488.
  • RBS ribosome binding site
  • the vectors of the invention can also contain a nucleic acid sequence that enables the vector to replicate independently in one or more selected host cells.
  • this sequence is one that enables the vector to replicate independently of the host chromosomal DNA, and includes origins of replication or autonomously replicating sequences.
  • origins of replication or autonomously replicating sequences are well known for a variety of bacteria. For instance, the origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria.
  • the vectors also comprise selectable marker genes to allow selection of bacterial cells bearing the desired construct. These genes encode a protein necessary for the survival or growth of transformed host cells grown in a selective culture medium. Host cells not transformed with the vector containing the selection gene will not survive in the culture medium. Typical selection genes encode proteins that confer resistance to antibiotics or other toxins, such as ampicillin, neomycin, kanamycin, chloramphenicol, or tetracycline. Alternatively, selectable markers may encode proteins that complement auxotrophic deficiencies or supply critical nutrients not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. A number of selectable markers are known to those of skill in the art and are described for instance in Sambrook et al.
  • a preferred selectable marker for use in using the dual tac-lac promoter to express a desired polypeptide is a kanamycin resistance marker (Vieira and Messing, Gene 19: 259 (1982)).
  • Use of kanamycin selection is advantageous over, for example, ampicillin selection because ampicillin is quickly degraded by ⁇ -lactamase in culture medium, thus removing selective pressure and allowing the culture to become overgrown with cells that do not contain the vector.
  • Plasmids containing one or more of the above listed components employs standard ligation techniques as described in the reference cited above. Isolated plasmids or DNA fragments are cleaved, tailored, and re-ligated in the form desired to generate the plasmids required. To confirm correct sequences in plasmids constructed, the plasmids are analyzed by standard techniques such as by restriction endonuclease digestion, and/or sequencing according to known methods.
  • a number of bacterial host cells can be used with the vectors of the invention.
  • useful bacteria include Escherichia, Enter obacter, Azotobacter, Erwinia, Bacillus, Pseudomonas, Klebsielia, Proteus, Salmonella, Serratia, Shigella, Rhizobia, Vitreoscilla, and Paracoccus.
  • Suitable E. coli hosts include the following strains: JM101, RR1, DH5 ⁇ , and others. These examples are illustrative rather than limiting.
  • transformation is done using standard techniques appropriate to such cells. Suitable techniques include calcium treatment employing calcium chloride, polyethylene glycol, or electroporation.
  • the invention also provides methods for using the dual tac-gal promoters to obtain high level expression of a desired polypeptide.
  • Host cells are transformed with vectors containing the dual promoter expression cassettes and cultured in culture medium under conditions appropriate for expression of the desired polypeptide.
  • the cells can be grown in shake flasks or other containers, although for large-scale preparation of the polypeptide growth in a fermentor is preferred.
  • galactose is added to the nutrient medium at an appropriate time in the growth cycle to induce increased expression of the desired polypeptide.
  • growth of the host cells can be initiated in culture medium containing fructose (0.25%> final concentration) as the carbon source; other sugars (e.g., glycerol, acetate) that cause an increase in intracellular cAMP (adenosine 3',5'-cyclic monophosphate) concentration can also be used as a carbon source.
  • cAMP adenosine 3',5'-cyclic monophosphate
  • a solution of fructose and galactose final concentration 3%> fructose, 0.6%> galactose is added to the medium (in fed-batch mode for a fermentor).
  • the galactose increases the level of expression from the dual tac-gal promoter expression cassettes of the invention.
  • the feed rate of fructose/galactose solution can be increased during the growth cycle (in a stepped or ramped fashion) as the culture becomes dense with cell growth.
  • fructose/galactose solution is fed through the end of the growth cycle.
  • the dual promoters of the invention are useful for expression of any desired polypeptide or protein at very high yields.
  • the polypeptides may be homologous to the bacterial host cell, or preferably, are heterologous to the host cell.
  • Many polypeptides produced using the claimed dual promoters will be enzymatically active.
  • certain polypeptides, such as those that require glycosylation or other eukaryote-specific processing for activity may not be produced in active form in bacterial host cells, the inactive polypeptides nevertheless find use as, for example, immunogens for induction of antibodies, molecular weight markers, and the like.
  • Exemplary bacterial polypeptides that one can express using the dual promoters include ⁇ -lactamase, carbohydrate metabolizing enzymes, alkaline phosphatase, restriction enzymes, DNA and RNA polymerases, ligases, kinases, endo- and exonucleases, and the like.
  • Exemplary fungal polypeptides include ligninases, proteases, glycosyltransferases, and the like.
  • Exemplary mammalian polypeptides that one can express in bacterial host cells using the claimed dual promoters include hormones such as insulin, growth hormones (including human growth hormone and bovine growth hormone), tissue-type plasminogen activator (t-PA), renin, clotting factors such as factor VIII and factor IX, bombesin, thrombin, hemopoietic growth factor, serum albumin, receptors for hormones or growth factors, interleukins, colony stimulating factors, T-cell receptors, MHC polypeptides, viral antigens, glycosyltransferases, and the like.
  • hormones such as insulin, growth hormones (including human growth hormone and bovine growth hormone), tissue-type plasminogen activator (t-PA), renin, clotting factors such as factor VIII and factor IX, bombesin, thrombin, hemopoietic growth factor, serum albumin, receptors for hormones or growth factors, interleukins, colony stimulating factors, T-cell receptors, MHC polypeptid
  • This list of enzymes is exemplary, not exclusive, as the dual promoters of the invention are useful for obtaining transcription of any nucleic acid expression unit that is operably linked to the dual promoters.
  • Such expression units include not only those that encode polypeptides, but also those for which the desired product is a nucleic acid, for example, an antisense RNA.
  • the vectors are particularly useful for expressing enzymes that are useful in the enzymatic synthesis of carbohydrates.
  • the use of enzymatic synthesis of carbohydrate offers advantages over chemical methods due to the virtually complete stereoselectivity and linkage specificity offered by enzymes (Ito et al, Pure Appl. Chem., 65:753 (1993); U.S. Patents 5,352,670, and 5,374,541).
  • a number of glycosyltransferase cycles (for example, sialyltransferase cycles, galactosyltransferase cycles, and fucosyltransferase cycles) are described in U.S. Patent No. 5,374,541 and WO 9425615 A.
  • Exemplary enzymes useful in the synthesis of carbohydrates that one can express using the claimed dual promoters also include CMP-sialic acid synthetase, UDP-glucose pyrophosphorylase, adenylate kinase, pyruvate kinase, sialic acid aldolase, UDP-GlcNAc pyrophosphorylase, myokinase, galactosyltransferases, glycosyltransferases encoded by the los locus o ⁇ Neisseria gonorrhoeae (see, e.g., international application WO 96/10086) and N-acetyl glucosaminyltransferases.
  • Any of the enzymes described in these references and used in these cycles can be recombinantly expressed using the vectors of the invention.
  • a typical example of a glycosyltransferase cycle for which the required enzymes can be produced using the claimed dual tac-gal promoters is a galactosyltransferase cycle.
  • the reaction medium for a galactosyltransferase cycle will preferably contain, in addition to a galactosyltransferase, donor substrate, acceptor sugar and divalent metal cation, a donor substrate recycling system comprising at least 1 mole of glucose- 1 -phosphate per each mole of acceptor sugar, a phosphate donor, a kinase capable of transferring phosphate from the phosphate donor to nucleoside diphosphates, and a pyrophosphorylase capable of forming UDP-glucose from UTP and glucose- 1 -phosphate and catalytic amounts of UDP and a UDP-galactose-4-epimerase.
  • a galactosyltransferase is the principal enzyme in this cycle.
  • Exemplary galactosyltransferases include ⁇ (l,3) galactosyltransferase, ⁇ (l,4) galactosyltransferase (E.C. No. 2.4.1.90, see, e.g., Narimatsu et al, Proc. Nat 'I. Acad. Sci. USA 83: 4720-4724 (1986)), ⁇ (l,3) galactosyltransferase (E.C. No. 2.4.1.151, see, e.g., Dabkowski et al, Transplant Proc.
  • kinase for example, pyruvate kinase
  • epimerase for example, UDP-galactose-4-epimerase
  • pyrophosphorylase for example, glucose pyrophosphorylase
  • the DNA encoding the polypeptide of interest may be expressed as a fusion with another polypeptide, preferably a signal sequence or other polypeptide having a specific cleavage site at the N-terminus of the mature polypeptide.
  • the signal sequence may be a component of the vector, or it may be a part of the polypeptide DNA that is inserted into the vector.
  • the heterologous signal sequence selected should be one that is recognized and processed (i.e., cleaved by a signal peptidase) by the host cell.
  • the signal sequence is substituted by a bacterial signal sequence.
  • a signal sequence can facilitate purification of the desired polypeptide by directing secretion of the desired protein from the cell into the extracellular medium.
  • the polypeptides produced by prokaryote cells may not necessarily fold properly.
  • the expressed polypeptides may first be denatured and then renatured. This can be accomplished by solubilizing the bacterially produced proteins in a chaotropic agent such as guanidine HC1 and reducing all the cysteine residues with a reducing agent such as beta-mercaptoethanol.
  • the polypeptides are then renatured, either by slow dialysis or by gel filtration.
  • Detection of expressed polypeptides is achieved by methods known in the art as radioimmunoassays, Western blotting techniques, immunoprecipitation, or activity assays. Purification from E. coli can be achieved following procedures described in U.S. Patent No. 4,511,503.
  • Plasmid pPHOX2 ( Figure 1) comprises a phosphate-starvation inducible promoter of the alkaline phosphatase gene (phoA), which increases transcription of genes under its control when phosphate levels become extremely low.
  • This plasmid contains a phoA promoter as described in WO 94/12636, as well as a rrnB ribosomal terminator (obtained from pKK223-3, Pharmacia Biotech).
  • the galactose-inducible promoter from the UDP-galactose-4-epimerase gene (galE) of Streptococcus thermophilus (Poolman et al, J. Bacteriol.
  • the expression plasmid pTGK was constructed as follows. First, a fragment of the plasmid pHPl/tac (described in Poolman et al, J. Bacteriol. 172:4037-4047 (1990)) was amplified by polymerase chain reaction (PCR) using Pfu polymerase and Xbal primers at the 5' and 3' ends.
  • PCR polymerase chain reaction
  • the amplified fragment contained a tac promoter approximately one kb upstream of the galactose-inducible promoter from the UDPgalactose-4-epimerase gene (galE) of Streptococcus thermophilus (Poolman et al, supra.).
  • the 5' primer (5'-1)
  • pPHOX2/galE The orientation of the dual tac-galE promoter was checked by BamHl digestion. The resulting plasmid is called pPHOX2/galE.
  • kanamycin resistance gene pPHOX2 has an ampicillin-resistance gene encoded by ⁇ -lactamase. Ampicillin is added to the culture to maintain the plasmid, but it is quickly degraded by ⁇ -lactamase, losing its effectiveness. In cells with a strong selective pressure against making the recombinant protein (e.g., as with CMP-sialic acid synthetase), overgrowth of cells without the plasmid can occur. To alleviate this problem, the plasmid was re-engineered to include a kanamycin resistance (Kan r ) gene, which gives a stronger selection since the encoded protein acts at the level of the membrane transport system.
  • Kan r kanamycin resistance
  • the 1.3 kb Kan r gene from plasmid pUC4K (Vieira and Messing, Gene 19:259 (1982)) was digested EcoRI and inserted into the unique Ec RI site of the pPUOXllgalE plasmid. Colonies were selected by kanamycin resistance. The resulting plasmid is called pPHOX2/galE/Kan ( Figure 3).
  • An oligonucleotide (ATGCATAAACTTTTGCCATTCTCAC; ⁇ H3 (SEQ ID NO:7)) was designed to change the AAG codon of HmdIII (AAGCTT) to delete the restriction site but keep the same amino acid codon (lysine, AAA).
  • the first PCR reaction amplified DNA from plasmid pUC4K using the ⁇ H3 oligonucleotide and the Ml 3 forward primer (New England Biolabs), generating a 620 bp fragment.
  • a second PCR reaction amplified DNA from pUC4K using the Ml 3 reverse primer (New England Biolabs) and the fragment from the first PCR, generating a 1.3 kb fragment.
  • the second fragment was further amplified by PCR with the forward and reverse primers.
  • This fragment was digested with Ec ⁇ RI (and HmdIII to cut nonrecombinants) and ligated to an isolated linear fragment of a partial EcoBI digest of plasmid pP ⁇ OX2/galE ⁇ Xba.
  • a NhellHin ⁇ lll digest was used to determine the correct insertion site of the EcoRl fragment.
  • This vector which is called pTGK ( Figure 4), was deposited with the American Type Culture Collection on May 22, 1996 and has been assigned Accession No. 98059.
  • pTGK pTGK vector
  • pTGK is digested with Xbal and the ends are filled in with Klenow polymerase.
  • a CCCGGG oligonucleotide is ligated to the blunt ends, recircularizing the plasmid, after which it is digested with Xbal to remove non-recombinants.
  • Digestion with Srfi or Smal which make a blunt-ended cut between the CCC and GGG, results in blunt ends to which one can ligate a blunt-ended fragment obtained by PCR or other methods.
  • Srfi or Smal which make a blunt-ended cut between the CCC and GGG
  • This vector is called pTGKS.
  • the tac promoter has a significant effect on expression levels
  • tac promoter contributes to expression of genes under the control of the dual tac-gal promoter.
  • constructs expressing GlcNAc transferase or Gal transferase (Gotschlich, E.C, J. Exp. Med. 180: 2181- 2190 (1994)).
  • the galE promoter was present in all constructs. Strains were grown in M9 + fructose + galactose and assayed for GlcNAc or Gal transferase activity. Results, shown in Table 3, demonstrate that the tac promoter contributes significantly to expression levels.
  • the tac promoter was deleted from the pyruvate kinase construct. This construct was transformed into E. coli and compared to strains containing either both promoters or the tac promoter only.
  • the results of this experiment demonstrate that the combined contribution of the tac and galE promoters is greater than the sum of their individual activities. Addition of galactose increases expression levels.
  • the E. coli expression vector pTGK has been used to produce numerous recombinant proteins, including CMP-sialic acid synthetase from E. coli, UDP-glucose pyrophosphorylase from Bacillus subtilis, adenylate kinase from E. coli, pyruvate kinase from Bacillus stear other mophilus, sialic acid aldolase from E. coli, UDP-GlcNAc pyrophosphorylase from E.
  • cells are grown initially in medium containing a small amount of fructose as the carbon source. Once the cells are proliferating, usually about 5-6 hours after inoculation, a solution of galactose and fructose is fed into the medium in fed-batch mode. The feed rate can be increased during the fermentation (in a stepped or ramped fashion) as the culture becomes dense with cell growth. The carbon source feed continues through to the end of the fermentation. If desired, the polypeptide of interest is then purified from the medium (in the case of a secreted protein) or from the harvested cells.
  • ATACTCGCCT TATGCCTACT TTAACGAACA TTCTATCTTC TTTTATGGTA AGCACGAACC 660
  • MOLECULE TYPE DNA

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Biotechnology (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Plant Pathology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Medicinal Chemistry (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Virology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
  • Image Generation (AREA)
  • Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)

Abstract

The present invention provides isolated recombinant nucleic acid constructs comprising a dual bacterial promoter operably linked to a heterologous nucleic acid which encodes a desired polypeptide. The constructs are useful for the expression of the desired polypeptides in bacterial cells at high levels.

Description

IMPROVED EXPRESSION VECTORS
CROSS REFERENCE TO RELATED APPLICATIONS
This application is a continuation-in-part of US Provisional Application No. 60/029,545, filed November 8, 1996, which is incorporated herein by reference.
FIELD OF THE INVENTION
The present invention relates to improved vectors and fermentation protocols for expression of recombinant proteins in bacterial cells. The invention can be used for expression of any desired protein, including enzymes useful in the enzymatic synthesis of oligosaccharides.
BACKGROUND OF THE INVENTION The production of biologically active polypeptides and proteins is important economically for the manufacture of human and animal pharmaceutical formulations, enzymes, and other specialty chemicals. Recombinant DNA techniques using bacterial, fungal, mammalian, or insect cells as expression hosts are particularly useful means for producing large quantities of polypeptides. Recombinant production of desired proteins generally involves transfecting host cells with an expression vector that contains signals which, when operably linked to a gene encoding the protein, control expression of the gene. The cells are grown under conditions suitable for expression of the recombinant protein. The expression control signals typically include a promoter, which influences the rate at which a gene located downstream of the promoter is transcribed into RNA and determines the transcriptional start site. Expression control signals are chosen so as to be functional in the host cell used for production of the desired protein. For instance, the bacterium E. coli is commonly used to produce recombinant proteins in high yields. Numerous references disclose methods of using E. coli and other bacteria to produce proteins recombinantly. (see, e.g., U.S. Pat. No. 4,565,785; U.S. Pat. No. 4,673,641; U.S. Pat. No. 4,738,921; U.S. Pat. No. 4,795,706; and U.S. Pat. No. 4,710,473). For recombinantly produced proteins that are intended for commercial use, in particular, it is desirable to obtain a high level of expression of the desired protein from the host cells. Increasing the amount of desired protein produced per cell can reduce costs of production due to the decreased volume of cells that must be grown to obtain a given amount of product, and also can facilitate purification because the desired product makes up a larger percentage of the total protein produced by the host cells. Therefore, a need exists for expression control signals that are capable of expressing a desired protein at high levels. The present invention fulfills this and other needs.
SUMMARY OF THE INVENTION The present invention provides isolated recombinant nucleic acid constructs that comprise a dual bacterial promoter operably linked to a heterologous nucleic acid that encodes a desired polypeptide. The constructs are useful for expressing a desired polypeptide in bacterial host cells at high levels. The dual promoters comprise a first component derived from a tαc-related promoter and a second promoter component obtained from a bacterial gene or operon that encodes an enzyme or enzymes involved in galactose metabolism. A number of galactose promoters can be used as the second promoter component; an exemplary promoter is a UDPgalactose-4-epimerase (also known as UDPglucose-4-epimerase) promoter such as that derived from Streptococcus thermophilus. Also provided by the invention are expression vectors which include a dual bacterial promoter operably linked to a heterologous nucleic acid that encodes a desired polypeptide. The expression vectors can further comprise other components such as a selectable marker. The constructs can also comprise an origin of replication sequence that functions in E. coli or other host cell. A preferred construct of the invention, the plasmid pTGK (described in detail below) was deposited with the American Type Culture Collection under Accession No. 98059 on May 22, 1996.
The invention also provides a bacterial cell that contains a recombinant expression cassette that includes the dual bacterial promoter unit operably linked to a heterologous nucleic acid. The expression cassette can be integrated into the genome of the host cell or be present on an independently replicating plasmid. A preferred bacterial cell is E. coli. The invention further provides methods of making a desired polypeptide. The methods involve culturing in an appropriate medium bacterial cells that contain a recombinant expression cassette having a dual bacterial promoter unit operably linked to a heterologous nucleic acid under conditions that allow expression of the heterologous nucleic acid. Typically E. coli are used as the host cell. The methods and constructs can be used to express a wide variety of polypeptides in bacterial cells. Exemplary polypeptides include hormones, growth factors, and the like, as well as enzymes useful in the synthesis of carbohydrates. Such enzymes include CMP-sialic acid synthetase, UDP-glucose pyrophosphorylase, adenylate kinase, pyruvate kinase, sialic acid aldolase, UDP-GlcNAc pyrophosphorylase, and myokinase, galactosyltransferase, and N-acetyl glucosaminyltransferase.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 is a map of the plasmid pPHOX2.
Figure 2 shows the spacing between the ribosome binding site (Shine- Dalgarno sequence) and the initiation codon of the recombinant gene in the vectors of the invention (SEQ ID NO:2).
Figure 3 is a map of the plasmid pPHOX2/galE/Kan.
Figure 4 is a map of the plasmid pTGK. This plasmid is essentially the same as pPHOX2/galE/Kan, except that the 5' Xbal site of the promoter region and the Hinάlll site in the kanamycin resistance gene (kan1) have been deleted.
Figures 5A and 5B show that one can maintain optimal spacing between the ribosome binding site and the initiation codon of the gene to be expressed in vectors of the invention by amplifying the DNA to be expressed using as primers oligonucleotides that begin with the ATG initiation codon. Figure 5A shows the relationship between the ribosome binding site (RBS) and the Srfi restriction site in the plasmid pTGKS (SEQ ID
NO: 3). Digestion with Srfi leaves a blunt end to which a blunt-ended recombinant gene that begins with the ATG initiation codon can be ligated as shown in Figure 5B (SEQ ID NO:4). Figure 6 shows the nucleotide sequence of one embodiment of a dual tac-gal promoter of the invention, flanked by pPHOX2 sequences, as indicated (SEQ ID NO:l). The -35 and -10 consensus sequences of both the tac and galE promoters are shown, as are the locations of Xbal and Hwdlll sites which are useful for inserting a recombinant gene for expression.
DETAILED DESCRIPTION OF THE INVENTION
The present invention provides improved promoters and vectors for the recombinant expression of desired polypeptides. The promoters and vectors of the invention are particularly suited for the expression of recombinant proteins in bacterial hosts, such as E. coli. Also provided are fermentation protocols for obtaining high levels of recombinant protein expression using the promoters and vectors of the invention.
Definitions Much of the nomenclature and general laboratory procedures required in this application can be found in Sambrook et al. , Molecular Cloning: A Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989. The manual is hereinafter referred to as "Sambrook et al."
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Singleton et al. (1994) Dictionary of Microbiology and Molecular Biology, second edition, John Wiley and Sons (New York) provides one of skill with a general dictionary of many of the terms used in this invention. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods and materials are described. For purposes of the present invention, the following terms are defined below.
The term "nucleic acid" refers to a deoxyribonucleotide or ribonucleotide polymer in either single- or double-stranded form, and unless otherwise limited, encompasses known analogues of natural nucleotides that hybridize to nucleic acids in manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence includes the complementary sequence thereof.
The term "operably linked" refers to functional linkage between a nucleic acid expression control sequence (such as a promoter, signal sequence, or array of transcription factor binding sites) and a second nucleic acid sequence, wherein the expression control sequence affects transcription and/or translation of the nucleic acid corresponding to the second sequence. The term "recombinant" when used with reference to a cell indicates that the cell replicates a heterologous nucleic acid, or expresses a peptide or protein encoded by a heterologous nucleic acid. Recombinant cells can express genes that are not found within the native (non-recombinant) form of the cell. Recombinant cells can also express genes that are found in the native form of the cell, but wherein the genes are modified and re-introduced into the cell by artificial means.
A "heterologous sequence" or a "heterologous nucleic acid", as used herein, is one that originates from a foreign source (or species) or, if from the same source, is modified from its original form. Thus, a heterologous nucleic acid operably linked to a promoter is from a source different from that from which the promoter was derived, or, if from the same source, is modified from its original form. For example, a UDPglucose 4-epimerase gene promoter can be linked to a structural gene encoding a polypeptide other than native UDPglucose 4-epimerase. Modification of the heterologous sequence may occur, e.g., by treating the DNA with a restriction enzyme to generate a DNA fragment that is capable of being operably linked to the promoter. Techniques such as site-directed mutagenesis are also useful for modifying a heterologous sequence.
A "recombinant expression cassette" or simply an "expression cassette" is a nucleic acid construct, generated recombinantly or synthetically, with nucleic acid elements that are capable of affecting expression of a structural gene in hosts that are compatible with such sequences. Expression cassettes include at least promoters and optionally, transcription termination signals. Typically, the recombinant expression cassette includes a nucleic acid to be transcribed (e.g., a nucleic acid encoding a desired polypeptide), and a promoter (e.g., a dual promoter that contains a tαc promoter component and a gαl promoter component). Additional factors necessary or helpful in effecting expression may also be used as described herein. For example, an expression cassette can also include nucleotide sequences that encode a signal sequence that directs secretion of an expressed protein from the host cell. The term "isolated" is meant to refer to material which is substantially or essentially free from components which normally accompany the material as found in its native state. Thus, an isolated protein, for example, does not include materials normally associated with their in situ environment. Typically, isolated proteins of the invention are at least about 80% pure, usually at least about 90%, and preferably at least about 95% pure as measured by band intensity on a silver stained gel or other method for determining purity. In the present invention polypeptides are purified from transgenic cells.
UDPglucose-4-epimerase (E.C. 5.1.3.2) is also known as UDPgalactose-4- epimerase and phosphoribulose epimerase. These terms are used interchangeably herein. Two polynucleotides or polypeptides are said to be "identical" if the sequence of nucleotides or amino acid residues in the two sequences is the same when aligned for maximum correspondence. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman, Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson and Lipman Proc. Natl. Acad. Sci. (U.S.A.) 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by inspection.
The term "substantial identity" as applied to polypeptides means that a polypeptide comprises a sequence that is at least 80% identical, preferably 90%, more preferably 95% or more, to a reference sequence over a comparison window of about 20 residues to about 600 residues—typically about 50 to about 500 residues usually about 250 to 300 residues. The values of percent identity are determined using the programs above.
The terms "substantial identity" or "substantial sequence identity" as applied to nucleic acid sequences and as used herein denote a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a sequence that has at least 85 percent sequence identity, preferably at least 90 to 95 percent sequence identity, and more preferably at least 99 percent sequence identity as compared to a reference sequence over a comparison window of at least 20 nucleotide positions, frequently over a window of at least 25-50 nucleotides, wherein the percentage of sequence identity is calculated by comparing the reference sequence to the polynucleotide sequence which may include deletions or additions which total 20 percent or less of the reference sequence over the window of comparison. The reference sequence may be a subset of a larger sequence.
Another indication that nucleotide sequences are substantially identical is if two molecules hybridize to each other under stringent conditions. Stringent conditions are sequence-dependent and will be different in different circumstances. Generally, stringent conditions are selected to be about 5° to about 20° C, usually about 10° C to about 15° C, lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50%o of the target sequence hybridizes to a perfectly matched probe. Typically, stringent conditions will be those in which the salt concentration is about 0.02 molar at pH 7 and the temperature is at least about 60° C. For instance in a standard Southern hybridization procedure, stringent conditions will include an initial wash in 6X SSC at 42° C followed by one or more additional washes in 0.2X SSC at a temperature of at least about 55° C, typically about 60° C and often about 65° C.
Nucleotide sequences are also substantially identical for purposes of this invention when the polypeptides which they encode are substantially identical. Thus, where one nucleic acid sequence encodes essentially the same polypeptide as a second nucleic acid sequence, the two nucleic acid sequences are substantially identical, even if they would not hybridize under stringent conditions due to silent substitutions permitted by the genetic code (see, Darnell et al. (1990) Molecular Cell Biology, Second Edition Scientific American Books W.H. Freeman and Company New York for an explanation of codon degeneracy and the genetic code).
Protein purity or homogeneity can be indicated by a number of means well known in the art, such as polyacrylamide gel electrophoresis of a protein sample, followed by visualization upon staining. For certain purposes high resolution will be needed and HPLC or a similar means for purification utilized.
The practice of this invention involves the construction of recombinant nucleic acids and the expression of genes in transfected bacterial cells. Molecular cloning techniques to achieve these ends are known in the art. A wide variety of cloning and in vitro amplification methods suitable for the construction of recombinant nucleic acids such as expression vectors are well-known to persons of skill. Examples of these techniques and instructions sufficient to direct persons of skill through many cloning exercises are found in Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology volume 152 Academic Press, Inc., San Diego, CA (Berger); and Current Protocols in Molecular Biology, F.M. Ausubel et al, eds., Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1994 Supplement) (Ausubel). Examples of protocols sufficient to direct persons of skill through in vitro amplification methods, including the polymerase chain reaction (PCR) the ligase chain reaction (LCR), Qβ-replicase amplification and other RNA polymerase mediated techniques are found in Berger, Sambrook, and Ausubel, as well as Mullis et al. (1987) U.S. Patent No. 4,683,202; PCR Protocols A Guide to Methods and Applications (Innis et al. eds) Academic Press Inc. San Diego, CA (1990) (Innis); Arnheim & Levinson (October 1, 1990) C&EN 36- 47; The Journal Of NIH Research (1991) 3, 81-94; (Kwoh et al. (1989) Proc. Natl. Acad. Set USA 86, 1173; Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87, 1874; Lomell et al.
(1989) J Clin. Chem. 35, 1826; Landegren et al, (1988) Science 241, 1077-1080; Van Brunt (1990) Biotechnology 8, 291-294; Wu and Wallace, (1989) Gene 4, 560; and Barringer et al.
(1990) Gene 89, 117. Improved methods of cloning in vitro amplified nucleic acids are described in Wallace et al, U.S. Pat. No. 5,426,039.
Description of the Invention
The invention provides expression cassettes that are useful for expressing recombinant genes in bacterial cells, such as E. coli, at high levels. Also provided are vectors that include the expression cassettes, as well as fermentation protocols for using the expression cassettes to obtain expression of a desired heterologous protein. The expression cassettes of the invention contain a dual promoter operably linked to a heterologous nucleic acid that encodes a desired gene product, typically a polypeptide. The dual promoters include a tac promoter component linked to a promoter component obtained from a gene or genes that encode enzymes involved in galactose metabolism (e.g., a promoter from a UDPgalactose 4-epimerase gene (galE)). The dual tac-gal promoter provides a level of expression that is greater than that provided by either promoter alone. The dual promoter can have a synergistic effect, having a greater than additive effect on expression level. To obtain high level expression of a cloned gene, the expression cassettes can include other sequences such as ribosome binding sites for translational initiation and transcription/translation terminator sequences. To allow selection of cells comprising the constructs, one or more selectable marker genes (e.g., antibiotic-resistance genes) are conveniently included in the expression vectors. The vectors may comprise other sequences to allow the vector to be cloned in prokaryotic hosts, such as a broad host range prokaryote origin of replication. One of skill will recognize that each of these vector components can be modified without substantially affecting their function.
One component of the dual promoters of the invention is a tac promoter, which is a combination of the lac and trp promoters. An example of an expression vector that contains the tac promoter is pKK223-3 (Brosius and Holy, Proc. Nat 'I. Acad. Sci. USA 81 : 6929 (1984)); this vector is commercially available (Pharmacia Biotech, Inc., Piscataway NJ). Variants of the tac promoter, such as trc (Amann et al., Gene 69: 301 (1988), are also useful as a first component of the claimed dual promoters.
A second component of the dual promoters of the invention is a promoter obtained from a gene or genes that encode enzymes involved in galactose metabolism. In bacteria, such genes are often clustered, with more than one gal gene present in close proximity to others. For example, in Streptococcus thermophilus the genes encoding UDPgalactose-4-epimerase (galE; also known as UDPglucose-4-epimerase) and aldose 1- epimerase (mutarotase) (galM) are closely linked (Poolman et al., J. Bacteriol 172:4037- 4047 (1990)). In E. coli, four gal genes are linked (galE, gatϊ (UDPglucose-hexose-1- phosphate uridylyltransferase), galK (galactokinase), and galM) (Bouffard et al., J. Mol. Biol. 244: 269-278 (1994)). Similarly, the gal operon of Klebsiella pneumoniae also includes several genes (in the order galE, galT, and galK.) (Peng et al., J. Biochem. 112: 604- 608 (1992)), while that of Hemophilus influenzae includes, in order, galT, galK, and galM in a single operon (Maskell et al., Mol. Microbiol. 6: 3051-3063 (1992)) and the gal operon of Streptomyces lividans has the gene order gall, galE, galK (Adams et al., J. Bacteriol. 170) 203-212 (1988)). Galactose operons are expressed under the control of one or more promoters. For example, the S. lividans gal operon includes two promoters, one (galPl) that is galactose-inducible and directs transcription of the galT, galE, and galK genes and a second promoter (galP2) that is located within the operon just upstream of the galE gene and is constitutively expressed (Fornwald et al., Proc. Nat'l. Acad. Sci. USA 84: 2130-2134 (1987)). The E. coli gal operon also includes, in addition to an inducible promoter, a constitutive promoter positioned upstream of the galE gene (Id.).
In a preferred embodiment, the dual promoter includes a promoter from a bacterial UDPgalactose 4-epimerase (galE) gene. The Streptococcus thermophilus
UDPgalactose 4-epimerase gene described by Poolman et al. (J. Bacteriol 172: 4037-4047 (1990)) is a particular example of a gene from which one can obtain a promoter that is useful in the present invention. Promoters from UDPglucose 4-epimerase genes of other organisms can be used in the present invention, so long as the promoters function in E. coli or other desired bacterial host cell. Exemplary organisms that have genes encoding UDPglucose 4- epimerase include E. coli, K. pneumoniae, S. lividans, and E. stewartii, as well as Salmonella and Streptococcus species.
The isolation of gal genes and their promoters may be accomplished by a number of techniques well known to those skilled in the art. For instance, oligonucleotide probes that selectively hybridize to the exemplified UDPglucose 4-epimerase gene or promoter described below can be used to identify the desired gene in DNA isolated from another organism. The use of such hybridization techniques for identifying homologous genes is well known in the art and need not be described further. The promoters obtained are typically identical to or show substantial sequence identity to the exemplary glucose epimerase promoter described below.
Alternatively, polynucleotides having the nucleotide sequence of the desired promoter fragments can be synthesized by well-known techniques as described in the technical literature. See, e.g., Carruthers et al., Cold Spring Harbor Symp. Quant. Biol. 47:411-418 (1982), and Adams et al, J. Am. Chem. Soc. 105:661 (1983). One can then obtain double stranded DNA fragments either by synthesizing the complementary strand and annealing the strands together under appropriate conditions, or by adding the complementary strand using DNA polymerase with an appropriate primer sequence.
The tac and gal promoters that comprise the dual promoters of the invention can be identical to the corresponding promoters in the wild-type bacterial cells, or can be modified as desired, such as by insertion, deletion, or substitution of nucleotides. Such modifications will maintain those portions of the promoter sequences that are necessary for promoter function. For example, the -10 and -35 consensus sequences for promoters of gram-positive and gram-negative bacteria (see, e.g., Graves et al., J. Biol. Chem. 261 : 11409- 11415 (1986); Singer and Berg, Genes & Genomes, University Science Books, Mill Valley, CA, 1991, pp. 140-143) will be maintained to the extent necessary to obtain expression. Nucleotides that are important for proper function of a promoter in E. coli are shown, for example, in Singer and Berg at page 143. Regions of the promoter sequences that are not essential to promoter function can be modified as desired, for example, to facilitate cloning by inserting a restriction site adjacent to or within the promoter regions. Both the tac and the gal promoter components will generally have a binding site for the cAMP receptor protein (CRP, which is the product of the crp gene). cAMP is widely known as a signal for carbon source availability, with its levels being inversely correlated with the energetic state of the cell as evidenced by growth of cells on poor carbon sources (e.g., fructose, glycerol, acetate) eliciting higher cAMP levels than growth on a good carbon source (e.g., glucose). cAMP regulates gene expression by binding to CRP. This high affinity binding produces a conformational change in the dimeric complex, which can then bind to a specific DNA site upstream of the binding site of RNA polymerase. Transcription is activated by accelerating the initial binding (increasing KB) of the Eσ70 form of RNA polymerase, at least in the case of the gal operon.
The tac promoter component of the dual promoters of the invention is generally located upstream of the gal promoter component. Generally, the two promoter components are separated by about 0.1 to 2 kb of DNA. More preferably, about 0.5 to 1.5 kb separate the two promoter components. In a most preferred embodiment, the tac promoter component is located about 1 kb upstream of the gal promoter component. The source of the DNA separating the two promoter components is not particularly critical, so long as the DNA does not contain sequences that interfere with gene expression, such as transcription terminators. For example, in one embodiment the tac promoter component is separated from the gal promoter component by DNA obtained from the native 5' flanking region of the gal promoter component. In a preferred embodiment, about one kb of DNA from the 5' flanking region of the S. thermophilus galE gene separate the tac promoter component from the gal promoter component.
The nucleotide sequence of a preferred dual promoter is shown in SEQ ID NO:l. As shown, the dual promoter is inserted into the Xbal site of the pPHOX2 expression vector, destroying the upstream Xbal site. Nucleotides 1-146 and 1490-1561 of the sequence are from pPHOX2. The -35 and -10 consensus sequences of the tac promoter are at nucleotides 362-367 and 384-389, respectively. The galE promoter consensus sequences are at nucleotides 1438-1443 (-35) and 1462-1467 (-10). A ribosome binding site (RBS) is found at nucleotides 1483-1488. To facilitate insertion of a gene to be expressed downstream of the dual promoter, the RBS is followed by a Xbal restriction site and a Hindlll site is present in the pPHOX2 sequence just 3' of the Xbal site. The vectors of the invention can also contain a nucleic acid sequence that enables the vector to replicate independently in one or more selected host cells. Generally, this sequence is one that enables the vector to replicate independently of the host chromosomal DNA, and includes origins of replication or autonomously replicating sequences. Such sequences are well known for a variety of bacteria. For instance, the origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria.
The vectors also comprise selectable marker genes to allow selection of bacterial cells bearing the desired construct. These genes encode a protein necessary for the survival or growth of transformed host cells grown in a selective culture medium. Host cells not transformed with the vector containing the selection gene will not survive in the culture medium. Typical selection genes encode proteins that confer resistance to antibiotics or other toxins, such as ampicillin, neomycin, kanamycin, chloramphenicol, or tetracycline. Alternatively, selectable markers may encode proteins that complement auxotrophic deficiencies or supply critical nutrients not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. A number of selectable markers are known to those of skill in the art and are described for instance in Sambrook et al. , supra. A preferred selectable marker for use in using the dual tac-lac promoter to express a desired polypeptide is a kanamycin resistance marker (Vieira and Messing, Gene 19: 259 (1982)). Use of kanamycin selection is advantageous over, for example, ampicillin selection because ampicillin is quickly degraded by β-lactamase in culture medium, thus removing selective pressure and allowing the culture to become overgrown with cells that do not contain the vector.
Construction of suitable vectors containing one or more of the above listed components employs standard ligation techniques as described in the reference cited above. Isolated plasmids or DNA fragments are cleaved, tailored, and re-ligated in the form desired to generate the plasmids required. To confirm correct sequences in plasmids constructed, the plasmids are analyzed by standard techniques such as by restriction endonuclease digestion, and/or sequencing according to known methods.
A number of bacterial host cells can be used with the vectors of the invention. Examples of useful bacteria include Escherichia, Enter obacter, Azotobacter, Erwinia, Bacillus, Pseudomonas, Klebsielia, Proteus, Salmonella, Serratia, Shigella, Rhizobia, Vitreoscilla, and Paracoccus. Suitable E. coli hosts include the following strains: JM101, RR1, DH5α, and others. These examples are illustrative rather than limiting. Depending on the host cell used, transformation is done using standard techniques appropriate to such cells. Suitable techniques include calcium treatment employing calcium chloride, polyethylene glycol, or electroporation. The invention also provides methods for using the dual tac-gal promoters to obtain high level expression of a desired polypeptide. Host cells are transformed with vectors containing the dual promoter expression cassettes and cultured in culture medium under conditions appropriate for expression of the desired polypeptide. The cells can be grown in shake flasks or other containers, although for large-scale preparation of the polypeptide growth in a fermentor is preferred. To obtain the maximum level of expression, galactose is added to the nutrient medium at an appropriate time in the growth cycle to induce increased expression of the desired polypeptide. For example, growth of the host cells can be initiated in culture medium containing fructose (0.25%> final concentration) as the carbon source; other sugars (e.g., glycerol, acetate) that cause an increase in intracellular cAMP (adenosine 3',5'-cyclic monophosphate) concentration can also be used as a carbon source. Approximately 5-6 hours after the culture is initiated, or once the cells have reached an appropriate density (ca. 3-6 A600), a solution of fructose and galactose (final concentration 3%> fructose, 0.6%> galactose) is added to the medium (in fed-batch mode for a fermentor). The galactose increases the level of expression from the dual tac-gal promoter expression cassettes of the invention. The feed rate of fructose/galactose solution can be increased during the growth cycle (in a stepped or ramped fashion) as the culture becomes dense with cell growth. Preferably, fructose/galactose solution is fed through the end of the growth cycle.
The dual promoters of the invention are useful for expression of any desired polypeptide or protein at very high yields. The polypeptides may be homologous to the bacterial host cell, or preferably, are heterologous to the host cell. For example, one can express yeast, fungal, mammalian, and plant proteins at very high levels using the dual promoters. Many polypeptides produced using the claimed dual promoters will be enzymatically active. Although certain polypeptides, such as those that require glycosylation or other eukaryote-specific processing for activity, may not be produced in active form in bacterial host cells, the inactive polypeptides nevertheless find use as, for example, immunogens for induction of antibodies, molecular weight markers, and the like. Exemplary bacterial polypeptides that one can express using the dual promoters include β-lactamase, carbohydrate metabolizing enzymes, alkaline phosphatase, restriction enzymes, DNA and RNA polymerases, ligases, kinases, endo- and exonucleases, and the like. Exemplary fungal polypeptides include ligninases, proteases, glycosyltransferases, and the like. Exemplary mammalian polypeptides that one can express in bacterial host cells using the claimed dual promoters include hormones such as insulin, growth hormones (including human growth hormone and bovine growth hormone), tissue-type plasminogen activator (t-PA), renin, clotting factors such as factor VIII and factor IX, bombesin, thrombin, hemopoietic growth factor, serum albumin, receptors for hormones or growth factors, interleukins, colony stimulating factors, T-cell receptors, MHC polypeptides, viral antigens, glycosyltransferases, and the like. This list of enzymes is exemplary, not exclusive, as the dual promoters of the invention are useful for obtaining transcription of any nucleic acid expression unit that is operably linked to the dual promoters. Such expression units include not only those that encode polypeptides, but also those for which the desired product is a nucleic acid, for example, an antisense RNA.
The vectors are particularly useful for expressing enzymes that are useful in the enzymatic synthesis of carbohydrates. The use of enzymatic synthesis of carbohydrate offers advantages over chemical methods due to the virtually complete stereoselectivity and linkage specificity offered by enzymes (Ito et al, Pure Appl. Chem., 65:753 (1993); U.S. Patents 5,352,670, and 5,374,541). A number of glycosyltransferase cycles (for example, sialyltransferase cycles, galactosyltransferase cycles, and fucosyltransferase cycles) are described in U.S. Patent No. 5,374,541 and WO 9425615 A. Other glycosyltransferase cycles are described in Ichikawa et al. J. Am. Chem. Soc. 114:9283 (1992), Wong et al, J. Org. Chem. 57: 4343 (1992), DeLuca et α/., J. Am. Chem. Soc. 117:5869-5870 (1995), and Ichikawa et al. in Carbohydrates and Carbohydrate Polymers. Yaltami, ed. (ATL Press, 1993). Exemplary enzymes useful in the synthesis of carbohydrates that one can express using the claimed dual promoters also include CMP-sialic acid synthetase, UDP-glucose pyrophosphorylase, adenylate kinase, pyruvate kinase, sialic acid aldolase, UDP-GlcNAc pyrophosphorylase, myokinase, galactosyltransferases, glycosyltransferases encoded by the los locus oϊ Neisseria gonorrhoeae (see, e.g., international application WO 96/10086) and N-acetyl glucosaminyltransferases. Any of the enzymes described in these references and used in these cycles can be recombinantly expressed using the vectors of the invention. A typical example of a glycosyltransferase cycle for which the required enzymes can be produced using the claimed dual tac-gal promoters is a galactosyltransferase cycle. The reaction medium for a galactosyltransferase cycle will preferably contain, in addition to a galactosyltransferase, donor substrate, acceptor sugar and divalent metal cation, a donor substrate recycling system comprising at least 1 mole of glucose- 1 -phosphate per each mole of acceptor sugar, a phosphate donor, a kinase capable of transferring phosphate from the phosphate donor to nucleoside diphosphates, and a pyrophosphorylase capable of forming UDP-glucose from UTP and glucose- 1 -phosphate and catalytic amounts of UDP and a UDP-galactose-4-epimerase. A galactosyltransferase is the principal enzyme in this cycle. Exemplary galactosyltransferases include β(l,3) galactosyltransferase, β(l,4) galactosyltransferase (E.C. No. 2.4.1.90, see, e.g., Narimatsu et al, Proc. Nat 'I. Acad. Sci. USA 83: 4720-4724 (1986)), α(l,3) galactosyltransferase (E.C. No. 2.4.1.151, see, e.g., Dabkowski et al, Transplant Proc. 25:2921 (1993) and Yamamoto et al Nature 345:229- 233 (1990)) and α(l,4) galactosyltransferase (E.C. No. 2.4.1.38). Other enzymes used in the galactosyltransferase cycle include a kinase (for example, pyruvate kinase), an epimerase (for example, UDP-galactose-4-epimerase), and a pyrophosphorylase (for example, glucose pyrophosphorylase). DNA encoding all of these enzymes can be expressed using the vectors of the invention.
In some embodiments, the DNA encoding the polypeptide of interest may be expressed as a fusion with another polypeptide, preferably a signal sequence or other polypeptide having a specific cleavage site at the N-terminus of the mature polypeptide. In general, the signal sequence may be a component of the vector, or it may be a part of the polypeptide DNA that is inserted into the vector. The heterologous signal sequence selected should be one that is recognized and processed (i.e., cleaved by a signal peptidase) by the host cell. For bacterial host cells that do not recognize and process the native polypeptide signal sequence, the signal sequence is substituted by a bacterial signal sequence. A signal sequence can facilitate purification of the desired polypeptide by directing secretion of the desired protein from the cell into the extracellular medium.
The polypeptides produced by prokaryote cells may not necessarily fold properly. During purification from E. coli, the expressed polypeptides may first be denatured and then renatured. This can be accomplished by solubilizing the bacterially produced proteins in a chaotropic agent such as guanidine HC1 and reducing all the cysteine residues with a reducing agent such as beta-mercaptoethanol. The polypeptides are then renatured, either by slow dialysis or by gel filtration. U.S. Patent No. 4,511,503.
Detection of expressed polypeptides is achieved by methods known in the art as radioimmunoassays, Western blotting techniques, immunoprecipitation, or activity assays. Purification from E. coli can be achieved following procedures described in U.S. Patent No. 4,511,503.
EXAMPLES
The following examples are offered to illustrate, but not to limit the present invention.
Example 1
Construction of pTGK
Construction of pPHOX2/gaIE
Plasmid pPHOX2 (Figure 1) comprises a phosphate-starvation inducible promoter of the alkaline phosphatase gene (phoA), which increases transcription of genes under its control when phosphate levels become extremely low. This plasmid contains a phoA promoter as described in WO 94/12636, as well as a rrnB ribosomal terminator (obtained from pKK223-3, Pharmacia Biotech). The galactose-inducible promoter from the UDP-galactose-4-epimerase gene (galE) of Streptococcus thermophilus (Poolman et al, J. Bacteriol. 172:4037-4047 (1990)) and the tac promoter were inserted into pPHOX2. The expression plasmid pTGK was constructed as follows. First, a fragment of the plasmid pHPl/tac (described in Poolman et al, J. Bacteriol. 172:4037-4047 (1990)) was amplified by polymerase chain reaction (PCR) using Pfu polymerase and Xbal primers at the 5' and 3' ends. The amplified fragment contained a tac promoter approximately one kb upstream of the galactose-inducible promoter from the UDPgalactose-4-epimerase gene (galE) of Streptococcus thermophilus (Poolman et al, supra.). The 5' primer (5'-
GCTCTAGACGATCCGTCCGGCGTA-3'; (SEQ ID NO:5)) was designed to hybridize to the pBR322 vector region upstream of the promoter on pHPl/tac, while the 3' primer (5'- ATTCTAGACCTCCTTTCTCAGAAAAAACAATT-3'; (SEQ ID NO:6)) was designed to hybridize to a sequenced region of the galE promoter containing the Shine-Dalgarno ribosome binding site. The optimal spacing between the galE ribosome binding site and the initiation codon of the recombinant gene was maintained (Figure 2). This amplified 1.3 kb DNA fragment, which encompassed both the tac and galE promoters, was digested with Xbal and inserted into Tjαl-digested pPHOX2 (Figure 1), which comprises a phosphate-starvation inducible promoter of the alkaline phosphatase gene (phoA, described in WO 94/12636), as well as a rrnB ribosomal terminator (obtained from pKK223-3,
Pharmacia Biotech). The orientation of the dual tac-galE promoter was checked by BamHl digestion. The resulting plasmid is called pPHOX2/galE.
Addition of a kanamycin resistance gene pPHOX2 has an ampicillin-resistance gene encoded by β-lactamase. Ampicillin is added to the culture to maintain the plasmid, but it is quickly degraded by β-lactamase, losing its effectiveness. In cells with a strong selective pressure against making the recombinant protein (e.g., as with CMP-sialic acid synthetase), overgrowth of cells without the plasmid can occur. To alleviate this problem, the plasmid was re-engineered to include a kanamycin resistance (Kanr) gene, which gives a stronger selection since the encoded protein acts at the level of the membrane transport system.
The 1.3 kb Kanr gene from plasmid pUC4K (Vieira and Messing, Gene 19:259 (1982)) was digested EcoRI and inserted into the unique Ec RI site of the pPUOXllgalE plasmid. Colonies were selected by kanamycin resistance. The resulting plasmid is called pPHOX2/galE/Kan (Figure 3).
Vector improvements for ease of cloning
Multiple restriction sites for Xbal and Hindlll in the pHOX2/galE/Kan plasmid made cloning cumbersome since the recombinant gene is inserted using these two sites. Therefore, the Xbal site at the 5' end of the galE promoter fragment and a H dIII site in the Kanr gene were removed. To delete the Xbal site, the pPHOX2/galE plasmid was partially digested with Xbal and the linearized plasmid was isolated. The cut Xbal site was filled in with Klenow polymerase to make a blunt fragment and religated. Colonies were screened by restriction mapping to identify those having a plasmid that lacked the 5' Xbal site (plasmid pPHOX2/galEΔXba).
An oligonucleotide (ATGCATAAACTTTTGCCATTCTCAC; ΔH3 (SEQ ID NO:7)) was designed to change the AAG codon of HmdIII (AAGCTT) to delete the restriction site but keep the same amino acid codon (lysine, AAA). The first PCR reaction amplified DNA from plasmid pUC4K using the ΔH3 oligonucleotide and the Ml 3 forward primer (New England Biolabs), generating a 620 bp fragment. A second PCR reaction amplified DNA from pUC4K using the Ml 3 reverse primer (New England Biolabs) and the fragment from the first PCR, generating a 1.3 kb fragment. The second fragment was further amplified by PCR with the forward and reverse primers. This fragment was digested with EcøRI (and HmdIII to cut nonrecombinants) and ligated to an isolated linear fragment of a partial EcoBI digest of plasmid pPΗOX2/galEΔXba. A NhellHinάlll digest was used to determine the correct insertion site of the EcoRl fragment. This vector, which is called pTGK (Figure 4), was deposited with the American Type Culture Collection on May 22, 1996 and has been assigned Accession No. 98059.
Modifications of the pTGK vector
A number of modifications can be made to the pTGK vector. For example one can modify the vector to facilitate cloning and expression of blunt PCR fragments. To do this, pTGK is digested with Xbal and the ends are filled in with Klenow polymerase. A CCCGGG oligonucleotide is ligated to the blunt ends, recircularizing the plasmid, after which it is digested with Xbal to remove non-recombinants. Digestion with Srfi or Smal, which make a blunt-ended cut between the CCC and GGG, results in blunt ends to which one can ligate a blunt-ended fragment obtained by PCR or other methods. By using as primers for PCR oligonucleotides that begin with the ATG of the initiation codon, the optimal spacing between the initiation codon and the ribosome binding site can be maintained (Figure 5). This vector is called pTGKS.
Example 2 Analysis of Expression from Dual tac-gal Promoter
This experiment tested the ability of galactose to induce expression of the S. thermophilus UDP-gal-4-epimerase gene (galE) in E. coli using pHPl, which contains the gene's natural promoter as well as the tac promoter. E. coli strain JM101 containing pHPl was grown overnight in LB or M9 medium which was supplemented as indicated below. All cultures were incubated overnight at 37°C with agitation and attained a cell density equivalent to an A600 of approximately 2-3. Cells were harvested upon reaching stationary phase and disrupted by French pressure cell treatment. UDP-galactose-4-epimerase activity was assayed as described in Kalckar et al, Proc. Nat'l. Acad. Sci. USA 45: 1776 (1959). A unit is defined as a μmole of substrate utilized per minute.
The results of this experiment, which are presented in Table 1, demonstrated that galactose induces expression of the epimerase gene in E. coli.
Table 1
Growth medium U/L, Epimerase
LB 850
LB, 10 mM Galactose 2000
M9, 0.5%) Fructose, 1 mM Galactose 2200
M9, 0.5% Fructose, 10 mM Galactose 3120
M9, 1%) Fructose, 10 mM Galactose 3340
M9, 1.2%) Glycerol, 10 mM Galactose 2860
To determine whether expression of genes other than the native galE gene is inducible by galactose when under the control of the dual promoter, we inserted a gene encoding GlcNAc transferase into pTGK. This vector was transformed into E. coli JMlOl, which was grown in M9 medium containing either fructose, fructose and galactose, or glucose as a carbon source. As shown in Table 2, expression levels were fairly low for GlcNAcT; the resulting high experimental error rendered the data inconclusive as to galactose inducibility.
Table 2
U/L. GlcNAc T pTGK/GlcNAcT
M9, Fructose 15
M9, Fructose + Galactose 16
M9, Glucose 12 The tac promoter has a significant effect on expression levels
To determine whether the tac promoter contributes to expression of genes under the control of the dual tac-gal promoter, we deleted the tac promoter from constructs expressing GlcNAc transferase or Gal transferase (Gotschlich, E.C, J. Exp. Med. 180: 2181- 2190 (1994)). The galE promoter was present in all constructs. Strains were grown in M9 + fructose + galactose and assayed for GlcNAc or Gal transferase activity. Results, shown in Table 3, demonstrate that the tac promoter contributes significantly to expression levels.
Table 3
U/L
A. GlcNAcT constructs
PTGK (Ptac, PgalE, Kan) 16
PGK (PBalE, Kan) 0.5
B. GalT constructs pTGK (Ptac, Pga]E, Kan) 7
PGK (PgalE, Kan) 3
Effect of galE and tac promoters on pyruvate kinase expression levels
Because the relatively low expression levels in Table 2 above prevented a statistically meaningful conclusion as to galactose inducibility of expression of a heterologous gene under the control of the dual promoter, a more highly expressed enzyme (pyruvate kinase) was chosen for the following experiments. The galE promoter was deleted from a pTGK plasmid that contained a pyruvate kinase construct, leaving the plasmid with only the tac promoter. As a control, we used the same construct but with both promoters. The ribosome binding site and spacing was identical in both constructs. The results of these experiments, shown in Table 4, demonstrate that the presence of the galE promoter region has a significant effect on expression of pyruvate kinase. Interestingly, galactose induction was observed for both constructs, including that which lacked the galE promoter. Table 4
U/L, Pvr. kinase
Figure imgf000023_0001
M9, 0.5% Fructose 1202 3167
M9, 0.5% Fructose + 10 mM Galactose 1902 4377
M9, 0.5% Glucose 1208 2625
In another experiment, the tac promoter was deleted from the pyruvate kinase construct. This construct was transformed into E. coli and compared to strains containing either both promoters or the tac promoter only. The results of this experiment, which are presented in Table 5, demonstrate that the combined contribution of the tac and galE promoters is greater than the sum of their individual activities. Addition of galactose increases expression levels.
Table 5
U/L, Pyr. kinase i-£alE i p-tac — p tac.galE
M9, 0.5% Fructose 403 1420 3181
M9, 0.5%) Fructose + 10 mM Galactose 429 1839 3635
Example 3 Expression of Recombinant Genes
The E. coli expression vector pTGK has been used to produce numerous recombinant proteins, including CMP-sialic acid synthetase from E. coli, UDP-glucose pyrophosphorylase from Bacillus subtilis, adenylate kinase from E. coli, pyruvate kinase from Bacillus stear other mophilus, sialic acid aldolase from E. coli, UDP-GlcNAc pyrophosphorylase from E. coli, rabbit muscle myokinase, Neisseria βl,4- galactosyltransferase, and Neisseria N-acetyl glucosaminyltransferase. High yields have been obtained for all of these proteins. For example, 10,000,000 U rabbit muscle myokinase were produced per kg of cells, and 3,500,000 U of pyruvate kinase per kg of cells were expressed from pTGK. Example 4 Bacterial Fermentation Protocol using pTGK
Preparation of medium
1. Weigh out the following ingredients in a 2 liter beaker:
60g Na2HPO4 Sigma S0876
30g KH2PO4 Sigma P5379
5g NaCl J.T. Baker 3628-05 50g (NH4)2SO4 J.T. Baker 0792R
2. Add 1 liter distilled water and mix to dissolve.
3. Weigh out the following ingredients in a 2 liter beaker: 120g NZAmine A Quest Intl.
50g Yeast extract Difco 2ml Mazu PPG Chemical DF 204
4. Add 1 liter distilled water and mix to dissolve.
5. Add solutions from steps 2 and 4 to fermentor (e.g., New Brunswick BioFlow IV). Make to 10 liters with distilled water.
6. Autoclave fermentor for 60 min. using steam-in-place sterilization. Sterilize ports for 15 min.
7. Weigh out fructose for 50%> solution:
400g Fructose Sigma F0127
8. Make to 800 ml with distilled water and mix. Transfer to a 1 liter bottle.
9. Weigh out galactose for 20% solution: lOOg Galactose Sigma G0625
10. Make to 500 ml with distilled water and mix. Transfer to 1 liter bottle.
11. Weigh out MgSO4 for 0.5 M solution:
6g MgS04 Sigma M7506
12. Make to 100 ml with distilled water and mix. Transfer to 200 ml bottle. 13. Weigh out CaCl2 for 1 M solution:
Hg CaCl2 J.T. Baker 1311-01 14. Make to 100 ml with distilled water and mix. Transfer to 200 ml bottle.
15. Autoclave the following for 45 min:
50%) fructose (step 8) 20%) galactose (step 10)
0.5 M MgCl2 (step 12) 1 M CaCl2 (step 14) 1 liter bottle equipped with tubing for Feed pump #1
16. Weigh out kanamycin for 25 mg/ml solution: 0.5 g Kanamycin Sigma K4000
17. Make to 20 ml with distilled water and mix. Filter sterilize through 0.2 micron sterile filter.
18. Weigh out FeSO4 immediately before inoculating culture in fermentor: 1.0g FeSO4 Sigma F7002 19. Make to 10 ml with distilled water and mix. Filter sterilize through 0.2 micron sterile filter. 20. Hook up 50% NH4OH solution to Feed pump #2.
Fermentor parameters for a New Brunswick BioFlow IV Fermentor
1. Calibrate the dissolved oxygen (D.O.) probe, if necessary. Starting D.O. should be 100%.
2. Set D.O. to proportional integral derivative (P.I.D.) with a set value of 20%>.
3. Set agitation to P.I.D. with a set value of 300 rpm. Change P.I.D. to D.O. setting and set at 800 rpm. The 800 value will revert back to 300 in a few seconds. This instructs the agitation to start at 300 rpm but will increase the rpm up to 800 in order to keep D.O. at 20%.
4. Set pH to P.I.D. with a set value of 6.8.
5. Set Feed #2 to base setting (feed pump #2 controls the NH4OH addition). 6. Set temperature to P.I.D. with a set value of 37 C.
7. Set air within range of 4.3 to 4.7 liters per min. Inoculation of fermentor
Prepare the feed bottle:
1. Add 600 ml of fructose solution and 300 ml of galactose solution to the autoclaved feed bottle. Hook up bottle to Feed #1 on fermentor.
2. Determine the absorbance at 600 nm of the grown inoculum. Dilute culture 1/10 in water in a 0.5 ml glass cuvette. Blank with water.
3. Add the following ingredients to a sterile 1 liter flask: 50 ml Fructose 100 ml MgSO4
1 ml CaCl2
20 ml Kanamycin
2.5 ml FeSO4
Add to the fermentor when cooled. 4. Add inoculum to fermentor.
Expression of Desired Protein
To obtain a protein of interest using the claimed dual promoters using a fermentor, cells are grown initially in medium containing a small amount of fructose as the carbon source. Once the cells are proliferating, usually about 5-6 hours after inoculation, a solution of galactose and fructose is fed into the medium in fed-batch mode. The feed rate can be increased during the fermentation (in a stepped or ramped fashion) as the culture becomes dense with cell growth. The carbon source feed continues through to the end of the fermentation. If desired, the polypeptide of interest is then purified from the medium (in the case of a secreted protein) or from the harvested cells.
The above examples are provided to illustrate the invention but not to limit its scope. Other variants of the invention will be readily apparent to one of ordinary skill in the art and are encompassed by the appended claims. All publications, patents, and patent applications cited herein are hereby incorporated by reference for all purposes. SEQUENCE LISTING
(1) GENERAL INFORMATION:
(l) APPLICANT:
(A) NAME: Cytel Corporation
(B) STREET: 3525 John Hopkins Court
(C) CITY: San Diego
(D) STATE: California
(E) COUNTRY: USA
(F) POSTAL CODE (ZIP) : 92121
(G) TELEPHONE: (619) 552-2794 (H) TELEFAX: (619) 552-3049
( I ) TELEX :
(n) TITLE OF INVENTION: Improved Expression Vectors (in) NUMBER OF SEQUENCES: 7
(lv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Townsend and Townsend and Crew LLP
(B) STREET: Two Embarcadero Center, Eighth Floor
(C) CITY: San Francisco
(D) STATE: California
(E) COUNTRY: USA
(F) ZIP: 94111-3834
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy d sk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentin Release #1.0, Version #1.30
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: US Not yet assigned
(B) FILING DATE: Not yet assigned
(C) CLASSIFICATION:
(vn) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: US 60/029,545
(B) FILING DATE: 08-NOV-1996
(vm) ATTORNEY/AGENT INFORMATION:
(A) NAME: Smith, Timothy L.
(B) REGISTRATION NUMBER: 35,367
(C) REFERENCE/DOCKET NUMBER: 014137-009610PC
(IX) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (415) 576-0200
(B) TELEFAX: (415) 576-0300
(2) INFORMATION FOR SEQ ID NO:l:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1561 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS : single
(D) TOPOLOGY: linear
(n) MOLECULE TYPE: DNA
(ix) FEATURE:
(A) NAME/KEY: promoter
(B) LOCATION: 362..389
(D) OTHER INFORMATION: /note= "tac promoter" ( ix ) FEATURE :
(A) NAME/KEY: promoter
(B) LOCATION: 1438..1467
(D) OTHER INFORMATION: /note= "galE promoter"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l:
GTAAAGAAGT TATGGAGCNT CTTNGTCAGT AAAAAGTTAT TTTTTTCAAC AGCGTTCATA 60
AAGTGTCACG GCCGGAGAAT TATAGTCGCT TGGTTTTTAT TTTTTAAGTA TTGGTAACTA 120
GTACGCAAGT TCACGTAAAA AGGGTAACTA GATAGACGAN GGTCCGGNGT AGAGGATCCG 180
GGCTTATCGA CTGCACGGTG CACCAATGCT TCTGGGTCAG GCAGCCATCG GAAGCTGTGG 240
TATGGCTGTG CAGGTCGTAA ATCACTGCAT AATTCGTGTC GCTCAAGGCG CACTCCCGTT 300
CTGGATAATG TTTTTTGCGC CGACATCATA ACGGTTCTGG CAAATATTCT GAAATGAGCT 360
GTTGACAATT AATCATCGGC TCGTATAATG TGTGGAATTG TGAGCGGATA ACAATTTCAC 420
ACAGGAAACA GAATTCCCGG GGATCCGTCG ACCTGCAGCT AAAAATGCGG TAGCTTCTGA 480
TTATCCAAAA TGCCAACTTT GTATGGAAAA TGAAGGTTAT TTGGGTCGCA TTAATCACCC 540
AGCCCGCAGC AATCACCGTG TTGTTCGTTT CCAAATGGAA GACAAGGAGT GGGGCTTCCA 600
ATACTCGCCT TATGCCTACT TTAACGAACA TTCTATCTTC TTTTATGGTA AGCACGAACC 660
AATGCACATC AGTCCATTGA CGTTTGGCCG TCTCCTAACA ATTGTTGAAG CATTCCCCTG 720
GTTACTTCGC AGGTTCAAAT GCCGATCTTC CAATTGTAGG TGGTTCAATT CTTACACATG 780
AACACTATCA AGGTGGTCGC CATACCTTCC CAATGGAAGT AGCAGGCATT AAAGAAAAAG 840
TTAGCTTTGA TGGTTACTCT GATGTTGAGG CTGGCATCGT TAATTGGCCT ATGTCTGTTC 900
TTCGTCTAAG AAGTGAAGAC AAGGGAAGAC TTATCGCTCT TGCAACTAAA ATCCTAAATT 960
GCTGGCGTGG TTATTCAGAC GAAAAAGCTG GGGTCTTGGC TGAGTCTGAT GGACAACCTC 1020
ACCACACCAT TACTCCAATT GCTCGTAGAA AAGACGGCAA ATTTGAATTG GATTTGGTTC 1080
TTCGTGACAA TCAAACTTCT GAAGAATATC CAGACGGTAT CTATCACCCA CATAAAGATG 1140
TTCAACATAT TAAGAAAGAA AATATTGGTT TGATTGAAGT TATGGGATTG GCCATTCTTC 1200
CACCTCGTTT GAAAACAGAA CTTAAAGATG TTGAAGATTA TCTATTAGGT CAAGGTAACC 1260
AAGTTGCTCC AATTCACCAA GAATGGGCAG ATGAACTCAA AGCTCAAATC CGAATATTAC 1320
GGCTGAGGAA GTGACAGAAG TTGTTCGACA ATCTGTTGCA GATATCTTTG CTCGTGTACT 1380
AGAAGATGCA GGTGTTTATA AGACTAATAG TGAAGGCTTG GATCAGTTTA AAGCATTTGT 1440
AGATTTTGTA AATTTAGCTG ATTAATTGTT TTTTCTGAAG AAAGGAGGTC TAGAGTCGAC 1500
CTGCAGGCAT GCAAGCTTCT GTTTTGGCGG ATGAGAGAAG ATTTTCAGCC TGATACAGAT 1560
T 1561
(2) INFORMATION FOR SEQ ID NO: 2:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ll) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: GAAAAGAAGT CTAGANNNAT GNNNNNN 27
(2) INFORMATION FOR SEQ ID NO: 3:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 25 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ll) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: GAAAAGAAGT CTAGCCCGGG CTAGA 25
(2) INFORMATION FOR SEQ ID NO: 4:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(n) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: GAAAAGAAGT CTAGCCCATG NNNNNNN 27
(2) INFORMATION FOR SEQ ID NO: 5:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(n) MOLECULE TYPE: DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: GCTCTAGACG ATCCGTCCGG CGTA 24
(2) INFORMATION FOR SEQ ID NO: 6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 32 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: ATTCTAGACC TCCTTTCTCA GAAAAAACAA TT 32
(2) INFORMATION FOR SEQ ID NO : 7 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 25 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: ATGCATAAAC TTTTGCCATT CTCAC 25

Claims

WHAT IS CLAIMED IS:
1. A recombinant nucleic acid construct comprising a dual bacterial promoter operably linked to a heterologous nucleic acid that encodes a desired polypeptide, wherein the dual bacterial promoter comprises a tac promoter component and a gal promoter component.
2. The recombinant nucleic acid of claim 1 , wherein the tac promoter component is a trc promoter.
3. The recombinant nucleic acid of claim 1 , wherein the gal promoter component is a bacterial UDPgalactose-4-epimerase promoter.
4. The recombinant nucleic acid construct of claim 3, wherein the gal promoter component is from Streptococcus thermophilus.
5. The recombinant nucleic acid construct of claim 1, wherein the dual promoter results in a higher level of expression of the desired polypeptide than either the tac promoter component or the gal promoter component individually.
6. The recombinant nucleic acid construct of claim 1 , wherein the heterologous nucleic acid encodes a bacterial polypeptide.
7. The recombinant nucleic acid construct of claim 6, wherein the heterologous nucleic acid is obtained from a los locus of Neisseria gonorrhoeae.
8. The recombinant nucleic acid construct of claim 1 , wherein the heterologous nucleic acid encodes a mammalian polypeptide.
9. The recombinant nucleic acid construct of claim 1 , wherein the heterologous nucleic acid encodes a fungal polypeptide.
10. The recombinant nucleic acid construct of claim 1, wherein the heterologous nucleic acid encodes a plant polypeptide.
11. The recombinant nucleic acid construct of claim 1 , wherein the heterologous nucleic acid encodes CMP-sialic acid synthetase.
12. The recombinant nucleic acid construct of claim 1 , wherein the heterologous nucleic acid encodes a UDP-glucose pyrophosphorylase.
13. The recombinant nucleic acid construct of claim 1 , wherein the heterologous nucleic acid encodes adenylate kinase.
14. The recombinant nucleic acid construct of claim 1 , wherein the heterologous nucleic acid encodes pyruvate kinase.
15. The recombinant nucleic acid construct of claim 1 , wherein the heterologous nucleic acid encodes sialic acid aldolase.
16. The recombinant nucleic acid construct of claim 27, wherein the heterologous nucleic acid encodes UDP-GlcNAc pyrophosphorylase.
17. The recombinant nucleic acid construct of claim 1 , wherein the heterologous nucleic acid encodes rabbit muscle myokinase.
18. An expression vector which comprises a selectable marker and a recombinant nucleic acid construct comprising a dual bacterial promoter operably linked to a heterologous nucleic acid that encodes a desired polypeptide, wherein the dual bacterial promoter comprises a tac promoter component and a gal promoter component.
19. The expression vector of claim 18, wherein the selectable marker is a kanamycin resistance gene.
20. The expression vector of claim 18, which further comprises an origin of replication sequence which functions in E. coli.
21. A plasmid which is substantially identical to a plasmid deposited with the American Type Culture Collection under Accession No. 98059.
22. The plasmid of claim 21 , wherein the plasmid is identical to a plasmid deposited with the American Type Culture Collection under Accession No. 98059.
23. A recombinant nucleic acid construct comprising a Streptococcus thermophilus UDPglucose-4-epimerase promoter operably linked to a heterologous nucleic acid that encodes a desired polypeptide.
24. A bacterial cell comprising a recombinant expression cassette comprising a dual bacterial promoter operably linked to a heterologous nucleic acid that encodes a desired polypeptide, wherein the dual bacterial promoter comprises a tac promoter component and a gal promoter component.
25. The bacterial cell of claim 24, wherein the gal promoter component is from Streptococcus thermophilus.
26. The bacterial cell of claim 24, which is E. coli.
27. The bacterial cell of claim 24, wherein the recombinant expression cassette is located on an independently replicating plasmid.
28. The bacterial cell of claim 27, wherein the plasmid further comprises a kanamycin resistance gene.
29. The bacterial cell of claim 24, wherein the heterologous nucleic acid encodes CMP-sialic acid synthetase.
30. The bacterial cell of claim 24, wherein the heterologous nucleic acid encodes UDP-glucose pyrophosphorylase.
31. The bacterial cell of claim 24, wherein the heterologous nucleic acid encodes adenylate kinase.
32. The bacterial cell of claim 24, wherein the heterologous nucleic acid encodes pyruvate kinase.
33. The bacterial cell of claim 24, wherein the heterologous nucleic acid encodes sialic acid aldolase.
34. The bacterial cell of claim 24, wherein the heterologous nucleic acid encodes UDP-GlcNAc pyrophosphorylase.
35. The bacterial cell of claim 24, wherein the heterologous nucleic acid encodes rabbit muscle myokinase.
36. The bacterial cell of claim 24, wherein the heterologous nucleic acid is obtained from a los locus of Neisseria gonorrhoeae.
37. A method of making a desired polypeptide, the method comprising culturing in an appropriate medium bacterial cells comprising a recombinant expression cassette comprising a dual bacterial promoter operably linked to a heterologous nucleic acid that encodes a desired polypeptide under conditions that allow expression of the desired polypeptide, wherein the dual bacterial promoter comprises a tac promoter component and a gal promoter component.
38. The method of claim 37, wherein the gal promoter component is from Streptococcus thermophilus.
39. The method of claim 37, wherein the bacterial cells are E. coli.
40. The method of claim 37, wherein the bacterial cells are cultured in a medium comprising kanamycin.
41. The method of claim 37, wherein expression of the desired polypeptide is induced by the presence of galactose in the medium.
42. The method of claim 37, wherein the heterologous nucleic acid encodes UDP-glucose pyrophosphorylase.
43. The method of claim 37, wherein the heterologous nucleic acid encodes adenylate kinase.
44. The method of claim 37, wherein the heterologous nucleic acid encodes pyruvate kinase.
45. The method of claim 37, wherein the heterologous nucleic acid encodes sialic acid aldolase.
46. The method of claim 37, wherein the heterologous nucleic acid encodes UDP-GlcNAc pyrophosphorylase.
47. The method of claim 37, wherein the heterologous nucleic acid encodes rabbit muscle myokinase.
48. The method of claim 37, wherein the heterologous nucleic acid encodes CMP-sialic acid synthetase.
49. The method of claim 37, wherein the heterologous nucleic acid is obtained from a los locus of Neisseria gonorrhoeae.
PCT/US1997/020528 1996-11-08 1997-11-07 Improved expression vectors WO1998020111A1 (en)

Priority Applications (8)

Application Number Priority Date Filing Date Title
HU0001650A HUP0001650A3 (en) 1996-11-08 1997-11-07 Improved expression vectors
AU52524/98A AU718382B2 (en) 1996-11-08 1997-11-07 Improved expression vectors
NZ335628A NZ335628A (en) 1996-11-08 1997-11-07 Improved expression vectors comprising a nucleic acid construct with tac and gal bacterial promoters
IL12984397A IL129843A0 (en) 1996-11-08 1997-11-07 Improved expression vectors
JP52185798A JP2001503274A (en) 1996-11-08 1997-11-07 Improved expression vector
CA002271230A CA2271230A1 (en) 1996-11-08 1997-11-07 Improved expression vectors
DE69738514T DE69738514D1 (en) 1996-11-08 1997-11-07 IMPROVED EXPRESSION VECTORS
EP97947444A EP0946711B1 (en) 1996-11-08 1997-11-07 Improved expression vectors

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US2954596P 1996-11-08 1996-11-08
US60/029,545 1996-11-08

Publications (1)

Publication Number Publication Date
WO1998020111A1 true WO1998020111A1 (en) 1998-05-14

Family

ID=21849585

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1997/020528 WO1998020111A1 (en) 1996-11-08 1997-11-07 Improved expression vectors

Country Status (13)

Country Link
US (1) US6117651A (en)
EP (1) EP0946711B1 (en)
JP (1) JP2001503274A (en)
KR (1) KR20000068934A (en)
CN (1) CN1244214A (en)
AT (1) ATE386126T1 (en)
AU (1) AU718382B2 (en)
CA (1) CA2271230A1 (en)
DE (1) DE69738514D1 (en)
HU (1) HUP0001650A3 (en)
IL (1) IL129843A0 (en)
NZ (1) NZ335628A (en)
WO (1) WO1998020111A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004009793A2 (en) 2002-07-23 2004-01-29 Neose Technologies, Inc. Synthesis of glycoproteins using bacterial gycosyltransferases
EP1539794A2 (en) * 2002-07-03 2005-06-15 Dow Global Technologies Inc. Benzoate- and anthranilate-inducible promoters
WO2007120932A2 (en) 2006-04-19 2007-10-25 Neose Technologies, Inc. Expression of o-glycosylated therapeutic proteins in prokaryotic microorganisms
EP2484759A2 (en) 2004-02-04 2012-08-08 BioGeneriX AG Methods of refolding mammalian glycosyltransferases

Families Citing this family (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002064804A2 (en) * 2001-02-13 2002-08-22 University Of Florida A bi-directional dual promoter complex with enhanced promoter activity for transgene expression in eukaryotes
US7172865B2 (en) * 2001-08-13 2007-02-06 Saint Louis University Rapid and sensitive assay for the detection and quantification of coregulators of nucleic acid binding factors
US7214660B2 (en) 2001-10-10 2007-05-08 Neose Technologies, Inc. Erythropoietin: remodeling and glycoconjugation of erythropoietin
US8008252B2 (en) 2001-10-10 2011-08-30 Novo Nordisk A/S Factor VII: remodeling and glycoconjugation of Factor VII
US7173003B2 (en) 2001-10-10 2007-02-06 Neose Technologies, Inc. Granulocyte colony stimulating factor: remodeling and glycoconjugation of G-CSF
AU2004236174B2 (en) 2001-10-10 2011-06-02 Novo Nordisk A/S Glycopegylation methods and proteins/peptides produced by the methods
US7157277B2 (en) 2001-11-28 2007-01-02 Neose Technologies, Inc. Factor VIII remodeling and glycoconjugation of Factor VIII
CN1314810C (en) * 2002-01-31 2007-05-09 圣路易大学 Rapid and sensitive assay for the detection and quantification of coregulators of nucleic acid binding factors
MXPA04012496A (en) 2002-06-21 2005-09-12 Novo Nordisk Healthcare Ag Pegylated factor vii glycoforms.
US7803777B2 (en) 2003-03-14 2010-09-28 Biogenerix Ag Branched water-soluble polymers and their conjugates
US8791070B2 (en) 2003-04-09 2014-07-29 Novo Nordisk A/S Glycopegylated factor IX
CA2524936A1 (en) 2003-05-09 2004-12-02 Neose Technologies, Inc. Compositions and methods for the preparation of human growth hormone glycosylation mutants
US9005625B2 (en) 2003-07-25 2015-04-14 Novo Nordisk A/S Antibody toxin conjugates
CA2536026A1 (en) * 2003-08-22 2005-05-06 Nucleonics Inc. Eukariotic expression systems for expression of inhibitory rna in multiple intracellular compartments
US8633157B2 (en) 2003-11-24 2014-01-21 Novo Nordisk A/S Glycopegylated erythropoietin
US20080305992A1 (en) 2003-11-24 2008-12-11 Neose Technologies, Inc. Glycopegylated erythropoietin
WO2005056760A2 (en) * 2003-12-03 2005-06-23 Neose Technologies, Inc. Glycopegylated follicle stimulating hormone
US20060040856A1 (en) 2003-12-03 2006-02-23 Neose Technologies, Inc. Glycopegylated factor IX
US7956032B2 (en) 2003-12-03 2011-06-07 Novo Nordisk A/S Glycopegylated granulocyte colony stimulating factor
ES2560657T3 (en) 2004-01-08 2016-02-22 Ratiopharm Gmbh O-linked glycosylation of G-CSF peptides
WO2005067601A2 (en) * 2004-01-09 2005-07-28 Neose Technologies, Inc. Vectors for recombinant protein expression in e.coli
US20080300173A1 (en) 2004-07-13 2008-12-04 Defrees Shawn Branched Peg Remodeling and Glycosylation of Glucagon-Like Peptides-1 [Glp-1]
JP4804467B2 (en) * 2004-08-23 2011-11-02 アルナイラム ファーマシューティカルズ, インコーポレイテッド Multiple RNA polymerase III promoter expression construct
EP1799249A2 (en) 2004-09-10 2007-06-27 Neose Technologies, Inc. Glycopegylated interferon alpha
DK2586456T3 (en) 2004-10-29 2016-03-21 Ratiopharm Gmbh Conversion and glycopegylation of fibroblast growth factor (FGF)
EP1858543B1 (en) 2005-01-10 2013-11-27 BioGeneriX AG Glycopegylated granulocyte colony stimulating factor
EP2386571B1 (en) 2005-04-08 2016-06-01 ratiopharm GmbH Compositions and methods for the preparation of protease resistant human growth hormone glycosylation mutants
EP1888098A2 (en) 2005-05-25 2008-02-20 Neose Technologies, Inc. Glycopegylated erythropoietin formulations
US20080255026A1 (en) * 2005-05-25 2008-10-16 Glycopegylated Factor 1X Glycopegylated Factor Ix
US20070105755A1 (en) 2005-10-26 2007-05-10 Neose Technologies, Inc. One pot desialylation and glycopegylation of therapeutic peptides
US20090048440A1 (en) 2005-11-03 2009-02-19 Neose Technologies, Inc. Nucleotide Sugar Purification Using Membranes
EP2049144B8 (en) 2006-07-21 2015-02-18 ratiopharm GmbH Glycosylation of peptides via o-linked glycosylation sequences
US20100075375A1 (en) 2006-10-03 2010-03-25 Novo Nordisk A/S Methods for the purification of polypeptide conjugates
CN101796063B (en) 2007-04-03 2017-03-22 拉蒂奥法姆有限责任公司 methods of treatment using glycopegylated G-CSF
UA100692C2 (en) 2007-05-02 2013-01-25 Мериал Лимитед Dna-plasmids having increased expression and stability
ES2551123T3 (en) 2007-06-12 2015-11-16 Ratiopharm Gmbh Improved process for the production of nucleotide sugars
US8207112B2 (en) 2007-08-29 2012-06-26 Biogenerix Ag Liquid formulation of G-CSF conjugate
MX2010009154A (en) 2008-02-27 2010-09-09 Novo Nordisk As Conjugated factor viii molecules.
US9616114B1 (en) 2014-09-18 2017-04-11 David Gordon Bermudes Modified bacteria having improved pharmacokinetics and tumor colonization enhancing antitumor activity
US11180535B1 (en) 2016-12-07 2021-11-23 David Gordon Bermudes Saccharide binding, tumor penetration, and cytotoxic antitumor chimeric peptides from therapeutic bacteria
US11129906B1 (en) 2016-12-07 2021-09-28 David Gordon Bermudes Chimeric protein toxins for expression by therapeutic bacteria
EP3938518A4 (en) 2019-03-12 2023-01-11 Terra Bioworks, Inc. Expression vector
WO2023205693A2 (en) * 2022-04-19 2023-10-26 Worcester Polytechnic Institute Inducible promoters

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4565785A (en) * 1978-06-08 1986-01-21 The President And Fellows Of Harvard College Recombinant DNA molecule
US4673641A (en) * 1982-12-16 1987-06-16 Molecular Genetics Research And Development Limited Partnership Co-aggregate purification of proteins
US4710473A (en) * 1983-08-10 1987-12-01 Amgen, Inc. DNA plasmids
US4738921A (en) * 1984-09-27 1988-04-19 Eli Lilly And Company Derivative of the tryptophan operon for expression of fused gene products
US4795706A (en) * 1985-01-31 1989-01-03 Eli Lilly And Company Novel expression control sequences
US5304472A (en) * 1992-11-20 1994-04-19 Genentech, Inc. Method of controlling polypeptide production in bacterial cells
US5342763A (en) * 1992-11-23 1994-08-30 Genentech, Inc. Method for producing polypeptide via bacterial fermentation
US5545553A (en) * 1994-09-26 1996-08-13 The Rockefeller University Glycosyltransferases for biosynthesis of oligosaccharides, and genes encoding them

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
GENE, 1995, Volume 154, FUSI et al., "Expression of a Synthetic Gene Encoding P2 Ribonuclease from the Extreme Thermoacidophilic Archaebacterium Sulfolobus Solfataricus in Mesophylic Hosts", pages 99-103. *
JOURNAL OF BACTERIOLOGY, July 1990, Vol. 172, No. 7, POOLMAN et al., "Carbohydrate Utilization in Streptococcus Thermophilus: Characterization of the Genes for Aldose 1-Epimerase (Mutarotase) and UDPglucose 4-Epimerase", pages 4037-4047. *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1539794A2 (en) * 2002-07-03 2005-06-15 Dow Global Technologies Inc. Benzoate- and anthranilate-inducible promoters
EP1539794B1 (en) * 2002-07-03 2008-12-31 Dow Global Technologies Inc. Benzoate- and anthranilate-inducible promoters
US7794972B2 (en) 2002-07-03 2010-09-14 Pfenex Inc. Benzoate-and anthranilate-inducible promoters
WO2004009793A2 (en) 2002-07-23 2004-01-29 Neose Technologies, Inc. Synthesis of glycoproteins using bacterial gycosyltransferases
EP2484759A2 (en) 2004-02-04 2012-08-08 BioGeneriX AG Methods of refolding mammalian glycosyltransferases
WO2007120932A2 (en) 2006-04-19 2007-10-25 Neose Technologies, Inc. Expression of o-glycosylated therapeutic proteins in prokaryotic microorganisms

Also Published As

Publication number Publication date
IL129843A0 (en) 2000-02-29
NZ335628A (en) 2000-10-27
JP2001503274A (en) 2001-03-13
HUP0001650A3 (en) 2001-09-28
KR20000068934A (en) 2000-11-25
US6117651A (en) 2000-09-12
AU5252498A (en) 1998-05-29
AU718382B2 (en) 2000-04-13
ATE386126T1 (en) 2008-03-15
CN1244214A (en) 2000-02-09
EP0946711A4 (en) 2004-10-27
HUP0001650A2 (en) 2000-09-28
EP0946711A1 (en) 1999-10-06
DE69738514D1 (en) 2008-03-27
CA2271230A1 (en) 1998-05-14
EP0946711B1 (en) 2008-02-13

Similar Documents

Publication Publication Date Title
AU718382B2 (en) Improved expression vectors
Winans Transcriptional induction of an Agrobacterium regulatory gene at tandem promoters by plant-released phenolic compounds, phosphate starvation, and acidic growth media
US5922583A (en) Methods for production of recombinant plasmids
JP2000050888A (en) New escherichia coli/host vector system selected by complementation of auxotrophy, but not selected by utilization of antibiotics
WO2007022623A1 (en) Regulation of heterologous recombinant protein expression in methylotrophic and methanotrophic bacteria
Bonekamp et al. Mechanism of UTP‐modulated attenuation at the pyrE gene of Escherichia coli: an example of operon polarity control through the coupling of translation to transcription.
US5726039A (en) Vectors and transformed host cells for recombinant protein production at reduced temperatures
WO1992018631A1 (en) Methods and nucleic acid sequences for the expression of the cellulose synthase operon
CA2622710C (en) Hybrid portable origin of replication plasmids
WO1999027117A1 (en) Cold-inducible expression vector
US6030807A (en) Highly regulable promoter for heterologous gene expression
Bae et al. The Rhizobium meliloti trpE (G) gene is regulated by attenuation, and its product, anthranilate synthase, is regulated by feedback inhibition
JP3696247B2 (en) Process for producing recombinant proteins, plasmids and modified cells
TWI510623B (en) Regulation of inducible promoters
EP0228726B1 (en) Method for preparing proteins using transformed lactic acid bacteria
US5654169A (en) Vectors and transformed host cells for recombinant protein production at reduced temperatures
JP3620831B2 (en) Lactic acid bacteria shuttle vector
US7807460B2 (en) Expression vector system regulated by σ32 and methods for using it to produce recombinant protein
KR100211002B1 (en) Process for the preparation of glutarylacylase in large quantities
CA2015046C (en) Recombinant dna and expression vector
Blanco et al. Construction of hybrid plasmids containing the Escherichia coli uxaB gene: analysis of its regulation and direction of transcription
US5830720A (en) Recombinant DNA and expression vector for the repressible and inducible expression of foreign genes
US6844169B1 (en) Constructs for controlled expression of recombinant proteins in prokaryotic cells
MXPA99004331A (en) Improved expression vectors
JP3058186B2 (en) Novel cloning and / or expression vectors, their production method and their use

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 97180609.8

Country of ref document: CN

AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW AM AZ BY KG KZ MD RU TJ TM

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH KE LS MW SD SZ UG ZW AT BE CH DE DK ES FI FR GB GR IE IT LU MC

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 335628

Country of ref document: NZ

ENP Entry into the national phase

Ref document number: 2271230

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 1019997004107

Country of ref document: KR

ENP Entry into the national phase

Ref document number: 1998 521857

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: PA/a/1999/004331

Country of ref document: MX

WWE Wipo information: entry into national phase

Ref document number: 1997947444

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 52524/98

Country of ref document: AU

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWP Wipo information: published in national office

Ref document number: 1997947444

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 52524/98

Country of ref document: AU

WWP Wipo information: published in national office

Ref document number: 1019997004107

Country of ref document: KR

WWG Wipo information: grant in national office

Ref document number: 1019997004107

Country of ref document: KR

WWG Wipo information: grant in national office

Ref document number: 1997947444

Country of ref document: EP