WO2004104165A2 - Ura5 gene and methods for stable genetic integration in yeast - Google Patents
Ura5 gene and methods for stable genetic integration in yeast Download PDFInfo
- Publication number
- WO2004104165A2 WO2004104165A2 PCT/US2004/013488 US2004013488W WO2004104165A2 WO 2004104165 A2 WO2004104165 A2 WO 2004104165A2 US 2004013488 W US2004013488 W US 2004013488W WO 2004104165 A2 WO2004104165 A2 WO 2004104165A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- nucleic acid
- acid sequence
- seq
- polypeptide
- sequence
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07H—SUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
- C07H21/00—Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids
- C07H21/04—Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids with deoxyribosyl as saccharide radical
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/905—Stable introduction of foreign DNA into chromosome using homologous recombination in yeast
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
- C12N15/815—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts for yeasts other than Saccharomyces
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1077—Pentosyltransferases (2.4.2)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Definitions
- This invention relates to novel genes isolated in yeast.
- the invention also relates to plasmids, which are particularly useful for stable genetic integration into the yeast genome.
- the present invention also relates to novel yeast strains in the expression of heterologous proteins and to methods of generating the novel strains.
- Yeast strains such as Pichia pastoris, are commonly used for the - production of heterologous proteins.
- P. pastoris has become a popular model system for the study of peroxisome biogenesis (Gould et al, Yeast 8:613-628 (1992)), autophagy (Tuttle and Dunn, J. Cell Sci. 108:25-35 (1995); Sakai et al., J. Cell Biol. 141:625-636 (1998)) and the organization and biogenesis of the organelles of the secretory pathway (Rossanese et al., J. Cell Biol. 145 : 69-81
- Toxicity of the T-u ⁇ fl3 gene appears to be a host specific problem, however, as the gene may be conditionally lethal with certain gene disruptions that are otherwise not lethal.
- the gene is also toxic in the absence of the counterselecting agent methomyl, therefore, the counterselection step must be performed immediately.
- a separate gene is required for the initial positive selection step, and the agent used for the counterselection step, methomyl, is light sensitive and breaks down rapidly in aqueous solutions. The system is therefore more complicated than the URA3 system described above, in which the same gene is responsible both for the initial selection of Ura + protofrophs and for the subsequent counterselection of Ura " auxotrophs.
- the S. cerevisiae URA5 gene was cloned by complementation of a non- reverting E. colipyrE mutant that was blocked in orotate-phosphoribosyl transferase activity. Montigny et al, Mol. Gen. Genet. 215:455-462 (1989). Yeast cells lacking this gene displayed a leaky phenotype, however, indicating that, in S. cerevisiae, another protein possesses orotate-phosphoribosyl transferase activity. See Jund and Lacroute, J. Bacteriol 109:196-202 (1972). The URA5 gene has also been identified in Kluyveromyces lactis.
- auxotrophic strains of P. pastoris suffer the further disadvantage that the respective auxotrophic marker genes have the potential to revert.
- a high reversion rate decreases the usefulness of auxotrophic strains, because revertant colonies are misidentified as false-positive fransformants.
- the present invention provides isolated polynucleotides comprising or consisting of nucleic acid sequences selected from the group consisting of the coding sequences of the P. pastoris URA5 gene, a fragment of the P. pastoris SEC65 gene and a fragment of the P. pastoris SCS7 gene; nucleic acid sequences that are degenerate variants of these sequences; and related nucleic acid sequences and fragments.
- the invention also provides vectors and host cells comprising these isolated polynucleotides.
- the invention further provides isolated polypeptides comprising or consisting of polypeptide sequences selected from the group consisting of sequences encoded by the P. pastoris URA5 gene, by a fragment of the P.
- the invention also provides host cells comprising a disruption, deletion or mutation of a nucleic acid sequence selected from the group consisting of the coding sequence of the P. pastoris URA5 gene, a nucleic acid sequence that is a degenerate variant of the coding sequence of the P. pastoris URA5 gene and related nucleic acid sequences and fragments, in which the host cells have a reduced activity of the polypeptide encoded by the nucleic acid sequence compared to a host cell without the disruption, deletion or mutation.
- the invention further provides methods for the genetic integration of a heterologous nucleic acid sequence in a host cell. These methods comprise the step of disrupting a host gene encoding orotate-phosphoribosyl transferase by introduction of a disrupted, deleted or mutated nucleic acid sequence derived from a sequence selected from the group consisting of the coding sequence of the P. pastoris URA5 gene, a nucleic acid sequence that is a degenerate variant of the coding sequence of the P. pastoris URA5 gene and related nucleic acid sequences and fragments.
- the invention provides methods for the genetic integration of a heterologous nucleic acid sequence in a host cell lacking orotate-phosphoribosyl transferase activity. These methods comorise the sten of intrndur.incr a spi ⁇ pinrA nf interest into the host cell in linkage with a sequence encoding orotate- phosphoribosyl transferase activity selected from the group consisting of the coding sequence of the P. pastoris URA5 gene, a nucleic acid sequence that is a degenerate variant of the coding sequence of the P. pastoris URA5 gene and related nucleic acid sequences and fragments.
- Fig. 1 shows a 1947 bp £/7 45-containing genomic fragment (Sau3A-Sspl) of P. pastoris (SEQ ID NO:l), including the URA5 coding sequence (SEQ ID NO:2) and its encoded polypeptide (SEQ ID NO:3), the sequence complementary to the 3' fragment of the SEC65 coding sequence (SEQ ED NO:4) and its encoded polypeptide (SEQ ID NO:5), and the 3' fragment of the SCS7 coding sequence (SEQ ID NO:6) and its encoded polypeptide (SEQ ID NO:7).
- Fig. 2 shows an alignment of sequences used to design degenerate primers.
- the C/i 45-related sequences are URA5 from S. cerevisiae (SEQ ID NO:2)
- URA10 from S. cerevisiae SEQ ID NO:9
- URA5 froml lactis SEQ ID NO: 10
- Y. lipolytica SEQ ID NO: 11
- S. pombe SEQ ID NO: 12
- T. reesei SEQ ID NO:13
- E. coli SEQ ID NO:14
- P. aeruginosa SEQ ID NO:15
- H. influenzae SEQ ID NO: 16
- the URA5 sequence from P. pastoris (residues 27 - 80 ofSEQ ID O:3) is shown for comparison.
- the SEC65-related sequences are from S. cerevisiae (S ⁇ Q ID NO: 17), K.
- Fig. 3 depicts some of the degenerate ohgonucleotides used in cloning of the P.pastoris URA5 gene.
- Fig. 4 shows restriction maps of plasmid pJN266 (including a recyclable URA3 cassette, which may be used to disrupt aKEXl locus); plasmid pJN315 (including the P.
- Fig. 5 shows restriction maps of plasmid pJN395 (including a P. pastoris URA5 disruption cassette marked with a kanamycin-resistance gene); plasmid pJN396 (including the P. pastoris URA5 gene flanked by lacZ direct repeats); plasmid pJN398 (including a recyclable URA5 cassette, which may be used to knock out an OCH7 locus); and plasmid pJN407 (including a P. pastoris URA5- K. lactis UDP-GlcNAc Transporter cassette, which may be used for stable integration into an OCH1 locus).
- Fig. 6 shows the use of a P. pastoris URA5- K. lactis UDP-GlcNAc Transporter cassette in the stable integration of the UDP-GlcNAc Transporter into the OCH7 locus.
- polynucleotide or “nucleic acid molecule” refers to a polymeric form of nucleotides of at least 10 bases in length.
- the term includes DNA molecules (e.g., cDNA or genomic or synthetic DNA) and RNA molecules (e.g. , mRNA or synthetic RNA), as well as analogs of DNA or RNA containing non-natural nucleotide analogs, non-native internucleoside bonds, or both.
- the nucleic acid can be in any topological conformation.
- nucleic acid can be single-stranded, double-stranded, triple-stranded, quadruplexed, partially double-stranded, branched, hairpinned, circular, or in a padlocked conformation.
- a "nucleic acid comprising SEQ ID NO:X” refers to a nucleic acid, at least a portion of which has either (i) the sequence of SEQ ID NO:X, or (ii) a sequence complementary to SEQ ID NO:X. The choice between the two is dictated by the context. For instance, if the nucleic acid is used as a probe, the choice between the two is dictated by the requirement that the probe be complementary to the desired target.
- An "isolated” or “substantially pure” nucleic acid or polynucleotide is one which is substantially separated from other cellular components that naturally accompany the native polynucleotide in its natural host cell, e.g., ribosomes, polymerases and genomic sequences with which it is naturally associated.
- the term embraces a nucleic acid or polynucleotide that (1) has been removed from its naturally occurring environment, (2) is not associated with all or a portion of a polynucleotide in which the "isolated polynucleotide” is found in nature, (3) is operatively linked to a polynucleotide which it is not linked to in nature, or (4) does not occur in nature.
- isolated or substantially pure also can be used in reference to recombinant or cloned DNA isolates, chemically synthesized polynucleotide analogs, or polynucleotide analogs that are biologically synthesized by heterologous systems.
- isolated does not necessarily require that the nucleic acid or polynucleotide so described has itself been physically removed from its native environment.
- an endogenous nucleic acid sequence in the genome of an organism is deemed “isolated” herein if a heterologous sequence is placed adjacent to the endogenous nucleic acid sequence, such that the expression of this endogenous nucleic acid sequence is altered.
- a heterologous sequence is a sequence that is not naturally adjacent to the endogenous nucleic acid sequence, whether or not the heterologous sequence is itself endogenous (originating from the same host cell or progeny thereof) or exogenous (originating from a different host cell or progeny thereof).
- a promoter sequence can be substituted (e.g., by homologous recombination) for the native promoter of a gene in the genome of a host cell, such that this gene has an altered expression pattern.
- This gene would now become “isolated” because it is separated from at least some of the sequences that naturally flank it.
- a nucleic acid is also considered “isolated” if it contains any modifications that do not naturally occur to the corresponding nucleic acid in a genome.
- an endogenous coding sequence is considered “isolated” if it contains an insertion, deletion or a point mutation introduced artificially, e.g., by human intervention.
- an “isolated nucleic acid” also includes a nucleic acid integrated into a host cell chromosome at a heterologous site and a nucleic acid construct present as an episome. Moreover, an “isolated nucleic acid” can be substantially free of other cellular material, or substantially free of culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized. [0029] As used herein, the phrase "degenerate variant" of a reference nucleic acid sequence encompasses nucleic acid sequences that can be translated, according to the standard genetic code, to provide an amino acid sequence identical to that translated from the reference nucleic acid sequence.
- degenerate oligonucleotide or “degenerate primer” is used to signify an oligonucleotide capable of hybridizing with target nucleic acid sequences that are not necessarily identical in sequence but that are homologous to one another within one or more particular segments.
- target nucleic acid sequences that are not necessarily identical in sequence but that are homologous to one another within one or more particular segments.
- percent sequence identity or “identical” in the context of nucleic acid sequences refers to the residues in the two sequences which are the same when aligned for maximum correspondence.
- the length of sequence identity comparison maybe over a stretch of at least about nine nucleotides, usually at least about 20 nucleotides, more usually at least about 24 nucleotides, typically at least about 28 nucleotides, more typically at least about 32 nucleotides, and preferably at least about 36 or more nucleotides.
- FASTA FASTA
- Gap or Bestfit programs in Wisconsin Package Version 10.0, Genetics Computer Group (GCG), Madison, Wisconsin.
- GCG Genetics Computer Group
- percent sequence identity between nucleic acid sequences can be determined using FASTA with its default parameters (a word size of 6 and the NOP AM factor for the scoring matrix) or using Gap with its default parameters as provided in GCG Version 6.1, herein incorporated by reference.
- sequences can be compared using the computer program, BLAST (Altschul et al, J. Mol. Biol. 215:403-410 (1990); Gish and States, Nature Genet. 3:266-272 (1993); Madden et al, Meth. Enzymol. 266:131-141 (1996); Altschul et al, Nucleic Acids Res.
- nucleic acid or fragment thereof indicates that, when optimally aligned with appropriate nucleotide insertions or deletions with another nucleic acid (or its complementary strand), there is nucleotide sequence identity in at least about 50%, more preferably 60% of the nucleotide bases, usually at least about 70%, more usually at least about 80%, preferably at least about 90%, and more preferably at least about 95%, 96%, 97%, 98% or 99% of the nucleotide bases, as measured by any well-known algorithm of sequence identity, such as FASTA, BLAST or Gap, as discussed above.
- sequence identity such as FASTA, BLAST or Gap
- nucleic acid or fragment thereof hybridizes to another nucleic acid, to a strand of another nucleic acid, or to the complementary strand thereof, under stringent hybridization conditions.
- Stringent hybridization conditions and “stringent wash conditions” in the context of nucleic acid hybridization experiments depend upon a number of different physical parameters. Nucleic acid hybridization will be affected by such conditions as salt concentration, temperature, solvents, the base composition of the hybridizing species, length of the complementary regions, and the number of nucleotide base mismatches between the hybridizing nucleic acids, as will be readily appreciated by those skilled in the art. One having ordinary skill in the art knows how to vary these parameters to achieve a particular stringency of hybridization.
- “stringent hybridization” is performed at about 25 °C below the thermal melting point (T m ) for the specific DNA hybrid under a particular set of conditions.
- “Stringent washing” is performed at temperatures about 5 °C lower than the T m for the specific DNA hybrid under a particular set of conditions.
- the T m is the temperature at which 50% of the target sequence hybridizes to a perfectly matched probe.
- stringent conditions are defined for solution phase hybridization as aqueous hybridization (i.e., free of formamide) in 6X SSC (where 20X SSC contains 3.0 M NaCl and 0.3 M sodium citrate), 1% SDS at 65 °C for 8-12 hours, followed by two washes in 0.2X SSC, 0.1% SDS at 65 °C for 20 minutes. It will be appreciated by the skilled worker that hybridization at 65 °C will occur at different rates depending on a number of factors including the length and percent identity of the sequences which are hybridizing.
- the nucleic acids (also referred to as polynucleotides) of this invention may include both sense and antisense strands of RNA, cDNA, genomic DNA, and synthetic forms and mixed polymers of the above. They may be modified chemically or biochemically or may contain non-natural or derivatized nucleotide include, for example, labels, methylation, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications such as uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.), charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), pendent moieties (e.g., polypeptides), intercalators (e.g., acridine, psoralen, etc.), chelators, alkylators, and modified linkages (e.g., alpha anomeric nucle
- nucleic acid sequences include, for example, those in which peptide linkages substitute for phosphate linkages in the backbone of the molecule.
- modifications can include, for example, analogs in which the ribose ring contains a bridging moiety or other structure such as the modifications found in "locked" nucleic acids.
- mutated when applied to nucleic acid sequences means that nucleotides in a nucleic acid sequence may be inserted, deleted or changed compared to a reference nucleic acid sequence. A single alteration may be made at a locus (a point mutation) or multiple nucleotides may be inserted, deleted or changed at a single locus.
- one or more alterations may be made at any number of loci within a nucleic acid sequence.
- a nucleic acid sequence may be mutated by any method known in the art including but not limited to mutagenesis techniques such as "error-prone PCR" (a process for performing PCR under conditions where the copying fidelity of the DNA polymerase is low, such that a high rate of point mutations is obtained along the entire length of the PCR product; see, e.g., Leung et al, Technique, 1:11-15 (1989) and Caldwell and Joyce, PCR Methods Applic.
- vector as used herein is intended to refer to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
- vector is a "plasmid”, which refers to a circular double stranded DNA loop into which additional DNA segments may be ligated.
- vectors include cosmids, bacterial artificial chromosomes (BAC) and yeast artificial chromosomes (YAC).
- BAC bacterial artificial chromosome
- YAC yeast artificial chromosome
- Another type of vector is a viral vector, wherein additional DNA segments may be ligated into the viral genome (discussed in more detail below).
- Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., vectors having an origin of rephcation which functions in the host cell).
- Other vectors can be integrated into the genome of a host cell upon introduction into the host cell, and are thereby replicated along with the host genome.
- certain preferred vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are referred to herein as "recombinant expression vectors" (or simply, "expression vectors").
- sequence of interest or “gene of interest” refers to a nucleic acid sequence, typically encoding a protein, that is not normally produced in the host cell.
- the methods disclosed herein allow one or more sequences of interest or genes of interest to be stably integrated into a host cell genome.
- Non-limiting examples of sequences of interest include sequences encoding one or more polypeptides having an enzymatic activity, e.g., an enzyme which affects N-glycan synthesis in a host such as mannosyltransferases, N- acetylglucosaminyltransferases, UDP-N-acetylglucosamine transporters, galactosyltransferases and sialyltransferases.
- Other non-limiting examples include sequences encoding one or more polypeptides having an enzymatic activity, e.g., an enzyme which affects O-glycan synthesis in a host such as protein- mannosyltransferase (PMT) genes.
- PMT protein- mannosyltransferase
- Still other sequences encode proteins of interest such as kringle domains of the human plasminogen, erythropoietin, cytokines such as interferon- ⁇ , interferon- ⁇ , interferon- ⁇ , interferon- ⁇ , and granulocyte-CSF, coagulation factors such as factor VIII, factor IX, and human protein C, soluble IgE receptor -chain, IgG, IgG fragments, IgM, urokinase, chymase, and urea trypsin inhibitor, IGF-binding protein, epidermal growth factor, growth hormone-releasing factor, annexin V fusion protein, angiostatin, vascular endothelial growth factor-2, myeloid progenitor inhibitory factor- 1, osteoprotegerin, ⁇ -1 antitrypsin, D ⁇ ase II and - feto proteins.
- proteins of interest such as kringle domains of the human plasminogen, erythropoietin
- marker sequence refers to a nucleic acid sequence capable of expressing an activity that allows either positive or negative selection for the presence or absence of the sequence within a host cell.
- the P. pastoris URA5 gene is a marker gene because its presence can be selected for by the ability of cells containing the gene to grow in the absence of uracil. Its presence can also be selected against by the inability of cells containing the gene to grow in the presence of 5-FOA. Marker sequences or genes do not necessarily need to display both positive and negative selectability.
- Non-limiting examples of marker sequences or genes from P.pastoris include ADE1, ARG4, HIS4 and URA3.
- “Operatively linked” expression control sequences refers to a linkage in which the expression control sequence is contiguous with the gene of interest to control the gene of interest, as well as expression confrol sequences that act in trans or at a distance to confrol the gene of interest.
- expression control sequence refers to polynucleotide sequences which are necessary to affect the expression of coding sequences to which they are operatively linked. Expression control sequences are sequences which control the transcription, post-transcriptional events and translation of nucleic acid sequences.
- Expression confrol sequences include appropriate transcription initiation, termination, promoter and enhancer sequences; efficient RNA processing signals such as splicing and polyadenylation signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (e.g., ribosome binding sites); sequences that enhance protein stability; and when desired, sequences that enhance protein secretion.
- control sequences differs depending upon the host organism; in prokaryotes, such confrol sequences generally include promoter, ribosomal binding site, and transcription termination sequence.
- control sequences is intended to include, at a minimum, all components whose presence is essential for expression, and can also include additional components whose presence is advantageous, for example, leader sequences and fusion partner sequences.
- recombinant host cell (or simply “host cell”), as used herein, is intended to refer to a cell into which a recombinant vector has been introduced. It should be understood that such terms are intended to refer not only to the particular subject cell but to the progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term "host cell” as used herein.
- a recombinant host cell may be an isolated cell or cell line grown in culture or may be a cell which resides in a living tissue or organism.
- polypeptide refers to a short polypeptide, e.g. , one that is typically less than about 50 amino acids long and more typically less than about 30 amino acids long.
- the term as used herein encompasses analogs and mimetics that mimic structural and thus biological function.
- polypeptide encompasses both naturally-occurring and non- naturally-occurring proteins, and fragments, mutants, derivatives and analogs thereof.
- a polypeptide may be monomeric or polymeric. Further, a polypeptide may comprise a number of different domains each of which has one or more distinct activities.
- isolated protein or "isolated polypeptide” is a protein or polypeptide that by virtue of its origin or source of derivation (1) is not associated with naturally associated components that accompany it in its native state, (2) exists in a purity not found in nature, where purity can be adjudged with respect to the presence of other cellular material (e.g., is free of other proteins from the same species) (3) is expressed by a cell from a different species, or (4) does not occur in nature (e.g., it is a fragment of a polypeptide found in nature or it includes amino acid analogs or derivatives not found in nature or linkages other than standard peptide bonds).
- polypeptide that is chemically synthesized or synthesized in a cellular system different from the cell from which it naturally originates will be “isolated” from its naturally associated components.
- a polypeptide or protein may also be rendered substantially free of naturally associated components by isolation, using protein purification techniques well known in the art.
- isolated does not necessarily require that the protein, polypeptide, peptide or oligopeptide so described has been physically removed from its native environment.
- polypeptide fragment refers to a polypeptide that has a deletion, e.g., an amino-terminal and/or carboxy-terminal deletion compared to a full-length polypeptide.
- the polypeptide fragment is a contiguous sequence in which the amino acid sequence of the fragment is identical to the corresponding positions in the naturally-occurring sequence. Fragments typically are at least 5, 6, 7, 8, 9 or 10 amino acids long, preferably at least 12, 14, 16 or 18 amino acids long, more preferably at least 20 amino acids long, more preferably at least 25, 30, 35, 40 or 45, amino acids, even more preferably at least 50 or 60 amino acids long, and even more preferably at least 70 amino acids long.
- a “modified derivative” refers to polypeptides or fragments thereof that are substantially homologous in primary structural sequence but which include, e.g., in vivo or in vitro chemical and biochemical modifications or which incorporate amino acids that are not found in the native polypeptide. Such modifications include, for example, acetylation, carboxylation, phosphorylation, glycosylation, ubiquitination, labeling, e.g., with radionuclides, and various enzymatic modifications, as will be readily appreciated by those skilled in the art.
- a variety of methods for labeling polypeptides and of substituents or labels useful for such purposes are well known in the art, and include radioactive isotopes such as 125 1, 32 P, 35 S, and 3 H, ligands which bind to labeled antiligands (e.g., antibodies), fluorophores, chemiluminescent agents, enzymes, and antiligands which can serve as specific binding pair members for a labeled ligand.
- the choice of label depends on the sensitivity required, ease of conjugation with the primer, stability requirements, and available instrumentation.
- Methods for labeling polypeptides are well known in the art. See, e.g., Ausubel et al, Current Protocols in Molecular Biology, Greene Publishing Associates (1992, and Supplements to 2002) (hereby incorporated by reference).
- fusion protein refers to a polypeptide comprising a polypeptide or fragment coupled to heterologous amino acid sequences. Fusion proteins are useful because they can be constructed to contain two or more desired functional elements from two or more different proteins.
- a fusion protein comprises at least 10 contiguous amino acids from a polypeptide of interest, more preferably at least 20 or 30 amino acids, even more preferably at least 40, 50 or 60 amino acids, yet more preferably at least 75, 100 or 125 amino acids. Fusions that include the entirety of the proteins of the present invention have particular utility.
- the heterologous polypeptide included within the fusion protein of the present invention is at least 6 amino acids in length, often at least 8 amino acids in length, and usefully at least 15, 20, and 25 amino acids in length.
- Fusions that include larger polypeptides, such as an IgG Fc region, and even entire proteins, such as the green fluorescent protein (“GFP") chromophore-containing proteins, have particular utility. Fusion proteins can be produced recombinantly by constructing a nucleic acid sequence which encodes the polypeptide or a fragment thereof in frame with a nucleic acid sequence encoding a different protein or peptide and then expressing the fusion protein. Alternatively, a fusion protein can be produced chemically by crosslinking the polypeptide or a fragment thereof to another protein.
- GFP green fluorescent protein
- the term “antibody” refers to a polypeptide, at least a portion of which is encoded by at least one immunoglobulin gene, or fragment thereof, and that can bind specifically to a desired target molecule.
- the term includes naturally-occurring forms, as well as fragments and derivatives.
- Fragments within the scope of the term “antibody” include those produced by digestion with various proteases, those produced by chemical cleavage and/or chemical dissociation and those produced recombinantly, so long as the fragment remains capable of specific binding to a target molecule. Among such fragments are Fab, Fab', Fv, F(ab') 2 , and single chain Fv (scFv) fragments.
- Derivatives within the scope of the term include antibodies (or fragments thereof) that have been modified in sequence, but remain capable of specific binding to a target molecule, including: interspecies chimeric and humanized antibodies; antibody fusions; heteromeric antibody complexes and antibody fusions, such as diabodies (bispecific antibodies), single-chain diabodies, and intrabodies (see, e.g., Intracellular Antibodies: Research and Disease Applications, (Marasco, ed., Springer- Verlag New York, Inc., 1998), the disclosure of which is incorporated herein by reference in its entirety).
- non-peptide analog refers to a compound with properties that are analogous to those of a reference polypeptide.
- a non-peptide compound may also be termed a "peptide mimetic” or a "peptidomimetic”.
- a "polypeptide mutant” or “mutein” refers to a polypeptide whose sequence contains an insertion, duplication, deletion, rearrangement or substitution of one or more amino acids compared to the amino acid sequence of a native or wild-type protein.
- a mutein may have one or more amino acid point substitutions, in which a single amino acid at a position has been changed to another amino acid, one or more insertions and/or deletions, in which one or more amino acids are inserted or deleted, respectively, in the sequence of the naturally-occurring protein, and/or truncations of the amino acid sequence at either or both the amino or carboxy termini.
- a mutein may have the same but preferably has a different biological activity compared to the naturally-occurring protein.
- a mutein has at least 65% overall sequence homology to its wild-type counterpart. Even more preferred are muteins having at least 70%, 75%, 80%, 85% or 90% overall sequence homology to the wild-type protein.
- a mutein exhibits at least 95% sequence identity, even more preferably 98%, even more preferably 99% and even more preferably 99.9% overall sequence identity. Sequence homology may be measured by any common sequence analysis algorithm, such as Gap or Bestfit.
- Amino acid substitutions can include those which: (1) reduce susceptibility to proteolysis, (2) reduce susceptibility to oxidation, (3) alter binding affinity for forming protein complexes, (4) alter binding affinity or enzymatic activity, and (5) confer or modify other physicochemical or functional properties of such analogs.
- Examples of unconventional amino acids include: 4-hydroxyproline, ⁇ -carboxyglutamate, e-NNN-trimethyllysine, e-N-acetyllysine, O-phosphoserine, N-acetylserine, N-formylmethionine, 3-methylhistidine, 5-hydroxylysine, N-methylarginine, and other similar amino acids and imino acids (e.g., 4-hydroxyproline).
- the left-hand end corresponds to the amino terminal end and the right-hand end corresponds to the carboxy-terminal end, in accordance with standard usage and convention.
- a protein has "homology” or is “homologous” to a second protein if the nucleic acid sequence that encodes the protein has a similar sequence to the nucleic acid sequence that encodes the second protein.
- a protein has homology to a second protein if the two proteins have "similar” amino acid sequences.
- the term "homologous proteins” is defined to mean that the two proteins have similar amino acid sequences.
- a homologous protein is one that exhibits at least 65% sequence homology to the wild type protein, more preferred is at least 70% sequence homology. Even more preferred are homologous proteins that exhibit at least 75%, 80%, 85% or 90% sequence homology to the wild type protein.
- a homologous protein exhibits at least 95%, 98%, 99% or 99.9% sequence identity.
- homology between two regions of amino acid sequence is interpreted as implying similarity in function.
- Sequence homology for polypeptides is typically measured using sequence analysis software. See, e.g., the Sequence Analysis Software Package of the Genetics Computer Group (GCG), University of Wisconsin Biotechnology Center, 910 University Avenue, Madison, Wisconsin 53705. Protein analysis software matches similar sequences using a measure of homology assigned to various substitutions, deletions and other modifications, including conservative amino acid substitutions. For instance, GCG contains programs such as "Gap” and "Bestfit” which can be used with default parameters to determine sequence homology or sequence identity between closely related polypeptides, such as homologous polypeptides from different species of organisms or between a wild-type protein and a mutein thereof.
- GCG Genetics Computer Group
- Bestfit programs
- BLAST Altschul et al, J. Mol. Biol. 215:403-410 (1990); Gish and States, Nature Genet. 3:266-272 (1993); Madden et al, Meth. Enzymol. 266:131-141 (1996); Altschul et al, Nucleic Acids Res. 25:3389-3402 (1997); Zhang and Madden, Genome Res. 7:649-656 (1997)), especially blastp or tblastn (Altschul et al, Nucleic Acids Res. 25:3389-3402 (1997)).
- Preferred parameters for BLASTp are:
- the length of polypeptide sequences compared for homology will generally be at least about 16 amino acid residues, usually at least about 20 residues, more usually at least about 24 residues, typically at least about 28 residues, and preferably more than about 35 residues.
- database searching using amino acid sequences can be measured by algorithms other than blastp known in the art.
- polypeptide sequences can be compared using FASTA, a program in GCG Version 6.1. FASTA provides alignments and percent sequence identity of the regions of the best overlap between the query and search sequences. Pearson, Methods Enzymol. 183:63-98 (1990) (herein incorporated by reference).
- percent sequence identity between amino acid sequences can be determined using FASTA with its default parameters (a word size of 2 and the PAM250 scoring matrix), as provided in GCG Version 6.1, herein incorporated by reference.
- Specific binding refers to the ability of two molecules to bind to each other in preference to binding to other molecules in the environment.
- “specific binding” discriminates over adventitious binding in a reaction by at least two-fold, more typically by at least 10-fold, often at least 100-fold.
- the affinity or avidity of a specific binding reaction, as quantified by a dissociation constant is about 10 "7 M or stronger (e.g., about 10 "8 M, 10 "9 M or even stronger).
- region refers to a physically contiguous portion of the primary structure of a biomolecule. In the case of proteins, a region is defined by a contiguous portion of the amino acid sequence of that protein.
- domain refers to a structure of a biomolecule that contributes to a known or suspected function of the biomolecule. Domains may be co-extensive with regions or portions thereof; domains may also include distinct, non-contiguous regions of a biomolecule. Examples of protein domains include, but are not limited to, an Ig domain, an extracellular domain, a transmembrane domain, and a cytoplasmic domain.
- molecule means any compound, including, but not limited to, a small molecule, peptide, protein, sugar, nucleotide, nucleic acid, lipid, etc., and such a compound can be natural or synthetic.
- all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention pertains. Exemplary methods and materials are described below, although methods and materials similar or equivalent to those described herein can also be used in the practice of the present invention and will be apparent to those of skill in the art. All publications and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will confrol.
- the present invention provides isolated nucleic acid molecules that include the URA5 gene from P. pastoris and variants thereof.
- the full-length nucleic acid sequence for this gene which encodes the enzyme orotate- phosphoribosyl transferase (OPRTase, EC 2.4.2.10), has been identified and comipnr H a « set forth in Fie. 1.
- SEQ ID NO:l Included within the cloned genomic sequence (SEQ ID NO:l) is a coding sequence for orotate-phosphoribosyl transferase (SEQ ID NO:2).
- SEQ ID NO:3 The encoded amino acid sequence is also set forth in Fig. 1 (SEQ ID NO:3).
- the URA5 gene is particularly useful as a reuseable, selectable and counterselectable marker.
- nucleic acid molecules capable of promoting the stable genetic integration of heterologous genes (i.e. genes of interest) into a host genome.
- the combination of the URA5 marker and nucleic acids capable of promoting stable genetic integration enables extensive strain modification. It will be readily apparent to a skilled artisan that the repeated use of the methods disclosed herein allows multiple genes to be disrupted in various loci and further allows the insertion at these sites of any gene or genes of interest. Genes inserted by the disclosed approaches become stably integrated at a selected region in the genomic DNA of the host cells.
- the invention provides an isolated nucleic acid molecule having a nucleic acid sequence comprising or consisting of a wild-type P. pastoris URA5 coding sequence (SEQ ID NO:2), and homologs, variants and derivatives thereof.
- the invention also provides a nucleic acid molecule comprising or consisting of a sequence which is a degenerate variant of the wild- type P. pastoris URA5 gene.
- the invention provides a nucleic acid molecule comprising or consisting of a sequence which is a variant of the P. pastoris URA5 gene having at least 65% identity to the wild-type gene.
- the nucleic acid sequence can preferably have at least 70%, 75% or 80% identity to the wild-type gene. Even more preferably, the nucleic acid sequence can have 85%, 90%, 95%, 98%, 99%, 99.9% or even higher identity to the wild-type gene.
- the nucleic acid molecule of the invention encodes a polypeptide having the amino acid sequence of SEQ ID NO:3. Also provided is a nucleic acid molecule encoding a polypeptide sequence that is at least 65% identical to SEQ ID NO:3. Typically the nucleic acid molecule of the invention encodes a polypeptide sequence of at least 70%, 75% or 80% identity to SEQ ID NO:3.
- the encoded polypeptide is 85%, 90% or 95% identical to SEQ ID NO:3, and the identity can even more preferably be 98%, 99%, 99.9% or even higher.
- the invention provides a fragment of the SEC65 gene from P. pastoris. This fragment, which is located downstream from and in the opposite orientation to the URA5 gene, has been identified as set forth in Fig. 1 (SEQ ID NO:4). The amino acid sequence encoded by the SEC65 fragment is also set forth in Fig. 1 (SEQ ID NO:5). Accordingly, the present invention provides isolated nucleic acid molecules that include a wild-type SEC65 gene fragment from P. pastoris and homologs, variants and derivatives thereof.
- the invention provides an isolated nucleic acid molecule having a nucleic acid sequence comprising or consisting of a fragment of the wild-type P. pastoris SEC65 gene (SEQ ID NO:4), and homologs, variants and derivatives thereof.
- the nucleic acid sequence is a degenerate variant of the P. pastoris SEC65 gene fragment.
- the nucleic acid sequence is a variant of the P. pastoris SEC65 gene fragment having at least 65% identity to the wild-type gene fragment.
- the nucleic acid sequence can preferably have at least 70%, 75% or 80% identity to the wild-type gene fragment.
- the nucleic acid sequence can have 85%, 90%, 95%, 98%, 99%, 99.9% or even higher identity to the wild-type gene fragment.
- the nucleic acid molecule of the invention encodes a polypeptide having the amino acid sequence of SEQ ID NO:5. Also provided is a nucleic acid molecule encoding a polypeptide sequence that is at least 65% identical to SEQ ID NO:5. Typically, the nucleic acid molecule of the invention encodes a polypeptide sequence of at least 70%, 75% or 80% identity to SEQ ID NO:5. Preferably, the encoded polypeptide is 85%, 90% or 95% identical to SEQ ID NO:5, and the identity can even more preferably be 98%, 99%, 99.9% or even higher.
- the invention provides a fragment of the SCS7 gene from P. pastoris.
- This fragment which is located upstream from and in the same orientation as the URA5 gene, is identified as set forth in Fig. 1 (SEQ ID NO: 6).
- the amino acid sequence encoded by the SCS7 fragment is also set forth in Fig. 1 (SEQ ID NO:7).
- the present invention thus provides isolated nucleic acid molecules that include a P. pastoris wild-type SCS7 gene fragment and variants thereof.
- the invention provides an isolated nucleic acid molecule having a nucleic acid sequence comprising or consisting of a fragment of the wild-type P. pastoris SCS7 gene (SEQ ID NO:6), and homologs, variants and derivatives thereof.
- the nucleic acid sequence is a degenerate variant of the P. pastoris SCS7 gene fragment.
- the nucleic acid sequence is a variant of the P. pastoris SCS7 gene fragment having at least 65% identity to the wild-type gene fragment.
- the nucleic acid sequence can preferably have at least 70%, 75% or 80% identity to the wild-type gene fragment.
- the nucleic acid sequence can have 85%, 90%, 95%, 98%, 99%, 99.9% or even higher identity to the wild-type gene fragment.
- the nucleic acid molecule of the invention encodes a polypeptide having the amino acid sequence of SEQ ID NO:7. Also provided is a nucleic acid molecule encoding a polypeptide sequence that is at least 65% identical to SEQ ID NO:7.
- the nucleic acid moleucle of the invention encodes a polypeptide sequence of at least 70%, 75% or 80% identity to SEQ ID NO:7.
- the encoded polypeptide is 85%, 90% or 95% identical to SEQ ID NO:7, and the identity can even more preferably be 98%, 99%, 99.9% or even higher.
- the invention also provides nucleic acid molecules that hybridize under stringent conditions to the above-described nucleic acid molecules.
- stringent hybridizations are performed at about 25 °C below the thermal melting point (T m ) for the specific DNA hybrid under a particular set of conditions, where the T m is the temperature at which 50% of the target sequence hybridizes to a perfectly matched probe.
- Stringent washing is performed at temperatures about 5 °C lower than the T m for the specific DNA hybrid under a particular set of conditions.
- Nucleic acid molecules comprising a fragment of any one of the above- described nucleic acid sequences are also provided. These fragments preferably contain at least 20 contiguous nucleotides. More preferably the fragments of the nucleic acid sequences contain at least 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100 or even more contiguous nucleotides.
- the nucleic acid sequence fragments of the present invention display utility in a variety of systems and methods.
- the fragments may be used as probes in various hybridization techniques.
- the target nucleic acid sequences may be either DNA or RNA.
- the target nucleic acid sequences may be fractionated (e.g., by gel electrophoresis) prior to the hybridization, or the hybridization may be performed on samples in situ.
- nucleic acid probes of known sequence find utility in determining chromosomal structure (e.g. , by Southern blotting) and in measuring gene expression (e.g., by Northern blotting).
- sequence fragments are preferably detectably labeled, so that their specific hydridization to target sequences can be detected and optionally quantified.
- nucleic acid fragments of the present invention may be used in a wide variety of blotting techniques not specifically described herein.
- nucleic acid sequence fragments disclosed herein also find utility as probes when immobilized on microarrays.
- Methods for creating microarrays by deposition and fixation of nucleic acids onto support substrates are well known in the art. Reviewed in DNA Microarrays : A Practical Approach (Practical Approach Series), Schena (ed.), Oxford University Press (1999) (ISBN: 0199637768); Nature Genet. 21(l)(suppl):l-60 (1999); Microarray Biochip: Tools and Technology, Schena (ed.), Eaton Publishing Company/BioTechniques Books Division (2000) (ISBN: 1881299376), the disclosures of which are incorporated herein by reference in their entireties.
- microarrays comprising nucleic acid sequence fragments, such as the nucleic acid sequence fragments disclosed herein, are well-established utility for sequence fragments in the field of cell and molecular biology.
- sequence fragments immobilized on microarrays are described in Gerhold et al, Trends Biochem. Sci. 24:168-173
- isolated nucleic acid molecules encoding a polypeptide having orotate-phosphoribosyl transferase activity are provided.
- enzyme activities can be measured in various ways.
- the pyrophosphorolysis of OMP may be followed spectroscopically. Grubmeyer et al, J. Biol. Chem. 268:20299-20304 (1993). Additional examples of substrates useful for the specfroscopic assay of orotate-phosphoribosyl transferase activity are also known in the art. Shostak et al, Anal Biochem. 191:365-369 (1990).
- the activity of the enzyme can be followed using chromatographic techniques, such as by high performance liquid chromatography. Chung and Sloan, J Chromatogr. 371:71-81 (1986). Other methods and techniques may also be suitable for the measurement of enzyme activity, as would be known by one of skill in the art.
- the invention also provides recombinant DNA molecules comprising a cassette containing the P. pastoris URA5 gene, or a homolog, variant or derivative thereof, flanked by direct repeat sequences.
- the direct repeat sequences are of sufficient length to mediate efficient homologous recombination, thereby providing a means for deleting the URA5 marker from the host cell in preparation for another round of transformation using the URA5 gene as a positive selection marker.
- the direct repeat sequences are preferably at least 200 nucleotides in length (see, e.g., Wilson et al, Yeast; 16: 65-70 (2000)).
- the direct repeat sequences are from around 200 nucleotides to around 1,100 nucleotides, but they may be even longer.
- the direct repeat sequences are derived from hisG segments of Salmonella.
- the direct repeats are obtained from segments of the lacZ reading frame.
- One of skill in the art will readily appreciate that virtually any other direct repeat sequences may also be used to provide flanking sequences for recombination according to this aspect of the invention.
- the £/7 45-containing cassettes of the invention comprise URA5 sequences with flanking direct repeat sequences which mediate subsequent excission of URA5 sequences from the host. Such URA5 cassettes allow for both selection and counterselection for the URA5 gene activity.
- the positive selection step is based on relieving auxofrophy to uracil, and the counterselection is based on the acquisition of resistance to 5-FOA in uracil prototrophs. Boeke et al, Mol. Gen. Genet. 197:345-346 (1984).
- the present invention provides a recombinant nucleic acid molecule comprising a P. pastoris URA5 gene flanked by direct repeats (e.g., lacZ- URA5-lacZ, a "URA5 cassette"), which, upon expression, allows for selection and counterselection in a URA5 ' host.
- yeast transformed with the P. pastoris URA5 cassette have integrated the URA5 gene, e.g., into the host genome, at a selected location by homologous recombination between host and recombinant nucleic acid sequences.
- the host is deleted for endogenous URA5 sequences to discourage homologous recombination into an endogenous URA5 locus.
- the URA5 cassette-containing recombinant nucleic acid molecule preferably comprises sequences which target integration of URA5 and other desired sequences into a select location of the yeast host. As described, such transformants are selected on the basis of conversion from Ura " to Ura " phenotypes.
- the direct repeats flanking the URA5 marker gene then facilitate homologous recombination events which delete the internal URA5 marker. Cells that have undergone such an event revert back to Ura " and are selected by their ability to grow in the presence of 5-FOA.
- this method provides for efficient, stable integration of heterologous sequences into a host cell.
- this marker gene is relatively small, only about 1 kb.
- the small size of the marker allows for construction of smaller plasmids.
- the small size should reduce the rate of gene conversion of the auxotrophic marker gene during transformation in a Ura ' host strain which is not deleted for URA5 sequences. This undesirable outcome can account for 10- 50% of transformed colonies in the case of the HIS4 marker. Higgins and Cregg, Meth. Mol. Biol, 103:1-15 (1998).
- a lower rate of gene conversion should increase the fraction of transformants having knock-ins at the desired target site. The P.
- the isolated nucleic acid molecules of the instant invention may additionally include a sequence or gene of interest.
- a sequence or gene of interest typically encodes a protein that is not normally produced in the host cell.
- yeast transformed with the sequence or gene of interest have stably integrated the sequence or gene of interest, e.g., into the host genome, at a selected location by homologous recombination between host and recombinant nucleic acid sequences.
- the sequence or gene of interest may be preferably linked to one or more expression control sequences, so that the protein encoded by the sequence can be expressed under appropriate conditions in host cells that contain the isolated nucleic acid molecule.
- the invention additionally provides isolated nucleic acid molecules encoding a fragment of the P. pastoris SEC65 protein.
- the S. cerevisiae homolog of this protein is related to mammalian SRP19, a subunit of the signal recognition particle, and is thought to have similar function. Hann et al, Nature 356:532-533 (1992); Stirling and Hewitt, N ⁇ twre 356:534-537 (1992). Mutations in the S.
- S. cerevisiae SEC65 gene can cause temperature-sensitive cell growth and defects in the translocation of several secreted and membrane-bound proteins.
- the S. cerevisiae SEC65 protein is required for the stable association of another subunit, SRP54p, with the signal recognition particle.
- SRP54p subunit of SRP54p
- Overexpression of SRP54p suppresses both growth and protein translocation defects in cells carrying a temperature-sensitive defect in the SEC65 gene.
- Nucleic acid molecules encoding a fragment of the P. pastoris SEC65 gene can be used to identify the full-length gene and can further be used to probe the expression and functional activity of the encoded protein. Such activities may include structural and functional roles in the P. pastoris signal recognition particle and related effects on protein translocation across the endoplasmic reticulum.
- the invention further provides isolated nucleic acid molecules encoding a fragment of the P. pastoris SCS7 protein. Mutants of S. cerevisiae that lack the S. cerevisiae homolog of SCS7 fail to accumulate an inositolphosphorylceramide species, IPC-C, which is the predominant form found in wild-type cells.
- the full-length S. cerevisiae SCS7 gene encodes a protein that contains both a cytochrome b5-like domain and a domain that resembles the family of cytochrome b5-dependent enzymes that use iron and oxygen to catalyse desaturation or hydroxylation of fatty acids and sterols. Id.
- the encoded protein is therefore likely to be the enzyme that hydroxylates the C26-fatty acid of IPC-C. Effects of mutations in the SCS7 gene on the lipid composition of a cell can be measured as described in Haak et al, J. Biol Chem.
- the isolated nucleic acid molecules encoding a fragment of the P. pastoris SCS7 protein of the present invention can be used to identify and characterize the full-length form of the SCS7 gene.
- the isolated nucleic acid molecules of the invention can also be used to measure expression of the SCS7 gene and to further characterize the structure and function of this gene and its encoded protein and the effects of alterations in this gene on cellular metabolism.
- degenerate Ohgonucleotides Useful for Cloning of P.pastoris URA5 are provided. These ohgonucleotides are capable of amplifying different portions of the P. pastoris URA5 gene. They can also bind to and amplify portions of the S. cerevisiae URA 10 gene. That the ohgonucleotides only amplify the URA5 gene in P. pastoris suggests that this organism does not posses the URA10 gene . The ohgonucleotides anneal to positions of the URA5 gene as shown in Fig. 3. Such ohgonucleotides are also useful in hybridization and amplification experiments.
- vectors including expression vectors, which comprise the above nucleic acid molecules of the invention, as described further herein.
- the vectors include the isolated nucleic acid molecules described above.
- the vectors of the invention include the above-described nucleic acid molecules operably linked to one or more expression control sequences.
- the vectors of the instant invention may thus be used to express a polypeptide having orotate-phosphoribosyl transferase activity.
- the vectors of the invention may also include an element which ensures that they are stably maintained at a single copy in each cell (e.g., a centromere-like sequence such as "CEN").
- the autonomously replicating vector may optionally comprise an element which enables the vector to be replicated to higher than one copy per host cell (e.g., an autonomously replicating sequence or "ARS").
- ARS autonomously replicating sequence
- the vectors are non- autonomously replicating, integrative vectors designed to function as gene disruption or replacement cassettes.
- An example of an integrative vector of this type comprises at least at portion of a heterologous target gene linked to P.
- the integrative vectors of the invention may include additionally heterologous sequences encoding proteins having desirable properties, e.g., those encoding glycosylation enzymes, so that the desired sequences can be introduced into the host cell genome as a result of the integration. These sequences remain in the host cell genome even after the OPT-encoding sequences have been deleted by recombination between flanking direct repeat sequences.
- isolated polypeptides (including muteins, allelic variants, fragments, derivatives, and analogs) encoded by the nucleic acid molecules of the invention are provided.
- the isolated polypeptide comprises the polypeptide sequence corresponding to SEQ ID NOs:3, 5 or 7.
- the isolated polypeptide comprises a polypeptide sequence at least 65% identical to SEQ ID NOs:3, 5 or 7.
- the isolated polypeptide of the invention has at least 70%, 75% or 80%o identity to SEQ ID NOs:3, 5 or 7. More preferably, the identity is 85%, 90% or 95%, but the identity to SEQ ID NOs:3, 5 or 7 can be 98%, 99%, 99.9% or even higher.
- isolated polypeptides comprising a fragment of the above-described polypeptide sequences are provided. These fragments preferably include at least 20 contiguous amino acids, more preferably at least 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100 or even more contiguous amino acids.
- the polypeptides of the present invention also include fusions between the above-described polypeptide sequences and heterologous polypeptides.
- the heterologous sequences can, for example, include heterologous sequences designed to facilitate purification and or visualization of recombinantly-expressed proteins.
- Other non-limiting examples of protein fusions include those that permit display of the encoded protein on the surface of a phage or a cell, fusions to intrinsically fluorescent proteins, such as green fluorescent protein (GFP), and fusions to the IgG Fc region.
- GFP green fluorescent protein
- host cells transformed with the nucleic carry the nucleic acid sequences of the invention on vectors, which may but need not be freely replicating vectors (see below).
- the nucleic acids have been integrated into the genome of the host cells.
- the host cells of the invention have been mutated by recombination with a disruption, deletion or mutation of the isolated nucleic acid of the invention so that the activity of orotate-phosphoribosyl transferase activity in the host cell is reduced compared to a host cell lacking the mutation.
- the host cell of the invention is preferably Pichia pastoris or Pichia methanolica, but other host cells, especially yeast cells, are also encompassed within the scope of the invention.
- host cells defective in orotate- phosphoribosyl transferase activity are used to integrate one or more sequences or genes of interest into the host cell genome using nucleic acid molecules and/or methods of the invention.
- the sequences or genes of interest are integrated so as to disrupt an endogenous gene of the host cell.
- Cells containing the integration are identified by the recovery of uracil prototrophy due to the concomitant integration of a gene encoding P. pastoris orotate- phosphoribosyl transferase.
- uracil auxotrophs of the modified host cells are provided by selection of cells in which the P. pastoris orotate-phosphoribosyl transferase gene has been excised by homologous recombination.
- the invention provides isolated antibodies, including fragments and derivatives thereof, that bind specifically to the isolated polypeptides and polypeptide fragments of the present invention or to one or more of the polypeptides encoded by the isolated nucleic acids of the present invention.
- the antibodies of the present invention maybe specific for linear epitopes, discontinuous epitopes or conformational epitopes of such polypeptides or polypeptide fragments, either as present on the polypeptide in its native conformation or, in some cases, as present on the polypeptides as denatured, as, e.g., by solubilization in SDS.
- Fab fragments provided by the instant invention
- Fab' fragments provided by the instant invention
- Fv fragments
- F(ab') 2 single chain Fv fragments.
- bind specifically and “specific binding” is here intended the ability of the antibody to bind to a first molecular species in preference to binding to other molecular species with which the antibody and first molecular species are admixed.
- An antibody is said specifically to "recognize” a first molecular species when it can bind specifically to that first molecular species.
- the degree to which an antibody can discriminate as among molecular species in a mixture will depend, in part, upon the conformational relatedness of the species in the mixture; typically, the antibodies of the present invention will discriminate over adventitious binding to unrelated polypeptides by at least two-fold, more typically by at least 5 -fold, typically by more than 10-fold, 25-fold, 50-fold, 75-fold, and often by more than 100-fold, and on occasion by more than 500-fold or 1000-fold.
- the affinity or avidity of an antibody (or antibody multimer, as in the case of an IgM pentamer) of the present invention for a polypeptide or polypeptide fragment of the present invention will be at least about 1 x 10 '6 M, typically at least about 5 x 10 '7 M, usefully at least about 1 x 10 "7 M, with affinities and avidities of 1 x 10 "8 M, 5 x 10 "9 M, 1 x 10 "10 M and even stronger proving especially useful.
- the isolated antibodies of the present invention may be naturally- occurring forms, such as IgG, IgM, IgD, IgE, and IgA, from any mammalian species.
- antibodies are usefully obtained from species including rodents — typically mouse, but also rat, guinea pig, and hamster — lagomorphs, typically rabbits, and also larger mammals, such as sheep, goats, cows, and horses.
- rodents typically mouse, but also rat, guinea pig, and hamster — lagomorphs, typically rabbits, and also larger mammals, such as sheep, goats, cows, and horses.
- the animal is typically affirmatively immunized, according to standard immunization protocols, with the polypeptide or polypeptide fragment of the present invention.
- Virtually all fragments of 8 or more contiguous amino acids of the polypeptides of the present invention may be used effectively as immunogens when conjugated to a carrier, typically a protein such as bovine thyroglobulin, keyhole limpet hemocyanin, or bovine serum albumin, conveniently using a bifunctional linker, hnmunogenicity may also be conferred by fusion of the polypeptide and polypeptide fragments of the present invention to other moieties.
- peptides of the present invention can be produced by solid phase synthesis on a branched polylysine core matrix; these multiple antigenic peptides (MAPs) provide high purity, increased avidity, accurate chemical definition and improved safety in vaccine development.
- MAPs multiple antigenic peptides
- Protocols for immunization are well-established in the art. Such protocols often include multiple immunizations, either with or without adjuvants such as Freund's complete adjuvant and Freund's incomplete adjuvant.
- Antibodies of the present invention may be polyclonal or monoclonal, with polyclonal antibodies having certain advantages in immunohistochemical detection of the proteins of the present invention and monoclonal antibodies having advantages in identifying and distinguishing particular epitopes of the proteins of the present invention.
- the antibodies of the present invention may be produced using any art-accepted technique.
- Host cells for recombinant antibody production can be prokaryotic or eukaryotic.
- Prokaryotic hosts are particularly useful for producing phage displayed antibodies, as is well known in the art.
- Eukaryotic cells including mammalian, insect, plant and fungal cells, are also useful for expression of the antibodies, antibody fragments, and antibody derivatives of the present invention.
- Antibodies of the present invention can also be prepared by cell free translation. [0112]
- the isolated antibodies of the present invention, including fragments and derivatives thereof, can usefully be labeled.
- the antibodies of the present invention may usefully be labeled with an enzyme.
- the antibodies may be labeled with colloidal gold or with a fluorophore.
- the antibodies of the present invention may usefully be labeled with biotin.
- the antibodies of the present invention may usefully be labeled with radioisotopes, such as 33 P,
- a method for the genetic integration of a heterologous nucleic acid sequence into the genome of a host cell is provided.
- a host gene encoding orotate-phosphoribosyl transferase is disrupted by the introduction of a disrupted, deleted or otherwise mutated nucleic acid sequence derived from the P. pastoris URA5 gene disclosed herein.
- disrupted host cells having a point mutation, rearrangement, insertion or preferably a deletion including a "marked deletion", in which a heterologous selectable sequence has replaced the deleted URAS sequence
- Host cells disrupted in the URA5 gene and consequently lacking in orotate-phosphoribosyl transferase activity serve as suitable hosts for further embodiments of the invention in which heterologous sequences may be introduced into the host cell genome by targeted integration.
- a heterologous nucleic acid sequence is introduced into a yeast host cell lacking orotate-phosphoribosyl transferase (OPT) activity (i.e., UraS " ).
- OPT orotate-phosphoribosyl transferase
- the heterologous nucleic acid sequences introduced using this method are linked to a nucleic acid sequence that encodes the P. pastoris OPT activity, preferably on a vector.
- cells containing heterologous sequences linked to the OPT-encoding sequences of the invention may be selected based on their ability to grow in the absence of added uracil.
- the method comprises the step of introducing into a competent Ura5 " host cell an autonomously replicating vector which is passed from vector comprises heterologous nucleic acid sequences of interest linked to P. pastoris OPT-encoding sequences and optionally comprises an element which ensures that it is stably maintained at a single copy in each cell (e.g., a centromerelike sequence such as "CEN").
- the autonomously replicating vector may optionally comprise an element which enables the vector to be replicated to higher than one copy per host cell (e.g., an autonomously replicating sequence or "ARS").
- the vector is a non-autonomously replicating, integrative vector which is designed to function as a gene disruption or replacement cassette.
- An integrative vector of the invention comprises one or more regions comprising "target gene sequences" (sequences which can undergo homologous recombination with sequences at a desired genomic site in the host cell) linked to P. pastoris OPT-encoding sequences of the invention which are preferably flanked by direct repeat sequences (see below).
- the OPT-encoding sequences may be adj acent to the target gene sequences (e.g. , a gene replacement cassette) or may be engineered to disrupt the target gene sequences (e.g., a gene disruption cassette).
- a host gene that encodes an undesirable activity (e.g., an enzymatic activity) maybe mutated (e.g., interrupted) by targeting a P. pastoris OPT-encoding replacement or disruption cassette of the invention into the host gene by homologous recombination.
- an undesired glycosylation enzyme activity e.g., an initiating mannosylfransferase activity such as OCHl
- OCHl mannosylfransferase activity
- the target gene replacement or disruption cassette of the invention further comprises direct repeat sequences flanking the P. pastoris orotate-phosphoribosyl transferase gene.
- direct repeat sequences flanking the P. pastoris orotate-phosphoribosyl transferase gene The properties of such direct repeat sequences have already been described.
- the direct repeat sequences flanking the orotate-phosphoribosyl transferase gene promote the excision of the OPT-encoding gene out of the host genome.
- Cells lacking orotate-phosphoribosyl transferase activity are conveniently counterselected for their ability to grow in medium containing 5-FOA.
- a gene encoding a heterologous protein is engineered in linkage to the P. pastoris URAS gene within the gene replacement or disruption cassette.
- the cassette is integrated into a locus of the host genome which encodes an undesirable activity, such as an enzymatic activity.
- the cassette is integrated into a host gene which encodes an initiating mannosylfransferase activity such as the OCH7 gene.
- the cassette further comprises one or more genes encoding desirable glycosylation enzymes, including but not limited to mannosyltransferases, N- acetylglucosaminyltransferases (GnTs), UDP-N-acetylglucosamine transporters, galactosyltransferases (GalTs), sialyltransferases (STs) and protein- mannosyltransferases (PMTs).
- the cassette comprises one or more genes encoding useful therapeutic proteins, e.g., kringle domains of the human plasminogen, erythropoietin, cytokines such as, but not limited to, interferon- ⁇ , interferon- ⁇ , interferon- ⁇ , interferon- ⁇ , and granulocyte- CSF, coagulation factors such as factor VIII, factor IX, and human protein C, soluble IgE receptor -chain, IgG, IgG fragments, IgM, urokinase, chymase, and urea trypsin inhibitor, IGF-binding protein, epidermal growth factor, growth hormone-releasing factor, annexin V fusion protein, angiostatin, vascular endothelial growth factor-2, myeloid progenitor inhibitory factor-1, osteoprotegerin, ⁇ -1 antitrypsin, D ⁇ ase II and - feto proteins.
- useful therapeutic proteins e.g., kring
- the engineered cassette is useful for "knocking-in” genes encoding such glycosylation enzymes and other sequences of interest in strains of yeast cells to produce glycoproteins with human-like glycosylations and other useful proteins of interest. Representative methods for producing human-like glycoproteins are described in WO 02/00879 and are incorporated by reference herein.
- Escherichia coli strain DH5 ⁇ (Invitrogen, Carlsbad, CA) was used for recombinant DNA work.
- P. pastoris strains NRRL Y-l 1430 (wild-type) and JC308 (adel arg4 his4 ura3) (Lin Cereghino et al, Gene 263:159-169 (2001)) were used for construction of yeast strains.
- PCR reactions were performed according to supplier recommendations using either ExTaq (TaKaRa, Madison, WI), Taq Poly (Promega, Madison, WI) or Pfu Turbo (Stratagene, Cedar Creek, TX). Restriction and modification enzymes were from New England Biolabs (Beverly, MA) or Promega.
- PCR analysis of the modified yeast strains was as follows. A single colony was resuspended in 100 ⁇ l breaking buffer (100 mM NaCl, 10 mM Tris, pH 8.0, 1 mM EDTA). After addition of 100 mg of acid washed glass beads and 100 ⁇ l of phenol-chloroform, the solution was vortexed for 1 min. The mixture was then centrifuged for 5 min at full speed in a microcentrifuge, the supernatant recovered, and the genomic DNA was precipitated by addition of 1 ml ice cold ethanol. Following a wash with 70% ethanol, the pellet was resuspended in 10 ⁇ l breaking buffer, and 0.5 to 1 ⁇ l were used for PCR analysis. Cloning of the P. pastoris URAS gene
- a preferred method involves using the sequence homology of the existing S. cerevisiae URAS gene in combination with conservation of gene order in a variety of yeast species.
- Two genes, URA5 and SEC65, are located adjacent to one another in opposite orientations in at least four yeast species: S. cerevisiae, K. lactis, C. albicans and Y. lipolytica. Sanchez and Dominguez, Yeast 18:807-813 rt. other microorganisms (Fig. 2), and these sequences were used to design degenerate primers, e.g., using the CODEHOP strategy.
- URA5-1 (Fig. 3) (SEQ ID NO:23) and Sec65-1 (AAGAGATTTCAAGTTTTGTACCCADKNTAYTTYGA) (SEQ ID NO:29), were used to amplify a 1.1 kb DNA fragment from P. pastoris genomic DNA.
- URA5-1 is on the top strand starting from amino acid 27.
- This PCR fragment was then cloned into the pCR2.1-TOPO vector (Invitrogen, Carlsbad, CA) and sequenced.
- the 1100 bp fragment generated by PCR shows high homology on one end to URAS of S.
- the derived nucleotide sequence was used to search the partial genomic sequence of P. pastoris, as provided by Integrated Genomics, Inc. (Chicago, IL). Results of this search identified an overlapping DNA fragment that includes an additional 0.9 kb DNA sequence adjacent to the primer site. Within this sequence is the predicted initiation codon for protein translation. The predicted initiation codon is preceded by about 150 nucleotides of upstream regulatory sequences (including promotor sequences) and about 0.7 kb of the 3' region of a gene with high homology to S. cerevisiae SCS7 (Fig.
- the protein sequence derived by translation of the P. pastoris URAS gene shows about 64% identity and about 78% similarity to the URA5 gene from S. cerevisiae, and also displays high homology to URAS genes from other species.
- the complete 1947 bp fragment is shown in Fig. 1.
- URA5 usins alternative degenerate oligonucleotides [0125] Degenerate primers were designed using the CODEHOP strategy. Rose et al, Nucleic Acids Res. 26:1628-1635 (1998).
- URA5-1 (SEQ ID NO:23) is a degenerate form of the coding strand, starting from the codon encoding amino acid 27.
- URA5-2 (SEQ ID NO:24) is a degenerate form of the coding strand starting from the codon encoding amino acid 66.
- URA5-3 (SEQ ID NO:25) is the partial complement of URA5-2.
- URA5-4 (SEQ ID NO:26) is a degenerate form of the coding strand, starting from the codon encoding amino acid 105.
- URA5-5 (SEQ ID NO:27) is the partial complement of URA5-4.
- URA5-6 (SEQ ID NO:28) is a degenerate form of the non-coding strand, designed to hybridize to the segment of the coding strand starting at the codon encoding amino acid 130.
- the sequence of and positions within URAS bound by the ohgonucleotides are illustrated in Fig. 3.
- the cloned URA5 gene may be used to generate a construct to disrupt the URA5 gene from the genome of P. pastoris.
- Host cells with a disrupted URAS gene were created using a P. pastoris URA5 disruption cassette as follows.
- Ura5-55 (GGGATATCGGCCTTTGTTGATGCAAGTTTTACGTGGATC ) (SEQ ID NO:30) and Ura5-53p
- Ura5-35p GACGCGTCGACGGTCTTTTCAACAAAGCTCCATTAGTGAG
- Ura5-33 ohgonucleotides Ura5-35p (GACGCGTCGACGGTCTTTTCAACAAAGCTCCATTAGTGAG) (SEQ ID NO:32) and Ura5-33
- Plasmid pJN266 (Fig. 4) consists of two fragments segments flank a P. pastoris GAPDH promotor, a S. cerevisiae CYC1 transcriptional terminator expression cassette ("CYC1 TT") and a S. cerevisiae URA3 auxotrophic marker cassette. All regions of this plasmid are flanked by multiple restriction sites and can be individually replaced.
- the expression cassette contains a multiple cloning site for the insertion of heterologous genes.
- auxotrophic marker cassettes Two reusable auxotrophic marker cassettes were constructed based on the approach described by Lu et al, Appl. Microbiol Biotechnol. 49:141-146 (1998) and Alani et al, Genetics 116:541-545 (1987), using direct repeats from segments of the lacZ reading frame as recombination sites.
- As counterselectable auxotrophic markers a 2 kb DNA fragment containing the P. pastoris URA3 gene or a 1 kb fragment harboring the P. pastoris URA5 gene were used. Both marker cassettes were then inserted into a P. pastoris OCHl knockout plasmid. The P. pastoris £/2 45-containing plasmid was then modified further to generate a plasmid that includes the heterologous gene for the UDP-N-acetylglucosamine transporter of K. lactis.
- the first step in plasmid construction involved creating a set of universal plasmids containing D ⁇ A regions of the KEX1 gene of P. pastoris (Boehm et al, Yeast 15:563-572 (1999)) as space holders for the 5' and 3' regions of the genes to be knocked out.
- the plasmids also contained the S. cerevisiae URA3 gene, flanked by bacterial direct repeat sequences (Alani et al, Genetics 116: 541-545 (1987)) as a space holder for the auxotrophic markers and an expression cassette with a multiple cloning site for insertion of a foreign gene.
- a 0.9-kb fragment of the P. pastoris KEXl-5' region was amplified by PCR using primers Kex 55 __
- ATTGATTGAAATAGGGACAA (SEQ ID NO:39) with the plasmid pGAPZ-A (Invitrogen, Carlsbad, CA) as a template.
- the amplified segment was cloned into the BamHI, Sphl sites of pUC19 (New England Biolabs, Beverly, MA).
- the resulting plasmid was cut with Spel and Sphl.
- the CYC1 transcriptional terminator region was amplified using primers Cyc 5 fCCTTGCTAGCTTAATTAACCGCGGCACGTCCGACGGCGGCCCACGGGT CCCA) (SEQ ID NO:40) and Cyc 3
- pJN266 (GGACATGCATGCGGATCCCTTAAGAGCCGGCAGCTTGCAAATTAAAGC CTTCGAGCGTCCC) (SEQ ID NO:41) with plasmid pPICZ-A (Invitrogen, Carlsbad, CA) as a template.
- the amplified segment was cloned into the cut plasmid to create pJN261.
- the expression cassette was generated by digestion of this plasmid with BamHI. This fragment was cloned either into pJN263 (supra) to generate plasmid, pJN265, or into pJN264 (supra) to generate plasmids pJN266 and pJN267, depending on orientation of the insert.
- the map of pJN266 is shown in (Fig. 4).
- a knockout plasmid for the P. pastoris OCHl gene was created by digesting pJN263 with Sail and Spel. A 2.9 kb DNA fragment of the OCHl -5' region, amplified using the primers Och55
- the P. pastoris gene disruption cassettes for URA3 and URA5 were constructed using a strategy similar to that described in Lu et al, Appl. Microbiol. Biotechnol 49:141-146 (1998).
- a 2.0-kb Pstl, Spel fragment of the P. pastoris URA3 gene was inserted into the Pstl, Xbal sites of pUC19 (New England Biolabs, Beverly, MA) to create pJN306.
- a 0.7-kb Sad, PvuIIUNA fragment of the lacZ open reading frame from E. coli see, e.g., Kalnins et al, EMBOJ.
- Ura5Comp5 GCTCTAGAGGGACTTATCTGGGTCCAGACGATGTG
- Ura5Comp3 CGGGATCCGCCGCCGTGCCCAAAGCTCCGAAACAG
- pJN299 was digested with Pmel and Aflll and treated with T4 DNA polymerase. Following digestion of pJN315 (Fig. 4) with Sad and Sphl, and digestion of pJN396 (Fig. 5) with EcoRI and Sphl, each of the auxotrophic marker cassettes was blunt-ended with T4 DNA polymerase and ligated into the pJN299 backbone. This yielded plasmids pJN329 (URA3) and pJN398a (URAS), respectively.
- Plasmid pJN398 was further modified by digestion with Spel and Notl and blunt ended using T4 DNA polymerase.
- FIG. 6 A schematic of the disruption and marker recycling steps occurring in the stable integration of the UDP-GlcNAc Transporter into the OCH7 locus using the P. pastoris URA5- K. lactis UDP-GlcNAc Transporter cassette is shown in Fig. 6.
- Fig. 6 A schematic of the disruption and marker recycling steps occurring in the stable integration of the UDP-GlcNAc Transporter into the OCH7 locus using the P. pastoris URA5- K. lactis UDP-GlcNAc Transporter cassette is shown in Fig. 6.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Mycology (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP04785511A EP1633855A4 (en) | 2003-05-16 | 2004-04-30 | Ura5 gene and methods for stable genetic integration in yeast |
CA002525954A CA2525954A1 (en) | 2003-05-16 | 2004-04-30 | Ura5 gene and methods for stable genetic integration in yeast |
JP2006532526A JP2007530007A (en) | 2003-05-16 | 2004-04-30 | URA5 gene and method for stable gene integration into yeast |
AU2004242082A AU2004242082B2 (en) | 2003-05-16 | 2004-04-30 | URA5 gene and methods for stable genetic integration in yeast |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US47143503P | 2003-05-16 | 2003-05-16 | |
US60/471,435 | 2003-05-16 | ||
US10/454,125 | 2003-06-03 | ||
US10/454,125 US7514253B2 (en) | 2003-05-16 | 2003-06-03 | URA5 gene and methods for stable genetic integration in yeast |
Publications (3)
Publication Number | Publication Date |
---|---|
WO2004104165A2 true WO2004104165A2 (en) | 2004-12-02 |
WO2004104165A9 WO2004104165A9 (en) | 2006-03-02 |
WO2004104165A3 WO2004104165A3 (en) | 2007-11-22 |
Family
ID=33423332
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2004/013488 WO2004104165A2 (en) | 2003-05-16 | 2004-04-30 | Ura5 gene and methods for stable genetic integration in yeast |
Country Status (6)
Country | Link |
---|---|
US (5) | US7514253B2 (en) |
EP (2) | EP2489740A1 (en) |
JP (2) | JP2007530007A (en) |
AU (1) | AU2004242082B2 (en) |
CA (1) | CA2525954A1 (en) |
WO (1) | WO2004104165A2 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7326681B2 (en) | 2000-06-28 | 2008-02-05 | Glycofi, Inc. | Methods for producing modified glycoproteins |
JP2009537144A (en) * | 2006-05-19 | 2009-10-29 | グライコフィ, インコーポレイテッド | Recombinant vector |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7449308B2 (en) | 2000-06-28 | 2008-11-11 | Glycofi, Inc. | Combinatorial DNA library for producing modified N-glycans in lower eukaryotes |
US7863020B2 (en) | 2000-06-28 | 2011-01-04 | Glycofi, Inc. | Production of sialylated N-glycans in lower eukaryotes |
US7598055B2 (en) * | 2000-06-28 | 2009-10-06 | Glycofi, Inc. | N-acetylglucosaminyltransferase III expression in lower eukaryotes |
US8697394B2 (en) * | 2000-06-28 | 2014-04-15 | Glycofi, Inc. | Production of modified glycoproteins having multiple antennary structures |
US7332299B2 (en) | 2003-02-20 | 2008-02-19 | Glycofi, Inc. | Endomannosidases in the modification of glycoproteins in eukaryotes |
CN101903531B (en) * | 2007-12-19 | 2013-09-18 | 格利科菲公司 | Yeast strains for protein production |
CN101945998B (en) * | 2008-02-20 | 2013-09-18 | 格利科菲公司 | Vectors and yeast strains for protein production |
US8067339B2 (en) * | 2008-07-09 | 2011-11-29 | Merck Sharp & Dohme Corp. | Surface display of whole antibodies in eukaryotes |
JP2010017131A (en) * | 2008-07-10 | 2010-01-28 | Mitsui Eng & Shipbuild Co Ltd | Bacteria of genus moorella and primer |
MX2011001706A (en) | 2008-08-12 | 2011-03-24 | Glycofi Inc | Improved vectors and yeast strains for protein production: ca2+ atpase overexpression. |
EP2401378B1 (en) | 2009-02-25 | 2013-08-14 | Merck Sharp & Dohme Corp. | Metabolic engineering of a galactose assimilation pathway in the glycoengineered yeast pichia pastoris |
CN104031961A (en) | 2009-10-16 | 2014-09-10 | 默沙东公司 | Method for producing a mature human erythropoietin and composition comprising the mature human erythropoietin obtained from the method |
EP2539430B1 (en) | 2010-02-24 | 2016-09-14 | Merck Sharp & Dohme Corp. | Method for increasing n-glycosylation site occupancy on therapeutic glycoproteins produced in pichia pastoris |
JP5733570B2 (en) | 2011-05-23 | 2015-06-10 | ソニー株式会社 | Image processing apparatus, image processing method, program, and recording medium |
EP2925345B1 (en) | 2012-12-03 | 2018-09-05 | Merck Sharp & Dohme Corp. | Method for making o-glycosylated carboxy terminal portion (ctp) peptide-based insulin and insulin analogues |
US9944689B2 (en) | 2013-03-07 | 2018-04-17 | The General Hospital Corporation | Human CTLA4 mutants and use thereof |
JP6488666B2 (en) * | 2013-11-22 | 2019-03-27 | Agc株式会社 | Cloning vector, expression vector and method for producing transformant |
JP6499587B2 (en) | 2013-11-22 | 2019-04-10 | Jmtcエンザイム株式会社 | Transformant, method for producing the same, and method for producing lactic acid |
EP3205721B1 (en) | 2014-10-10 | 2019-08-28 | JMTC Enzyme Corporation | Transformant and method for producing same, and method for producing lactic acid |
EP3927731A4 (en) | 2019-02-19 | 2022-12-28 | The Regents of the University of Colorado, a body corporate | Bispecific immunotoxins targeting human cd25+ccr4+ tumors and regulatory t-cells |
CN111850017A (en) * | 2019-04-30 | 2020-10-30 | 广州华真医药科技有限公司 | URA3 gene-based expression vector and construction method thereof |
CN115838645B (en) * | 2022-09-15 | 2024-03-08 | 天津大学 | Yeast strain for high production of orotic acid and application thereof |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1992017595A1 (en) * | 1991-04-01 | 1992-10-15 | The Salk Institute Biotechnology/Industrial Associates, Inc. | Genes which influence pichia proteolytic activity, and uses therefor |
DE19934408A1 (en) * | 1999-07-22 | 2001-01-25 | Consortium Elektrochem Ind | New fungal orotic acid phosphoribosyltransferase gene, useful as a selection marker in recombinant protein expression systems |
-
2003
- 2003-06-03 US US10/454,125 patent/US7514253B2/en not_active Expired - Lifetime
-
2004
- 2004-04-30 WO PCT/US2004/013488 patent/WO2004104165A2/en active Application Filing
- 2004-04-30 EP EP11195621A patent/EP2489740A1/en not_active Withdrawn
- 2004-04-30 CA CA002525954A patent/CA2525954A1/en not_active Abandoned
- 2004-04-30 JP JP2006532526A patent/JP2007530007A/en active Pending
- 2004-04-30 EP EP04785511A patent/EP1633855A4/en not_active Withdrawn
- 2004-04-30 AU AU2004242082A patent/AU2004242082B2/en not_active Ceased
-
2009
- 2009-01-22 US US12/321,512 patent/US8062879B2/en not_active Expired - Fee Related
-
2011
- 2011-03-16 JP JP2011058168A patent/JP2011115182A/en not_active Ceased
- 2011-10-07 US US13/268,061 patent/US8524479B2/en not_active Expired - Fee Related
-
2013
- 2013-08-01 US US13/956,747 patent/US20140051172A1/en not_active Abandoned
-
2014
- 2014-12-11 US US14/567,459 patent/US20150087013A1/en not_active Abandoned
Non-Patent Citations (1)
Title |
---|
See references of EP1633855A4 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7326681B2 (en) | 2000-06-28 | 2008-02-05 | Glycofi, Inc. | Methods for producing modified glycoproteins |
JP2009537144A (en) * | 2006-05-19 | 2009-10-29 | グライコフィ, インコーポレイテッド | Recombinant vector |
Also Published As
Publication number | Publication date |
---|---|
US20150087013A1 (en) | 2015-03-26 |
US20040229306A1 (en) | 2004-11-18 |
JP2007530007A (en) | 2007-11-01 |
EP1633855A2 (en) | 2006-03-15 |
EP1633855A4 (en) | 2008-08-20 |
WO2004104165A3 (en) | 2007-11-22 |
US8062879B2 (en) | 2011-11-22 |
AU2004242082A1 (en) | 2004-12-02 |
US8524479B2 (en) | 2013-09-03 |
CA2525954A1 (en) | 2004-12-02 |
AU2004242082B2 (en) | 2008-12-11 |
US20140051172A1 (en) | 2014-02-20 |
US7514253B2 (en) | 2009-04-07 |
JP2011115182A (en) | 2011-06-16 |
US20090203105A1 (en) | 2009-08-13 |
US20120116058A1 (en) | 2012-05-10 |
WO2004104165A9 (en) | 2006-03-02 |
EP2489740A1 (en) | 2012-08-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8062879B2 (en) | URA5 gene and methods for stable genetic integration in yeast | |
EP1696864B1 (en) | Methods for eliminating mannosylphosphorylation of glycans in the production of glycoproteins | |
US7479389B2 (en) | ARG1, ARG2, ARG3, HIS1, HIS2, HIS5, HIS6 genes and methods for stable genetic integration | |
AU2005238308B8 (en) | Methods for reducing or eliminating alpha-mannosidase resistant glycans in the production of glycoproteins | |
EP2316963B1 (en) | Combinatorial DNA library for producing modified N-glycans in lower eukaryotes | |
CA2876864A1 (en) | Methods for eliminating mannosylphosphorylation of glycans in the production of glycoproteins | |
JP5752582B2 (en) | Methods for reducing or eliminating alpha-mannosidase resistant glycans in the production of glycoproteins |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
ENP | Entry into the national phase |
Ref document number: 2525954 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006532526 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004242082 Country of ref document: AU |
|
ENP | Entry into the national phase |
Ref document number: 2004242082 Country of ref document: AU Date of ref document: 20040430 Kind code of ref document: A |
|
REEP | Request for entry into the european phase |
Ref document number: 2004785511 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004785511 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2004242082 Country of ref document: AU |
|
COP | Corrected version of pamphlet |
Free format text: PAGES 1, 5, 6, 11, 21, 22, 32, 36, 39, 41, 42, 43, 46, DESCRIPTION, REPLACED BY CORRECT PAGES 1, 5, 6, 11, 21, 22, 32, 36, 39, 41, 42, 43, 46; PAGES 48, 51, 52, 53, CLAIMS, REPLACED BY CORRECT PAGES 48, 51, 52, 53 |
|
WWP | Wipo information: published in national office |
Ref document number: 2004785511 Country of ref document: EP |