EP0763117A1 - Synthetic leader peptide sequences - Google Patents

Synthetic leader peptide sequences

Info

Publication number
EP0763117A1
EP0763117A1 EP95922439A EP95922439A EP0763117A1 EP 0763117 A1 EP0763117 A1 EP 0763117A1 EP 95922439 A EP95922439 A EP 95922439A EP 95922439 A EP95922439 A EP 95922439A EP 0763117 A1 EP0763117 A1 EP 0763117A1
Authority
EP
European Patent Office
Prior art keywords
seq
ser
sequence
leu
thr
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP95922439A
Other languages
German (de)
French (fr)
Other versions
EP0763117B1 (en
Inventor
Thomas Borglum Kjeldsen
Knud Vad
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Novo Nordisk AS
Original Assignee
Novo Nordisk AS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Novo Nordisk AS filed Critical Novo Nordisk AS
Publication of EP0763117A1 publication Critical patent/EP0763117A1/en
Application granted granted Critical
Publication of EP0763117B1 publication Critical patent/EP0763117B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/575Hormones
    • C07K14/62Insulins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins
    • C12N15/625DNA sequences coding for fusion proteins containing a sequence coding for a signal sequence
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/80Vectors or expression systems specially adapted for eukaryotic hosts for fungi
    • C12N15/81Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/01Fusion polypeptide containing a localisation/targetting motif
    • C07K2319/036Fusion polypeptide containing a localisation/targetting motif targeting to the medium outside of the cell, e.g. type III secretion
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/70Fusion polypeptide containing domain for protein-protein interaction
    • C07K2319/74Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor
    • C07K2319/75Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor containing a fusion for activation of a cell surface receptor, e.g. thrombopoeitin, NPY and other peptide hormones

Definitions

  • the present invention relates to synthetic leader peptide sequences for secreting polypeptides in yeast.
  • Yeast organisms produce a number of proteins which are synthesized intracellularly, but which have a function outside the cell. Such extracellular proteins are referred to as secreted proteins. These secreted proteins are expressed initially inside the cell in a precursor or a pre-protein form containing a presequence ensuring effective direction of the expressed product across the membrane of the endoplasmic reticulum (ER) .
  • the presequence normally named a signal peptide, is generally cleaved off from the desired product during translocation. Once entered in the secretory pathway, the protein is transported to the Golgi apparatus.
  • the protein can follow different routes that lead to compartments such as the cell vacuole or the cell membrane, or it can be routed out of the cell to be secreted to the external medium (Pfeffer, S.R. and Rothman, J.E. Ann.Rev.Biochem. 56 (1987) 829-852).
  • European published patent application No. 88 632 describes a process by which proteins heterologous to yeast are expressed, processed and secreted by transforming a yeast organism with an expression vehicle harbouring DNA encoding the desired protein and a signal peptide, preparing a culture of the transformed organism, growing the culture and recovering the protein from the culture medium.
  • the signal peptide may be the signal peptide of the desired protein itself, a heterologous signal peptide or a hybrid of native and heterologous signal peptide.
  • a problem encountered with the use of signal peptides hetero ⁇ logous to yeast might be that the heterologous signal peptide does not ensure efficient translocation and/ ⁇ r cleavage after 5 the signal peptide.
  • the Saccharomyces cerevisiae MF ⁇ l ( ⁇ -factor) is synthesized as a prepro form of 165 amino acids comprising signal- or prepeptide of 19 amino acids followed by a "leader” or propeptide of 64 amino acids, encompassing three N-linked 10 glycosylation sites followed by (LysArg( (Asp/Glu)Ala) 2-3 ⁇ ⁇ factor) 4 (Kurjan, J. and Herskowitz, I. Cell 30 (1982) 933- 943) .
  • the signal-leader part of the preproMF ⁇ l has been widely employed to obtain synthesis and secretion of heterologous proteins in S_ cerevisiae.
  • EP 206783 discloses a system for the secretion of polypeptides from S_ cerevisiae using an ⁇ -factor leader sequence which has been truncated to eliminate the four ⁇ -factor units present on the native leader sequence so as to leave the leader peptide itself fused to a heterologous poiypeptide via the ⁇ -factor processing site LysArgGluAlaGluAla. This construction is indicated to lead to an efficient processing of smaller peptides (less than 50 amino acids) .
  • the native ⁇ -factor leader sequence has been truncated to leave one or two of the ⁇ -factor units between the leader peptide and the poiypeptide.
  • a number of secreted proteins are routed so as to be exposed to a proteolytic processing system which can cleave the peptide bond at the carboxy end of two consecutive basic amino acids.
  • This enzymatic activity is in S_ cerevisiae encoded by the KEX 2 gene (Julius, D.A. et al., Cell 37 (1984b) 1075). Processing of the product by the KEX 2 protease is needed for the secretion of active S_:_ cerevisiae mating factor ⁇ l (MF ⁇ l or ⁇ - factor) whereas KEX 2 is not involved in the secretion of ac- tive S_ cerevisiae mating factor a.
  • the present invention relates to a DNA expression cassette comprising the following sequence:
  • SP is a DNA sequence encoding a signal peptide
  • LS is a DNA sequence encoding a leader peptide with the general formula I:
  • X 1 is a peptide bond or a codable amino acid
  • X 2 is a peptide bond, a codable amino acid or a sequence of up to 4 codable amino acids which may be the same or different;
  • Z is a codable amino acid except Pro;
  • X 3 is a sequence of from 4 to 30 codable amino acids which may be the same or different;
  • PS is a E)NA sequence encoding a processing site; *gene* is a DNA sequence encoding a poiypeptide; T is a terminator sequence; and i is 0 or 1.
  • leader peptide is understood to indicate a peptide whose function is to allow the expressed poiypeptide to be directed from the endoplasmic reticulum to the Golgi apparatus and further to a secretory ve ⁇ sicle for secretion into the medium, (i.e. exportation of the expressed poiypeptide across the cell wall or at least through the cellular membrane into the periplasmic space of the cell) .
  • synthetic used in connection with leader peptides is intended to indicate that the leader peptide is one not found in nature.
  • signal peptide is understood to mean a presequence which is predominantly hydrophobic in nature and present as an N-terminal sequence of the precursor form of an extracellular protein expressed in yeast.
  • the function of the signal peptide is to allow the expressed protein to be secreted to enter the endoplasmic reticulum.
  • the signal peptide is normally cleaved off in the course of this process.
  • the signal peptide may be heterologous or homologous to the yeast organism producing the protein.
  • poiypeptide is intended to indicate a heterologous poiypeptide, i.e. a poiypeptide which is not pro ⁇ pokerd by the host yeast organism in nature as well as a homologous poiypeptide, i.e. a poiypeptide which is produced by the host yeast organism in nature and any preform thereof.
  • the expression cassette of the present invention encodes a heterologous poiypeptide.
  • a codable amino acid is intended to indicate an amino acid which can be coded for by a triplet ("codon") of nucleotides.
  • the present invention relates to a process for producing a poiypeptide in yeast, the process comprising culturing a yeast cell, which is capable of expressing a poiypeptide and which is transformed with a yeast expression vector as described above including a leader peptide sequence of the invention, in a suitable medium to obtain expression and secretion of the poiypeptide, after which the poiypeptide is recovered from the medium.
  • Fig. 1 schematically shows the plasmid pAK492
  • Fig. 2 shows part of the DNA sequence encoding the signal peptide/leader/MI3 insulin precursor
  • Fig. 3 shows the construction of the plasmid pAK546
  • Fig. 4 shows the amino acid sequence of the leader SEQ ID No. 4 and the DNA sequence encoding it
  • Fig. 5 shows the DNA sequence of S. cerevisiae expression plasmid pAK546 encoding the YAP3 signal peptide, leader SEQ ID No. 4 and the MI3 insulin precursor and the encoded amino acid sequence;
  • Fig. 6 shows the amino acid sequence of the leader SEQ ID No. 6 and the' DNA sequence encoding it
  • Fig. 7 shows the amino acid sequence of the leader SEQ ID No. 8 and the DNA sequence encoding it
  • Fig. 8 shows the amino acid sequence of the leader SEQ ID No. 17 and the DNA sequence encoding it
  • Fig. 9 shows the amino acid sequence of the leader SEQ ID No. 16 and the DNA sequence encoding it
  • Fig. 10 shows the amino acid sequence of the leader SEQ ID No. 19 and the DNA sequence encoding it
  • Fig. 11 shows the amino acid sequence of the leader SEQ ID No. 20 and the DNA sequence encoding it
  • Fig. 12 shows the amino acid sequence of the leader SEQ ID No. 21 and the DNA sequence encoding it;
  • Fig. 13 shows the DNA fragment of pAK527 used as the direct template in the construction of SEQ ID Nos. 4 and 6;
  • Fig. 14 shows the DNA fragment of pAK531 used as the direct template in the construction of SEQ ID No. 8;
  • Fig. 15 shows the DNA fragment of pAK555 used as the direct template in the construction of SEQ ID Nos. 16 and 17;
  • Fig. 16 shows the DNA fragment of pAK559 used as the direct template in the construction of SEQ ID Nos. 19 and 20;
  • Fig. 17 shows the DNA fragment of pAK562 used as the direct template in the construction of SEQ ID No. 21;
  • Fig. 18 shows the amino acid sequence of the leader SEQ ID No. 27 and the DNA sequence SEQ ID No. 66 encoding it;
  • Fig. 19 shows the amino acid sequence SEQ ID No. 70 of an N- terminally extended MI3 insulin precursor and the DNA sequence SEQ ID No. 71 encoding it;
  • Fig. 20 shows the amino acid sequence of the leader SEQ ID No. 67 and the DNA sequence SEQ ID No. 69 encoding it;
  • Fig. 21 shows the DNA fragment SEQ ID No. 72 of pAK614 used as the direct template in the construction of SEQ ID No. 27;
  • Fig. 22 shows the DNA fragment SEQ ID No. 73 of pAK625 used as the direct template in the construction of SEQ ID No. 67. DETAILED DISCLOSURE OF THE INVENTION
  • X 1 in general formula I designates an amino acid
  • it is preferably Ser, Thr or Ala
  • X 2 in general formula I designates one amino acid
  • it is preferably Ser, Thr or Ala
  • X 2 in general formula I designates a sequence of two amino acids
  • it is preferably Serlle
  • X 2 in general formula I designates a sequence of three amino acids
  • it is preferably SerAlalle
  • X 2 in general formula I designates a sequence of four amino acids it is preferably SerPheAlaThr.
  • X 3 is an amino acid sequence of the general formula II
  • X 4 is a sequence of from 1 to 21 codable amino acids which may be the same or different
  • X 5 is Pro or one of the amino acid sequences ValAsnLeu or LeuAlaAsnValAlaMetAla
  • X 6 is a sequence of from 1 to 8 codable amino acids which may be the same or different.
  • X4 is preferably an amino acid sequence which includes one or more of the motifs LeuValAsnLeu, SerValAsnLeu, MetAlaAsp, ThrGluSer, ArgPheAlaThr or ValAlaMetAla; or X 4 is an amino acid sequence which includes the sequence AsnSerThr or AsnThrThr; or X 4 is an amino acid sequence which includes the sequence (Ser/Leu)ValAsnLeu, (Ser/Leu)ValAsnLeuMetAlaAsp,
  • X 4 is an amino acid sequence which includes the sequence Asn(Thr/Ser)ThrLeu, Asn(Thr/Ser)ThrLeuAsnLeu or Asn(Thr/Ser)ThrLeuValAsnLeu; or any combination thereof.
  • X 5 is preferably Pro or an amino acid sequence which includes the sequence ValAsnLeu, LeuAlaAsnValAlaMetAla, LeuAspValValAsnLeuProGly or LeuAspValValAsnLeuIleSerMet.
  • X 6 ' in general formula II designates one amino acid, it is preferably Ala, Gly, Leu, Thr, Val or Ser.
  • X 6 ' in general formula II designates a sequence of two amino acids, it is preferably GlyAla or SerAla.
  • X 6 ' in general formula II designates a sequence of three amino acids, it is preferably AlaValAla.
  • X 6 , in general formula II designates a sequence of eight amino acids, it is preferably GlyAlaAspSerLysThrValGlu.
  • Examples of preferred leader peptides coded for by the DNA sequence LS are:
  • SEQ ID No. 27 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuAspVal- ValAsnLeuIleSerMetAla;
  • leader peptides coded for by the DNA sequence LS are: SEQ ID No. 15 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AspLeuAlaValGlyLeuProGlyAla;
  • SEQ ID No. 16 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaAspAspThrGluSerlleAsnThrThrLeuValAsnLeuProGly- Ala;
  • SEQ ID No. 24 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsnLeu- MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuAsp- ValValAsnLeuProGlyAla;
  • the signal sequence may encode any signal peptide which ensures an effective direction of the expressed poiypeptide into the secretory pathway of the cell.
  • the signal peptide may be a naturally occurring signal peptide or functional parts thereof or it may be a synthetic peptide. Suitable signal peptides have been found to be the ⁇ -factor signal peptide, the signal peptide of mouse salivary amylase, a modified carboxypeptidase signal peptide, the yeast BAR1 signal peptide or the Humicola lanu ⁇ inosa lipase signal peptide or a derivative thereof.
  • the mouse salivary amylase signal sequence is described by Hagenb ⁇ chle, 0. et al., Nature 289 (1981) 643- 646.
  • the carboxypeptidase signal sequence is described by Vails, L.A. et al., Cell 48 (1987) 887-897.
  • the BAR1 signal peptide is disclosed in WO 87/02670.
  • the yeast aspartic protease 3 signal peptide is described in Danish patent application No. 0828/93.
  • the yeast processing site encoded by the DNA sequence PS may suitably be any paired combination of Lys and Arg, such as LysArg, ArgLys, ArgArg or LysLys which permits processing of the poiypeptide by the KEX2 protease of Saccharomyces cerevisiae or the equivalent protease in other yeast species (Julius, D.A. et al., Cell 37 (1984) 1075). If KEX2 processing is not convenient, e.g. if it would lead to cleavage of the poiypeptide product, e.g.
  • a processing site for another protease may be selected comprising an amino acid combination which is not found in the poiypeptide product, e.g. the processing site for FX a , IleGluGlyArg (cf. Sambrook, J., Fritsch, E.F. and Maniatis, T., Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, New York, 1989) .
  • the protein produced by the method of the invention may be any protein which may advantageously be produced in yeast.
  • proteins are heterologous proteins such as aprotinin, tissue factor pathway inhibitor or other protease inhibitors, insulin or insulin precursors, human or bovine growth hormone, interleukin, glucagon, GLP-1, IGF-I, IGF-II, tissue plasminogen activator, transforming growth factor ⁇ or ⁇ , platelet-derived growth factor, enzymes or a functional analogue thereof.
  • the term "functional analogue” is meant to indicate a protein with a similar function as the native protein (this is intended to be understood as relating to the nature, rather than the level of biological activity of the native protein) .
  • the protein may be structurally similar to the native protein and may be derived from the native protein by addition of one or more amino acids to either or both the C- and N-terminal end of the native protein, substitution of one or more amino acids at one or a number of different sites in the native amino acid sequence, deletion of one or more amino acids at either or both ends of the native protein or at one or several sites in the amino acid sequence, or insertion of one or more amino acids at one or more sites in the native amino acid sequence.
  • modifications are well known for several of the proteins mentioned above.
  • precursors or intermediates for other proteins may be produced by the method of the invention.
  • MI3 insulin precursor which comprises the amino acid sequence B(l- 29)AlaAlaLysA(l-21) wherein A(l-21) is the A chain of human insulin and B(l-29) is the B chain of human insulin in which Thr(B30) is missing.
  • Preferred DNA constructs encoding leader sequences are as shown in Figs. 4 - 12 or suitable modifications thereof.
  • suitable modifications of the DNA sequence are nucleotide substitutions which do not give rise to another amino acid sequence of the protein, but which may correspond to the codon usage of the yeast organism into which the DNA construct is inserted or nucleotide substitutions which do give rise to a different amino acid sequence and therefore, possibly, a different protein structure.
  • Other examples of possible modifications are insertion of one or more codons into the sequence, addition of one or more codons at either end of the sequence and deletion of one or more codons at either end of or within the sequence.
  • P, SP, LS, *gene*, T and i may be any vector which is capable of replicating in yeast organisms.
  • the promoter may be any DNA sequence which shows transcriptional activity in yeast and may be derived from genes encoding proteins either homologous or heterologous to yeast.
  • the promoter is preferably derived from a gene encoding a protein homologous to yeast. Examples of suitable promoters are the Saccharo yces cerevisiae MF ⁇ l, TPI, ADH or PGK promoters.
  • TPI terminator cf. Alber, T. and Kawasaki, G., J. Mol. APPI. Genet. 1 (1982) 419-434.
  • the recombinant expression vector of the invention further comprises a DNA sequence enabling the vector to replicate in yeast.
  • yeast sequences are the yeast plasmid 2 ⁇ replication genes REP 1-3 and origin of replication.
  • the vector may also comprise a selectable marker, e.g. the Schizo- saccharomvces pombe TPI gene as described by Russell, P.R., Gene 40 (1985) 125-130.
  • the vector may be constructed either by first preparing a DNA construct containing the entire sequence 5'-P-SP-LS-PS- *gene*-(T) ⁇ -3' and subsequently inserting this fragment into a suitable expression vector, or by sequentially inserting DNA fragments into a suitable vector containing genetic information for the individual elements (such as the promoter sequence, the signal peptide , the leader sequence GlnProIle(Asp/Glu) (Asp/GluJX 1 (Glu/Asp)X 2 AsnZ(Thr/Ser)X 3 , the processing site, the poiypeptide, and, if present, the terminator sequence) followed by ligation.
  • genetic information for the individual elements such as the promoter sequence, the signal peptide , the leader sequence GlnProIle(Asp/Glu) (Asp/GluJX 1 (Glu/Asp)X 2 AsnZ(Thr/Ser)X 3 , the processing site, the poiypeptide, and, if present, the
  • the yeast organism used in the method of the invention may be any suitable yeast organism which, on cultivation, produces large amounts of the desired poiypeptide.
  • suitable yeast organisms may be strains of the yeast species
  • the transformation of the yeast cells may for instance be effected by protoplast formation followed by transformation in a manner known per se.
  • the medium used to cultivate the cells may be any conventional medium suitable for growing yeast organisms.
  • the secreted poiypeptide a significant proportion of which will be present in the medium in correctly processed form, may be recovered from the medium by conventional procedures including separating the yeast cells from the medium by centrifugation or filtration, precipitating the proteinaceous components of the supernatant or filtrate by means of a salt, e.g. ammonium sulphate, followed by purification by a variety of chromatographic procedures, e.g. ion exchange chromatography, affinity chromatography or the like.
  • a salt e.g. ammonium sulphate
  • All expression plasmids are of the C-POT type. Such plasmids are described in EP patent application No. 171 142 and are characterized in containing the Schizosaccharo yces pombe triose phosphate isomerase gene (POT) for the purpose of plasmid selection and stabilization.
  • POT Schizosaccharo yces pombe triose phosphate isomerase gene
  • a plasmid containing the POT-gene is available from a deposited E. coli strain (ATCC 39685) .
  • the plasmids furthermore contain the S. cerevisiae triose phosphate isomerase promoter and terminator (P TPI and T TPI ) . They are identical to pMT742 (Egel-Mitani, M. et al.. Gene 73 (1988) 113-120) (see Fig. 1) except for the region defined by the EcoR I-Xba I restriction sites encompassing the coding region for signal/
  • the plasmids pAK527, pAK531, pAK555, pAK559, pAK562, pAK614 and pAK625 were used as DNA templates in the PCR reactions applied in the construction of the leaders described in the examples.
  • the synthetic DNA fragments serving as the direct template are shown in Figs. 13 - 17. With the exception of the shown DNA regions the plasmids are identical to pAK492 shown in Fig. 1.
  • Synthetic DNA fragments were synthesized on an automatic DNA synthesizer (Applied Biosystems model 380A) using phosphoramidite chemistry and commercially available reagents (Beaucage, S.L. and Caruthers, M.H., Tetrahedron Letters 22 (1981) 1859-1869). All other methods and materials used are common state of the art knowledge (see, e.g. Sambrook, J., Fritsch, E.F. and Maniatis, T. , Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, New York, 1989) .
  • the leader SEQ ID No. 4 has the following amino acid sequence: GlnProIleAspAspGluAsnThrThrSerValAsnLeuProVal
  • PCR polymerase chain reactions
  • the purified PCR DNA fragment was dissolved in 10 ⁇ l of water 25 and restriction endonuclease buffer and cut with the restriction endonucleases Asp 718 and Hind III in a total volume of 15 ⁇ l according to standard techniques (Sambrook, J. , Fritsch, E.F. and Maniatis, T. , op.cit. ) .
  • the 167 bp Asp 718/Hind III DNA fragment was subjected to electrophoresis on 30 agarose gel and purified using The Gene Clean Kit as described.
  • the S__. cerevisiae expression plasmid pAK492 shown in Fig.
  • plasmid pAK492 is a derivative of the previously described plasmid pMT742 in which the fragment encoding the signal/leader/insulin precursor has been replaced by the EcoR I-Xba I fragment shown in Fig. 2. This fragment has been synthesized on an Applied Biosystems DNA synthesizer in accordance with the manufacturer's instructions.
  • the plasmid pAK492 was cut with the restriction endonucleases Asp 718 and Xba I and the vector fragment of 10986 bp was isolated.
  • the plasmid pAK492 was cut with the restriction endonucleases Hind III and Xba I and the DNA fragment of 140 bp encoding part of the MI3 insulin precursor was isolated.
  • the three DNA fragments were ligated together using T4 DNA ligase under standard conditions (Sambrook, J., Fritsch, E.F. and Maniatis, T., op.cit. ) .
  • the ligation mixture was then transformed into a competent E. coli strain (R-, M+) and transformants were identified by ampicillin resistance. Plasmids were isolated from the resulting E. coli colonies using standard DNA iniprep technique (Sambrook, J., Fritsch, E.F. and Maniatis, T. , op.cit. ) . checked with appropriate restrictions endonucleases i.e. EcoR I, Xba I, Nco I and Hind III.
  • the selected plasmid, pAK546, was shown by DNA sequencing analysis (Sequenase, U.S. Biochemical Corp.) using the primer # 94 to contain a DNA sequence encoding the leader SEQ ID No. 4. For the DNA sequence encoding the leader SEQ ID No. 4, see Fig. 4) .
  • the plasmid pAK546 was transformed into S_ cerevisiae strain MT663 as described in European published patent application No. 214 826 and the resulting strain was named yAK546.
  • the DNA sequence of the protein coding region of the expression plasmid is given in Fig. 5.
  • the leader SEQ ID No. 6 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuProAla
  • the following oligonucleotide was synthesised:
  • the polymerase chain reaction was performed as described in Example 1 with the expection that oligonucleotide # 331 was used insted of oligonucleotide # 333.
  • the Asp 718/Hind III DNA fragment of 168 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1.
  • the Asp 718/Hind III DNA fragment was subcloned into the S_j_ cerevisiae expression plasmid as described in Example 1.
  • the selected plasmid, pAK531 was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 6.
  • the plasmid pAK531 was transformed into S_ cerevisiae strain MT663 as described in European patent application 86306721.1 and the resulting strain was named yAK531.
  • the DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
  • the leader SEQ ID No. 8 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuProGlyAla
  • the Asp 718/Hind III DNA fragment of 171 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1.
  • the Asp 718/Hind III DNA fragment was subcloned into the S_ cerevisiae expression plasmid as described in Example 1.
  • the selected plasmid, pAK547, was shown by DNA sequencing
  • the leader SEQ ID No. 17 has the following amino acid sequence: GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsnLeuProGlyAla
  • the polymerase chain reaction was performed as described in Example 1 with the expection that oligonucleotide # 376 was used insted of oligonucleotide # 333 and plasmid pAK555 (Fig. 30 15) was used as template.
  • the Asp 718/Hind III DNA fragment of 180 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1.
  • the Asp 718/Hind III DNA fragment was subcloned into the S_ cerevisiae expression plasmid as described in Example 1.
  • the selected plasmid, pAK561 was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 17.
  • Fig. 8 For the DNA sequence encoding the leader SEQ ID No. 17, see Fig. 8.
  • the plasmid pAK561 was transformed into S_ cerevisiae strain MT663 as described in European patent application No. 86306721.1 and the resulting strain was named yAK561.
  • the DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
  • EXAMPLE 5 Synthesis of the leader SEQ ID No. 16 for expression of the MI3 insulin precursor in S_ ⁇ . cerevisiae (strain yAK559) .
  • the leader SEQ ID No. 16 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMetAlaAspAspThr- GluSerlleAsnThrThrLeuValAsnLeuProGlyAla
  • the polymerase chain reaction was performed as described in Example 1 with the expection that oligonucleotide # 375 was used insted of oligonucleotide # 333 and plasmid pAK555 (Fig. 15) was used as template.
  • the Asp 718/Hind III DNA fragment of 222 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1.
  • the Asp 718/Hind III DNA fragment was subcloned into the S_j_ cerevisiae expression plasmid as described in Example 1.
  • the selected plasmid, pAK559 was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 16.
  • the plasmid pAK559 was transformed into ⁇ _ cerevisiae strain MT663 as described in European patent application No. 86306721.1 and the resulting strain was named yAK559.
  • the DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
  • the leader SEQ ID No. 19 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMetAlaAspAspThr- GluSerArgPheAlaThrAsnThrThrLeuValAsnLeuProLeu
  • the polymerase chain reaction was performed as described in Example 1 with the expection that oligonucleotide # 384 was used insted of oligonucleotide # 333 and plasmid pAK559 (Fig. 16) was used as template.
  • the Asp 718/Hind III DNA fragment of 228 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1.
  • the Asp 718/Hind III DNA fragment was subcloned into the S__. cerevisiae expression plasmid as described in Example 1.
  • the selected plasmid, pAK580 was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 19.
  • the plasmid pAK580 was transformed into S_j_ cerevisiae strain MT663 as described in European patent application No. 86306721.1 and the resulting strain was named yAK580.
  • the DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
  • the leader SEQ ID No. 20 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMetAlaAspAspThr- GluSerI1eAsnThrThrLeuVa1AsnLeuAlaAsnVa1AlaMetAla
  • the polymerase chain reaction was performed as described in Example 1 with the expection that oligonucleotide # 390 was used insted of oligonucleotide # 333 and plasmid pAK559 (Fig. 16) was used as template.
  • the Asp 718/Hind III DNA fragment of 231 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1.
  • the Asp 718/Hind III DNA fragment was subcloned into the S_j_ cerevisiae expression plasmid as described in Example 1.
  • the selected plasmid, pAK583, was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 20.
  • the plasmid pAK583 was transformed into S_ cerevisiae strain MT663 as described in European patent application No. 86306721.1 and the resulting strain was named yAK583.
  • the DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
  • the leader SEQ ID No. 21 has the following amino acid sequence: GlnProIleAspAspThrGluSerAlalleAsnThrThrLeuValAsnLeuProGlyAla
  • the polymerase chain reaction was performed as described in Example 1 with the exception that oligonucleotide # 401 was used insted of oligonucleotide # 333 and plasmid pAK562 (Fig. 17) was used as template.
  • the Asp 718/Hind III DNA fragment of 183 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1.
  • the Asp 718/Hind III DNA fragment was subcloned into the S_j_ cerevisiae expression plasmid as described in Example 1.
  • the selected plasmid, pAK586, was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 21, see Fig. 12.
  • the plasmid pAK586 was transformed into ⁇ _. cerevisiae strain MT663 as described in European patent application No. 86306721.1 and the resulting strain was named yAK586.
  • the DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
  • YPD medium Ferman, F. et al.. Methods in Yeast Genetics. Cold Spring Harbor Laboratory Press, 1981
  • Table 1 the expression levels of the insulin precursor, MI3, obtained by use of selected leader sequences according to the present invention, are given as a percentage of the level obtained with transformants of pMT742, utilizing the MF ⁇ (l) leader of S_j_ cerevisiae.
  • the leader SEQ ID No. 27 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMetAlaAspAspThr- GluSerArgPheAlaThrAsnThrThrLeuAspValValAsnLeuIleSerMetAla
  • oligonucleotide # 440 was used instead of oligonucleotide # 333 and plasmid pAK614 was used as template.
  • oligonucleotide # 441 was used instead of oligonucleotide #
  • the purified PCR DNA fragment was isolated and digested with the restriction endonucleases Asp 718 and Hind III as described in Example 1.
  • the Asp 718/Hind III DNA fragment of 268 bp was subjected to electrophoresis on agarose gel and
  • pAK616 30 plasmid, pAK616, was shown by DNA sequencing analysis, as described in Example 1, to contain the DNA sequence encoding the leader SEQ ID No. 27.
  • SEQ ID No. 66 encoding the leader SEQ ID No. 27, see Fig. 18.
  • the Asp 718/Hind III DNA fragment of 268 bp from pAK616 was isolated and ligated with the 10986 bp Asp 718/Xba I DNA fragment from pAK601 and the 140 bp DNA fragment Hind Ill/Xba I from pAK464 (encoding an extended version of Asp B28 human insulin) and named pAK 625.
  • the 180 bp Asp 718/Nco I DNA fragment from pAK625 was isolated and ligated with the 221 bp Nco I/Xba I DNA fragment from pJB146 (encoding and extended version of the insulin precursor) and the 10824 bp Asp 718/Xba I DNA fragment from pAK601 and the resulting plasmid was named pAK677.
  • the plasmid pAK677 was transformed into S_j_ cerevisiae strain MT663 as described in European patent application 86306721.1 and the resulting strain was named yAK677.
  • the DNA sequence encoding the signal peptide is as described in Fig. 5.
  • the DNA sequence coding for the extended MI3 insulin precursor is as described in Fig. 19.
  • the leader SEQ ID No. 67 has the following amino acid sequence:
  • the Asp 718/Nco I DNA fragment of 190 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1 expect that the 10824 bp Asp 718/Xba I vector DNA fragment was isolated from and from pAK601.
  • the 190 bp Asp 718/Nco I DNA fragment was subcloned into the S__ . cerevisiae expression plasmid as described in Example 1, expect that the 221 bp DNA fragment Nco I/Xba I (encoding an extended version of the MI3 insulin precursor) was isolated from pAK677 and used instead of the Hind Ill/Xba I DNA fragment.
  • the selected plasmid was shown by DNA sequencing analysis as described in Example 1 to contain the DNA sequence encoding the leader SEQ ID No. 67 and named pAK680.
  • SEQ ID No. 69 encoding the leader SEQ ID No. 67, see Fig. 20.
  • the plasmid pAK680 was transformed into S__ . cerevisiae strain
  • MT663 as ' described in European patent application 86306721.1 and the resulting strain was named yAK680.
  • the DNA sequence encoding the leader is as described in Fig. 5 and the extended insulin precursor MI3 DNA sequence is as described in Fig. 19.
  • YPD medium Ferman, F. et al., Methods in Yeast Genetics. Cold Spring Harbor Laboratory Press, 1981
  • MOLECULE TYPE peptide
  • xi SEQUENCE DESCRIPTION: SEQ ID N0:23:
  • MOLECULE TYPE cDNA
  • xi SEQUENCE DESCRIPTION: SEQ ID N0:29: TAAATCTATA ACTACAAAAA ACACATA 27
  • MOLECULE TYPE cDNA
  • cDNA SEQUENCE DESCRIPTION: SEQ ID N0:30: GACTCTCTTA ACTGGCAAGT TGACA 25
  • MOLECULE TYPE cDNA
  • cDNA SEQUENCE DESCRIPTION: SEQ ID N0:31: AAGTACAAAG CTTCAACCAA GTGAGAACCA CACAAGTGTT GGTTAACGAA TCTCTT 56
  • MOLECULE TYPE cDNA
  • cDNA SEQUENCE DESCRIPTION: SEQ ID N0:33: GAATCTCTTA GCTGGCAAGT TGACAGAAGT AGTGTTAGTT TCAGAGTCGT CAATT 55
  • MOLECULE TYPE cDNA
  • Xi SEQUENCE DESCRIPTION: SEQ ID N0:34: AACGAATCTC TTAGCACCTG GCAAGTTGAC AGAAGT 36
  • MOLECULE TYPE cDNA
  • MOLECULE TYPE cDNA
  • cDNA SEQUENCE DESCRIPTION: SEQ ID N0:38: AACGAATCTC TTAGCCATGG CAACGTTAGC CAAGTTAACC AAAGT 45
  • MOLECULE TYPE cDNA
  • cDNA SEQUENCE DESCRIPTION: SEQ ID N0:39: AACGAATCTC TTAGCACCTG GCAAGTTGAC CAAAGTAGTG TTGATAGCAG ATTCAGTGTC 60 G 61
  • CAG GTC CTT GGC CAA CCA ATT GAC GAC ACT GAA TCT AAC ACT ACT TCT 211 Gin Val Leu Gly Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Ser 20 25 30
  • CAG GTC CTT GGC CAA CCA ATT GAC GAC ACT GAA TCT AAC ACT ACT TCT 211 Gin Val Leu Gly Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Ser 20 25 30
  • CAG GTC CTT GGC CAA CCA ATT GAC GAC ACT GAA TCT ATC AAC ACT ACT 211 Gin Val Leu Gly Gin Pro He Asp Asp Thr Glu Ser He Asn Thr Thr 20 25 30
  • MOLECULE TYPE cDNA
  • Xi SEQUENCE DESCRIPTION: SEQ ID N0:65: CAAGTACAAA GCTTCAACCA AGTGGGAACC GCACAAGTGT TGGTTAACGA ACTT 54
  • MOLECULE TYPE cDNA

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Biomedical Technology (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Mycology (AREA)
  • Toxicology (AREA)
  • Endocrinology (AREA)
  • Diabetes (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Medicinal Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Peptides Or Proteins (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Medicines Containing Material From Animals Or Micro-Organisms (AREA)

Abstract

The present invention relates to a DNA expression cassette comprising the following sequence: 5'-P-SP-LS-PS-*gene*-(T)i-3'wherein P is a promoter sequence, SP is a DNA sequence encoding a signal peptide, LS is a DNA sequence encoding a leader peptide of formula I: GinProIle(Asp/Glu)(Asp/Glu)X1(Glu/Asp)X2AsnZ(Thr/Ser)X3 (SEQ ID NO: 77) (I) wherein X1 is a peptide bond or a codable amino acid; X2 is a peptide bond, a codable amino acid or a sequence of up to 4 codable amino acids which may be the same or different; Z is a codable amino acid except Pro; and X3 is a sequence of from 4 to 30 codable amino acids which may be the same or different; PS is a DNA sequence encoding a processing site; *gene* is a DNA sequence encoding a polypeptide; T is a terminator sequence; and i is 0 or 1.

Description

SYNTHETIC LEADER PEPTIDE SEQUENCES
FIELD OP INVENTION
The present invention relates to synthetic leader peptide sequences for secreting polypeptides in yeast.
BACKGROUND OF THE INVENTION
Yeast organisms produce a number of proteins which are synthesized intracellularly, but which have a function outside the cell. Such extracellular proteins are referred to as secreted proteins. These secreted proteins are expressed initially inside the cell in a precursor or a pre-protein form containing a presequence ensuring effective direction of the expressed product across the membrane of the endoplasmic reticulum (ER) . The presequence, normally named a signal peptide, is generally cleaved off from the desired product during translocation. Once entered in the secretory pathway, the protein is transported to the Golgi apparatus. From the Golgi the protein can follow different routes that lead to compartments such as the cell vacuole or the cell membrane, or it can be routed out of the cell to be secreted to the external medium (Pfeffer, S.R. and Rothman, J.E. Ann.Rev.Biochem. 56 (1987) 829-852).
Several approaches have been suggested for the expression and secretion in yeast of proteins heterologous to yeast. European published patent application No. 88 632 describes a process by which proteins heterologous to yeast are expressed, processed and secreted by transforming a yeast organism with an expression vehicle harbouring DNA encoding the desired protein and a signal peptide, preparing a culture of the transformed organism, growing the culture and recovering the protein from the culture medium. The signal peptide may be the signal peptide of the desired protein itself, a heterologous signal peptide or a hybrid of native and heterologous signal peptide.
A problem encountered with the use of signal peptides hetero¬ logous to yeast might be that the heterologous signal peptide does not ensure efficient translocation and/όr cleavage after 5 the signal peptide.
The Saccharomyces cerevisiae MFαl (α-factor) is synthesized as a prepro form of 165 amino acids comprising signal- or prepeptide of 19 amino acids followed by a "leader" or propeptide of 64 amino acids, encompassing three N-linked 10 glycosylation sites followed by (LysArg( (Asp/Glu)Ala) 2-3α~ factor)4 (Kurjan, J. and Herskowitz, I. Cell 30 (1982) 933- 943) . The signal-leader part of the preproMFαl has been widely employed to obtain synthesis and secretion of heterologous proteins in S_ cerevisiae.
15 Use of signal/leader peptides homologous to yeast is known from i.a. US patent specification No. 4,546,082, European published patent applications Nos. 116 201, 123 294, 123 544, 163 529 and 123 289 and DK patent application No. 3614/83.
In EP 123 289 utilization of the S_j_ cerevisiae a-factor pre- 20 cursor is described whereas WO 84/01153 indicates utilization of the S_ cerevisiae invertase signal peptide and DK 3614/83 utilization of the £ _ cerevisiae PH05 signal peptide for secretion of foreign proteins.
US patent specification No. 4,546,082, EP 16 201, 123 294, 123 25544 and 163 529 describe processes by which the α-factor signal-leader from S_ cerevisiae (MFαl or MFα2) is utilized in the secretion process of expressed heterologous proteins in yeast. By fusing a DNA sequence encoding the £_;_ cerevisiae MFαl signal/leader sequence at the 5• end of the gene for the 30 desired protein secretion and processing of the desired protein was demonstrated. EP 206783 discloses a system for the secretion of polypeptides from S_ cerevisiae using an α-factor leader sequence which has been truncated to eliminate the four α-factor units present on the native leader sequence so as to leave the leader peptide itself fused to a heterologous poiypeptide via the α-factor processing site LysArgGluAlaGluAla. This construction is indicated to lead to an efficient processing of smaller peptides (less than 50 amino acids) . For the secretion and processing of larger polypeptides, the native α-factor leader sequence has been truncated to leave one or two of the α-factor units between the leader peptide and the poiypeptide.
A number of secreted proteins are routed so as to be exposed to a proteolytic processing system which can cleave the peptide bond at the carboxy end of two consecutive basic amino acids. This enzymatic activity is in S_ cerevisiae encoded by the KEX 2 gene (Julius, D.A. et al., Cell 37 (1984b) 1075). Processing of the product by the KEX 2 protease is needed for the secretion of active S_:_ cerevisiae mating factor αl (MFαl or α- factor) whereas KEX 2 is not involved in the secretion of ac- tive S_ cerevisiae mating factor a.
Secretion and correct processing of a poiypeptide intended to be secreted is obtained in some cases when culturing a yeast organism which is transformed with a vector constructed as indicated in the references given above. In many cases, how- ever, the level of secretion is very low or there is no se¬ cretion, or the proteolytic processing may be incorrect or incomplete. It is therefore the object of the present invention to provide leader peptides which ensure a more efficient expression and/or processing of polypeptides.
SUMMARY OF THE INVENTION
Surprisingly, a new type of leader peptide has been found which allows secretion in high yield of a poiypeptide in yeast. Accordingly, the present invention relates to a DNA expression cassette comprising the following sequence:
5•-P-SP-LS-PS-*gene*-(T) -3' wherein P is a promoter sequence,
SP is a DNA sequence encoding a signal peptide,
LS is a DNA sequence encoding a leader peptide with the general formula I:
GlnProIle(Asp/Glu) (Asp/Glu)X1(Glu/Asp)X2AsnZ(Thr/Ser)X3 (I) wherein
X1 is a peptide bond or a codable amino acid;
X2 is a peptide bond, a codable amino acid or a sequence of up to 4 codable amino acids which may be the same or different; Z is a codable amino acid except Pro; and
X3 is a sequence of from 4 to 30 codable amino acids which may be the same or different;
PS is a E)NA sequence encoding a processing site; *gene* is a DNA sequence encoding a poiypeptide; T is a terminator sequence; and i is 0 or 1.
In the present context, the expression "leader peptide" is understood to indicate a peptide whose function is to allow the expressed poiypeptide to be directed from the endoplasmic reticulum to the Golgi apparatus and further to a secretory ve¬ sicle for secretion into the medium, (i.e. exportation of the expressed poiypeptide across the cell wall or at least through the cellular membrane into the periplasmic space of the cell) . The term "synthetic" used in connection with leader peptides is intended to indicate that the leader peptide is one not found in nature. The term "signal peptide" is understood to mean a presequence which is predominantly hydrophobic in nature and present as an N-terminal sequence of the precursor form of an extracellular protein expressed in yeast. The function of the signal peptide is to allow the expressed protein to be secreted to enter the endoplasmic reticulum. The signal peptide is normally cleaved off in the course of this process. The signal peptide may be heterologous or homologous to the yeast organism producing the protein.
The expression "poiypeptide" is intended to indicate a heterologous poiypeptide, i.e. a poiypeptide which is not pro¬ duced by the host yeast organism in nature as well as a homologous poiypeptide, i.e. a poiypeptide which is produced by the host yeast organism in nature and any preform thereof. In a preferred embodiment, the expression cassette of the present invention encodes a heterologous poiypeptide.
The expression "a codable amino acid" is intended to indicate an amino acid which can be coded for by a triplet ("codon") of nucleotides.
When, in the amino acid sequences given in the present specification, the three letter codes of two amino acids, separated by a slash, are given in brackets, e.g. (Asp/Glu) , this is intended to indicate that the sequence has either the one or the other of these amino acids in the pertinent position.
In a further aspect, the present invention relates to a process for producing a poiypeptide in yeast, the process comprising culturing a yeast cell, which is capable of expressing a poiypeptide and which is transformed with a yeast expression vector as described above including a leader peptide sequence of the invention, in a suitable medium to obtain expression and secretion of the poiypeptide, after which the poiypeptide is recovered from the medium. BRIEF DESCRIPTION OF THE DRAWINGS
The present invention is further illustrated with reference to the appended drawings wherein
Fig. 1 schematically shows the plasmid pAK492;
Fig. 2 shows part of the DNA sequence encoding the signal peptide/leader/MI3 insulin precursor;
Fig. 3 shows the construction of the plasmid pAK546;
Fig. 4 shows the amino acid sequence of the leader SEQ ID No. 4 and the DNA sequence encoding it;
Fig. 5 shows the DNA sequence of S. cerevisiae expression plasmid pAK546 encoding the YAP3 signal peptide, leader SEQ ID No. 4 and the MI3 insulin precursor and the encoded amino acid sequence;
Fig. 6 shows the amino acid sequence of the leader SEQ ID No. 6 and the' DNA sequence encoding it;
Fig. 7 shows the amino acid sequence of the leader SEQ ID No. 8 and the DNA sequence encoding it;
Fig. 8 shows the amino acid sequence of the leader SEQ ID No. 17 and the DNA sequence encoding it;
Fig. 9 shows the amino acid sequence of the leader SEQ ID No. 16 and the DNA sequence encoding it;
Fig. 10 shows the amino acid sequence of the leader SEQ ID No. 19 and the DNA sequence encoding it;
Fig. 11 shows the amino acid sequence of the leader SEQ ID No. 20 and the DNA sequence encoding it; Fig. 12 shows the amino acid sequence of the leader SEQ ID No. 21 and the DNA sequence encoding it;
Fig. 13 shows the DNA fragment of pAK527 used as the direct template in the construction of SEQ ID Nos. 4 and 6;
Fig. 14 shows the DNA fragment of pAK531 used as the direct template in the construction of SEQ ID No. 8;
Fig. 15 shows the DNA fragment of pAK555 used as the direct template in the construction of SEQ ID Nos. 16 and 17;
Fig. 16 shows the DNA fragment of pAK559 used as the direct template in the construction of SEQ ID Nos. 19 and 20; and
Fig. 17 shows the DNA fragment of pAK562 used as the direct template in the construction of SEQ ID No. 21;
Fig. 18 shows the amino acid sequence of the leader SEQ ID No. 27 and the DNA sequence SEQ ID No. 66 encoding it;
Fig. 19 shows the amino acid sequence SEQ ID No. 70 of an N- terminally extended MI3 insulin precursor and the DNA sequence SEQ ID No. 71 encoding it;
Fig. 20 shows the amino acid sequence of the leader SEQ ID No. 67 and the DNA sequence SEQ ID No. 69 encoding it;
Fig. 21 shows the DNA fragment SEQ ID No. 72 of pAK614 used as the direct template in the construction of SEQ ID No. 27; and
Fig. 22 shows the DNA fragment SEQ ID No. 73 of pAK625 used as the direct template in the construction of SEQ ID No. 67. DETAILED DISCLOSURE OF THE INVENTION
When X1 in general formula I designates an amino acid, it is preferably Ser, Thr or Ala. When X2 in general formula I designates one amino acid, it is preferably Ser, Thr or Ala. When X2 in general formula I designates a sequence of two amino acids, it is preferably Serlle. When X2 in general formula I designates a sequence of three amino acids, it is preferably SerAlalle. When X2 in general formula I designates a sequence of four amino acids it is preferably SerPheAlaThr. In a preferred embodiment, X3 is an amino acid sequence of the general formula II
X4-X5-X6 (II)
wherein X4 is a sequence of from 1 to 21 codable amino acids which may be the same or different, X5 is Pro or one of the amino acid sequences ValAsnLeu or LeuAlaAsnValAlaMetAla, and X6 is a sequence of from 1 to 8 codable amino acids which may be the same or different.
In general formula II, X4 is preferably an amino acid sequence which includes one or more of the motifs LeuValAsnLeu, SerValAsnLeu, MetAlaAsp, ThrGluSer, ArgPheAlaThr or ValAlaMetAla; or X4 is an amino acid sequence which includes the sequence AsnSerThr or AsnThrThr; or X4 is an amino acid sequence which includes the sequence (Ser/Leu)ValAsnLeu, (Ser/Leu)ValAsnLeuMetAlaAsp,
(Ser/Leu)ValAsnLeuMetAlaAspAsp, (Ser/Leu)ValAsnLeuMetAlaAspAspThrGluSer, (Ser/Leu)ValAsnLeuMetAlaAspAspThrGluSerlle or (Ser/Leu)ValAsnLeuMetAlaAspAspThrGluSerArgPheAlaThr; or X4 is an amino acid sequence which includes the sequence Asn(Thr/Ser)ThrLeu, Asn(Thr/Ser)ThrLeuAsnLeu or Asn(Thr/Ser)ThrLeuValAsnLeu; or any combination thereof.
In general formula II, X5 is preferably Pro or an amino acid sequence which includes the sequence ValAsnLeu, LeuAlaAsnValAlaMetAla, LeuAspValValAsnLeuProGly or LeuAspValValAsnLeuIleSerMet.
When X6' in general formula II, designates one amino acid, it is preferably Ala, Gly, Leu, Thr, Val or Ser. When X6' in general formula II, designates a sequence of two amino acids, it is preferably GlyAla or SerAla. When X6' in general formula II, designates a sequence of three amino acids, it is preferably AlaValAla. When X6, in general formula II, designates a sequence of eight amino acids, it is preferably GlyAlaAspSerLysThrValGlu.
Examples of preferred leader peptides coded for by the DNA sequence LS are:
SEQ ID No. 1 GlnProIleAspGluAspAsnAspThrSerValAsnLeuProAla;
SEQ ID No. 2 GlnProIleAspAspGluAsnThrThrSerValAsnLeuProAla;
SEQ ID No. 3 GlnProIleAspAspGluSerAsnThrThrSerValAsnLeuProAla;
SEQ ID No. 4 GlnProIleAspAspGluAsnThrThrSerValAsnLeuProVal;
SEQ ID No. 5 GlnProIleAspAspThrGluAsnThrThrSerValAsnLeuProAla;
SEQ ID No. 6 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuPro- Ala;
SEQ ID No. 7 GlnProIleAspAspGluAsnThrThrSerValAsnLeuMetAla;
SEQ ID No. 8 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuPro- GlyAla; SEQ ID No. 9 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- Ala;
SEQ ID No. 10 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnValPro- Thr;
SEQ ID No. 11 GlnProIleAspAspThrGluSerAsnThrThrLeuValAsnValPro-
Thr;
SEQ ID No. 12 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuPro- Thr;
SEQ ID No. 13 GlnProIleAspAspThrGluSerAsnThrThrLeuValAsnValPro- GlyAla;
SEQ ID No. 14 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaProAlaValAla;
SEQ ID No. 15 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AspLeuAlaValGlyLeuProGlyAla;
SEQ ID No'. 16 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet-
AlaAspAspThrGluSerlleAsnThrThrLeuValAsnLeuProGly- Ala;
SEQ ID No. 17 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsnLeu- ProGlyAla;
SEQ ID No. 18 GlnProIleAspAspThrGluSerAsnThrThrLeuValAsnLeuPro-
GlyAla;
SEQ ID No. 19 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuValAsn- LeuProLeu;
SEQ ID No. 20 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet-
AlaAspAspThrGluSerlleAsnThrThrLeuValAsnLeuAlaAsn- ValAlaMetAla;
SEQ ID No. 21 GlnProIleAspAspThrGluSerAlalleAsnThrThrLeuValAsn- LeuProGlyAla;
SEQ ID No. 22 GlnProIleAspAspThrGluSerPheAlaThrAsnThrThrLeuVal- AsnLeuProGlyAla;
SEQ ID No. 23 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsnLeu- MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuVal- AsnLeuProLeu;
SEQ ID No. 24 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsnLeu- MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuAsp-
ValValAsnLeuProGlyAla;
SEQ ID No. 25 GlnProIleAspAspThrGluSerAlaAlalleAsnThrThrLeuVal- AsnLeuProGlyAla;
SEQ ID No. 26 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuValAsn-
LeuAlaAsnValAlaMetAla;
SEQ ID No. 27 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuAspVal- ValAsnLeuIleSerMetAla;
SEQ ID No. 28 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet-
AlaAsnThrThrGluSerArgPheAlaThrAsnThrThrLeuAspVal- ValAsnLeuIleSerMetAla; and
SEQ ID No. 67 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuAlaLeu- AspValValAsnLeuIleSerMetAlaLysArg.
Particularly preferred leader peptides coded for by the DNA sequence LS are: SEQ ID No. 15 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AspLeuAlaValGlyLeuProGlyAla;
SEQ ID No. 16 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaAspAspThrGluSerlleAsnThrThrLeuValAsnLeuProGly- Ala;
SEQ ID No. 17 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsnLeu- ProGlyAla;
SEQ ID No. 18 GlnProIleAspAspThrGluSerAsnThrThrLeuValAsnLeuPro- GlyAla;
SEQ ID No. 19 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet-
AlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuValAsn- LeuProLeu;
SEQ ID No. 20 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaAspAspThrGluSer11eAsnThrThrLeuVa1AsnLeuAlaAsn- ValAlaMetAla;
SEQ ID No. 21 GlnProIleAspAspThrGluSerAlalleAsnThrThrLeuValAsn- LeuProGlyAla;
SEQ ID No. 22 GlnProIleAspAspThrGluSerPheAlaThrAsnThrThrLeuVal- AsnLeuProGlyAla;
SEQ ID No. 23 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsnLeu-
MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuVal- AsnLeuProLeu;
SEQ ID No. 24 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsnLeu- MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuAsp- ValValAsnLeuProGlyAla;
SEQ ID No. 25 GlnProIleAspAspThrGluSerAlaAlalleAsnThrThrLeuVal- AsnLeuProGlyAla; SEQ ID No. 26 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuValAsn- LeuAlaAsnVa1AlaMetAla; and
SEQ ID No. 28 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaAsnThrThrGluSerArgPheAlaThrAsnThrThrLeuAspVal-
ValAsnLeuIleSerMetAla.
SEQ ID No. 67 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMet- AlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeuAlaLeu- AspValValAsnLeuIleSerMetAlaLysArg.
The signal sequence (SP) may encode any signal peptide which ensures an effective direction of the expressed poiypeptide into the secretory pathway of the cell. The signal peptide may be a naturally occurring signal peptide or functional parts thereof or it may be a synthetic peptide. Suitable signal peptides have been found to be the α-factor signal peptide, the signal peptide of mouse salivary amylase, a modified carboxypeptidase signal peptide, the yeast BAR1 signal peptide or the Humicola lanuσinosa lipase signal peptide or a derivative thereof. The mouse salivary amylase signal sequence is described by Hagenbύchle, 0. et al., Nature 289 (1981) 643- 646. The carboxypeptidase signal sequence is described by Vails, L.A. et al., Cell 48 (1987) 887-897. The BAR1 signal peptide is disclosed in WO 87/02670. The yeast aspartic protease 3 signal peptide is described in Danish patent application No. 0828/93.
The yeast processing site encoded by the DNA sequence PS may suitably be any paired combination of Lys and Arg, such as LysArg, ArgLys, ArgArg or LysLys which permits processing of the poiypeptide by the KEX2 protease of Saccharomyces cerevisiae or the equivalent protease in other yeast species (Julius, D.A. et al., Cell 37 (1984) 1075). If KEX2 processing is not convenient, e.g. if it would lead to cleavage of the poiypeptide product, e.g. due to the presence of two consecutive basic amino acid internally in the desired product,, a processing site for another protease may be selected comprising an amino acid combination which is not found in the poiypeptide product, e.g. the processing site for FXa, IleGluGlyArg (cf. Sambrook, J., Fritsch, E.F. and Maniatis, T., Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, New York, 1989) .
The protein produced by the method of the invention may be any protein which may advantageously be produced in yeast. Examples of such proteins are heterologous proteins such as aprotinin, tissue factor pathway inhibitor or other protease inhibitors, insulin or insulin precursors, human or bovine growth hormone, interleukin, glucagon, GLP-1, IGF-I, IGF-II, tissue plasminogen activator, transforming growth factor α or β , platelet-derived growth factor, enzymes or a functional analogue thereof. In the present context, the term "functional analogue" is meant to indicate a protein with a similar function as the native protein (this is intended to be understood as relating to the nature, rather than the level of biological activity of the native protein) . The protein may be structurally similar to the native protein and may be derived from the native protein by addition of one or more amino acids to either or both the C- and N-terminal end of the native protein, substitution of one or more amino acids at one or a number of different sites in the native amino acid sequence, deletion of one or more amino acids at either or both ends of the native protein or at one or several sites in the amino acid sequence, or insertion of one or more amino acids at one or more sites in the native amino acid sequence. Such modifications are well known for several of the proteins mentioned above. Also, precursors or intermediates for other proteins may be produced by the method of the invention. An example of such a precursor is the MI3 insulin precursor which comprises the amino acid sequence B(l- 29)AlaAlaLysA(l-21) wherein A(l-21) is the A chain of human insulin and B(l-29) is the B chain of human insulin in which Thr(B30) is missing. Preferred DNA constructs encoding leader sequences are as shown in Figs. 4 - 12 or suitable modifications thereof. Examples of suitable modifications of the DNA sequence are nucleotide substitutions which do not give rise to another amino acid sequence of the protein, but which may correspond to the codon usage of the yeast organism into which the DNA construct is inserted or nucleotide substitutions which do give rise to a different amino acid sequence and therefore, possibly, a different protein structure. Other examples of possible modifications are insertion of one or more codons into the sequence, addition of one or more codons at either end of the sequence and deletion of one or more codons at either end of or within the sequence.
The recombinant expression vector carrying the expression casette
5•-P-SP-LS-PS-*gene*-(T) ^-3•
wherein P, SP, LS, *gene*, T and i are as defined above may be any vector which is capable of replicating in yeast organisms. The promoter may be any DNA sequence which shows transcriptional activity in yeast and may be derived from genes encoding proteins either homologous or heterologous to yeast. The promoter is preferably derived from a gene encoding a protein homologous to yeast. Examples of suitable promoters are the Saccharo yces cerevisiae MFαl, TPI, ADH or PGK promoters.
The sequences shown above should preferably also be operably connected to a suitable terminator, e.g. the TPI terminator (cf. Alber, T. and Kawasaki, G., J. Mol. APPI. Genet. 1 (1982) 419-434) .
The recombinant expression vector of the invention further comprises a DNA sequence enabling the vector to replicate in yeast. Examples of such sequences are the yeast plasmid 2μ replication genes REP 1-3 and origin of replication. The vector may also comprise a selectable marker, e.g. the Schizo- saccharomvces pombe TPI gene as described by Russell, P.R., Gene 40 (1985) 125-130.
The methods used to ligate the sequence 5*-P-SP-LS-PS-*gene*- (T)^-3• and to insert it into suitable yeast vectors containing the information necessary for yeast replication, are well known to persons skilled in the art (cf., for instance, Sambrook, J., Fritsch, E.F. and Maniatis, T. , op.cit.) ♦ It will be understood that the vector may be constructed either by first preparing a DNA construct containing the entire sequence 5'-P-SP-LS-PS- *gene*-(T)^-3' and subsequently inserting this fragment into a suitable expression vector, or by sequentially inserting DNA fragments into a suitable vector containing genetic information for the individual elements (such as the promoter sequence, the signal peptide , the leader sequence GlnProIle(Asp/Glu) (Asp/GluJX1(Glu/Asp)X2AsnZ(Thr/Ser)X3, the processing site, the poiypeptide, and, if present, the terminator sequence) followed by ligation.
The yeast organism used in the method of the invention may be any suitable yeast organism which, on cultivation, produces large amounts of the desired poiypeptide. Examples of suitable yeast organisms may be strains of the yeast species
Saccharomvces cerevisiae. Saccharomvces kluvveri.
Schizosaccharomvces pombe or Saccharomvces uvarum. The transformation of the yeast cells may for instance be effected by protoplast formation followed by transformation in a manner known per se. The medium used to cultivate the cells may be any conventional medium suitable for growing yeast organisms. The secreted poiypeptide, a significant proportion of which will be present in the medium in correctly processed form, may be recovered from the medium by conventional procedures including separating the yeast cells from the medium by centrifugation or filtration, precipitating the proteinaceous components of the supernatant or filtrate by means of a salt, e.g. ammonium sulphate, followed by purification by a variety of chromatographic procedures, e.g. ion exchange chromatography, affinity chromatography or the like.
The invention is further described in the following examples which are not to be construed as limiting the scope of the invention as claimed.
EXAMPLES
Plasmids and DNA material
All expression plasmids are of the C-POT type. Such plasmids are described in EP patent application No. 171 142 and are characterized in containing the Schizosaccharo yces pombe triose phosphate isomerase gene (POT) for the purpose of plasmid selection and stabilization. A plasmid containing the POT-gene is available from a deposited E. coli strain (ATCC 39685) . The plasmids furthermore contain the S. cerevisiae triose phosphate isomerase promoter and terminator (PTPI and TTPI) . They are identical to pMT742 (Egel-Mitani, M. et al.. Gene 73 (1988) 113-120) (see Fig. 1) except for the region defined by the EcoR I-Xba I restriction sites encompassing the coding region for signal/leader/product.
The plasmids pAK527, pAK531, pAK555, pAK559, pAK562, pAK614 and pAK625 were used as DNA templates in the PCR reactions applied in the construction of the leaders described in the examples. The synthetic DNA fragments serving as the direct template are shown in Figs. 13 - 17. With the exception of the shown DNA regions the plasmids are identical to pAK492 shown in Fig. 1.
Synthetic DNA fragments were synthesized on an automatic DNA synthesizer (Applied Biosystems model 380A) using phosphoramidite chemistry and commercially available reagents (Beaucage, S.L. and Caruthers, M.H., Tetrahedron Letters 22 (1981) 1859-1869). All other methods and materials used are common state of the art knowledge (see, e.g. Sambrook, J., Fritsch, E.F. and Maniatis, T. , Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, New York, 1989) .
EXAMPLE 1
Synthesis of the leader SEQ ID No. 4 for expression of the MI3 insulin precursor in S_j_ cerevisiae (strain yAK546) .
The leader SEQ ID No. 4 has the following amino acid sequence: GlnProIleAspAspGluAsnThrThrSerValAsnLeuProVal
The following oligonucleotides were synthesised:
# 94 5•-TAAATCTATAACTACAAAAAACACATA-3 • SEQ ID No. 29
# 333 5•-GACTCTCTTAACTGGCAAGTTGACA-3 ' SEQ ID No. 30 # 312 5•-AAGTACAAAGCTTCAACCAAGTGAGAACCACACAAGTGTT GGTTAACGAATCTCTT-3 • SEQ ID No. 31
# 1845 5•-CATACACAATATAAACGACGG-3 • SEQ ID No. 32
The following polymerase chain reactions (PCR) were performed using the Gene Amp PCR reagent kit (Perkin Elmer, 761 Main Avewalk, CT 06859, USA) according to the manufacturers instructions. During the reaction, the PCR mixtures were overlayed with 100 μl of mineral oil (Sigma Chemical CO, St. Louis MO, USA) :
Polymerase chain reaction No. 1
5 μl of oligonucleotide # 94 (50 pmol) 5 μl of oligonucleotide # 333 (50 pmol) 10 μl of 10X PCR buffer 16 μl of dNTP mix 0.5 μl of Taq enzyme 0.5 μl of pAK527 plasmid (Fig. 13) as template (0.2 μg of DNA) 63 μl of water
A total of 12 cycles were performed, one cycle was 94βC for 1 min; 37°C for 2 in; 72°C for 3 min. The PCR mixture was then loaded onto a 2% agarose gel and electrophoresis was performed 5 using standard techniques (Sambrook, J., Fritsch, E.F. and Maniatis, T. , op.cit. ) . The resulting DNA fragment was cut out of the agarose gel and isolated using the Gene Clean kit (Bio 101 inc., PO BOX 2284, La Jolla, CA 92038, USA) according to the manufacturers instructions.
10 Polymerase chain reaction No. 2
5 μl of oligonucleotide # 312 (50 pmol) 5 μl of oligonucleotide # 94 (50 pmol) 10 μl of 10X PCR buffer 16 μl of dNTP mix 150.5 μl of Taq enzyme
10 μl of purified DNA fragment from PCR No. 1 53.5 μl of water
A total of 12 cycles were performed, one cycle was 94°C for 1 min; 37°C for 2 min; 72"C for 3 min. 20 The DNA fragment from polymerase chain reaction No. 2 was isolated and purified using the Gene Clean kit (Bio 101 inc. , PO BOX 2284, La Jolla, CA 92038, USA) according to the manufacturers instructions.
The purified PCR DNA fragment was dissolved in 10 μl of water 25 and restriction endonuclease buffer and cut with the restriction endonucleases Asp 718 and Hind III in a total volume of 15 μl according to standard techniques (Sambrook, J. , Fritsch, E.F. and Maniatis, T. , op.cit. ) . The 167 bp Asp 718/Hind III DNA fragment was subjected to electrophoresis on 30 agarose gel and purified using The Gene Clean Kit as described. The S__. cerevisiae expression plasmid pAK492 (shown in Fig. 1) is a derivative of the previously described plasmid pMT742 in which the fragment encoding the signal/leader/insulin precursor has been replaced by the EcoR I-Xba I fragment shown in Fig. 2. This fragment has been synthesized on an Applied Biosystems DNA synthesizer in accordance with the manufacturer's instructions. The plasmid pAK492 was cut with the restriction endonucleases Asp 718 and Xba I and the vector fragment of 10986 bp was isolated. The plasmid pAK492 was cut with the restriction endonucleases Hind III and Xba I and the DNA fragment of 140 bp encoding part of the MI3 insulin precursor was isolated. The three DNA fragments were ligated together using T4 DNA ligase under standard conditions (Sambrook, J., Fritsch, E.F. and Maniatis, T., op.cit. ) . The ligation mixture was then transformed into a competent E. coli strain (R-, M+) and transformants were identified by ampicillin resistance. Plasmids were isolated from the resulting E. coli colonies using standard DNA iniprep technique (Sambrook, J., Fritsch, E.F. and Maniatis, T. , op.cit. ) . checked with appropriate restrictions endonucleases i.e. EcoR I, Xba I, Nco I and Hind III. The selected plasmid, pAK546, was shown by DNA sequencing analysis (Sequenase, U.S. Biochemical Corp.) using the primer # 94 to contain a DNA sequence encoding the leader SEQ ID No. 4. For the DNA sequence encoding the leader SEQ ID No. 4, see Fig. 4) . The plasmid pAK546 was transformed into S_ cerevisiae strain MT663 as described in European published patent application No. 214 826 and the resulting strain was named yAK546. The DNA sequence of the protein coding region of the expression plasmid is given in Fig. 5.
EXAMPLE 2
Synthesis of the leader SEQ ID No. 6 for expression of the MI3 insulin precursor in S_;_ cerevisiae (strain yAK531) .
The leader SEQ ID No. 6 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuProAla The following oligonucleotide was synthesised:
# 331 5'-GAATCTCTTAGCTGGCAAGTTGACAGAAGTAGTGTTAG
TTTCAGAGTCGTCAATT-3 • SEQ ID No. 33
The polymerase chain reaction was performed as described in Example 1 with the expection that oligonucleotide # 331 was used insted of oligonucleotide # 333.
The Asp 718/Hind III DNA fragment of 168 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1. The Asp 718/Hind III DNA fragment was subcloned into the S_j_ cerevisiae expression plasmid as described in Example 1. The selected plasmid, pAK531, was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 6. For the DNA sequence encoding the leader SEQ ID No. 6, see Fig. 6. The plasmid pAK531 was transformed into S_ cerevisiae strain MT663 as described in European patent application 86306721.1 and the resulting strain was named yAK531. The DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
EXAMPLE 3
Synthesis of the leader SEQ ID No. 8 for expression of the MI3 insulin precursor in ∑ . cerevisiae (strain yAK547) .
The leader SEQ ID No. 8 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuProGlyAla
The following oligonucleotide was synthesised:
# 345 δ^-AACGAATCTCTTAGCACCTGGCAAGTTGACAGAAGT-S' SEQ ID No. 34 The polymerase chain reaction was performed as described in Example 1 with the expection that oligonucleotide # 345 was used insted of oligonucleotide # 333 and plasmid pAK531 (Fig. 14) was used as template.
5 The Asp 718/Hind III DNA fragment of 171 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1. The Asp 718/Hind III DNA fragment was subcloned into the S_ cerevisiae expression plasmid as described in Example 1. The selected plasmid, pAK547, was shown by DNA sequencing
10 analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 8. For the DNA sequence encoding the leader SEQ ID No. 8, see Fig. 7. The plasmid pAK547 was transformed into S_j_ cerevisiae strain MT663 as described in European patent application No. 86306721.1 and the resulting
15 strain was named yAK547. The DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
EXAMPLE 4
Synthesis of the leader SEQ ID No. 17 for expression of the MI3 20 insulin precursor in S_ cerevisiae (strain yAK561) .
The leader SEQ ID No. 17 has the following amino acid sequence: GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsnLeuProGlyAla
The following oligonucleotide was synthesised:
25 # 376 5'-AACGAATCTCTTAGCACCTGGCAAGTTGACCAAAGTAG
TGTTGATAGATTCAGTGTCGTC-3 ' SEQ ID No. 35
The polymerase chain reaction was performed as described in Example 1 with the expection that oligonucleotide # 376 was used insted of oligonucleotide # 333 and plasmid pAK555 (Fig. 30 15) was used as template. The Asp 718/Hind III DNA fragment of 180 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1. The Asp 718/Hind III DNA fragment was subcloned into the S_ cerevisiae expression plasmid as described in Example 1. The selected plasmid, pAK561, was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 17. For the DNA sequence encoding the leader SEQ ID No. 17, see Fig. 8. The plasmid pAK561 was transformed into S_ cerevisiae strain MT663 as described in European patent application No. 86306721.1 and the resulting strain was named yAK561. The DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
EXAMPLE 5 Synthesis of the leader SEQ ID No. 16 for expression of the MI3 insulin precursor in S_Ϊ. cerevisiae (strain yAK559) .
The leader SEQ ID No. 16 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMetAlaAspAspThr- GluSerlleAsnThrThrLeuValAsnLeuProGlyAla
The following oligonucleotide was synthesised:
# 375 5'-AACGAATCTCTTAGCACCTGGCAAGTTAACCAAAGTAGT
GTTGATAGATTCAGTGTCGTCAGCCATCAAGTTGAC-31 SEQ ID No. 36
The polymerase chain reaction was performed as described in Example 1 with the expection that oligonucleotide # 375 was used insted of oligonucleotide # 333 and plasmid pAK555 (Fig. 15) was used as template.
The Asp 718/Hind III DNA fragment of 222 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1. The Asp 718/Hind III DNA fragment was subcloned into the S_j_ cerevisiae expression plasmid as described in Example 1. The selected plasmid, pAK559, was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 16. For the DNA sequence encoding the leader SEQ ID No. 16, see Fig. 9. The plasmid pAK559 was transformed into ∑ _ cerevisiae strain MT663 as described in European patent application No. 86306721.1 and the resulting strain was named yAK559. The DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
EXAMPLE 6
Synthesis of the leader SEQ ID No. 19 for expression of the MI3 insulin precursor in S_ cerevisiae (strain yAK580) .
The leader SEQ ID No. 19 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMetAlaAspAspThr- GluSerArgPheAlaThrAsnThrThrLeuValAsnLeuProLeu
The following oligonucleotide was synthesised:
# 384 5'-AACGAATCTCTTCAATGGCAAGTTAACCAAAGTAGTGT TAGTAGCGAATCTAGATTCAGTGTCGTCAGCCAT-31 SEQ ID No. 37
The polymerase chain reaction was performed as described in Example 1 with the expection that oligonucleotide # 384 was used insted of oligonucleotide # 333 and plasmid pAK559 (Fig. 16) was used as template.
The Asp 718/Hind III DNA fragment of 228 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1. The Asp 718/Hind III DNA fragment was subcloned into the S__. cerevisiae expression plasmid as described in Example 1. The selected plasmid, pAK580, was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 19. For the DNA sequence encoding the leader SEQ ID No. 19, see Fig. 10. The plasmid pAK580 was transformed into S_j_ cerevisiae strain MT663 as described in European patent application No. 86306721.1 and the resulting strain was named yAK580. The DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
EXAMPLE 7
Synthesis of the leader SEQ ID No. 20 for expression of the MI3 insulin precursor in S_j_ cerevisiae (strain yAK583) .
The leader SEQ ID No. 20 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMetAlaAspAspThr- GluSerI1eAsnThrThrLeuVa1AsnLeuAlaAsnVa1AlaMetAla
The following oligonucleotide was synthesised:
# 390 5•-AACGAATCTCTTAGCCATGGCAACGTTAGCCAAGTTAA
CCAAAGT-31 SEQ ID No. 38
The polymerase chain reaction was performed as described in Example 1 with the expection that oligonucleotide # 390 was used insted of oligonucleotide # 333 and plasmid pAK559 (Fig. 16) was used as template.
The Asp 718/Hind III DNA fragment of 231 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1. The Asp 718/Hind III DNA fragment was subcloned into the S_j_ cerevisiae expression plasmid as described in Example 1. The selected plasmid, pAK583, was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 20. For the DNA sequence encoding the leader SEQ ID No. 20, see Fig. 11. The plasmid pAK583 was transformed into S_ cerevisiae strain MT663 as described in European patent application No. 86306721.1 and the resulting strain was named yAK583. The DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5.
EXAMPLE 8
Synthesis of the leader SEQ ID No. 21 for expression of the MI3 insulin precursor in S__. cerevisiae (strain yAK586) .
The leader SEQ ID No. 21 has the following amino acid sequence: GlnProIleAspAspThrGluSerAlalleAsnThrThrLeuValAsnLeuProGlyAla
The following oligonucleotide was synthesised:
# 401 5'-AACGAATCTCTTAGCACCTGGCAAGTTGACCAAAGTAG
TGTTGATAGCAGATTCAGTGTCG-3 ' SEQ ID No. 39
The polymerase chain reaction was performed as described in Example 1 with the exception that oligonucleotide # 401 was used insted of oligonucleotide # 333 and plasmid pAK562 (Fig. 17) was used as template.
The Asp 718/Hind III DNA fragment of 183 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1. The Asp 718/Hind III DNA fragment was subcloned into the S_j_ cerevisiae expression plasmid as described in Example 1. The selected plasmid, pAK586, was shown by DNA sequencing analysis, as described in Example 1, to contain a DNA sequence encoding the leader SEQ ID No. 21, see Fig. 12. The plasmid pAK586 was transformed into ∑_. cerevisiae strain MT663 as described in European patent application No. 86306721.1 and the resulting strain was named yAK586. The DNA sequences encoding the signal peptide and the insulin precursor MI3 were the same as those shown in Fig. 5. EXAMPLE 9
Expression of the MI3 insulin precursor using selected leader sequences according to the present invention.
Yeast strains harbouring plasmids as described above, were grown in YPD medium (Sherman, F. et al.. Methods in Yeast Genetics. Cold Spring Harbor Laboratory Press, 1981) . For each strain 6 individual 5 ml cultures were shaken at 30°C for 72 hours, with a final OD600 of approx. 15. After centrifugation the supernatant was removed for HPLC analysis by which method the concentration of secreted insulin precursor was measured by a method described by Snel, L. et al. Chromatographia 24 (1987) 329-332.
In Table 1 the expression levels of the insulin precursor, MI3, obtained by use of selected leader sequences according to the present invention, are given as a percentage of the level obtained with transformants of pMT742, utilizing the MFα(l) leader of S_j_ cerevisiae.
Table 1
Leader Expression level, %
MT748 α-leader 100
SEQ ID No. 15 87
SEQ ID No. 16 215
SEQ ID No. 17 157
SEQ ID No. 19 166
SEQ ID No. 20 86
SEQ ID No. 21 145
SEQ ID No. 22 137
SEQ ID No. 23 121 EXAMPLE 10
Synthesis of the leader SEQ ID No. 27 for expression of the extended MI3 insulin precursor in S_ cerevisiae (strain yAK677) .
5 .
The leader SEQ ID No. 27 has the following amino acid sequence: GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMetAlaAspAspThr- GluSerArgPheAlaThrAsnThrThrLeuAspValValAsnLeuIleSerMetAla
The following oligonucleotides were synthesised:
10 # 440 5'-GGTTAACGAACTTTGGAGCTTCAGCTTCAGCTTCTTCTCTCTTAGCCAT
GGAGATCAAGTTAACAACATCCAAAGTAGTGTT-3' SEQ ID No. 64 and
# 441 5•-CAAGTACAAAGCTTCAACCAAGTGGGAACCGCACAAGTGTTGGTTAACG
AACTT-3' SEQ ID No. 65
15 Polymerase chain reactions were performed as described in Example in 1 with the exception that oligonucleotide # 440 was used instead of oligonucleotide # 333 and plasmid pAK614 was used as template. For the second polymerase chain reaction, oligonucleotide # 441 was used instead of oligonucleotide #
20312.
The purified PCR DNA fragment was isolated and digested with the restriction endonucleases Asp 718 and Hind III as described in Example 1. The Asp 718/Hind III DNA fragment of 268 bp was subjected to electrophoresis on agarose gel and
25 purified as described in Example 1. The Asp 718/Hind III DNA fragment was subcloned into the S_. cerevisiae expression plasmid as described in Example 1, with the exception that the 140 bp Hind III/Xba I DNA fragment was derived from pAK602 and encodes AspB28 human insulin. The selected
30 plasmid, pAK616, was shown by DNA sequencing analysis, as described in Example 1, to contain the DNA sequence encoding the leader SEQ ID No. 27. For the DNA sequence, SEQ ID No. 66, encoding the leader SEQ ID No. 27, see Fig. 18. The Asp 718/Hind III DNA fragment of 268 bp from pAK616 was isolated and ligated with the 10986 bp Asp 718/Xba I DNA fragment from pAK601 and the 140 bp DNA fragment Hind Ill/Xba I from pAK464 (encoding an extended version of AspB28 human insulin) and named pAK 625. The 180 bp Asp 718/Nco I DNA fragment from pAK625 was isolated and ligated with the 221 bp Nco I/Xba I DNA fragment from pJB146 (encoding and extended version of the insulin precursor) and the 10824 bp Asp 718/Xba I DNA fragment from pAK601 and the resulting plasmid was named pAK677. The plasmid pAK677 was transformed into S_j_ cerevisiae strain MT663 as described in European patent application 86306721.1 and the resulting strain was named yAK677. With the exception of the DNA sequence encoding the leader, the DNA sequence encoding the signal peptide is as described in Fig. 5. The DNA sequence coding for the extended MI3 insulin precursor is as described in Fig. 19.
EXAMPLE l'l
Synthesis of the leader SEQ ID No. 67 for expression of the extended MI3 insulin precursor in S_j_ cerevisiae (yAK680)
The leader SEQ ID No. 67 has the following amino acid sequence:
GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeuMetAlaAspAspThr GluSerArgPheAlaThrAsnThrThrLeuAlaLeuAspValValAsnLeuIleSerMet Ala
The following oligonucleotide was synthesised:
# 577 5•-TCTCTTAGCCATGGAGATCAAGTTAACAACATCCAAAG
CCAAAGTAGTGTT-3 ' SEQ ID No. 68 The PCR was performed as described in Example in 1 with the exception that oligonucleotide # 577 was used instead of oligonucleotide # 333 and plasmid pAK625 was used as template and the second PCR was not performed. The PCR fragment was digested with the restriction endonucleases Asp 718 and Nco I as described in Example 1.
The Asp 718/Nco I DNA fragment of 190 bp was subjected to electrophoresis on agarose gel and purified as described in Example 1 expect that the 10824 bp Asp 718/Xba I vector DNA fragment was isolated from and from pAK601. The 190 bp Asp 718/Nco I DNA fragment was subcloned into the S__. cerevisiae expression plasmid as described in Example 1, expect that the 221 bp DNA fragment Nco I/Xba I (encoding an extended version of the MI3 insulin precursor) was isolated from pAK677 and used instead of the Hind Ill/Xba I DNA fragment. The selected plasmid was shown by DNA sequencing analysis as described in Example 1 to contain the DNA sequence encoding the leader SEQ ID No. 67 and named pAK680. For the DNA sequence, SEQ ID No. 69, encoding the leader SEQ ID No. 67, see Fig. 20. The plasmid pAK680 was transformed into S__. cerevisiae strain
MT663 as 'described in European patent application 86306721.1 and the resulting strain was named yAK680. With the exception of the DNA sequence encoding the leader, the DNA sequence encoding the signal peptide is as described in Fig. 5 and the extended insulin precursor MI3 DNA sequence is as described in Fig. 19.
EXAMPLE 12
Expression of N-terminally extended MI3 insulin precursors using the leader sequences SEQ ID No. 27 and SEQ ID No. 67 according to the present invention.
Yeast strains harbouring plasmids as described above, were grown in YPD medium (Sherman, F. et al., Methods in Yeast Genetics. Cold Spring Harbor Laboratory Press, 1981) . For each strain 6 individual 5 ml cultures were shaken at 30°C for 72 hours, with a final OD600 of approximately 15. After centrifugation the supernatant was removed for HPLC analysis by which method the concentration of secreted insulin precursor was measured by a method described by Snel, L. et al. Chromatographia 24 (1987) 329-332.
In Table 2 the expression levels of some N-terminally extended MI3 insulin precursors, obtained by use of the leader sequences SEQ ID No. 27 and SEQ ID No. 67 according to the present invention, are given as a percentage of the level obtained with transformants of pMT742, utilizing the MFα(l) leader of S_ cerevisiae.
Table 2
Strain Signal peptide Leader Extension Relative to MT748
MT748 α α
yAK675 YAP3 SEQ ID EEAEAEAP 251%
No. 27 K
yAK677 YAP3 SEQ ID EEAEAEAE 224%
No. 27 PK
yAK681 YAP3 SEQ ID EEAEAEAP 248%
No. 67 K
yAK680 YAP3 SEQ ID EEAEAEAE 362%
No. 67 PK SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT:
(A) NAME: Novo Nordisk A/S
(B) STREET: Novo Alle
(C) CITY: DK-2880 Bagsvaerd (E) COUNTRY: Denmark
(G) TELEPHONE: +4544448888
(H) TELEFAX: +4544490555
(I) TELEX: 37173
(ii) TITLE OF INVENTION: SYNTHETIC LEADERS PEPTIDE SEQUENCES
(iii) NUMBER OF SEQUENCES: 73
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Novo Nordisk A/S
Corporate Patents
(B) STREET: Novo Alle
(C) CITY: DK-2880 Bagsvaerd (E) COUNTRY: Denmark
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentin Release #1.0, Version #1.25
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: DK 0705/94 and US 08/282,852
(B) FILING DATE: 16-JUN-1994 and 29-JUL-1994
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Jørgensen, Dan et al .
(C) REFERENCE/DOCKET NUMBER: 4085-WO.DJ
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: +4544448888
(B) TELEFAX: +4544493256
(2) INFORMATION FOR SEQ ID N0:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 15 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l:
Gi n Pro H e Asp Gl u Asp Asn Asp Thr Ser Val Asn Leu Pro Al a 1 5 10 15
(2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 15 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2:
Gin Pro He Asp Asp Glu Asn Thr Thr Ser Val Asn Leu Pro Ala 1 5 10 15
(2) INFORMATION FOR SEQ ID N0:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 16 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3:
Gin Pro He Asp Asp Gl u Ser Asn Thr Thr Ser Val Asn Leu Pro Al a 1 5 10 15
(2) INFORMATION FOR SEQ ID N0:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 15 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4:
Gin Pro H e Asp Asp Gl u Asn Thr Thr Ser Val Asn Leu Pro Val 1 5 10 15
(2) INFORMATION FOR SEQ ID NO: 5: (i ) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5:
Gin Pro He Asp Asp Thr Gl u Asn Thr Thr Ser Val Asn Leu Pro Ala 1 5 10 15
(2) INFORMATION FOR SEQ ID NO:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 17 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6:
Gin Pro H e Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Pro 1 5 10 15
Al a
(2) INFORMATION FOR SEQ ID N0:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 15 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7:
Gin Pro He Asp Asp Gl u Asn Thr Thr Ser Val Asn Leu Met Ala 1 5 10 15
(2) INFORMATION FOR SEQ ID N0:8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: Gin Pro He Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Pro 1 5 10 15
Gly Ala
(2) INFORMATION FOR SEQ ID N0:9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 17 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9:
Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
Ala
(2) INFORMATION FOR SEQ ID NO:10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 17 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
_
(i i ) MOLECULE TYPE: peptide
(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 10:
Gi n Pro H e Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Val Pro 1 5 10 15
Thr
(2) INFORMATION FOR SEQ ID NO:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 17 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: Gi n Pro H e Asp Asp Thr Gl u Ser Asn Thr Thr Leu Val Asn Val Pro 1 5 10 15
Thr
(2) INFORMATION FOR SEQ ID NO:12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 17 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:
Gi n Pro He Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Pro 1 5 10 15
Thr
(2) INFORMATION FOR SEQ ID NO:13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:
Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Leu Val Asn Val Pro 1 5 10 15
Gly Al a
(2) INFORMATION FOR SEQ ID NO:14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: Gin Pro He Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
Al a Pro Al a Val Al a 20
(2) INFORMATION FOR SEQ ID NO:15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 25 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:
Gi n Pro H e Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
Asp Leu Al a Val Gly Leu Pro Gly Al a 20 25
(2) INFORMATION FOR SEQ ID NO:16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:
Gin Pro He Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
Al a Asp Asp Thr Gl u Ser H e Asn Thr Thr Leu Val Asn Leu Pro Gly 20 25 30
Al a
(2) INFORMATION FOR SEQ ID NO:17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 19 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: Gin Pro He Asp Asp Thr Gl u Ser He Asn Thr Thr Leu Val Asn Leu 1 5 10 15
Pro Gly Ala
(2) INFORMATION FOR SEQ ID NO:18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:
Gi n Pro He Asp Asp Thr Gl u Ser Asn Thr Thr Leu Val Asn Leu Pro 1 5 10 15
Gly Ala
(2) INFORMATION FOR SEQ ID NO:19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 35 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:
Gi n Pro H e Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
Ala Asp Asp Thr Glu Ser Arg Phe Ala Thr Asn Thr Thr Leu Val Asn 20 25 30
Leu Pro Leu 35
(2) INFORMATION FOR SEQ ID N0:20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 36 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: Gin Pro He Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
Al a Asp Asp Thr Gl u Ser H e Asn Thr Thr Leu Val Asn Leu Al a Asn 20 25 30
Val Al a Met Ala 35
(2) INFORMATION FOR SEQ ID NO:21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21:
Gin Pro He Asp Asp Thr Glu Ser Ala He Asn Thr Thr Leu Val Asn 1 5 10 15
Leu Pro Gly Ala 20
(2) INFORMATION FOR SEQ ID NO:22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22:
Gin Pro He Asp Asp Thr Gl u Ser Phe Al a Thr Asn Thr Thr Leu Val 1 5 10 15
Asn Leu Pro Gly Ala 20
(2) INFORMATION FOR SEQ ID NO:23:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 36 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID N0:23:
Gin Pro He Asp Asp Thr Gl u Ser He Asn Thr Thr Leu Val Asn Leu 1 5 10 15
Met Ala Asp Asp Thr Glu Ser Arg Phe Ala Thr Asn Thr Thr Leu Val 20 25 30
Asn Leu Pro Leu 35
(2) INFORMATION FOR SEQ ID NO:24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24:
Gin Pro He Asp Asp Thr Gl u Ser H e Asn Thr Thr Leu Val Asn Leu 1 5 10 15
Met Ala Asp Asp Thr Glu Ser Arg Phe Ala Thr Asn Thr Thr Leu Asp 20 25 30
Val Val Asn Leu Pro Gly Ala
35
(2) INFORMATION FOR SEQ ID NO:25:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:25:
Gi n Pro H e Asp Asp Thr Gl u Ser Al a Al a H e Asn Thr Thr Leu Val 1 5 10 15
Asn Leu Pro Gly Ala 20
(2) INFORMATION FOR SEQ ID NO:26:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 39 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:
Gin Pro He Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
Al a Asp Asp Thr Gl u Ser Arg Phe Al a Thr Asn Thr Thr Leu Val Asn 20 25 30
Leu Al a Asn Val Al a Met Al a 35
(2) INFORMATION FOR SEQ ID N0:27:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:27:
Gin Pro He Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
Al a Asp Asp Thr Gl u Ser Arg Phe Al a Thr Asn Thr Thr Leu Asp Val 20 25 30
Val Asn Leu He Ser Met Al a 35
(2) INFORMATION FOR SEQ ID NO:28:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28:
Gin Pro H e Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
Al a Asn Thr Thr Glu Ser Arg Phe Ala Thr Asn Thr Thr Leu Asp Val 20 25 30 Val Asn Leu He Ser Met Al a 35
(2) INFORMATION FOR SEQ ID NO:29:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: TAAATCTATA ACTACAAAAA ACACATA 27
(2) INFORMATION FOR SEQ ID NO:30:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 25 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: GACTCTCTTA ACTGGCAAGT TGACA 25
(2) INFORMATION FOR SEQ ID NO:31:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 56 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: AAGTACAAAG CTTCAACCAA GTGAGAACCA CACAAGTGTT GGTTAACGAA TCTCTT 56
(2) INFORMATION FOR SEQ ID NO:32:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID N0:32: CATACACAAT ATAAACGACG G 21
(2) INFORMATION FOR SEQ ID N0:33:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 55 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: GAATCTCTTA GCTGGCAAGT TGACAGAAGT AGTGTTAGTT TCAGAGTCGT CAATT 55
(2) INFORMATION FOR SEQ ID N0:34:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 36 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (Xi) SEQUENCE DESCRIPTION: SEQ ID N0:34: AACGAATCTC TTAGCACCTG GCAAGTTGAC AGAAGT 36
(2) INFORMATION FOR SEQ ID N0:35:
(i) SEQUENCE CHARACTERISTICS: -
(A) LENGTH: 60 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: AACGAATCTC TTAGCACCTG GCAAGTTGAC CAAAGTAGTG TTGATAGATT CAGTGTCGTC 60
(2) INFORMATION FOR SEQ ID N0:36: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 75 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:36: AACGAATCTC TTAGCACCTG GCAAGTTAAC CAAAGTAGTG TTGATAGATT CAGTGTCGTC 60 AGCCATCAAG TTGAC 75
(2) INFORMATION FOR SEQ ID NO:37:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 72 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:37: ' AACGAATCTC TTCAATGGCA AGTTAACCAA AGTAGTGTTA GTAGCGAATC TAGATTCAGT 60 GTCGTCAGCC AT 72
(2) INFORMATION FOR SEQ ID NO:38:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 45 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID N0:38: AACGAATCTC TTAGCCATGG CAACGTTAGC CAAGTTAACC AAAGT 45
(2) INFORMATION FOR SEQ ID N0:39:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 61 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID N0:39: AACGAATCTC TTAGCACCTG GCAAGTTGAC CAAAGTAGTG TTGATAGCAG ATTCAGTGTC 60 G 61
(2) INFORMATION FOR SEQ ID N0:40:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 372 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 82..351
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40:
GAATTCATTC AAGAATAGTT CAAACAAGAA GATTACAAAC TATCAATTTC ATACACAATA 60
TAAACGACGG GTACCAAAAT A ATG AAA CTG AAA ACT GTA AGA TCT GCG GTC 111
Met Lys Leu Lys Thr Val Arg Ser Ala Val 1 5 10
CH TCG TCA CTC TTT GCA TCT CAG GTC CTT GGC CAA CCA ATA GAC GAA 159 Leu Ser Ser Leu Phe Ala Ser Gin Val Leu Gly Gin Pro He Asp Glu 15 20 25
GAC AAC GAC ACT TCT TCC ATG GCT AAG AGA TTC GTT AAC CAA CAC TTG 207 Asp Asn Asp Thr Ser Ser Met Ala Lys Arg Phe Val Asn Gin His Leu 30 35 40
TGC GGT TCC CAC TTG GTT GAA GCT TTG TAC TTG GTT TGC GGT GAA AGA 255 Cys Gly Ser His Leu Val Glu Ala Leu Tyr Leu Val Cys Gly Glu Arg 45 50 55
GGT TTC TTC TAC ACT CCT AAG GCT GCT AAG GGT ATT GTC GAG CAA TGC 303 Gly Phe Phe Tyr Thr Pro Lys Ala Ala Lys Gly He Val Glu Gin Cys 60 65 70
TGT ACC TCC ATC TGC TCC TTG TAC CAA TTG GAA AAC TAC TGC AAC TAGACGCAGC 358
Cys Thr Ser He Cys Ser Leu Tyr Gin Leu Glu Asn Tyr Cys Asn 75 80 85 90
CCGCAGGCTC TAGA 372
(2) INFORMATION FOR SEQ ID N0:41:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 89 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41:
Met Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala 1 5 10 15
Ser Gin Val Leu Gly Gin Pro He Asp Glu Asp Asn Asp Thr Ser Ser 20 25 30
Met Al a Lys Arg Phe Val Asn Gi n Hi s Leu Cys Gly Ser Hi s Leu Val 35 40 45
Glu Al a Leu Tyr Leu Val Cys Gly Gl u Arg Gly Phe Phe Tyr Thr Pro 50 55 60
Lys Al a Al a Lys Gly H e Val Gl u Gi n Cys Cys Thr Ser H e Cys Ser 65 70 75 80
Leu Tyr Gin Leu Glu Asn Tyr Cys Asn 85
(2) INFORMATION FOR SEQ ID N0:42:
(i) SEQUENCE CHARACTERISTICS: •(A) LENGTH: 45 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..45
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:
CAA CCA ATT GAC GAC GAA AAC ACT ACT TCT GTC AAC TTG CCA GTT 45
Gin Pro He Asp Asp Gl u Asn Thr Thr Ser Val Asn Leu Pro Val 1 5 10 15
(2) INFORMATION FOR SEQ ID NO:43:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 15 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:43:
Gi n Pro He Asp Asp Gl u Asn Thr Thr Ser Val Asn Leu Pro Val 1 5 10 15
(2) INFORMATION FOR SEQ ID NO:44:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 297 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..276
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:44:
ATG AAA CTG AAA ACT GTA AGA TCT GCG GTC CTT TCG TCA CTC TTT GCA 48 Met Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala 1 5 10 15
TCT CAG GTC CTT GGC CAA CCA ATT GAC GAC GAA AAC ACT ACT TCT GTC 96 Ser Gin Val Leu Gly Gin Pro He Asp Asp Glu Asn Thr Thr Ser Val 20 25 30
AAC TTG CCA GTT AAG AGA TTC GTT AAC CAA CAC TTG TGT GGT TCT CAC 144 Asn Leu Pro Val Lys Arg Phe Val Asn Gin His Leu Cys Gly Ser His 35 40 45
TTG GTT GAA GCT TTG TAC TTG GTT TGC GGT GAA AGA GGT TTC TTC TAC 192 Leu Val Glu Ala Leu Tyr Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr 50 55 60
ACT CCT AAG GCT GCT AAG GGT ATT GTC GAA CAA TGC TGT ACC TCC ATC 240 Thr Pro Lys Ala Ala Lys Gly He Val Glu Gin Cys Cys Thr Ser He 65 70 75 80
TGC TCC TTG TAC CAA TTG GAA AAC TAC TGC AAC TAGACGCAGC CCGCAGGCTC 293 Cys Ser Leu Tyr Gin Leu Glu Asn Tyr Cys Asn 85 90
TAGA 297
(2) INFORMATION FOR SEQ ID NO:45:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 91 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: l i near
(i i ) MOLECULE TYPE: pepti de
(xi ) SEQUENCE DESCRIPTION: SEQ ID N0:45:
Met Lys Leu Lys Thr Val Arg Ser Al a Val Leu Ser Ser Leu Phe Al a 1 5 10 15
Ser Gi n Val Leu Gly Gi n Pro H e Asp Asp Gl u Asn Thr Thr Ser Val 20 25 30
Asn Leu Pro Val Lys Arg Phe Val Asn Gin His Leu Cys Gly Ser His 35 40 45
Leu Val Glu Ala Leu Tyr Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr 50 55 60
Thr Pro Lys Ala Ala Lys Gly He Val Glu Gin Cys Cys Thr Ser He 65 70 75 80
Cys Ser Leu Tyr Gin Leu Glu Asn Tyr Cys Asn 85 90
(2) INFORMATION FOR SEQ ID N0:46:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 51 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..51
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:46:
CAA CCA ATT GAC GAC ACT GAA TCT AAC ACT ACT TCT GTC AAC TTG CCA 48
Gin Pro He Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Pro 1 5 10 15
GCT 51
Al a
(2) INFORMATION FOR SEQ ID NO:47:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 17 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:47:
Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Ser Val Asn Leu Pro 1 5 10 15
Ala
(2) INFORMATION FOR SEQ ID NO:48:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 54 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..54
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:48:
CAA CCA ATT GAC GAC ACT GAA TCT AAC ACT ACT TCT GTC AAC TTG CCA 48
Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Ser Val Asn Leu Pro 1 5 10 15
GGT GCT < 54
Gly Ala
(2) INFORMATION FOR SEQ ID NO:49:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 57 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..57
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:49:
CAA CCA ATT GAC GAC ACT GAA TCT ATC AAC ACT ACT TTG GTC AAC TTG 48
Gin Pro He Asp Asp Thr Glu Ser He Asn Thr Thr Leu Val Asn Leu 1 5 10 15 CCA GGT GCT 57 Pro Gly Ala
(2) INFORMATION FOR SEQ ID N0:50:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 99 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..99
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:50:
CAA CCA ATT GAC GAC ACT GAA TCT AAC ACT ACT TCT GTC AAC TTG ATG 48 Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
GCT GAC GAC ACT GAA TCT ATC AAC ACT ACT TTG GTT AAC TTG CCA GGT 96 Ala Asp Asp Thr Glu Ser He Asn Thr Thr Leu Val Asn Leu Pro Gly 20 25 30
GCT 99
Ala
(2) INFORMATION FOR SEQ ID N0:51:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 105 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..105
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:51:
CAA CCA ATT GAC GAC ACT GAA TCT AAC ACT ACT TCT GTC AAC TTG ATG 48
Gi n Pro H e Asp Asp Thr Gl u Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15 GCT GAC GAC ACT GAA TCT AGA TTC GCT ACT AAC ACT ACT TTG GTT AAC 96 Ala Asp Asp Thr Glu Ser Arg Phe Ala Thr Asn Thr Thr Leu Val Asn 20 25 30
TTG CCA TTG 105
Leu Pro Leu
35
(2) INFORMATION FOR SEQ ID NO:52:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 108 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..108
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:52:
CAA CCA ATT GAC GAC ACT GAA TCT AAC ACT ACT TCT GTC AAC TTG ATG 48 Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
GCT GAC GAC ACT GAA TCT ATC AAC ACT ACT TTG GTT AAC TTG GCT AAC 96 Ala Asp Asp Thr Glu Ser He Asn Thr Thr Leu Val Asn Leu Ala Asn 20 25 30
GTT GCC ATG GCT 108
Val Ala Met Ala 35
(2) INFORMATION FOR SEQ ID NO:53:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 60 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..60
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:53: CAA CCA ATT GAC GAC ACT GAA TCT GCT ATC AAC ACT ACT TTG GTC AAC 48 Gi n Pro H e Asp Asp Thr Gl u Ser Al a H e Asn Thr Thr Leu Val Asn 1 5 10 15
TTG CCA GGT GCT 60 Leu Pro Gly Ala
20
(2) INFORMATION FOR SEQ ID N0:54:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 276 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 113..274
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:54:
TTAAATCTAT AACTACAAAA AACACATACA GGAATTCATT CAAGAATAGT TCAAACAAGA 60
AGATTACAAA CTATCAATTT CATACACAAT ATAAACGACG GGTACCAAAA TA ATG 115
Met 1
AAA CTG AAA ACT GTA AGA TCT GCG GTC CTT TCG TCA CTC TTT GCA TCT 163 Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala Ser 5 10 15
CAG GTC CTT GGC CAA CCA ATT GAC GAC GAA AAC ACT ACT TCT GTT AAC 211 Gin Val Leu Gly Gin Pro He Asp Asp Glu Asn Thr Thr Ser Val Asn 20 25 30
TTG CCA GCT AAG AGA TTC GTT AAC CAA CAC TTG TGC GGT TCC CAC TTG 259 Leu Pro Ala Lys Arg Phe Val Asn Gin His Leu Cys Gly Ser His Leu 35 40 45
GTT GAA GCT TTG TAC TT 276
Val Glu Ala Leu Tyr 50
(2) INFORMATION FOR SEQ ID NO:55:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 54 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:55:
Met Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala 1 5 10 15
Ser Gin Val Leu Gly Gin Pro He Asp Asp Glu Asn Thr Thr Ser Val 20 25 30
Asn Leu Pro Ala Lys Arg Phe Val Asn Gin His Leu Cys Gly Ser His 35 40 45
Leu Val Glu Ala Leu Tyr 50
(2) INFORMATION FOR SEQ ID NO:56:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 282 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 113..280
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:56:
TTAAATCTAT AACTACAAAA AACACATACA GGAATTCATT CAAGAATAGT TCAAACAAGA 60
AGATTACAAA CTATCAATTT CATACACAAT ATAAACGACG GGTACCAAAA TA ATG 115
Met 1
AAA CTG AAA ACT GTA AGA TCT GCG GTC CTT TCG TCA CTC TTT GCA TCT 163 Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala Ser 5 10 15
CAG GTC CTT GGC CAA CCA ATT GAC GAC ACT GAA TCT AAC ACT ACT TCT 211 Gin Val Leu Gly Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Ser 20 25 30
GTC AAC TTG CCA GCT AAG AGA TTC GTT AAC CAA CAC TTG TGC GGT TCC 259 Val Asn Leu Pro Ala Lys Arg Phe Val Asn Gin His Leu Cys Gly Ser 35 40 45
CAC TTG GTT GAA GCT TTG TAC TT 282
His Leu Val Glu Ala Leu Tyr 50 55
(2) INFORMATION FOR SEQ ID NO:57: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 56 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0.-57:
Met Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala 1 5 10 15
Ser Gin Val Leu Gly Gi n Pro H e Asp Asp Thr Gl u Ser Asn Thr Thr 20 25 30
Ser Val Asn Leu Pro Ala Lys Arg Phe Val Asn Gin His Leu Cys Gly 35 40 45
Ser His Leu Val Glu Ala Leu Tyr 50 55
(2) INFORMATION FOR SEQ ID N0:58:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 282 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
( i) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 113..280
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:58:
TTAAATCTAT AACTACAAAA AACACATACA GGAATTCATT CAAGAATAGT TCAAACAAGA 60
AGATTACAAA CTATCAATTT CATACACAAT ATAAACGACG GGTACCAAAA TA ATG 115
Met 1
AAA CTG AAA ACT GTA AGA TCT GCG GTC CTT TCG TCA CTC TTT GCA TCT 163 Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala Ser 5 10 15
CAG GTC CTT GGC CAA CCA ATT GAC GAC ACT GAA TCT AAC ACT ACT TCT 211 Gin Val Leu Gly Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Ser 20 25 30
GTC AAC TTG ATG GCT AAG AGA TTC GTT AAC CAA CAC TTG TGC GGT TCC 259 Val Asn Leu Met Ala Lys Arg Phe Val Asn Gin His Leu Cys Gly Ser 35 40 45 CAC TTG GTT GAA GCT TTG TAC TT 282 His Leu Val Glu Ala Leu Tyr 50 55
(2) INFORMATION FOR SEQ ID NO:59:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 56 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:59:
Met Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala 1 5 10 15
Ser Gin Val Leu Gly Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr 20 25 30
Ser Val Asn Leu Met Ala Lys Arg Phe Val Asn Gin His Leu Cys Gly 35 40 45
Ser His Leu Val Glu Ala Leu Tyr 50 55
(2) INFORMATION FOR SEQ ID N0:60:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 330 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 113..328
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:60:
TTAAATCTAT AACTACAAAA AACACATACA GGAATTCATT CAAGAATAGT TCAAACAAGA 60
AGATTACAAA CTATCAATTT CATACACAAT ATAAACGACG GGTACCAAAA TA ATG 115
Met 1
AAA CTG AAA ACT GTA AGA TCT GCG GTC CTT TCG TCA CTC TTT GCA TCT 163 Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala Ser 5 10 15 CAG GTC CTT GGC CAA CCA ATT GAC GAC ACT GAA TCT AAC ACT ACT TCT 211 Gin Val Leu Gly Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Ser 20 25 30
GTC AAC TTG ATG GCT GAC GAC ACT GAA TCT ATC AAC ACT ACT TTG GTT 259 Val Asn Leu Met Ala Asp Asp Thr Glu Ser He Asn Thr Thr Leu Val 35 40 45
AAC TTG CCA GGT GCT AAG AGA TTC GTT AAC CAA CAC TTG TGC GGT TCC 307
Asn Leu Pro Gly Ala Lys Arg Phe Val Asn Gin His Leu Cys Gly Ser
50 55 60 65
CAC TTG GTT GAA GCT TTG TAC TT 330 His Leu Val Glu Ala Leu Tyr 70
(2) INFORMATION FOR SEQ ID NO:61:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 72 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:61:
Met Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala 1 5 10 15
Ser Gin Val *Leu Gly Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr 20 25 30
Ser Val Asn Leu Met Ala Asp Asp Thr Glu Ser He Asn Thr Thr Leu 35 40 45
Val Asn Leu Pro Gly Ala Lys Arg Phe Val Asn Gin His Leu Cys Gly 50 55 60
Ser His Leu Val Glu Ala Leu Tyr 65 70
(2) INFORMATION FOR SEQ ID N0:62:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 288 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS (B) LOCATION: 113..286
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62:
TTAAATCTAT AACTACAAAA AACACATACA GGAATTCATT CAAGAATAGT TCAAACAAGA 60
AGATTACAAA CTATCAATTT CATACACAAT ATAAACGACG GGTACCAAAA TA ATG 115
Met 1
AAA CTG AAA ACT GTA AGA TCT GCG GTC CTT TCG TCA CTC TTT GCA TCT 163 Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala Ser 5 10 15
CAG GTC CTT GGC CAA CCA ATT GAC GAC ACT GAA TCT ATC AAC ACT ACT 211 Gin Val Leu Gly Gin Pro He Asp Asp Thr Glu Ser He Asn Thr Thr 20 25 30
TTG GTC AAC TTG CCA GGT GCT AAG AGA TTC GTT AAC CAA CAC TTG TGC 259 Leu Val Asn Leu Pro Gly Ala Lys Arg Phe Val Asn Gin His Leu Cys 35 40 45
GGT TCC CAC TTG GTT GAA GCT TTG TAC TT 288 Gly Ser His Leu Val Glu Ala Leu Tyr 50 55
(2) INFORMATION FOR SEQ ID NO:63:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 58 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:63:
Met Lys Leu Lys Thr Val Arg Ser Ala Val Leu Ser Ser Leu Phe Ala 1 5 10 15
Ser Gin Val Leu Gly Gin Pro He Asp Asp Thr Glu Ser He Asn Thr 20 25 30
Thr Leu Val Asn Leu Pro Gly Ala Lys Arg Phe Val Asn Gin His Leu 35 40 45
Cys Gly Ser His Leu Val Glu Ala Leu Tyr 50 55
(2) INFORMATION FOR SEQ ID NO:64:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 82 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:64: GGTTAACGAA CTTTGGAGCT TCAGCTTCAG CTTCTTCTCT CTTAGCCATG GAGATCAAGT 60 TAACAACATC CAAAGTAGTG TT 82
(2) INFORMATION FOR SEQ ID N0:65:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 54 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (Xi) SEQUENCE DESCRIPTION: SEQ ID N0:65: CAAGTACAAA GCTTCAACCA AGTGGGAACC GCACAAGTGT TGGTTAACGA ACTT 54
(2) INFORMATION FOR SEQ ID N0:66:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 117 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:66: CAACCAATTG ACGACACTGA ATCTAACACT ACTTCTGTCA ACTTGATGGC TGACGACACT 60 GAATCTAGAT TCGCTACTAA CACTACTTTG GATGTTGTTA ACTTGATCTC CATGGCT 117
(2) INFORMATION FOR SEQ ID N0:67:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 41 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:67: Gin Pro He Asp Asp Thr Glu Ser Asn Thr Thr Ser Val Asn Leu Met 1 5 10 15
Ala Asp Asp Thr Glu Ser Arg Phe Ala Thr Asn Thr Thr Leu Ala Leu 20 25 30
Asp Val Val Asn Leu He Ser Met Ala 35 40
(2) INFORMATION FOR SEQ ID NO:68:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 51 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID N0:68: TCTCTTAGCC ATGGAGATCA AGTTAACAAC ATCCAAAGCC AAAGTAGTGT T 51
(2) INFORMATION FOR SEQ ID NO:69:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 123 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:69: CAACCAATTG ACGACACTGA ATCTAACACT ACTTCTGTCA ACTTGATGGC TGACGACACT 60 GAATCTAGAT TCGCTACTAA CACTACTTTG GCTTTGGATG TTGTTAACTT GATCTCCATG 120 GCT 123
(2) INFORMATION FOR SEQ ID NO:70:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 65 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:70: Lys Arg Gl u Gl u Al a Gl u Al a Gl u Al a Gl u Pro Lys Phe Val Asn Gin 1 5 10 15
His Leu Cys Gly Ser His Leu Val Gl u Al a Leu Tyr Leu Val Cys Gly 20 25 30
Gl u Arg Gly Phe Phe Tyr Thr Pro Lys Al a Al a Lys Gly He Val Glu 35 40 45
Gin Cys Cys Thr Ser H e Cys Ser Leu Tyr Gin Leu Gl u Asn Tyr Cys 50 55 60
Asn 65
(2) INFORMATION FOR SEQ ID N0:71:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 219 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:71: AAGAGAGAAG AAGCTGAAGC TGAAGCTGAA CCAAAGTTCG TTAACCAACA CTTGTGTGGT 60 TCTCACTTGG TTGAAGCTTT GTACTTGGTT TGCGGTGAAA GAGGTTTCTT CTACACTCCT 120 AAGGCTGCTA AGGGTATTGT CGAACAATGC TGTACCTCCA TCTGCTCCTT GTACCAATTG 180 GAAAACTACT GCAACTAGAC GCAGCCCGCA GGCTCTAGA 219
(2) INFORMATION FOR SEQ ID N0:72:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 348 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:72: TTAAATCTAT AACTACAAAA AACACATACA GGAATTCCAT TCAAGAATAG TTCAAACAAG 60 AAGATTACAA ACTATCAATT TCATACACAA TATAAACGAC GGTACCAAAA TAATGAAACT 120 GAAAACTGTA AGATCTGCGG TCCTTTCGTC ACTCTTTGCA TCTCAGGTCC TTGGCCAACC 180 AATTGACGAC ACTGAATCTA ACACTACTTC TGTCAACTTG ATGGCTGACG ACACTGAATC 240 TAGATTCGCT ACTAACACTA CTTTGGTTAA CTTGGCTAAC GTTGCCAACC AACACTTGTG 300 TGGTTCTCAC TTGGTTGAAG CTTTGTACTT ATGGCTAAGA GATTCGTT 348
(2) INFORMATION FOR SEQ ID NO:73:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 379 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:73:
TTAAATCTAT AACTACAAAA AACACATACA GGAATTCCAT TCAAGAATAG TTCAAACAAG 60
AAGATTACAA ACTATCAATT TCATACACAA TATAAACGAC GGTACCAAAA TAATGAAACT 120
GAAAACTGTA AGATCTGCGG TCCTTTCGTC ACTCTTTGCA TCTCAGGTCC TTGGCCAACC 180
AATTGACGAC ACTGAATCTA ACACTACTTC TGTCAACTTG ATGGCTGACG ACACTGAATC 240
TAGATTCGCT ACTAACACTA CTTTGGATGT TGTTAACTTG ATCTCCATGG CTAAGAGAGA 300
AGAAGCTGAA GCTGAAGCTG AACCAAAGTT CGTTAACCAA CACTTGTGTG GTTCTCACTT 360
GGTTGAAGCT TTGTACTTG 379

Claims

1. A DNA expression cassette comprising the following sequence:
5'-P-SP-LS-PS-*gene*-(T) ^-3'
5 wherein
P is a promoter sequence,
SP is a DNA sequence encoding a signal peptide,
LS is a DNA sequence encoding a leader peptide with the general formula I:
10 GlnProIle(Asp/Glu) (Asp/GluJX1(Glu/Asp)X2AsnZ(Thr/Ser)X3 (I)
wherein
X1 is a peptide bond or a codable amino acid; X2 is a peptide bond, a codable amino acid or a sequence of up to 4 codable amino acids which may be the same or 15 different;
Z is a codable amino acid except Pro; and
X3 is a'sequence of from 4 to 30 codable amino acids which may be the same or different;
PS is a DNA sequence encoding a processing site; 20 *gene* is a DNA sequence encoding a poiypeptide; T is a terminator sequence; and i is 0 or 1.
2. An expression cassette according to claim 1, wherein X1, in general formula I, is Ser, Thr or Ala.
253. An expression cassette according to claim 1, wherein X2, in general formula I, is Ser, Thr or Ala.
4. An expression cassette according to claim 1, wherein X2, in general formula I, is Serlle. 5. An expression cassette accordi.ng to clai•m 1, wherei-n X2, in general formula I, is SerAlalle.
6. An expression cassette according to claim 1, wherein X , in general formula I, is SerPheAlaThr.
57. An expression cassette according to claim 1, wherein X3, in general formula I, is an amino acid sequence of the general formula II
X4-X5-X6 (II)
wherein 10 X4 is a sequence of from 1 to 21 codable amino acids;
X5 is Pro or an amino acid sequence including the amino acid sequence ValAsnLeu, LeuAlaAsnValAlaMetAla, LeuAspValValAsnLeuProGly, or LeuAspValValAsnLeuIleSerMet; and X6 is a sequence of from 1 to 8 codable amino acids.
158. An expression cassette according to claim 7, wherein X4, in general formula II, is an amino acid sequence including one or more of the motifs LeuValAsnLeu, SerValAsnLeu, MetAlaAsp, ThrGluSer, ArgPheAlaThr and ValAlaMetAla.
9. An expression cassette according to claim 7 , wherein X4, 20 in general formula II, is an amino acid sequence including the sequence AsnSerThr or AsnThrThr.
10. An expression cassette according to claim 7, wherein X4, in general formula II, is an amino acid sequence including the sequence
25 (Ser/Leu)ValAsnLeu,
(Ser/Leu)Va1AsnLeuMetAlaAsp,
(Ser/Leu)ValAsnLeuMetAlaAspAsp,
(Ser/Leu)ValAsnLeuMetAlaAspAspThrGluSer,
(Ser/Leu)ValAsnLeuMetAlaAspAspThrGluSerlle, or 30 (Ser/Leu)ValAsnLeuMetAlaAspAspThrGluSerArgPheAlaThr. 11. An expression cassette according to claim 7, wherein X4 in general formula II, is an amino acid sequence including the sequence
Asn(Thr/Ser)ThrLeu, 5 Asn(Thr/Ser)ThrLeuAsnLeu, or Asn(Thr/Ser)ThrLeuValAsnLeu.
123 An expression cassette according to claim 7, wherein X5, in general formula II, is Pro.
13. An expression cassette according to claim 7, wherein X5, 10 in general formula II, is the amino acid sequence ValAsnLeu.
14. An expression cassette according to claim 7, wherein X5, in general formula II, is the amino acid sequence LeuAlaAsnValAlaMetAla.
15. An expression cassette according to claim 7, wherein X5, 15 in general formula II, is the amino acid sequence
LeuAspValValAsnLeuProGly.
16. An expression cassette according to claim 7, wherein X5, in general formula II, is the amino acid sequence LeuAspValValAsnLeuIleSerMet.
20 17. An expression cassette according to claim 7, wherein X6, in general formula II, is Ala, Gly, Leu, Thr, Val or Ser.
18. An expression cassette according to claim 7, wherein X6, in general formula II, is GlyAla or SerAla.
19. An expression cassette according to claim 7, wherein X6, 25 in general formula II, is AlaValAla.
20. An expression cassette according to claim 7, wherein X6, in general formula II, is GlyAlaAspSerLysThrValGlu. 21. An expression cassette according to claim 1, wherein the leader peptide coded for by the DNA sequence LS is selected from the group comprising:
SEQ ID No. 1 GlnProIleAspGluAspAsnAspThrSerValAsnLeuProAla;
SEQ ID No. 2 GlnProIleAspAspGluAsnThrThrSerValAsnLeuProAla;
SEQ ID No. 3 GlnProIleAspAspGluSerAsnThrThrSerValAsnLeuPro- Ala;
SEQ ID No. 4 GlnProIleAspAspGluAsnThrThrSerValAsnLeuProVal;
SEQ ID No. 5 GlnProIleAspAspThrGluAsnThrThrSerValAsnLeuPro- Ala;
SEQ ID No. 6 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- ProAla;
SEQ ID No. 7 GlnProIleAspAspGluAsnThrThrSerValAsnLeuMetAla;
SEQ ID No. 8 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu ProGlyAla;
SEQ ID No. 9 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAla;
SEQ ID No. 10 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnVal- ProThr;
SEQ ID No. 11 GlnProIleAspAspThrGluSerAsnThrThrLeuValAsnVal-
ProThr;
SEQ ID No. 12 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- ProThr; SEQ ID No. 13 GlnProIleAspAspThrGluSerAsnThrThrLeuValAsnVal- ProGlyAla;
SEQ ID No. 14 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAlaProAlaValAla;
SEQ ID No. 15 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu-
MetAspLeuAlaValGlyLeuProGlyAla;
SEQ ID No. 16 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAlaAspAspThrGluSerlleAsnThrThrLeuValAsnLeu- ProGlyAla;
SEQ ID No. 17 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsn-
LeuProGlyAla;
SEQ ID No. 18 GlnProIleAspAspThrGluSerAsnThrThrLeuValAsnLeu- ProGlyAla;
SEQ ID No. 19 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeu
ValAsnLeuProLeu;
SEQ ID No. 20 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAlaAspAspThrGluSerlleAsnThrThrLeuValAsnLeu- AlaAsnValAlaMetAla;
SEQ ID No. 21 GlnProIleAspAspThrGluSerAlalleAsnThrThrLeuVal-
AsnLeuProGlyAla;
SEQ ID No. 22 GlnProIleAspAspThrGluSerPheAlaThrAsnThrThr- LeuValAsnLeuProGlyAla;
SEQ ID No. 23 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsn- LeuMetAlaAspAspThrGluSerArgPheAlaThrAsnThrThr-
LeuValAsnLeuProLe ; SEQ ID No. 24 GlnProIleAspAspThrGluSerlleAsnThrThrLeuVal-
AsnLeuMetAlaAspAspThrGluSerArgPheAlaThrAsnThr- ThrLeuAspValValAsnLeuProGlyAla;
SEQ ID No. 25 GlnProIleAspAspThrGluSerAlaAlalleAsnThrThrLeu- ValAsnLeuProGlyAla;
SEQ ID No. 26 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeu- ValAsnLeuAlaAsnValAlaMetAla;
SEQ ID No. 27 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeu-
AspVal-ValAsnLeuIleSerMetAla;
SEQ ID No. 28 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAlaAsnThrThrGluSerArgPheAlaThrAsnThrThrLeu- AspValValAsnLeuIleSerMetAla; and
SEQ ID No. 67 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu-
MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeu- AlaLeuAspValValAsnLeuIleSerMetAlaLysArg.
Particularly preferred leader peptides coded for by the DNA sequence LS are:
SEQ ID No. 15 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu
MetAspLeuAlaValGlyLeuProGlyAla;
SEQ ID No. 16 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAlaAspAspThrGluSerlleAsnThrThrLeuValAsnLeu- ProGlyAla;
SEQ ID No. 17 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsn-
LeuProGlyAla;
SEQ ID No. 18 GlnProIleAspAspThrGluSerAsnThrThrLeuValAsnLeu ProGlyAla;
SEQ ID No. 19 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeu- ValAsnLeuProLeu;
SEQ ID No. 20 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu-
MetAlaAspAspThrGluSerlleAsnThrThrLeuValAsnLeu- AlaAsnValAlaMetAla;
SEQ ID No. 21 GlnProIleAspAspThrGluSerAlalleAsnThrThrLeuVal- AsnLeuProGlyAla;
SEQ ID No. 22 GlnProIleAspAspThrGluSerPheAlaThrAsnThrThrLeu-
ValAsnLeuProGlyAla;
SEQ ID No. 23 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsn- LeuMetAlaAspAspThrGluSerArgPheAlaThrAsnThrThr- LeuValAsnLeuProLeu;
SEQ ID No. 24 GlnProIleAspAspThrGluSerlleAsnThrThrLeuValAsn-
LeuMetAlaAspAspThrGluSerArgPheAlaThrAsnThrThr- LeuAspValValAsnLeuProGlyAla;
SEQ ID No. 25 GlnProIleAspAspThrGluSerAlaAlalleAsnThrThrLeu- ValAsnLeuProGlyAla;
SEQ ID No. 26 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu-
MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeu- ValAsnLeuAlaAsnValAlaMetAla; and
SEQ ID No. 28 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAlaAsnThrThrGluSerArgPheAlaThrAsnThrThrLeu- AspValValAsnLeuIleSerMetAla.
SEQ ID No. 67 GlnProIleAspAspThrGluSerAsnThrThrSerValAsnLeu- MetAlaAspAspThrGluSerArgPheAlaThrAsnThrThrLeu- AlaLeuAspValValAsnLeuIleSerMetAlaLysArg.
22. An expression cassette according to claim 1, wherein SP is a DNA sequence encoding the α-factor signal peptide, the signal peptide of mouse salivary amylase, the
5 carboxypeptidase signal peptide, the yeast aspartic protease 3 signal peptide or the yeast BAR1 signal peptide.
23. An expression cassette according to claim 1, wherein PS is a DNA sequence encoding LysArg, ArgLys, ArgArg, LysLys or Ile-GluGlyArg.
1024. An expression cassette according to claim 1, wherein the poiypeptide is selected from the group consisting of aprotinin, tissue factor pathway inhibitor, or other protease inhibitors, insulin or insulin precursors, insulin-like growth factor I, insulin-like growth factor II, human or
15 bovine growth hormone, interleukin, glucagon, glucagon-like peptide 1, tissue plasminogen activator, transforming growth factor α or β , platelet-derived growth factor, enzymes, or a functional analogue thereof.
25. A yeast expression vector comprising an expression 0 cassette according to any of the preceding claims.
26. A yeast cell which is capable of expressing a poiypeptide and which is transformed with a yeast expression vector according to claim 25.
27. A process for producing a poiypeptide in yeast, the 5 process comprising culturing a yeast cell, which is capable of expressing the desired poiypeptide and which is transformed with a yeast expression vector according to claim 25, in a suitable medium to obtain expression and secretion of the poiypeptide, after which the poiypeptide is recovered 0 from the medium.
SUBSTITUTESHEET
EP95922439A 1994-06-16 1995-06-16 Synthetic leader peptide sequences Expired - Lifetime EP0763117B1 (en)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
DK70594 1994-06-16
DK70594 1994-06-16
DK705/94 1994-06-16
US28285294A 1994-07-29 1994-07-29
US282852 1994-07-29
PCT/DK1995/000249 WO1995034666A1 (en) 1994-06-16 1995-06-16 Synthetic leader peptide sequences

Publications (2)

Publication Number Publication Date
EP0763117A1 true EP0763117A1 (en) 1997-03-19
EP0763117B1 EP0763117B1 (en) 2001-11-14

Family

ID=26064490

Family Applications (1)

Application Number Title Priority Date Filing Date
EP95922439A Expired - Lifetime EP0763117B1 (en) 1994-06-16 1995-06-16 Synthetic leader peptide sequences

Country Status (20)

Country Link
US (2) US5639642A (en)
EP (1) EP0763117B1 (en)
JP (1) JP3676369B2 (en)
KR (1) KR100251054B1 (en)
CN (1) CN1115413C (en)
AT (1) ATE208823T1 (en)
AU (1) AU693569B2 (en)
BR (1) BR9508033A (en)
CZ (1) CZ290069B6 (en)
DE (1) DE69523915T2 (en)
DK (1) DK0763117T3 (en)
ES (1) ES2168367T3 (en)
FI (1) FI965005A (en)
HU (1) HU221118B1 (en)
MX (1) MX9606492A (en)
NO (1) NO965362L (en)
PL (1) PL317722A1 (en)
PT (1) PT763117E (en)
UA (1) UA40648C2 (en)
WO (1) WO1995034666A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2460795C1 (en) * 2011-07-06 2012-09-10 Федеральное государственное унитарное предприятие "Государственный научно-исследовательский институт генетики и селекции промышленных микроорганизмов" (ФГУП "ГосНИИгенетика") Method for microbiological synthesis of secreted human somatotropin and yeast strain saccharomyces cerevisiae - secreted human somatotropin producer

Families Citing this family (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2686899B1 (en) * 1992-01-31 1995-09-01 Rhone Poulenc Rorer Sa NOVEL BIOLOGICALLY ACTIVE POLYPEPTIDES, THEIR PREPARATION AND PHARMACEUTICAL COMPOSITIONS CONTAINING THEM.
US6500645B1 (en) 1994-06-17 2002-12-31 Novo Nordisk A/S N-terminally extended proteins expressed in yeast
ZA9610456B (en) * 1995-12-20 1997-06-20 Novo Nordisk As N-terminally extended proteins expressed in yeast
US6017731A (en) 1996-12-13 2000-01-25 Chiron Corporation Method for expression of heterologous proteins in yeast
AU7873798A (en) * 1996-12-20 1998-07-17 Novo Nordisk A/S N-terminally extended proteins expressed in yeast
JP2001527387A (en) 1997-01-24 2001-12-25 ノボ ノルディスク アクティーゼルスカブ Synthetic leader peptide sequence
CA2205076A1 (en) 1997-05-14 1998-11-14 Jim Hu Episomal expression cassettes for gene therapy
EP1049790A1 (en) 1998-01-23 2000-11-08 Novo Nordisk A/S Process for making desired polypeptides in yeast
ATE474930T1 (en) 1999-12-29 2010-08-15 Novo Nordisk As METHOD FOR PRODUCING INSULIN PRECURSORS AND ANALOGUES OF INSULIN PRECURSORS WITH IMPROVED FERMENTATION YIELD IN YEAST
US20050100991A1 (en) * 2001-04-12 2005-05-12 Human Genome Sciences, Inc. Albumin fusion proteins
JP2003530838A (en) 2000-04-12 2003-10-21 ヒューマン ゲノム サイエンシズ インコーポレイテッド Albumin fusion protein
WO2002097038A2 (en) * 2001-05-25 2002-12-05 Human Genome Sciences, Inc. Chemokine beta-1 fusion proteins
WO2005003296A2 (en) 2003-01-22 2005-01-13 Human Genome Sciences, Inc. Albumin fusion proteins
CA2471363C (en) 2001-12-21 2014-02-11 Human Genome Sciences, Inc. Albumin fusion proteins
CA2484556A1 (en) * 2001-12-21 2003-07-24 Human Genome Sciences, Inc. Albumin fusion proteins
GB0217033D0 (en) * 2002-07-23 2002-08-28 Delta Biotechnology Ltd Gene and polypeptide sequences
WO2005047508A1 (en) 2003-11-14 2005-05-26 Novo Nordisk A/S Processes for making acylated insulin
JP5697831B2 (en) 2003-12-03 2015-04-08 ノヴォ ノルディスク アー/エス Single chain insulin
NZ548612A (en) * 2004-02-09 2009-11-27 Human Genome Sciences Inc Albumin fusion proteins comprising tandem GLP-1
US7527947B2 (en) 2004-06-14 2009-05-05 Novozymes A/S Signal peptide for producing a polypeptide
EP1917363B1 (en) 2005-08-16 2011-06-22 Novo Nordisk A/S Method for making mature insulin polypeptides
US20090068158A1 (en) * 2005-12-09 2009-03-12 Medin Jeffrey A Thymidylate kinase mutants and uses thereof
CA2566267A1 (en) * 2005-12-09 2007-06-09 University Health Network Thymidylate kinase mutants and uses thereof
ES2387955T3 (en) 2006-02-27 2012-10-04 Novo Nordisk A/S Insulin derivatives
WO2007098607A1 (en) 2006-03-03 2007-09-07 Amorfix Life Sciences Ltd. Methods and compositions to treat and detect misfolded-sod1 mediated diseases
PT2074141T (en) 2006-09-22 2016-11-10 Novo Nordisk As Protease resistant insulin analogues
EP2069502B1 (en) 2006-09-27 2014-02-26 Novo Nordisk A/S Method for making maturated insulin polypeptides
CA2584494A1 (en) * 2007-03-27 2008-09-27 Jeffrey A. Medin Vector encoding therapeutic polypeptide and safety elements to clear transduced cells
EP2150618B1 (en) 2007-05-04 2017-10-11 University Health Network Il-12 immunotherapy for cancer
WO2008139496A1 (en) * 2007-05-16 2008-11-20 Bigtec Private Limited Recombinant human insulin and a method thereof
JP5688969B2 (en) 2007-07-16 2015-03-25 ノボ・ノルデイスク・エー/エス Pegylated insulin analogues are stable against proteases
PL2254906T3 (en) 2008-03-18 2017-04-28 Novo Nordisk A/S Protease stabilized, acylated insulin analogues
US8568709B2 (en) 2008-03-20 2013-10-29 University Health Network Thymidylate kinase fusions and uses thereof
WO2011089170A2 (en) 2010-01-22 2011-07-28 Novo Nordisk A/S Process for preparing fgf-21 with low degree of o-glycosylation
US8815541B2 (en) 2009-11-25 2014-08-26 Novo Nordisk A/S Method for making polypeptides
WO2014078819A2 (en) 2012-11-16 2014-05-22 Transposagen Biopharmaceuticals, Inc. Site-specific enzymes and methods of use
US9707276B2 (en) 2012-12-03 2017-07-18 Merck Sharp & Dohme Corp. O-glycosylated carboxy terminal portion (CTP) peptide-based insulin and insulin analogues
US20160115216A1 (en) 2013-06-07 2016-04-28 Novo Nordisk A/S Method for Making Mature Insulin Polypeptides
US20170151281A1 (en) 2015-02-19 2017-06-01 Batu Biologics, Inc. Chimeric antigen receptor dendritic cell (car-dc) for treatment of cancer
US10562951B2 (en) 2015-03-10 2020-02-18 Merck Sharp & Dohme Corp. Process for preparing recombinant insulin using microfiltration
CA2989827A1 (en) 2015-06-17 2016-12-22 Poseida Therapeutics, Inc. Compositions and methods for directing proteins to specific loci in the genome
CN105418755A (en) * 2015-12-28 2016-03-23 珠海冀百康生物科技有限公司 Quick-acting insulin aspart precursor protein and preparation method for quick-acting insulin
WO2019126578A1 (en) 2017-12-20 2019-06-27 Poseida Therapeutics, Inc. Compositions and methods for directing proteins to specific loci in the genome
RU2020128190A (en) * 2018-02-09 2022-03-09 Цзянсу Хэнжуй Медицин Ко., Лтд. CODON-OPTIMIZED PRECURSOR GENE AND SIGNAL PEPTIDE GENE OF HUMAN INSULIN ANALOGUE
JP2023549140A (en) 2020-11-04 2023-11-22 マイエロイド・セラピューティクス,インコーポレーテッド Engineered chimeric fusion protein compositions and methods of use thereof

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA1340772C (en) * 1987-12-30 1999-09-28 Patricia Tekamp-Olson Expression and secretion of heterologous protiens in yeast employing truncated alpha-factor leader sequences
US5037743A (en) * 1988-08-05 1991-08-06 Zymogenetics, Inc. BAR1 secretion signal
DK300090D0 (en) * 1990-12-19 1990-12-19 Novo Nordisk As PROCEDURE FOR PREPARING LEADER SEQUENCES

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO9534666A1 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2460795C1 (en) * 2011-07-06 2012-09-10 Федеральное государственное унитарное предприятие "Государственный научно-исследовательский институт генетики и селекции промышленных микроорганизмов" (ФГУП "ГосНИИгенетика") Method for microbiological synthesis of secreted human somatotropin and yeast strain saccharomyces cerevisiae - secreted human somatotropin producer

Also Published As

Publication number Publication date
EP0763117B1 (en) 2001-11-14
US5639642A (en) 1997-06-17
US5795746A (en) 1998-08-18
DK0763117T3 (en) 2002-02-18
DE69523915T2 (en) 2002-08-22
HUT76378A (en) 1997-08-28
HU221118B1 (en) 2002-08-28
AU2733495A (en) 1996-01-05
NO965362L (en) 1997-02-13
AU693569B2 (en) 1998-07-02
CN1115413C (en) 2003-07-23
WO1995034666A1 (en) 1995-12-21
BR9508033A (en) 1999-06-29
PL317722A1 (en) 1997-04-28
ES2168367T3 (en) 2002-06-16
HU9603477D0 (en) 1997-02-28
ATE208823T1 (en) 2001-11-15
NO965362D0 (en) 1996-12-13
DE69523915D1 (en) 2001-12-20
CZ290069B6 (en) 2002-05-15
UA40648C2 (en) 2001-08-15
FI965005A0 (en) 1996-12-13
PT763117E (en) 2002-05-31
CN1154143A (en) 1997-07-09
CZ364196A3 (en) 1997-05-14
JPH10501413A (en) 1998-02-10
KR100251054B1 (en) 2000-04-15
FI965005A (en) 1997-02-13
MX9606492A (en) 1997-03-29
JP3676369B2 (en) 2005-07-27

Similar Documents

Publication Publication Date Title
EP0763117B1 (en) Synthetic leader peptide sequences
AU660161B2 (en) A method of constructing synthetic leader sequences
US6214547B1 (en) Synthetic leader peptide sequences
JP3730255B2 (en) N-terminal extended protein expressed in yeast
US6500645B1 (en) N-terminally extended proteins expressed in yeast
WO1989002463A1 (en) Synthetic yeast leader peptides
WO1995002059A1 (en) A dna construct encoding the yap3 signal peptide
EP0946735B1 (en) N-terminally extended proteins expressed in yeast
EP0868523B1 (en) Vector for expression of n-terminally extended proteins in yeast cell
CA2192942C (en) Synthetic leader peptide sequences
RU2167939C2 (en) Method of polypeptide producing in yeast
CA2337635A1 (en) Method of making proteins in transformed yeast cells

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 19970116

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LI LU NL PT SE

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

17Q First examination report despatched

Effective date: 20010411

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LI LU NL PT SE

REF Corresponds to:

Ref document number: 208823

Country of ref document: AT

Date of ref document: 20011115

Kind code of ref document: T

RIN1 Information on inventor provided before grant (corrected)

Inventor name: VAD, KNUD

Inventor name: KJELDSEN, THOMAS BOERGLUM

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REF Corresponds to:

Ref document number: 69523915

Country of ref document: DE

Date of ref document: 20011220

REG Reference to a national code

Ref country code: GB

Ref legal event code: IF02

REG Reference to a national code

Ref country code: CH

Ref legal event code: NV

Representative=s name: R. A. EGLI & CO. PATENTANWAELTE

REG Reference to a national code

Ref country code: DK

Ref legal event code: T3

ET Fr: translation filed
REG Reference to a national code

Ref country code: PT

Ref legal event code: SC4A

Free format text: AVAILABILITY OF NATIONAL TRANSLATION

Effective date: 20020213

REG Reference to a national code

Ref country code: ES

Ref legal event code: FG2A

Ref document number: 2168367

Country of ref document: ES

Kind code of ref document: T3

REG Reference to a national code

Ref country code: GR

Ref legal event code: EP

Ref document number: 20020400614

Country of ref document: GR

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed
PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: AT

Payment date: 20030611

Year of fee payment: 9

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: LU

Payment date: 20030612

Year of fee payment: 9

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: PT

Payment date: 20030623

Year of fee payment: 9

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GR

Payment date: 20030626

Year of fee payment: 9

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: BE

Payment date: 20030919

Year of fee payment: 9

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040616

Ref country code: AT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040616

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040630

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20041216

BERE Be: lapsed

Owner name: *NOVO NORDISK A/S

Effective date: 20040630

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20050105

REG Reference to a national code

Ref country code: PT

Ref legal event code: MM4A

Free format text: LAPSE DUE TO NON-PAYMENT OF FEES

Effective date: 20041216

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20140611

Year of fee payment: 20

Ref country code: IE

Payment date: 20140610

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: CH

Payment date: 20140612

Year of fee payment: 20

Ref country code: IT

Payment date: 20140610

Year of fee payment: 20

Ref country code: DE

Payment date: 20140611

Year of fee payment: 20

Ref country code: SE

Payment date: 20140611

Year of fee payment: 20

Ref country code: ES

Payment date: 20140513

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DK

Payment date: 20140610

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: NL

Payment date: 20140610

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20140609

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 69523915

Country of ref document: DE

REG Reference to a national code

Ref country code: DK

Ref legal event code: EUP

Effective date: 20150616

REG Reference to a national code

Ref country code: NL

Ref legal event code: V4

Effective date: 20150616

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

REG Reference to a national code

Ref country code: GB

Ref legal event code: PE20

Expiry date: 20150615

REG Reference to a national code

Ref country code: IE

Ref legal event code: MK9A

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20150615

REG Reference to a national code

Ref country code: SE

Ref legal event code: EUG

REG Reference to a national code

Ref country code: ES

Ref legal event code: FD2A

Effective date: 20150925

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IE

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20150616

Ref country code: ES

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20150617