IE852440L - Signal for protein transport - Google Patents

Signal for protein transport

Info

Publication number
IE852440L
IE852440L IE852440A IE244085A IE852440L IE 852440 L IE852440 L IE 852440L IE 852440 A IE852440 A IE 852440A IE 244085 A IE244085 A IE 244085A IE 852440 L IE852440 L IE 852440L
Authority
IE
Ireland
Prior art keywords
dna
sequence
gene
dna sequence
vector
Prior art date
Application number
IE852440A
Other versions
IE63262B1 (en
Inventor
Jaochem Engels
Michael Leineweber
Eugen Uhlmann
Waldemar Wetekam
Original Assignee
Hoechst Ag
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hoechst Ag filed Critical Hoechst Ag
Publication of IE852440L publication Critical patent/IE852440L/en
Publication of IE63262B1 publication Critical patent/IE63262B1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/575Hormones
    • C07K14/62Insulins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins
    • C12N15/625DNA sequences coding for fusion proteins containing a sequence coding for a signal sequence
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/01Fusion polypeptide containing a localisation/targetting motif
    • C07K2319/036Fusion polypeptide containing a localisation/targetting motif targeting to the medium outside of the cell, e.g. type III secretion

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Biomedical Technology (AREA)
  • Zoology (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Endocrinology (AREA)
  • Diabetes (AREA)
  • Physics & Mathematics (AREA)
  • Toxicology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Plant Pathology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Microbiology (AREA)
  • Medicinal Chemistry (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Peptides Or Proteins (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Immobilizing And Processing Of Enzymes And Microorganisms (AREA)
  • Saccharide Compounds (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Medicines Containing Material From Animals Or Micro-Organisms (AREA)

Abstract

The DNA of a natural signal sequence is modified by incorporation of cleavage sites for endonucleases and is thus made suitable for incorporation in any desired vectors by the building block principle. The vectors modified in this way then bring about transport of the encoded protein out of the cytoplasm. <IMAGE>

Description

6 3 26.? In the cell, proteins are synthesized on the ribosomes which are located in the cytoplasm. Proteins which are transported out of the cytoplasm carry on the amino -terminal end a relatively short peptide chain which is 5 eliminated enzymatically on passage through the cytoplasmic membrane, whereupon the "mature" protein is produced. This short peptide sequence is called a "signal peptide" or a presequence or leader sequence.
The signal sequence located at the amino-terminal end has 10 already been characterized for a large number of secretory proteins. In general, it is composed of a hydrophobic region of about 10 to 20 amino acids, which is called the core and to whose ami no- terminal end a short peptide sequence (the pre-core) is bonded, this 15 usually having one positively charged amino acid (or several) . Between the carboxy-terminal end of the hydrophobic region and the ami no- terminal end of the "mature" transported protein there is a short peptide sequence (the post-core) which contains the splice site 2 0 and ensures that the spatial arrangement is favorable.
It is known, from U.S. Patent 4,411,994, to couple the gene for a protein which is to be expressed with a bacterial gene which codes for an extracellular or periplasmic carrier protein, in order thus to bring about 25 the transport of the desired protein out of the cytoplasm. It is necessary for this process to isolate a bacterial gene, which is intrinsic to the host, for a periplasmic, outer membrane protein or am extracellular protein. This gene is then cut with a restriction enzyme, 30 the gene for the protein which is to be transported is inserted into the cleavage site which has been produced, and the host cell is transformed with a vector which contains the fusion gene thus formed. The isolation of the natural gene and its characterization for the selec-35 tion of suitable cleavage sites is extremely complex. This complexity is avoided according to the invention by making use of a synthetic signal sequence.
Thus the invention relates to a synthetic signal sequence for the transport of proteins in expression systems, wherein the DNA essentially corresponds to a natural signal sequence but has one or more cleavage sites for 5 endonucleases which the natural DNA does not contain. Further aspects of the invention and preferred embodiments are presented below or are set out in the patent claims.
The DNA should "essentially" correspond to that of a 10 natural signal sequence. This is to be understood to mean that the expressed signal peptide is substsuitially or completely identical to the natural signal peptide; in the latter case therefore the only difference existing at the DNA level is that the synthetic DNA has at least one 15 cleavage site which the natural DNA sequence does not contain. This incorporation of the cleavage site according to the invention thus means that there is a more or less extensive deviation from the natural sequence, it being necessary under certain circumstances to have 2 0 recourse to codons which are known to be less preferred by the particular host organism. However, surprisingly, this is not associated with any expression disadvantage. On the contrary, the specific "tailoring" of the synthetic gene is associated with so many advantages that 25 any disadvantage owing to the use of "unnatural" codons is, in general, overcompensated by far. In fact, it has emerged that replacement of the start codon GTG, which occurs in the gene for alkaline phosphatase in E. coli, by ATG leads to a great increase in expression. A 3 0 particular advantage of the invention is that the host cell has to produce less ballast protein because the gene which is to be expressed can be directly linked to the 3' end of the synthetic DNA signal sequence. Furthermore, advantages accrue insofar as it is possible in the 3 5 construction of the synthetic DNA to provide DNA sequences, which protrude at the ends, for certain restriction recognition sites which allow cloning of this sequence and, in the case of disparate recognition sites, permit defined incorporation into a cloning vector. This makes possible incorporation into any desired vectors by the "modular construction principle".
Internal recognition sites for restriction enzymes permit 5 any desired homologous or heterologous genes to be coupled on in the correct reading f reuse. It is also possible via these internal cleavage sites to introduce in a straightforward manner modifications in the DNA of the signal sequences, which lead to presequences which do 10 not occur in nature.
These internal cleavage sites are advantageously placed in the regions upstream and downstream of the hydrophobic region, in particular in the post-core region, it being possible to modify the splice site and/or its adjacent 15 region. Of course, it is also possible to modify the core region in a manner known per se.
Taking known rules into account (G. von Heijne, J. Mol. Biol. 173 (1984) 243 - 251) it is possible, via suitable cleavage sites in the gene section which codes for the 2 0 carboxy-terminal part of the prepeptide, to plan the signal peptidase splice site in such a manner that there is expression not of a fusion protein but directly of the desired, generally eukaryotic, peptide in its natural form. In general, genes of natural origin do not allow 25 processing of this type.
Suitable signal sequences are in principle all signal sequences known from the literature (M.E.E. Watson; Nucleic Acids Res. 12 (1984), 5145 - 5164), modifications thereof and "idealized" signal sequences derived 30 therefrom (D. Perlman and H.O. Halvorson; J. Mol. Biol. 167 (1983) , 391 - 409).
Preferred host organisms are E. coli, Streptomyces, Staphylococcus species such as S. aureus, Bacillus species such as B. subtilis, B.. amyloliquifaciens, B. cereus or B. licheniformis, Pseudomonas, Saccharomyces, Spodoptera frugiperda and cell lines of higher organisms, such as plant or animal cells.
In principle, it is possible to obtain by transport 5 expression all those proteins of prokaryotic or eukaryotic origin which can pass through the membrane. » However, preferred peptide products are those of pharmaceutical significance, such as hormones, lymphokines, interferons, blood-coagulation factors and vaccines, which in nature are also encoded as peptides with an amino-terminal presequence. However, in the prokaryotic host organisms this eukaryotic presequence is not, as a rule, eliminated by the signal peptidases intrinsic to the host.
In E. coli, the genes for the periplasmic and outer-membrane proteins are suitable for transport expression, the former directing the product into the periplasm whereas the latter tend to direct onto the outer membrane.
The example which is given is the DNA signal sequence of the periplasmic protein alkaline phosphatase, which is capable of high-level expression in E. coli, but there is no intention to restrict the invention to this.
The presequence including the first twenty amino acids of 25 alkaline phosphatase of E. coli is show below: 10 Met-Lys-Gln-Ser-Thr-Ile-Ala-Leu-Ala-Leu-Leu-Pro-Leu-Leu- 20 25 Phe-Thr-Pro-Val-Thr-Lys-Ala-Arg-Thr-Pro-Glu-Met-Pro-Val- 35 40 Leu-Glu-Asn-Arg-Ala-Ala-Gln-Gly-Asn-Ile-Thr-Ala-Pro It has emerged that up to about 40, usually about 20, additional amino acids of the mature protein suffice for t = preferred splice site of the signal peptidase correct processing. However, in many cases fewer additional amino acids also suffice, for example about 10, advantageously about 5. Since a shorter protein chain means fewer demands on the protein biosynthesis system of 5 the host cell, an advantageous embodiment of the invention is set out in DNA sequence I (appendix) which codes for the presequence of alkaline phosphatase and an additional 5 amino acids of the mature protein. Apart from a few triplet modifications - namely those which 10 introduce unique restriction enzyme cleavage sites and replace the start codon 6T6 by AT6 - DNA sequence I corresponds to the natural sequence for alkaline phosphatase. At the ends of the coding strand are located "protruding" DNA sequences corresponding to the restrie-15 tion endonuclease EcoR I, which permit incorporation into conventional cloning vectors, for example the commercially available plasmids such as pBR 322, pUC 8 or pUC 12. In addition, a number of other unique cleavage sites for restriction enzymes have been incorporated 20 within the gene of DNA sequence I, and these, on the one hand, make it possible for heterologous genes to be coupled on at the correct site and in the desired reading frame and, on the other hand, also permit modifications to be carried out: Restriction enzyme Cut after nucleotide No. (in the coding strand) Sau 3 A 19 Pvu I 22 Hpa II 54 ) (present in the 3 0 Nci I 54 ) natural gene) Alu I 66 Hph I 68 Ava II 70 Of course, it is also possible to construct the protrud-3 5 ing sequences in such a manner that they correspond to different restriction enzymes, and this then permits incorporation into suitable vectors in a defined orientation. In this context, the expert will give consideration to whether the complexity associated with the construction of the gene and its specific incorporation is more important them the additional work of 5 selection associated with incorporation in both orientation directions when the protruding ends are identical.
DNA sequence I can be constructed of 6 oligonucleotides 26-31 bases in length by first synthesizing them chemically and then linking them enzymatically via sticky 10 ends of 6 nucleotides. Incorporation of the synthetic gene into cloning vectors, for example into the commercially available plasmids mentioned, is carried out in a manner known per se.
As an example of the expression of a eukaryotic gene in E. coli using a presequence according to the invention, the synthesis of monkey proinsulin is described below: a DNA sequence is constructed in which the DNA sequence I, followed by the proinsulin gene (W. Wetekam et al., Gene 19 (1982) 179-183) , is arranged downstream of a chemically synthesized regulation region, composed of a bacterial promoter, a lac operator and a ribosome binding site (German Patent Application P 34 30 683.8), and 6 -14 nucleotides away from the ribosome binding site onto a following recognition sequence for EcoR I. The expressed proinsulin fusion peptide contains an additional 9 amino acids on the amino-terminal end, and these can be eliminated enzymatically or chemically.
The incorporation of the synthetic gene into pUC 8 and the construction of expression plasmids which contain 30 eukaryotic genes coupled to DNA sequence I are carried out in a manner known per se. In this context, reference may be made to the textbook by Maniatis (Molecular Cloning, Maniatis et al., Cold Spring Harbor, 1982). The transformation of the hybrid plasmids thus obtained into 35 suitable host organisms, advantageously E. coli, is likewise known per se and is described in detail in the abovementioned textbook. The isolation of the expressed proteins and their purification is likewise described.
In the examples which follow some more embodiments of the invention are specifically illustrated, from which the 5 large number of possible modifications (and combinations) is evident to the expert. Unless otherwise specified, percentage data in these examples relate to weight.
Examples 1. Chemical synthesis of a single-stranded oligo-10 nucleotide The synthesis of the structural units of the gene is illustrated by the example of structural unit la of the gene, which comprises nucleotides 1 - 2 9 of the coding strand. The nucleoside at the 3' end, in the present case 15 therefore guanosine (nucleotide No. 29) , is covalently bonded via the 3'-hydroxy group to silica gel (FRACTOSIL, supplied by Merck), by known methods (M.J. Gait et al., Nucleic Acids Res. 8 (1980) 1081 - 1096)). For this purpose, first the silica gel is reacted with 3-tri-20 ethoxysilylpropylamine with elimination of ethanol and formation of a Si-O-Si linkage. The guanisone is reacted as the NJ'-isobutyryl-3' -0-succinoyl-5' -dimethoxytrityl ether with the modified carrier in the presence of paranitrophenol and N,N'-dicyclohexylcarbodiimide, the 25 free carboxyl group of the succinoyl group acylating the amino radical of the propylamine group.
In the synthetic steps which follow, the base component is used as the monomethyl ester of the 5'-O-dimethoxy-tritylnucleoside-3'-phosphorous acid dialkylamide or 30 chloride, the adenine being in the form of the N*-benzoyl compound, the cytosine being in the form of the ^-benzoyl compound, the guanine being in the form of the N2-isobutyryl compound, and the thymine, which contains no amino group, being without a protective group. 50 mg of the polymeric carrier containing 2 /imol of boxind guanosine are treated successively with the following agents: a) b) c) d) e) f) g) h) i) j) k) 1) In this context, the term "phosphite" is to be understood to be the monomethyl ester of the deoxyribose-3' -mono-phosphorous acid, the third valency being saturated by chlorine or a tertiary amino group, for example a morpholino radical. The yields in each synthetic step can be determined after the detritylation reaction (b) in each case by spectrophotometry, measuring the absorption of the dimethoxytrityl cation at a wavelength of 496 nm.
When the synthesis of the oligonucleotide is complete, the methyl phosphate protective groups on the oligomer are eliminated using p-thiocresol and triethylamine. The oligonucleotide is then removed from the solid carrier by 3 5 treatment with ammonia for 3 hours. Treatment of the oligomers with concentrated ammonia for 2 to 3 days nitromethane saturated zinc bromide solution in nitromethane containing 1% water methanol tetrahydrofuran acetonitrile 40 fimol of the appropriate nucleoside phosphite and 200 ^mol of tetrazole in 0.5 ml of anhydrous acetonitrile (5 minutes) % acetic anhydride in tetrahydrofuran containing 40% lutidine and 10% dimethylaminopyridine (2 minutes) t e t rahydro f uran tetrahydrofuran containing 20% water and 40% lutidine 3% iodine in collidine/water/tetrahydrofuran in the ratio by volume 5:4:1 tetrahydrofuran and methanol. quantitatively eliminates the amino protective groups on the bases. The crude product thus obtained is purified by high-pressure liquid chromatography (HPLC) or by poly-acrylamide gel electrophoresis.
The other structural units lb - If of the gene are synthesized entirely correspondingly, their nucleotide sequence being evident from DNA sequence II (appendix). 2. Enzymatic linkage of the single-stranded oligonucleotides to give DNA sequence I The terminal oligonucleotides la and If are not phos-phorylated. This prevents oligomerization via the protruding ends. For the phosphorylation of oligonucleotides lb, Ic, Id and Ie, in each case 1 nmol of these compounds is treated with 5 nmol of adenosine triphosphate and four 15 units of T4 polynucleotide kinase in 20 fil of 50 mM Tris-HC1 buffer (pH 7.6), 10 mM magnesium chloride and 10 mM dithiothreitol (DTT) at 37°C for 30 minutes. The enzyme is inactivated by heating at 95°C for five minutes. The oligonucleotides la to If are then combined and 20 hybridized to give the double strand by heating them in a 20 mM KCl solution and then slowly (over the course of 2 hours) cooling to 16°C. The ligation to give the DNA fragment according to DNA sequence I is carried out by reaction in 40 /zl of 50 mM Tris-HCl buffer (20 mM 25 magnesium chloride and 10 mM DTT) using 100 units of T4 DNA ligase, at 15°C over the course of 18 hours.
The purification of the gene fragment is carried out by gel electrophoresis on a 10% polyacrylamide gel (without addition of urea, 20 x 40 cm, 1 mm thick) , the marker 30 substance used being 0X 174 DNA (supplied by BRL) cut with Hinf I, or pBR 322 cut with Hae III. 3. Incorporation of the gene fragment into pUC 8 The commercially available plasmid pUC 8 is opened in a known manner and in accordance with the manufacturer's data using the restriction endonuclease EcoR I. The digestion mixture is fractionated by electrophoresis on a 5% polyacrylamide gel in a known manner, and the DNA is 5 visualized by staining with ethidium bromide or by radioactive labeling ("nick translation" method of Maniatis, loc. cit.). The plasmid band is then cut out of the acrylamide gel and separated from the polyacrylamide by electrophoresis. 4. Incorporation of DNA sequence I into an expression plasmid The expression plasmid pWI 6 having the information for monkey proinsulin is constructed as follows: fig of the plasmid pBR 322 are digested with the 15 restriction endonucleases Eco RI and Pvu II and then the Eco RI cleavage site is filled in by a fill-in reaction using Klenow polymerase. Following fractionation by gel electrophoresis in a 5% polyacrylamide gel, the plasmid fragment of length 2293 bp can be obtained by electro- 2 0 elution (Figure 1).
The monkey preproinsulin DNA integrated in the plasmid pBR 322 (Wetekam et al., Gene 19 (1982) 179 - 183) is isolated therefrom by digestion using the restriction endonucleases Hind III and Mst I (as a fragment of about 25 1250 bp) and recloned into the plasmid pUC 9 as follows: the plasmid pUC 9 is cleaved with the enzyme Bam HI, the cleavage site is filled in in a standard fill-in reaction using Klenow polymerase (large fragment), subsequent cleavage with the restriction enzyme Hind III is carried 3 0 out, and the DNA is separated from the other DNA fragments by gel electrophoresis in a 5% polyacrylamide gel. The isolated insulin DNA fragment of length about 1250 bp is integrated into the opened plasmid.
To remove the untranslated region and the presequence, ■lithe pUC 9 plasmid thus modified is digested with Hae III, and the fragment of length 143 bp is digested with Bal 31 under limiting enzyme conditions to eliminate the last two nucleotides from the presequence. This results in the 5 first codon on the amino-terminal end being TTT, which represents phenylalanine as the first amino acid of the B chain.
An adaptor which is specific for Eco RI is now ligated onto this fragment in a blunt-end ligation reaction: a) 5' AAT TAT GAA TTC GCA ATG Eco RI TA CTT AAG CGT TAC b) 5' AAT TAT GAA TTC GCA AGA Eco RI TA CTT AAG CGT TCT In order to prevent polymerization of the adaptors they are used unphosphorylated in the ligation reaction (this being indicated in the figures by Eco RI', in the same way as recognition sequences inactivated by, for example, filling in) . The adaptor a) has a codon for methionine at 15 the end, and the adaptor b) has the codon for arginine. Thus, the gene product obtained by variant a) is amenable to removal of the bacterial portion by cleavage with cyanogen bromide, whereas variant b) allows trypsin cleavage.
The ligation product is digested with Mbo II. After fractionation by gel electrophoresis, a DNA fragment of length 79 bp having the information for amino acids Nos. 1 to 21 of the B chain is obtained.
The gene for the remaining information for the proinsulin 25 molecule (including a G-C sequence from the cloning and 21 bp from the pBR 322 connected to the stop codon) is obtained from the pUC 9 plasmid having the complete information for monkey preproinsulin by digestion with Mbo Il/Sma I and isolation of a DNA fragment of length about 240 bp. The correct ligation product of length about 320 bp (including the adaptor of 18 bp) is obtained by ligation of the two proinsulin fragments. This pro-5 insulin DNA fragment thus constructed can now be ligated together with a regulation region via the Eco RI-negative cleavage site.
Figure 2 shows the entire reaction sequence, where A, B and C denote the DNA for the particular peptide chains of 10 the proinsulin molecule, Ad denotes the (dephosphoryl-ated) adaptor (a or b) and Prae denotes the DNA for the presequence of monkey preproinsulin.
A chemically synthesized regulation region composed of a recognition sequence for Bam EI, the lac operator (0) , a 15 bacterial promoter (P) and a ribosomal binding site (RB) , and having an AT6 start codon, 6 to 14 nucleotides away from the RB and having a connected recognition sequence for Eco RI (Figure 3) is ligated, via the common Eco RI overlapping region, with the proinsulin gene fragment 2 0 obtained according to the previous example. It is advantageous to choose the following synthetic regulation region (DNA sequence Ila from Table 2, corresponding to German Patent Application P 34 30 683.8): 5' GATCCTAAATAAATTCTTGACATTTTTTAAA 3' 3' GATTTATTTAAGAACTGTAAAAAATTT 5 ' (Bam HI) P ' TAATTTGGTATAATGTGTGGAATTGTGAGCG 3' 3' ATTAAACCATATTACACACCTTAACACTCGC 5' 0 ' GAATAACAATTTCACAGAGGATCTAG 3' 3' CTTATTGTTAAAGTGTCTCCTAGATCTTAA 5' RB (Eco RI) The other synthetic regulation regions specified in Table 2 can be used likewise. However, it is also possible to choose a natural or derived (Perlman et al., loc. cit.) signal sequence known from the literature.
TABLE 2 Synthetic regulation region (coding strand): ' GOATCCTAAATAAATTCTTGACATTT1TTAA2TAATTTGGTATAAT0T3T 1aAATTG5GAGCG6T7ACAATT8C9C10GllG12T13TAl'lTT15 (ATG) 3' 1 = T or G 7 = A or C 1 1 = AG or GA 2 = A or C 8 = T or direct bond 12 = A or G 3 = G or C 9 = A or T AGA 13 = C or T A = G or A 1 0 = : A, TTTAAA, AAGCTT U = GAA or AGC = T or C o r AAGCTA = C or direct bond 1 6 = C, GA o r GAA H DNA sequences 11 a-h • I 1 2 6 7 8 11 12 13 1M 3 = 0 Ila T A T GAA A T A AG A C OAA c 0 b T A T GAA A T TTTAAA AG A C GAA c 9 = A c 0 C T GAA A T TTTAAA AO A C GAA c d G C T GAA A - AAGCTT AO A C GAA c e G C C GAA A — AAGCTT AO A C OAA c f G C T C C — AAGCTT AO A C OAA c G G C T GA C — AAGCTT AG A C OAA c h G C T GA C — AAGCTA OA 0 T AOC — .♦ C Following double digestion with Sma I/Bam EI and a fill-in reaction of the Bam HI cleavage site with the Klenow fragment, the ligation product (about 420 bp) is isolated by gel electrophoresis.
The fragment thus obtained can then, by a blunt-end ligation, be ligated into the pBR 322 part-plasmid of Figure 1 (Figure 4) . The hybrid plasmid pWI 6 is obtained.
After transformation into the E. coli strain EB 101 and 10 selection on ampicillin plates, the plasmid DNA of individual clones was tested for the integration of a 420 bp fragment having the regulation region and the proinsulin gene shortened by Bal 31. In order to demonstrate the correct shortening of the proinsulin gene by 15 Bal 31 (Figure 2) , the plasmids having the integrated proinsulin gene fragment were sequenced starting from the Eco RI cleavage site. Of 60 sequenced clones, three had the desired shortening by two nucleotides (Figure 4). 1 of the plasmid pWI 6 is cut with the restriction 2 0 enzyme Eco RI and then ligated together in the presence of 30 ng of DNA sequence I, at 16°C in 6 hours. After transformation into E. coli HB 101, plasmids are isolated from individual clones and tested for integration of DNA sequence I by means of restriction enzyme analysis. 7% of 25 the clones contained the plasmid pWI 6 with integrated DNA sequence I.
The direction of this integration reaction can be unambiguously determined by standard methods of restriction enzyme analysis via double digestion with Hind III/Pvu I. 3 0 The plasmid pWI 6 having a DNA sequence I integrated in the correct direction of reading to the proinsulin gene is shown as pWIP 1 in Figure 5.
This plasmid can then be transformed into various E. coli strains in order to test the synthetic capacity of the individual strains.
The expression of the presequence-proinsulin gene fusion in £. coli is determined as follows: 1 ml of a bacterial culture induced with IPTG (isopropyl 5 /5-D-thiogalactopyranoside) is stopped using PMSF (phenyl- methylsulfonyl fluoride) in a final concentration of SxlO"4 M at an optical density ODS00 of 1.0 and at an induction time of 1 hour, cooled in ice and spun down. The cell sediment is then washed in 1 ml of buffer (10 mM 10 Tris-HCl, pH 7.6; 40 mM NaCl), spun down and resuspended in 200 /xl of buffer (20% sucrose; 20 mM Tris-HCl, pH 8.0; 2 mM EDTA) , incubated at room temperature for 10 minutes, spun down and immediately resuspended in 500 ^1 of double-distilled H30. After incubation in ice for 10 minutes, the shock-lysed bacteria are spun down and the supernatant is frozen. The proinsulin content of this supernatant is tested by a standard insulin RIA (Amersham) .
The bacterial sediment is resuspended once more in 200 /xl 20 of lysozyme buffer (20% sucrose; 2 mg/ml lysozyme; 20 mM Tris-HCl, pH 8.0; 2 mM EDTA), incubated in ice for 30 minutes, sonicated 3 x 10 seconds and then spun down. The supernatant resulting from this is tested for the content of proinsulin, as a plasma fraction, in a radio-25 immunoassay.
Individual bacterial clones which contain the plasmid pWIP 1 were examined for their synthetic capacity and their ability to transport the proinsulin-presequence product. It was possible to demonstrate that all the 30 bacterial clones, as expected, transported about 90% of the produced proinsulin into the periplasmic space. About 10% of the RIA activity of proinsulin was still found in the plasma fraction.
DNA sequence I Triplet No. 12 3 Amino acid No. Met Lys Gin Nucleotide No. .5 10 Coding strand 5' AA TTC ATG AAA CAA non-cod. strand 3' g TAC TTT GTT 4 Ser 6 Thr lie 7 Ala 8 Leu 9 Ala Leu 11 Leu 12 13 Pro Leu 20 25 30 35 40 AGC ACG ATC GCA CTG GCA CTC TTA CCG TTA TCG TGC TAG CGT GAC CGT GAG AAT GGC AAT 14 15 16 17 18 19 20 21 22 23 Leu Phe Thr Pro Val Thr Lys Ala Arg Thr 45 50 55 60 65 70 CTG TTT ACC CCG GTG ACA AAA GCT CGG ACC GAC AAA TGG GGC CAC TGT TTT CGA GCC TGG 24 25 26 Pro Glu Met 75 80 84 CCA GAA ATG G 3' GGT CTT TAC CTT AA 5' DNA sequence II: 4 Ia > ' AA TTC ATG AAA CAA AGC ACG ATC GCA CTG 3' G TAC TTT GTT TCG TGC TAG CGT GAC Eco RI 4 lb t CA CTC CGT GAG TTA AAT « CCG GGC - Ic TTA AAT CTG GAC TTT AAA Id - ACC TGG CCG GGC STG AC A CAC TGT AAA TTT « GCT CGA Ie - CGG GCC ACC TGG - If ► Eco RI CCA GAA ATG G GGT CTT TAC CTT AA

Claims (8)

- 19 - Patent Claims
1. A synthetic DNA for the transport of proteins in expression systems, which DNA codes for a natural 5 signal sequence but has one or more cleavage sites for endonucleases which the natural DNA does not contain, and which DNA contains at the 3' end at least 5 and up to about 40 of the amino-terminal coding codons of that structural gene which 10 naturally follows the signal sequence, the amino acid sequence of the required protein being connected thereto.
2. A DNA as claimed in claim 1, which contains internal cleavage sites upstream and/or downstream of the 15 hydrophobic region.
3. A DNA as claimed in claim 1 or 2, which essentially codes for the natural signal sequence of the alkaline phosphatase of E. coli.
4. DNA sequence I 20 Triplet No. 12 3 Amino acid No. Met Lys Gin Nucleotide No. 5 10 Coding strand 5' AA TTC ATG AAA CAA Noncoding strand 3' G TAC TTT GTT 25 4 5 6 7 8 9 10 11 12 13 Ser Thr lie Ala Leu Ala Leu Leu Pro Leu 15 20 25 30 35 40 AGC ACG ATC GCA CTG GCA CTC TTA CCG TTA TCG TGC TAG CGT GAC CGT GAG AAT GGC AAT 30 14 15 16 17 18 19 20 21 22 23 Leu Phe Thr Pro Val Thr Lys Ala Arg Thr - 20 - 45 50 55 60 65 70 CTG TTT ACC CCG GTG ACA AAA GCT CGG ACC GAC AAA TGG GGC CAC TGT TTT CGA GCC TGG 24 25 26 Pro Glu Met 75 80 84 CCA GAA ATG G 3' GGT CTT TAC CTT AA A process for the transport expression of eukaryotic, prokaryotic or viral proteins in prokaryotic cells, which comprises coupling onto the gene for the proteins which is to be transported onto a DNA sequence as claimed in claim 1 to 4, incorporating this fusion gene into a vector, and transforming therewith a host cell which tramsports the expressed protein out of the cytoplasm. The process as claimed in claim 5, wherein the synthetic DNA sequence encodes a signal protein intrinsic to the host. A hybrid vector comprising a DNA sequence as claimed in claim 1 to 4. A hybrid plasmid containing the DNA sequence I as claimed in claim 4 inserted in an Eco RI cleavage site. A host organism containing a vector as claimed in claim 7 or 8. E. coli containing a vector as claimed in claim 7 or 8.
5. A synthetic DNA as claimed in claim 1, substantially as hereinbefore described and exemplified
6. A process as claimed in claim 5, substantially as hereinbefore described and exemplified.
7. A hybrid vector as claimed in claim 7, substantially as hereinbefore described and exemplified F. R. KELLY & CO., AGENT
8.S FOR THE APPLICANTS.
IE244085A 1984-10-06 1985-10-04 A synthetic signal sequence for the transport of proteins in expression systems IE63262B1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
DE19843436818 DE3436818A1 (en) 1984-10-06 1984-10-06 SYNTHETIC SIGNAL SEQUENCE FOR TRANSPORTING PROTEINS IN EXPRESSION SYSTEMS

Publications (2)

Publication Number Publication Date
IE852440L true IE852440L (en) 1986-04-06
IE63262B1 IE63262B1 (en) 1995-04-05

Family

ID=6247347

Family Applications (1)

Application Number Title Priority Date Filing Date
IE244085A IE63262B1 (en) 1984-10-06 1985-10-04 A synthetic signal sequence for the transport of proteins in expression systems

Country Status (15)

Country Link
EP (1) EP0177827B1 (en)
JP (1) JPS6188883A (en)
AT (1) ATE97445T1 (en)
AU (1) AU595486B2 (en)
CA (1) CA1340280C (en)
DE (2) DE3436818A1 (en)
DK (1) DK175511B1 (en)
ES (1) ES8605579A1 (en)
GR (1) GR852405B (en)
HU (1) HU197355B (en)
IE (1) IE63262B1 (en)
IL (1) IL76573A (en)
NZ (1) NZ213717A (en)
PH (1) PH30819A (en)
PT (1) PT81253B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0196864A3 (en) * 1985-03-25 1988-03-23 Cetus Corporation Alkaline phosphatase-mediated processing and secretion of recombinant proteins, dna sequences for use therein and cells transformed using such sequences
IT1196484B (en) * 1986-07-11 1988-11-16 Sclavo Spa YEAST EXPRESSION AND SECRETION VECTOR, USEFUL FOR THE PREPARATION OF HETEROLOGICAL PROTEINS
US5426036A (en) * 1987-05-05 1995-06-20 Hoechst Aktiengesellschaft Processes for the preparation of foreign proteins in streptomycetes
DE68917759T2 (en) * 1988-03-18 1995-04-27 Wang Laboratories Distributed reference and change table for a virtual storage system.
CA2520415A1 (en) 2003-03-27 2004-10-14 Ptc Therapeutics, Inc. Methods of identifying compounds that target trna splicing endonuclease and uses of said compounds as anti-proliferative agents
JP4634372B2 (en) 2003-03-27 2011-02-16 ピーティーシー セラピューティクス,インコーポレーテッド Methods for identifying compounds that target tRNA splicing endonucleases and their use as antifungal agents
CA2531321A1 (en) 2003-07-02 2005-01-13 Ptc Therapeutics, Inc. Rna processing protein complexes and uses thereof

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5939899A (en) * 1982-08-27 1984-03-05 Gakuzo Tamura Novel vector
US4711844A (en) * 1983-03-09 1987-12-08 Cetus Corporation Modified signal peptides
NZ207925A (en) * 1983-04-25 1988-05-30 Genentech Inc Yeast expression vehicle consisting of a yeast promoter and signal peptide encoding region linked to a heterologus peptide coding region; expression and culture
AU3011684A (en) * 1983-05-19 1984-12-04 Public Health Research Institute Of The City Of New York, Inc., The Expression and secretion vectors & method of constructing vectors
US4663280A (en) * 1983-05-19 1987-05-05 Public Health Research Institute Of The City Of New York Expression and secretion vectors and method of constructing vectors
JPS6030687A (en) * 1983-08-01 1985-02-16 Wakunaga Seiyaku Kk Dna gene, its preparation and plasmid containing the same
DE3587205T2 (en) * 1984-07-30 1993-08-26 Wakunaga Seiyaku Kk METHOD FOR PRODUCING A PROTEIN AND VECTOR TO BE USED FOR IT, RECOMBINANT DNA AND TRANSFORMED CELL.

Also Published As

Publication number Publication date
DK175511B1 (en) 2004-11-15
PT81253A (en) 1985-11-01
DE3436818A1 (en) 1986-04-10
EP0177827A2 (en) 1986-04-16
JPS6188883A (en) 1986-05-07
PT81253B (en) 1987-11-30
CA1340280C (en) 1998-12-22
ES8605579A1 (en) 1986-03-16
IE63262B1 (en) 1995-04-05
GR852405B (en) 1986-02-04
ES547600A0 (en) 1986-03-16
ATE97445T1 (en) 1993-12-15
DK453285A (en) 1986-04-07
AU4833385A (en) 1986-04-10
PH30819A (en) 1997-10-17
DK453285D0 (en) 1985-10-04
IL76573A (en) 1992-06-21
AU595486B2 (en) 1990-04-05
EP0177827A3 (en) 1987-12-02
DE3587660D1 (en) 1993-12-23
IL76573A0 (en) 1986-02-28
HU197355B (en) 1989-03-28
EP0177827B1 (en) 1993-11-18
HUT40164A (en) 1986-11-28
NZ213717A (en) 1989-01-06

Similar Documents

Publication Publication Date Title
FI91883C (en) Process for the preparation of polypeptide with hirudin activity
Urdea et al. Chemical synthesis of a gene for human epidermal growth factor urogastrone and its expression in yeast.
DK172618B1 (en) Fusion proteins, their preparation and use
EP0046039B2 (en) Synthetic urogastrone gene, corresponding plasmid recombinants, transformed cells, production thereof and urogastrone expression
IL95495A (en) Fusion proteins their preparation and use
AU607209B2 (en) Secretion of insulin-like growth factor-i in e. coli
KR920009505B1 (en) Method for preparation of polypeptide having human gamma inf activity
EP0163406B1 (en) Novel polypeptide expression method
US5489517A (en) Secretion of insulin-like growth factor-I in E. coli
DK166681B1 (en) DNA SEQUENCES, DNA SEQUENCES AND DNA OLIGONUCLEOTIDES FOR USE IN THE PREPARATION OF THE FIRST DNA SEQUENCES, HYBRID PLASMIDS CONTAINING DNA SEQUENCE, HOUSE CELLS WITH 2
DK171940B1 (en) Method for Preparation of Polypeptides with C-Terminal Carboxamide Group, Polypeptides for Use in the Method, Fusion Proteins Thereto and Suitable DNA Sequences, Plasmids and Host Organisms
IE852440L (en) Signal for protein transport
CA1224167A (en) Preparation of secretin
US5545564A (en) Vector for expression and secretion of polypeptide microorganism transformed by the vector and production of polypeptide with the same microorganism
AU600229B2 (en) Modification of the DNA sequemce between the shine-dalgarno sequence and the start codon of the TRP operon to increase protein expression

Legal Events

Date Code Title Description
MK9A Patent expired