CA1340280C - Synthetic signal sequence for the transport of proteins in expression systems - Google Patents
Synthetic signal sequence for the transport of proteins in expression systemsInfo
- Publication number
- CA1340280C CA1340280C CA000492345A CA492345A CA1340280C CA 1340280 C CA1340280 C CA 1340280C CA 000492345 A CA000492345 A CA 000492345A CA 492345 A CA492345 A CA 492345A CA 1340280 C CA1340280 C CA 1340280C
- Authority
- CA
- Canada
- Prior art keywords
- signal sequence
- dna
- sequence
- gene
- coli
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 58
- 108010076504 Protein Sorting Signals Proteins 0.000 title claims abstract description 27
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 24
- 238000003776 cleavage reaction Methods 0.000 claims abstract description 21
- 230000007017 scission Effects 0.000 claims abstract description 21
- 239000013598 vector Substances 0.000 claims abstract description 15
- 210000000805 cytoplasm Anatomy 0.000 claims abstract description 6
- 108010042407 Endonucleases Proteins 0.000 claims abstract description 3
- 102000004533 Endonucleases Human genes 0.000 claims abstract description 3
- 108020004414 DNA Proteins 0.000 claims description 31
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 27
- 239000013612 plasmid Substances 0.000 claims description 27
- 241000588724 Escherichia coli Species 0.000 claims description 13
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 claims description 12
- 210000004027 cell Anatomy 0.000 claims description 12
- 230000032258 transport Effects 0.000 claims description 11
- 108020004705 Codon Proteins 0.000 claims description 9
- 102000002260 Alkaline Phosphatase Human genes 0.000 claims description 7
- 108020004774 Alkaline Phosphatase Proteins 0.000 claims description 7
- 238000000034 method Methods 0.000 claims description 6
- 230000004927 fusion Effects 0.000 claims description 5
- 230000002209 hydrophobic effect Effects 0.000 claims description 4
- 230000008569 process Effects 0.000 claims description 4
- 238000011144 upstream manufacturing Methods 0.000 claims description 3
- 108010067390 Viral Proteins Proteins 0.000 claims description 2
- 230000008878 coupling Effects 0.000 claims description 2
- 238000010168 coupling process Methods 0.000 claims description 2
- 238000005859 coupling reaction Methods 0.000 claims description 2
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 2
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 238000010348 incorporation Methods 0.000 abstract description 12
- 238000010276 construction Methods 0.000 abstract description 6
- 108010076181 Proinsulin Proteins 0.000 description 20
- 239000012634 fragment Substances 0.000 description 19
- 108091008146 restriction endonucleases Proteins 0.000 description 15
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 9
- 108091034117 Oligonucleotide Proteins 0.000 description 8
- 150000001413 amino acids Chemical class 0.000 description 8
- 239000002773 nucleotide Substances 0.000 description 8
- 125000003729 nucleotide group Chemical group 0.000 description 8
- 108090000765 processed proteins & peptides Proteins 0.000 description 8
- 230000001580 bacterial effect Effects 0.000 description 7
- 230000033228 biological regulation Effects 0.000 description 7
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- 241000282693 Cercopithecidae Species 0.000 description 6
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 6
- 230000029087 digestion Effects 0.000 description 6
- YLQBMQCUIZJEEH-UHFFFAOYSA-N tetrahydrofuran Natural products C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 6
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 5
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 238000001502 gel electrophoresis Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 229920002401 polyacrylamide Polymers 0.000 description 5
- OISVCGZHLKNMSJ-UHFFFAOYSA-N 2,6-dimethylpyridine Chemical compound CC1=CC=CC(C)=N1 OISVCGZHLKNMSJ-UHFFFAOYSA-N 0.000 description 4
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 4
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 4
- 108010066381 preproinsulin Proteins 0.000 description 4
- 210000003705 ribosome Anatomy 0.000 description 4
- -1 succinoyl group Chemical group 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 3
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 3
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 108700005078 Synthetic Genes Proteins 0.000 description 3
- ZMANZCXQSJIPKH-UHFFFAOYSA-N Triethylamine Chemical compound CCN(CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-N 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 3
- 229940088598 enzyme Drugs 0.000 description 3
- 239000013613 expression plasmid Substances 0.000 description 3
- 229940029575 guanosine Drugs 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 108010077805 Bacterial Proteins Proteins 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- QOSSAOTZNIDXMA-UHFFFAOYSA-N Dicylcohexylcarbodiimide Chemical compound C1CCCCC1N=C=NC1CCCCC1 QOSSAOTZNIDXMA-UHFFFAOYSA-N 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 241001131785 Escherichia coli HB101 Species 0.000 description 2
- 101150087426 Gnal gene Proteins 0.000 description 2
- 108090001061 Insulin Proteins 0.000 description 2
- 102000004877 Insulin Human genes 0.000 description 2
- 102000016943 Muramidase Human genes 0.000 description 2
- 108010014251 Muramidase Proteins 0.000 description 2
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 2
- 101710116435 Outer membrane protein Proteins 0.000 description 2
- 108010090127 Periplasmic Proteins Proteins 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 229910021529 ammonia Inorganic materials 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- HPYNZHMRTTWQTB-UHFFFAOYSA-N dimethylpyridine Natural products CC1=CC=CN=C1C HPYNZHMRTTWQTB-UHFFFAOYSA-N 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 238000009585 enzyme analysis Methods 0.000 description 2
- 238000011049 filling Methods 0.000 description 2
- 238000005194 fractionation Methods 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 229940125396 insulin Drugs 0.000 description 2
- 229910052740 iodine Inorganic materials 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 229960000274 lysozyme Drugs 0.000 description 2
- 239000004325 lysozyme Substances 0.000 description 2
- 235000010335 lysozyme Nutrition 0.000 description 2
- 229910001629 magnesium chloride Inorganic materials 0.000 description 2
- 210000004379 membrane Anatomy 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- LYGJENNIWJXYER-UHFFFAOYSA-N nitromethane Chemical compound C[N+]([O-])=O LYGJENNIWJXYER-UHFFFAOYSA-N 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 150000007523 nucleic acids Chemical class 0.000 description 2
- 239000002777 nucleoside Substances 0.000 description 2
- 150000003833 nucleoside derivatives Chemical class 0.000 description 2
- 210000001322 periplasm Anatomy 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- OJMIONKXNSYLSR-UHFFFAOYSA-N phosphorous acid Chemical compound OP(O)O OJMIONKXNSYLSR-UHFFFAOYSA-N 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 125000006239 protecting group Chemical group 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000003127 radioimmunoassay Methods 0.000 description 2
- 239000013049 sediment Substances 0.000 description 2
- 238000004904 shortening Methods 0.000 description 2
- 239000000741 silica gel Substances 0.000 description 2
- 229910002027 silica gel Inorganic materials 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 108010087967 type I signal peptidase Proteins 0.000 description 2
- IMVJCTWESJIARS-QWWZWVQMSA-N (2r,3r)-2,3-bis(sulfanyl)butane-1,4-diol Chemical compound OC[C@@H](S)[C@H](S)CO IMVJCTWESJIARS-QWWZWVQMSA-N 0.000 description 1
- WYTZZXDRDKSJID-UHFFFAOYSA-N (3-aminopropyl)triethoxysilane Chemical compound CCO[Si](OCC)(OCC)CCCN WYTZZXDRDKSJID-UHFFFAOYSA-N 0.000 description 1
- CPEONABTMRSIKA-UHFFFAOYSA-N 1,4$l^{2}-oxazinane Chemical compound C1COCC[N]1 CPEONABTMRSIKA-UHFFFAOYSA-N 0.000 description 1
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 1
- WLHCBQAPPJAULW-UHFFFAOYSA-N 4-methylbenzenethiol Chemical compound CC1=CC=C(S)C=C1 WLHCBQAPPJAULW-UHFFFAOYSA-N 0.000 description 1
- BTJIUGUIPKRLHP-UHFFFAOYSA-N 4-nitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1 BTJIUGUIPKRLHP-UHFFFAOYSA-N 0.000 description 1
- ZCYVEMRRCGMTRW-UHFFFAOYSA-N 7553-56-2 Chemical compound [I] ZCYVEMRRCGMTRW-UHFFFAOYSA-N 0.000 description 1
- ZKHQWZAMYRWXGA-KQYNXXCUSA-J ATP(4-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-J 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 102000015081 Blood Coagulation Factors Human genes 0.000 description 1
- 108010039209 Blood Coagulation Factors Proteins 0.000 description 1
- 240000001817 Cereus hexagonus Species 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108050001049 Extracellular proteins Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- 102000008072 Lymphokines Human genes 0.000 description 1
- 108010074338 Lymphokines Proteins 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- PTLOFJZJADCNCD-DCAQKATOSA-N Pro-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 PTLOFJZJADCNCD-DCAQKATOSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 108091058545 Secretory proteins Proteins 0.000 description 1
- 102000040739 Secretory proteins Human genes 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- 229910002808 Si–O–Si Inorganic materials 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 241000473945 Theria <moth genus> Species 0.000 description 1
- ATJFFYVFTNAWJD-UHFFFAOYSA-N Tin Chemical compound [Sn] ATJFFYVFTNAWJD-UHFFFAOYSA-N 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 229940117913 acrylamide Drugs 0.000 description 1
- 230000002730 additional effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- MDFFNEOEWAXZRQ-UHFFFAOYSA-N aminyl Chemical compound [NH2] MDFFNEOEWAXZRQ-UHFFFAOYSA-N 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- HOPRXXXSABQWAV-UHFFFAOYSA-N anhydrous collidine Natural products CC1=CC=NC(C)=C1C HOPRXXXSABQWAV-UHFFFAOYSA-N 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 239000003114 blood coagulation factor Substances 0.000 description 1
- 229940019700 blood coagulation factors Drugs 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- UTBIMNXEDGNJFE-UHFFFAOYSA-N collidine Natural products CC1=CC=C(C)C(C)=N1 UTBIMNXEDGNJFE-UHFFFAOYSA-N 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 239000012043 crude product Substances 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000006642 detritylation reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 239000011630 iodine Substances 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- KNWQLFOXPQZGPX-UHFFFAOYSA-N methanesulfonyl fluoride Chemical compound CS(F)(=O)=O KNWQLFOXPQZGPX-UHFFFAOYSA-N 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- PSHKMPUSSFXUIA-UHFFFAOYSA-N n,n-dimethylpyridin-2-amine Chemical compound CN(C)C1=CC=CC=N1 PSHKMPUSSFXUIA-UHFFFAOYSA-N 0.000 description 1
- 238000006384 oligomerization reaction Methods 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 125000006308 propyl amino group Chemical group 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 229940070376 protein Drugs 0.000 description 1
- 238000000163 radioactive labelling Methods 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000002798 spectrophotometry method Methods 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- GFYHSKONPJXCDE-UHFFFAOYSA-N sym-collidine Natural products CC1=CN=C(C)C(C)=C1 GFYHSKONPJXCDE-UHFFFAOYSA-N 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 125000001302 tertiary amino group Chemical group 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 150000003536 tetrazoles Chemical class 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- VNDYJBBGRKZCSX-UHFFFAOYSA-L zinc bromide Chemical class Br[Zn]Br VNDYJBBGRKZCSX-UHFFFAOYSA-L 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/62—Insulins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
- C12N15/625—DNA sequences coding for fusion proteins containing a sequence coding for a signal sequence
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/036—Fusion polypeptide containing a localisation/targetting motif targeting to the medium outside of the cell, e.g. type III secretion
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Endocrinology (AREA)
- Physics & Mathematics (AREA)
- Toxicology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Plant Pathology (AREA)
- Gastroenterology & Hepatology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Diabetes (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
- Saccharide Compounds (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Immobilizing And Processing Of Enzymes And Microorganisms (AREA)
Abstract
The DNA of a natural signal sequence is modified by incorporation of cleavage sites for endonucleases and can thus be incorporated in any desired vectors by the modular construction principle. The vectors modified in this way then bring about transport of the coded protein out of the cytoplasm.
Description
- 1 - 1 e~)280 A synthetic s;gnal sequence for the transport of proteins in express;on systems In the cell, proteins are synthesized on the ribosomes which are located in the cytoplasm. Proteins wh;ch are transported out of the cytoplasm carry on the amino ter-minal end a relatively short peptide chain which is elimi-nated enzymatically on passage through the cytoplasmic membrane, whereupon the mature protein is produced.
This short peptide sequence is called a "signal peptide"
or a presequence or leader sequence.
The signal sequence located at the amino terminal end has already been characterized for a large number of secretory proteins. In general, it is composed of a hydrophobic region of about 10 to 20 amino acids, which is called the core and to whose amino term;nal end a short peptide se-quence (the pre-core) is bonded, this usually hav;ng one posit;vely charged am;no acid (or several). Between the carboxy terminal end of the hydrophobic region and the amino terminal end of the mature transported protein there is a short pept;de sequence (the post-core) wh;ch contains the splice site and ensures that the spatial arrangement is favorable.
It is known, from U.S. Patent 4,411,994, to couple the gene for a protein which is to be expressed with a bacte-rial gene which codes for an extracellular or periplasmiccarrier protein in order thus to bring about the transport of the desired protein out of the cytoplasm. It is neces-sary for this process to isolate a bacterial gene, which is intrinsic to the host, for a periplasmic, outer mem-brane protein or an extracellular protein. This gene isthen cut with a restriction enzyme, the gene for the pro-tein which is to be transported is inserted into the cut which has been produced, and the host cell ;s transformed iO~80 th a vector ~hich contains the fusion gene thus formed.
The ;solation of the natural gene and its characterization for the selection of suitable cleavage sites is extremely complex. This complexity is avoided according to the in-S vention by making use of a synthetic signal sequence.
Thus the invention relates to a synthetic signal sequence for the transport of proteins in expression systems, which comprises DNA essentially corresponding to a natural signal sequence but having one or nore cleavage sites for endonucleases ~hich are not present in the natu-ral DNA.
The invention further relates to DNA of the Formula I (see page 17).
The invention further relates to a process for he transport expression of eukaryotic, prokaryotic or viral proteins in prokaryotic and eukaryotic cells, which comprises coupling the gene for the protein which is to be transported onto a DNA sequence as described above, incorporating this fusion gene into a vector, and transforming therewith a host cell which transports the expressed protein out of the cytoplasm.
The invention further relates to a hybrid vector comprising a DNA sequence as described above and a host organism containing such vector.
The invention will now be described in further detail by reference to the appended drawings:
Figure 1 shows the digestion of the plasmid p~ 322 with the restriction endonucleases EcoR I and Pvu II and then the filling in of the EcoR I
cleavage site.
Figure 2 shows the plasmid pUC 9 containing the monkey preproinsulin DNA
and the reaction sequence for the construction of the proinsulin DNA fragment.
Figure 3 shows the ligation of the chemically synthesized regulation region with the proinsulin DNA fragment.
--~ Figure 4 shows how the hybrid plasmid pVI 6 is obtained.
Figure 5 shows the plasmid pWI Pl having a DNA sequence I integrated in the 3r correct direction of reading to t-he proinsulin gene.
- 2a - 13~0280 The DNA should "essentially" correspond to that of a natural signal sequence. This is to be understood to mean that the expressed signal peptide is substantially or completely identical to the natural signal peptide, in the latter case therefore the only difference existing at the DNA level is that the synthetic DNA has at least one cleavage site that the natural DNA sequence does not contain. This incorporation of the cleavage site accor-ding to the invention thus means that there is a, more or 1n less extensive, difference from the natural sequence, it being necessary under certain circumstances to have re-course to codons uhich are knoYn to be less preferred by the particular host organism. Ho~ever, surprisingly, this is not associated ~ith any expression disadvantage.
On the contrary, the specific "making to measure" of the synthetic gene is associated with so many advantages that any disadvantage o~ing to the use of "unnatural" codons is, in general, overcompensated by far. In fact, it has emerged that replacement of the start codon GTG, uhich occurs in the gene for alkaline phosphatase in E. coli, by ATG leads to a great increase in expression. A parti-cular advantage of the invention is that the host cell has to produce less ballast protein because the gene ~hich is to be expressed can be directly linked to the 3' end of ~ 10280 the synthetic DNA s;gnal sequence. Furthermore, advan-tages accrue in so far as it is possible in the con-struction of the synthetic DNA to provide DNA sequences, ~hich protrude at the ends, for certain restriction recog-5 nition sites ~hich allo~ cloning of this sequence and, inthe case of disparate recognition sites, permit defined incorporation into a cloning vector. This makes possible incorporation to any desired vectors by the "modular construction principle".
10 Internal recognition sites for restriction enzymes per-mit any desired homologous or heterologous genes to be coupled on in the correct reading frame. It is also possible via these internal cleavage sites to introduce in a straightfor~ard manner modifications in the DNA of the 15 signal sequences, ~hich lead to presequences ~hich do not occur in nature.
These internal cleavage sites are advantageously placed in the regions upstream and do~nstream of the hydrophobic region, in particular in the post-core region, it being 20 possible to modify the splice site and/or its adjacent region. Of course, it is also possible to modify the core region in a manner kno~n per se.
Taking kno~n rules into account (G. von Heijne, J. Mol.
Biol. 173 (1984) 243-251) it is possible, via suitable 25 cleavage sites in the gene section ~hich codes for the carboxy terminal part of the prepeptide, to plan the sig-nal peptidase splice site in such a manner that there is expression not of a fusion protein but directly of the desired, generally eukaryotic, peptide in its natural 30 form. In general, genes of natural origin do not allo~
processing of this type.
Suitable signal sequences are in principle all signal sequences knoun from the literature (M.E.E. l~atson; Nucleic Acids Res. 12 (1984), 5145 - 5164), modifications thereof 35 and "idealized" signal sequences derived therefrom 4 ~3~0280 (D. Perlman and HØ Halvorson~ J. Mol. Biol. 167 (1983), 391 - 409).
Preferred host organisms are E. coli, Streptomyces, Staphy-lococcus species, such as S. aureus, Bacillus species, such as B. subtilis, P. amyloliquifaciens, B. cereus or B. licheniformis, Pseudomonas, Saccharomyces, Spodoptera frugiperda and cell lines of higher organisms, such as plant or animal cells.
In principle, it is possible to obtain by transport 1û expression all those proteins of prokaryotic or eukaryotic origin ~hich can pass through the membrane. Ho~ever, pep-tide products ~hich are of pharmaceutical significance, such as hormones, lymphokines, interferons, blood-coagu-lation factors and vaccines, ~hich in nature are also coded as peptides ~ith an amino-terminal presequence are preferred. Ho~ever, in the prokaryotic host organisms this eukaryotic presequence is not, as a rule, eliminated by the signal peptidases intrinsic to the host.
In E. coli, the genes for the periplasmic and outer-membrane proteins are suitable for transport expression, the former directing the product into the periplasm ~here-as the latter tend to direct onto the outer membrane.
The example ~hich is given is the DNA signal sequence of the periplasmic protein alkaline phosphatase, ~hich is very readily expressed in E. coli, but there is no inten-tion to restrict the invention to this.
The presequence including the first t~enty amino acids of alkaline phosphatase of E. coli is shoun belou:
l 5 lO
Met-Lys-Gln-Ser-Thr-Ile-Ala-Leu-Ala-Leu-Leu-Pro-Leu-Leu-Phe-Thr-Pro-Val-Thr-Lys-Ala-Arg-Thr-Pro-Glu-Met-Pro-Val-3~ 35 40 Leu-Glu-Asn-Arg-Ala-Ala-Gln-Gly-Asn-Ile-Thr-Ala-Pro .. ..
- S - 1~402~0 = preferred splice site of the si~nal peptidase It has emerged that up to about 40, usually about 20, additional amino acids of the mature protein suffice for correct processin~. Houever, in many cases feuer addi-tional amino acids also suffice, for example about 10,advantageously about 5. Since a shorter protein chain means lcss stress on the protein biosynthesis system of the host cell, an advantageous embodiment of the invention is set out in DNA sequence I (see page l7)uhich codes for the presequence of alkaline phosphatase and an additional 5 amino acids of the perfect protein. Apart from a fe~
triplet modifications - namely those uhich introduce unique restriction enzyme cleavage sites and replace the start codon GTG by ATG - DNA sequence I corresponds to the natural sequence for alkaline phosphatase. At the ends of the codin~ strand are located protruding DNA sequences corresponding to the restriction endonuclease EcoR I, uhich permit incorporation into conventional cloning vec-tors, for example the commercially available plasmids such as p9R 322, pUC 8 or pUC 12. In addition, a number of other unique cleavage sites for restriction enzymes have been incorporated ~ithin the gene of DNA sequence I, and these, on the one hand, make it possible to couple heterologous genes onto the correct site and in the desired reading frame and, on the other hand, permit modifications to be carried out:
Restriction enzyme Cut after nucleotide No.
tin the coding strand) Sau 3 A 19 Pvu I 22 Hpa II 54 ) (present in the Ncl I ~4 ) natural gene) Alu I 66 Hph I 68 Ava II 70 Of course, it is also possible to construct the protruding 1~40280 sequences ;n such a manner that they correspond to diffe-rent restriction enzymes, and this then permits incorpor-ation into suitabLe vectors in a defined orientation. In this context, the expert will give consideration to whether the complexity associated with the construction of the gene and its specific incorporation is more important than the additional work of selection associated with incorporation in both orientations when the protruding ends are identical.
DNA sequence I can be constructed of 6 oligonucleotides 26 - 31 bases in length by first synthesizing them chemi-cally and then linking them enzymatically via sticky ends of 6 nucleotides. Incorporation of the synthetic gene into cloning vectors, for example into the commercially available plasmids mentioned, is carried out in a manner known per se.
As an example for the expression of a eukaryotic gene in E. coli using a presequence according to the invention, the synthesis of monkey proinsulin is described below: a DNA sequence is constructed in which the DNA sequence I, followed by the proinsulin gene (W. Wetekam et al., Gene 19 (1982) 179-183), is located on a connecting recognition site for EcoR I and downstream of a chemically syn-thesized regulation region, composed of a bacterial promoter, a lac operator and a ribosomal binding site (German Patent Application P 34 30 683.8), and 6 - 14 nucleotides away from the ribosomal binding site. The expressed proinsulin fusion peptide contains an additional 9 amino acids on the amino terminal end, and these can be eliminated enzymatically or chemically.
The incorporation of the synthetic gene into pUC 8 and the construction of expression plasmids which contain the eukaryotic genes coupled to DNA sequence I are carried out in a manner known per se. In this context, reference may be made to the textbook by Maniatis (Molecular Cloning, Maniatis et al., Cold Spring Harbor, 1982). The .. . . .
~ 7 - 1~ 4n ~8 transformation of the hybrid pLasmids thus obtained into suitable host organisms, advantageously E. coli, is Likewise known per se and is described in detail in the abovementioned textbook. The isolation of the expressed proteins and their purification is likewise described.
In the examples which follow some more embodiments of the invention are specifically illustrated, from which is evident to the expert the large number of possible modi-fications (and combinations). Unless otherwise specified, percentage data in these examples relate to weight.
Examples 1. Chemical synthesis of a single-stranded oligonucleotide The synthesis of the structural units of the gene is illustrated by the example of structural unit Ia of the gene, which comprises nucleotides 1 - 29 of the coding strand. The nucleoside at the 3' end, in the present case therefore guanosine (nucleotide No. 29), is co-valently bonded via the 3'-hydroxy group, by known methods (M.J. Gait et al., Nucleic Acids Res. 8 (1980) 1081 - 1096) to silica gel (FRACTOSIL, supplied by Merck).
For this purpose, first the silica gel is reacted with 3-triethoxysilylpropylamine with elimination of ethanol and formation of a Si-O-Si bond. The guanosine is reacted as the N2 -isobutyryl-3'-0-succinoyl-5'-dimethoxytrityl ether with the modified carrier in the presence of para-nitrophenol and N,N'-dicyclohexylcarbodiimide, the free carboxy group of the succinoyl group acylating the amino radical of the propylamine group.
In the synthetic steps which follow, the base component is used as the monomethyl ester of the 5'-0-dimethoxy-tritylnucleoside-3'-phosphorous acid dialkylamide or chloride, the adenine being in the form of the N6-benzoyl compound, the cytosine being in the form of the N4-benzoyl compound, the guanine being in the form of theN2-iso-~ fraol~la~k 134~2~0 -- 8 --butyryl compound, and the thymine, ~hich contains no amino group, be;ng ~ithout a protective group.
50 mg of the polymeric carrier containing 2 ~mol of bound guanosine are treated successively ~ith the follo~ing agents:
a) nitromethane b) saturated zinc bromide solution in nitromethane con-taining 1% ~ater c) methanol d) tetrahydrofuran e) acetonitrile f) 40 ~mol of the appropriate nucleoside phosphite and 200 ~mol of tetrazole in 0.5 ml of anhydrous aceto-nitrile (5 minutes) 9) 20X acetic anhydride in tetrahydrofuran containing 40% lutidine and 10% dimethylaminopyridine (2 minutes) h) tetrahydrofuran i) tetrahydrofuran containing 20% uater and 40X lutidine j) 3X iodine in collidine/~ater/tetrahydrofuran in the ratio by volume 5 : 4 : 1 k) tetrahydrofuran and 1) methanol.
In this context, the term "phosphite" is to be understood to be the monomethyl ester of the deoxyribose-3'-mono-phosphorous acid, the third valency being saturated bychloride or a tertiary amino group, for example a morpho-lino radical. The yields in each synthetic step can be determined after the detritylation reaction (b) in each case by spectrophotometry, measuring the absorption of the dimethoxytrityl cation at a wavelength of 496 nm.
When the synthesis of the oligonucleotide is complete, the methyl phosphate protective groups on the oligomer are eliminated using p-thiocresol and triethylamine. The oligonucleotide is then removed from the solid carrier by treatment uith ammonia for 3 hours. Treatment of the 13402~0 _ 9 _ oligomers uith concentrated ammonia for 2 to 3 days quanti-tatively eliminates the amino protective groups on the bases. The crude product thus obtained is purified by high-pressure liquid chromatography (HPLC) or by poly-acrylamide gel electrophoresis.
The other structural units Ib - If of the gene are synthe-sized entirely correspondingly, their nucleotide sequences being evident from DNA sequence II(see page 18).
This short peptide sequence is called a "signal peptide"
or a presequence or leader sequence.
The signal sequence located at the amino terminal end has already been characterized for a large number of secretory proteins. In general, it is composed of a hydrophobic region of about 10 to 20 amino acids, which is called the core and to whose amino term;nal end a short peptide se-quence (the pre-core) is bonded, this usually hav;ng one posit;vely charged am;no acid (or several). Between the carboxy terminal end of the hydrophobic region and the amino terminal end of the mature transported protein there is a short pept;de sequence (the post-core) wh;ch contains the splice site and ensures that the spatial arrangement is favorable.
It is known, from U.S. Patent 4,411,994, to couple the gene for a protein which is to be expressed with a bacte-rial gene which codes for an extracellular or periplasmiccarrier protein in order thus to bring about the transport of the desired protein out of the cytoplasm. It is neces-sary for this process to isolate a bacterial gene, which is intrinsic to the host, for a periplasmic, outer mem-brane protein or an extracellular protein. This gene isthen cut with a restriction enzyme, the gene for the pro-tein which is to be transported is inserted into the cut which has been produced, and the host cell ;s transformed iO~80 th a vector ~hich contains the fusion gene thus formed.
The ;solation of the natural gene and its characterization for the selection of suitable cleavage sites is extremely complex. This complexity is avoided according to the in-S vention by making use of a synthetic signal sequence.
Thus the invention relates to a synthetic signal sequence for the transport of proteins in expression systems, which comprises DNA essentially corresponding to a natural signal sequence but having one or nore cleavage sites for endonucleases ~hich are not present in the natu-ral DNA.
The invention further relates to DNA of the Formula I (see page 17).
The invention further relates to a process for he transport expression of eukaryotic, prokaryotic or viral proteins in prokaryotic and eukaryotic cells, which comprises coupling the gene for the protein which is to be transported onto a DNA sequence as described above, incorporating this fusion gene into a vector, and transforming therewith a host cell which transports the expressed protein out of the cytoplasm.
The invention further relates to a hybrid vector comprising a DNA sequence as described above and a host organism containing such vector.
The invention will now be described in further detail by reference to the appended drawings:
Figure 1 shows the digestion of the plasmid p~ 322 with the restriction endonucleases EcoR I and Pvu II and then the filling in of the EcoR I
cleavage site.
Figure 2 shows the plasmid pUC 9 containing the monkey preproinsulin DNA
and the reaction sequence for the construction of the proinsulin DNA fragment.
Figure 3 shows the ligation of the chemically synthesized regulation region with the proinsulin DNA fragment.
--~ Figure 4 shows how the hybrid plasmid pVI 6 is obtained.
Figure 5 shows the plasmid pWI Pl having a DNA sequence I integrated in the 3r correct direction of reading to t-he proinsulin gene.
- 2a - 13~0280 The DNA should "essentially" correspond to that of a natural signal sequence. This is to be understood to mean that the expressed signal peptide is substantially or completely identical to the natural signal peptide, in the latter case therefore the only difference existing at the DNA level is that the synthetic DNA has at least one cleavage site that the natural DNA sequence does not contain. This incorporation of the cleavage site accor-ding to the invention thus means that there is a, more or 1n less extensive, difference from the natural sequence, it being necessary under certain circumstances to have re-course to codons uhich are knoYn to be less preferred by the particular host organism. Ho~ever, surprisingly, this is not associated ~ith any expression disadvantage.
On the contrary, the specific "making to measure" of the synthetic gene is associated with so many advantages that any disadvantage o~ing to the use of "unnatural" codons is, in general, overcompensated by far. In fact, it has emerged that replacement of the start codon GTG, uhich occurs in the gene for alkaline phosphatase in E. coli, by ATG leads to a great increase in expression. A parti-cular advantage of the invention is that the host cell has to produce less ballast protein because the gene ~hich is to be expressed can be directly linked to the 3' end of ~ 10280 the synthetic DNA s;gnal sequence. Furthermore, advan-tages accrue in so far as it is possible in the con-struction of the synthetic DNA to provide DNA sequences, ~hich protrude at the ends, for certain restriction recog-5 nition sites ~hich allo~ cloning of this sequence and, inthe case of disparate recognition sites, permit defined incorporation into a cloning vector. This makes possible incorporation to any desired vectors by the "modular construction principle".
10 Internal recognition sites for restriction enzymes per-mit any desired homologous or heterologous genes to be coupled on in the correct reading frame. It is also possible via these internal cleavage sites to introduce in a straightfor~ard manner modifications in the DNA of the 15 signal sequences, ~hich lead to presequences ~hich do not occur in nature.
These internal cleavage sites are advantageously placed in the regions upstream and do~nstream of the hydrophobic region, in particular in the post-core region, it being 20 possible to modify the splice site and/or its adjacent region. Of course, it is also possible to modify the core region in a manner kno~n per se.
Taking kno~n rules into account (G. von Heijne, J. Mol.
Biol. 173 (1984) 243-251) it is possible, via suitable 25 cleavage sites in the gene section ~hich codes for the carboxy terminal part of the prepeptide, to plan the sig-nal peptidase splice site in such a manner that there is expression not of a fusion protein but directly of the desired, generally eukaryotic, peptide in its natural 30 form. In general, genes of natural origin do not allo~
processing of this type.
Suitable signal sequences are in principle all signal sequences knoun from the literature (M.E.E. l~atson; Nucleic Acids Res. 12 (1984), 5145 - 5164), modifications thereof 35 and "idealized" signal sequences derived therefrom 4 ~3~0280 (D. Perlman and HØ Halvorson~ J. Mol. Biol. 167 (1983), 391 - 409).
Preferred host organisms are E. coli, Streptomyces, Staphy-lococcus species, such as S. aureus, Bacillus species, such as B. subtilis, P. amyloliquifaciens, B. cereus or B. licheniformis, Pseudomonas, Saccharomyces, Spodoptera frugiperda and cell lines of higher organisms, such as plant or animal cells.
In principle, it is possible to obtain by transport 1û expression all those proteins of prokaryotic or eukaryotic origin ~hich can pass through the membrane. Ho~ever, pep-tide products ~hich are of pharmaceutical significance, such as hormones, lymphokines, interferons, blood-coagu-lation factors and vaccines, ~hich in nature are also coded as peptides ~ith an amino-terminal presequence are preferred. Ho~ever, in the prokaryotic host organisms this eukaryotic presequence is not, as a rule, eliminated by the signal peptidases intrinsic to the host.
In E. coli, the genes for the periplasmic and outer-membrane proteins are suitable for transport expression, the former directing the product into the periplasm ~here-as the latter tend to direct onto the outer membrane.
The example ~hich is given is the DNA signal sequence of the periplasmic protein alkaline phosphatase, ~hich is very readily expressed in E. coli, but there is no inten-tion to restrict the invention to this.
The presequence including the first t~enty amino acids of alkaline phosphatase of E. coli is shoun belou:
l 5 lO
Met-Lys-Gln-Ser-Thr-Ile-Ala-Leu-Ala-Leu-Leu-Pro-Leu-Leu-Phe-Thr-Pro-Val-Thr-Lys-Ala-Arg-Thr-Pro-Glu-Met-Pro-Val-3~ 35 40 Leu-Glu-Asn-Arg-Ala-Ala-Gln-Gly-Asn-Ile-Thr-Ala-Pro .. ..
- S - 1~402~0 = preferred splice site of the si~nal peptidase It has emerged that up to about 40, usually about 20, additional amino acids of the mature protein suffice for correct processin~. Houever, in many cases feuer addi-tional amino acids also suffice, for example about 10,advantageously about 5. Since a shorter protein chain means lcss stress on the protein biosynthesis system of the host cell, an advantageous embodiment of the invention is set out in DNA sequence I (see page l7)uhich codes for the presequence of alkaline phosphatase and an additional 5 amino acids of the perfect protein. Apart from a fe~
triplet modifications - namely those uhich introduce unique restriction enzyme cleavage sites and replace the start codon GTG by ATG - DNA sequence I corresponds to the natural sequence for alkaline phosphatase. At the ends of the codin~ strand are located protruding DNA sequences corresponding to the restriction endonuclease EcoR I, uhich permit incorporation into conventional cloning vec-tors, for example the commercially available plasmids such as p9R 322, pUC 8 or pUC 12. In addition, a number of other unique cleavage sites for restriction enzymes have been incorporated ~ithin the gene of DNA sequence I, and these, on the one hand, make it possible to couple heterologous genes onto the correct site and in the desired reading frame and, on the other hand, permit modifications to be carried out:
Restriction enzyme Cut after nucleotide No.
tin the coding strand) Sau 3 A 19 Pvu I 22 Hpa II 54 ) (present in the Ncl I ~4 ) natural gene) Alu I 66 Hph I 68 Ava II 70 Of course, it is also possible to construct the protruding 1~40280 sequences ;n such a manner that they correspond to diffe-rent restriction enzymes, and this then permits incorpor-ation into suitabLe vectors in a defined orientation. In this context, the expert will give consideration to whether the complexity associated with the construction of the gene and its specific incorporation is more important than the additional work of selection associated with incorporation in both orientations when the protruding ends are identical.
DNA sequence I can be constructed of 6 oligonucleotides 26 - 31 bases in length by first synthesizing them chemi-cally and then linking them enzymatically via sticky ends of 6 nucleotides. Incorporation of the synthetic gene into cloning vectors, for example into the commercially available plasmids mentioned, is carried out in a manner known per se.
As an example for the expression of a eukaryotic gene in E. coli using a presequence according to the invention, the synthesis of monkey proinsulin is described below: a DNA sequence is constructed in which the DNA sequence I, followed by the proinsulin gene (W. Wetekam et al., Gene 19 (1982) 179-183), is located on a connecting recognition site for EcoR I and downstream of a chemically syn-thesized regulation region, composed of a bacterial promoter, a lac operator and a ribosomal binding site (German Patent Application P 34 30 683.8), and 6 - 14 nucleotides away from the ribosomal binding site. The expressed proinsulin fusion peptide contains an additional 9 amino acids on the amino terminal end, and these can be eliminated enzymatically or chemically.
The incorporation of the synthetic gene into pUC 8 and the construction of expression plasmids which contain the eukaryotic genes coupled to DNA sequence I are carried out in a manner known per se. In this context, reference may be made to the textbook by Maniatis (Molecular Cloning, Maniatis et al., Cold Spring Harbor, 1982). The .. . . .
~ 7 - 1~ 4n ~8 transformation of the hybrid pLasmids thus obtained into suitable host organisms, advantageously E. coli, is Likewise known per se and is described in detail in the abovementioned textbook. The isolation of the expressed proteins and their purification is likewise described.
In the examples which follow some more embodiments of the invention are specifically illustrated, from which is evident to the expert the large number of possible modi-fications (and combinations). Unless otherwise specified, percentage data in these examples relate to weight.
Examples 1. Chemical synthesis of a single-stranded oligonucleotide The synthesis of the structural units of the gene is illustrated by the example of structural unit Ia of the gene, which comprises nucleotides 1 - 29 of the coding strand. The nucleoside at the 3' end, in the present case therefore guanosine (nucleotide No. 29), is co-valently bonded via the 3'-hydroxy group, by known methods (M.J. Gait et al., Nucleic Acids Res. 8 (1980) 1081 - 1096) to silica gel (FRACTOSIL, supplied by Merck).
For this purpose, first the silica gel is reacted with 3-triethoxysilylpropylamine with elimination of ethanol and formation of a Si-O-Si bond. The guanosine is reacted as the N2 -isobutyryl-3'-0-succinoyl-5'-dimethoxytrityl ether with the modified carrier in the presence of para-nitrophenol and N,N'-dicyclohexylcarbodiimide, the free carboxy group of the succinoyl group acylating the amino radical of the propylamine group.
In the synthetic steps which follow, the base component is used as the monomethyl ester of the 5'-0-dimethoxy-tritylnucleoside-3'-phosphorous acid dialkylamide or chloride, the adenine being in the form of the N6-benzoyl compound, the cytosine being in the form of the N4-benzoyl compound, the guanine being in the form of theN2-iso-~ fraol~la~k 134~2~0 -- 8 --butyryl compound, and the thymine, ~hich contains no amino group, be;ng ~ithout a protective group.
50 mg of the polymeric carrier containing 2 ~mol of bound guanosine are treated successively ~ith the follo~ing agents:
a) nitromethane b) saturated zinc bromide solution in nitromethane con-taining 1% ~ater c) methanol d) tetrahydrofuran e) acetonitrile f) 40 ~mol of the appropriate nucleoside phosphite and 200 ~mol of tetrazole in 0.5 ml of anhydrous aceto-nitrile (5 minutes) 9) 20X acetic anhydride in tetrahydrofuran containing 40% lutidine and 10% dimethylaminopyridine (2 minutes) h) tetrahydrofuran i) tetrahydrofuran containing 20% uater and 40X lutidine j) 3X iodine in collidine/~ater/tetrahydrofuran in the ratio by volume 5 : 4 : 1 k) tetrahydrofuran and 1) methanol.
In this context, the term "phosphite" is to be understood to be the monomethyl ester of the deoxyribose-3'-mono-phosphorous acid, the third valency being saturated bychloride or a tertiary amino group, for example a morpho-lino radical. The yields in each synthetic step can be determined after the detritylation reaction (b) in each case by spectrophotometry, measuring the absorption of the dimethoxytrityl cation at a wavelength of 496 nm.
When the synthesis of the oligonucleotide is complete, the methyl phosphate protective groups on the oligomer are eliminated using p-thiocresol and triethylamine. The oligonucleotide is then removed from the solid carrier by treatment uith ammonia for 3 hours. Treatment of the 13402~0 _ 9 _ oligomers uith concentrated ammonia for 2 to 3 days quanti-tatively eliminates the amino protective groups on the bases. The crude product thus obtained is purified by high-pressure liquid chromatography (HPLC) or by poly-acrylamide gel electrophoresis.
The other structural units Ib - If of the gene are synthe-sized entirely correspondingly, their nucleotide sequences being evident from DNA sequence II(see page 18).
2. Enzymatic linkage of the single-stranded oligonucleo-tides to give DNA sequence I
The terminal oligonucleotides Ia and If are not phosphory-lated. This prevents oligomerization via the protruding ends. For the phosphorylation of oligonucleotides Ib, Ic, Id and Ie, in each case 1 nmol of these compounds is treated ~ith 5 nmol of adenosine triphosphate and 4 units of T4 polynucleotide kinase in 2û ~ul of 50 mM tris.HCl buffer (pH 7.6), 10 mM magnesium chloride and 10 mM di-thiothreitol (DTT) at 37~C for 30 minutes. The enzyme is inactivated by heating at 95~C for 5 minutes. The oligonucleotides Ia to If are then combined and hybridized to give the double strand by heating them in a 20 mM KCl solution and then slo~ly (over the course of 2 hours) cooling to 1~~C. The ligation to give the DNA fragment according to DNA sequence I is carried out by reaction in 40~ul of 50 mM tris.HCl buffer (20 mM magnesium chloride and 10 mM DTT) using 100 units of T4 DNA ligase, at 15~C over the course of 18 hours.
The purification of the gene fragment is carried out by gel electrophoresis on a 10X polyacrylamide gel (uithout addition of urea, 20 x 40 cm, 1 mm thick), the marker sub-stance used being OX 174 DNA (supplied by BRL) cut uith Hinf I, or pPR 322 cut ~ith Hae III.
- lO - 13~0283 3. Incorporation of the gene fragment in pUC 8 The commerc;ally available plasmid pUC 8 is opened in a known manner and in accordance with the manufacturer's data using the restriction endonuclease EcoR I. The digestion mixture is fractionated by electrophoresis on a 5% polyacrylamide gel in a known manner, and the DNA is visualized by staining with ethidium bromide or by radio-active labeling ("Nick translation" method of Maniatis, loc. cit.). The plasmid band is then cut out of the acryl-amide gel and separated from the polyacrylamide by electro-phoresis.
The terminal oligonucleotides Ia and If are not phosphory-lated. This prevents oligomerization via the protruding ends. For the phosphorylation of oligonucleotides Ib, Ic, Id and Ie, in each case 1 nmol of these compounds is treated ~ith 5 nmol of adenosine triphosphate and 4 units of T4 polynucleotide kinase in 2û ~ul of 50 mM tris.HCl buffer (pH 7.6), 10 mM magnesium chloride and 10 mM di-thiothreitol (DTT) at 37~C for 30 minutes. The enzyme is inactivated by heating at 95~C for 5 minutes. The oligonucleotides Ia to If are then combined and hybridized to give the double strand by heating them in a 20 mM KCl solution and then slo~ly (over the course of 2 hours) cooling to 1~~C. The ligation to give the DNA fragment according to DNA sequence I is carried out by reaction in 40~ul of 50 mM tris.HCl buffer (20 mM magnesium chloride and 10 mM DTT) using 100 units of T4 DNA ligase, at 15~C over the course of 18 hours.
The purification of the gene fragment is carried out by gel electrophoresis on a 10X polyacrylamide gel (uithout addition of urea, 20 x 40 cm, 1 mm thick), the marker sub-stance used being OX 174 DNA (supplied by BRL) cut uith Hinf I, or pPR 322 cut ~ith Hae III.
- lO - 13~0283 3. Incorporation of the gene fragment in pUC 8 The commerc;ally available plasmid pUC 8 is opened in a known manner and in accordance with the manufacturer's data using the restriction endonuclease EcoR I. The digestion mixture is fractionated by electrophoresis on a 5% polyacrylamide gel in a known manner, and the DNA is visualized by staining with ethidium bromide or by radio-active labeling ("Nick translation" method of Maniatis, loc. cit.). The plasmid band is then cut out of the acryl-amide gel and separated from the polyacrylamide by electro-phoresis.
4. Incorporation of DNA sequence I into an expression plasmid The expression plasmid p~I 6 having the information for monkey proinsulin is constructed as follows:
10 ~ug of the plasmid pBR 322 are digested with the res-triction endonucleases EcoR I and Pvu II and then the EcoRI cleavage site is filled in a fill-in reaction using Klenow polymerase. Following fractionation by gel electro-phoresis in a 5% polyacrylamide gel, the plasmid fragmentof length 2293 Bp can be obtained by electroelution (Figure 1).
The monkey preproinsulin DNA integrated in the plasmid p8R 322 (~etekam et al., Gene 19 (1982) 179 - 183) is isolated by digestion using the restriction endonucleases Hind III and Mst I (as a fragment of about 1250 Bp) and recloned into the plasmid pUC 9 as follows: the plasmid pUC 9 is cleaved with the enzyme Bam HI, the cleav~e ~ite is filled in a standard fill-in reaction using Klenow polymerase ("large fragment"), subsequent cleavage with the restriction enzyme Hind III is carried out, and the DNA is separated from the other DNA fragments by gel electrophoresis in a 5% polyacrylamide gel. The isolated insulin DNA fragment of length about 1250 Bp is integrated .
" 3.3~280 into the opened plasm;d.
To remove the untranslated region and the presequence, the pUC 9 plasmid thus modified is digested u;th Hae III, and the fragment of length 143 Bp is digested ~ith 8al 31 under limiting enzyme conditions to eliminate the last t~o nucleotides from the presequence. This results in the first codon on the amino terminal end being TTT, ~hich represents phenylalanine as the first amino acid of the B chain.
An adaptor uhich is specific for Eco RI is no~ ligated onto this fragment in a blunt-end ligation reaction:
a) 5' AAT TAT GAA TTC GCA ATG
Eco RI TA CTT AAG CGT TAC
b) 5' AAT TAT GAA TTC GCA AGA
Eco RI TA CTT AAG CGT TCT
In order to prevent polymerization of the adaptors they are used unphosphorylated in the ligation reaction (this being indicated in the figures by Eco RI-, in the same uay as recognition sequences inactivated by, for example, filling in). The adaptor a) has a codon for methionine at the end, and the adaptor b) has a codon for arginine.
Thus, the gene product obtained by variant a) is amenable to removal of the bacterial contribution by cleavage with cyanogen bromide, ~hereas variant b) allo~s trypsin cleav-age.
The ligation product is digested u;th Mbo II. After frac-tionation by gel electrophoresis, a DNA fragment of length 79 Bp having the information for amino acids Nos. 1 to 21 of the B chain is obtained.
The gene for the remaining information for the proinsulin molecule ~including a G-C sequence from the cloning and 21 Bp from the pBR 322 connected to the stop codon) is - - 12 - ~34~280 obtained from the pUC 9 plasmid having the complete information for monkey preproinsulin by digestion ~ith Mbo II/Sma I and isolation of a DNA fragment of length about 240 Bp. The correct ligation product of length about 320 Bp (including the adaptor of 18 Bp) is obtained by ligation of the t~o proinsulin fragments. This pro-insulin DNA fragment thus constructed can no~ be ligated together ~ith a regulation region via the Eco RI negative cleavage site.
Figure 2 sho~s the entire reaction sequence, ~here A, B
and C denote the DNA for the particular peptide chains of the proinsulin molecule, Ad denotes the (dephosphoryl-ated) adaptor (a or b) and Pre denotes the DNA for the presequence of monkey preproinsulin.
A chemically synthesized regulation region composed of a recognition sequence for Bam HI, the lac operator (0), a bacterial promoter (P) and a ribosomal binding site (RB), and having an ATG start codon, 6 to 14 nucleotides a~ay from the RB and having a connected recognition sequence for Eco RI (Figure 3) is Ligated, via the common Eco RI
overlapping region, ~ith the proinsulin gene fragment obtained according to the previous example. It is advan-tageous to choose the follo~ing synthetic regulation region (DNA sequence IIa from Table 2, corresponding to German Patent Application P 34 30 683.8):
. . .. .
~ 13 ~ 1~2~
10 ~ug of the plasmid pBR 322 are digested with the res-triction endonucleases EcoR I and Pvu II and then the EcoRI cleavage site is filled in a fill-in reaction using Klenow polymerase. Following fractionation by gel electro-phoresis in a 5% polyacrylamide gel, the plasmid fragmentof length 2293 Bp can be obtained by electroelution (Figure 1).
The monkey preproinsulin DNA integrated in the plasmid p8R 322 (~etekam et al., Gene 19 (1982) 179 - 183) is isolated by digestion using the restriction endonucleases Hind III and Mst I (as a fragment of about 1250 Bp) and recloned into the plasmid pUC 9 as follows: the plasmid pUC 9 is cleaved with the enzyme Bam HI, the cleav~e ~ite is filled in a standard fill-in reaction using Klenow polymerase ("large fragment"), subsequent cleavage with the restriction enzyme Hind III is carried out, and the DNA is separated from the other DNA fragments by gel electrophoresis in a 5% polyacrylamide gel. The isolated insulin DNA fragment of length about 1250 Bp is integrated .
" 3.3~280 into the opened plasm;d.
To remove the untranslated region and the presequence, the pUC 9 plasmid thus modified is digested u;th Hae III, and the fragment of length 143 Bp is digested ~ith 8al 31 under limiting enzyme conditions to eliminate the last t~o nucleotides from the presequence. This results in the first codon on the amino terminal end being TTT, ~hich represents phenylalanine as the first amino acid of the B chain.
An adaptor uhich is specific for Eco RI is no~ ligated onto this fragment in a blunt-end ligation reaction:
a) 5' AAT TAT GAA TTC GCA ATG
Eco RI TA CTT AAG CGT TAC
b) 5' AAT TAT GAA TTC GCA AGA
Eco RI TA CTT AAG CGT TCT
In order to prevent polymerization of the adaptors they are used unphosphorylated in the ligation reaction (this being indicated in the figures by Eco RI-, in the same uay as recognition sequences inactivated by, for example, filling in). The adaptor a) has a codon for methionine at the end, and the adaptor b) has a codon for arginine.
Thus, the gene product obtained by variant a) is amenable to removal of the bacterial contribution by cleavage with cyanogen bromide, ~hereas variant b) allo~s trypsin cleav-age.
The ligation product is digested u;th Mbo II. After frac-tionation by gel electrophoresis, a DNA fragment of length 79 Bp having the information for amino acids Nos. 1 to 21 of the B chain is obtained.
The gene for the remaining information for the proinsulin molecule ~including a G-C sequence from the cloning and 21 Bp from the pBR 322 connected to the stop codon) is - - 12 - ~34~280 obtained from the pUC 9 plasmid having the complete information for monkey preproinsulin by digestion ~ith Mbo II/Sma I and isolation of a DNA fragment of length about 240 Bp. The correct ligation product of length about 320 Bp (including the adaptor of 18 Bp) is obtained by ligation of the t~o proinsulin fragments. This pro-insulin DNA fragment thus constructed can no~ be ligated together ~ith a regulation region via the Eco RI negative cleavage site.
Figure 2 sho~s the entire reaction sequence, ~here A, B
and C denote the DNA for the particular peptide chains of the proinsulin molecule, Ad denotes the (dephosphoryl-ated) adaptor (a or b) and Pre denotes the DNA for the presequence of monkey preproinsulin.
A chemically synthesized regulation region composed of a recognition sequence for Bam HI, the lac operator (0), a bacterial promoter (P) and a ribosomal binding site (RB), and having an ATG start codon, 6 to 14 nucleotides a~ay from the RB and having a connected recognition sequence for Eco RI (Figure 3) is Ligated, via the common Eco RI
overlapping region, ~ith the proinsulin gene fragment obtained according to the previous example. It is advan-tageous to choose the follo~ing synthetic regulation region (DNA sequence IIa from Table 2, corresponding to German Patent Application P 34 30 683.8):
. . .. .
~ 13 ~ 1~2~
5' GATCCTAAATAAATTCTTGACA~ AAA 3' 3' GATTTATTTAAGAACTGTAAAAAATTT 5' (Ba~ HI) P
5' TAATTTGGTATAATGTGTGGAATTGTGAGCG 3' 3' ATTAAACCATATTACACACCTTAACACTCGC 5' 5' GAATAACAATTTCACAGAGGATCTAG 3' 3' CTTATTGTTAAAGTGTCTCCTAGATCTTAA 5' RB (Eco RI) The other synthetic regulation regions specified in Table 2 can be used like~ise. Ho~ever, it ;s also pos-sible to choose a natural or derived ~Perlman et al., loc.
cit.) signal sequence kno~n from the literature.
.. . . . . ..
- 1 4 - 1 3 ~ ~ 2 8 0 C~ C~ 'C
ll ll ll a~
~ U~
., a~
.
~ ~ Cl: ~ ~ C C ~ C~
~c~
O ~U a:
O u L
O O ~ O
~ N ~ ~ V~
3 ~
.. 3 c c ~: ~ u u ~ c~ c c ~ c c c~ o o o o ~s co r cr o 11 11 11 "
~ ~ ~ c ~ ' J ~I: C~ ~ ~
~ ~ c ~ ~ o U ~
~ ~ ~ O O O O O r ~ c~ c~ c~ c~ U C~
~ C~ C~ ~ ~ ~ ~ ~ ~. V~
c 1l 1l 1l 1l 1l 1l ~ ~ D C~ ~ ~ ~ bDS
Z H
- 15 - ~ 2~3 Follo~ing double digestion ~ith Sma I/Bam HI and a fill-in reaction of the Bam HI cleavage site ~ith the Kleno~
fragment, the ligation product (about 420 Bp) is isolated by gel electrophoresis.
The fragment thus obtained can then, by a blunt-end liga-tion, be ligated into the pBR 322 part-plasmid of Figure 1 (Figure 4). The hybrid plasmid pWI 6 is obtained.
After transformation into the E. coli strain HB 101 and selection on ampicillin plates, the plasmid DNA of indi-vidual clones uas tested for the integration of a 420 Bpfragment having the regulation region and the proinsulin gene shortened by Bal 31. In order to demonstrate the correct shortening of the proinsulin gene by Bal 31 (Figure 2), the plasmids having the integrated proinsulin gene fragment uere sequenced starting from the Eco RI
cleavage site. Of 60 sequenced clones, three had the desired shortening by t~o nucleotides (Figure 4).
1 JUg of the plasmid pWI 6 is cut ~ith the restriction enzyme Eco RI and then ligated together in the presence of 30 ng of DNA sequence I, at 16~C in 6 hours. After transformation into E. coli HB 101, plasmids are isolated from individual clones and tested for integration of DNA
sequence I by means of restriction enzyme analysis. 7%
of the clones contained the plasmid pWI 6 with integrated DNA sequence I.
The direction of this integration reaction can be unambi-guously determined by standard methods of restriction enzyme analysis via double digestion uith Hind III/
Pvu I. The plasmid pWI 6 having a DNA sequence I inte-grated in the correct direction of reading to the pro-insulin gene is sho~n as pWIP 1 in Figure 5.
This plasmid can then be transformed into various E. coli strains in order to test the synthetic capacity of the individual strains.
- 16 - 13~2~0 The express;on of the presequence-proinsulin gene fusion in E. coli is determined as follows:
1 ml of a bacterial culture induced with IPTG (isopropyl ~-D-thiogalactopyranoside) is stopped using PMSF ~phenyl-5 methylsulfonyl fluoride) in a final concentration of 5x10-4 M at an opt;cal density of OD600 of 1.0 and at an induction time of 1 hour, cooled in ice and spun do~n.
The cell sediment is then washed in 1 ml of buffer (1û mM
tris.HCl, pH 7.6; 40 mM NaCl), spun down and resuspended 10 in 200/ul of buffer (20X sucrose; 20 mM tris.HCl, pH 8.0;
2 mM EDTA), incubated at room temperature for 10 minutes, spun down and immediately resuspended in 500jul of double-distilled H20. After incubation in ice for 10 minutes, the shock-lysed bacteria are spun down and the supernatant 15 is frozen. The proinsuLin content of this supernatant is tested by a standard insulin RIA (Amersham).
The bacterial sediment is resuspended once more in 200~ul of lysozyme buffer (20Z sucrose; 2 mg/ml lysozyme; 20 mM
tris.HCl, pH 8.0; 2 mM EDTA), incubated in ice for 20 30 minutes, sonicated 3 x 10 seconds and then spun down.
The supernatant resulting from this is tested for the con-tent of proinsulin ("plasma fractionn) in a radio-immunoassay.
Individual bacterial clones which contain the plasmid 25 pWIP 1 were examined for their synthetic capacity and their ability to transport the proinsulin-presequence product. It was possible to demonstrate that all the bacterial clones, as expected, transported about 90Z of the produced proinsulin into the periplasmic space. About 30 10Z of the RIA activity of proinsulin was still found in the plasma fraction.
.
- 17 - ~.? 10 280 DNA sequenceI
Triplet No. l 2 3 A~lno acid No. Met Lys Gln Nucleoti~ No. 5 lO
Codin~ strand 5' AA TTC ATG AAA CAA
non-cod. strand 3' G TAC TTT GTT
4 5 6 7 8 9 10 ll 12 13 Ser Thr Ile Ala Leu Ala Leu Leu Pro Leu AGC ACG ATC GCA CTG GCA CTC TTA CCG TTA
TCG TGC TAG CGT GAC CGT GAG AAT GGC AAT
Leu Phe Thr Pro Val Thr Lys Ala Arg Thr CTG TTT ACC CCG GTG ACA AAA GCT CGG ACC
GAC AAA TGG GGC CAC TGT TTT CGA GCC TGG
Pro Glu Met CCA GAA ATG G 3' GGT CTT TAC CTT AA 5' 18 - ~3 ~ 0 2 ~ 0 DNA sequence II:
Ia 5 ' AA TTC ATG AAA CAA AGC ACG ATC GCA CTG
3 ' G TAC TTT GTT TCG TGC TAG CGT GAC
Eco RI ~ Ib Ic IGCA CTC TTA CCG TTA CTG TTT ACC CCG
CGT GAG AAT GGC AAT GAC AAA TGG GGC
~ ~ Id Ie ~ Eco RI
hTG ACA AAA GCT CGG ACC CCA GAA ATG G
CAC TGT TTT CGA GCC TGG GGT CTT TAC CTT AA
~ ~ If
5' TAATTTGGTATAATGTGTGGAATTGTGAGCG 3' 3' ATTAAACCATATTACACACCTTAACACTCGC 5' 5' GAATAACAATTTCACAGAGGATCTAG 3' 3' CTTATTGTTAAAGTGTCTCCTAGATCTTAA 5' RB (Eco RI) The other synthetic regulation regions specified in Table 2 can be used like~ise. Ho~ever, it ;s also pos-sible to choose a natural or derived ~Perlman et al., loc.
cit.) signal sequence kno~n from the literature.
.. . . . . ..
- 1 4 - 1 3 ~ ~ 2 8 0 C~ C~ 'C
ll ll ll a~
~ U~
., a~
.
~ ~ Cl: ~ ~ C C ~ C~
~c~
O ~U a:
O u L
O O ~ O
~ N ~ ~ V~
3 ~
.. 3 c c ~: ~ u u ~ c~ c c ~ c c c~ o o o o ~s co r cr o 11 11 11 "
~ ~ ~ c ~ ' J ~I: C~ ~ ~
~ ~ c ~ ~ o U ~
~ ~ ~ O O O O O r ~ c~ c~ c~ c~ U C~
~ C~ C~ ~ ~ ~ ~ ~ ~. V~
c 1l 1l 1l 1l 1l 1l ~ ~ D C~ ~ ~ ~ bDS
Z H
- 15 - ~ 2~3 Follo~ing double digestion ~ith Sma I/Bam HI and a fill-in reaction of the Bam HI cleavage site ~ith the Kleno~
fragment, the ligation product (about 420 Bp) is isolated by gel electrophoresis.
The fragment thus obtained can then, by a blunt-end liga-tion, be ligated into the pBR 322 part-plasmid of Figure 1 (Figure 4). The hybrid plasmid pWI 6 is obtained.
After transformation into the E. coli strain HB 101 and selection on ampicillin plates, the plasmid DNA of indi-vidual clones uas tested for the integration of a 420 Bpfragment having the regulation region and the proinsulin gene shortened by Bal 31. In order to demonstrate the correct shortening of the proinsulin gene by Bal 31 (Figure 2), the plasmids having the integrated proinsulin gene fragment uere sequenced starting from the Eco RI
cleavage site. Of 60 sequenced clones, three had the desired shortening by t~o nucleotides (Figure 4).
1 JUg of the plasmid pWI 6 is cut ~ith the restriction enzyme Eco RI and then ligated together in the presence of 30 ng of DNA sequence I, at 16~C in 6 hours. After transformation into E. coli HB 101, plasmids are isolated from individual clones and tested for integration of DNA
sequence I by means of restriction enzyme analysis. 7%
of the clones contained the plasmid pWI 6 with integrated DNA sequence I.
The direction of this integration reaction can be unambi-guously determined by standard methods of restriction enzyme analysis via double digestion uith Hind III/
Pvu I. The plasmid pWI 6 having a DNA sequence I inte-grated in the correct direction of reading to the pro-insulin gene is sho~n as pWIP 1 in Figure 5.
This plasmid can then be transformed into various E. coli strains in order to test the synthetic capacity of the individual strains.
- 16 - 13~2~0 The express;on of the presequence-proinsulin gene fusion in E. coli is determined as follows:
1 ml of a bacterial culture induced with IPTG (isopropyl ~-D-thiogalactopyranoside) is stopped using PMSF ~phenyl-5 methylsulfonyl fluoride) in a final concentration of 5x10-4 M at an opt;cal density of OD600 of 1.0 and at an induction time of 1 hour, cooled in ice and spun do~n.
The cell sediment is then washed in 1 ml of buffer (1û mM
tris.HCl, pH 7.6; 40 mM NaCl), spun down and resuspended 10 in 200/ul of buffer (20X sucrose; 20 mM tris.HCl, pH 8.0;
2 mM EDTA), incubated at room temperature for 10 minutes, spun down and immediately resuspended in 500jul of double-distilled H20. After incubation in ice for 10 minutes, the shock-lysed bacteria are spun down and the supernatant 15 is frozen. The proinsuLin content of this supernatant is tested by a standard insulin RIA (Amersham).
The bacterial sediment is resuspended once more in 200~ul of lysozyme buffer (20Z sucrose; 2 mg/ml lysozyme; 20 mM
tris.HCl, pH 8.0; 2 mM EDTA), incubated in ice for 20 30 minutes, sonicated 3 x 10 seconds and then spun down.
The supernatant resulting from this is tested for the con-tent of proinsulin ("plasma fractionn) in a radio-immunoassay.
Individual bacterial clones which contain the plasmid 25 pWIP 1 were examined for their synthetic capacity and their ability to transport the proinsulin-presequence product. It was possible to demonstrate that all the bacterial clones, as expected, transported about 90Z of the produced proinsulin into the periplasmic space. About 30 10Z of the RIA activity of proinsulin was still found in the plasma fraction.
.
- 17 - ~.? 10 280 DNA sequenceI
Triplet No. l 2 3 A~lno acid No. Met Lys Gln Nucleoti~ No. 5 lO
Codin~ strand 5' AA TTC ATG AAA CAA
non-cod. strand 3' G TAC TTT GTT
4 5 6 7 8 9 10 ll 12 13 Ser Thr Ile Ala Leu Ala Leu Leu Pro Leu AGC ACG ATC GCA CTG GCA CTC TTA CCG TTA
TCG TGC TAG CGT GAC CGT GAG AAT GGC AAT
Leu Phe Thr Pro Val Thr Lys Ala Arg Thr CTG TTT ACC CCG GTG ACA AAA GCT CGG ACC
GAC AAA TGG GGC CAC TGT TTT CGA GCC TGG
Pro Glu Met CCA GAA ATG G 3' GGT CTT TAC CTT AA 5' 18 - ~3 ~ 0 2 ~ 0 DNA sequence II:
Ia 5 ' AA TTC ATG AAA CAA AGC ACG ATC GCA CTG
3 ' G TAC TTT GTT TCG TGC TAG CGT GAC
Eco RI ~ Ib Ic IGCA CTC TTA CCG TTA CTG TTT ACC CCG
CGT GAG AAT GGC AAT GAC AAA TGG GGC
~ ~ Id Ie ~ Eco RI
hTG ACA AAA GCT CGG ACC CCA GAA ATG G
CAC TGT TTT CGA GCC TGG GGT CTT TAC CTT AA
~ ~ If
Claims (16)
1. A synthetic signal sequence for the transport of proteins in expression systems which comprises DNA
essentially corresponding to a natural signal sequence but having one or more cleavage sites for endonucleases which the natural DNA does not contain.
essentially corresponding to a natural signal sequence but having one or more cleavage sites for endonucleases which the natural DNA does not contain.
2. A signal sequence as claimed in claim 1, which contains internal cleavage sites upstream or downsteam or upstream and downstream of the hydrophobic region.
3. A signal sequence as claimed in claim 1, which essentially corresponds to the natural signal sequence of alkaline phosphatase of E. coli.
4. A signal sequence as claimed in claim 1, which contains at the 3' end up to about 40 of the amino-terminal codons of the adjacent structural gene following downstream.
5. A signal sequence as claimed in claim 2, which essentially corresponds to the natural signal sequence of alkaline phosphatase of E. coli.
6. A signal sequence as claimed in claim 2, which contains at the 3' end up to about 40 of the amino-terminal codons of the structural gene following downstream.
7. A signal sequence as claimed in claim 3, which contains at the 3' end up to about 40 of the amino-terminal codons of the structural gene following downstream.
8. DNA of the formula I: 5 10 5 ' AA TTC ATG AAA CAA
3' G TAC TTT GTT
AGC ACG ATC GCA CTG GCA CTC TTA CCG TTA
TCG TGC TAG CGT GAC CGT GAG AAT GGC AAT
CTG TTT ACC CCG GTG ACA AAA GCT CGG ACC
GAC AAA TGG GGC CAC TGT TTT CGA GCC TGG
CCA GAA ATG G 3 ' GGT CTT TAC CTT AA 5 '
3' G TAC TTT GTT
AGC ACG ATC GCA CTG GCA CTC TTA CCG TTA
TCG TGC TAG CGT GAC CGT GAG AAT GGC AAT
CTG TTT ACC CCG GTG ACA AAA GCT CGG ACC
GAC AAA TGG GGC CAC TGT TTT CGA GCC TGG
CCA GAA ATG G 3 ' GGT CTT TAC CTT AA 5 '
9. A process for the transport expression of eukaryotic, prokaryotic or viral proteins in prokaryotic and eukaryotic cells, which comprises coupling the gene for the protein which is to be transported onto a DNA sequence as claimed in claim 1, incorporating this fusion gene into a vector, and transforming therewith a host cell which transports the expressed protein out of the cytoplasm.
10. The process as claimed in claim 9, wherein the synthetic DNA signal sequence codes for a protein intrinsic to the host.
11. A hybrid vector comprising a DNA sequence as claimed in claim 1.
12. A hybrid vector as claimed in claim 11, which is a hybrid plasmid containing the DNA sequence I as claimed in claim 8, inserted in an Eco RI cleavage site.
13. A host cell containing a vector as claimed in claim 11.
14. A host cell containing a vector as claimed in claim 12.
15. A host cell as claimed in claim 13, which is of the species E.
coli.
coli.
16. A host cell as claimed in claim 14, which is of the species E.
coli.
coli.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DEP3436818.3 | 1984-10-06 | ||
| DE19843436818 DE3436818A1 (en) | 1984-10-06 | 1984-10-06 | SYNTHETIC SIGNAL SEQUENCE FOR TRANSPORTING PROTEINS IN EXPRESSION SYSTEMS |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CA1340280C true CA1340280C (en) | 1998-12-22 |
Family
ID=6247347
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA000492345A Expired - Fee Related CA1340280C (en) | 1984-10-06 | 1985-10-04 | Synthetic signal sequence for the transport of proteins in expression systems |
Country Status (15)
| Country | Link |
|---|---|
| EP (1) | EP0177827B1 (en) |
| JP (1) | JPS6188883A (en) |
| AT (1) | ATE97445T1 (en) |
| AU (1) | AU595486B2 (en) |
| CA (1) | CA1340280C (en) |
| DE (2) | DE3436818A1 (en) |
| DK (1) | DK175511B1 (en) |
| ES (1) | ES8605579A1 (en) |
| GR (1) | GR852405B (en) |
| HU (1) | HU197355B (en) |
| IE (1) | IE63262B1 (en) |
| IL (1) | IL76573A (en) |
| NZ (1) | NZ213717A (en) |
| PH (1) | PH30819A (en) |
| PT (1) | PT81253B (en) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7829503B2 (en) | 2003-03-27 | 2010-11-09 | Ptc Therapeutics, Inc. | Methods of identifying compounds that target tRNA splicing endonuclease and uses of said compounds as anti-fungal agents |
| US7939468B2 (en) | 2003-03-27 | 2011-05-10 | Ptc Therapeutics, Inc. | Methods of identifying compounds that target tRNA splicing endonuclease and uses of said compounds as anti-proliferative agents |
| US8278085B2 (en) | 2003-07-02 | 2012-10-02 | Ptc Therapeutics, Inc. | RNA processing protein complexes and uses thereof |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0196864A3 (en) * | 1985-03-25 | 1988-03-23 | Cetus Corporation | Alkaline phosphatase-mediated processing and secretion of recombinant proteins, dna sequences for use therein and cells transformed using such sequences |
| IT1196484B (en) * | 1986-07-11 | 1988-11-16 | Sclavo Spa | YEAST EXPRESSION AND SECRETION VECTOR, USEFUL FOR THE PREPARATION OF HETEROLOGICAL PROTEINS |
| US5426036A (en) * | 1987-05-05 | 1995-06-20 | Hoechst Aktiengesellschaft | Processes for the preparation of foreign proteins in streptomycetes |
| DE68917759T2 (en) * | 1988-03-18 | 1995-04-27 | Wang Laboratories | Distributed reference and change table for a virtual storage system. |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS5939899A (en) * | 1982-08-27 | 1984-03-05 | Gakuzo Tamura | Novel vector |
| US4711844A (en) * | 1983-03-09 | 1987-12-08 | Cetus Corporation | Modified signal peptides |
| NZ207925A (en) * | 1983-04-25 | 1988-05-30 | Genentech Inc | Yeast expression vehicle consisting of a yeast promoter and signal peptide encoding region linked to a heterologus peptide coding region; expression and culture |
| US4663280A (en) * | 1983-05-19 | 1987-05-05 | Public Health Research Institute Of The City Of New York | Expression and secretion vectors and method of constructing vectors |
| AU3011684A (en) * | 1983-05-19 | 1984-12-04 | Public Health Research Institute Of The City Of New York, Inc., The | Expression and secretion vectors & method of constructing vectors |
| JPS6030687A (en) * | 1983-08-01 | 1985-02-16 | Wakunaga Seiyaku Kk | Dna gene, its preparation and plasmid containing the same |
| DE3587205T2 (en) * | 1984-07-30 | 1993-08-26 | Wakunaga Seiyaku Kk | METHOD FOR PRODUCING A PROTEIN AND VECTOR TO BE USED FOR IT, RECOMBINANT DNA AND TRANSFORMED CELL. |
-
1984
- 1984-10-06 DE DE19843436818 patent/DE3436818A1/en not_active Withdrawn
-
1985
- 1985-09-23 AT AT85112043T patent/ATE97445T1/en not_active IP Right Cessation
- 1985-09-23 DE DE85112043T patent/DE3587660D1/en not_active Expired - Lifetime
- 1985-09-23 EP EP85112043A patent/EP0177827B1/en not_active Expired - Lifetime
- 1985-09-30 HU HU853761A patent/HU197355B/en unknown
- 1985-10-03 JP JP60221120A patent/JPS6188883A/en active Pending
- 1985-10-04 AU AU48333/85A patent/AU595486B2/en not_active Expired
- 1985-10-04 IE IE244085A patent/IE63262B1/en not_active IP Right Cessation
- 1985-10-04 ES ES547600A patent/ES8605579A1/en not_active Expired
- 1985-10-04 DK DK198504532A patent/DK175511B1/en not_active IP Right Cessation
- 1985-10-04 IL IL76573A patent/IL76573A/en not_active IP Right Cessation
- 1985-10-04 NZ NZ213717A patent/NZ213717A/en unknown
- 1985-10-04 PT PT81253A patent/PT81253B/en unknown
- 1985-10-04 CA CA000492345A patent/CA1340280C/en not_active Expired - Fee Related
- 1985-10-04 GR GR852405A patent/GR852405B/el unknown
- 1985-10-14 PH PH32879A patent/PH30819A/en unknown
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7829503B2 (en) | 2003-03-27 | 2010-11-09 | Ptc Therapeutics, Inc. | Methods of identifying compounds that target tRNA splicing endonuclease and uses of said compounds as anti-fungal agents |
| US7939468B2 (en) | 2003-03-27 | 2011-05-10 | Ptc Therapeutics, Inc. | Methods of identifying compounds that target tRNA splicing endonuclease and uses of said compounds as anti-proliferative agents |
| US8278085B2 (en) | 2003-07-02 | 2012-10-02 | Ptc Therapeutics, Inc. | RNA processing protein complexes and uses thereof |
Also Published As
| Publication number | Publication date |
|---|---|
| DE3587660D1 (en) | 1993-12-23 |
| IE852440L (en) | 1986-04-06 |
| EP0177827B1 (en) | 1993-11-18 |
| AU4833385A (en) | 1986-04-10 |
| DE3436818A1 (en) | 1986-04-10 |
| DK453285A (en) | 1986-04-07 |
| ATE97445T1 (en) | 1993-12-15 |
| JPS6188883A (en) | 1986-05-07 |
| EP0177827A2 (en) | 1986-04-16 |
| ES547600A0 (en) | 1986-03-16 |
| IL76573A (en) | 1992-06-21 |
| HU197355B (en) | 1989-03-28 |
| HUT40164A (en) | 1986-11-28 |
| EP0177827A3 (en) | 1987-12-02 |
| DK453285D0 (en) | 1985-10-04 |
| NZ213717A (en) | 1989-01-06 |
| AU595486B2 (en) | 1990-04-05 |
| IE63262B1 (en) | 1995-04-05 |
| PH30819A (en) | 1997-10-17 |
| DK175511B1 (en) | 2004-11-15 |
| GR852405B (en) | 1986-02-04 |
| ES8605579A1 (en) | 1986-03-16 |
| PT81253B (en) | 1987-11-30 |
| PT81253A (en) | 1985-11-01 |
| IL76573A0 (en) | 1986-02-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CA1341124C (en) | Genetic engineering process for the preparation of hirudins, and means for carrying out this process | |
| AU607209B2 (en) | Secretion of insulin-like growth factor-i in e. coli | |
| DK172618B1 (en) | Fusion proteins, their preparation and use | |
| IL95495A (en) | Fusion proteins their preparation and use | |
| US5489517A (en) | Secretion of insulin-like growth factor-I in E. coli | |
| KR920009505B1 (en) | Method for preparation of polypeptide having human gamma inf activity | |
| CA1297813C (en) | Process for production of insulin-like growth factor i | |
| Sung et al. | Short synthetic oligodeoxyribonucleotide leader sequences enhance accumulation of human proinsulin synthesized in Escherichia coli. | |
| IE851319L (en) | Genetically engineered interleukin 2 | |
| CA1340280C (en) | Synthetic signal sequence for the transport of proteins in expression systems | |
| AU595805B2 (en) | Releasing desired peptides from fusion polypeptides using endopeptidases | |
| DK171940B1 (en) | Method for Preparation of Polypeptides with C-Terminal Carboxamide Group, Polypeptides for Use in the Method, Fusion Proteins Thereto and Suitable DNA Sequences, Plasmids and Host Organisms | |
| AU617999B2 (en) | A genetic engineering process for the preparation of angiogenins | |
| Hinton et al. | The molybdenum-pterin binding protein is encoded by a multigene family in Clostridium pasteurianum | |
| CA1224167A (en) | Preparation of secretin | |
| EP0207165A1 (en) | Polypeptide secretion-causing vector, microorganisms transformed by said vector, and process for preparing polypeptide using said microorganisms | |
| Cravador et al. | Total DNA synthesis and cloning in Escherichia coli of a gene coding for the human growth hormone releasing factor | |
| AU600229B2 (en) | Modification of the DNA sequemce between the shine-dalgarno sequence and the start codon of the TRP operon to increase protein expression | |
| HU200366B (en) | Process for producing modified hygromycin resistance gene |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| MKLA | Lapsed |
Effective date: 20131223 |