MXPA97006607A - Methods and compositions for secretion of heterolo polypeptides - Google Patents
Methods and compositions for secretion of heterolo polypeptidesInfo
- Publication number
- MXPA97006607A MXPA97006607A MXPA/A/1997/006607A MX9706607A MXPA97006607A MX PA97006607 A MXPA97006607 A MX PA97006607A MX 9706607 A MX9706607 A MX 9706607A MX PA97006607 A MXPA97006607 A MX PA97006607A
- Authority
- MX
- Mexico
- Prior art keywords
- atg
- ttt
- ctt
- seq
- tct
- Prior art date
Links
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 66
- 230000028327 secretion Effects 0.000 title claims abstract description 59
- 239000000203 mixture Substances 0.000 title description 6
- 101710011036 stiI Proteins 0.000 claims description 83
- 150000007523 nucleic acids Chemical class 0.000 claims description 49
- 108020004707 nucleic acids Proteins 0.000 claims description 47
- 229920001850 Nucleic acid sequence Polymers 0.000 claims description 10
- 230000014621 translational initiation Effects 0.000 claims description 9
- 235000019687 Lamb Nutrition 0.000 claims description 5
- 238000004519 manufacturing process Methods 0.000 claims description 4
- 230000003248 secreting Effects 0.000 claims description 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 235000008469 Oxalis tuberosa Nutrition 0.000 claims 1
- 240000000645 Oxalis tuberosa Species 0.000 claims 1
- 241000124008 Mammalia Species 0.000 abstract description 2
- 101700054060 phoA Proteins 0.000 description 40
- 101710016966 phoA2 Proteins 0.000 description 40
- 150000001413 amino acids Chemical group 0.000 description 35
- 229920003013 deoxyribonucleic acid Polymers 0.000 description 29
- 238000010276 construction Methods 0.000 description 27
- 108020004705 Codon Proteins 0.000 description 22
- 241000237955 Nassarius Species 0.000 description 22
- 241000588724 Escherichia coli Species 0.000 description 20
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 16
- 238000010586 diagram Methods 0.000 description 16
- 102100014231 IGF1 Human genes 0.000 description 12
- 101700074337 IGF1 Proteins 0.000 description 12
- 102100016449 CCL5 Human genes 0.000 description 10
- 108010055166 Chemokine CCL5 Proteins 0.000 description 10
- 102100016021 INSRR Human genes 0.000 description 10
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 9
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 9
- 239000000499 gel Substances 0.000 description 9
- 230000029087 digestion Effects 0.000 description 8
- 230000000694 effects Effects 0.000 description 7
- 239000002609 media Substances 0.000 description 7
- 102000004169 proteins and genes Human genes 0.000 description 7
- 108090000623 proteins and genes Proteins 0.000 description 7
- FPPNZSSZRUTDAP-UWFZAAFLSA-N Carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 6
- 229960003669 Carbenicillin Drugs 0.000 description 6
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 5
- 229920001405 Coding region Polymers 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 229940014598 TAC Drugs 0.000 description 5
- 210000004027 cells Anatomy 0.000 description 5
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 4
- 102000004594 DNA Polymerase I Human genes 0.000 description 4
- 108010017826 DNA Polymerase I Proteins 0.000 description 4
- -1 OmpA Proteins 0.000 description 4
- 229920000978 Start codon Polymers 0.000 description 4
- 102100015249 VEGFA Human genes 0.000 description 4
- 108010073929 Vascular Endothelial Growth Factor A Proteins 0.000 description 4
- 102000006635 beta-Lactamases Human genes 0.000 description 4
- 108020004256 beta-Lactamases Proteins 0.000 description 4
- 238000005259 measurement Methods 0.000 description 4
- 238000002703 mutagenesis Methods 0.000 description 4
- 231100000350 mutagenesis Toxicity 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 239000002773 nucleotide Substances 0.000 description 4
- 125000003729 nucleotide group Chemical group 0.000 description 4
- 239000008188 pellet Substances 0.000 description 4
- OZAIFHULBGXAKX-UHFFFAOYSA-N precursor Substances N#CC(C)(C)N=NC(C)(C)C#N OZAIFHULBGXAKX-UHFFFAOYSA-N 0.000 description 4
- YXFVVABEGXRONW-UHFFFAOYSA-N toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 description 4
- 108010000521 Human Growth Hormone Proteins 0.000 description 3
- 102000002265 Human Growth Hormone Human genes 0.000 description 3
- 239000000854 Human Growth Hormone Substances 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 238000000326 densiometry Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 101500002601 human Epidermal growth factor Proteins 0.000 description 3
- 229940116978 human epidermal growth factor Drugs 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 239000010452 phosphate Substances 0.000 description 3
- 235000021317 phosphate Nutrition 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- BTJIUGUIPKRLHP-UHFFFAOYSA-N 4-Nitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1 BTJIUGUIPKRLHP-UHFFFAOYSA-N 0.000 description 2
- 101700032127 ETX1 Proteins 0.000 description 2
- 101700029730 ETX2 Proteins 0.000 description 2
- 231100000655 Enterotoxin Toxicity 0.000 description 2
- 102100006425 GAPDH Human genes 0.000 description 2
- 101710008404 GAPDH Proteins 0.000 description 2
- 102000015271 Intercellular Adhesion Molecule-1 Human genes 0.000 description 2
- 108010064593 Intercellular Adhesion Molecule-1 Proteins 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N L-serine Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 2
- GUBGYTABKSRVRQ-YOLKTULGSA-N Maltose Natural products O([C@@H]1[C@H](O)[C@@H](O)[C@H](O)O[C@H]1CO)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 GUBGYTABKSRVRQ-YOLKTULGSA-N 0.000 description 2
- 108020004999 Messenger RNA Proteins 0.000 description 2
- 101700080605 NUC1 Proteins 0.000 description 2
- 210000001322 Periplasm Anatomy 0.000 description 2
- OFVLGDICTFRJMM-WESIUVDSSA-N Tetracycline Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(O)=C(C(N)=O)C(=O)[C@@]4(O)C(O)=C3C(=O)C2=C1O OFVLGDICTFRJMM-WESIUVDSSA-N 0.000 description 2
- 229960002180 Tetracycline Drugs 0.000 description 2
- 239000004098 Tetracycline Substances 0.000 description 2
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Tris Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-K [O-]P([O-])([O-])=O Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 2
- 108010006025 bovine growth hormone Proteins 0.000 description 2
- 101710023118 btfP Proteins 0.000 description 2
- 239000012228 culture supernatant Substances 0.000 description 2
- 239000000147 enterotoxin Substances 0.000 description 2
- ANUSOIHIIPAHJV-UHFFFAOYSA-N fenticlor Chemical compound OC1=CC=C(Cl)C=C1SC1=CC(Cl)=CC=C1O ANUSOIHIIPAHJV-UHFFFAOYSA-N 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 229920002106 messenger RNA Polymers 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000006011 modification reaction Methods 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 101700006494 nucA Proteins 0.000 description 2
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 239000003638 reducing agent Substances 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 235000019364 tetracycline Nutrition 0.000 description 2
- 230000002103 transcriptional Effects 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 229920000160 (ribonucleotides)n+m Polymers 0.000 description 1
- VHJLVAABSRFDPM-UHFFFAOYSA-N 1,4-dimercaptobutane-2,3-diol Chemical compound SCC(O)C(O)CS VHJLVAABSRFDPM-UHFFFAOYSA-N 0.000 description 1
- RFCVXVPWSPOMFJ-UHFFFAOYSA-N 2-[(2-azaniumyl-3-phenylpropanoyl)amino]-4-methylpentanoate Chemical compound CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 RFCVXVPWSPOMFJ-UHFFFAOYSA-N 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N 2-mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-Phosphoglyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- XZKIHKMTEMTJQX-UHFFFAOYSA-N 4-Nitrophenyl dihydrogen phosphate Chemical compound OP(O)(=O)OC1=CC=C([N+]([O-])=O)C=C1 XZKIHKMTEMTJQX-UHFFFAOYSA-N 0.000 description 1
- 101700080270 ADH2 Proteins 0.000 description 1
- 101700012170 ADH5 Proteins 0.000 description 1
- 101700033574 ADH6 Proteins 0.000 description 1
- 102000013563 Acid Phosphatase Human genes 0.000 description 1
- 108010051457 Acid Phosphatase Proteins 0.000 description 1
- BUQICHWNXBIBOG-LMVFSUKVSA-N Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)N BUQICHWNXBIBOG-LMVFSUKVSA-N 0.000 description 1
- 238000008940 Alkaline Phosphatase assay kit Methods 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- SJUXYGVRSGTPMC-UHFFFAOYSA-N Asparaginyl-Alanine Chemical compound OC(=O)C(C)NC(=O)C(N)CC(N)=O SJUXYGVRSGTPMC-UHFFFAOYSA-N 0.000 description 1
- 102100003754 CCNL2 Human genes 0.000 description 1
- 101700041552 CCNL2 Proteins 0.000 description 1
- 101710022848 CYPADH Proteins 0.000 description 1
- 229920000062 Coding strand Polymers 0.000 description 1
- GZCGUPFRVQAUEE-KCDKBNATSA-N D-(+)-Galactose Natural products OC[C@@H](O)[C@H](O)[C@H](O)[C@@H](O)C=O GZCGUPFRVQAUEE-KCDKBNATSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N D-Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-SECBINFHSA-N D-tryptophane Chemical compound C1=CC=C2C(C[C@@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-SECBINFHSA-N 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 102000019622 EC 2.7.1.2 Human genes 0.000 description 1
- 108010021582 EC 2.7.1.2 Proteins 0.000 description 1
- 102000013009 EC 2.7.1.40 Human genes 0.000 description 1
- 108020005115 EC 2.7.1.40 Proteins 0.000 description 1
- 108010011939 EC 4.1.1.1 Proteins 0.000 description 1
- 102000012288 EC 4.2.1.11 Human genes 0.000 description 1
- 108010022181 EC 4.2.1.11 Proteins 0.000 description 1
- 108010051696 Growth Hormone Proteins 0.000 description 1
- 102000018997 Growth Hormone Human genes 0.000 description 1
- 102000005548 Hexokinases Human genes 0.000 description 1
- 108020002022 Hexokinases Proteins 0.000 description 1
- 102000004218 Insulin-like growth factor I Human genes 0.000 description 1
- 108090000723 Insulin-like growth factor I Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 102000004195 Isomerases Human genes 0.000 description 1
- 108090000769 Isomerases Proteins 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- 101700073929 LEP Proteins 0.000 description 1
- 229940053278 LTA Drugs 0.000 description 1
- GUBGYTABKSRVRQ-UUNJERMWSA-N Lactose Natural products O([C@@H]1[C@H](O)[C@H](O)[C@H](O)O[C@@H]1CO)[C@H]1[C@@H](O)[C@@H](O)[C@H](O)[C@H](CO)O1 GUBGYTABKSRVRQ-UUNJERMWSA-N 0.000 description 1
- JPNRPAJITHRXRH-UHFFFAOYSA-N Lysyl-Asparagine Chemical compound NCCCCC(N)C(=O)NC(C(O)=O)CC(N)=O JPNRPAJITHRXRH-UHFFFAOYSA-N 0.000 description 1
- IMTUWVJPCQPJEE-IUCAKERBSA-N Met-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN IMTUWVJPCQPJEE-IUCAKERBSA-N 0.000 description 1
- HGCNKOLVKRAVHD-RYUDHWBXSA-N Met-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-RYUDHWBXSA-N 0.000 description 1
- 230000036740 Metabolism Effects 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 241000692885 Nymphalis antiopa Species 0.000 description 1
- 101710042135 PPi-PFK Proteins 0.000 description 1
- 102000035443 Peptidases Human genes 0.000 description 1
- 108091005771 Peptidases Proteins 0.000 description 1
- 102000030951 Phosphotransferases Human genes 0.000 description 1
- 108091000081 Phosphotransferases Proteins 0.000 description 1
- 102000014961 Protein Precursors Human genes 0.000 description 1
- 108010078762 Protein Precursors Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 210000003705 Ribosomes Anatomy 0.000 description 1
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 1
- 229920000401 Three prime untranslated region Polymers 0.000 description 1
- 102000005924 Triose-phosphate isomerases Human genes 0.000 description 1
- 108020003073 Triose-phosphate isomerases Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- UIIMBOGNXHQVGW-UHFFFAOYSA-M buffer Substances [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000000295 complement Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000000593 degrading Effects 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N edta Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002255 enzymatic Effects 0.000 description 1
- 238000002825 functional assay Methods 0.000 description 1
- 230000002068 genetic Effects 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 230000002414 glycolytic Effects 0.000 description 1
- 239000000122 growth hormone Substances 0.000 description 1
- 239000001963 growth media Substances 0.000 description 1
- 101710007341 hil-6 Proteins 0.000 description 1
- 230000002209 hydrophobic Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 101710018422 lepB1 Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- CSNNHWWHGAXBCP-UHFFFAOYSA-L magnesium sulphate Substances [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 230000035786 metabolism Effects 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 230000000051 modifying Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 101710036149 pfk-1.2 Proteins 0.000 description 1
- 101700087004 pfkA1 Proteins 0.000 description 1
- 101700035696 pfkB Proteins 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 101700021597 phoE Proteins 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 230000000306 recurrent Effects 0.000 description 1
- 238000003303 reheating Methods 0.000 description 1
- 108091007521 restriction endonucleases Proteins 0.000 description 1
- 101700056160 secE Proteins 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 101710018401 sipP Proteins 0.000 description 1
- 101710018491 sipS Proteins 0.000 description 1
- 101710018425 sipS2 Proteins 0.000 description 1
- 101710018489 sipT Proteins 0.000 description 1
- 101710018502 sipU Proteins 0.000 description 1
- 101710018490 sipV Proteins 0.000 description 1
- 101710018315 sipW Proteins 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000001131 transforming Effects 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-PHYPRBDBSA-N α-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 1
Abstract
The present invention relato the unexpected result that signal sequences and mutants with reduced translation force provide essentially complete processing and high expression levels of a polypeptide of interest and comparison with wild-type signal sequences, and that many polypeptides of mammal require a narrow range of translation levels to obtain maximum secretion. A set of signal sequence vectors provides a range of translation powers to optimize the expression of a polypeptide of interest
Description
METHODS AND CCMPOSICTCMEf3 FOR SECRETION DB QTJ é IDOg
AMPO DE AI TUVEMCTQW
This invention relates to signal sequences for the secretion of heterologous polypeptides from bacteria.
"F- &? CpTPfT" w "" t ^ "« ran-m-arai v TfryTCft RtffACTCM? P.V
The secretion of heterologous polypeptides in the periplasmic space of E. coli and other prokaryotes or in their culture media is subjected to various parameters. Typically, reactors for secretion of a polypeptide of interest are engineered to place the DNA encoding a 5 'secretory signal sequence relative to the DNA encoding the polypeptide of interest. Two main recurrent problems prevent the secretion of such polypeptides. First, the signal sequence is often incompletely processed or removed, and secondly, the amount of secreted polypeptide is often low or undetectable. The charms for solving these problems fall into three main areas: trying several different signal sequences,
REF: 25568 mutating the amino acid sequence of the signal sequence and altering the secretory pathway within the host bacterium. Many signal sequences are available for the first approach, in order to solve the secretion problems. atson (Nucleic Acids Reaearph 12: 5145-5164
(1984)) describes a compilation of signal sequences. U.S. Patent 4,963,495 describes the expression and secretion of mature eukaryotic protein in the periplasmic space of a host organism using a prokaryotic secretion signal sequence DNA attached at its 3 'end to the 5' end of DNA encoding the mature protein. In particular, the DNA encoding the enterotoxin signals of E. coli, especially STII, is preferred. Chang et al. (Gene 55: 189-196 (1987)) describes the use of the STII signal sequence to secrete hGH in E. COli. Gray et al. (Gene 39: 247-245 (1985)) describes the use of the natural signal sequence of human growth hormone and the use of the alkaline phosphatase promoter of E. coli and a signal sequence for the secretion of growth hormone human in E. coli. Wong et al. (Gene 68: 193-203 (1988)) describes the secretion of insulin-like growth factor-1 (IGF-1) fused to the LamB and OmpF secretion leader sequences in E. coli, and the improvement of processing efficiency of these signal sequences in the presence of a prlA4 mutation.
Fujimoto et al. (Bioteph 8: 77-86 (1988)) describes the use of four different E. coli enterotoxin signal sequences, STI, STII, LT-A and LT-B, for the secretion of human epidermal growth factor (hEGF) in E. coli. Denefle et al. (Gene 85: 499-510 (1989)) describes the use of OmpA and phoA signal peptides for the secretion of mature human interleukin lß. The mutagenesis of the signal sequence, in general, has not been especially useful for resolving secretion problems. For example, Morio a-Fujimoto et al. (. Biol. Chem. 266: 1728-1732 (1991)) describes amino acid changes in the LTA signal sequence that increase the amount of human epidermal growth factor secreted in E. coli. Goldstein et al. (J. Bact. 172: 1225-1231 (1990)) describes the substitution of amino acids in the hydrophobic region of OmpA, produces secretion of nuclease A, but not of TEM ß-lactamase. Matteucci et al. (Biotech 4: 51-55 (1986)) describes mutations in the human growth hormone signal sequence that improve the secretion of hGH. Lehnhardt et al. (J. Biol. Chem. 262: 1716-1719 (1987) describes the effect of deletion mutations on the OmpA signal peptide on the secretion of nuclease A and of ß-lactamase TEM.Finally, attempts to improve heterologous secretion In E. coli, by modulating the host's machinery, up to now they have had an improved success in solving secretion problems, for example van Dijl et al. (Mol. Gen. Genet. 227: 40-48 (1991)) describes the effects of the overproduction of the peptidase I signal (SPase I) of E. coli in the processing of precursors Klein et al.
(Protein Engineer n 5: 511-517 (1992) describes the mutagenesis of the LamB signal sequence which has little effect on the secretion of bovine somatotropin, and the bovine somatotropin secretion properties appear to be determined by the mature protein instead of changes in the signal sequence Perez-Perez et al. (Bio / Technology 12: 179-180 (1994)) describes that by providing an E. coli host with additional copies of prlA4 (secY allele) and secE genes, which encode for the major components of the "translocator", ie, the molecular apparatus that physically moves proteins through the membrane, increases the ratio of mature hIL-6 relative to the precursor from 1.2 to 10.8 US Patent Number 5,232,840 describes sites novel binding sites for ribosome useful in improving the production of proteins in bacteria by an improved and / or more efficient translation US Pat. No. 5,082,783 describes s Improved ecretion of heterologous proteins by hosts such as yeast by using promoters of at most intermediate strength with heterologous DNA secretion signal sequences. European patent application number 84308928.5, filed on December 19, 1984, discloses expression elements of promoter-ribosome binding sites of general utility for expression of high level heterologous genes. The present invention describes an unexpected result that altered translation initiation regions with reduced translational force provide essentially complete processing and high levels of secretion of a polypeptide of interest compared to wild-type signal sequences, and that many polypeptides of mammal require a narrow range of translation levels to obtain maximum secretion. A set of vectors with variant translation initiation regions provide a range of translational forces to optimize the secretion of a polypeptide of interest.
BRIEF DESCRIPTION OF IA TUBHd?
One aspect of the invention is a method for optimizing the secretion of a heterologous polypeptide of interest in a cell comprising comparing the expression levels of the polypeptide under the control of a set of nucleic acid variants of a translation start region, in wherein the set of variants represents a range of translational forces, and determining the optimal translational force for the production of mature polypeptide, in which the translational force is less than the translational force of the wild-type translation start region. In a further aspect of the invention, the variants are signal sequence variants, especially variants of the STII signal sequence.
BRIEF, DESCRIPTION DB THE DEESHOS
Figure 1 shows the sequence of the phoA promoter, trp and the Shine-Dalgarno regions of STII and the STII signal sequence. Figure 2 is a diagram showing the relevant characteristics of the plasmid pLS33. Figure 3 is a diagram showing the construction of the library, pSTIIBK. Figure 4 is a graph showing comparison of the level of IGF-1 expression, as measured by the amount of IGF-1 detected in culture supernatants, for pLS33, pSTIIBK # 131 and pSTIIC. Experiments 1 to 8 represent measurements taken on 8 separate dates.
Figure 5 is a diagram showing the construction of the pSTIIC plasmid. Figure 6 is a diagram showing the construction of the plasmid pSTIILys. Figure 7 is a diagram showing the construction of the ppho21 plasmid. Figure 8 is a diagram showing the construction of ppho31 plasmid. Figure 9 is a diagram showing the construction of the ppho41 plasmid. Figure 10 is a diagram showing the construction of the ppho51 plasmid. Figure 11 is a diagram showing the relevant features of the pSTIICBK library. Figure 12 is a diagram showing the construction of the pSTBKphoA library. Figure 13 is a graph showing phoA activity in isolates from the pSTBKphoA library. Figure 14 shows the nucleotide sequences of the STII signal sequence variants listed. Figure 15 is a diagram showing the construction of plasmid pNT3PST116. Figure 16 is a diagram showing the construction of plasmid pST116pho.
Figure 17 is a diagram showing the relevant characteristics of the "category A" plasmids used in the examples. Figure 18 is a diagram showing the relevant characteristics of the "category B" plasmids used in the examples. Figure 19 is a photograph of a polypeptide gel stained with Coomassie blue showing the secretion of extracellular domains 1 and 2 of mature ICAM-1 in E. coli under the control of variant STII signal sequences. The TIR of the relative force 9 is provided by the ppho31 STII variant; the TIR of the relative force 3 is provided by the ppho41 STII variant. The precursor and mature forms of the polypeptide are indicated in the figures. Figure 20 is a photograph of a polypeptide gel stained with Coomassie blue showing the secretion of mature NT3 in E. coli under control of the STII variant signal sequences. The TIR of the relative force 9 is provided by the ppho31 STII variant; the TIR of the relative force 7 is provided by the ppho21 STII variant; the TIR of the relative force 3 is provided by the ppho41 STII variant; the TIR of the relative force 1 is provided by the variant pphodl STII. The mature form of the polypeptide is indicated in the figure.
Figure 21 is a photograph of a polypeptide gel stained with Coomassie blue showing the secretion of mature RANTES in E. coli under control of the signal sequences of the STII variant. When read from the left to the right in the figure, the relative strength IRRs 9 are provided by the ppho31 and pSTBKphoA variants # 116 STII; the TIR of the relative force 7 is provided by the ppho21 STII variant; in the IRR of the relative force 4 it is provided by the variant pSTBKphoA # 81 STII; the relative strength IRR 3 is provided by the ppho41 STII variant; the relative strength IRR 2 is provided by the variant pSTBKphoA # 107 STII; the relative strength IRRs 1 are provided by pSTBKphoA # 86 and by the STII pphodl variants. The mature form of the polypeptide is indicated in the figure.
I DETAI JADA.
A. DBFIMICIQNES
The "translation initiation region" or TIR, as used herein, refers to a region of RNA (or its encoding DNA) determined by the site and efficiency of translation initiation of a gene of interest. (See, for example, McCarthy et al., Trenda in Genetics 6: 78-85 (1990)).
An IRR for a particular gene may extend beyond the ribosome binding site (rbs) to include the 5 'and 3' sequences relative to the rbs. The rbs are defined to include, in a minimal way, the Shine-Dalgarno region and the start codon, plus the bases between them, but may include the expansion of the protected mRNA of the digestion of ribonuclease by bound ribosomes. Therefore, a TIR may include an untranslated leader or the end of an upstream cistron and thus a traditional stop codon. A "secretion signal sequence" or "signal sequence" as used herein, refers to a sequence present in the amino terminus of a polypeptide that directs its translocation through a membrane. Typically, a precursor polypeptide is processed by breaking the signal sequence to generate a mature polypeptide. The term "translational force", as used herein refers to a measurement of a polypeptide secreted in a control system, wherein one or more variants of a TIR are used to direct the secretion of a polypeptide encoded by a indicator gene, and the results are compared with wild-type IRR or with some other control under the same culture and test conditions. For example, in these experiments, the translational force is measured by the use of alkaline phosphatase as the reporter gene expressed under a basal level control for the phoA promoter, in which the secretion of the phoA polypeptide is directed by variants of the STII signal. The amount of mature alkaline phosphatase present in the host is a measure of the amount of secreted polypeptide, and can be quantified in relation to a negative control. Without being limited to one theory, therefore, the "translational force" as used herein may, for example, include a measure of mRNA stability, ribosome binding efficiency at the ribosome site, and a translocation mode through a membrane. As used herein, "polypeptide" generally refers to peptides and polypeptides having at least about 2 amino acids.
B. ÉTODQfi < ? gwgp ^.? eg
The present invention demonstrates that translational force is a critical factor in determining whether many heterologous polypeptides are secreted in significant amounts. Therefore, for a given TIR, a series of amino acid or nucleic acid sequence variants can be generated with a range of translational forces, thereby providing a convenient means by which to adjust this factor for optimal secretion of many different polypeptides. The use of an indicator gene expressed under the control of these variants, such as phoA, provides a method for quantifying the relative translational forces of different translation initiation regions. TIR variants or mutants can be provided in the background of a plasmid vector, whereby a set of plasmids into which a gene of interest can be inserted, and its expression can be measured, so as to establish an optimal range of translational forces for maximum expression of the mature polypeptide. Thus, for example, signal sequences can be used from any prokaryotic or eukaryotic organism. Preferably, the signal sequence is STII, OmpA, phoE, LamB, MBP or phoA. The mutagenesis of the TIR is performed by conventional techniques that result in codon changes which can alter the amino acid sequence, although silent changes in the nucleotide sequence are preferred. Alterations in the IRR may include, for example, alterations in the number or separation of the Shine-Dalgarno sequence, together with alterations in the signal sequence. A preferred method for generating mutant signal sequences is the generation of a "codon library" at the beginning of a coding sequence that does not change the amino acid sequence of the signal sequence (ie, silent changes). This can be done by changing the position of the third nucleotide of each codon; additionally, some amino acids such as leucine, serine and arginine, have first and second multiple positions that can add complexity in the elaboration of the bank. This method of mutagenesis is described in detail in Yansura et al. (METHODS: A Companion to MethoHa in Ensymol 4: 151-158 (1992)). Basically, a DNA fragment encoding the signal sequence at the beginning of the mature polypeptide is synthesized so that the third position (and possibly the first and second, as described above) of each of the first 6 to 12 codons is altered The additional nucleotides downstream of these codons provide a site for the binding of a complementary primer used in making the lower strand. The treatment of the upper coding strand and the lower strand primer with DNA polymerase I (Klenow) will result in a set of DNA duplex fragments containing randomized codons. The primers are designed to contain useful cloning sites that can then be used to insert the DNA fragments into an appropriate vector, thereby allowing amplification of the codon library. Alternative methods include, for example, substitution of all rbs with random nucleotides (Wilson et al., BioTechniques 17: 944-952 (1994)) and the use of phage display libraries (see, for example, Barbas et al., Proc. Nat. Acad. Sci. USA 89: 4457-4461 (1992), Garrard et al., Gene 128: 103-109 (1993)). Typically, the TIR variants will be provided in a plasmid vector with elements suitable for expression of a gene of interest. For example, a typical construct will contain a 5 'promoter relative to the signal sequence, a 3' restriction enzyme recognition site relative to the signal sequence for insertion of a gene of interest or a reporter gene, and a selectable marker , such as a marker of drug resistance, for selection and / or maintenance of the bacteria transformed with the resulting plasmids. Promoters suitable for use with prokaryotic hosts include β-lactamase and lactose promoter system (Chang, et al., Nature 275: 617-624 (1978)).; and Goeddel et al., Nature 281: 544-548 (1979)), the alkaline phosphatase and tryptophan (trp) promoter system (Goeddel, Nucleic Acid Res. 8 (18): 4057-4074 (1980) and EP 36,776 ) and hybrid promoters such as the tac promoter (de Boer et al., Proc. Nati. Acad. Sci. USA. 80: 21-25 (1983).
Promoter sequences suitable for use in yeast hosts include the promoters for 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 255 (24): 12073-80 (1980) or other glycolytic enzymes (Hess et al. , -T. Adv. Enzyme Reg. 7: 149-67 (1968)) and Holland Riorhemistry 17: 4900-4907 (1978)), such as enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6. -phosphate isomerase, 3-phosphoglyceratomutase, pyruvate kinase, triosephosphate isomerase, phosphoglucosisomerase and glucokinase. Other yeast promoters, which are inducible promoters that have the additional advantage of transcription controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocitochrome C, acid phosphatase, degrading enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde -3-phosphate dehydrogenase and enzymes responsible for the use of maltose and galactose. Suitable vectors and promoters for use in yeast expression are further described in Hitzeman et al., EP 73,657A. Advantageously, yeast promoters used with yeast promoters can also be used advantageously. Any indicator gene can be used which can be quantified in some way. Thus, for example, the production of alkaline phosphatase can be quantified as a measure of the secreted level of the product of the gene £ llQ_A. Other examples include, for example, the genes for β-lactamase. Preferably, a set of vectors with a range of translational forces is generated in which the DNA encoding a polypeptide of interest can be inserted. This limited set provides a comparison of secreted levels of polypeptides. The secreted level of polypeptides can be determined, for example, by functional assays for the polypeptide of interest, if available, by radioimmunoassays
(RIA), enzyme-linked immunoassays (ELISA), or by PAGE and visualization of the correct molecular weight of the polypeptide of interest. Vectors constructed in this way can be used to transform an appropriate host. Preferably, the host is a prokaryotic host. Most preferably, the host is E. coli. Additional details of the invention can be found in the following examples, which further define the scope of the invention. All references mentioned herein are expressly incorporated by reference in their entirety.
EXAMPLES
A. CONSTITUTION OF ÜN
All of the plasmids described in this patent application are constructed from the basic backbone of pBR322 (Sutcliffe, Cold Spring Harb Symp Ouant Biol 43: 77-90 (1978)). Although the gene of interest expressed in each case varies, the transcriptional and translational sequences required for the expression of each gene are provided by the phoA promoter and by the Shine-Dalgarno sequence for trp (Chang et al., Gañí-i 55: 189 -196 (1987)). Additionally, in the indicated cases, a second Shine-Dalgarno sequence is also present, the Shine-Dalgarno sequence for STII (Picken et al., Infect Immun 42 (1): 269-275 (1983)). The secretion of the polypeptide is directed by the STII signal sequence or variants thereof (Picken et al., Tnfect Immun 42 (1): 269-275 (1983)). The phoA promoter, the Shine-Dalgarno sequences for trp and STII and the wild-type STII signal sequence are provided in Figure 1.
B CQMSIRP? PS33 ION
Plasmid pLS33 is derived from phGH1 (Chang et al., Gene 55: 189-196 (1987)), which is constructed for the expression of des (1, 3) -IGF-I. In plasmid pLS33, the gene encoding this insulin-like growth factor-1 version (altered from the original sequence (Elmblad et al., Third European Consensus on Biotechnology III, Weinheim: Verlag Chemie, pp.287-292 ( 1984)) by the removal of the first three amino acids in the N-terminal part) replaces the gene that codes for human growth hormone. The construction of pLS33 maintains the sequences for the phoA promoter, the Shine-Dalgarno regions for trp and STII, and the wild-type STII signal sequence described for phGH1. However, the 3 'end subsequent to the stop codon for des (1,3) -IGF-1 is altered compared to that described for phGH1. In the case of pLS33, immediately downstream of the termination codon a HindIII restriction site has been placed, followed by the methionine start codon of the tetracycline resistance gene of pBR322.
(Sutcliffe, Cold Spring Harb Symp Ouant EIQI 43: 77-90
(1978)). A diagram of plasmid pLS33 is provided in Figure 2.
A plasmid library containing a variable codon library of the STII signal sequence (pSTIIBK) is constructed to examine and search for improved nucleotide sequences of this signal. The vector fragment for the construction of pSTIIBK is generated by isolating the larger fragment when pLS33 is digested with Xbal and BstEII. This vector fragment contains the sequences coding for the phoA promoter, the Shine-Dalgarno sequence for trp and the amino acids 16-67 of des (1,3) -IGF-I. The coding region for amino acids 3-15 of des (1,3) -IGF-I is provided by isolating the DralII-BstEII fragment (approximately 45 bp) from another IGF-I expression plasmid, pLS33LamB. Variations in the nucleotide sequence for the STII signal are derived from two strands of synthetic DNA listed below:
'- GCATGTCTAGAATT ATG AA £ AA £ A? X MU QOH SS CJÜ Ü GCN? Cfl ATG m GJTJJ TC X £ H? IH GCT ACA AAC GCG TAT GCC ACTCT - 3' (SEQ ID NO: l) 3 '- CGA TGT TTG CGC ATA CGG TGAGACACGCCACGACTT -5' (SEQ ID NO: 2) R: A, GY: T, CH; A, T, C N: G, A, T, C
These two strands of synthetic DNA are aligned or spliced and treated with DNA polymerase I (Klenow fragment) to form duplex DNA of approximately 101 bp. This duplex DNA is then digested with Xbal and DralII to generate the approximately 82 bp fragment encoding the STII signal sequence with variable codons and the first two amino acids of des (1,3) -IGF-I. Then these fragments are ligated together as shown in Figure 3 to build the library, pSTIIBK.
? B PSTIIBKÉ1231
The plasmid library containing a variable codon library of the STII signal sequence (pSTIIBK) is examined for enhanced growth of transformants and increased secretion of IGF-1. Basically, the plasmids are transformed into the 27C7 host chain (see below) and examined for their increased ability to grow in a medium low in phosphate (see Chang et al., Supra) plus carbenicillin (50 μg / ml) based to DO600 cell density measurements. The candidate colonies are tested to determine increased levels of IGF-1 secretion as follows. The colonies are inoculated in 3-5 ml of LB plus carbenicillin (50 μg / ml) and grown at 37 ° C with shaking for about 5-15 hours. The cultures are diluted 1: 100 in 1-3 ml of low medium in phosphates plus carbenicillin (50 μg / ml) and induced with shaking for 24 hours at 37 ° C. The induced cultures are centrifuged in microcentrifuge tubes for 5 minutes. The supernatants are diluted in IGF RIA diluent and stored at -20 ° C until the assay is performed. The amount of IGF-1 secreted in the medium is measured by radioimmunoassay. The expression level of IGF-1, measured by the amount of IGF-1 detected in culture supernatants, is compared for pLS33, pSTIIBK # 131 and pSTIIC, in Figure 4. Variant # 131 consistently improves IGF expression -1 with respect to the "original" or wild-type STII signal sequence. pSTIIC shows some slight improvement in expression with respect to the wild-type sequence. pSTIIBK # 131 differs from wild-type STII in 12 codons and in the deletion of a Shine-Dalgarno sequence. pSTIIC is constructed as described below as a control plasmid having only one Shine-Dalgarno sequence and three codon changes near the end of the 3 'part of the signal.
E. Q3MSTRDCCIÓW OF pSHIg
In pSTIIC, the sequence of
STII Shine-Dalgarno from plasmid pLS33. In addition, by incorporating silent mutations near the 3 'end of the STII signal, an Mlul site is produced in pSTIIC. The identical fragments described for the construction of pSTIIBK (the pLS33 vector and a DralII-BstEII fragment of approximately 45 bp from pLS33LamB) are used for the construction of this plasmid. However, the synthetic DNA differs from that described above for the construction of pSTIIBK. For the construction of pSTIIC, the synthetic DNA encoding the signal sequence of STII and the first two amino acids of des (1, 3) -IGF-1 is as follows:
'- CTAGAATT ATG AAA AAG AAT ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT 3' - TTAA TAC TTT TTC TTA TAG CGT AAA GAA GAA CGT AGA TAC AAG CAÁ
Mlul TTT TCT ATT GCT ACA AAC GCG TAT GCC ACTCT - 3 '(SEQ ID NO: 3) AAA AGA TAA CGA TGT TTG CGC ATA CGG TG - 5' (SEQ ID NO: 4)
These fragments are ligated together as illustrated in Figure 5 to construct the plasmid pSTIIC.
F. CQlsyrKUCICN OF pSTIIIga
The plasmid pSTIILys contains a STII signal sequence that differs from the signal sequence of pSTIIC only in a nucleotide change at the position of the second codon. This signal sequence is constructed from synthetic DNA and placed in a vector based on pBR322 for the expression of the RANTES polypeptide (Schall et al., Im unol 141 (3): 1018-1025 (1988). Xbal-Mlul vector for this construct is isolated from plasmid pBK131Ran (a derivative of plasmid pSTIIBK # 131 with the gene encoding RANTES substituting the gene coding for des (1,3) -IGF-1.) This vector contains the promoter phoA, the Shine-Dalgarno sequence for trp, the last three amino acids of the STIIC signal sequence and the gene encoding the RANTES polypeptide As illustrated in figure 6, this fragment is then ligated with the following strands of DNA Synthetic to build the pSTIILys plasmid
(SEQ ID NO: 3):
'- CTAGAATT ATG AAG AAG AAT ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT 3 '-TTAA TTC TTC TTC TAG TAG CGT AAA GAA GAA CGT AGA TAC AAG CA TTT TCT ATT GCT ACA AA - 3' (SEQ ID. NO: 5) AAA AGA TAA CGA TGT TTG CGC-51 (SEQ ID NO: 6) G. caNSTagxxad PLASMIDES OF PHOSEAIASA
ALKALINE
In order to determine the quantitative TIR value for each of the described STII signal sequences, the alkaline phosphatase gene for E. coli is used as the reporter gene. In each of these constructs, the phoA gene is placed downstream with respect to the phoA promoter, the Shp-Dalgarno sequence of trp and a version of the STII signal sequence. Plasmids ppho21, ppho31, ppho41 and pphodl, contain the signal sequences derived from pSTIIC, pLS33, pSTIIBK # 131 and pSTIILys, respectively. In the case of ppho31, the construction also contains the Shine-Dalgarno STII region.
H. LUNtfi'U 'lON DB pphtál
The vector fragment for the construction of ppho21 is produced by digesting pBR322 with EcoRI and BamHl, and isolating the largest fragment. The phoA promoter, the Shp-Dalgarno sequence of trp and the signal sequence STII (amino acids 1-20) are provided by isolation of an approximately 484 bp fragment from pCN131Tsc followed by digestion with EcoRI and Mlul. An identical fragment of approximately 484 bp can also be generated from pSTIIC, a plasmid which has been previously described. The phoA gene fragment (approximately 1430 bp) coding for amino acids 24-450 of alkaline phosphatase is generated from plasmid pb0525 after digestion with Bspl286 and BamHI (Inouye et al., J. Bacteriol 146 (2): 668-675 (1981)). This Bspl286-BamHI fragment also contains 142 bp of SV40 DNA (Fiers et al., Nature 273: 113-120 (1978)) after the stop codon of alkaline phosphatase. Synthetic DNA is used to bind the STII signal sequence to the phoA gene. The sequence of this DNA encoding the last three amino acids of the signal sequence of STII and amino acids 1-23 of alkaline phosphatase is as follows:
'-CGCGTATGCCCGGACACCAGAAATGCCTGTTCTGGAAAACCGGGCTGCTCAGGGCGATATTACTG
3 '-ATACGGGCCTGTGGTCTTTACGGACAAGACCTTTTGGCCCGACGAGTCCCGCTATAATGAC CACCCGGCGGTGCT-3' (SEQ ID NO: 7) GTGGGCCGCC-5 '(SEQ ID NO: 8)
In order to facilitate the construction of this plasmid, the synthetic DNA is pre-ligated to the fragment
EcoRI-MluI of pCN131Tsc. This pre-ligation generates a new fragment of approximately 575 bp. As illustrated in Figure 7, the fragment generated from the pre-ligation is then ligated together with the other fragments described to build ppho21.
I. UU STRÜ ION DB ppho31
The vector fragment for the construction of this plasmid is the identical vector described for ppho21. The phoA promoter, the Shp-Dalgarno sequence of trp, the Shine-Dalgarno sequence of STII and the signal sequence of STII (amino acids 1-20) are generated from pJAL55. The necessary fragment (approximately 496 bp) of pJAL55 is isolated after digestion with EcoRI and Mlul. This EcoRI-MluI fragment only differs from the same region of pLS33 by an engineered site Mlul starting at amino acid 20 of the STII signal sequence (as described for pSTIIC). The last three amino acids of the STII signal sequence and the sequence encoding the phoA gene are provided by digesting the ppho21 plasmid with Mlul and BamHl, and isolating the approximately 1505 bp fragment. These fragments are linked together, as shown in Figure 8 to produce ppho31.
J. UNS RÜCCICB
The vector fragment for the construction of this plasmid is the identical vector described for ppho21. The phoA promoter, the Shp-Dalgarno sequence of trp and the STII signal sequence with the sequence codons pSTIIBK # 131 (amino acids 1-20) is provided by isolation of an EcoRI-MluI fragment of approximately 484 bp of pNGF131. An identical fragment can also be generated from pSTIIBK # 131. The last three amino acids of the STII signal sequence and the sequence encoding the phoA gene are provided by digesting the ppho21 plasmid with Mlul and BamHl, and isolating a fragment of approximately 1505 bp. As illustrated in Figure 9, these three fragments are then linked together to build ppho41.
K. atfl'KU JlQN DB pphoSl
The vector fragment for the construction of pphodl is generated by digesting the plasmid pLS18 with Xbal-BamHl and isolating the largest fragment. Plasmid pLS18 is a derivative of phGH1 (Chang et al., Gene 55: 189-196 (1987)) and an identical vector may have been generated which has used phGH1 instead of pLS18. This Xbal-BamHl vector contains the phoA promoter and the Shine-Dalgarno trp sequence. The STII signal sequence (amino acids 1-20) with the pSTIILys codons is provided by isolating a fragment of approximately 67 bp generated when pSTIILys are digested with Xbal and Mlul. The last three amino acids of the STII signal sequence and the sequence encoding the phoA gene are provided by digesting the ppho21 plasmid with Mlul and BamHl and isolating the approximately 150.5 bp fragment. In Figure 10 a diagram for the construction of pphodl is provided.
L. LOLStif MlK-iLlLQN DB fPTTTf? Y
A second variable codon library of the STII signal sequence, pSTIICBK, is constructed. This second codon library is designated solely to focus the codons closest to the start codon, met of the STII signal sequence. As illustrated in Figure 11, pSTIICBK is a plasmid based on pBR322 that contains the gene encoding the RANTES polypeptide (Schall et al.,? L_
Tmmunol 141 (3): 1018-1025 (1988)) under the control of the phoA promoter and the Shp-Dalgarno sequence of trp.
In this plasmid, the secretion of RANTES is driven by a STII signal sequence codon library derived from the following two strands of synthetic DNA: 5 '- GCATGTCTAGAATT ATG AAR AA £ MI? IH flCU TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCC-3 '(SEQ ID NO: 9) 3' -AGA TAA CGA TGT TTG CGC ATA CGG TGA -5 '(SEQ ID NO: 10) R: A, GY: T, CH: A, T, CN: G, A, T, c
These two strands of synthetic DNA are annealed and treated with DNA polymerase I (Fragment Klenow) to form DNA duplex of approximately 86 bp. This duplex DNA is then digested with Xbal and Mlul to generate an approximately 67 bp fragment encoding the first 20 amino acids of the STII signal sequence with variable codons at positions 2-6.
M. aa 'HU LQN DB pgTMtfrhoA
To increase the number of available STII signal sequences with different relative TIR forces, a convenient method of examining the codon library of pSTIICBK is required. Plasmid pSTBKphoA is constructed as a solution to this problem. In the pSTBKphoA plasmid, the STII codon library of pSTIICBK is inserted upstream of the phoA gene and downstream of the phoA promoter, and the Shpn-Dalgarno sequence of trp. In this way, the activity of phoA provides a means by which to discriminate or determine the difference between the various versions of the STII signal sequences. The vector fragment of this construct is generated by isolation of the larger fragment when pl31TGF is digested with Xbal and BamHl. An identical vector can also be generated from phGH1 (Chang et al., Gene 55: 189-196 (1987)). This vector contains the phoA promoter and the Shine-Dalgarno trp sequence. The codon library of the STII signal sequence is provided by isolating a fragment of approximately 67 bp generated from pSTIICBK after digestion with Xbal and Mlul. The last three amino acids of the STII signal sequence and the sequence encoding the phoA gene are provided by digesting ppho21 with Mlul and BamHl, and isolating a fragment of approximately 1505 bp. As illustrated in Figure 12, the fragments are then ligated together to construct pSTBKphoA.
N. SrCT.tyyrr? W DB pSTBRbhoA #a ?. ace. ? n? . n *
Plasmids pSTBKphoA # 81, 86, 107, 116 are selected from the codon library of pSTBKphoA based on their baseline level of phoA activity (Figure 13). As listed in Figure 14, each has a different nucleotide sequence that codes for the STII signal sequence.
O. r? TMH Pn? Pptfiw tm pan i «r > * >
This version of the STII signal sequence, ST116, combines the double sequence of Shine-Dalgarno described by Chang et al. (Gene 55: 189-196 (1987)) with the codons of the STII sequence selected from pSTBKphoA # 116. This signal sequence is initially constructed in a plasmid designed for the secretion of the NT3 pro region (pNT3PST116) and then transferred to a plasmid containing the phoA gene to obtain the relative TIR measurement (pST116pho).
P. CCNS ^ RDCTION DB pMT3PSTllg
The vector for this construct is generated by digesting plasmid pLS18 with Xbal and BamHl and isolating the largest fragment. Plasmid pLS18 is a derivative of phGH1 (Chang et al., Gene 55: 189-196 (1987)) and an identical vector could be generated from phGH1. This Xbal-BamHl vector contains the phoA promoter and the Shp-Dalgarno sequence of trp. A fragment (approximately 682 bp) containing the last three amino acids of the signal sequence of STII and the coding region for amino acids 19-138 of proNT3 (Jones et al., Proc Nati Acad Sci 87: 8060-8064 (1990) ) is generated from the pNT3P plasmid after digestion with Mlul and BamHl. Plasmid pNT3P is a plasmid based on pBR322 containing the phoA promoter, the STIIBK version # 131 of the STII signal sequence and the coding region for amino acids 19-138 of proNT3. The synthetic DNA strands listed below provide the sequence for the Shine-Dalgarno sequence of STII and the first 20 amino acids of the STII signal sequence:
'-CTAGAGGTTGAGGTGATTTT ATG AAA AAA AAC ATC GCA TTT CTT CTT GCA TCT 3' - TCCAACTCCACTAAAA TAC TTT TTT TTG TAG CGT AAA GAA GAA CGT AGA ATG TTC GTT TTT TCT ATT GCT AA AA - 3 '(SEQ ID NO: 11) TAC AAG CAA AAA AGA TAA CGA TGT TTG CGC - 51 (SEQ ID NO: 12)
These fragments are then ligated together as shown in Figure 15 to construct pNT3PSTH6.
Q. CQM3IBPCCIQ DB pSTI 1 fiphn
The vector for the construction of this plasmid is the identical vector described for the construction of pNT3PST116. The Shine-Dalgarno sequence of STII and the first 20 amino acids of the STII signal sequence are generated
(codons pSTBKphoA # 116) by isolating a fragment of approximately 79 bp from pNT3PST116 subsequent to digestion with Xbal and Mlul. The last three amino acids of the STII signal sequence and the sequence encoding the gene for phoA are isolated from pSTBKphoA # 116 after digestion with Mlul and BamHl (a fragment of approximately 1505 bp). As illustrated in Figure 16, ligation or binding of these three fragments results in the construction of pST116pho.
II. ALKALINE PHOSPHATASE ASSAY
In these experiments, TIR constructs altered using the phoA reporter gene are tested to determine their relative translational forces by a modification of the method of Amemuura et al. (J. Bacteriol., 152: 692-701, 1982). Basically, the method used is as follows. Plasmids that exhibit altered sequences, either in the IRR, the Shine-Dalgarno region, the nucleotide sequence between the Shine-Dalgarno region and the start codon of the signal sequence, or the signal sequence itself, and whether they are variants of the amino acid sequence or variants of the nucleotide sequence, they are used to transform E. coli, strain 27C7 (ATCC 55,244) although any phoA "strain of E. coli can be used." Transforming colonies are inoculated in medium. Luria-Bertani (LB) plus carbenicillin (50 μg / ml, Sigma, Ine) The cultures are grown at 37 ° C with shaking for 4-8 h The equivalent of 1 OD600 of each culture is centrifuged, then resuspended in 1 ml of strict AP medium (0.4% glucose, 20 mM NH4C1, 1.6 mM MgSO4, 50 mM KCl, 20 mM NaCl, 120 mM triethanolamine, pH 7.4) plus carbenicillin (50 μg / ml). The mixtures are then placed immediately at -20 ° C overnight. After reheating, a drop of toluene is added to 1 μl of reheated culture. After vortexing, the mixtures are transferred to 16 x 125 mm test tubes and aerated in a wheel at 37 ° C for 1 hour. Then 40 μl of each culture treated with toluene is added to 1 ml of 1M Tris-HCl, pH 8, plus PNPP (disodium 4-nitrophenylphosphate hexahydrate) 1 mM and left at room temperature for 1 hour. The reactions are stopped by adding 100 ml of 1 M sodium phosphate, pH 6.5. The DO410 is measured in the next 30 minutes. The enzymatic activity is calculated as micromoles of p-nitrophenol released per minute, per one equivalent of DO600 cells. The results are summarized in Table 1.
Table 1. Determination of the Relative Strength of the IRR: Use of phoA as an Indicator Gene
TIR Deviation Activity Relative Strength phoA1 standard pBR322 0.0279 0.0069 pphodl * 0.0858 0.0165 1 pSTBKp oA # 86 0.1125 0.0246 1 pSTBKphoA # 107 0.1510 0.0267 2 pphc-413 0.1986 0.0556 3 pSTBKphoA # 81 0.2796 0.0813 4 ppho214 0.4174 0.1145 7 pSTBKph? A # 116 0.5314 0.1478 9 ppho315 0.5396 0.0869 9 pST116pho 0.7760 0.1272 13
^ icromoles of p-nitrophenol / min / cells DO600 2 same variant of STII as pSTIILys 3 same variant of STII as pSTIIBK # 131 4 same variant of STII as STIIC 5 STII wild type + site Mlul, last codon GCC,
III. SECRETION OF EXAMPLES OF HETEROLLIC POLYPEPTIDES
The plasmids used in these examples are all very similar in design to those described above. Instead of describing each construction in detail, the expression plasmids are described here in general terms. Although different polypeptides of interest are expressed in each example, the only significant variation between these constructs is the nucleotide sequence subsequent to the 3 'end of each coding region. Therefore, for descriptive purposes, these plasmids are generally grouped into the following two categories, based on their 3 'sequence:
Category A: Within about 25 bp, 3 'to the stop codon of each gene of interest begins the sequence encoding the transcriptional terminator described by Scholtissek and Grosse (Nucleic Acid Res. 15 (7): 3185 (1987)) followed by the tetracycline resistance gene of pBR322 (Sutcliffe, Cold Spring Harb Sy p Ouant Biol 43: 77-90 (1978)). Examples in this category include plasmids designed for the secretion of mature NGF (Ullrich et al., Nature 303: 821-825 (1983)),
TGF-ßl MADURO (Derynck et al., Nature 316: 701-705 (1985)) and domains 1 and 2 of ICAM-1 (Staunton et al., Cell 52: 925-933
(1988). A schematic representation of these plasmids is provided in Figure 17.
Category B_ Examples in this category include plasmids designed for the secretion of mature VEGF (Leung et al., Science 246: 1306-1309 (1989)), mature NT3 (Jones et al., Proc. Nati. Acad. Sci. USA 87: 8060-8064 (1990)), RANTES (Schall et al., J. Immunol 141 (3): 1018-1025 (1988)) and phoA. The termination codon of each of these plasmids is followed in the 3 'direction by a segment of untranslated DNA (VEGF: approximately 43 bp, mature NT3, approximately 134 bp, RANTES: approximately 7 bp, phoA: approximately 142 bp) . After this 3 'untranslated region, the pBR322 sequence is restarted starting with the HindIII site (as in the mature NT3 secretion plasmid) or the BamH1 site (phoA secretion plasmids, VEGF, RANTES). A schematic representation of the plasmids included in this category is illustrated in Figure 18. These plasmids are used to transform the host E. coli, strain 27C7. The transformant colonies are inoculated in 3-5 ml of LB + carbenicillin (50 μg / ml). The cultures are grown at 37 ° C with shaking for 3-8 h. The cultures are then diluted 1: 100 in 3 ml of low phosphate medium (Chang et al., Supra) and induced for approximately 20 hours with shaking at 37 ° C. For each growth culture, an aliquot of DO600 0.5 is centrifuged in a microcentrifuge tube. Then each DO6000.5 pellet is prepared for gel analysis as follows. Each pellet is resuspended in 50 μl of TE (10 mM Tris, pH 7.6, 1 mM EDTA). After the addition of 10 μl of 10% SDS, and 5 μl of reducing agent (1 M dithiothreitol or 1 M β-mercaptoethanol), the samples are heated to about 90 ° C for 2 minutes and then vortexed. The samples are allowed to cool to room temperature, after which 500 μl of acetone is added. The mixtures are vortexed and then allowed to stand at room temperature for about 15 minutes. The mixtures are centrifuged for 5 minutes. Supernatants are discarded, and the pellets are resuspended in 20 μl of water, 5 μl of reducing agent, 25 μl of sample buffer NOVEX 2X. The samples are heated to approximately 90 ° C for 3-5 minutes, and then vortexed. After centrifugation for 5 minutes, the supernatants are transferred to clean tubes and the pellets are discarded. 5-10 μl of each sample is loaded into 10 wells, 1.0 mm of NOVEX manufactured gel (San Diego, CA) and electrophoresed for 1.5-2 hours at 120 volts. The gels are stained with Coomassie blue to visualize the polypeptide (Figures 19-21). To provide additional quantification of the results, some gels are analyzed by densitometry. These results are shown in table 2 below. Both results of polypeptide gels and densitometry indicate that the heterologous polypeptides tested are secreted in a consistently more efficient manner when a STII variant of reduced translational force is used to direct the secretion of that polypeptide.
Table 2. Examples of Enhanced Polypeptide Secretion by Modification of the? R Densitometry Tests of the Polypeptide Gels
Polypeptide R (Relative Strength) Secreted Quantity (Total% of the host polypeptide VEGF 9 0.6 3 5.9 NGF 9 1.6 7 1.8 4 5.7 1 5.5 RANTES 9 0.3 9 0.2 7 0.4 4 3.9 3 3.6 2 3.5 1 * 1.6 1 1.7 GF-ßl 7 1.7 3 9.2 signal awareness * pSTBKphoA # il? SEQUENCE LIST t
(1. GENERAL INFORMATION:
(i) APPLICANT: Genentech, Inc.
(ii) TITLE OF THE INVENTION: Methods and compositions for the secretion of heterologous polypeptides
(iii) NUMBER OF SEQUENCES: 23
. { iv) CORRESPONDENCE ADDRESS: (A) RECIPIENT: Genentech, Inc. (B) STREET: 460 Point San Bruno Blvd (C) CITY: South San Francisco (D) STATE: California (E) COUNTRY: USA (F) AREA POSTCARD: 94080
(v) READABLE FORM OF THE COMPUTER: (A) TYPE OF MEDIUM: 3.5 inches, 1.44 Mb flexible disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS / MS-DOS (D) SOFTWARE: WinPatin (Genentech)
(vi) CURRENT APPLICATION DATA: (A) APPLICATION NUMBER: (B) SUBMISSION DATE: (C) CLASSIFICATION:
(vii) PREVIOUS APPLICATION DATA: (A) NUMBER OF APPLICATION: 08/398615 (B) DATE OF SUBMISSION: 01-MARCH-1995
(viii) ATTORNEY / AGENT INFORMATION: (A) NAME: Lee, Wendy M. (B) REGISTRATION NUMBER: 00,000 (C) REFERENCE NUMBER / FILE: P0889
(ix) TELECOMMUNICATION INFORMATION: (A) TELEPHONE: 415 / 225-1994 (B) TELEFAX: 415 / 952-9881 (C) TÉLEX: 910 / 371-7168
(2) INFORMATION FOR SEC. FROM IDENT. NO: l: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 88 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: l:
GCATGTCTAG AATTATGAAR AARAAYATHG CNTTYCTNCT NGCNTCNATG 50
TTYGTNTTYT CNATHGCTAC AAACGCGTAT GCCACTCT 88
(2) INFORMATION FOR SEC. FROM IDENT. NO: 2:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 36 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 2:
TTCAGCACCG CACAGAGTGG CATACGCGTT TGTAGC 36 (2) INFORMATION FOR SEC. FROM IDENT. NO: 3:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 82 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 3:
CTAGAATTAT GAAAAAGAAT ATCGCATTTC TTCTTGCATC TATGTTCGTT 50
TTTTCTATTG CTACAAACGC GTATGCCACT CT 82
(2) INFORMATION FOR SEC. FROM IDENT. NO: 4:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 75 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 4:
GTGGCATACG CGTTTGTAGC AATAGAAAAA ACGAACATAG ATGCAAGAAG 50
AAATGCGATA TTCTTTTTCA TAATT 75
(2) INFORMATION FOR SEC. FROM IDENT. NO: 5:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 67 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 5:
CTAGAATTAT GAAGAAGAAT ATCGCATTTC TTCTTGCATC TATGTTCGTT 50
TTTTCTATTG CTACAAA 67
(2) INFORMATION FOR SEC. FROM IDENT. NO 6:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 67 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO 6:
CGCGTTTGTA GCAATAGAAA AAACGAACAT AGATGCAAGA AGAAATGCGA 50
TATTCTTCTT CATAATT 67
(2) INFORMATION FOR SEC. FROM IDENT. NO: 7:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 79 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 7:
CGCGTATGCC CGGACACCAG AAATGCCTGT TCTGGAAAAC CGGGCTGCTC 50
AGGGCGATAT TACTGCACCC GGCGGTGCT 79 (2) INFORMATION FOR SEC. FROM IDENT. NO: 8:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 71 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 8:
CCGCCGGGTG CAGTAATATC GCCCTGAGCA GCCCGGTTTT CCAGAACAGG 50
CATTTCTGGT GTCCGGGCAT A 71
(2) INFORMATION FOR SEC. FROM IDENT. NO: 9:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 83 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 9:
GCATGTCTAG AATTATGAAR AARAAYATHG CNTTTCTTCT TGCATCTATG 50
TTCGTTTTTT CTATTGCTAC AAACGCGTAT GCC 83
(2) INFORMATION FOR SEC. FROM IDENT. NO: 10:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 27 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 10:
AGTGGCATAC GCGTTTGTAG CAATAGA 27
(2) INFORMATION FOR SEC. FROM IDENT. NO: 11:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 79 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear (xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 11:
CTAGAGGTTG AGGTGATTTT ATGAAAAAAA ACATCGCATT TCTTCTTGCA 50 TCTATGTTCG TTTTTTCTAT TGCTACAAA 79
(2) INFORMATION FOR SEC. FROM IDENT. NO: 12:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 79 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Simple (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 12:
CGCGTTTGTA GCAATAGAAA AAACGAACAT AGATGCAAGA AGAAATGCGA 50
TGTTTTTTTTT CATAAAATCA CCTCAACCT 79
(2) INFORMATION FOR SEC. FROM IDENT. NO: 13:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 506 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Double (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 13:
GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC ATGAAAAATC 50
TCATTGCTGA GTTGTTATTT AAGCTTGCCC AAAAAGAAGA AGAGTCGAAT 100
GAACTGTGTG CGCAGGTAGA AGCTTTGGAG ATTATCGTCA CTGCAATGCT 150
TCGCAATATG GCGCAAAATG ACCAACAGCG GTTGATTGAT CAGGTAGAGG 200
GGGCGCTGTA CGAGGTAAAG CCCGATGCCA GCATTCCTGA CGACGATACG 250
GAGCTGCTGC GCGATTACGT AAAGAAGTTA TTGAAGCATC CTCGTCAGTA 300
AAAAGTTAAT CTTTTCAACA GCTGTCATAA AGTTGTCACG GCCGAGACTT 350
ATAGTCGCTT TGTTTTTATT TTTTAATGTA TTTGTAACTA GTACGCAAGT 400
TCACGTAAAA AGGGTATCTA GAGGTTGAGG TGATTTTATG AAAAAGAATA 450 TCGCATTTCT TCTTGCATCT ATGTTCGTTT TTTCTATTGC TACAAATGCC 500
TATGCA 506
(2) INFORMATION FOR SEC. FROM IDENT. NO: 14:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 23 amino acids (B) TYPE: Nucleic acid (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 14:
Met Lys Lys Asn lie Wing Phe Leu Leu Wing Being Met Phe Val Phe 1 5 10 15
Ser lie Ala Thr Asn Ala Tyr Ala 20 23
(2) INFORMATION FOR SEC. FROM IDENT. NO: 15:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 90 base pairs (B> TYPE: Nucleic acid (C) TYPE OF HEBRA: Double (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 15:
TCTAGAGGTT GAGGTGATTT TATGAAAAAG AATATCGCAT TTCTTCTTGC 50
ATCTATGTTC GTTTTTTTCTA TTGCTACAAA YGCSTATGCM 90
(2) INFORMATION FOR SEC. FROM IDENT. NO: 16:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 78 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Double (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 16:
TCTAGAATTA TGAAAAAGAA TATCGCATTT CTTCTTGCAT CTATGTTCGT 50
TTTTTCTATT GCTACAAACG CGTATGCM 78 (2) INFORMATION FOR SEC. FROM IDENT. NO: 17:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 78 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Double (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 17:
TCTAGAATTA TGAAGAAGAA TATTGCGTTC CTACTTGCCT CTATGTTTGT 50
CTTTTCTATA GCTACAAACG CGTATGCM 78
(2) INFORMATION FOR SEC. FROM IDENT. NO: 18:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 78 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Double (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 18:
TCTAGAATTA TGAAGAAGAA TATCGCATTT CTTCTTGCAT CTATGTTCGT 50
TTTTTCTATT GCTACAAACG CGTATGCM 78
(2) INFORMATION FOR SEC. FROM IDENT. NO: 19:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 78 base pairs (B) TYPE: Nucleic acid (OR TYPE OF HEBRA: Double (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 19:
TCTAGAATTA TGAAAAAAAA CATCGCATTT CTTCTTGCAT CTATGTTCGT 50
TTTTTCTATT GCTACAAACG CGTATGCM 78
(2) INFORMATION FOR SEC. FROM IDENT. NO: 20:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 78 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Double (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 20:
TCTAGAATTA TGAAAAAAAA CATTGCCTTT CTTCTTGCAT CTATGTTCGT 50
TTTTTCTATT GCTACAAACG CGTATGCM 78
(2) INFORMATION FOR SEC. FROM IDENT. NO: 21:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 78 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Double (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 21:
TCTAGAATTA TGAAGAAAAA CATCGCTTTT CTTCTTGCAT CTATGTTCGT 50
TTTTTCTATT GCTACAAACG CGTATGCM 78 (2) INFORMATION FOR SEC. FROM IDENT. NO: 22:
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 78 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Double (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT. NO: 22:
TCTAGAATTA TGAAAAAGAA CATAGCGTTT CTTCTTGCAT CTATGTTCGT 50
TTTTTCTATT GCTACAAACG CGTATGCM 78
(2) INFORMATION FOR SEC. FROM IDENT. NO: 23
(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 90 base pairs (B) TYPE: Nucleic acid (C) TYPE OF HEBRA: Double (D) TOPOLOGY: Linear
(xi) DESCRIPTION OF THE SEQUENCE: SEC. FROM IDENT.
TCTAGAGGTT GAGGTGATTT TATGAAAAAA AACATCGCAT TTCTTCTTGC 50
ATCTATGTTC GTTTTTTTCTA TTGCTACAAA CGCGTATGCM 90
It is noted that in relation to this date, the best method known to the applicant to carry out the aforementioned invention, is that which is clear from the present description of the invention. Having described the invention as above, property is claimed as contained in the following:
Claims (21)
1. A method for optimizing the secretion of a heterologous polypeptide of interest in a cell, characterized in that it comprises comparing the expression levels of the polypeptide under the control of a set of nucleic acid variants of a translation initiation region, wherein the of variants represents a range of translational forces, and determining the optimal translation force for the production of a mature polypeptide, in which the optimal translational force is less than the translational force of the wild-type translation start region.
2. A method for secreting a heterologous polypeptide of interest in a cell, characterized in that it comprises using a variant of the translation initiation region operably linked to nucleic acid encoding the heterologous polypeptide, to secrete the heterologous polypeptide, wherein the The translational strength of the variant translation initiation region is less than the translational strength of the wild type translational initiation region.
3. The method according to claim 1 or 2, characterized in that the variant does not alter the amino acid sequence of the translational start region.
4. The method according to claim 1 or 2, characterized in that the amount of polypeptide secreted when the nucleic acid is operably linked to the variant is greater than the amount of secreted polypeptide when the nucleic acid is operably linked to the region start of wild type translation.
5. The method according to claim 1 or 2, characterized in that the variants comprise nucleic acid variants of a secretion signal sequence.
6. The method according to claim 5, characterized in that the secretion signal sequence is a prokaryotic secretion signal sequence.
7. The method according to claim 6, characterized in that the variant secretion signal sequences are variants of STII, OmpA, PhoE, LamB, MBP or PhoA.
8. The method according to claim 7, characterized in that the variant signal sequence is a variant of STII.
9. The method according to claim 8, characterized in that the variant is selected from the group consisting of: 5 'TCTAGAGGTTGAGGTGATTTT ATG AAA AAG AAT ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ATA GCS TAT GCM 3' (SEQ ID NO: 15); 5 * TCTAGAATT ATG AAA AAG AAT ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT HERE AAC GCG TAT GCM 3 '(SEQ ID NO: 16); 5 'TCTAGAATT ATG AAG AAG AAT ATT GCG TTC CTA CTT GCC TCT ATG TTT GTC TTT TCT ATA GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 17); 5 'TCTAGAATT ATG AAG AAG AAT ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 18); 5 'TCTAGAATT ATG AAA AAA AAC ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 19); 5 • TCTAGAATT ATG AAA AAA AAC ATT GCC TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 31 (SEQ ID NO: 20); 5 'TCTAGAATT ATG AAG AAA AAC ATC GCT TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3 * (SEQ ID NO: 21); 5 'TCTAGAATT ATG AAA AAG AAC ATA GCG TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3"(SEQ ID NO: 22); 5 'TCTAGAGGTTGAGGTGATTTT ATG AAA AAA AAC ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 23).
10. A nucleic acid encoding a variant of translational start region, characterized in that the translational force of the variational translational start region is less than the translational strength of the wild-type translational start region.
11. A nucleic acid encoding a polypeptide operably linked to a translational start region variant, characterized in that the translational force of the variational translational start region is less than the translational strength of the wild-type translational start region.
12. The nucleic acid according to claim 10 or 11, characterized in that the variant does not alter the amino acid sequence of the translational start region.
13. The nucleic acid according to claim 10 or 11, characterized in that the amount of polypeptide secreted when the nucleic acid is operably linked to the variant is greater than the amount of secreted polypeptide when the nucleic acid is operably linked to the nucleic acid. wild type translation start region.
14. The nucleic acid according to claim 10 or 11, characterized in that the variants comprise nucleic acid variants of a secretion signal sequence.
15. The nucleic acid according to claim 10 or 11, characterized in that the secretion signal sequence is a prokaryotic secretion signal sequence.
16. The nucleic acid according to claim 15, characterized in that the variant secretion signal sequences are variants of STII, OmpA, PhoE, LamB, MBP or PhoA.
17. The nucleic acid according to claim 16, characterized in that the variant secretion signal sequence is a variant of STII.
18. The nucleic acid according to claim 17, characterized in that the variant is selected from the group consisting of: 5 'TCTAGAGGTTGAGGTGATTTT ATG AAA AAG AAT ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ATA GCS TAT GCM 3 * (SEQ ID NO: 15); 5 'TCTAGAATT ATG AAA AAG AAT ATC GCA TTT CTT CTT OCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 16); 5 'TCTAGAATT ATG AAG AAG AAT ATT GCG TTC CTA CTT GCC TCT ATG TTT GTC TTT TCT ATA GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 17); 5 'TCTAGAATT ATG AAG AAG AAT ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 18); 5 'TCTAGAATT ATG AAA AAA AAC ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 19); 5 'TCTAGAATT ATG AAA AAA AAC ATT GCC TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 20); 5 • TCTAGAATT ATG AAG AAA AAC ATC GCT TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3 '(SEQ ID NO: 21); 5 'TCTAGAATT ATG AAA AAG AAC ATA GCG TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 22); Y 5"TCTAGAGGTTGAGGTGATTTT ATG AAA AAA AAC ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT HERE AAC GCG TAT GCM 3 • (SEQ ID NO: 23).
19. An expression vector characterized in that it comprises the nucleic acid according to any of claims 10 to 18, operably linked to additional elements for expression for a gene of interest.
20. A host cell, characterized in that it comprises the nucleic acid according to any of claims 10 to 19.
21. A variant of the STII secretion signal sequence characterized in that it has the nucleic acid sequence consisting of: 5 'TCTAGAGGTTGAGGTGATTTT ATG AAA AAG AAT ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ATA GCS TAT GCM 3' (SEQ ID NO: 15); 5 'TCTAGAATT ATG AAA AAG AAT ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 16); 5 'TCTAGAATT ATG AAG AAG AAT ATT GCG TTC CTA CTT GCC TCT ATG TTT GTC TTT TCT ATA GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 17); 5 'TCTAGAATT ATG AAG AAG AAT ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 18); 5 'TCTAGAATT ATG AAA AAA AAC ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 19); 5 'TCTAGAATT ATG AAA AAA AAC ATT GCC TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 20); 5 'TCTAGAATT ATG AAG AAA AAC ATC GCT TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 21); 5 »TCTAGAATT ATG AAA AAG AAC ATA GCG TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3 '(SEQ ID NO: 22); Y 5 'TCTAGAGGTTGAGGTGATTTT ATG AAA AAA AAC ATC GCA TTT CTT CTT GCA TCT ATG TTC GTT TTT TCT ATT GCT ACA AAC GCG TAT GCM 3' (SEQ ID NO: 23).
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/398,615 US5840523A (en) | 1995-03-01 | 1995-03-01 | Methods and compositions for secretion of heterologous polypeptides |
US08398615 | 1995-03-01 | ||
PCT/US1996/002858 WO1996027016A1 (en) | 1995-03-01 | 1996-02-27 | Methods and compositions for secretion of heterologous polypeptides |
Publications (2)
Publication Number | Publication Date |
---|---|
MX9706607A MX9706607A (en) | 1997-11-29 |
MXPA97006607A true MXPA97006607A (en) | 1998-07-03 |
Family
ID=
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU701476B2 (en) | Methods and compositions for secretion of heterologous polypeptides | |
US5747662A (en) | Methods and compositions for secretion of heterologous polypeptides | |
EP0205475B1 (en) | Recombinant methods for production of serine protease inhibitors and dna sequences useful for same | |
Nahlik et al. | Nucleotide sequence and transcriptional organization of the Escherichia coli enterobactin biosynthesis cistrons entB and entA | |
Bergès et al. | Combined effects of the signal sequence and the major chaperone proteins on the export of human cytokines in Escherichia coli | |
WO2002055717A2 (en) | Enhanced secretion of a polypeptide by a microorganism | |
EP0750634A1 (en) | Enhanced secretion of polypeptides | |
JP4669181B2 (en) | Signal sequence for producing Leu-hirudin by secretion by E. coli into the culture medium | |
Maclntyre et al. | Export incompatibility of N-terminal basic residues in a mature polypeptide of Eschericbia coli can be alleviated by optimising the signal peptide | |
Krebs et al. | Use of a Tn5 derivative that creates lacZ translational fusions to obtain a transposition mutant | |
US6242177B1 (en) | Methods and compositions for secretion of heterologous polypeptides | |
MXPA97006607A (en) | Methods and compositions for secretion of heterolo polypeptides | |
Kooi et al. | Mutational analysis of the C-terminal region of Saccharomyces cerevisiae ribosomal protein L25 in vitro and in vivo demonstrates the presence of two distinct functional elements | |
KR100963302B1 (en) | Recombinant Vector Containing ptsL Promoter and Method for Producing Exogeneous Proteins Using the Same | |
MOFFAT et al. | Functional domains in the Escherichia coli release factors: Activities of hybrids between RF‐1 and RF‐2 | |
Fujimoto et al. | Expression and secretion of human epidermal growth factor by Escherichia coli using enterotoxin signal sequences | |
Kiani et al. | Construction of recombinant plasmids for periplasmic expression of human growth hormone in Escherichia coli under T7 and lac promoters | |
Rockenbach et al. | Secretion of active truncated CD4 into Escherichia coli periplasm | |
EP1022339B1 (en) | Plasmids, their construction and their use in the manufacture of interleukin-4 and interleukin-4 muteins | |
Jappelli et al. | Interaction between cAMP-dependent protein kinase catalytic subunit and peptide inhibitors analyzed with λ repressor fusions | |
Hubl | Structure-function studies of the DNA binding domain of heat shock transcription factor | |
MXPA96004047A (en) | Increased secretion of polipepti |