CA2034385A1 - Vector to produce biologically important peptides - Google Patents
Vector to produce biologically important peptidesInfo
- Publication number
- CA2034385A1 CA2034385A1 CA002034385A CA2034385A CA2034385A1 CA 2034385 A1 CA2034385 A1 CA 2034385A1 CA 002034385 A CA002034385 A CA 002034385A CA 2034385 A CA2034385 A CA 2034385A CA 2034385 A1 CA2034385 A1 CA 2034385A1
- Authority
- CA
- Canada
- Prior art keywords
- coli
- construct
- parc
- vector
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 66
- 239000013598 vector Substances 0.000 title claims abstract description 36
- 102000004196 processed proteins & peptides Human genes 0.000 title abstract description 12
- 241000588724 Escherichia coli Species 0.000 claims abstract description 55
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 52
- 108091026890 Coding region Proteins 0.000 claims abstract description 23
- 230000028327 secretion Effects 0.000 claims abstract description 21
- 101150077543 st gene Proteins 0.000 claims abstract description 21
- 238000010276 construction Methods 0.000 claims abstract description 10
- 230000004927 fusion Effects 0.000 claims abstract description 7
- 238000000034 method Methods 0.000 claims description 33
- 102000004169 proteins and genes Human genes 0.000 claims description 32
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical group N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 claims description 28
- 108090001061 Insulin Proteins 0.000 claims description 24
- 102000004877 Insulin Human genes 0.000 claims description 23
- 230000008569 process Effects 0.000 claims description 21
- 108020004414 DNA Proteins 0.000 claims description 20
- 102400000344 Angiotensin-1 Human genes 0.000 claims description 17
- 101800000734 Angiotensin-1 Proteins 0.000 claims description 17
- ORWYRWWVDCYOMK-HBZPZAIKSA-N angiotensin I Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O)C(C)C)C1=CC=C(O)C=C1 ORWYRWWVDCYOMK-HBZPZAIKSA-N 0.000 claims description 17
- 229940125396 insulin Drugs 0.000 claims description 14
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 claims description 12
- 108020004705 Codon Proteins 0.000 claims description 7
- 238000004519 manufacturing process Methods 0.000 claims description 6
- 230000014509 gene expression Effects 0.000 claims description 5
- 238000002360 preparation method Methods 0.000 claims description 3
- 239000007858 starting material Substances 0.000 claims description 3
- 238000011161 development Methods 0.000 claims description 2
- 239000013604 expression vector Substances 0.000 claims description 2
- 108700012359 toxins Proteins 0.000 abstract description 23
- 239000003053 toxin Substances 0.000 abstract description 20
- 231100000765 toxin Toxicity 0.000 abstract description 20
- 231100000350 mutagenesis Toxicity 0.000 abstract description 14
- 238000002703 mutagenesis Methods 0.000 abstract description 14
- 239000002773 nucleotide Substances 0.000 abstract description 13
- 125000003729 nucleotide group Chemical group 0.000 abstract description 13
- 238000000746 purification Methods 0.000 abstract description 11
- 238000000338 in vitro Methods 0.000 abstract description 8
- 239000013606 secretion vector Substances 0.000 abstract description 8
- 238000007429 general method Methods 0.000 abstract description 5
- 230000018448 secretion by cell Effects 0.000 abstract description 4
- 101710146739 Enterotoxin Proteins 0.000 abstract description 3
- 239000000147 enterotoxin Substances 0.000 abstract description 3
- 231100000655 enterotoxin Toxicity 0.000 abstract description 3
- 108020004511 Recombinant DNA Proteins 0.000 abstract description 2
- 238000005516 engineering process Methods 0.000 abstract description 2
- 239000013612 plasmid Substances 0.000 description 27
- 239000012634 fragment Substances 0.000 description 16
- 108010076504 Protein Sorting Signals Proteins 0.000 description 14
- 239000000047 product Substances 0.000 description 10
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 9
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 9
- 238000002965 ELISA Methods 0.000 description 8
- 150000001413 amino acids Chemical group 0.000 description 8
- 210000004027 cell Anatomy 0.000 description 8
- 239000012228 culture supernatant Substances 0.000 description 8
- 231100000219 mutagenic Toxicity 0.000 description 8
- 230000003505 mutagenic effect Effects 0.000 description 8
- 101000976075 Homo sapiens Insulin Proteins 0.000 description 7
- PBGKTOXHQIOBKM-FHFVDXKLSA-N insulin (human) Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@H]1CSSC[C@H]2C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3C=CC(O)=CC=3)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3NC=NC=3)NC(=O)[C@H](CO)NC(=O)CNC1=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O)=O)CSSC[C@@H](C(N2)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](NC(=O)CN)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C=CC=CC=1)C(C)C)C1=CN=CN1 PBGKTOXHQIOBKM-FHFVDXKLSA-N 0.000 description 7
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 108700026244 Open Reading Frames Proteins 0.000 description 5
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 230000000644 propagated effect Effects 0.000 description 5
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 4
- 102000002265 Human Growth Hormone Human genes 0.000 description 4
- 108010000521 Human Growth Hormone Proteins 0.000 description 4
- 239000000854 Human Growth Hormone Substances 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- ZMANZCXQSJIPKH-UHFFFAOYSA-N Triethylamine Chemical compound CCN(CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-N 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 239000002243 precursor Substances 0.000 description 3
- 238000003127 radioimmunoassay Methods 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- CUKWUWBLQQDQAC-VEQWQPCFSA-N (3s)-3-amino-4-[[(2s)-1-[[(2s)-1-[[(2s)-1-[[(2s,3s)-1-[[(2s)-1-[(2s)-2-[[(1s)-1-carboxyethyl]carbamoyl]pyrrolidin-1-yl]-3-(1h-imidazol-5-yl)-1-oxopropan-2-yl]amino]-3-methyl-1-oxopentan-2-yl]amino]-3-(4-hydroxyphenyl)-1-oxopropan-2-yl]amino]-3-methyl-1-ox Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O)C(C)C)C1=CC=C(O)C=C1 CUKWUWBLQQDQAC-VEQWQPCFSA-N 0.000 description 2
- 102400000345 Angiotensin-2 Human genes 0.000 description 2
- 101800000733 Angiotensin-2 Proteins 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 102000012410 DNA Ligases Human genes 0.000 description 2
- 108010061982 DNA Ligases Proteins 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 2
- 229950006323 angiotensin ii Drugs 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- -1 (bFGF) Proteins 0.000 description 1
- UUUHXMGGBIUAPW-UHFFFAOYSA-N 1-[1-[2-[[5-amino-2-[[1-[5-(diaminomethylideneamino)-2-[[1-[3-(1h-indol-3-yl)-2-[(5-oxopyrrolidine-2-carbonyl)amino]propanoyl]pyrrolidine-2-carbonyl]amino]pentanoyl]pyrrolidine-2-carbonyl]amino]-5-oxopentanoyl]amino]-3-methylpentanoyl]pyrrolidine-2-carbon Chemical compound C1CCC(C(=O)N2C(CCC2)C(O)=O)N1C(=O)C(C(C)CC)NC(=O)C(CCC(N)=O)NC(=O)C1CCCN1C(=O)C(CCCN=C(N)N)NC(=O)C1CCCN1C(=O)C(CC=1C2=CC=CC=C2NC=1)NC(=O)C1CCC(=O)N1 UUUHXMGGBIUAPW-UHFFFAOYSA-N 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 102000004881 Angiotensinogen Human genes 0.000 description 1
- 108090001067 Angiotensinogen Proteins 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- 101100123358 Caenorhabditis elegans his-54 gene Proteins 0.000 description 1
- 101100129088 Caenorhabditis elegans lys-2 gene Proteins 0.000 description 1
- 101100455752 Caenorhabditis elegans lys-3 gene Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000518994 Conta Species 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 206010012735 Diarrhoea Diseases 0.000 description 1
- 102000018233 Fibroblast Growth Factor Human genes 0.000 description 1
- 108050007372 Fibroblast Growth Factor Proteins 0.000 description 1
- 101150111020 GLUL gene Proteins 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- MMFKFJORZBJVNF-UWVGGRQHSA-N His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 MMFKFJORZBJVNF-UWVGGRQHSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101150017040 I gene Proteins 0.000 description 1
- 238000012218 Kunkel's method Methods 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 231100000757 Microbial toxin Toxicity 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 102000004270 Peptidyl-Dipeptidase A Human genes 0.000 description 1
- 108090000882 Peptidyl-Dipeptidase A Proteins 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 108090000783 Renin Proteins 0.000 description 1
- 108091058545 Secretory proteins Proteins 0.000 description 1
- 102000040739 Secretory proteins Human genes 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- 102000005157 Somatostatin Human genes 0.000 description 1
- 108010056088 Somatostatin Proteins 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000003277 amino acid sequence analysis Methods 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 229920001429 chelating resin Polymers 0.000 description 1
- RKLXDNHNLPUQRB-TVJUEJKUSA-N chembl564271 Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]2C(C)SC[C@H](N[C@@H](CC(N)=O)C(=O)NC(=O)[C@@H](NC2=O)CSC1C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NC(=C)C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NC(=O)C(=C\C)/NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H]2NC(=O)CNC(=O)[C@@H]3CCCN3C(=O)[C@@H](NC(=O)[C@H]3N[C@@H](CC(C)C)C(=O)NC(=O)C(=C)NC(=O)CC[C@H](NC(=O)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC=4C5=CC=CC=C5NC=4)CSC3)C(O)=O)C(C)SC2)C(C)C)C(C)SC1)C1=CC=CC=C1 RKLXDNHNLPUQRB-TVJUEJKUSA-N 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000000688 enterotoxigenic effect Effects 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 229940126864 fibroblast growth factor Drugs 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000004007 reversed phase HPLC Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- NHXLMOGPVYXJNR-ATOGVRKGSA-N somatostatin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CSSC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1)[C@@H](C)O)NC(=O)CNC(=O)[C@H](C)N)C(O)=O)=O)[C@H](O)C)C1=CC=CC=C1 NHXLMOGPVYXJNR-ATOGVRKGSA-N 0.000 description 1
- 229960000553 somatostatin Drugs 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 108010082567 subtilin Proteins 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 239000005526 vasoconstrictor agent Substances 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/24—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
- C07K14/245—Escherichia (G)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/62—Insulins
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K7/00—Peptides having 5 to 20 amino acids in a fully defined sequence; Derivatives thereof
- C07K7/04—Linear peptides containing only normal peptide links
- C07K7/14—Angiotensins: Related peptides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/02—Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Zoology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Gastroenterology & Hepatology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Vascular Medicine (AREA)
- Physics & Mathematics (AREA)
- Diabetes (AREA)
- Plant Pathology (AREA)
- Endocrinology (AREA)
- Microbiology (AREA)
- Toxicology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
Abstract In this patent application we have described the construction of a novel secretion vector based on E.coli enterotoxin coding sequence. We have shown categorically that pre and pro region of toxin gene are absolutely necessary for extra cellular secretion of the stable toxin. We have also shown with specific examples that when the nucleotide coding sequence of a heterologous peptide is fused in frame to the end of the pro region in the st gene, the resultant vector in an E.coli host secretes extracellularly correctly processed heterologous peptide.
This application also includes construction of suitable vectors where this gene fusion can be achieved. General methods to create such fusions involving a) recombinant DNA technology and b) the use of site directed in vitro mutagenesis, have also been described. A general method of purification of heterologous peptides is also described in this application. This novel vector system can be used for hyperproduction and extracellular secretion of peptides of biological importance.
This application also includes construction of suitable vectors where this gene fusion can be achieved. General methods to create such fusions involving a) recombinant DNA technology and b) the use of site directed in vitro mutagenesis, have also been described. A general method of purification of heterologous peptides is also described in this application. This novel vector system can be used for hyperproduction and extracellular secretion of peptides of biological importance.
Description
Field of the invention: 2~343~5 In this patent application we have described the construction of a novel secretion vector based on E.coli enterotoxin codin~ sequence. We have shown categorically that pre and pro region of toxin gene are absolutely necessary for extra cellular secretion of the stable toxin. We have also shown with specific examples that when the nucleotide coding sequence of a he~erologous peptide is fused in frame to the end of the pro region in the st gene, the resultant vector in an E.coli host secretes extracellularly correctly processed heterologous peptide.
This application also includes construction of suitable vectors where this gene fusion can be achieved. General methods to create such fusions invoiving a) recombinant DNA technology and b) the use of site directed in vitro mutagenesis, have also been described. A general method of purification of heterologous peptides is also described in this application. This novel vector system can be used for hyperproduction and extracellular secretion of peptides of biological importance.
,, " , .
This application also includes construction of suitable vectors where this gene fusion can be achieved. General methods to create such fusions invoiving a) recombinant DNA technology and b) the use of site directed in vitro mutagenesis, have also been described. A general method of purification of heterologous peptides is also described in this application. This novel vector system can be used for hyperproduction and extracellular secretion of peptides of biological importance.
,, " , .
2 ,'~03~1385 Background of the invention ~nd prior art:
Secretory as well as many membrane proteins are initially synthesized as nascent intracellular preproteins with a signal peptide attached to the N-terminus. The signal peptide enables the protein to cross the inner membrane ~arrier. In this process the protein gets cleaved and released as a mature protein which normally resides in the periplasmic space. Some exceptions to this mechanism are certain membrane proteins whose signal peptides remain uncleaved (S.Letenhardt, et al. in Protein Engineering, Application in Science, Nedicine and Industry edt. by M, Inouye and R. Sharma, 1986 Academic Press, Inc., 157-171).
A few prokaryotic proteins or peptides are synthesized as large precursors possess~ng signal peptides (pre-region) as well as pro-regions. ~oth segments are cleaved to yield mature proteins or peptides. Examples include subtilin of Bacillus subtilis (C. Nishio, et al, 1983, Biochem.Biophys.Res.Commun. 116, 751-758) or stable toxin of Escherichia coli (P.Dwarakanath, et al, 1989, Gene 81, 219-226). Enterotoxigenic E.coli (ETEC) strains cause diarrhoea in humans by the elaboration of extra cellular toxins classified as heat labile (LT) and heat stable (ST) family of toxins (R.N.~reenberg and R.L. Guerrant (1981).
Pharmacol.Ther.II, 5~7-5~7; M.D. Gill and M.Woolkalis 1985 in Microbial toxins and diarrhoeal diseses, Ciba Foundation symposium 112, Pitman, London, 57-73). ST
toxins are of two types: methanol soluble toxins (STI) and methanol insoluble toxins (STII). STI is further classified into 3 groups depending upon the origin i.e.
STh (human), STp (porcir.e) and STb (bovine). The genes for both the LT and ST are pl~simid encoded. The nucleotide sequence of the st gene is shown in Fig. 1. (P.Dwarakanath et al, 1989, Gene 81, 219-226). From the nucleotide sequence as well as from ~he translated amino acid sequence it is concluded that the 72 amino acid peptide is a precursor of ST which is processed post translationally to release 19 amino acid n~ptide from the carboxy terminal ..
. ". . , . ;. . : .
- ... . ,- .. . ......... .
. . .. . .
.
Secretory as well as many membrane proteins are initially synthesized as nascent intracellular preproteins with a signal peptide attached to the N-terminus. The signal peptide enables the protein to cross the inner membrane ~arrier. In this process the protein gets cleaved and released as a mature protein which normally resides in the periplasmic space. Some exceptions to this mechanism are certain membrane proteins whose signal peptides remain uncleaved (S.Letenhardt, et al. in Protein Engineering, Application in Science, Nedicine and Industry edt. by M, Inouye and R. Sharma, 1986 Academic Press, Inc., 157-171).
A few prokaryotic proteins or peptides are synthesized as large precursors possess~ng signal peptides (pre-region) as well as pro-regions. ~oth segments are cleaved to yield mature proteins or peptides. Examples include subtilin of Bacillus subtilis (C. Nishio, et al, 1983, Biochem.Biophys.Res.Commun. 116, 751-758) or stable toxin of Escherichia coli (P.Dwarakanath, et al, 1989, Gene 81, 219-226). Enterotoxigenic E.coli (ETEC) strains cause diarrhoea in humans by the elaboration of extra cellular toxins classified as heat labile (LT) and heat stable (ST) family of toxins (R.N.~reenberg and R.L. Guerrant (1981).
Pharmacol.Ther.II, 5~7-5~7; M.D. Gill and M.Woolkalis 1985 in Microbial toxins and diarrhoeal diseses, Ciba Foundation symposium 112, Pitman, London, 57-73). ST
toxins are of two types: methanol soluble toxins (STI) and methanol insoluble toxins (STII). STI is further classified into 3 groups depending upon the origin i.e.
STh (human), STp (porcir.e) and STb (bovine). The genes for both the LT and ST are pl~simid encoded. The nucleotide sequence of the st gene is shown in Fig. 1. (P.Dwarakanath et al, 1989, Gene 81, 219-226). From the nucleotide sequence as well as from ~he translated amino acid sequence it is concluded that the 72 amino acid peptide is a precursor of ST which is processed post translationally to release 19 amino acid n~ptide from the carboxy terminal ..
. ". . , . ;. . : .
- ... . ,- .. . ......... .
. . .. . .
.
3 ~03~385 as the biologically active toxin. (P.Dwarakanath et al, 1989, Gene, 81, 219-226).
Attempts have been made by many groups to utilize the "signal sequence" portion of naturally occuring secretory proteins to construct recombinant vectors that will secrete heterologous proteins. In many cases synthetic signal sequences are also used. In all these cases the recombinant products are localized within the subcellular compartmen~s. Specific examples are: A patent by Gray et al (US patent 4,755, 465 July 5, 1988) where the inventors have claimed that they have constructed a vector which promotes the secretion of correctly processed human growth hormone (hGH) in E.coli and Pseudomonas. The "signal sequence" which also comes under the patent claim is very diffe~ent from E.coli stable toxin "signal sequence".
II. A ~econd patent (US patent No. 4, 704, 362 dated Nov 3, 1987) by Itakura _ al. describes a recor~binant cloning vehicle for microbial polypeptide expression where a fusion product of ~-gal and somatostatin is produced and then pro~essed in vitro to get the final product.
An example where specifically an enterotoxin signal sequence is employed, is by Gray et al. (US patent 4, 680, 262 lQ85~ where the inventors linked methanol insoluble stable toxin (STII) "signal sequence" with human growth hormone (hGH) gene and localized the product in the periplasmic region of the host cell. It is interesting to note that inventors were specifically looking for an expressiGri vehicle which localized the expressed recombinant protein intracellularly. The st II signal sequence bea.s no similarity to the st I signal sequence and therefore it is considered a different structure.
In this patent application the inventors have taken advantage of ~oth the pre and the pro region of STI to create a recombinant vehicle in which the nucleotide ,, ~ .
,. : , , 4 203~ 3s sequence coding for a peptide is fused inframe at the terminus of the pro region. Expression of the whole gene, resulted in the secretion of the recombinant peptide extracellularly and it was correctly processed. A
schematic diagram of the principle is shown below:
Signal seq. ¦ ¦ Mature Pre region ¦ Pro region ¦ ST peptide Vector - Signal seq. ¦ ¦ ¦ Mature Pre region ¦ Pro region ¦ heterologous peptide Vector The advantage of this system is that the purfication of the recombinant product becomes much simpler as the cells harboring such a recombinant vector can be grown in a synthetic medium and therefore the secreted peptide constitutes the major peptide present in the culture supernatant. A general method of purification of such a recombinant heterologous peptide (AngiotensinI) is decribed here.
SummarY:
Utilizing the pre and the pro region of E.coli st I gene of the human variety a novel recombinant vector has been constructed which can properly process and extracellularly secrete any peptide. This process is achieved by fusing a ~; coding sequence to the terminus of the pro region. A
generalized method of purification of such a recombinant i peptide, is described in this application.
The present invention is summarized as follows in the - following clauses:
.
, ::: ~:
, ;:
1. The use of E.coli st pre-pro sequence coding for (single letter code) M K K S I L M I F L S V L S F S
P F A Q D A K P V E S S K E K I T K E S K K C N I A K
K S N K S G P E S M in the development of a vector for secretion of heterologous proteins.
2. The vector pARC 0101, deposit no. NCIB 40115.
3. An E. coli host containing the vector pARC 0101 according to clause 2.
4. The use of pARC 0101 according to clause 2 in the construction of an expression vector for heterologous proteins.
Attempts have been made by many groups to utilize the "signal sequence" portion of naturally occuring secretory proteins to construct recombinant vectors that will secrete heterologous proteins. In many cases synthetic signal sequences are also used. In all these cases the recombinant products are localized within the subcellular compartmen~s. Specific examples are: A patent by Gray et al (US patent 4,755, 465 July 5, 1988) where the inventors have claimed that they have constructed a vector which promotes the secretion of correctly processed human growth hormone (hGH) in E.coli and Pseudomonas. The "signal sequence" which also comes under the patent claim is very diffe~ent from E.coli stable toxin "signal sequence".
II. A ~econd patent (US patent No. 4, 704, 362 dated Nov 3, 1987) by Itakura _ al. describes a recor~binant cloning vehicle for microbial polypeptide expression where a fusion product of ~-gal and somatostatin is produced and then pro~essed in vitro to get the final product.
An example where specifically an enterotoxin signal sequence is employed, is by Gray et al. (US patent 4, 680, 262 lQ85~ where the inventors linked methanol insoluble stable toxin (STII) "signal sequence" with human growth hormone (hGH) gene and localized the product in the periplasmic region of the host cell. It is interesting to note that inventors were specifically looking for an expressiGri vehicle which localized the expressed recombinant protein intracellularly. The st II signal sequence bea.s no similarity to the st I signal sequence and therefore it is considered a different structure.
In this patent application the inventors have taken advantage of ~oth the pre and the pro region of STI to create a recombinant vehicle in which the nucleotide ,, ~ .
,. : , , 4 203~ 3s sequence coding for a peptide is fused inframe at the terminus of the pro region. Expression of the whole gene, resulted in the secretion of the recombinant peptide extracellularly and it was correctly processed. A
schematic diagram of the principle is shown below:
Signal seq. ¦ ¦ Mature Pre region ¦ Pro region ¦ ST peptide Vector - Signal seq. ¦ ¦ ¦ Mature Pre region ¦ Pro region ¦ heterologous peptide Vector The advantage of this system is that the purfication of the recombinant product becomes much simpler as the cells harboring such a recombinant vector can be grown in a synthetic medium and therefore the secreted peptide constitutes the major peptide present in the culture supernatant. A general method of purification of such a recombinant heterologous peptide (AngiotensinI) is decribed here.
SummarY:
Utilizing the pre and the pro region of E.coli st I gene of the human variety a novel recombinant vector has been constructed which can properly process and extracellularly secrete any peptide. This process is achieved by fusing a ~; coding sequence to the terminus of the pro region. A
generalized method of purification of such a recombinant i peptide, is described in this application.
The present invention is summarized as follows in the - following clauses:
.
, ::: ~:
, ;:
1. The use of E.coli st pre-pro sequence coding for (single letter code) M K K S I L M I F L S V L S F S
P F A Q D A K P V E S S K E K I T K E S K K C N I A K
K S N K S G P E S M in the development of a vector for secretion of heterologous proteins.
2. The vector pARC 0101, deposit no. NCIB 40115.
3. An E. coli host containing the vector pARC 0101 according to clause 2.
4. The use of pARC 0101 according to clause 2 in the construction of an expression vector for heterologous proteins.
5. A vector containing the E. coli st pre-pro sequence as defined in clause 1.
6. A vector according to clause 5, containing the Bam HI-Hind III sequence of the E. coli st pre-pro sequence as defined in clause 1.
7. A E. coli host containing a vector as defined in any of clauses 5 and 6. ;;
8. A process for the production of heterologous proteins in E. coli, by growing, by standard methods, an E.
coli host as defined in clause 7 and isolating, by standard methods, the desired protein product.
coli host as defined in clause 7 and isolating, by standard methods, the desired protein product.
9. A process according to clause 8 wherein the protein product is secreted extra-cellularly.
10. The construct pARC 0726 containing the E. coli st ; 35 pre-pro sequence fused with the Angiotensin I coding sequence in frame.
11. The use of the construct pARC 0726 according to , : -,: :
clause 11 in the expression of Angiotensin I~
clause 11 in the expression of Angiotensin I~
12. An E. coli host containing the construct pARC 0726 according to clause 10.
13. A process for the production of Angiotensin I by growing an E. coli host according to claim 12 and isolating the desired protein product.
14. A process according to clause 13 wherein the protein product is secreted extra-cellularly.
15. The construct pARC 0801 containing an internal Eco RI
site, said Eco RI site having been created by using the alternate codon for Ser 55 of the mature ST
peptide.
site, said Eco RI site having been created by using the alternate codon for Ser 55 of the mature ST
peptide.
16. The use of a first construct pARC 0801 according to clause 15 as a starting material in the preparation of a second construct by fusion of a heterologous or . homologous gene sequence at the Eco RI site of the 1 pARC 0801 construct.
; 17. A vector, capable of facilitating the secretion of heterologous protein expressed in a host cell, said vector including DNA encoding the E. Coli st pre-pro sequence M K K S I L M I F L S V L S F S P F A Q D A
,` K P V E S S K E K I T K E S K K C N I A K K S N K S
G P E S M.
18. A construct comprising a vector according to clause ~:.
,~
,~ . - . . , ::
.. . ..
- :
- 6a - 23940-679 1 fused in reading frame to DNA encoding a desired protein.
19. A construct according to clause 18 wherein the DNA
encodes Angiotensin I.
20. A construct according to clause 18 wherein the DNA
encodes insulin A chain.
21. A construct according to clause 18 wherein the DNA
encodes insulin B chain.
22. The construct pARC 0726 containing the E. coli st pre-pro sequence fused with the insulin chain A coding sequence in frame.
23. The construct pARC 0726 containing the E. coli st pre-pro sequence fused with the insulin chain B
coding sequence in frame.
24. A process according to clause 8 or 9 wherein Angio-tensin I is isolated.
:, 25. A process according to clause 8 or 9 wherein insulin chain A is isolated.
26. A process according to clause 8 or 9 wherein insulin chain B is isolated.
27. A process for facilitating the secretion from the host cell of a protein expressed in a host cell which - . ~ -:
. ~- . : : .
20;~385 - 6b - 23940-679 comprises fusing in reading frame DNA encoding the protein to be expressed to an E. coli st pre-pro sequence as defined in clause 17 or 18.
28. The construct pARC 0801 containing the complete E.
coli _ gene sequence and containing an internal Eco RI
site, said Eco RI site having been created by using the alternate codon for Ser 55 of the mature ST
peptide.
29. A process for forming a construct which comprises fusing a desired sequence at the Eco RI site of the pARC 0801 construct.
Detailed description of the invention:
Identification and cloning of the human variety of E. coli stI gene was described in detail by P. Dwarakanath et al (1989, Gene, 81, 219-226). In brief, a plasmid of ca. 100 MDa was identified in E. coli strain 86 cal which contained the st gene. A BamHI library of the 100 MDa plasmid of E. Coli 86 cal was constructed in pBR 322 and a st gene containing clone was identified by DNA probe hybridization. This st gene was further subcloned in M13mpl9 as a BamHI - HindIII fragment and the complete 0 fragment was sequenced by Sanger's method.
A part of the sequence containing the open reading frame (ORF) of the st gene is shown in Figure 1. The carboxy : `
X03't385 : 7 terminal 19 aa of the ORF coded peptide corresponds to the sequence of the STh peptide (Aimoto et al 1982 Eur.J.Biochem. 129, 257-263). Four nucleotides preceding the start codon, a sequence suggesting a putative ribosome 5 binding site can be identified. The coding sequence is followed by a pair of stop codon (TAA) and a 15 nucleotide stretch with dyad symmetry presumably indicating a transcription termination signal. The ORF codes for a 72 amino acid peptide in which the 19 amino acid carboxy 10 terminal is the biologically active peptide. The 19 or 20 amino acid stretch at the N-terminus constitutes the signal peptide, having two basic residues [Lys2, Lys3~
following the initiator methionine, a hydropho~ic stretch of amino acids and a consensus sequence for signal 15 cleavage junction. The exact cleavage junction of the signal peptide from the pro-ST region is not known. The pro ST region spans upto Met 53 where it is finally cleaved to yield a biologically active peptide secreted extracellularly with Asn54 as the N-terminal of the 20 mature peptide. r Estimation of the STh concentration in culture supernatants was obtained using the competitive ELISA
technique. This ELISA was routinely used throughout this investigation to estimate the level of STh in culture supernatant. Hyperexpression of the st gene was achieved by subcloning the gene fragment in T7 promoter containing vector. In such a hyperexpression system the induction of hyperexpression was achieved by the addition of the inducer (isopropylthiogalactoside) IPTG. Purification ~; scheme for the ST peptide (wild type/mutant) was accordirig to the method described by P.Dwarakanath et al. (1989;
Gene 81 219-226). Amino acid sequence analysis of the peptide was done by an Applied Biosystem Amino Acid Sequence Analyser (Model 477A). The in vitro site direct~d mutagenesis method was used to generate mutants in the pro region as well as in the mature part of the toxin peptide coding sequence following standard methods (Das et al .
: , .
203~13~35 1989, Proc.Natl.Acad.Sci. U.S.A. 86, 496-499). Similarly, by in vitro site directed mutagenesis method the complete mature ST peptide coding region was replaced by Angiotensin I (Ang I) coding sequence and secretion of the Ang I was monitored by radioimmuno assay (RIA). The results are described under Experimental Data Section.
Esperimental Data I. Construction of secretion vectors:
STh was detected in the culture supernatant of an ETEC E.coli isolate 86 cal which was shown to harbor a plasmid of 100 MDa. A BamHI library of the 100 MDa plasmid of E.coli 86 cal was constructed in pBR
322 and the recombinant clones were checked for st gene. One of the recombinant clones wa~ identified as carrying the st gene in a 1.9 Kb BamHI fragment (pARC
074). A 1.1 Kb BamHI - HindIII fragment obtained from pARC 074 was cloned in pBR 32~ to obtain plasmid pARC 0101. The plasmid pARC 0101 is the ~tarting material for all further experiment~. This plasmid is deposited in National Collection of Industrial Bacteria, Aberdeen, Scotland (strain NCIB 40115 deposi-ted under the Budapest Treaty on February 15, 1989).
Culture supernatants of E.coli B lC1 harboring pARC
0101 elicit a positive biological resp~nse in suckling mice for ST.
II. Construction of M13mpl9ss DNA ~hage containinq st insert.
A 1.1 Kb BamHI - HindIII fragment conta~ning st gene was isolated from pARC0101. This was s~~bcloned in a M13mpl9RF digested with BamHI and H~n~lII. The recombinant replicative form (RF) was transformed in JM101 and plated in presence of X-Gal and IPTG. The transformant white plaques were screer.ed for st gene .. . . .
- ~ ., ~;
: ~ .
.
.
203~38S
g and one such phage clone was identified as 0192 (Fig.
2) which when propagated in JMlOl releases stable toxin in the culture supernatant. 0192 is the starting material for all the in vitro mutagenesis S experiments described.
III. Construction of the ny~rexpression vector for hyperexpression of the wild type or mutant STh 0192 or its mutants werP propagated in JMlOl and the RF was isolated following an overnight growth of the bacteria following a standard protocol of plasmid purification [T. Maniatis et al. 1982 Molecular i-Cloning. A Laboratory Manual. Cold Spring Harbour Laboratory, Code Spring Harbour, NY]. A BamHI-HindIII
fragment of 1.1 Kb was isolated and subcloned in a T7 promoter containing vector as a BamHI-HindIII
fragment such that the promoter was oriented in the same direction as th~t of the transcription of the st gene. The hyperexpression plasmid containing the wild type st gene was designated as pARC 0601 (Fig. 3).
IV. Exampleisi to demonstrate the essent_alit~ of the pro region for extra cellular secretion.
To demonstrate the essentiality of the pro region for extracellular secretion of stable toxin, two mutants of the st gene were made where the mutation was located in the pro region. In the first example a mutation was made at the processing site where the Met53 was altered to Ile 53 using ~192 grown in CJ236 (dut- ung-] strain as the template. The mutagenic primer used for creating this alteration had the sequence 5' C C T G A A A G C A T T A A T A G
T A G C 3' (ATC--->ATT3 This primer (GD21) was annealed to the template and extended in the presence ;~Q3~3~3S
of Sequenase and the four dN$Ps. The extended chain was ligated by T4 DNA ligase and this in vitro synthesized double stranded DNA was used to transform JM101. The transformant plaques were screened by DNA
sequencing and the mutant clones were identified. One such mutant clone (0GD21) was plaque purified and subcloned into the hyperexpression vector by the method as outlined in experimental data Section III.
The resulting plasmid pARC 0701 was used to transform E.coli HB1(~1~ Production of extra cellular ST
(M53->I53~ in an overnight culture was compared with that of pARC 0601 grown in E.coli HB101. The result is shown below:
________________________________________________ Plasmid in HB101 ST production in ug/ml _____ __________________________________________ pARC0501 7 pARC0701 _______~________________________________________ This experiment showed that a conservative change (Met53 ->Ile53) in the pro region reduced the level of secretion of stable toxin by more than 85%.
., The attenuation of STh secretion was more evident when a deletion mutant of st was constructed where the deletion spanned from Ser48 to Ser52 in the pro region~ ~xperimental protocol was the same as that described ln the previous section except for the ; mutagenic primer used which had the sequence 5' G C A
A A A A A A A G T A A T A A A A T G A A T A G T A G C
A A T T A C 3 ' ( GD-ll ~ . Hyperexpression plasmid containing 'his mutant st (Del. Ser 48- SerS2) was designated pARC 0702. When this plasmid was propagated in HB101, the resultant clone did not produce ar.y detectable extracellular STh. These examples (Met53 ->Ile53 and Del. Ser48 -> Ser52) - therefore clearly showed that the presence of intact pro regio-, was necessary for extra cellular STh . . . .
20343~35 secretion.
V. Exam~les demonstrating non-specificit~ of the N-terminus residue of the mature pePtide for correct processinq and extra cellular secretion of the ST
h peptide.
To find the effect of the N-terminal residue of the ST peptide on the processing and extracellular secretion of the peptide two mutants of the st were made. In the first example, Asn54 of the STh was .
changed to Gln54 exactly following the method as described section III of Experimental Data. The mutagenic primer had the sequence of 5' G A A A G C A
T G C A G A G T A G C A A T 3'(ATT-->CAG). The hyper-expression plasmid containing the mutant st Asn54 ->
Gln54 was designated pARC 0732. This plasmid when propagated in E.coli B101 or in the hyperexpression strain BL21-DE3 produced equivalent amount of extracellular toxin compared to that produced by strains HB101 or BL21-DE3 harboring pARC 0601. In order to determine the processing site of the extracellularly secreted mutant ST (Asn54-> GlnS~), the mutant gene was overexpressed and the peptide was purified essentially following the method described by P. Dwarakanath et al (1989, Gene 81, 219-226). The HPLC purification profile is shown in Fig.4. The N-terminal sequence of this peptide revealed a se~uence Gln - Ser - Ser - Asn - Tyr.
Another similar experiment was conducted in which Asn54 of STh was altered to His54. The resulting plasmid containing the mutant st was designated pARC 0716. Following overexpression of the st AsnS4->Ile54 and purification of peptide, the N-terminal sequence of the peptide was determined. The sequence data showed presence of two peptides with N-'erminal sequences ~1) His-Ser-Ser-Asn-Tyr ...and (2) ' ~0~3~3S
Ser-Ser-Asn-Tyr.... A HPLC purification profile of this peptide is shown in Fiy. 5.
These examples clearly demonstrate that the N-terminus of the STh peptide (Asn54) is not crucial for post translational processing and secretion of the peptide.
V~. Example to demonstrate generality of the stable toxin based secretion vector To test the generality of the st ~ased secretion vector, the coding region corresponding to mature ST
peptide (ie. Asn54 to Tyr72) was replaced in the st gene by Angiotensin I coding sequence. Angiotensin I
is a decamer peptide which is converted from a larger precursor Angiotensinogen by Renin protease.
Angiotensin I is further modified to yield Angiotensin II by 'Angiotensin converting enzyme' which deletes the last two carboxyl terminal residues (His-Leu) of Angiotensin I, Angiotensin II is known to be a potential vaso constrictor. The substitution of AngI coding sequence in place of ST coding sequence in the st gene was accomplished by Kunkel's method (T.A.Kunkel, 1985 Proc.Natl. Acad.Sci. USA 82, 48~-492) with some modifications. A mutagenic primer ( GD9 ) was synthesized having the sequence 5' G T G G
T C C T G A A A G C A T G G A C C G G G T G T A C A T
A C A C C C C T T C C A C C T C T T A A T A A T A T A
A A G G G 3'. This 62 mer primer was annealed to ss 0192 DNA template grown in CJ 236. The temperature for annealing reaction was 55 C. The amount of template used was 3ug per reaction while the primer used was lOng. Following annealing, the extension reaction was carried out at 37C for 4 hrs in the presence of 8 units of Sequenase and four dNTPs (final conc. of each dNTP was lmM). The extension mix also contained 4 units of T4 DNA ligase for the ligation of the extended primer to take place. This complex was used to transform E.coli JM101 and the putative clones were identified by DNA sequencing.
The Ang I coding sequence linked st was subcloned into the hyperexpression plasmid and the plasmid was designated as pARC 0726. This plasmid when propagated into B 101 or hyperexpression strain BL21-DE3, the plasmid bearing strains produced AngI peptide extra ~--cellularly as detected by RIA. Following hyperexpression of pARC 0726 in BL21-DE3 in M9 medium the peptide was purified for N-terminal sequencing.
The purification scheme is described below. We believe this purification scheme can be applied generally for other peptides.
BL21-DE3 cells harboring pARC 0726 were grown in M9 medium (250 ml x 4). When the culture reached an A600nm = 0.86, the cells were induced by adding IPTG (final conc. 0.5 mm). After 2.5 hrs following induction, cells were harvested and 470 ml of culture supernatant was mixed with 30 g of Amberlite XAD-4 and allowed to stand at room temperature overnight. After washing the resin thoroughly with water, the bound peptide was eluted with 99% Ethanol/1% acetic acid followed by 80%
Ethanol/1% acetic acid. The eluate was concentrated by flash evaporation and the concentrate was loaded on SP
Sephadex C-25 column (10 ml) previously equilibrated with 20 mM phosphate buffer, pH 6.4. Prior to elution, the column was rinsed with water. Elution was carried out with 50 mM triethylamine. The pH of the eluate was brought to 6.0 with acetic acid. The eluate was lyophilised and the dry powder was reconstituted in 1 ml water. The preparation was subject to HPLC on RP-8 column using an acetonitrile gradient [Solvent A:0.1%TFA, Solvent ~:0.1% TFA/95% Acetonitrile: Flow rate 1 ml/min. Gradient was 10-50%B in 40 mins]. The An~ I peak was detected at 29% B (Fig.6). The purified peptide was sequenced and the sequence was con~irmed .
.: :
.
. .
Z03~3~3S
with the native sequence.
This example clearly demonstrates that a completely heterologous peptide coding sequence can be linked with st pro region and the correctly processed peptide S can be detected extracellularly following propagation of such construct in a suitable E.coli host.
VII. Construction of a general secretion vector_with suitable cloning site.
Introduction of a suitable cloning site in the mature peptide coding region can be accomplished as follows:
To introduce a restriction site within or at very close proximity to the mature peptide coding region, the DNA
stretch 5' A T G A A T A G T 3' representing the amino acid sequence from Met53 to Ser55 is chosen. By altering the nucleotide sequence of Ser55 codon ~AGT~
to the alternate codon TCT, an EcoRI site could be generated without altering the amino acid residue. A
schematic diagram is shown below:
Met53 Asn54 SerS5 A T G A A T A G T
A T GA A T T C T
.
; EcoRI site The single stranded (ss) 0192 DNA template is mutagenized by the method described earlier. The mutagenic primer (GD13) had a nucleotide sequence, 5'A
i G C A T G A A T T C T A G C A A T T A C 3'. A general hyperexpression vector system can be constucted utilizing this EcoRI site as the suitable site for insertion o~ heterologous protein and peptide coding sequence. The hyperexpression plasmid pET7 is digested with EcoRI and BamHI and the large fragment isolated.
0192 (GD13) RF is isolated and digested with BamHI and . - ^ -. ~ : . .; , .:
. . ~ .
EcoRI. Following CIP (calf intestinal phosphatase) treatment of the digestion product, a 620 bp fragment is isolated.
This fragment is ligated with the large fragment of pET7. The resultant recombinant plasmid is named pARC 0801.
(Figure 7) This plasmid can be used as a general secretion vector. Any heterologous peptide or protein coding sequence can be inserted at the EcoRI site (insertion site), for example, peptides such as Angiotensin I, bovine fibroblast growth factor, (bFGF), insulin, and others.
Expression of this plasmid in a E. coli host, results in the secretion of the peptide or protein into the medium which can be purified. It should be noted that peptide produced from this recombinant plasmid will have one additional amino acid residue at its amino terminal (Ser).
The possibility to produce Angiotensin I and, especially, insulin, represent an important aspect of the present invention. Purified insulin A chain and insulin B chain can be used to produce insulin.
VII. EXAMPLES TO DEMONSTRATE SECRETION OF INSULIN A AND B CHAINS
FROM THE STABLE TOXIN BASED SECRETION VECTOR.
The present example demonstrates the generality of the stable toxin based secretion vector. Human insulin consists of two polypeptide chains A (21 amino acid residues) and B (30 amino acid residues) which are linked through SH bonds. These chains can be separated in vitro by reduction and pure forms of A and B chains can be obtained. Under appropriate conditions ' : ' '~ -, : : ~ : ' ~ ' ,"' ' ,- .
. ., ~ .
- 15a - 23940-679 purified A and B chains can be reoxidised to form immunoreactive and biologically active human insulin. The gene sequences of human insulin A and B chains were fused separately at the end of the ST pre-pro sequence. The recombinant plasmid harboured in actively growing E. coli hosts, secreted into the culture medium substances which were immunoreactive against human insulin anti-body. Incubation of the human insulin antibody with authentic insulin A and B chains inhibited in a competitive manner in res-pective cases where recombinant A and B chains were expected to be produced, indicating that the expected product was produced in each case. The recombinant plasmids were constructed using the strategy similar to that described in Experimental Data VI. A Ml3 mp l9 based recombinant phage DNA (0GD24) was used as the tem-plate for the first round of mutagenesis. 0GD24 had a ST gene inserted as a BamHI-HindIII fragment in M 13 mp l9 multiple cloning site. This particular ST insert had a missense mutation at the N-terminal residue of the mature toxin peptide replacing the Asn 54 by Gly 54. The mutagenic primer used for introducing the first round of mutagenesis had the r.ucleotide sequence 5' C C T G A A A G C A T G G G T A T C G T G G A G C A G T G C
T G T A C A T C T A T C T G C T C A C T G T A T T A A T A A T
A T A A A 3' This mutagenesis yielded the phage 0GD26 which had the DN~ sequence corresponding to the N-terminal 1-14 residues of insulin A chain fused in frame with the ST pre-pro region.
0 GD26 DNA was used as the template for the second round of .
.. - .. : , ~- ... :
:: :::
.
~: .. : .: ! ' : ~
203~385 - 15b - 23940-679 ~,.
mutagenesis. The mutagenic primer had the nucleotide sequence 5' T G C T C A C T G T A T C A G C T A G A G A A C T A C
T G C A A C T A A T A A T A T A A A 3' Following mutagenesis, 0GD28 was identified which had the complete A chain gene fused at the C terminal section of the ST pre-pro region. In summary, with 2 rounds of mutagenesis the complete nucleotide sequence of insulin A chain gene was fused in frame at the 3' end of ST pre-pro sequence and thereby com-pletely replacing the mature toxin gene sequence. However, rest of the 3' end sequence of the ST gene was retained intact in the ; final construct. The recombinant phage 0GD28 was grown in JMlOl and the RF was isolated. The RF was digested with BamHI-HindIII
and the 1.1 Kb fragment was cloned into pET7 as a BamHI-HindIII
fragment to yield pARC0750. The hyperexpression E. coli strain BL21-DE3 was transformed with pARC0750. In a similar manner insulin B chain was also synthesised and fused at the 3' end of the ST pre-pro region. The initial template used was 0GD23 which had a ST insert N54F. The nucleotide sequence of the first muta-genesis primer was 5' G G T C C T G A A A G C A T G T T T G T G A A T C A G C A T C
T T T G C G G A A G T C A T C T G G T T G A G G C T C T T T A
T T A A T A A T A T A A A 3' Following mutagenesis a mutant clone was identified (0GD32) which had the DNA sequence corresponding to the insulin B chain 1-16 residues fused in frame with ST pre-pro sequence.
0GD32 ss DNA was used as the template for the 2nd round of :' : , . ~
20343~3S
- 15c - 23940-679 mutagenesis. The nucleotide sequence of the mutagenic primer was 5' G T T G A G G C T C T T T A T C T T G T A T G T G G T G A A C
G T G G T T T C T T C T A T A C A C C T A A G A C A T A A T A
A T A T A A A G G G 3' Following mutagenesis, the mutant phage 0GD33 was isolated which had the complete B chain gene sequence fused at the 3' end of the ST pre-pro sequence. This fused gene insert was isolated from 0 GD33 as the BamHI-HindIII fragment and subcloned in pET7 to yield pARC0759. E. coli hyperexpression strain BL21-DE3 was transformed with pARC0759 and the transformed clones were screened for the presence of the plasmid. To check whether the clones pARC0750 and pARC0759 were secreting A and B chains respectively ELISA procedures were developed where culture super-natants could be directedly tested. Essential features of the ELISA procedure are described below:
Human insulin was purchased from Novo Industries. It was in a highly pure form which was confirmed on reverse phase HPLC analysis. An aliquot of this pure insulin was reduced quantitatively with DTT and carboxyamidated using iodoacetamide to form stable A & B chains. A direct ELISA as well as an inhibi-tion ELISA for A & B chain detection were developed using antibody raised against human insulin. pARC0750 and pARC0759 in BL21-DE3 host were grown separately in M9 medium and induced with IPTG as described in the previous examples. Following induction the cultures were centrifuged and the supernatants were checked for the presence of A & B chains. The yield of A and B chains as estimated : :~
::
. ~
203'138S
- 15d - 23940-679 by direct ELISA were approximately 16 ug/ml and 30 ug/ml respectively. Inhibition ELISA studies also confirmed the quantitative estimation of the secretion level of the A & B
chains respectively.
. : ~ ., ~ .,, : : : .
; 17. A vector, capable of facilitating the secretion of heterologous protein expressed in a host cell, said vector including DNA encoding the E. Coli st pre-pro sequence M K K S I L M I F L S V L S F S P F A Q D A
,` K P V E S S K E K I T K E S K K C N I A K K S N K S
G P E S M.
18. A construct comprising a vector according to clause ~:.
,~
,~ . - . . , ::
.. . ..
- :
- 6a - 23940-679 1 fused in reading frame to DNA encoding a desired protein.
19. A construct according to clause 18 wherein the DNA
encodes Angiotensin I.
20. A construct according to clause 18 wherein the DNA
encodes insulin A chain.
21. A construct according to clause 18 wherein the DNA
encodes insulin B chain.
22. The construct pARC 0726 containing the E. coli st pre-pro sequence fused with the insulin chain A coding sequence in frame.
23. The construct pARC 0726 containing the E. coli st pre-pro sequence fused with the insulin chain B
coding sequence in frame.
24. A process according to clause 8 or 9 wherein Angio-tensin I is isolated.
:, 25. A process according to clause 8 or 9 wherein insulin chain A is isolated.
26. A process according to clause 8 or 9 wherein insulin chain B is isolated.
27. A process for facilitating the secretion from the host cell of a protein expressed in a host cell which - . ~ -:
. ~- . : : .
20;~385 - 6b - 23940-679 comprises fusing in reading frame DNA encoding the protein to be expressed to an E. coli st pre-pro sequence as defined in clause 17 or 18.
28. The construct pARC 0801 containing the complete E.
coli _ gene sequence and containing an internal Eco RI
site, said Eco RI site having been created by using the alternate codon for Ser 55 of the mature ST
peptide.
29. A process for forming a construct which comprises fusing a desired sequence at the Eco RI site of the pARC 0801 construct.
Detailed description of the invention:
Identification and cloning of the human variety of E. coli stI gene was described in detail by P. Dwarakanath et al (1989, Gene, 81, 219-226). In brief, a plasmid of ca. 100 MDa was identified in E. coli strain 86 cal which contained the st gene. A BamHI library of the 100 MDa plasmid of E. Coli 86 cal was constructed in pBR 322 and a st gene containing clone was identified by DNA probe hybridization. This st gene was further subcloned in M13mpl9 as a BamHI - HindIII fragment and the complete 0 fragment was sequenced by Sanger's method.
A part of the sequence containing the open reading frame (ORF) of the st gene is shown in Figure 1. The carboxy : `
X03't385 : 7 terminal 19 aa of the ORF coded peptide corresponds to the sequence of the STh peptide (Aimoto et al 1982 Eur.J.Biochem. 129, 257-263). Four nucleotides preceding the start codon, a sequence suggesting a putative ribosome 5 binding site can be identified. The coding sequence is followed by a pair of stop codon (TAA) and a 15 nucleotide stretch with dyad symmetry presumably indicating a transcription termination signal. The ORF codes for a 72 amino acid peptide in which the 19 amino acid carboxy 10 terminal is the biologically active peptide. The 19 or 20 amino acid stretch at the N-terminus constitutes the signal peptide, having two basic residues [Lys2, Lys3~
following the initiator methionine, a hydropho~ic stretch of amino acids and a consensus sequence for signal 15 cleavage junction. The exact cleavage junction of the signal peptide from the pro-ST region is not known. The pro ST region spans upto Met 53 where it is finally cleaved to yield a biologically active peptide secreted extracellularly with Asn54 as the N-terminal of the 20 mature peptide. r Estimation of the STh concentration in culture supernatants was obtained using the competitive ELISA
technique. This ELISA was routinely used throughout this investigation to estimate the level of STh in culture supernatant. Hyperexpression of the st gene was achieved by subcloning the gene fragment in T7 promoter containing vector. In such a hyperexpression system the induction of hyperexpression was achieved by the addition of the inducer (isopropylthiogalactoside) IPTG. Purification ~; scheme for the ST peptide (wild type/mutant) was accordirig to the method described by P.Dwarakanath et al. (1989;
Gene 81 219-226). Amino acid sequence analysis of the peptide was done by an Applied Biosystem Amino Acid Sequence Analyser (Model 477A). The in vitro site direct~d mutagenesis method was used to generate mutants in the pro region as well as in the mature part of the toxin peptide coding sequence following standard methods (Das et al .
: , .
203~13~35 1989, Proc.Natl.Acad.Sci. U.S.A. 86, 496-499). Similarly, by in vitro site directed mutagenesis method the complete mature ST peptide coding region was replaced by Angiotensin I (Ang I) coding sequence and secretion of the Ang I was monitored by radioimmuno assay (RIA). The results are described under Experimental Data Section.
Esperimental Data I. Construction of secretion vectors:
STh was detected in the culture supernatant of an ETEC E.coli isolate 86 cal which was shown to harbor a plasmid of 100 MDa. A BamHI library of the 100 MDa plasmid of E.coli 86 cal was constructed in pBR
322 and the recombinant clones were checked for st gene. One of the recombinant clones wa~ identified as carrying the st gene in a 1.9 Kb BamHI fragment (pARC
074). A 1.1 Kb BamHI - HindIII fragment obtained from pARC 074 was cloned in pBR 32~ to obtain plasmid pARC 0101. The plasmid pARC 0101 is the ~tarting material for all further experiment~. This plasmid is deposited in National Collection of Industrial Bacteria, Aberdeen, Scotland (strain NCIB 40115 deposi-ted under the Budapest Treaty on February 15, 1989).
Culture supernatants of E.coli B lC1 harboring pARC
0101 elicit a positive biological resp~nse in suckling mice for ST.
II. Construction of M13mpl9ss DNA ~hage containinq st insert.
A 1.1 Kb BamHI - HindIII fragment conta~ning st gene was isolated from pARC0101. This was s~~bcloned in a M13mpl9RF digested with BamHI and H~n~lII. The recombinant replicative form (RF) was transformed in JM101 and plated in presence of X-Gal and IPTG. The transformant white plaques were screer.ed for st gene .. . . .
- ~ ., ~;
: ~ .
.
.
203~38S
g and one such phage clone was identified as 0192 (Fig.
2) which when propagated in JMlOl releases stable toxin in the culture supernatant. 0192 is the starting material for all the in vitro mutagenesis S experiments described.
III. Construction of the ny~rexpression vector for hyperexpression of the wild type or mutant STh 0192 or its mutants werP propagated in JMlOl and the RF was isolated following an overnight growth of the bacteria following a standard protocol of plasmid purification [T. Maniatis et al. 1982 Molecular i-Cloning. A Laboratory Manual. Cold Spring Harbour Laboratory, Code Spring Harbour, NY]. A BamHI-HindIII
fragment of 1.1 Kb was isolated and subcloned in a T7 promoter containing vector as a BamHI-HindIII
fragment such that the promoter was oriented in the same direction as th~t of the transcription of the st gene. The hyperexpression plasmid containing the wild type st gene was designated as pARC 0601 (Fig. 3).
IV. Exampleisi to demonstrate the essent_alit~ of the pro region for extra cellular secretion.
To demonstrate the essentiality of the pro region for extracellular secretion of stable toxin, two mutants of the st gene were made where the mutation was located in the pro region. In the first example a mutation was made at the processing site where the Met53 was altered to Ile 53 using ~192 grown in CJ236 (dut- ung-] strain as the template. The mutagenic primer used for creating this alteration had the sequence 5' C C T G A A A G C A T T A A T A G
T A G C 3' (ATC--->ATT3 This primer (GD21) was annealed to the template and extended in the presence ;~Q3~3~3S
of Sequenase and the four dN$Ps. The extended chain was ligated by T4 DNA ligase and this in vitro synthesized double stranded DNA was used to transform JM101. The transformant plaques were screened by DNA
sequencing and the mutant clones were identified. One such mutant clone (0GD21) was plaque purified and subcloned into the hyperexpression vector by the method as outlined in experimental data Section III.
The resulting plasmid pARC 0701 was used to transform E.coli HB1(~1~ Production of extra cellular ST
(M53->I53~ in an overnight culture was compared with that of pARC 0601 grown in E.coli HB101. The result is shown below:
________________________________________________ Plasmid in HB101 ST production in ug/ml _____ __________________________________________ pARC0501 7 pARC0701 _______~________________________________________ This experiment showed that a conservative change (Met53 ->Ile53) in the pro region reduced the level of secretion of stable toxin by more than 85%.
., The attenuation of STh secretion was more evident when a deletion mutant of st was constructed where the deletion spanned from Ser48 to Ser52 in the pro region~ ~xperimental protocol was the same as that described ln the previous section except for the ; mutagenic primer used which had the sequence 5' G C A
A A A A A A A G T A A T A A A A T G A A T A G T A G C
A A T T A C 3 ' ( GD-ll ~ . Hyperexpression plasmid containing 'his mutant st (Del. Ser 48- SerS2) was designated pARC 0702. When this plasmid was propagated in HB101, the resultant clone did not produce ar.y detectable extracellular STh. These examples (Met53 ->Ile53 and Del. Ser48 -> Ser52) - therefore clearly showed that the presence of intact pro regio-, was necessary for extra cellular STh . . . .
20343~35 secretion.
V. Exam~les demonstrating non-specificit~ of the N-terminus residue of the mature pePtide for correct processinq and extra cellular secretion of the ST
h peptide.
To find the effect of the N-terminal residue of the ST peptide on the processing and extracellular secretion of the peptide two mutants of the st were made. In the first example, Asn54 of the STh was .
changed to Gln54 exactly following the method as described section III of Experimental Data. The mutagenic primer had the sequence of 5' G A A A G C A
T G C A G A G T A G C A A T 3'(ATT-->CAG). The hyper-expression plasmid containing the mutant st Asn54 ->
Gln54 was designated pARC 0732. This plasmid when propagated in E.coli B101 or in the hyperexpression strain BL21-DE3 produced equivalent amount of extracellular toxin compared to that produced by strains HB101 or BL21-DE3 harboring pARC 0601. In order to determine the processing site of the extracellularly secreted mutant ST (Asn54-> GlnS~), the mutant gene was overexpressed and the peptide was purified essentially following the method described by P. Dwarakanath et al (1989, Gene 81, 219-226). The HPLC purification profile is shown in Fig.4. The N-terminal sequence of this peptide revealed a se~uence Gln - Ser - Ser - Asn - Tyr.
Another similar experiment was conducted in which Asn54 of STh was altered to His54. The resulting plasmid containing the mutant st was designated pARC 0716. Following overexpression of the st AsnS4->Ile54 and purification of peptide, the N-terminal sequence of the peptide was determined. The sequence data showed presence of two peptides with N-'erminal sequences ~1) His-Ser-Ser-Asn-Tyr ...and (2) ' ~0~3~3S
Ser-Ser-Asn-Tyr.... A HPLC purification profile of this peptide is shown in Fiy. 5.
These examples clearly demonstrate that the N-terminus of the STh peptide (Asn54) is not crucial for post translational processing and secretion of the peptide.
V~. Example to demonstrate generality of the stable toxin based secretion vector To test the generality of the st ~ased secretion vector, the coding region corresponding to mature ST
peptide (ie. Asn54 to Tyr72) was replaced in the st gene by Angiotensin I coding sequence. Angiotensin I
is a decamer peptide which is converted from a larger precursor Angiotensinogen by Renin protease.
Angiotensin I is further modified to yield Angiotensin II by 'Angiotensin converting enzyme' which deletes the last two carboxyl terminal residues (His-Leu) of Angiotensin I, Angiotensin II is known to be a potential vaso constrictor. The substitution of AngI coding sequence in place of ST coding sequence in the st gene was accomplished by Kunkel's method (T.A.Kunkel, 1985 Proc.Natl. Acad.Sci. USA 82, 48~-492) with some modifications. A mutagenic primer ( GD9 ) was synthesized having the sequence 5' G T G G
T C C T G A A A G C A T G G A C C G G G T G T A C A T
A C A C C C C T T C C A C C T C T T A A T A A T A T A
A A G G G 3'. This 62 mer primer was annealed to ss 0192 DNA template grown in CJ 236. The temperature for annealing reaction was 55 C. The amount of template used was 3ug per reaction while the primer used was lOng. Following annealing, the extension reaction was carried out at 37C for 4 hrs in the presence of 8 units of Sequenase and four dNTPs (final conc. of each dNTP was lmM). The extension mix also contained 4 units of T4 DNA ligase for the ligation of the extended primer to take place. This complex was used to transform E.coli JM101 and the putative clones were identified by DNA sequencing.
The Ang I coding sequence linked st was subcloned into the hyperexpression plasmid and the plasmid was designated as pARC 0726. This plasmid when propagated into B 101 or hyperexpression strain BL21-DE3, the plasmid bearing strains produced AngI peptide extra ~--cellularly as detected by RIA. Following hyperexpression of pARC 0726 in BL21-DE3 in M9 medium the peptide was purified for N-terminal sequencing.
The purification scheme is described below. We believe this purification scheme can be applied generally for other peptides.
BL21-DE3 cells harboring pARC 0726 were grown in M9 medium (250 ml x 4). When the culture reached an A600nm = 0.86, the cells were induced by adding IPTG (final conc. 0.5 mm). After 2.5 hrs following induction, cells were harvested and 470 ml of culture supernatant was mixed with 30 g of Amberlite XAD-4 and allowed to stand at room temperature overnight. After washing the resin thoroughly with water, the bound peptide was eluted with 99% Ethanol/1% acetic acid followed by 80%
Ethanol/1% acetic acid. The eluate was concentrated by flash evaporation and the concentrate was loaded on SP
Sephadex C-25 column (10 ml) previously equilibrated with 20 mM phosphate buffer, pH 6.4. Prior to elution, the column was rinsed with water. Elution was carried out with 50 mM triethylamine. The pH of the eluate was brought to 6.0 with acetic acid. The eluate was lyophilised and the dry powder was reconstituted in 1 ml water. The preparation was subject to HPLC on RP-8 column using an acetonitrile gradient [Solvent A:0.1%TFA, Solvent ~:0.1% TFA/95% Acetonitrile: Flow rate 1 ml/min. Gradient was 10-50%B in 40 mins]. The An~ I peak was detected at 29% B (Fig.6). The purified peptide was sequenced and the sequence was con~irmed .
.: :
.
. .
Z03~3~3S
with the native sequence.
This example clearly demonstrates that a completely heterologous peptide coding sequence can be linked with st pro region and the correctly processed peptide S can be detected extracellularly following propagation of such construct in a suitable E.coli host.
VII. Construction of a general secretion vector_with suitable cloning site.
Introduction of a suitable cloning site in the mature peptide coding region can be accomplished as follows:
To introduce a restriction site within or at very close proximity to the mature peptide coding region, the DNA
stretch 5' A T G A A T A G T 3' representing the amino acid sequence from Met53 to Ser55 is chosen. By altering the nucleotide sequence of Ser55 codon ~AGT~
to the alternate codon TCT, an EcoRI site could be generated without altering the amino acid residue. A
schematic diagram is shown below:
Met53 Asn54 SerS5 A T G A A T A G T
A T GA A T T C T
.
; EcoRI site The single stranded (ss) 0192 DNA template is mutagenized by the method described earlier. The mutagenic primer (GD13) had a nucleotide sequence, 5'A
i G C A T G A A T T C T A G C A A T T A C 3'. A general hyperexpression vector system can be constucted utilizing this EcoRI site as the suitable site for insertion o~ heterologous protein and peptide coding sequence. The hyperexpression plasmid pET7 is digested with EcoRI and BamHI and the large fragment isolated.
0192 (GD13) RF is isolated and digested with BamHI and . - ^ -. ~ : . .; , .:
. . ~ .
EcoRI. Following CIP (calf intestinal phosphatase) treatment of the digestion product, a 620 bp fragment is isolated.
This fragment is ligated with the large fragment of pET7. The resultant recombinant plasmid is named pARC 0801.
(Figure 7) This plasmid can be used as a general secretion vector. Any heterologous peptide or protein coding sequence can be inserted at the EcoRI site (insertion site), for example, peptides such as Angiotensin I, bovine fibroblast growth factor, (bFGF), insulin, and others.
Expression of this plasmid in a E. coli host, results in the secretion of the peptide or protein into the medium which can be purified. It should be noted that peptide produced from this recombinant plasmid will have one additional amino acid residue at its amino terminal (Ser).
The possibility to produce Angiotensin I and, especially, insulin, represent an important aspect of the present invention. Purified insulin A chain and insulin B chain can be used to produce insulin.
VII. EXAMPLES TO DEMONSTRATE SECRETION OF INSULIN A AND B CHAINS
FROM THE STABLE TOXIN BASED SECRETION VECTOR.
The present example demonstrates the generality of the stable toxin based secretion vector. Human insulin consists of two polypeptide chains A (21 amino acid residues) and B (30 amino acid residues) which are linked through SH bonds. These chains can be separated in vitro by reduction and pure forms of A and B chains can be obtained. Under appropriate conditions ' : ' '~ -, : : ~ : ' ~ ' ,"' ' ,- .
. ., ~ .
- 15a - 23940-679 purified A and B chains can be reoxidised to form immunoreactive and biologically active human insulin. The gene sequences of human insulin A and B chains were fused separately at the end of the ST pre-pro sequence. The recombinant plasmid harboured in actively growing E. coli hosts, secreted into the culture medium substances which were immunoreactive against human insulin anti-body. Incubation of the human insulin antibody with authentic insulin A and B chains inhibited in a competitive manner in res-pective cases where recombinant A and B chains were expected to be produced, indicating that the expected product was produced in each case. The recombinant plasmids were constructed using the strategy similar to that described in Experimental Data VI. A Ml3 mp l9 based recombinant phage DNA (0GD24) was used as the tem-plate for the first round of mutagenesis. 0GD24 had a ST gene inserted as a BamHI-HindIII fragment in M 13 mp l9 multiple cloning site. This particular ST insert had a missense mutation at the N-terminal residue of the mature toxin peptide replacing the Asn 54 by Gly 54. The mutagenic primer used for introducing the first round of mutagenesis had the r.ucleotide sequence 5' C C T G A A A G C A T G G G T A T C G T G G A G C A G T G C
T G T A C A T C T A T C T G C T C A C T G T A T T A A T A A T
A T A A A 3' This mutagenesis yielded the phage 0GD26 which had the DN~ sequence corresponding to the N-terminal 1-14 residues of insulin A chain fused in frame with the ST pre-pro region.
0 GD26 DNA was used as the template for the second round of .
.. - .. : , ~- ... :
:: :::
.
~: .. : .: ! ' : ~
203~385 - 15b - 23940-679 ~,.
mutagenesis. The mutagenic primer had the nucleotide sequence 5' T G C T C A C T G T A T C A G C T A G A G A A C T A C
T G C A A C T A A T A A T A T A A A 3' Following mutagenesis, 0GD28 was identified which had the complete A chain gene fused at the C terminal section of the ST pre-pro region. In summary, with 2 rounds of mutagenesis the complete nucleotide sequence of insulin A chain gene was fused in frame at the 3' end of ST pre-pro sequence and thereby com-pletely replacing the mature toxin gene sequence. However, rest of the 3' end sequence of the ST gene was retained intact in the ; final construct. The recombinant phage 0GD28 was grown in JMlOl and the RF was isolated. The RF was digested with BamHI-HindIII
and the 1.1 Kb fragment was cloned into pET7 as a BamHI-HindIII
fragment to yield pARC0750. The hyperexpression E. coli strain BL21-DE3 was transformed with pARC0750. In a similar manner insulin B chain was also synthesised and fused at the 3' end of the ST pre-pro region. The initial template used was 0GD23 which had a ST insert N54F. The nucleotide sequence of the first muta-genesis primer was 5' G G T C C T G A A A G C A T G T T T G T G A A T C A G C A T C
T T T G C G G A A G T C A T C T G G T T G A G G C T C T T T A
T T A A T A A T A T A A A 3' Following mutagenesis a mutant clone was identified (0GD32) which had the DNA sequence corresponding to the insulin B chain 1-16 residues fused in frame with ST pre-pro sequence.
0GD32 ss DNA was used as the template for the 2nd round of :' : , . ~
20343~3S
- 15c - 23940-679 mutagenesis. The nucleotide sequence of the mutagenic primer was 5' G T T G A G G C T C T T T A T C T T G T A T G T G G T G A A C
G T G G T T T C T T C T A T A C A C C T A A G A C A T A A T A
A T A T A A A G G G 3' Following mutagenesis, the mutant phage 0GD33 was isolated which had the complete B chain gene sequence fused at the 3' end of the ST pre-pro sequence. This fused gene insert was isolated from 0 GD33 as the BamHI-HindIII fragment and subcloned in pET7 to yield pARC0759. E. coli hyperexpression strain BL21-DE3 was transformed with pARC0759 and the transformed clones were screened for the presence of the plasmid. To check whether the clones pARC0750 and pARC0759 were secreting A and B chains respectively ELISA procedures were developed where culture super-natants could be directedly tested. Essential features of the ELISA procedure are described below:
Human insulin was purchased from Novo Industries. It was in a highly pure form which was confirmed on reverse phase HPLC analysis. An aliquot of this pure insulin was reduced quantitatively with DTT and carboxyamidated using iodoacetamide to form stable A & B chains. A direct ELISA as well as an inhibi-tion ELISA for A & B chain detection were developed using antibody raised against human insulin. pARC0750 and pARC0759 in BL21-DE3 host were grown separately in M9 medium and induced with IPTG as described in the previous examples. Following induction the cultures were centrifuged and the supernatants were checked for the presence of A & B chains. The yield of A and B chains as estimated : :~
::
. ~
203'138S
- 15d - 23940-679 by direct ELISA were approximately 16 ug/ml and 30 ug/ml respectively. Inhibition ELISA studies also confirmed the quantitative estimation of the secretion level of the A & B
chains respectively.
. : ~ ., ~ .,, : : : .
Claims (29)
1. The use of E.coli st pre-pro sequence coding for (single letter code) M K K S I L M I F L S V L S F S P
F A Q D A K P V E S S K E K I T K E S K K C N I A K K S
N K S G P E S M in the development of a vector for secretion of heterologous proteins.
F A Q D A K P V E S S K E K I T K E S K K C N I A K K S
N K S G P E S M in the development of a vector for secretion of heterologous proteins.
2. The vector pARC 0101, deposit no. NC1B 40115.
3. An E.coli host containing the vector pARC 0101 according to claim 2.
4. The use of pARC 0101 according to claim 2 in the construction of an expression vector for heterologous proteins.
5. A vector containing the E.coli st pre-pro sequence as defined in claim 1.
6. A vector according to claim 5, containing the BamHI-HindIII segment of the E.coli st pre-pro sequence as defined in claim 1.
7. A E.coli host containing a vector as defined in any of claims 5 and 6.
8. A process for the production of heterologous proteins in E.coli, by growing an E.coli host as defined in claim 7 and isolating the desired protein product.
9. A process according to claim 8 wherein the protein product is secreted extra-cellularly.
10. The constract pARC 0726 containing the E.coli st pre-pro sequence fused with the Angiotensin I coding sequence in frame.
11. The use of the construct pARC 0726 according to claim 7 in the expression of Angiotensin I.
12. An E.coli host containing the construct pARC 0726 according to claim 10.
13. A process for the production of Angiotensin I by growing an E.coli host according to claim 12 and isolating the desired protein product.
14. A process according to claim 13 wherein the protein product is secreted extra-cellularly.
15. The construct pARC 0801 containing the complete E.coli st gene sequence and containing an internal Eco RI
site, said Eco RI site having been created by using the alternate codon for Ser 55 of the mature ST peptide.
site, said Eco RI site having been created by using the alternate codon for Ser 55 of the mature ST peptide.
16. The use of a first construct pARC 0801 according to claim 15 as a starting material in the preparation of a second construct by fusion of a heterologous or homologous gene sequence at the Eco RI site of the pARC
0801 construct.
0801 construct.
17. A vector, capable of facilitating the secretion of heterologous protein expressed in a host cell, said vector in-cluding DNA encoding the E. coli st pre-pro sequence M K K S I L
M I F L S V L S F S P F A Q D A K P V E S S K E K I T K E S K K
C N I A K K S N K S G P E S M.
M I F L S V L S F S P F A Q D A K P V E S S K E K I T K E S K K
C N I A K K S N K S G P E S M.
18. A construct comprising a vector according to claim 1 fused in reading frame to DNA encoding a desired protein.
19. A construct according to claim 18 wherein the DNA
encodes Angiotensin I.
encodes Angiotensin I.
20. A construct according to claim 18 wherein the DNA
encodes insulin A chain.
encodes insulin A chain.
21. A construct according to claim 18 wherein the DNA
encodes insulin B chain.
encodes insulin B chain.
22. The construct pARC 0726 containing the E. coli st pre-pro sequence fused with the insulin chain A coding sequence in frame.
23. The construct pARC 0726 containing the E. coli st pre-pro sequence fused with the insulin chain B coding sequence in frame.
24. A process according to claim 8 or 9 wherein Angio-tensin I is isolated.
25. A process according to claim 8 or 9 wherein insulin chain A is isolated.
26. A process according to claim 8 or 9 wherein insulin chain B is isolated.
27. A process for facilitating the secretion from the host cell of a protein expressed in a host cell which comprises fusing in reading frame DNA encoding the protein to be expressed to an E. coli st pre-pro sequence as defined in claim 17 or 18.
28. The construct pARC 0801 containing the complete E. coli st gene sequence and containing an internal Eco RI site, said Eco RI site having been created by using the alternate codon for Ser 55 of the mature ST peptide.
29. A process for forming a construct which comprises fusing a desired sequence at the Eco RI site of the pARC 0801 construct.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
IN52/MAS/90 | 1990-01-18 | ||
IN52MA1990 IN173163B (en) | 1990-01-18 | 1990-01-18 | |
US53892790A | 1990-06-15 | 1990-06-15 | |
US538,927 | 1990-06-15 |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2034385A1 true CA2034385A1 (en) | 1991-07-19 |
Family
ID=26324738
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002034385A Abandoned CA2034385A1 (en) | 1990-01-18 | 1991-01-17 | Vector to produce biologically important peptides |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP0535119A1 (en) |
CA (1) | CA2034385A1 (en) |
GB (1) | GB2240106B (en) |
IE (1) | IE910147A1 (en) |
NZ (1) | NZ236805A (en) |
WO (1) | WO1991019805A1 (en) |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4680262A (en) * | 1984-10-05 | 1987-07-14 | Genentech, Inc. | Periplasmic protein recovery |
ATE78515T1 (en) * | 1984-10-05 | 1992-08-15 | Genentech Inc | DNA, CELL CULTURES AND METHODS FOR HETEROLOGOUS PROTEIN SECRETION AND PERIPLASMIC PROTEIN RECOVERY. |
-
1991
- 1991-01-17 CA CA002034385A patent/CA2034385A1/en not_active Abandoned
- 1991-01-17 NZ NZ236805A patent/NZ236805A/en unknown
- 1991-01-17 IE IE014791A patent/IE910147A1/en unknown
- 1991-01-17 GB GB9101057A patent/GB2240106B/en not_active Expired - Fee Related
- 1991-06-12 EP EP91912043A patent/EP0535119A1/en not_active Withdrawn
- 1991-06-12 WO PCT/SE1991/000424 patent/WO1991019805A1/en not_active Application Discontinuation
Also Published As
Publication number | Publication date |
---|---|
GB9101057D0 (en) | 1991-02-27 |
GB2240106A (en) | 1991-07-24 |
IE910147A1 (en) | 1991-07-31 |
EP0535119A1 (en) | 1993-04-07 |
GB2240106B (en) | 1994-07-20 |
NZ236805A (en) | 1992-12-23 |
WO1991019805A1 (en) | 1991-12-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101363299B1 (en) | Method for amidating a polypeptide having a basic amino acid C-terminus using a specific endoprotease | |
KR100316347B1 (en) | Recombinant microorganisms expressing a fusion protein of Escherichia coli enterotoxin II signal peptide and fusion protein of human growth hormone and a method of producing human growth hormone using the same | |
US5399490A (en) | Vector to produce biologically important peptides | |
JP4857279B2 (en) | Method for producing peptide having amidated carboxy terminus | |
FI97139B (en) | Process for Preparation of Variants of Pancreatic Secretory Trypsin Inhibitor and DNA, Used Vector and Host Cell | |
JP2989853B2 (en) | Polypeptide, method for producing the same, DNA base sequence, and transformed cell host | |
RU2001110120A (en) | MODIFIED SIGNAL PEPTIDE ENTEROTOXIN-II E. COLI AND MICROORGANISM EXPRESSING THE Fusion PROTEIN OF THE MENTIONED PEPTIDE WITH A HETEROLOGICAL PROTEIN | |
EP0205475B2 (en) | Recombinant methods for production of serine protease inhibitors and dna sequences useful for same | |
PT97093B (en) | PROCESS FOR OBTAINING HYDRUDINE DERIVATIVES | |
RU2144957C1 (en) | Recombinant plasmid dna ppins07 encoding fused polypeptide containing human proinsulin and strain of bacterium escherichia coli - producer of fused polypeptide containing human proinsulin | |
WO1991018101A1 (en) | Process for producing protein | |
EP0510018A1 (en) | Lipoprotein signal peptide fused to antigenic polypeptides | |
JPH10511845A (en) | Methods for producing extracellular proteins in bacteria | |
AU676019B2 (en) | Novel peptide having esterase inhibitory activity and methodof producing same | |
KR100659671B1 (en) | Method for preparing recombinant insulin from novel fusion proteins | |
WO2001068884A1 (en) | Process for producing polypeptide having disulfide bond | |
CA2034385A1 (en) | Vector to produce biologically important peptides | |
AU8076491A (en) | A novel vector to produce biologically important peptides | |
Wardhan et al. | Molecular analysis of the recA gene of Agrobacterium tumefaciens C58 | |
JP5757555B2 (en) | Novel acidic protease and its use | |
Ghasemi et al. | Using L-arabinose for production of human growth hormone in Escherichia coli, studying the processing of gIII:: hGH precursor | |
STRAUSS et al. | Chemical synthesis of a gene for human stefin A and its expression in E. coli | |
US7098018B2 (en) | Aminopeptidase derived from bacillus licheniformis, gene encoding the aminopeptidase, expression vector containing the gene, transformant and method for preparation thereof | |
KR100477062B1 (en) | Novel aminopeptidase derived from Bacilus licheniformis, gene coding the aminopeptidase, expression vector containing the gene, transformant transfected with the expression vector and manufacturing method thereof | |
JP4696915B2 (en) | Protein secretory production system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |