CA2043953C - Method for the isolation and expression of a gene which codes for streptokinase, nucleotide sequence obtained, recombinant dna and transformed microorganisms - Google Patents
Method for the isolation and expression of a gene which codes for streptokinase, nucleotide sequence obtained, recombinant dna and transformed microorganismsInfo
- Publication number
- CA2043953C CA2043953C CA 2043953 CA2043953A CA2043953C CA 2043953 C CA2043953 C CA 2043953C CA 2043953 CA2043953 CA 2043953 CA 2043953 A CA2043953 A CA 2043953A CA 2043953 C CA2043953 C CA 2043953C
- Authority
- CA
- Canada
- Prior art keywords
- gene
- streptokinase
- skc
- expression
- expression vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 87
- 108010023197 Streptokinase Proteins 0.000 title claims abstract description 45
- 229960005202 streptokinase Drugs 0.000 title claims abstract description 36
- 230000014509 gene expression Effects 0.000 title claims abstract description 31
- 239000002773 nucleotide Substances 0.000 title claims abstract description 24
- 125000003729 nucleotide group Chemical group 0.000 title claims abstract description 24
- 238000000034 method Methods 0.000 title claims description 28
- 238000002955 isolation Methods 0.000 title abstract description 6
- 108020004511 Recombinant DNA Proteins 0.000 title abstract description 5
- 244000005700 microbiome Species 0.000 title abstract description 4
- 239000013598 vector Substances 0.000 claims abstract description 12
- 241000588724 Escherichia coli Species 0.000 claims abstract description 9
- 241000235058 Komagataella pastoris Species 0.000 claims abstract description 9
- 241000264435 Streptococcus dysgalactiae subsp. equisimilis Species 0.000 claims abstract description 3
- 239000013604 expression vector Substances 0.000 claims description 20
- 239000013612 plasmid Substances 0.000 claims description 19
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 16
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 16
- 241000894006 Bacteria Species 0.000 claims description 12
- 108010025188 Alcohol oxidase Proteins 0.000 claims description 6
- 238000013518 transcription Methods 0.000 claims description 6
- 230000035897 transcription Effects 0.000 claims description 6
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims description 5
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 5
- 241000235648 Pichia Species 0.000 claims description 4
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 claims description 4
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 claims description 4
- 239000003550 marker Substances 0.000 claims description 3
- 238000004519 manufacturing process Methods 0.000 claims description 2
- 101150009006 HIS3 gene Proteins 0.000 claims 2
- 241000588722 Escherichia Species 0.000 claims 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 claims 1
- 108020004707 nucleic acids Proteins 0.000 claims 1
- 150000007523 nucleic acids Chemical class 0.000 claims 1
- 102000039446 nucleic acids Human genes 0.000 claims 1
- 230000001131 transforming effect Effects 0.000 claims 1
- 102000004169 proteins and genes Human genes 0.000 abstract description 23
- 238000010367 cloning Methods 0.000 abstract description 11
- 239000003814 drug Substances 0.000 abstract description 4
- 238000011282 treatment Methods 0.000 abstract description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 3
- 208000035475 disorder Diseases 0.000 abstract description 3
- 230000002068 genetic effect Effects 0.000 abstract description 3
- 230000009466 transformation Effects 0.000 abstract description 3
- 239000003527 fibrinolytic agent Substances 0.000 abstract description 2
- 238000010353 genetic engineering Methods 0.000 abstract description 2
- 229960000103 thrombolytic agent Drugs 0.000 abstract description 2
- 206010011091 Coronary artery thrombosis Diseases 0.000 abstract 1
- 208000002528 coronary thrombosis Diseases 0.000 abstract 1
- 238000010188 recombinant method Methods 0.000 abstract 1
- 229940124597 therapeutic agent Drugs 0.000 abstract 1
- 230000009424 thromboembolic effect Effects 0.000 abstract 1
- 239000000047 product Substances 0.000 description 18
- 210000004027 cell Anatomy 0.000 description 10
- 108010076504 Protein Sorting Signals Proteins 0.000 description 8
- 108020004414 DNA Proteins 0.000 description 7
- 108091034117 Oligonucleotide Proteins 0.000 description 7
- 150000001413 amino acids Chemical class 0.000 description 7
- 238000003752 polymerase chain reaction Methods 0.000 description 7
- 241000194017 Streptococcus Species 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 239000001963 growth medium Substances 0.000 description 6
- 230000003834 intracellular effect Effects 0.000 description 6
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 229940088598 enzyme Drugs 0.000 description 5
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 101000895926 Streptomyces plicatus Endo-beta-N-acetylglucosaminidase H Proteins 0.000 description 4
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 101150061985 ska gene Proteins 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- 241000701533 Escherichia virus T4 Species 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 241000282414 Homo sapiens Species 0.000 description 3
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 238000000855 fermentation Methods 0.000 description 3
- 230000004151 fermentation Effects 0.000 description 3
- 229950003499 fibrin Drugs 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 229940037201 oris Drugs 0.000 description 3
- 101150086266 skc gene Proteins 0.000 description 3
- OAMLVOVXNKILLQ-BQBZGAKWSA-N Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O OAMLVOVXNKILLQ-BQBZGAKWSA-N 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 108010017826 DNA Polymerase I Proteins 0.000 description 2
- 102000004594 DNA Polymerase I Human genes 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 102000009123 Fibrin Human genes 0.000 description 2
- 108010073385 Fibrin Proteins 0.000 description 2
- BWGVNKXGVNDBDI-UHFFFAOYSA-N Fibrin monomer Chemical compound CNC(=O)CNC(=O)CN BWGVNKXGVNDBDI-UHFFFAOYSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 101000605403 Homo sapiens Plasminogen Proteins 0.000 description 2
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 102000013566 Plasminogen Human genes 0.000 description 2
- 108010051456 Plasminogen Proteins 0.000 description 2
- UJTZHGHXJKIAOS-WHFBIAKZSA-N Ser-Gln Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O UJTZHGHXJKIAOS-WHFBIAKZSA-N 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 208000007536 Thrombosis Diseases 0.000 description 2
- XXDVDTMEVBYRPK-XPUUQOCRSA-N Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O XXDVDTMEVBYRPK-XPUUQOCRSA-N 0.000 description 2
- 101150031623 aox gene Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 2
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 230000003480 fibrinolytic effect Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 239000001573 invertase Substances 0.000 description 2
- 235000011073 invertase Nutrition 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 230000007918 pathogenicity Effects 0.000 description 2
- 210000001322 periplasm Anatomy 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 230000035899 viability Effects 0.000 description 2
- LXJXRIRHZLFYRP-VKHMYHEASA-L (R)-2-Hydroxy-3-(phosphonooxy)-propanal Natural products O=C[C@H](O)COP([O-])([O-])=O LXJXRIRHZLFYRP-VKHMYHEASA-L 0.000 description 1
- OGRXKBUCZFFSTL-UHFFFAOYSA-N 4-(methylnitrosamino)-1-(3-pyridyl)-1-butanol Chemical compound O=NN(C)CCCC(O)C1=CC=CN=C1 OGRXKBUCZFFSTL-UHFFFAOYSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- XTWSWDJMIKUJDQ-RYUDHWBXSA-N Arg-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XTWSWDJMIKUJDQ-RYUDHWBXSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- XZFONYMRYTVLPL-NHCYSSNCSA-N Asn-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N XZFONYMRYTVLPL-NHCYSSNCSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- 101100290837 Bacillus subtilis (strain 168) metAA gene Proteins 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 108090000317 Chymotrypsin Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 102220489870 Cofilin-1_H46A_mutation Human genes 0.000 description 1
- LXJXRIRHZLFYRP-VKHMYHEASA-N D-glyceraldehyde 3-phosphate Chemical compound O=C[C@H](O)COP(O)(O)=O LXJXRIRHZLFYRP-VKHMYHEASA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 206010014513 Embolism arterial Diseases 0.000 description 1
- 206010014522 Embolism venous Diseases 0.000 description 1
- 241000392944 Erwinia soli Species 0.000 description 1
- 101100390711 Escherichia coli (strain K12) fhuA gene Proteins 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 239000001828 Gelatine Substances 0.000 description 1
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- JJSVALISDCNFCU-SZMVWBNQSA-N Glu-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JJSVALISDCNFCU-SZMVWBNQSA-N 0.000 description 1
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- SITLTJHOQZFJGG-XPUUQOCRSA-N Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- 108010051815 Glutamyl endopeptidase Proteins 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- XBGGUPMXALFZOT-VIFPVBQESA-N Gly-Tyr Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-VIFPVBQESA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- MMFKFJORZBJVNF-UWVGGRQHSA-N His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 MMFKFJORZBJVNF-UWVGGRQHSA-N 0.000 description 1
- VLDVBZICYBVQHB-IUCAKERBSA-N His-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 VLDVBZICYBVQHB-IUCAKERBSA-N 0.000 description 1
- 108010003272 Hyaluronate lyase Proteins 0.000 description 1
- 102000009066 Hyaluronoglucosaminidase Human genes 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- CNPNWGHRMBQHBZ-ZKWXMUAHSA-N Ile-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O CNPNWGHRMBQHBZ-ZKWXMUAHSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- QONKWXNJRRNTBV-AVGNSLFASA-N Leu-Pro-Met Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N QONKWXNJRRNTBV-AVGNSLFASA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 1
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 1
- QXOHLNCNYLGICT-YFKPBYRVSA-N Met-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(O)=O QXOHLNCNYLGICT-YFKPBYRVSA-N 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 206010067482 No adverse event Diseases 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108010030544 Peptidyl-Lys metalloendopeptidase Proteins 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- 241001674048 Phthiraptera Species 0.000 description 1
- 108010001014 Plasminogen Activators Proteins 0.000 description 1
- 102000001938 Plasminogen Activators Human genes 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- JQOHKCDMINQZRV-WDSKDSINSA-N Pro-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 JQOHKCDMINQZRV-WDSKDSINSA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- BAKAHWWRCCUDAF-IHRRRGAJSA-N Pro-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CN=CN1 BAKAHWWRCCUDAF-IHRRRGAJSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 1
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 1
- 108010059712 Pronase Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 208000010378 Pulmonary Embolism Diseases 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 101150060587 SK gene Proteins 0.000 description 1
- RZEQTVHJZCIUBT-WDSKDSINSA-N Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-WDSKDSINSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- 108010011834 Streptolysins Proteins 0.000 description 1
- 101100016403 Takifugu rubripes hars1 gene Proteins 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- 108090000373 Tissue Plasminogen Activator Proteins 0.000 description 1
- 102100033571 Tissue-type plasminogen activator Human genes 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- 102000003990 Urokinase-type plasminogen activator Human genes 0.000 description 1
- 108090000435 Urokinase-type plasminogen activator Proteins 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 230000007815 allergy Effects 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- YTRQFSDWAXHJCC-UHFFFAOYSA-N chloroform;phenol Chemical compound ClC(Cl)Cl.OC1=CC=CC=C1 YTRQFSDWAXHJCC-UHFFFAOYSA-N 0.000 description 1
- 239000003593 chromogenic compound Substances 0.000 description 1
- 229960002376 chymotrypsin Drugs 0.000 description 1
- 210000004351 coronary vessel Anatomy 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 229940119679 deoxyribonucleases Drugs 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- 230000002900 effect on cell Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000029142 excretion Effects 0.000 description 1
- 210000001105 femoral artery Anatomy 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 102000035122 glycosylated proteins Human genes 0.000 description 1
- 108091005608 glycosylated proteins Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 230000002949 hemolytic effect Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 229960002773 hyaluronidase Drugs 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 238000001819 mass spectrum Methods 0.000 description 1
- 101150003180 metB gene Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 208000010125 myocardial infarction Diseases 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 229940012957 plasmin Drugs 0.000 description 1
- 229940127126 plasminogen activator Drugs 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 101150059159 proA2 gene Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 230000002685 pulmonary effect Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 231100000820 toxicity test Toxicity 0.000 description 1
- 238000009602 toxicology test Methods 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 229960005356 urokinase Drugs 0.000 description 1
- 208000004043 venous thromboembolism Diseases 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
The present invention relates to the field of biotechnology and genetic engineering and in particular a novel nucleotide sequence which codes for a streptokinase, as well as the recombinant DNA obtained therefrom which is used for the transformation of various host organisms. The technical object which is pursued with the proposed solution is to produce the protein in commercially useful quantities, by the recombinant method, with the possibility of alternatively using different hosts to obtain a high-quality streptokinase capable of being used as a thrombolytic agent in the treatment of different disorders. The present invention is based on the isolation of a new gene which codes for streptokinase from Streptococcus equisimilis of type C (strain ATCC-9542) and the cloning and expression thereof in prokaryotic (E. coli) and eukaryotic (Pichia pastoris) hosts, for which is includes the vehicles of expression which contain the genetic sequences of said gene, as well as the microorganisms transformed with these vectors capable of producing streptokinase. The protein obtained thereby can be used in clinical medicine as a therapeutic agent, in the treatment of disorders such as thromboembolic obstructions including coronary thrombosis.
Description
~.-:. 2043953 Title: Method for the isolation and expression of a gene which codes for streptokinase, nucleotide sequence obtained, recombinant DNA and transformed microorganisms.
The present invention relates to the field of biotechnology and genetic engineering techniques and in particular a method for the isolation and cloning of a novel nucleotide sequence which codes for a streptokinase, as well as the recombinant DNA obtained therefrom which is used for the transformation of different host organisms.
Streptokinases are plasminogen activators from prokaryotic. cells, which are usually secreted into the culture medium by a large number of haemolytic Strepto oc ~ of different serotypes. Being proteins of bacterial origin, some antigenic responses to them have been detected (Dykewies, M.S. et al., 1985 and McGroth, K.G. et al., 1985).
Their molecular weight is approximately 97,000 dalton.
The function in the pathogenicity of tr p~~oCGi is not known exactly, although potentially it must contribute to elimination or avoidance of the formation of fibrin barriers around the infection.
During interaction of streptokinase with the plasminogen which is in human plasma, it was found that the protein is capable of converting the latter to plasmin, which displays proteolytic activity and can degrade fibrin clots into soluble products.
Streptokinase, urokinase and tissue-type plasminogen activator are at present used~as thrombolytic agents in the treatment of disorders which collectively represent one of the greatest causes of death in the world, such as myocardial infarct, pulmonary, arterial or venous thromboembolism, surgical complications and other cases of thrombosis.
There are physicochemical and immunological differences anal differences in rESpect. of .~~ubstrate specificity which beat witness to molecular heterogeneity of streptokinases of A
. . . _ . .. .. . .. .. ~04~~~~ ..
The present invention relates to the field of biotechnology and genetic engineering techniques and in particular a method for the isolation and cloning of a novel nucleotide sequence which codes for a streptokinase, as well as the recombinant DNA obtained therefrom which is used for the transformation of different host organisms.
Streptokinases are plasminogen activators from prokaryotic. cells, which are usually secreted into the culture medium by a large number of haemolytic Strepto oc ~ of different serotypes. Being proteins of bacterial origin, some antigenic responses to them have been detected (Dykewies, M.S. et al., 1985 and McGroth, K.G. et al., 1985).
Their molecular weight is approximately 97,000 dalton.
The function in the pathogenicity of tr p~~oCGi is not known exactly, although potentially it must contribute to elimination or avoidance of the formation of fibrin barriers around the infection.
During interaction of streptokinase with the plasminogen which is in human plasma, it was found that the protein is capable of converting the latter to plasmin, which displays proteolytic activity and can degrade fibrin clots into soluble products.
Streptokinase, urokinase and tissue-type plasminogen activator are at present used~as thrombolytic agents in the treatment of disorders which collectively represent one of the greatest causes of death in the world, such as myocardial infarct, pulmonary, arterial or venous thromboembolism, surgical complications and other cases of thrombosis.
There are physicochemical and immunological differences anal differences in rESpect. of .~~ubstrate specificity which beat witness to molecular heterogeneity of streptokinases of A
. . . _ . .. .. . .. .. ~04~~~~ ..
different origins, although they are all closely related in respect of function.
Combined with the low yields obtained in production of the protein and the pathogenicity of its natural host, the ,tre~tQcoccus strains used for the industrial production of streptokinase secrete other products into the culture medium, such as deoxyribonucleases, streptolysin br hyaluronidase and proteases, which makes the process of purifying the desired protein difficult. On the other hand, it has not yet been possible to obtain genetically improved strains from these hosts due to the lack of a developed methodology for the gene transfer.
It is due to these drawbacks that the cloning of different isolated genes which code for these proteins has been attempted, using prokaryotic and eukaryotic hosts.
The German Democratic Republic patent no. 249493, IPC: C12 N 15/00, describes the cloning and expression of a gene which codes for streptokinase from strain H46A of Streptococcus ~cuisimilis, belonging to Streptococcus type C, using ~. coli bacteria as the host, wherein levels of excretion into the medium from 0.1 to 1.8 mg/1 are obtained, depending on the age of the culture.
The coding sequence for this gene, which was called SKC, was subsequently determined, including its signal peptide, as well as the identification of the primary structure of adjacent regions involved in the control of transcription and translation of said gene (Malke, H. et al., 1985).
Other nucleotide sequences of genes which code for streptokinases from Streptococcus types G and A have been characterised (Water, F. et al., 1989), In particular, the gene which determines streptokinase type A (gene SKA) from strain 49 type M of Streptococcus pio,genes was cloned and expressed in strain JM109 of ~, coli and in Streptococcus sanauis Challis 57E (Huan , T.T, et al., 1989). In both cases,",;
.. . .,. ..~.. ... ... . , .. _. , , , ... ..,~.,.,,:»~.~~~ . .., .... ... ,.
.. ,, .. , . . . .. . . . . .
protein levels of 0.64 mg/1 and 40 ~.g/1 respectively were produced. In the case of ~. coii, 94$ of the'protein recovered ,,, . .
.204393 _ was in the periplasmic space and 6~ in the cytosol, whereas in pan uis all the enzyme was found extracellularly. Moreover, many clones producing streptokinase in ~. coli were very unstable, as in some cases the SKA gene was deleted or the host cells died owing to some lethal activity of the gene product in question.
In the case of ~. s~"uis, the protein molecule obtained was approximately 3,000 dalton less than native streptokinase and the absence of 32 amino acids from the C-terminal end was detected; however, biological activity was not affected (Huang, T.T. et al., 1989).
Subsequently, conclusive results were obtained with respect to the difficulty of cloning and expressing the isolated SKC gene in ~. coli, it being found that the gene product interferes with the normal physiology of the host, which is shown by the mucosity of the cells which carry this gene, by the incomplete export of streptokinase into the periplasmic space, by the structural instability of the .plasmids which carry the SKC gene and which are designated for the expression of high levels of the protein, as well as by the drawbacks encountered in cloning streptokinase genes from additional serotypes of Streptococcus in plasmids of ~. _roli and unsuccessful attempts to express heterologous genes under the control of expression-excretion signals of the SKC gene itself (Muller, J. et al., 1989).
More recently the company Phillips Petroleum (patent DD257646 IPC: C 12 N 15/00, and Ha , ggenson, M.J. et al., 1989) has obtained the expression of~~streptokinase in the methylotrophic yeast Pichia pastoris, under the control of gene regulatory sequences of alcohol oxidase, wherein yields of the desired product using continuous fermentation of the order of 77-250 mg/1 culture medium were obtained, with an . intermediary cell density of 46 g/1. This system uses the ~KG~ge~ne,- contained i~n the vpla'si~~ti ~~MFS "Which is'''lice~rised' to this company by Dr. J.J Ferretti of Oklahoma University, USA, in the expression vectors.
The controllability of the system makes it attractive, taking into account that it can easily be repressed by using glucose or glycerol and induced with methanol; nevertheless, its application is limited to this host.
It is the object of the present invention to obtain high levels of streptokinase yield in different host systems, by the use of expression vectors which carry a novel nucleotide sequence which represents a genetic variant not described before and which codes for a bioactive streptokinase, which contains the active portion corresponding to streptokinase from ~t~rP.~gt-ococcus ~gllssims>>s of type C, strain ATCC-9592.
The novel isolated gene is called SKC-2, and codes for a protein of the same molecular weight as the one encoded by SKC; it has the fundamental characteristic of stability in vectors of E_. soli and yeasts, adverse effects on cells growth or viability not being found in a single case, which makes it possible to obtain yields greater than those reported up to now in both hosts, in respect of the product obtained, which indicates a streptokinase different to those known up to nova, with the desired biological activity.
A
s , 4a Drawings: The invention will now be described with reference to the following drawings:
Figure 1. Plasmid pEKG-3, which carries the sequence of the SKC-2 gene under the tryptophan promoter of E. coli and displays at the 3~ end of the gene the signal for termination of bacteriophage T4 to give greater stability to expression.
Figure 2. Comparison between the amino acid sequences derived from the base sequences of the genes SKC-2, SKC, SKG and SKA.
Figure 3. Plasmids pPESKC-4 and pPISKC-6, for extracellular and intracellular expression respectively in P.
pastoris.
Figure 4. Relative positions of DNA fragments SK1, SK2 and SK3.
JJ:lcd The present invention relates to a method for the isolation and expression of a novel nucleotide sequence corresponding to the SKC-2 gene, the product of which is a protein of approximately 97,000 dalton. Said protein belongs to the streptokinases, which are distinguished by their fibrinolytic activity.
The present invention also relates to this gene, l.he sequence of which corresponds to seq. id. No. 1 in the Sequence T~isting.
It was obtained from the genome of the ~ re~tocsccus eauis~.rnilis type C strain (ATCC-9592) by gene amplificat.ian using the polymerase chain reaction (PCR) from three synthetic oligonucleot.ides called SK1, SK2 and SK3 having the seyuer~c S..K1 5' . ... TGG1AATTCAT.G11AA1~~'~'1~~',TTj~T,CT . . . 3_' sect . id . Ne .
SK2 5'...'TGGATCCTTATTTGTCGTTAGGGTTATC...3' seq. id. Nm. i SK3 5'...GGnATTCATGATTGCTGGACCTGAGTGGCTG...3' seq. id. No. 4 JJ:lcd a and constitute a novel aspect for this method. The relative positions of the DNA fragments SK1, SK2 and SK3 are depicted in Fig. 4. These primers further carry restriction sites which are not found in the gene and which allow direct cloning 5 in an expression vector. On the other hand, the SK3 which hybridises at the 5~ end carries an ATG which acts as a site for initiating translation and removes the signal peptide of SKC-2. The three oligonucleotides were synthesised from the SKC sequences published by Malke et al., 1985; and with them were marked the boundaries of the exact fragment of the gene which codes for the mature protein or the complete gene including the signal peptide.
Another novel aspect of this method is the possibility of expressing the isolated gene in both bacteria and yeasts, high levels of expression being achieved in both cases.
The present invention also relates to recombinant DNA which includes the SKC-2 gene, such as vectors for the expression of this gene, in bacteria pEKG3 (Fig. 1), and in yeasts pPESKC-4 and pPISKC-6 (Fig. 3). In particular for expression in ~. coli, the SKC-2 gene with or without its signal peptide is cloned under the tryptophan promoter and with the transcription termination signal of phage T4. For yeasts there was used an expression vector in which the SKC-2 gene is located behind the signal peptide of sucrose invertase (SUC2) controlled by the alcohol oxidase gene promoter (AOX1) of ich' a pastoris and which carries the termination signal of the glyceraldehyde-3-phosphate dehydrogenase (GAPt) gene of Saccharomyces cerevisiae for the extracellular expression variant, this vector being called pPESKC-4. For the intracellular expression of SKC-2, the vector pPISKC-6 is used which does not contain the signal peptide of SUC-2 behind the JJ:lcd a 2043953 _ 5a AOX1 promoter, and this is obtained by inserting the SKC-2 gene in the expression vector pNAO (kindly provided by Muzio, OIGB, Havana, Cuba) (Fig. 3). The HISS gene of S. cerevisiae is used as a selection marker in both vectors, and the expression cassette referred to above JJ:lcd Bt is flanked by 5' and 3' sequences of the AOX gene of ~ hia gast-oris for integration.
The present invention also relates to the microorganisms resulting from transformation of ~.. ,~oli strain 4V3110 with vector pEKG3, and to mutant MP-36 of P.ichia ~asroris with expression vectors pPESKC-4 and pPISKC-6, which axe characterised as expressing high levels of streptokinase, and having good viability (the product has no adverse effects on cell growth) and high stability of the strains transformed.
The transformed ~.. coli clone was called HSK-M and exhibits levels of expression of the product of the SKC-2 gene greater than 350 mg/1 culture medium..
The transformed Pichia has oris strains MSK-M9 and MSK-M6 produce streptokinase levels intracellularly and extracellularly respectively which vary between 1.0 and 1.2 g/1 culture medium.
The method described in the present invention, given the levels of expression never before reported for this product, makes it possible to reach optimum purity thereof for its administration to human beings and animals, without the need to develop a complex and costly process for purification.
Examples The following examples are intended to illustrate but not limit the invention. ~. ~oli and ~i~hia ~~storis are used as host systems in these examples; nevertheless, other eukaryot.i.c and prokaryotic cells can be~used for the method described in the present invention.
Fxamg,~e 1 For the isolation of genomic DNA of ~trPp~~occus ~,L~imilis of type C, strain ATCC-9542 was used as a source thereof for cloning the SK gene. The cells of ~~rPnrococcus aauis.;mills were =grown -in Br..aHin Heart .Infusion .Medi..um .(GLBCO) at 240 r.p.m. for 12 hours at 37°C in 5-ml pre-cultures acting as an inoculum which was grown for 12 hours in a 300-ml *Trade-mark ,~ 7 203953 Erlenmeyer flask. The cells were collected by centrifugation at 3,000 r.p.m. and resuspended in 8 ml lyse (4.5 g glucose, 1.$6 g EDTn, 1.51 g tris-HC1, in 500 ml sterile water, pH=8), 80 ~1 of lysozyme were added at a concentration of 10 mg/ml, and the suspension was incubated for 30 minutes at 37°C. Next, to obtain efficient cell rupture, 500 ~1 pronase (Boehringer), 1 ml SDS at 10~ and 200 ail EDTA, 0,5 M, pH=8, were added and the suspension was incubated for 2 hours at 50°C with smooth agitation. Successive treatments with phenol, phenol-chloroform and chloroform were carried out, and the genomic DNA was precipitated with absolute ethanol and NHqAc" 7.5 M.
The yield obtained was 100 Hg per 300-ml Erlenmeyer culture flask. The presence of the gene which codes for streptokinase was verified by the Southern blot technique (Maniatis et al., 1982).
For subcloning in bacteria, 1 ~g genomic DNA of the ~trPntococcus Pau~s~m>>;s type C strain (ATCC-9592) was taken and the gene which codes for SKC-2 was amplified by a PCR
(Randall et al., 1988) using the oligonucleotides SK1 and SK2 for cloning the gene with its signal peptide and SK2-SK3 for cloning without it.
In each reaction 100 pmol of each oligonucleotide, 2 units of Taq polymerase (Perkin Elmer, USA) and 200 )unol of each dNTP were used and the reactions were performed in 10 mM MgCl~, 100 mM dTT, 10 mM NaCl and 100 ~g/ml gelatine.
Thirty amplification cyc~.es were performed, wherein in each one the reaction was incubated at 95°C for 1 minute for denaturisation, at 52°C for 95 seconds for hybridisation of the oligonucleotides and at 70°C for 80 seconds for extension.
An efficiency greater than 5~ of amplification was obtained.
For cloning in bacteria (F. co i), a new genetic construct was used, wherein the tryptophan promoter of E_. coli and the termination signal of bacteriophage T4 are used.
A
The fragments amplified in the PCR were digested with BamHI
and ligated with the vector ptrip-N_coI-S1-CHI
(Estrada et al., 1988). This construct was transformed into a preparation of competent cells prepared according to Dagert et al. (1979) and Hanahan et al. (1983), of ~. coli strain HB101 ((rg-mg-), sunEq4, ara-19, aalK-2, cYl, proA2, rnsL20, (StrR) , ~cyl-5, ~1-5, ~1-1, recAl3 } , which had a frequency greater than 10~ transformants per g DNA.
The colonies obtained were applied to plates of LB medium (10 g/1 trypton, 5 g/1 yeast extract, 10 g/1 sodium chloride) and 50 ~tg/ml ampicillin, and hybridised according to Maniatis et al. (1982), using as a probe the fragment resulting from the amplification in the PCR, which was marked using dATP32 (Amersham, UK) and the Klenow fragment of DNA-polymerase I of ~. ~oli (Maniatis et al., 1982) in whatman 541 filters for 30 minutes at 37°C, the reaction being terminated by EDTA and heat. 4~ of the colonies were positive clones, which were examined by restriction analysis and had the same pattern of digestion with more than 10 restriction enzymes;
moreover the positive clones were checked by double chain DNA
sequencing (Sanger et al., 1977), using therefor an oligo-nucleotide of 17 bases (5'...ATCATCGAACTAGTTAA...3', seq. id.
No. 5) which hybridises at the 3' end of the promoter, with which it was corroborated that joining of the latter to the SKC-2 gene was as desired.
The selected clone was called pEKG-3 (E'ig. 1), which was subjected to fermentation to realise the characterisation of the product.
The plasmid of clone pEKG3 was purified using a CsCl gradient and the sequences of the SKC-2 gene were established, each time using 2 g of plasmid, and using the oligonucleotides which appear below as primers:
SSK-O1 5'...GAATCAAGACATTAGTC...3' seq. id. No. 6 SSK-02 5'...GTGGCGCGATGCCAC..,3' seq. ld. NO. 7 SSK-03 5'...GCAACCATTACTGATCG...3' seq. id. No. 8 SSK-04 5'...CCAGTACAAAATCAAGC...3' seq> id. No. 9 *Trade-mark a ,~~ 204395 SSK-05 5'...CTAGCTATCGGTGACAC...3' seq. id. No. 10 SSK-06 5'...CAGAGATCAGGTCAG...3' seq.~id. No. ll SSK-07 5'...GTTAAGAGCTGCTCGC...3' seq. id. No. 12 SSK-08 5'...CCAGTTAAGGTATAGTC...3' seq. id. No. 13 SSK-09 5'...TCTCGTTCTTCTTCGG...3' seq. id. No. 14 The protocol followed was basically according to Sanger et al. (1977), and dATP32 and S35dATP (Amersham, UK) were used.
Fig. 2 shows a comparison between the amino acid sequence derived from the base sequence of SKC-2, and those of the genes SKC (Malke et al., 1986), SKA and SKG (Water, F. et al., 1989) .
The plasmid pEKG3 was transformed in several E,, coli strains such as W-3110, JM-101, LE392 and MC-1061 and the expression of streptokinase was compared between them. The best results were obtained with strain W3110 (F- supF suoE
~R g~",],K ~gR metB ~Y tonA), owing to which it was selected to be subjected to fermentation, wherein stable expression levels greater than 20~ of the total protein content of the cells were obtained, and 350-400 mg streptokinase per litre of culture medium were obtained.
ExamBle 3 For subcloning of the SKC-2 gene in yeast, strain MP36 of Piehia nastori~ was used as the host, and variants were made for intracellular and extracellular expression from the plasmids pNAO and pPS7 (Fig. 3), using the signal peptide of sucrose invertase for the extY~acellular construct, and in both cases subcloning the gene under the control of the alcohol oxidase (AOX) promoter, using as the terminator at the 3' end the termination signal of the glyceraldehyde-3-phosphate '. dehydrogenase gene of ~. e-erevisiae and a non-coding 3' region of the AOX gene for integration by homology in the genome of the ~'e.ast,.... fuxt,~e~;, ,~e.~'~,n,~ ~ ~ ;k~~", ~e ~nco, i ... 35~....,..which w 1~ ' .."~, ..k~ ,9.~ . .." ,.. ~
,~.9.:~~;~,~~~~~.~?~,"~~... ..,....",... , .
as used for selection in strain MP36 his'.
204395"3 The vector pPESKC4 (plasmid for extracellular expression of the protein) was obtained from the vector construct pPS7-NcoI-S1 nuclease-phosphatase ligated with the SKC-2 gene amplified by PCR from pEKG3 with the 5 oligonucleotide SK2 and a new primer which hybridises with the 5' end of the gene and eliminates the ATG which had been inserted for expression in bacteria. This new primer is a fragment of SK-3 which does not contain the ATG. In the case of intracellular expression, pPISKC6 was obtained (plasmid for 10 intracellular expression of the protein) from the vector pNAO-NcoI-EcoRI-S1 nuclease-phosphatase ligated with the band of SKC-2 amplified by PCR, with the primers SK2 and SK3 with which is obtained the exact gene which codes for streptokinase with an ATG at its 5' end (Fig. 3). Both plasmids were transformed in strain MP36 his-, using the protocol according to Cregg, J. et al. (1985).
The positive clones were studies by the Southern blot method (Maniatis et al., 1982), and out of those which had the correct integration were selected the most productive in each case for characterisation of the product.
The expression of recombinant streptokinase obtained in P_. pastoris extracellularly was 1-1.2 g/1 culture supernatant, and in case of the intracellular construct more than 1.0 g/1 culture.
In the construct for extracellular expression, the glycosylated protein was obtained with a molecular weight greater than 67,000 dalton, it being corroborated by Western blotting that it decreases to the molecular weight of native streptokinase when it is digested with endoglycosidase H.
This was carried out by taking a portion having a concentration equal to 1 mg/ml in a sodium citrate solution, 0.05 molar, pH=5.5, and denaturing it by the addition of SDS
JJ:1 cd 10a (final concentration 0.02%) and heating at 100°C for 10 minutes, then leaving it to cool to ambient temperature and adding 20 milli-units (mU) of endoglycosidase H (endo H) and leaving it for 16 hours at 37°C, at the end of which it is subjected to subsequent heating at 100°C for 5 minutes and 10 mU of endo H are added, followed by 12 hours incubation at 37°C, and it is JJ:Icd °', ~,'"'""""
. . . . . . . _ . . . 204~:9..~~ ...
11 ' applied to a 12.5% polyacrylamide ge,l and compared with an undeglycosylated sample.
The streptokinase produced in .fichia gastoris in both constructs maintains biological activity, not varying its affinity for plasminogen and being in fact another variant for the use of this protein in clinical medicine.
Example 4 To verify the biological activity of the product of the SKC-2 gene, the pure recombinant streptokinase obtained from both bacteria and yeast was used for acute and subacute toxicology tests on rats, wherein satisfactory and acceptable results were obtained to allow its use in human and animal therapeutics. Its in vivo fibrinolytic activity was verified in clinical tests on animals, wherein there was success in dissolving clots in the coronary and femoral arteries of dogs, blood parameters being maintained similar to those reported in the literature with this type of product.
The product of the SKC-2 gene showed a specific activity of 50,000-100,000 IU/mg, which was measured on plates of agarose-fibrin (Astrup et al.., 1952), chromogenic substrate (Friberger et al., 1982) and in vitro clot lysis according to Westlund et al. (1985).
Exam 1~ a 5 To verify the amino acid sequence derived from the base sequence of the SKC-2 gene, an analysis was made of the pure product by high-performance Liquid chromatography in reverse phase (HPLC-RP), using therefor a C8 4.6 x 250 mm column (Baker, USA), wherein there was used the gradient 5 minutes at 0% buffer B and up to 90% B in 55 minutes, with buffer A
(trifluoroacetic acid (TFA, Pierce, USA) at 0.1% in distilled water) and buffer 8 (TFA at 0,.,5% ~.n acetonitrile .
,, , ~ ,(~L.ichrosolv, ,Merck,T, FRG) ),,..,. ~a~"~,~air)in~ :a, flow , r,ate",off ,, M , "" , ,. , ""
0.8 ml/min.
."
. _ . _ _ .. 12 ~0 ~3~ j~ .
With the protein with a high degree of purity, the amino acid sequence derived from the base sequence obtained from the SKC-2 gene was verified by sequencing it by mass spectrometry.
For this the protein was digested with different enzymes and with combinations of them. The enzymes used were chymotrypsin, endoproteinase Glu-C, endoproteinase Lys-C and trypsin .
From the analysis of the mass spectra of the°peptides obtained in each of the digestions with the different enzymes, the map of the amino acid sequence of the protein was constructed by superposition, which made it possible to verify that there is in this case 100 correspondence between the sequence of the SKC-2 gene and the amino acid sequence of the protein obtained.
Strain deposits The ~. coli HSK-M [pEKG3] strain, based on the ~. coli strain W3110 and containing the plasmid pEKG3, was deposited on June 11, 1990, with the Centraalbureau voor Schimmelcultures (CBS), Baarn, The Netherlands, and obtained deposit number CBS 243.90.
Likewise, the Pichia pas oris MSK-M4 [pPESKC-4] strain, based on the Pir,~.zia gastoris strain MP-36 and containing the plasmid pPESKC-4, was deposited on June 11, 1990, with the Centraalbureau voor Schimmelcultures (CBS), Baarn, The , Netherlands, and obtained deposit number CBS 244.90.
_ .:. W . . ...~ . , . , ,. ,. . , . . . . . . t ~E~~ ~ : . . .. .., . .. . .
. . . . .. . . .. . .
' . ,.
',: ' R1~ FERENCES
Astrup, 'T., and Mullertz, S., 1952, Arch. Biochem. Biophys.
40: 346-351 Burnett, N.N., 1981, nnal. Biochemistry 112: 195-203 Cregg, J.M., Barringer, K.J., Iiessler, A.Y., and Madden, K.R., 1985, Mol. Cell. Biol. 5: 3376-3385 Dagert, M., and Ehrlich, S.D., 1974, Gene 6: 23-28 Dykewicz, M.S., McGrath, K.G., Harris, K.E., and Patterson, R., 1985, Int. Arch. Allergy Appl. Immun. 78: 386-390 Estrada, M.P., Hernandez, O., and de la Fuente, J., Interferon y Biotecnolog~.a 5: 152-156 Fiberger, P., 1982, J. Clin. Lab. Invest. 92, Suppl. 162:
Hanaha~~, D., 1983, J. Mol. Biol. 166: 557-580 Huang, T.T., Malke, Ii., and Ferrett_i, J.J., 1989, Molec.
Microb . 3 (2 ) : 197-205 Malke, ti., and Ferretti, J.J., 1984, Proc. Natl. Acad. Sci.
USA, 81: 3557-3561 Malke, H., Roe, B., and Ferretti, J.J., 1985, Gene 34: 357-362 Maniatis, T., Frisch, E.F., and Sambrook, J., 1982, Cold Spring Harbor Laboratory, USA
McGrath, K.G., Zeffren, B., Alexander, J., Kaplan, K., and Patterson, R.J., 1985, Allergy Clin. Immunol. 76: 453-457 Muller, J . , Reinert, fi . , and Malke, H . , 1989, Journal of Bacteriology, Apr.: 2202-2208 Randall, K., Gelfond, D.H., Stoffel, S., Scharf, S., t~iguchi, R., Horn, G.T., Mullis, K.B., and Erlich, H.A., 1988, Science 239: 987-491 Sanger, F., Nickler, S., and Coulson, A.K., 1977, Proc. Natl.
Acad. Sci. USA 74: 5463-5467 Tombin, H., Stahelin, T., and Gordon, J., 1979, Proc. Natl.
Acad. Sci. USA 76: 4350-4354 Walter, F., Siegel, M., and Malke, H., 1989, Nucl. Acids Res.
17 (3) : 1262 Westtund, L.E., and Anderson, L.O., 1985, Thrombosis Research 37: 213-223 Haggenson M.J. et al., Enzyme Microb. and Technol. 11, 650-656 (1989) ~4 20,4,3953 SEQUENCE LISTING
SEQ ID NO:1 SEQUENCE TYPE: Nucleotide with corresponding protein SEQUENCE LENGTH: 1245 base pairs STRANDEDNESS: single TOPOLOGY: linear MOLECULE TYPE: genomic DNA
ORIGINAL SOURCE ORGANISM: Streptococcus equisimilis from group C
of Lanfield definition IMMEDIATE EXPERIMENTAL SOURCE: ATCC-9542 strain FEATURES: from 1 to 1245 by mature peptide PROPERTIES: Streptokinase gene The gene product binds to human plasminogen The gene product is an activator of human plasminogen GCT CTA
CCT CGT
GAG CCA
TGG TCT
Ile GTC
Ala AAC
Gly AAC
Pro AGC
Glu Leu Trp Leu 1 Asp Arg CAA Pro TTA Ser GTT Val GTT Asn AGC Asn GTT Ser Gln 10 Leu 15 Val GCT
Val GGT
Ser ACT
Val GTT
GGG
ACG
AAT
CAA
GAC
Ala Gly Thr Val Glu Gly Thr Asn Gln Asp IleSer Leu Lys PhePhe GluIle Asp LeuThr Ser Arg ProAla His GlyGly Lys Thr GluGln GlyLeu Ser ProLys Ser Lys ProPhe Ala ThrAsp,Ser Gly AlaMet ProHis Lys LeuGlu Lys Ala AspLeu Leu LysAla Ile Gln GluGln LeuIle Ala AsnVal His Ser AsnAsp Asp TyrPhe Glu Val IleAsp PheAla Ser AspAla Thr Ile ThrAsp Arg GGC Lys Val TTT Asp Lys AspGly Ser Val ThrLeu Pro Asn 115 GCT
Gly Tyr 120 125 Phe Ala t~.
~5 2043953 ThrGlnPro Val Gln GluPhe LeuLeu Ser Gly HisVal ArgVal Arg 130 135 ~ 140 ProTyrLys Glu Lys ProIle GlnAsn Gln Ala LysSer ValAsp Val GluTyrThr Val Gln PheThr ProLeu Asn Pro AspAsp AspPhe Arg ProGlyLeu Lys Asp ThrLys LeuLeu Lys Thr LeuAla IleGly Asp ThrIleThr Ser Gln GluLeu LeuAla Gln Ala GlnSer IleLeu Asn AAAACC,CAC CCA GGC TATACG ATTTAT GAA CGT GACTCC TCAATC GTC 672 LysThrIsisPro Gly TyrThr IleTyr Glu Arg AspSer SerIle Val 210-~ 215 220 ThrHisAsp Asn Asp IlePhe ArgThr Ile Leu ProMet AspGln Glu PheThrTyr His Val LysAsn ArgGlu Gln Ala TyrGlu IleAsn Lys LysSerGly Leu Asn GluGlu IleAsn Asn Thr AspLeu IleSer Glu AAATAT,TAC GTC CTT AAAAAA GGGGAA AAG CCG TATGAT CCCTTT GAT 864 LysTyrTyr Val Leu LysLys'GlyGlu Lys Pro TyrAsp ProPhe Asp ArgSerHis Leu Lys LeuPhe ThrIle Lys Tyr ValAsp ValAsn Thr AsnGluLeu Leu Lys SerGlu GlnLeu Leu Thr AlaSer GluArg Asn TTAGACTTC AGA GAT TTATAC GAT,~ CGT GAT AAGGCT AAACTA CTC 1008 CT
LeuAspPhe Arg Asp "LeuTyr Asp'1 Arg Asp LysAla LysLeu Leu ~~ro TyrAsnAsn Leu Asp AlaPhe GlyIle Met Asp TyrThr LeuThr Gly a 2p~3953 LysVal GluAsp Asn HisAsp AspThr Asn ArgIle Ile Thr ValTyr MetGly LysArg Pro GluGly GluAsn Ala SerTyr.His Leu AlaTyr AspLys AspArg Tyr ThrGlu GluGlu Arg GluVal Tyr Ser TyrLeu ArgTyr ThrGly Thr ProIle ProAsp Asn ProAsn Asp .Lys A
2043953 _.
SEQ ID N0:2 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 26 bases TGGAATTCAT GAAAAATTAC TTATCT
SEQ ID N0:3 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 28 bases TGGATCCTTA TTTGTCGTTA GGGTTATC
SEQ ID N0:9 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 31 bases GGAATTCATG ATTGCTGGAC CTGAGTGGCT G
SEQ ID N0:5 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases ATCATCGAAC TAGTTAA
SEQ ID N0:6 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases GAATCAAGAC ATTAGTC
SEQ ID N0:7 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 15 bases GTGGCGCGAT GCCAC
SEQ ID N0:8 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases GCAACCATTA CTGATCG
.,, 18 204r395~3 SEQ ID N0:9 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases SEQ ID NO:10 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases CTAGCTATCG GTGACAC
SEQ ID NO:11 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 15 bases CAGAGATCAG GTCAG
SEQ ID N0:12 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 16 bases GTTAAGAGCT GCTCGC
SEQ ID N0:13 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases CCAGTTAAGG TATAGTC
SEQ ID N0:14 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 16 bases TCTCGTTCTT CTTCGG
A
Combined with the low yields obtained in production of the protein and the pathogenicity of its natural host, the ,tre~tQcoccus strains used for the industrial production of streptokinase secrete other products into the culture medium, such as deoxyribonucleases, streptolysin br hyaluronidase and proteases, which makes the process of purifying the desired protein difficult. On the other hand, it has not yet been possible to obtain genetically improved strains from these hosts due to the lack of a developed methodology for the gene transfer.
It is due to these drawbacks that the cloning of different isolated genes which code for these proteins has been attempted, using prokaryotic and eukaryotic hosts.
The German Democratic Republic patent no. 249493, IPC: C12 N 15/00, describes the cloning and expression of a gene which codes for streptokinase from strain H46A of Streptococcus ~cuisimilis, belonging to Streptococcus type C, using ~. coli bacteria as the host, wherein levels of excretion into the medium from 0.1 to 1.8 mg/1 are obtained, depending on the age of the culture.
The coding sequence for this gene, which was called SKC, was subsequently determined, including its signal peptide, as well as the identification of the primary structure of adjacent regions involved in the control of transcription and translation of said gene (Malke, H. et al., 1985).
Other nucleotide sequences of genes which code for streptokinases from Streptococcus types G and A have been characterised (Water, F. et al., 1989), In particular, the gene which determines streptokinase type A (gene SKA) from strain 49 type M of Streptococcus pio,genes was cloned and expressed in strain JM109 of ~, coli and in Streptococcus sanauis Challis 57E (Huan , T.T, et al., 1989). In both cases,",;
.. . .,. ..~.. ... ... . , .. _. , , , ... ..,~.,.,,:»~.~~~ . .., .... ... ,.
.. ,, .. , . . . .. . . . . .
protein levels of 0.64 mg/1 and 40 ~.g/1 respectively were produced. In the case of ~. coii, 94$ of the'protein recovered ,,, . .
.204393 _ was in the periplasmic space and 6~ in the cytosol, whereas in pan uis all the enzyme was found extracellularly. Moreover, many clones producing streptokinase in ~. coli were very unstable, as in some cases the SKA gene was deleted or the host cells died owing to some lethal activity of the gene product in question.
In the case of ~. s~"uis, the protein molecule obtained was approximately 3,000 dalton less than native streptokinase and the absence of 32 amino acids from the C-terminal end was detected; however, biological activity was not affected (Huang, T.T. et al., 1989).
Subsequently, conclusive results were obtained with respect to the difficulty of cloning and expressing the isolated SKC gene in ~. coli, it being found that the gene product interferes with the normal physiology of the host, which is shown by the mucosity of the cells which carry this gene, by the incomplete export of streptokinase into the periplasmic space, by the structural instability of the .plasmids which carry the SKC gene and which are designated for the expression of high levels of the protein, as well as by the drawbacks encountered in cloning streptokinase genes from additional serotypes of Streptococcus in plasmids of ~. _roli and unsuccessful attempts to express heterologous genes under the control of expression-excretion signals of the SKC gene itself (Muller, J. et al., 1989).
More recently the company Phillips Petroleum (patent DD257646 IPC: C 12 N 15/00, and Ha , ggenson, M.J. et al., 1989) has obtained the expression of~~streptokinase in the methylotrophic yeast Pichia pastoris, under the control of gene regulatory sequences of alcohol oxidase, wherein yields of the desired product using continuous fermentation of the order of 77-250 mg/1 culture medium were obtained, with an . intermediary cell density of 46 g/1. This system uses the ~KG~ge~ne,- contained i~n the vpla'si~~ti ~~MFS "Which is'''lice~rised' to this company by Dr. J.J Ferretti of Oklahoma University, USA, in the expression vectors.
The controllability of the system makes it attractive, taking into account that it can easily be repressed by using glucose or glycerol and induced with methanol; nevertheless, its application is limited to this host.
It is the object of the present invention to obtain high levels of streptokinase yield in different host systems, by the use of expression vectors which carry a novel nucleotide sequence which represents a genetic variant not described before and which codes for a bioactive streptokinase, which contains the active portion corresponding to streptokinase from ~t~rP.~gt-ococcus ~gllssims>>s of type C, strain ATCC-9592.
The novel isolated gene is called SKC-2, and codes for a protein of the same molecular weight as the one encoded by SKC; it has the fundamental characteristic of stability in vectors of E_. soli and yeasts, adverse effects on cells growth or viability not being found in a single case, which makes it possible to obtain yields greater than those reported up to now in both hosts, in respect of the product obtained, which indicates a streptokinase different to those known up to nova, with the desired biological activity.
A
s , 4a Drawings: The invention will now be described with reference to the following drawings:
Figure 1. Plasmid pEKG-3, which carries the sequence of the SKC-2 gene under the tryptophan promoter of E. coli and displays at the 3~ end of the gene the signal for termination of bacteriophage T4 to give greater stability to expression.
Figure 2. Comparison between the amino acid sequences derived from the base sequences of the genes SKC-2, SKC, SKG and SKA.
Figure 3. Plasmids pPESKC-4 and pPISKC-6, for extracellular and intracellular expression respectively in P.
pastoris.
Figure 4. Relative positions of DNA fragments SK1, SK2 and SK3.
JJ:lcd The present invention relates to a method for the isolation and expression of a novel nucleotide sequence corresponding to the SKC-2 gene, the product of which is a protein of approximately 97,000 dalton. Said protein belongs to the streptokinases, which are distinguished by their fibrinolytic activity.
The present invention also relates to this gene, l.he sequence of which corresponds to seq. id. No. 1 in the Sequence T~isting.
It was obtained from the genome of the ~ re~tocsccus eauis~.rnilis type C strain (ATCC-9592) by gene amplificat.ian using the polymerase chain reaction (PCR) from three synthetic oligonucleot.ides called SK1, SK2 and SK3 having the seyuer~c S..K1 5' . ... TGG1AATTCAT.G11AA1~~'~'1~~',TTj~T,CT . . . 3_' sect . id . Ne .
SK2 5'...'TGGATCCTTATTTGTCGTTAGGGTTATC...3' seq. id. Nm. i SK3 5'...GGnATTCATGATTGCTGGACCTGAGTGGCTG...3' seq. id. No. 4 JJ:lcd a and constitute a novel aspect for this method. The relative positions of the DNA fragments SK1, SK2 and SK3 are depicted in Fig. 4. These primers further carry restriction sites which are not found in the gene and which allow direct cloning 5 in an expression vector. On the other hand, the SK3 which hybridises at the 5~ end carries an ATG which acts as a site for initiating translation and removes the signal peptide of SKC-2. The three oligonucleotides were synthesised from the SKC sequences published by Malke et al., 1985; and with them were marked the boundaries of the exact fragment of the gene which codes for the mature protein or the complete gene including the signal peptide.
Another novel aspect of this method is the possibility of expressing the isolated gene in both bacteria and yeasts, high levels of expression being achieved in both cases.
The present invention also relates to recombinant DNA which includes the SKC-2 gene, such as vectors for the expression of this gene, in bacteria pEKG3 (Fig. 1), and in yeasts pPESKC-4 and pPISKC-6 (Fig. 3). In particular for expression in ~. coli, the SKC-2 gene with or without its signal peptide is cloned under the tryptophan promoter and with the transcription termination signal of phage T4. For yeasts there was used an expression vector in which the SKC-2 gene is located behind the signal peptide of sucrose invertase (SUC2) controlled by the alcohol oxidase gene promoter (AOX1) of ich' a pastoris and which carries the termination signal of the glyceraldehyde-3-phosphate dehydrogenase (GAPt) gene of Saccharomyces cerevisiae for the extracellular expression variant, this vector being called pPESKC-4. For the intracellular expression of SKC-2, the vector pPISKC-6 is used which does not contain the signal peptide of SUC-2 behind the JJ:lcd a 2043953 _ 5a AOX1 promoter, and this is obtained by inserting the SKC-2 gene in the expression vector pNAO (kindly provided by Muzio, OIGB, Havana, Cuba) (Fig. 3). The HISS gene of S. cerevisiae is used as a selection marker in both vectors, and the expression cassette referred to above JJ:lcd Bt is flanked by 5' and 3' sequences of the AOX gene of ~ hia gast-oris for integration.
The present invention also relates to the microorganisms resulting from transformation of ~.. ,~oli strain 4V3110 with vector pEKG3, and to mutant MP-36 of P.ichia ~asroris with expression vectors pPESKC-4 and pPISKC-6, which axe characterised as expressing high levels of streptokinase, and having good viability (the product has no adverse effects on cell growth) and high stability of the strains transformed.
The transformed ~.. coli clone was called HSK-M and exhibits levels of expression of the product of the SKC-2 gene greater than 350 mg/1 culture medium..
The transformed Pichia has oris strains MSK-M9 and MSK-M6 produce streptokinase levels intracellularly and extracellularly respectively which vary between 1.0 and 1.2 g/1 culture medium.
The method described in the present invention, given the levels of expression never before reported for this product, makes it possible to reach optimum purity thereof for its administration to human beings and animals, without the need to develop a complex and costly process for purification.
Examples The following examples are intended to illustrate but not limit the invention. ~. ~oli and ~i~hia ~~storis are used as host systems in these examples; nevertheless, other eukaryot.i.c and prokaryotic cells can be~used for the method described in the present invention.
Fxamg,~e 1 For the isolation of genomic DNA of ~trPp~~occus ~,L~imilis of type C, strain ATCC-9542 was used as a source thereof for cloning the SK gene. The cells of ~~rPnrococcus aauis.;mills were =grown -in Br..aHin Heart .Infusion .Medi..um .(GLBCO) at 240 r.p.m. for 12 hours at 37°C in 5-ml pre-cultures acting as an inoculum which was grown for 12 hours in a 300-ml *Trade-mark ,~ 7 203953 Erlenmeyer flask. The cells were collected by centrifugation at 3,000 r.p.m. and resuspended in 8 ml lyse (4.5 g glucose, 1.$6 g EDTn, 1.51 g tris-HC1, in 500 ml sterile water, pH=8), 80 ~1 of lysozyme were added at a concentration of 10 mg/ml, and the suspension was incubated for 30 minutes at 37°C. Next, to obtain efficient cell rupture, 500 ~1 pronase (Boehringer), 1 ml SDS at 10~ and 200 ail EDTA, 0,5 M, pH=8, were added and the suspension was incubated for 2 hours at 50°C with smooth agitation. Successive treatments with phenol, phenol-chloroform and chloroform were carried out, and the genomic DNA was precipitated with absolute ethanol and NHqAc" 7.5 M.
The yield obtained was 100 Hg per 300-ml Erlenmeyer culture flask. The presence of the gene which codes for streptokinase was verified by the Southern blot technique (Maniatis et al., 1982).
For subcloning in bacteria, 1 ~g genomic DNA of the ~trPntococcus Pau~s~m>>;s type C strain (ATCC-9592) was taken and the gene which codes for SKC-2 was amplified by a PCR
(Randall et al., 1988) using the oligonucleotides SK1 and SK2 for cloning the gene with its signal peptide and SK2-SK3 for cloning without it.
In each reaction 100 pmol of each oligonucleotide, 2 units of Taq polymerase (Perkin Elmer, USA) and 200 )unol of each dNTP were used and the reactions were performed in 10 mM MgCl~, 100 mM dTT, 10 mM NaCl and 100 ~g/ml gelatine.
Thirty amplification cyc~.es were performed, wherein in each one the reaction was incubated at 95°C for 1 minute for denaturisation, at 52°C for 95 seconds for hybridisation of the oligonucleotides and at 70°C for 80 seconds for extension.
An efficiency greater than 5~ of amplification was obtained.
For cloning in bacteria (F. co i), a new genetic construct was used, wherein the tryptophan promoter of E_. coli and the termination signal of bacteriophage T4 are used.
A
The fragments amplified in the PCR were digested with BamHI
and ligated with the vector ptrip-N_coI-S1-CHI
(Estrada et al., 1988). This construct was transformed into a preparation of competent cells prepared according to Dagert et al. (1979) and Hanahan et al. (1983), of ~. coli strain HB101 ((rg-mg-), sunEq4, ara-19, aalK-2, cYl, proA2, rnsL20, (StrR) , ~cyl-5, ~1-5, ~1-1, recAl3 } , which had a frequency greater than 10~ transformants per g DNA.
The colonies obtained were applied to plates of LB medium (10 g/1 trypton, 5 g/1 yeast extract, 10 g/1 sodium chloride) and 50 ~tg/ml ampicillin, and hybridised according to Maniatis et al. (1982), using as a probe the fragment resulting from the amplification in the PCR, which was marked using dATP32 (Amersham, UK) and the Klenow fragment of DNA-polymerase I of ~. ~oli (Maniatis et al., 1982) in whatman 541 filters for 30 minutes at 37°C, the reaction being terminated by EDTA and heat. 4~ of the colonies were positive clones, which were examined by restriction analysis and had the same pattern of digestion with more than 10 restriction enzymes;
moreover the positive clones were checked by double chain DNA
sequencing (Sanger et al., 1977), using therefor an oligo-nucleotide of 17 bases (5'...ATCATCGAACTAGTTAA...3', seq. id.
No. 5) which hybridises at the 3' end of the promoter, with which it was corroborated that joining of the latter to the SKC-2 gene was as desired.
The selected clone was called pEKG-3 (E'ig. 1), which was subjected to fermentation to realise the characterisation of the product.
The plasmid of clone pEKG3 was purified using a CsCl gradient and the sequences of the SKC-2 gene were established, each time using 2 g of plasmid, and using the oligonucleotides which appear below as primers:
SSK-O1 5'...GAATCAAGACATTAGTC...3' seq. id. No. 6 SSK-02 5'...GTGGCGCGATGCCAC..,3' seq. ld. NO. 7 SSK-03 5'...GCAACCATTACTGATCG...3' seq. id. No. 8 SSK-04 5'...CCAGTACAAAATCAAGC...3' seq> id. No. 9 *Trade-mark a ,~~ 204395 SSK-05 5'...CTAGCTATCGGTGACAC...3' seq. id. No. 10 SSK-06 5'...CAGAGATCAGGTCAG...3' seq.~id. No. ll SSK-07 5'...GTTAAGAGCTGCTCGC...3' seq. id. No. 12 SSK-08 5'...CCAGTTAAGGTATAGTC...3' seq. id. No. 13 SSK-09 5'...TCTCGTTCTTCTTCGG...3' seq. id. No. 14 The protocol followed was basically according to Sanger et al. (1977), and dATP32 and S35dATP (Amersham, UK) were used.
Fig. 2 shows a comparison between the amino acid sequence derived from the base sequence of SKC-2, and those of the genes SKC (Malke et al., 1986), SKA and SKG (Water, F. et al., 1989) .
The plasmid pEKG3 was transformed in several E,, coli strains such as W-3110, JM-101, LE392 and MC-1061 and the expression of streptokinase was compared between them. The best results were obtained with strain W3110 (F- supF suoE
~R g~",],K ~gR metB ~Y tonA), owing to which it was selected to be subjected to fermentation, wherein stable expression levels greater than 20~ of the total protein content of the cells were obtained, and 350-400 mg streptokinase per litre of culture medium were obtained.
ExamBle 3 For subcloning of the SKC-2 gene in yeast, strain MP36 of Piehia nastori~ was used as the host, and variants were made for intracellular and extracellular expression from the plasmids pNAO and pPS7 (Fig. 3), using the signal peptide of sucrose invertase for the extY~acellular construct, and in both cases subcloning the gene under the control of the alcohol oxidase (AOX) promoter, using as the terminator at the 3' end the termination signal of the glyceraldehyde-3-phosphate '. dehydrogenase gene of ~. e-erevisiae and a non-coding 3' region of the AOX gene for integration by homology in the genome of the ~'e.ast,.... fuxt,~e~;, ,~e.~'~,n,~ ~ ~ ;k~~", ~e ~nco, i ... 35~....,..which w 1~ ' .."~, ..k~ ,9.~ . .." ,.. ~
,~.9.:~~;~,~~~~~.~?~,"~~... ..,....",... , .
as used for selection in strain MP36 his'.
204395"3 The vector pPESKC4 (plasmid for extracellular expression of the protein) was obtained from the vector construct pPS7-NcoI-S1 nuclease-phosphatase ligated with the SKC-2 gene amplified by PCR from pEKG3 with the 5 oligonucleotide SK2 and a new primer which hybridises with the 5' end of the gene and eliminates the ATG which had been inserted for expression in bacteria. This new primer is a fragment of SK-3 which does not contain the ATG. In the case of intracellular expression, pPISKC6 was obtained (plasmid for 10 intracellular expression of the protein) from the vector pNAO-NcoI-EcoRI-S1 nuclease-phosphatase ligated with the band of SKC-2 amplified by PCR, with the primers SK2 and SK3 with which is obtained the exact gene which codes for streptokinase with an ATG at its 5' end (Fig. 3). Both plasmids were transformed in strain MP36 his-, using the protocol according to Cregg, J. et al. (1985).
The positive clones were studies by the Southern blot method (Maniatis et al., 1982), and out of those which had the correct integration were selected the most productive in each case for characterisation of the product.
The expression of recombinant streptokinase obtained in P_. pastoris extracellularly was 1-1.2 g/1 culture supernatant, and in case of the intracellular construct more than 1.0 g/1 culture.
In the construct for extracellular expression, the glycosylated protein was obtained with a molecular weight greater than 67,000 dalton, it being corroborated by Western blotting that it decreases to the molecular weight of native streptokinase when it is digested with endoglycosidase H.
This was carried out by taking a portion having a concentration equal to 1 mg/ml in a sodium citrate solution, 0.05 molar, pH=5.5, and denaturing it by the addition of SDS
JJ:1 cd 10a (final concentration 0.02%) and heating at 100°C for 10 minutes, then leaving it to cool to ambient temperature and adding 20 milli-units (mU) of endoglycosidase H (endo H) and leaving it for 16 hours at 37°C, at the end of which it is subjected to subsequent heating at 100°C for 5 minutes and 10 mU of endo H are added, followed by 12 hours incubation at 37°C, and it is JJ:Icd °', ~,'"'""""
. . . . . . . _ . . . 204~:9..~~ ...
11 ' applied to a 12.5% polyacrylamide ge,l and compared with an undeglycosylated sample.
The streptokinase produced in .fichia gastoris in both constructs maintains biological activity, not varying its affinity for plasminogen and being in fact another variant for the use of this protein in clinical medicine.
Example 4 To verify the biological activity of the product of the SKC-2 gene, the pure recombinant streptokinase obtained from both bacteria and yeast was used for acute and subacute toxicology tests on rats, wherein satisfactory and acceptable results were obtained to allow its use in human and animal therapeutics. Its in vivo fibrinolytic activity was verified in clinical tests on animals, wherein there was success in dissolving clots in the coronary and femoral arteries of dogs, blood parameters being maintained similar to those reported in the literature with this type of product.
The product of the SKC-2 gene showed a specific activity of 50,000-100,000 IU/mg, which was measured on plates of agarose-fibrin (Astrup et al.., 1952), chromogenic substrate (Friberger et al., 1982) and in vitro clot lysis according to Westlund et al. (1985).
Exam 1~ a 5 To verify the amino acid sequence derived from the base sequence of the SKC-2 gene, an analysis was made of the pure product by high-performance Liquid chromatography in reverse phase (HPLC-RP), using therefor a C8 4.6 x 250 mm column (Baker, USA), wherein there was used the gradient 5 minutes at 0% buffer B and up to 90% B in 55 minutes, with buffer A
(trifluoroacetic acid (TFA, Pierce, USA) at 0.1% in distilled water) and buffer 8 (TFA at 0,.,5% ~.n acetonitrile .
,, , ~ ,(~L.ichrosolv, ,Merck,T, FRG) ),,..,. ~a~"~,~air)in~ :a, flow , r,ate",off ,, M , "" , ,. , ""
0.8 ml/min.
."
. _ . _ _ .. 12 ~0 ~3~ j~ .
With the protein with a high degree of purity, the amino acid sequence derived from the base sequence obtained from the SKC-2 gene was verified by sequencing it by mass spectrometry.
For this the protein was digested with different enzymes and with combinations of them. The enzymes used were chymotrypsin, endoproteinase Glu-C, endoproteinase Lys-C and trypsin .
From the analysis of the mass spectra of the°peptides obtained in each of the digestions with the different enzymes, the map of the amino acid sequence of the protein was constructed by superposition, which made it possible to verify that there is in this case 100 correspondence between the sequence of the SKC-2 gene and the amino acid sequence of the protein obtained.
Strain deposits The ~. coli HSK-M [pEKG3] strain, based on the ~. coli strain W3110 and containing the plasmid pEKG3, was deposited on June 11, 1990, with the Centraalbureau voor Schimmelcultures (CBS), Baarn, The Netherlands, and obtained deposit number CBS 243.90.
Likewise, the Pichia pas oris MSK-M4 [pPESKC-4] strain, based on the Pir,~.zia gastoris strain MP-36 and containing the plasmid pPESKC-4, was deposited on June 11, 1990, with the Centraalbureau voor Schimmelcultures (CBS), Baarn, The , Netherlands, and obtained deposit number CBS 244.90.
_ .:. W . . ...~ . , . , ,. ,. . , . . . . . . t ~E~~ ~ : . . .. .., . .. . .
. . . . .. . . .. . .
' . ,.
',: ' R1~ FERENCES
Astrup, 'T., and Mullertz, S., 1952, Arch. Biochem. Biophys.
40: 346-351 Burnett, N.N., 1981, nnal. Biochemistry 112: 195-203 Cregg, J.M., Barringer, K.J., Iiessler, A.Y., and Madden, K.R., 1985, Mol. Cell. Biol. 5: 3376-3385 Dagert, M., and Ehrlich, S.D., 1974, Gene 6: 23-28 Dykewicz, M.S., McGrath, K.G., Harris, K.E., and Patterson, R., 1985, Int. Arch. Allergy Appl. Immun. 78: 386-390 Estrada, M.P., Hernandez, O., and de la Fuente, J., Interferon y Biotecnolog~.a 5: 152-156 Fiberger, P., 1982, J. Clin. Lab. Invest. 92, Suppl. 162:
Hanaha~~, D., 1983, J. Mol. Biol. 166: 557-580 Huang, T.T., Malke, Ii., and Ferrett_i, J.J., 1989, Molec.
Microb . 3 (2 ) : 197-205 Malke, ti., and Ferretti, J.J., 1984, Proc. Natl. Acad. Sci.
USA, 81: 3557-3561 Malke, H., Roe, B., and Ferretti, J.J., 1985, Gene 34: 357-362 Maniatis, T., Frisch, E.F., and Sambrook, J., 1982, Cold Spring Harbor Laboratory, USA
McGrath, K.G., Zeffren, B., Alexander, J., Kaplan, K., and Patterson, R.J., 1985, Allergy Clin. Immunol. 76: 453-457 Muller, J . , Reinert, fi . , and Malke, H . , 1989, Journal of Bacteriology, Apr.: 2202-2208 Randall, K., Gelfond, D.H., Stoffel, S., Scharf, S., t~iguchi, R., Horn, G.T., Mullis, K.B., and Erlich, H.A., 1988, Science 239: 987-491 Sanger, F., Nickler, S., and Coulson, A.K., 1977, Proc. Natl.
Acad. Sci. USA 74: 5463-5467 Tombin, H., Stahelin, T., and Gordon, J., 1979, Proc. Natl.
Acad. Sci. USA 76: 4350-4354 Walter, F., Siegel, M., and Malke, H., 1989, Nucl. Acids Res.
17 (3) : 1262 Westtund, L.E., and Anderson, L.O., 1985, Thrombosis Research 37: 213-223 Haggenson M.J. et al., Enzyme Microb. and Technol. 11, 650-656 (1989) ~4 20,4,3953 SEQUENCE LISTING
SEQ ID NO:1 SEQUENCE TYPE: Nucleotide with corresponding protein SEQUENCE LENGTH: 1245 base pairs STRANDEDNESS: single TOPOLOGY: linear MOLECULE TYPE: genomic DNA
ORIGINAL SOURCE ORGANISM: Streptococcus equisimilis from group C
of Lanfield definition IMMEDIATE EXPERIMENTAL SOURCE: ATCC-9542 strain FEATURES: from 1 to 1245 by mature peptide PROPERTIES: Streptokinase gene The gene product binds to human plasminogen The gene product is an activator of human plasminogen GCT CTA
CCT CGT
GAG CCA
TGG TCT
Ile GTC
Ala AAC
Gly AAC
Pro AGC
Glu Leu Trp Leu 1 Asp Arg CAA Pro TTA Ser GTT Val GTT Asn AGC Asn GTT Ser Gln 10 Leu 15 Val GCT
Val GGT
Ser ACT
Val GTT
GGG
ACG
AAT
CAA
GAC
Ala Gly Thr Val Glu Gly Thr Asn Gln Asp IleSer Leu Lys PhePhe GluIle Asp LeuThr Ser Arg ProAla His GlyGly Lys Thr GluGln GlyLeu Ser ProLys Ser Lys ProPhe Ala ThrAsp,Ser Gly AlaMet ProHis Lys LeuGlu Lys Ala AspLeu Leu LysAla Ile Gln GluGln LeuIle Ala AsnVal His Ser AsnAsp Asp TyrPhe Glu Val IleAsp PheAla Ser AspAla Thr Ile ThrAsp Arg GGC Lys Val TTT Asp Lys AspGly Ser Val ThrLeu Pro Asn 115 GCT
Gly Tyr 120 125 Phe Ala t~.
~5 2043953 ThrGlnPro Val Gln GluPhe LeuLeu Ser Gly HisVal ArgVal Arg 130 135 ~ 140 ProTyrLys Glu Lys ProIle GlnAsn Gln Ala LysSer ValAsp Val GluTyrThr Val Gln PheThr ProLeu Asn Pro AspAsp AspPhe Arg ProGlyLeu Lys Asp ThrLys LeuLeu Lys Thr LeuAla IleGly Asp ThrIleThr Ser Gln GluLeu LeuAla Gln Ala GlnSer IleLeu Asn AAAACC,CAC CCA GGC TATACG ATTTAT GAA CGT GACTCC TCAATC GTC 672 LysThrIsisPro Gly TyrThr IleTyr Glu Arg AspSer SerIle Val 210-~ 215 220 ThrHisAsp Asn Asp IlePhe ArgThr Ile Leu ProMet AspGln Glu PheThrTyr His Val LysAsn ArgGlu Gln Ala TyrGlu IleAsn Lys LysSerGly Leu Asn GluGlu IleAsn Asn Thr AspLeu IleSer Glu AAATAT,TAC GTC CTT AAAAAA GGGGAA AAG CCG TATGAT CCCTTT GAT 864 LysTyrTyr Val Leu LysLys'GlyGlu Lys Pro TyrAsp ProPhe Asp ArgSerHis Leu Lys LeuPhe ThrIle Lys Tyr ValAsp ValAsn Thr AsnGluLeu Leu Lys SerGlu GlnLeu Leu Thr AlaSer GluArg Asn TTAGACTTC AGA GAT TTATAC GAT,~ CGT GAT AAGGCT AAACTA CTC 1008 CT
LeuAspPhe Arg Asp "LeuTyr Asp'1 Arg Asp LysAla LysLeu Leu ~~ro TyrAsnAsn Leu Asp AlaPhe GlyIle Met Asp TyrThr LeuThr Gly a 2p~3953 LysVal GluAsp Asn HisAsp AspThr Asn ArgIle Ile Thr ValTyr MetGly LysArg Pro GluGly GluAsn Ala SerTyr.His Leu AlaTyr AspLys AspArg Tyr ThrGlu GluGlu Arg GluVal Tyr Ser TyrLeu ArgTyr ThrGly Thr ProIle ProAsp Asn ProAsn Asp .Lys A
2043953 _.
SEQ ID N0:2 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 26 bases TGGAATTCAT GAAAAATTAC TTATCT
SEQ ID N0:3 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 28 bases TGGATCCTTA TTTGTCGTTA GGGTTATC
SEQ ID N0:9 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 31 bases GGAATTCATG ATTGCTGGAC CTGAGTGGCT G
SEQ ID N0:5 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases ATCATCGAAC TAGTTAA
SEQ ID N0:6 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases GAATCAAGAC ATTAGTC
SEQ ID N0:7 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 15 bases GTGGCGCGAT GCCAC
SEQ ID N0:8 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases GCAACCATTA CTGATCG
.,, 18 204r395~3 SEQ ID N0:9 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases SEQ ID NO:10 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases CTAGCTATCG GTGACAC
SEQ ID NO:11 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 15 bases CAGAGATCAG GTCAG
SEQ ID N0:12 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 16 bases GTTAAGAGCT GCTCGC
SEQ ID N0:13 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 17 bases CCAGTTAAGG TATAGTC
SEQ ID N0:14 SEQUENCE TYPE: nucleotide SEQUENCE LENGTH: 16 bases TCTCGTTCTT CTTCGG
A
Claims (30)
1. A method for producing streptokinase by expression of a gene encoding said streptokinase which comprises the steps of transforming a host cell with an expression vector containing a streptokinase gene (SKC-2) comprising nucleotide sequence:
operably linked to an effective promoter and transcription terminator, growing said transformed host cell and isolating the streptokinase so produced.
operably linked to an effective promoter and transcription terminator, growing said transformed host cell and isolating the streptokinase so produced.
2. The method of Claim 1 wherein said SKC-2 gene is isolated from Streptococcus equisimilis type C strain ATCC-9542.
3. The method of Claim 1 wherein said SKC-2 gene is used without its signal peptide-encoding region.
4. The method of Claim 1 wherein said SKC-2 gene is used with its signal peptide-encoding region.
5. The method of Claim 1 wherein said host cell is a bacterium.
6. The method of Claim 5 wherein said bacterium is Escherichia coli.
7. The method of Claim 5 wherein said bacterium is Escherichia cold strain W3110.
8. The method of Claim 5 wherein said expression vector is a plasmid containing the SKC-2 gene operably linked to the Escherichia coli tryptophan promoter and the T4 transcription terminator.
9. The method of Claim 8 wherein said expression vector is the plasmid pEKG3 contained in Escherichia coli strain CBS 243.90.
10. The method of Claim 1 wherein said host cell is a yeast.
11. The method of Claim 10 wherein said yeast is Pichia pastoris.
12. The method of Claim 10 wherein said yeast is Pichia pastors HIS- strain MP-36 (CBS 244.90).
13. The method of Claim 10 wherein said expression vector is a plasmid containing the SKC-2 gene operably linked to the Pichia pastoris alcohol oxidase promoter (AOX1) and the Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase termination sequence (GAPt).
14. The method of Claim 13 wherein said expression vector contains the Saccharamyces cerevisiae HIS3 gene as a selection marker.
15. The method of Claim 14 wherein said expression vector is the plasmid pPESKC-4 contained in Pichia pastoris strain CBS
244.90.
244.90.
16. The method of Claim 14 wherein said expression vector contains the SKC-2 gene, without a signal peptide-encoding region.
17. An isolated and purified nucleic acid consisting essentially of the nucleotide sequence shown in claim 1.
18. An expression vector for the expression of streptokinase in a host cell, said vector containing a streptokinase gene (SKC-2) consisting essentially of the nucleotide sequence shown in Claim 1 operably linked to an effective promoter and transcription terminator.
19. The expression vector of Claim 18 containing the SKC-2 gene operably linked to the Escherichia coli tryptophan promoter and the T4 transcription terminator for expression of said SKC-2 gene in bacteria.
20. The expression vector of Claim 19 which is the plasmid pEKG3 contained in Escherichia coli strain CBS 243.90.
21. The expression vector of Claim 18 containing the SKC-2 gene operably linked to the Pichia pastoris alcohol oxidase promoter (AOX1) and the Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase termination sequence (GAPt) for expression of said SKC-2 gene in yeast.
22. The expression vector of Claim 21 containing the Saccharomyces cerevisiae HIS3 gene as a selection marker.
23. The expression vector of Claim 22 which is the plasmid pPESKC-4.
24. A host cell for producing streptokinase by expression of a gene encoding said streptokinase, said host cell being transformed with an expression vector containing a streptokinase gene (SKC-2)comprising the nucleotide sequence shown in claim 1 operably linked to an effective promoter and transcription terminator.
25. The host cell of Claim 24 which is a bacterium.
26. The host cell of Claim 24 which is Escherichia coli.
27. The host cell of Claim 24 which is Escherichia coli strain W3110.
28. The host cell of Claim 24 which is a yeast.
29. The host cell of Claim 24 which is Pichia pastoris.
30. The host cell of Claim 24 which is Pichia Qastoris HIS-strain MP-36 (CBS 244.90).
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| HU1770/91A HU216073B (en) | 1991-05-27 | 1991-05-27 | Process for producing streptokinase |
| CA 2043953 CA2043953C (en) | 1991-06-05 | 1991-06-05 | Method for the isolation and expression of a gene which codes for streptokinase, nucleotide sequence obtained, recombinant dna and transformed microorganisms |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CA 2043953 CA2043953C (en) | 1991-06-05 | 1991-06-05 | Method for the isolation and expression of a gene which codes for streptokinase, nucleotide sequence obtained, recombinant dna and transformed microorganisms |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CA2043953A1 CA2043953A1 (en) | 1992-12-06 |
| CA2043953C true CA2043953C (en) | 2001-06-12 |
Family
ID=4147743
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA 2043953 Expired - Fee Related CA2043953C (en) | 1991-05-27 | 1991-06-05 | Method for the isolation and expression of a gene which codes for streptokinase, nucleotide sequence obtained, recombinant dna and transformed microorganisms |
Country Status (1)
| Country | Link |
|---|---|
| CA (1) | CA2043953C (en) |
-
1991
- 1991-06-05 CA CA 2043953 patent/CA2043953C/en not_active Expired - Fee Related
Also Published As
| Publication number | Publication date |
|---|---|
| CA2043953A1 (en) | 1992-12-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US5612196A (en) | Human serun albumin, preparation and use | |
| Engel et al. | Murein-metabolizing enzymes from Escherichia coli: sequence analysis and controlled overexpression of the slt gene, which encodes the soluble lytic transglycosylase | |
| JP2733602B2 (en) | Expression cassette, plasmid and transformed yeast encoding hirudin | |
| CA2185343A1 (en) | Enhanced secretion of polypeptides | |
| JPS62272976A (en) | Mutant tissue plasminogen activator and its production | |
| EP0802983A2 (en) | Expression of urokinase plasminogen activator inhibitors | |
| US6248715B1 (en) | Method of treating a urokinase-type plasminogen activator-mediated disorder | |
| Van Poelje et al. | Cloning, sequencing, expression, and site-directed mutagenesis of the gene from Clostridium perfringens encoding pyruvoyl-dependent histidine decarboxylase | |
| JP3550409B2 (en) | Dipeptidyl aminopeptidase of dictyosterium | |
| PL166199B1 (en) | Method for manufacturing new plasmides and for manufacturing a plasminogen activator | |
| US5541288A (en) | Peptide having elastase inhibitory activity and producing method thereof | |
| EP0489201B1 (en) | Method for the isolation and expression of a gene which codes for streptokinase, nucleotide sequence obtained, recombinant DNA and transformed microorganisms | |
| CA2043953C (en) | Method for the isolation and expression of a gene which codes for streptokinase, nucleotide sequence obtained, recombinant dna and transformed microorganisms | |
| JPH01501996A (en) | De-epidermal cell growth factor plasminogen activator | |
| JPH06169770A (en) | New polypeptide, plasmid with code relating thereto and method of preparing and using the same | |
| WO1989007146A1 (en) | Rearranged tissue plasminogen activators and method for producing same | |
| RU2107726C1 (en) | Dna fragment corresponding to gene skc-2, method for expressing dna fragment corresponding to gene skc-2, plasmid vector pekg3 providing for expressing dna fragment corresponding to gene skc-2, plasmid vector pes kc-4 providing for expressing dna fragment corresponding to gene skc-2, escherichia coli bacteria strain having plasmid vector pekg3 usable for expressing dna fragment corresponding to gene skc-2, and streptokinase produced in utilizing escherichia coli bacteria strain hsk-m | |
| JPH07507439A (en) | Artificial promoter for protein expression in yeast | |
| SK279873B6 (en) | Method for the preparation of a streptokinase, streptokinase, pharmaceutical composition, nucleotide acid, expression vector, and host cell | |
| JP2001321179A (en) | New method for producing raffinose | |
| AU619803B2 (en) | Rearranged tissue plasminogen activators and method for producing same | |
| JPH04258294A (en) | Secretory vector, bacterium transformed with the same vector and preparation of product from the same bacterium | |
| HU216073B (en) | Process for producing streptokinase | |
| JPH09187280A (en) | Use of dna having gene information on pyruvate oxidase | |
| JPH09168389A (en) | Dna coding for ascorbate oxidase |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| EEER | Examination request | ||
| MKLA | Lapsed |