OA20681A - N-terminal extension sequence for expression of recombinant therapeutic peptides. - Google Patents
N-terminal extension sequence for expression of recombinant therapeutic peptides. Download PDFInfo
- Publication number
- OA20681A OA20681A OA1202200086 OA20681A OA 20681 A OA20681 A OA 20681A OA 1202200086 OA1202200086 OA 1202200086 OA 20681 A OA20681 A OA 20681A
- Authority
- OA
- OAPI
- Prior art keywords
- seq
- lirapeptide
- sequence
- vector
- sequence encoding
- Prior art date
Links
- 230000014509 gene expression Effects 0.000 title claims abstract description 106
- 102000004196 processed proteins & peptides Human genes 0.000 title abstract description 17
- 108090000765 processed proteins & peptides Proteins 0.000 title abstract description 17
- 230000001225 therapeutic Effects 0.000 title abstract description 14
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 28
- 238000000034 method Methods 0.000 claims abstract description 18
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 16
- 241000723792 Tobacco etch virus Species 0.000 claims description 53
- 238000003776 cleavage reaction Methods 0.000 claims description 41
- 241000235058 Komagataella pastoris Species 0.000 claims description 18
- 241000186226 Corynebacterium glutamicum Species 0.000 claims description 17
- 241000588724 Escherichia coli Species 0.000 claims description 15
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 14
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 14
- 108010019598 Liraglutide Proteins 0.000 claims description 13
- 229960002701 liraglutide Drugs 0.000 claims description 13
- YSDQQAXHVYUZIW-QCIJIYAXSA-N liraglutide Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCNC(=O)CC[C@H](NC(=O)CCCCCCCCCCCCCCC)C(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=C(O)C=C1 YSDQQAXHVYUZIW-QCIJIYAXSA-N 0.000 claims description 13
- 238000000855 fermentation Methods 0.000 claims description 8
- 230000004151 fermentation Effects 0.000 claims description 8
- 240000008371 Bacillus subtilis Species 0.000 claims description 7
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 7
- 229940075615 Bacillus subtilis Drugs 0.000 claims description 7
- 239000002609 media Substances 0.000 claims description 6
- 235000003534 Saccharomyces carlsbergensis Nutrition 0.000 claims description 5
- 229940081969 Saccharomyces cerevisiae Drugs 0.000 claims description 5
- 230000021615 conjugation Effects 0.000 claims description 5
- GRSIQCSSMDOXBZ-IBGZPJMESA-M N[C@@H](CCC(=O)[O-])C(=O)OCCCCCCCCCCCCCCCC Chemical compound N[C@@H](CCC(=O)[O-])C(=O)OCCCCCCCCCCCCCCCC GRSIQCSSMDOXBZ-IBGZPJMESA-M 0.000 claims description 4
- 238000003306 harvesting Methods 0.000 claims description 4
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 3
- 238000005119 centrifugation Methods 0.000 claims description 3
- 239000012228 culture supernatant Substances 0.000 claims description 3
- 239000001963 growth media Substances 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 238000003259 recombinant expression Methods 0.000 claims 1
- 102000004169 proteins and genes Human genes 0.000 abstract description 13
- 108090000623 proteins and genes Proteins 0.000 abstract description 13
- 230000002708 enhancing Effects 0.000 abstract description 6
- 238000004519 manufacturing process Methods 0.000 abstract description 6
- 210000004027 cells Anatomy 0.000 description 31
- 150000001413 amino acids Chemical class 0.000 description 20
- OGBMKVWORPGQRR-UMXFMPSGSA-N 52232-67-4 Chemical compound C([C@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)[C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 OGBMKVWORPGQRR-UMXFMPSGSA-N 0.000 description 15
- 229920001850 Nucleic acid sequence Polymers 0.000 description 15
- 108010049264 Teriparatide Proteins 0.000 description 14
- 229960005460 teriparatide Drugs 0.000 description 14
- 235000001014 amino acid Nutrition 0.000 description 13
- 235000018102 proteins Nutrition 0.000 description 12
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 10
- DTHNMHAUYICORS-KTKZVXAJSA-N 107444-51-9 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 DTHNMHAUYICORS-KTKZVXAJSA-N 0.000 description 8
- 239000000203 mixture Substances 0.000 description 8
- 229920001184 polypeptide Polymers 0.000 description 8
- 230000001131 transforming Effects 0.000 description 8
- 101710042131 GCG Proteins 0.000 description 7
- 102100003818 GCG Human genes 0.000 description 7
- 238000010276 construction Methods 0.000 description 6
- 238000002955 isolation Methods 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 102000004877 Insulin Human genes 0.000 description 5
- 108090001061 Insulin Proteins 0.000 description 5
- 230000001939 inductive effect Effects 0.000 description 5
- OZAIFHULBGXAKX-UHFFFAOYSA-N precursor Substances N#CC(C)(C)N=NC(C)(C)C#N OZAIFHULBGXAKX-UHFFFAOYSA-N 0.000 description 5
- 230000035897 transcription Effects 0.000 description 5
- 108020004999 Messenger RNA Proteins 0.000 description 4
- 229920003013 deoxyribonucleic acid Polymers 0.000 description 4
- 229920002106 messenger RNA Polymers 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 3
- 229940088597 Hormone Drugs 0.000 description 3
- 229920000272 Oligonucleotide Polymers 0.000 description 3
- IPCSVZSSVZVIGE-UHFFFAOYSA-N Palmitic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 239000005556 hormone Substances 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 229920001405 Coding region Polymers 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 108090000445 Parathyroid hormone Proteins 0.000 description 2
- 102000003982 Parathyroid hormone Human genes 0.000 description 2
- 102000014961 Protein Precursors Human genes 0.000 description 2
- 108010078762 Protein Precursors Proteins 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 230000000975 bioactive Effects 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037240 fusion proteins Human genes 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 239000003877 glucagon like peptide 1 receptor agonist Substances 0.000 description 2
- NPZTUJOABDZTLV-UHFFFAOYSA-N hydroxybenzotriazole Substances O=C1C=CC=C2NNN=C12 NPZTUJOABDZTLV-UHFFFAOYSA-N 0.000 description 2
- 230000002503 metabolic Effects 0.000 description 2
- 239000000199 parathyroid hormone Substances 0.000 description 2
- 229960001319 parathyroid hormone Drugs 0.000 description 2
- 230000001323 posttranslational Effects 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 210000001519 tissues Anatomy 0.000 description 2
- 229920000160 (ribonucleotides)n+m Polymers 0.000 description 1
- -1 2-methyl-prop-2-yl (tert-butyl) Chemical group 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 210000000133 Brain Stem Anatomy 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108010011459 Exenatide Proteins 0.000 description 1
- HTQBXNHDCUEHJF-XWLPCZSASA-N Exendin-4 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)NCC(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CO)C(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 HTQBXNHDCUEHJF-XWLPCZSASA-N 0.000 description 1
- 102400000326 Glucagon-like peptide 2 Human genes 0.000 description 1
- 229960002989 Glutamic Acid Drugs 0.000 description 1
- 229960002591 Hydroxyproline Drugs 0.000 description 1
- 229920002459 Intron Polymers 0.000 description 1
- 108020004391 Introns Proteins 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- 101710013993 Mcpt10 Proteins 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 208000001132 Osteoporosis Diseases 0.000 description 1
- 101000032483 PTH Proteins 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 101800001442 Peptide pr Proteins 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 102000035853 Proglucagon Human genes 0.000 description 1
- 108010058003 Proglucagon Proteins 0.000 description 1
- 108010033725 Recombinant Proteins Proteins 0.000 description 1
- 102000007312 Recombinant Proteins Human genes 0.000 description 1
- 101710012186 SLC7A1 Proteins 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102400000757 Ubiquitin Human genes 0.000 description 1
- VBJHPXDIVMXHJU-UHFFFAOYSA-N Zeocin Chemical compound N=1C(C=2SC=C(N=2)C(=O)NCCCCN=C(N)N)CSC=1CCNC(=O)C(C(O)C)NC(=O)C(C)C(O)C(C)NC(=O)C(C(OC1C(C(O)C(O)C(CO)O1)OC1C(C(OC(N)=O)C(O)C(CO)O1)O)C=1[N]C=NC=1)NC(=O)C1=NC(C(CC(N)=O)NCC(N)C(N)=O)=NC(N)=C1C VBJHPXDIVMXHJU-UHFFFAOYSA-N 0.000 description 1
- 108010084455 Zeocin Proteins 0.000 description 1
- UQYZFNUUOSSNKT-UHFFFAOYSA-N [benzotriazol-1-yloxy(dimethylamino)methylidene]-dimethylazanium;hexafluorophosphate Chemical compound F[P-](F)(F)(F)(F)F.C1=CC=C2N(OC(N(C)C)=[N+](C)C)N=NC2=C1 UQYZFNUUOSSNKT-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 230000001195 anabolic Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000001580 bacterial Effects 0.000 description 1
- 230000001851 biosynthetic Effects 0.000 description 1
- 125000000484 butyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000001143 conditioned Effects 0.000 description 1
- 230000000875 corresponding Effects 0.000 description 1
- 230000001808 coupling Effects 0.000 description 1
- 239000007822 coupling agent Substances 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 229940079593 drugs Drugs 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 229960001519 exenatide Drugs 0.000 description 1
- 235000012631 food intake Nutrition 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 125000004051 hexyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 101500013969 human Glucagon-like peptide 1 Proteins 0.000 description 1
- 102000036364 human PTH protein Human genes 0.000 description 1
- 125000001841 imino group Chemical group [H]N=* 0.000 description 1
- 239000000859 incretin Substances 0.000 description 1
- 230000000968 intestinal Effects 0.000 description 1
- 125000000959 isobutyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])* 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 102000005614 monoclonal antibodies Human genes 0.000 description 1
- 108010045030 monoclonal antibodies Proteins 0.000 description 1
- 210000002569 neurons Anatomy 0.000 description 1
- 230000011164 ossification Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 239000000813 peptide hormone Substances 0.000 description 1
- 239000000816 peptidomimetic Substances 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000001737 promoting Effects 0.000 description 1
- 125000001436 propyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 230000001105 regulatory Effects 0.000 description 1
- 108091007521 restriction endonucleases Proteins 0.000 description 1
- 125000002914 sec-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 230000002194 synthesizing Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N trans-L-hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- 102000003995 transcription factors Human genes 0.000 description 1
- 108090000464 transcription factors Proteins 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
Abstract
The invention relates to an N-terminal extension sequences which are employed to enhance the expression of recombinant therapeutic peptides. The invention also relates to a process for the high-level expression of recombinant therapeutic peptides using the said N-terminal extension sequence. The invention also provides nucleic acids, vectors and recombinant host cells for efficient production of biologically active proteins such as lirapeptide.
Description
N-TERMINAL EXTENSION SEQUENCE FOR EXPRESSION OF RECOMBINANT THERAPEUTIC PEPTIDES
FIELD OF THE INVENTION
The invention relates to an N-terminal extension sequence for high-level expression of recombinant therapeutic peptides. The invention also relates to a process for high-level expression of recombinant therapeutic peptides using the said N-terminal extension sequence.
BACKGROUND OF THE INVENTION
Peptide therapeutics hâve played a notable rôle in medical practice since the advent of insulin therapy in the 1920s. Currently, there are more than 60 approved peptide drugs in the market, and the numbers are expected to grow significantly.
Glucagon-like peptide-1 (GLP-1) is a 31 amino acid long peptide hormone deriving from the tissue-specific post-translational processing of the proglucagon peptide. It is produced and secreted by intestinal enteroendocrine L-cells and certain neurons within the nucléus of the solitary tract in the brainstem upon food consumption. Liraglutide is a dérivative of a human incretin (metabolic hormone), glucagon-like peptide-1 (GLP-1) that is used as a long-acting glucagon-like peptide-1 receptor agonist, binding to the same receptors as the endogenous metabolic hormone GLP-1 that stimulâtes insulin sécrétion.
Teriparatide is a recombinant protein form of parathyroid hormone consisting of the first (N-terminus) 34 amino acids, which is the bioactive portion of the hormone. It is an effective anabolic (promoting bone formation) agent used in the treatment of some forms of osteoporosis.
An expression plasmid is engineered to contain regulatory sequences that act as enhancer and promoter régions and lead to efficient transcription of the gene carried on the expression vector. The goal of a well-designed expression vector is the efficient production of protein, and this may be achieved by synthesizing a significant amount of stable messenger RNA.
It is possible to design expression vectors that exert a tight control ofthe expression, and the protein is only produced in high quantity when necessary through the use of suitable expression condition. In absence of the tight control of the gene expression, the protein may also be expressed constitutively.
U.S. Patent No. 4,916,212 discloses DNA-sequence encoding biosynthetic insulin precursors and the process for preparing the insulin precursors and human insulin in a yeast cell.
U.S. Patent No. 7,572,884 discloses a method for preparing recombinant Lirapeptide, a precursor of Liraglutide in Saccharomyces cerevisiae.
IN 201741024763 A discloses a process for the préparation of Liraglutide by expression of synthetic oligonucleotide encoding Lirapeptide which is operably connected to an oligonucleotide sequence of a signal peptide in a yeast cell.
WO 1998/008871 A1 discloses dérivatives of GLP-1 and analogues thereof prepared using recombinant DNA technique.
WO 1998/008872 A1 discloses dérivatives of GLP-2 prepared using recombinant DNA technique.
WO 1999/043708 A1 discloses dérivatives exendin and of GLP-1 (7-C), prepared using recombinant DNA technique.
WO 2017/021819 A1 discloses a process for the préparation of peptides or proteins or dérivatives thereof by expression of synthetic oligonucleotide encoding desired protein or peptide in a prokaryotic cell as ubiquitin fusion construct.
Avicenna J Med Biotech 2017; 9(1): 19-22 discloses overexpression of teriparatide (1-34), a recombinant bioactive part of human parathyroid hormone (PTH) in Eschenchia coli.
The inventors of the present invention, in their endeavour to enhance the expression of the recombinant therapeutic peptides by several folds, hâve corne up with the use of a short N-terminal extension sequence which is not disclosed in the above mentioned prior art.
OBJECTIVE OF THE INVENTION
It is an objective of the present invention to provide high-level expression of the therapeutic peptides by several folds.
SUMMARY OF THE INVENTION
The present invention provides N-terminal extensions, nucleic acids, vectors and recombinant host cells for efficient production of biologically active peptides such as lirapeptide.
The invention contemplâtes a multidimensional approach for achieving a high yield of peptides such as lirapeptide in a host cell by providing an expression construct in which the nucleic acid encoding lirapeptide is operably fused to a modified gene sequence encoding TEV (Tobacco Etch Virus) cleavage site and an N-terminal extension (NE-3).
The present invention provides an N-terminal extension sequence as set forth in SEQ ID NO: 1 (NE-3) to enhance the expression of a therapeutic peptide in bacteria or yeast.
The present invention also provides expression vectors and recombinant host cells for high-level expression of Lirapeptide, wherein the expression vector comprises a modified gene sequence encoding the N-terminat extension sequence NE-3, a modified gene sequence encoding TEV (Tobacco Etch Virus) cleavage site, and a modified gene sequence encoding Lirapeptide.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 A: Schematic diagram of Expression cassettes without N-terminal extension.
Figure 1 B: Schematic diagram of Expression cassettes with N-terminal extension (NE1).
Figure 1 C: Schematic diagram of Expression cassettes with N-terminal extension (NE3).
Figure 2A : Expression plasmid without N-terminal extension.
Figure 2B: Expression plasmid with N-terminal extension-1 (NE1).
Figure 2C: Expression plasmid with N-terminal extension-3 (NE3).
Figure 3: Lirapeptide expression by ELISA.
Figure 4: Dry cell weight of Lirapeptide.
DESCRIPTION OF SEQUENCE LISTING
SEQ ID NO: 1 (amino acid sequence of N-terminal extension sequence NE-3)
EEQAE
SEQ ID NO: 2 (amino acid sequence of the modified TEV cleavage site)
ENLYFQ
SEQ ID NO: 3 (amino acid sequence of Lirapeptide)
HAEGTFTSDVSSYLEGQAAKEFIAWLVRGRG
SEQ ID NO: 4 (nucleic acid sequence encoding the N-terminal extension NE-3 sequence for Pichia pastoris) gaagaacaagccgaa
SEQ ID NO: 5 (nucleic acid sequence encoding the modified TEV cleavage site for Pichia pastoris) gagaacttgtacttccaa
SEQ ID NO: 6 (nucleic acid seguence encoding the lirapeptide for Pichia pastoris) cacgctgagggtacttttacctctgacgtgtcctcttacttggagggtcaagctgccaaagagttcattgcctggttggttagagg tagaggttag
SEQ ID NO: 7 (nucleic acid seguence encoding the N-terminal extension NE-3 sequence for Corynebacterium glutamicum) gaagaacaggcagaa
SEQ ID NO: 8 (nucleic acid sequence encoding the modified TEV cleavage site for Corynebacterium glutamicum) g aaaacctgtacttccag
SEQ ID NO: 9 (nucleic acid sequence encodinq the lira peptide for Corynebacterium glutamicum) cacgcagaaggcacctttacctccgatgtgtcctcctacctggaaggccaggcagcaaaagaattcattgcatggctggt tcgcggtcgcggttag
SEQ ID NO: 10 (nucleic acid sequence encodinq the N-terminal extension NE-3 sequence for Escherichia coli) gaagaacaggcagaa
SEQ ID NO: 11 (nucleic acid sequence encodinq the modified TEV cleavaqe site for
Escherichia coli) gaaaacctgtacttccag
SEQ ID NO: 12 (nucleic acid sequence encodinq the lirapeptide for Escherichia coli) catgcggaaggcaccttcaccagcgatgttagcagctacctggagggtcaggcggcgaaggaatttatcgcgtggctggtt cgtggccgtggttaa
SEQ ID NO: 13 (nucleic acid sequence encodinq the N-terminal extension NE-3 sequence for Baciiius subtilis) gaagaacaagccgaa
SEQ ID NO: 14 (nucleic acid sequence encodinq the modified TEV cleavaqe site for
Baciiius subtilis) gagaacttgtacttccaa
SEQ ID NO: 15 (nucleic acid sequence encodinq the lirapeptide for Baciiius subtilis) cacgctgagggtacttttacctctgacgtgtcctcttacttggagggtcaagctgccaaagagttcattgcctggttggttagagg tagaggttag
SEQ ID NO: 16 (amino acid sequence of teriparatide)
SVSEIQLMHNLGKHLNSMERVEWLRKKLQDVHNF
SEQ ID NO: 17 (amino acid sequence of N-terminal extension sequence NE-1)
EEA
SEQ ID NO: 18 (nucleic acid sequence encodinq the N-terminal extension NE-1 sequence!
gaggaagcg
SEQ ID NO: 19 (fusion protein comprising lirapeptide operably fused to N-terminal extension sequence NE-3 and TEV cleavaqe site)
EEQAEENLYFQHAEGTFTSDVSSYLEGQAAKEFIAWLVRGRG
SEQ ID NO: 20 (fusion protein comprising lirapeptide operably fused to N-terminal extension sequence NE-1 and TEV cleavaqe site)
EEAENLYFQHAEGTFTSDVSSYLEGQAAKEFIAWLVRGRG
DEFINITIONS
Unless defined otherwise, ail technical and scientific terms used herein hâve the same meaning as commonly understood by one of ordinary skill in the art to which the methods belong. Although any vectors, host cells, methods, and compositions similar or équivalent to those described herein can also be used in the practice or testing of the vectors, host cells, methods, and compositions, représentative illustrations are now described.
Where a range of values is provided, it is understood that each intervening value between the upper and lower limit of that range and any other stated or intervening value in that stated range, is encompassed within by the methods and compositions. The upper and lower limits of these smaller ranges may independently be included in the smaller ranges and are also encompassed within by the methods and compositions, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the methods and compositions.
It is appreciated that certain features of the methods, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the methods and compositions, which are, for brevîty, described in the context of a single embodiment, may also be provided separately or in any suitable sub-combination. It is noted that, as used herein and in the appended daims, the singular forms a, an, and the include plural referents unless the context clearly dictâtes otherwise. It is further noted that the daims may be drafted to exdude any optional element. As such, this statement is intended to serve as antécédent basis for use of such exclusive terminology as solely, only and the like in connection with the recitation of claim éléments or use of a négative limitation.
As will be apparent to those of skill in the art upon reading this disclosure, each of the individual embodiments described and itlustrated herein has discrète components and features which may be readily separaied from or combined with the features of any of the other embodiments without departing from the scope or spirit of the présent methods. Any recited method can be carried ouf in the order of events recited or in any other order that is logically possible.
The term “host cell” includes an individual cell or cell culture which can be, or has been, a récipient for the subject of expression constructs. Host cells include progeny of a single host cell. The host cell for the purposes of this invention refers to any strain of Pichia pastoris, Saccharomyces cerevisiae, Corynebacterium glutamicum, Escherichia coli, and Bacillus subtilis which can be suitably used for the purposes of the invention.
The term “recombinant strain” or “recombinant hast cell” refers to a host cell which has been transfected or transformed with the expression constructs or vectors of this invention.
The term “expression vector” or “expression construct” refers to any vector, plasmid or vehicle designed to enable the expression of an inserted nucleic acid sequence following transformation into the host.
The term “promoter” refers a DNA sequences that define where transcription of a gene begins. Promoter sequences are typically located directly upstream or at the 5' end of the transcription initiation site. RNA polymerase and the necessary transcription factors bind to the promoter sequence and initiate transcription. Promoters can either be constitutive or inducible promoters. Constitutive promoters are the promoter which allows continuai transcription of its associated genes as their expression is normally not conditioned by environmental and developmental factors. Constitutive promoters are very useful tools in genetic engineering because constitutive promoters drive gene expression under inducer-free conditions and often show better characteristics than commonly used inducible promoters. Inducible promoters are the promoters that are induced by the presence or absence of biotic or abiotic and Chemical or physical factors. Inducible promoters are a very powerful tool in genetic engineering because the expression of genes operably linked to them can be turned on or off at certain stages of development or growth of an organism or in a particular tissue or cells.
The term “expression refers to the biological production of a product encoded by a coding sequence. In most cases, a DNA sequence, includîng the coding sequence, is transcribed to form a messenger-RNA (mRNA). The messenger-RNA is then translated to form a polypeptide product that has a relevant biological activity. Also, the process of expression may involve further Processing steps to the RNA product of transcription, such as splicing to remove introns, and/or post-translational Processing of a polypeptide product.
The term “modified nucleic acid” as used herein is used to refer to a nucleic acid encoding modified iirapeptide as represented by SEQ ID NO: 19 or 20 or a functionally équivalent variant thereof. Functional variant includes any nucleic acid having substantial or significant sequence identity or similarity to SEQ ID NO: 19 or 20, and which retains the biological activity of the protein.
The terms polypeptide, peptide and protein are used interchangeably herein to refer to two or more amino acid residues joined to each other by peptide bonds or modified peptide bonds. The ternis apply to amino acid polymers in which one or more amino acid residue is an artificial Chemical mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those containing modifiée! residues, and non-naturally occurring amino acid polymer. Polypeptide refers to both short chains, commonly referred to as peptides, oligopeptides or oligomers, and to longer chains, generally referred to as proteins. Polypeptides may contain amino acids other than the 20 gene-encoded amino acids. Likewise, protein refers to at least two covalently attached amino acids, which includes proteins, polypeptides, oligopeptides and peptides. A protein may be made up of naturally occurring amino acids and peptide bonds, or synthetic peptidomimetic structures. Thus amino acid, or peptide residue, as used herein means both naturally occurring and synthetic amino acids. Amino acid includes imino acid residues such as proline and hydroxyproline. The side chains may be in either the (R) or the (S) configuration.
The term “N-terminal extension refers to a peptide or polypeptide sequence that is removably linked to the N-terminal amino acid of a desired polypeptide. In a preferred embodiment, the N-terminal extension comprises the amino acid sequence of SEQ ID NO: 1.
DETAILED DESCRIPTION OF THE INVENTION
The présent invention discloses N-terminal extensions, nucleic acids, vectors, and recombinant host cells for the efficient production of biologically active peptides such as lirapeptide.
The invention contemplâtes a multidimensional approach for achieving a high yield of recombinant lirapeptide in a host cell by providing an expression construct in which the nucleic acid encoding lirapeptide is operably fused to a modified gene sequence encoding TEV (Tobacco Etch Virus) cleavage site and an N-terminal extension (NE-3).
In one embodiment, the invention relates to the N-terminal extension sequence set forth în SEQ ID NO: 1 (NE-3). The invention relates to a process for the enhanced expression of recombinant therapeutic peptides by several folds using a short N-terminal extension sequence as set forth in SEQ ID NO: 1, in bacteria or yeast.
In another embodiment, nucleic acids encoding the N-terminal extension sequence set forth in SEQ ID NO: 1 (NE-3) are also covered within the scope of the invention.
Suitable host cell for the expression of a recombinant therapeutic peptide is selected from eukaryotic hosts, such as, but not lîmited to yeast which includes Pichia pastoris and Saccharomyces cerevisiae. Bacterial hosts, such as, but not lîmited to Corynebacterium glutamicum, Escherichia coli and Bacillus subtilis can also be used.
The term therapeutic peptide includes peptides, such as, but not lîmited to Lirapeptide, Teriparatide, Exenatide and the like.
Constitutive or inducible promoters known to a person skilled in the art can be used in the expression cassettes in one or more embodiments of this invention.
In another embodiment, the présent invention provides expression cassettes comprising promoter, signal sequence, N-terminal extension (NE-3), gene encoding for Lirapeptide or Teriparatide, TEV cleavage site and terminator.
In an embodiment, the présent invention provides N-terminal extension sequence as set forth in SEQ !D NO: 1 to enhance the expression of therapeutic peptide in yeast wherein the yeast is Pichia pastoris, Saccharomyces cerevisiae, Corynebacterium glutamicum, Escherichia coli and Bacilles subtiiis.
The présent invention also provides TEV cleavage site having the amino acid sequence set forth in SEQ ID NO: 2.
In an embodiment, the présent invention provides enhanced expression of Lirapeptide as set forth in SEQ ID NO: 3 and Teriparatide as set forth in SEQ ID NO: 16 in bacteria or yeast using the N-terminal extension sequence as set forth in SEQ ID NO: 1.
Expression constructs known to person skilled in the art for expression of prokaryotic or eukaryotic proteins can be used in one or more embodiments of this invention.
In one embodiment, the présent invention also provides an expression constructforthe high-level expression of Lirapeptide which comprises of:
1. a modified gene sequence encoding the N-terminal extension sequence (NE3),
2. a modified gene sequence encoding TEV (Tobacco Etch Virus) cleavage site, and
3. a modified gene sequence encoding Lirapeptide.
In another embodiment, the présent invention also provides an expression construct for the high-level expression of Lirapeptide which comprises of:
1. a gene sequence encoding the N-terminal extension sequence (NE-3) as set forth in SEQ ID NO: 4,
2. a gene sequence encoding TEV (Tobacco Etch Virus) cleavage site as set forth in SEQ ID NO: 5, and
3. a gene sequence encoding Lirapeptide as set forth in SEQ ID NO: 6.
In another embodiment, the présent invention also provides a method for high level expression of Lirapeptide as set forth in SEQ ID NO: 3 in Pichia Pastoris which comprises:
1. construction of a recombinant vector (expression construct) comprising the gene sequences as set forth in SEQ ID NO: 4, 5 and 6,
2. transformation of the expression construct into Pichia Pastoris,
3. évaluation of the clone and sélection thereof,
4. subjecting the selected clones to fermentation process,
5. isolation and purification of Lirapeptide, and
6, cleavage ofthe N-terminal extension sequencefrom the purified Lirapeptide.
In another embodiment, the présent invention also provides an expression construct for the high-level expression of Lirapeptide which comprises of:
1. a gene sequence encoding the N-terminal extension sequence as set forth in SEQ ID NO: 7,
2. a gene sequence encoding TEVcleavage site as set forth în SEQ ID NO: 8, and
3. a gene sequence encoding Lirapeptide as set forth in SEQ ID NO: 9.
In an embodiment, the présent invention also provides a method for high level expression of Lirapeptide as set forth in SEQ ID NO: 3 in Corynebacterium glutamicum which comprises:
1. construction of a recombinant vector (expression construct) comprising the gene sequences as set forth in SEQ ID NO: 7, 8 and 9,
2. transformation of the expression construct into Corynebacterium glutamicum,
3. évaluation of the clone and sélection thereof,
4. subjecting the selected clones to fermentation process,
5. isolation and purification of Lirapeptide, and
6. cleavage ofthe N-terminal extension sequence from the purified Lirapeptide.
The présent invention provides an expression construct for the high-level expression of Lirapeptide which comprises of;
1. a gene sequence encoding the N-terminal extension sequence as set forth in SEQ IDNO: 10,
2. a gene sequence encoding TEV cleavage site as set forth in SEQ ID NO: 11, and
3. a gene sequence encoding Lirapeptide as set forth in SEQ ID NO: 12.
In an embodiment, the présent invention also provides a method for high-level expression of Lirapeptide as set forth in SEQ ID NO: 3 in Escherichia coli which comprises:
1. construction of a recombinant vector (expression construct) comprising the gene sequences as set forth in SEQ ID NO: 10, 11 and 12,
2. transformation of the expression construct into Escherichia coli,
3. évaluation of the clone and sélection thereof,
4. subjecting the selected clones to fermentation process,
5. isolation and purification of Lirapeptide, and
6. cleavage of the N-terminal extension sequence from the purified Lirapeptide.
The présent invention provides an expression construct for the high-level expression of Lirapeptide which comprises of:
1. a gene sequence encoding the N-terminal extension sequence as set forth in SEQ ID NO: 13,
2. a gene sequence encoding TEV cleavage site as set forth in SEQ ID NO: 14, and
3. a gene sequence encoding Lirapeptide as set forth in SEQ ID NO: 15.
In an embodiment, the présent invention also provides a method for high-level expression of Lirapeptide as set forth in SEQ ID NO: 3 in Bacillus subtilis which comprises:
1. construction of a recombinant vector (expression construct) comprising the gene sequences as set forth in SEQ ID NO: 13, 14 and 15,
2. transformation of the expression construct into Bacillus subtilis,
3. évaluation of the clone and sélection thereof,
4. subjecting the selected clones to fermentation process,
5. isolation and purification of Lirapeptide, and
6. cleavage of the N-terminal extension sequence from the purified Lirapeptide.
The présent invention provides high level expression of Teriparatide as set forth in SEQ ID NO: 16 in Corynebacterium glutamicum using the expression construct comprising Nterminal extension sequence.
The présent invention provides high level expression of Teriparatide as set forth in SEQ ID NO: 16 in Pichia pastoris using the expression construct comprising N-terminal extension sequence.
The présent invention provides high level expression of Teriparatide as set forth in SEQ ID NO: 16 in Corynebacterium glutamicum which comprises the foilowing steps:
1. construction of a recombinant vector (expression construct)
2. transformation of the expression construct into Corynebacterium glutamicum,
3. évaluation of the clone and sélection thereof,
4. subjecting the selected clones to fermentation process,
5. isolation and purification of Teriparatide, and
6. cleavage of the N-terminal extension sequence from the purified Teriparatide, The present invention also provides N-terminal extension sequence as set forth in SEQ ID NO: 1 to enhance expression of Teriparatide in Pichia pastoris which comprises the following steps:
1. construction of a recombinant vector (expression construct),
2. transformation of the constructed vector into Pichia pastoris,
3. évaluation of the clone and sélection thereof,
4. subjecting the selected clones to fermentation process,
5. isolation and purification of Teriparatide, and
6. cleavage of the N-terminal extension sequence from the purified Teriparatide.
In another embodiment, the invention provides a modified lirapeptide, wherein the lirapeptide is operably fused to TEV (Tobacco Etch Virus) cleavage site and an N-terminal extension sequence (NE-3), and wherein the modified lirapeptide is as set forth in SEQ ID NO: 19.
in another embodiment, the invention provides a method for expressing lirapeptide using recombinant host cells of the présent invention, wherein the fermentation process comprises:
a. culturing the recombinant host cells in BMGY media for about 24 hrs;
b. harvesting the recombinant host cells by centrifugation;
c. resuspending the recombinant host cells to an ODsœnm of about 10 in BMMY medium;
d. incubating the host cells in a shaker incubator for about 24 hrs at 30°C;
e. harvesting and purifying the culture supernatants to obtain lirapeptide.
Liraglutide, an analog of human GLP-1 and acts as a GLP-1 receptor agonist. Liraglutide is made by attaching a C-16 fatty acid (palmitic acid) with a glutamic acid spacer on the remaining lysine residue at position 26 of the peptide precursor (lirapeptide as set forth in SEQ ID NO: 3).
In another embodiment, the invention provides préparation of Liraglutide which comprises conjugation of lirapeptide produced as per the invention with palmityl glutamate dérivative such as 1 -methyl palmityl glutamic acid, using methods known in the art.
In another embodiment, the invention provides préparation of Liraglutide which comprises conjugation of lirapeptide produced as per the invention with palmityl glutamate dérivatives, wherein dérivatives are such as methyl (1 -methyl palmityl glutamic acid), ethyl, propyl, prop-2-yl, butyl, but-2-yl, 2-methylprop-1-yl, 2-methyl-prop-2-yl (tert-butyl), hexyl and the like, using methods known in the art. This conjugation reaction is carried out in the presence of a coupling reagent. The coupling agent may be selected from the group of DIC/6-CI-HOBt, DIC/HOBt, HBTU/HOBVDIEA or DIC/Oxyma.
In another embodiment, the invention provides a method for préparation of liraglutide, said method comprising the steps of:
a. culturing the recombinant host cell of the présent invention in a suitable culture medium to obtain lirapeptide;
b. converting lirapeptide to liraglutide, wherein the method comprises conjugation of lirapeptide obtained in step (a) with a palmityl glutamate dérivative.
The above disclosure generally describes the présent invention. A more complété understanding can be obtained by reference to the following spécifie examples. This example is described solely for the purposes of illustration and are not intended to limit the scope of the invention. Although spécifie terms hâve been employed herein, such terms are intended in a descriptive sense and not for purposes of limitation.
EXAMPLES
Example 1: Modified nucleic acid for expression of lirapeptide
Expression cassettes encoding for liraglutide precursor peptide was modified for optimum expression in Pichia pastons, Corynebacterium glutamicum, Escherichia coli and Bacillus subtîlis. The modified open reading frame comprising the nucléotide sequence encoding lirapeptide fused to a sequence encoding a TEV (Tobacco Etch Virus) cleavage site and a sequence encoding an N-terminal extension (NE-3 or NE-1). The preferred codons for expression in Pichia pastoris, Corynebacterium glutamicum, Escherichia coli and Bacillus subtîlis hâve been used in place of rare codons.
As a control, an open reading frame comprising the nucléotide sequence encoding lirapeptide without any N-terminal extension was prepared.
For expression in Pichia pastoris
For expression in Pichia Pastoris, the nucléotide sequence encoding lirapeptide, the nucléotide sequence encoding TEV cleavage site and the nucléotide sequence encoding the N-terminal extension was modified.
The nucléotide sequence encoding lirapeptide is represented by SEQ ID NO: 6. The nucléotide sequence TEV cleavage site is represented by SEQ ID NO: 5. The nucléotide sequence encoding N-terminal extension (NE-3) is represented by SEQ ID NO: 4.
This modified open reading frame has been artificially synthesized using the sequence for lirapeptide, sequence for TEV cleavage site and the sequence for N-terminal extension.
The modified open reading frame comprising DNA encoding Lirapeptide without Nterminal extension (Figure 1A) and with N-terminal extension-1 (NE1) (GAGGAAGCGFigure IB), N-terminal extension-3 (NE3) (GAAGAACAAGCCGAA-Figure 10), along with TEV (Tobacco Etch Virus) cleavage sequence (GAGAACTTGTACTTCCAA) and signal sequence cassettes are represented in Figure 1.
The modified sequence encoding for the recombinant lirapeptide was cloned into pD912 expression vector (Atum, USA). The recombinant plasmid contains the open reading frame and a promoter.
The vector map of pD912 is represented in Figure 1.
For expression in Corynebacterium glutamicum
For expression in Corynebacterium glutamicum, the nucléotide sequence encoding lirapeptide, the nucléotide sequence encoding TEV cleavage site and the nucléotide sequence encoding the N-terminal extension was modified.
The nucléotide sequence encoding lirapeptide is represented by SEQ ID NO: 9. The nucléotide sequence TEV cleavage site is represented by SEQ ID NO: 8. The nucléotide sequence encoding N-terminal extension (NE-3) is represented by SEQ ID NO: 7.
This modified open reading frame has been artificially synthesized using using Thermo Fisher Scientific technique utilizing the sequence for lirapeptide, sequence for TEV cleavage site and the sequence for N-terminal extension.
The modified sequence encoding for the recombinant lirapeptide was cloned into pD912 expression vector (Atum, USA). The recombinant plasmid contains the open reading frame and a promoter.
For expression in Escherichia coli
For expression in Escherichia coli, the nucléotide sequence encoding lirapeptide, the nucléotide sequence encoding TEV cleavage site and the nucléotide sequence encoding the N-terminal extension was modified.
The nucléotide sequence encoding lirapeptide is represented by SEQ ID NO: 12. The nucléotide sequence TEV cleavage site is represented by SEQ ID NO: 11. The nucléotide sequence encoding N-terminal extension (NE-3) is represented by SEQ ID NO: 10.
This modified open reading frame has been artificially synthesized using the sequence for lirapeptide, sequence for TEV cleavage site and the sequence for N-terminal extension.
The modified sequence encoding for the recombinant lirapeptide was cloned into pD912 expression vector (Atum, USA). The recombinant plasmid contains the open reading frame and a promoter.
For expression in Bacillus subtilis
For expression in Bacillus subtilis, the nucléotide sequence encoding lirapeptide, the nucléotide sequence encoding TEV cleavage site and the nucléotide sequence encoding the N-terminal extension was modified.
The nucléotide sequence encoding lirapeptide is represented by SEQ ID NO: 15. The nucléotide sequence TEV cleavage site is represented by SEQ ID NO: 14. The nucléotide sequence encoding N-terminal extension (NE-3) is represented by SEQ ID NO: 13.
This modified open reading frame has been artificially synthesized using the sequence for lirapeptide, sequence for TEV cleavage site and the sequence for N-terminal extension.
The modified sequence encoding for the recombinant lirapeptide was cloned into pD912 expression vector (Atum, USA). The recombinant plasmid contains the open reading frame and a promoter.
Confirmation of linearization of plasmid DNA
The synthetic DNA encoding Lirapeptide without N-terminal extension, N-terminal exten$ion-1, N-terminal extension-3 and plasmid pD912 were digested with EcoR! and Bglll restriction enzymes. The restriction digested fragments were ligated and transformed into Escherichia coli strain. The résultant plasmids, containing Lirapeptide expression cassettes, without N-terminal extension (Figure 2A), N-terminal extension-1(Figure-2B), Nterminal extension-3 (Figure-2C) were sequenced to confirm Lirapeptide, N-Terminal extension, and TEV cleavage sequence.The sequence confirmed plasmid DNA’s were linearized with Sac I enzyme.
Example 2: Development of recombinant host cell by transformation with recombinant plasmids
Recombinant pD912 plasmids as described in foregoing example carrying the gene for liraglutide precursor peptide fused to signal peptides were used for development of recombinant hosts.
Pichia pastoris host cells (obtained from Atum, USA) were transformed using the plasmids by electroporation method.
The transformed cells were plated on YPD agar (Yeast Peptone Dextrose) plates containing 100pg/ml zeocin .The transformed Pichia pastoris cells were grown in 20 ml BMGY media for 24hrs. Cells were harvested by centrifugation and re-suspended to an ODsœrm of 10 in 20ml BMMY medium. The cell suspension was incubated in shaker incubator for 24 hrs at 30°C.
Example 3: Analysis and évaluation of lirapeptide expression
After 24 hours, the culture supernatants were harvested, purified and analysed for Lirapeptide expression by ELISA (Figure 3) using monoclonal antibody spécifie to Lirapeptide. Further, the dry cell weight was measured using moisture analyser (Figure 4).
Table 1 and Table 2 provides a comparison establishing the efficacy of the Nterminal extensions in improving the yield of lirapeptide.
Table 1: Different N-terminal extensions and Lirapeptide expression compared to a control with no N-terminal extension
Extension | Clones | LP expression (Percentage of control) | OD at 450 nm |
None (control) | 100 | 0.52 | |
EEA | 1 | 103 | 0.561 |
EEA | 2 | 75 | 0.409 |
EEA | 3 | 87 | 0.474 |
EEQAE | 1 | 459 | 2.488 |
EEQAE | 2 | 479 | 2.598 |
EEQAE | 3 | 443 | 2.4 |
Table 2: Different N-terminal extensions and fold change in Lirapeptide expression compared to control with no N-terminal extension
Extension | Clones | Fold différence |
EEA | 1 | 1.0 |
EEA | 2 | 0.8 |
EEA | 3 | 0.9 |
EEQAE | 1 | 4.6 |
EEQAE | 2 | 4.8 |
EEQAE | 3 | 4.4 |
The above data clearly shows that the N-terminal extension is able to improve the expression of lirapeptide by about 5-folds as compared to control and known N-terminal extensions.
Claims (16)
- CLAIMS:1. An N-terminal extension consîsting of the amino acid sequence of SEQ ID NO: 1.
- 2. A nucleic acid encoding the N-terminal extension as claimed in claim 1.
- 3. The nucleic acid as claimed in claim 2, wherein the nucleic acid is selected from a group comprising SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10 and SEQ ID NO: 13.
- 4. A vector comprising the nucleic acid as claimed in 3.
- 5. The vector as claimed in claim 4, wherein the vector is pD912.
- 6. The vector as claimed in claim 4, wherein the vector comprises a modified TEV cleavage site selected from a group comprising SEQ ID NO: 5, SEQ ID NO: 8, SEQ ID NO: 11 and SEQ ID NO: 14.
- 7. A vector for recombinant expression of lirapeptide comprising:a. a modified gene sequence encoding the N-terminal extension sequence (NE3) selected from a group comprising SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10 and SEQ ID NO: 13;b. a modified gene sequence encoding TEV (Tobacco Etch Virus) cleavage site selected from a group comprising SEQ ID NO: 5, SEQ ID NO: 8, SEQ ID NO: 11 and SEQ ID NO: 14; andc. a modified gene sequence encoding lirapeptide selected from a group comprising SEQ ID NO: 6, SEQ ID NO: 9, SEQ ID NO: 12 and SEQ ID NO: 15.
- 8. The vector as claimed in claim 7, wherein the vector is modified for high level expression of lirapeptide in Pichia pastoris comprising:a. a modified gene sequence encoding the N-terminal extension sequence (NE3) comprising the nucléotide sequence of SEQ ID NO: 4;b. a modified gene sequence encoding TEV (Tobacco Etch Virus) cleavage site comprising the nucléotide sequence of SEQ ID NO: 5; andc. a modified gene sequence encoding lirapeptide comprising the nucléotide sequence of SEQ ID NO: 6.
- 9. The vector as claimed in claim 7, wherein the vector is modified for high level expression of lirapeptide in Corynebacterium glutamicum comprising:a. a modified gene sequence encoding the N-terminal extension sequence (NE3) comprising the nucléotide sequence of SEQ ID NO:7;b. a modified gene sequence encoding TEV (Tobacco Etch Virus) cleavage site comprising the nucléotide sequence of SEQ ID NO: 8; andc. a modified gene sequence encoding lirapeptide comprising the nucléotide sequence of SEQ ID NO: 9.
- 10. The vector as claimed in claim 7, wherein the vector is modified for high level expression of lirapeptide in Escherichia coli comprising:a. a modifiée! gene sequence encoding the N-terminal extension sequence (NE3) comprising the nucléotide sequence of SEQ ID NO: 10;b. a modifîed gene sequence encoding TEV (Tobacco Etch Virus) cleavage site comprising the nucléotide sequence of SEQ ID NO: 11 ; andc. a modified gene sequence encoding lirapeptide comprising the nucléotide sequence of SEQ ID NO: 12.
- 11. The vector as claimed in claim 7, wherein the vector is modified for high level expression of lirapeptide in Bacillus subti/is comprising:a. a modified gene sequence encoding the N-terminal extension sequence (NE3) comprising the nucléotide sequence of SEQ ID NO: 13;b. a modified gene sequence encoding TEV (Tobacco Etch Virus) cleavage site comprising the nucléotide sequence of SEQ ID NO: 14; andc. a modified gene sequence encoding lirapeptide comprising the nucléotide sequence of SEQ ID NO: 15.
- 12. A recombinant host cell comprising the vector as claimed in claim 7.
- 13. The recombinant host cell as claimed in claim 12, wherein the recombinant host cell is selected from a group comprising Pichia pastoris, Saccharomyces cerevisiae, Corynebacterium glutamicum, Escherichia coli and Bacillus subtilis.
- 14. A modified lirapeptide comprising the amino acid sequence of SEQ ID NO: 19, wherein the lirapeptide is operably fused to TEV (Tobacco Etch Virus) cleavage site and an Nterminal extension sequence (NE-3).
- 15. A method for expressing lirapeptide using recombinant host cells as claimed in claim 12, wherein a fermentation process of the method comprises:a. culturing the recombinant host cells in BMGY media for about 24 hrs;b. harvesting the recombinant host cells by centrifugation;c. resuspending the recombinant host cells to an ODeoonm of about 10 in BMMY medium;d. incubating the host cells in a shaker incubator for about 24 hrs at 30°C; ande. harvesting and purifying the culture supernatants to obtain lirapeptide.
- 16. A method for préparation of liraglutide, said method comprising the steps of:a. culturing the recombinant host cell as claimed in claim 12 in a suitable culture medium to obtain lirapeptide; andb. converting lirapeptide to liraglutide, wherein the method comprises conjugation of lirapeptide obtained in step (a) with a palmityl glutamate dérivative.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
IN201941009728 | 2019-09-13 |
Publications (1)
Publication Number | Publication Date |
---|---|
OA20681A true OA20681A (en) | 2022-12-30 |
Family
ID=
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8962811B2 (en) | Growth hormone and interferon-alpha 2 glycoproteins produced in plants | |
KR102699074B1 (en) | Carbon-source regulated protein production in a recombinant host cell | |
WO2007148345A2 (en) | A method of producing biologically active polypeptide having insulinotropic activity | |
CA2279836A1 (en) | Polynucleotides encoding proexendin, and methods and uses thereof | |
SK116896A3 (en) | Enhanced secretion of polypeptides | |
JPS6236183A (en) | Ugenoyobibunpi | |
US20160168226A1 (en) | Process for production of insulin and insulin analogues | |
US8304210B2 (en) | Method for the secretory production of heterologous protein in Escherichia coli | |
US20230002468A1 (en) | N-terminal extension sequence for expression of recombinant therapeutic peptides | |
WO2005100388A1 (en) | Production of insulinotropic peptides | |
OA20681A (en) | N-terminal extension sequence for expression of recombinant therapeutic peptides. | |
AU2019218315A1 (en) | Codon optimized precursor gene and signal peptide gene of human insulin analogue | |
EA044776B1 (en) | N-TERMINAL EXTENDATION SEQUENCE FOR THE EXPRESSION OF RECOMBINANT THERAPEUTIC PEPTIDES | |
EP3711772A1 (en) | Recombinant proteins and fusion proteins | |
WO2020200414A1 (en) | Protein production in mut-methylotrophic yeast | |
US20090035815A1 (en) | Synthetic Gene for Enhanced Expression in E. Coli | |
CA2529282C (en) | Recombinant igf expression systems | |
EP4163376A1 (en) | Insulin aspart derivative, and preparation method therefor and use thereof | |
Rao et al. | Expression, purification, and characterisation of nesiritide using an E. coli expression system | |
WO2000020610A1 (en) | Disruption of the kex1 gene in pichia and methods of full length protein expression | |
CN104672334B (en) | Recombinate the preparation method of Long IGF-1 R3-I | |
KR20220009943A (en) | MUT- methylotrophic yeast | |
KR20200082618A (en) | Ramp Tag for Overexpressing Insulin and Method for Producing Insulin Using the Same | |
KR20040007892A (en) | Recombinant Human Ferritin Protein And A Process for Producing the Same | |
KR20070035483A (en) | Chimeric human growth hormone derived from the placenta and pituitary isoform and processes for obtaining said chimera |