CN114686504B - Lpp或其突变体作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用 - Google Patents
Lpp或其突变体作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用 Download PDFInfo
- Publication number
- CN114686504B CN114686504B CN202011620414.6A CN202011620414A CN114686504B CN 114686504 B CN114686504 B CN 114686504B CN 202011620414 A CN202011620414 A CN 202011620414A CN 114686504 B CN114686504 B CN 114686504B
- Authority
- CN
- China
- Prior art keywords
- lpp
- fusion protein
- escherichia coli
- glp
- protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 241000588724 Escherichia coli Species 0.000 title claims abstract description 51
- 108010006519 Molecular Chaperones Proteins 0.000 title claims abstract description 31
- 230000003248 secreting effect Effects 0.000 title claims abstract description 27
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 title claims abstract description 21
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 title claims abstract description 21
- 102000005431 Molecular Chaperones Human genes 0.000 title abstract description 22
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 72
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 71
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 48
- 238000000855 fermentation Methods 0.000 claims abstract description 40
- 230000004151 fermentation Effects 0.000 claims abstract description 40
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 37
- 238000000034 method Methods 0.000 claims abstract description 23
- 239000013598 vector Substances 0.000 claims description 21
- 108091005804 Peptidases Proteins 0.000 claims description 17
- 108010019598 Liraglutide Proteins 0.000 claims description 15
- 239000004365 Protease Substances 0.000 claims description 15
- 229960002701 liraglutide Drugs 0.000 claims description 15
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims description 14
- YSDQQAXHVYUZIW-QCIJIYAXSA-N Liraglutide Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCNC(=O)CC[C@H](NC(=O)CCCCCCCCCCCCCCC)C(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=C(O)C=C1 YSDQQAXHVYUZIW-QCIJIYAXSA-N 0.000 claims description 13
- 150000004665 fatty acids Chemical group 0.000 claims description 13
- 239000013604 expression vector Substances 0.000 claims description 12
- 230000004048 modification Effects 0.000 claims description 12
- 238000012986 modification Methods 0.000 claims description 12
- 108010075254 C-Peptide Proteins 0.000 claims description 10
- 230000004927 fusion Effects 0.000 claims description 10
- 241000894006 Bacteria Species 0.000 claims description 8
- 235000014113 dietary fatty acids Nutrition 0.000 claims description 8
- 239000000194 fatty acid Substances 0.000 claims description 8
- 229930195729 fatty acid Natural products 0.000 claims description 8
- 239000003795 chemical substances by application Substances 0.000 claims description 7
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 6
- 102100031780 Endonuclease Human genes 0.000 claims description 4
- 108010042407 Endonucleases Proteins 0.000 claims description 4
- 108010013369 Enteropeptidase Proteins 0.000 claims description 4
- 102100029727 Enteropeptidase Human genes 0.000 claims description 4
- 239000000411 inducer Substances 0.000 claims description 4
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 claims description 4
- 108090000631 Trypsin Proteins 0.000 claims description 3
- 102000004142 Trypsin Human genes 0.000 claims description 3
- 239000012634 fragment Substances 0.000 claims description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical group CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 claims description 3
- 239000012588 trypsin Substances 0.000 claims description 3
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 claims description 2
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 claims description 2
- 238000013461 design Methods 0.000 claims description 2
- 229920001184 polypeptide Polymers 0.000 claims description 2
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 2
- 238000001308 synthesis method Methods 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 4
- 230000012010 growth Effects 0.000 abstract description 7
- 238000005336 cracking Methods 0.000 abstract description 5
- 230000028327 secretion Effects 0.000 abstract description 5
- 238000009776 industrial production Methods 0.000 abstract description 3
- 230000002349 favourable effect Effects 0.000 abstract description 2
- 239000006228 supernatant Substances 0.000 description 30
- 210000004027 cell Anatomy 0.000 description 25
- DTHNMHAUYICORS-KTKZVXAJSA-N Glucagon-like peptide 1 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 DTHNMHAUYICORS-KTKZVXAJSA-N 0.000 description 19
- 150000001413 amino acids Chemical class 0.000 description 19
- 101800000224 Glucagon-like peptide 1 Proteins 0.000 description 15
- 102400000322 Glucagon-like peptide 1 Human genes 0.000 description 15
- 235000001014 amino acid Nutrition 0.000 description 13
- 239000002609 medium Substances 0.000 description 11
- 108010076504 Protein Sorting Signals Proteins 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 9
- 150000007523 nucleic acids Chemical group 0.000 description 9
- 238000000746 purification Methods 0.000 description 9
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 7
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 7
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 7
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 7
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 7
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 7
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 7
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 7
- 230000003834 intracellular effect Effects 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- 235000018102 proteins Nutrition 0.000 description 7
- 238000003259 recombinant expression Methods 0.000 description 7
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 6
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 6
- 108020004414 DNA Proteins 0.000 description 6
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 6
- 108010079246 OMPA outer membrane proteins Proteins 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000003776 cleavage reaction Methods 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 230000007017 scission Effects 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 4
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 4
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 4
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 4
- 229940089838 Glucagon-like peptide 1 receptor agonist Drugs 0.000 description 4
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 4
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 4
- 239000004472 Lysine Substances 0.000 description 4
- 239000001888 Peptone Substances 0.000 description 4
- 108010080698 Peptones Proteins 0.000 description 4
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 4
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 4
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- 238000006731 degradation reaction Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 229930027917 kanamycin Natural products 0.000 description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 4
- 229960000318 kanamycin Drugs 0.000 description 4
- 229930182823 kanamycin A Natural products 0.000 description 4
- 238000001819 mass spectrum Methods 0.000 description 4
- 235000019319 peptone Nutrition 0.000 description 4
- 239000002243 precursor Substances 0.000 description 4
- GCYXWQUSHADNBF-AAEALURTSA-N preproglucagon 78-108 Chemical class C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 GCYXWQUSHADNBF-AAEALURTSA-N 0.000 description 4
- 239000013558 reference substance Substances 0.000 description 4
- 238000011218 seed culture Methods 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 4
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 3
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 3
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 101800004266 Glucagon-like peptide 1(7-37) Proteins 0.000 description 3
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 3
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 3
- -1 Liraglutide fatty acid Chemical class 0.000 description 3
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 3
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 3
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 3
- 101710116435 Outer membrane protein Proteins 0.000 description 3
- 102000035195 Peptidases Human genes 0.000 description 3
- 108010003201 RGH 0205 Proteins 0.000 description 3
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 3
- 241000219784 Sophora Species 0.000 description 3
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 3
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 3
- 230000010933 acylation Effects 0.000 description 3
- 238000005917 acylation reaction Methods 0.000 description 3
- 229940041514 candida albicans extract Drugs 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 238000005520 cutting process Methods 0.000 description 3
- 229940088598 enzyme Drugs 0.000 description 3
- 235000011187 glycerol Nutrition 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 239000008188 pellet Substances 0.000 description 3
- 239000012474 protein marker Substances 0.000 description 3
- 108010060175 trypsinogen activation peptide Proteins 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 239000012138 yeast extract Substances 0.000 description 3
- FUOOLUPWFVMBKG-UHFFFAOYSA-N 2-Aminoisobutyric acid Chemical compound CC(C)(N)C(O)=O FUOOLUPWFVMBKG-UHFFFAOYSA-N 0.000 description 2
- 125000004042 4-aminobutyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])N([H])[H] 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- 102000012410 DNA Ligases Human genes 0.000 description 2
- 108010061982 DNA Ligases Proteins 0.000 description 2
- 108010016626 Dipeptides Proteins 0.000 description 2
- 108010067722 Dipeptidyl Peptidase 4 Proteins 0.000 description 2
- 102100025012 Dipeptidyl peptidase 4 Human genes 0.000 description 2
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 2
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 2
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 2
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 2
- 102000008100 Human Serum Albumin Human genes 0.000 description 2
- 108091006905 Human Serum Albumin Proteins 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 2
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 238000003287 bathing Methods 0.000 description 2
- LLSDKQJKOVVTOJ-UHFFFAOYSA-L calcium chloride dihydrate Chemical compound O.O.[Cl-].[Cl-].[Ca+2] LLSDKQJKOVVTOJ-UHFFFAOYSA-L 0.000 description 2
- 229940052299 calcium chloride dihydrate Drugs 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 108010005400 cutinase Proteins 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 239000003877 glucagon like peptide 1 receptor agonist Substances 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 238000004896 high resolution mass spectrometry Methods 0.000 description 2
- 239000012535 impurity Substances 0.000 description 2
- 210000003000 inclusion body Anatomy 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000004949 mass spectrometry Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 210000001322 periplasm Anatomy 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 229940044601 receptor agonist Drugs 0.000 description 2
- 239000000018 receptor agonist Substances 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 238000010532 solid phase synthesis reaction Methods 0.000 description 2
- 238000001195 ultra high performance liquid chromatography Methods 0.000 description 2
- JIAARYAFYJHUJI-UHFFFAOYSA-L zinc dichloride Chemical compound [Cl-].[Cl-].[Zn+2] JIAARYAFYJHUJI-UHFFFAOYSA-L 0.000 description 2
- AYMLQYFMYHISQO-QMMMGPOBSA-N (2s)-3-(1h-imidazol-3-ium-5-yl)-2-[(2-methylpropan-2-yl)oxycarbonylamino]propanoate Chemical compound CC(C)(C)OC(=O)N[C@H](C(O)=O)CC1=CN=CN1 AYMLQYFMYHISQO-QMMMGPOBSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- XUUXCWCKKCZEAW-YFKPBYRVSA-N Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N XUUXCWCKKCZEAW-YFKPBYRVSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- XLHLPYFMXGOASD-CIUDSAMLSA-N Asn-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLHLPYFMXGOASD-CIUDSAMLSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- 101100136076 Aspergillus oryzae (strain ATCC 42149 / RIB 40) pel1 gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 1
- 102000004877 Insulin Human genes 0.000 description 1
- 108090001061 Insulin Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 235000019687 Lamb Nutrition 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108700006385 OmpF Proteins 0.000 description 1
- 101710203389 Outer membrane porin F Proteins 0.000 description 1
- 101710160104 Outer membrane protein F Proteins 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- 101710195090 Protein kil Proteins 0.000 description 1
- 101100084022 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) lapA gene Proteins 0.000 description 1
- 241000588746 Raoultella planticola Species 0.000 description 1
- DLSWIYLPEUIQAV-UHFFFAOYSA-N Semaglutide Chemical compound CCC(C)C(NC(=O)C(Cc1ccccc1)NC(=O)C(CCC(O)=O)NC(=O)C(CCCCNC(=O)COCCOCCNC(=O)COCCOCCNC(=O)CCC(NC(=O)CCCCCCCCCCCCCCCCC(O)=O)C(O)=O)NC(=O)C(C)NC(=O)C(C)NC(=O)C(CCC(N)=O)NC(=O)CNC(=O)C(CCC(O)=O)NC(=O)C(CC(C)C)NC(=O)C(Cc1ccc(O)cc1)NC(=O)C(CO)NC(=O)C(CO)NC(=O)C(NC(=O)C(CC(O)=O)NC(=O)C(CO)NC(=O)C(NC(=O)C(Cc1ccccc1)NC(=O)C(NC(=O)CNC(=O)C(CCC(O)=O)NC(=O)C(C)(C)NC(=O)C(N)Cc1cnc[nH]1)C(C)O)C(C)O)C(C)C)C(=O)NC(C)C(=O)NC(Cc1c[nH]c2ccccc12)C(=O)NC(CC(C)C)C(=O)NC(C(C)C)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CCCNC(N)=N)C(=O)NCC(O)=O DLSWIYLPEUIQAV-UHFFFAOYSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- 241000203780 Thermobifida fusca Species 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 1
- HHPSUFUXXBOFQY-AQZXSJQPSA-N Trp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O HHPSUFUXXBOFQY-AQZXSJQPSA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 1
- 101100398653 Yersinia pestis lamB1 gene Proteins 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010047506 alanyl-glutaminyl-glycyl-valine Proteins 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- KGBXLFKZBHKPEV-UHFFFAOYSA-N boric acid Chemical compound OB(O)O KGBXLFKZBHKPEV-UHFFFAOYSA-N 0.000 description 1
- 239000004327 boric acid Substances 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229960002713 calcium chloride Drugs 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 210000000748 cardiovascular system Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- GFHNAMRJFCEERV-UHFFFAOYSA-L cobalt chloride hexahydrate Chemical compound O.O.O.O.O.O.[Cl-].[Cl-].[Co+2] GFHNAMRJFCEERV-UHFFFAOYSA-L 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- MPTQRFCYZCXJFQ-UHFFFAOYSA-L copper(II) chloride dihydrate Chemical compound O.O.[Cl-].[Cl-].[Cu+2] MPTQRFCYZCXJFQ-UHFFFAOYSA-L 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- YBZZSZQZDODUAA-UHFFFAOYSA-N digitoxigenin alpha-L-cymaroside Natural products O1C(C)C(O)C(OC)CC1OC1CC(CCC2C3(CCC(C3(C)CCC32)C=2COC(=O)C=2)O)C3(C)CC1 YBZZSZQZDODUAA-UHFFFAOYSA-N 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 230000029142 excretion Effects 0.000 description 1
- 239000012526 feed medium Substances 0.000 description 1
- 239000006052 feed supplement Substances 0.000 description 1
- 229960002089 ferrous chloride Drugs 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000004026 insulin derivative Substances 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- NMCUIPGRVMDVDB-UHFFFAOYSA-L iron dichloride Chemical compound Cl[Fe]Cl NMCUIPGRVMDVDB-UHFFFAOYSA-L 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 101150012518 lamB gene Proteins 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- ISPYRSDWRDQNSW-UHFFFAOYSA-L manganese(II) sulfate monohydrate Chemical compound O.[Mn+2].[O-]S([O-])(=O)=O ISPYRSDWRDQNSW-UHFFFAOYSA-L 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 101150073640 ompF gene Proteins 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 101150040383 pel2 gene Proteins 0.000 description 1
- 101150050446 pelB gene Proteins 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 101150009573 phoA gene Proteins 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- LWIHDJKSTIGBAC-UHFFFAOYSA-K potassium phosphate Substances [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 1
- 230000001376 precipitating effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 108010060325 semaglutide Proteins 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- RWVGQQGBQSJDQV-UHFFFAOYSA-M sodium;3-[[4-[(e)-[4-(4-ethoxyanilino)phenyl]-[4-[ethyl-[(3-sulfonatophenyl)methyl]azaniumylidene]-2-methylcyclohexa-2,5-dien-1-ylidene]methyl]-n-ethyl-3-methylanilino]methyl]benzenesulfonate Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C(=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C)C=2C(=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C)C=C1 RWVGQQGBQSJDQV-UHFFFAOYSA-M 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 239000013585 weight reducing agent Substances 0.000 description 1
- 239000011592 zinc chloride Substances 0.000 description 1
- 235000005074 zinc chloride Nutrition 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/24—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
- C07K14/245—Escherichia (G)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/605—Glucagons
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/02—Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/35—Fusion polypeptide containing a fusion for enhanced stability/folding during expression, e.g. fusions with chaperones or thioredoxin
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/50—Fusion polypeptide containing protease site
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/22—Vectors comprising a coding region that has been codon optimised for expression in a respective host
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Gastroenterology & Hepatology (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Medicinal Chemistry (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Endocrinology (AREA)
- Toxicology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明公开了Lpp或其突变体作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用。本发明发明人意外发现Lpp作为目的蛋白的分子伴侣,有利于包含分子伴侣和目的蛋白的重组蛋白的分泌表达,产量高,且对大肠杆菌菌体生长的影响小,无显著的菌体裂解情况,易于实现高密度发酵,便于工业化生产。通过该应用,本发明还提供了一种用于大肠杆菌分泌表达的融合蛋白以及一种大肠杆菌分泌表达融合蛋白的方法。
Description
技术领域
本发明属于生物医药领域,特别涉及Lpp或其突变体作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用。
背景技术
诺和诺德专利US20090156478介绍了一种GLP-1受体激动剂(利拉鲁肽,Victoza)的制备方法,34位赖氨酸突变成精氨酸避免修饰,在第26位赖氨酸位点酰化连接脂肪酸链,便可以与人血清白蛋白(HSA)非共价结合,可以延长半衰期至1天左右,每天只需注射一次,同时具有心血管保护和减肥效果。
诺和诺德专利US9732137介绍了另一种新的GLP-1受体激动剂(索玛鲁肽,Ozempic)的制备方法,34位赖氨酸突变成精氨酸避免修饰,DPP-IV的识别位点第八位氨基酸由丙氨酸突变成非天然氨基酸α-氨基异丁酸减少DPP-Ⅳ降解,同时在26位赖氨酸通过酰化连接脂肪酸链,可以将半衰期延长到1周左右,每周注射一次,同时具有保护心血管和减肥效果。
利拉鲁肽和索玛鲁肽表达宿主为诺和诺德专利酿酒酵母(S.cerevisiae ME1719)(MATa/αleu2/1eu2pep-4-3/pep-4-3Δtpi::LEU2/Δtpi::LEU2Δura3/Δura3Δyps 1::URA3/Δyps1::ura3 Cir+),经过多年改造,多个蛋白酶被敲除,减少蛋白酶降解,但这类复杂的基因改造在其他酵母上难以实现。同时还使用了专利小片段前导肽,形成GLP-1融合蛋白,可以进一步减少蛋白酶降解,并减少错误修饰,但这类小分子前导肽设计难度大,有极大概率会被降解,无法保护易被蛋白酶降解的GLP-1受体激动剂。因此,诺和诺德也是世界上唯一一家以酵母为宿主菌表达GLP-1受体激动剂的公司。
除了酵母外,大肠杆菌也是常用的宿主菌,常见的表达方式为胞内表达。由于GLP-1及其类似物分子量小,极易被胞内蛋白酶降解,因此无法使用常规信号肽如OmpA(CN109825488A)、PelB、PhoA、OmpF、LamB等(Secretory and extracellular productionof recombinant proteins using Escherichia coli)直接表达,需要使用大分子量的分子伴侣形成融合蛋白。常见分子伴侣有TrxA(CN106434717A)、KSI(CN107881187A)、GST(Astrategy for fusion expression and preparation of functional glucagon-likepeptide-1(GLP-1)analogue by introducing an enterokinase cleavage site)等,主要表达方式为胞内表达。
但大肠杆菌胞内表达涉及到细胞破碎,破碎后细胞内源蛋白释放出来,大大增加了纯化压力。为了获得纯度较高的目的蛋白需要多步纯化工艺,同时为了减少胞内蛋白酶降解目的蛋白,需要设计大分子量的分子伴侣进行保护,造成目的蛋白在融合蛋白占比低,产量低。
胞外分泌表达可以直接分泌到胞外培养基,无需破碎,纯化更方便。纤维素酶蛋白(CN201210264387)、细胞素释放蛋白Kil(High-level expression of a recombinantprotein in Klebsiella planticola owing to induced secretion into the culturemedium)、角质酶(重组Thermobifida fusca角质酶的高效胞外表达及其分子机制)、外膜蛋白OmpF(Excretion of Human-Endorphin into Culture Medium by Using OuterMembrane Protein F as a Fusion Partner in Recombinant Escherichia coli)等通过共表达或融合蛋白可促进外源蛋白分泌至发酵液上清。Lpp'-OmpA'(Display of beta-lactamase on the Escherichia coli surface:outer membrane phenotypes conferredby Lpp'-OmpA'-beta-lactamase fusions)可将融合蛋白定位到外膜表面,少量可以渗漏到胞外。
胞外分泌使用的分子伴侣多数为促裂解蛋白或外膜蛋白,促裂解蛋白大量表达加速细菌裂解死亡,从而释放融合蛋白到胞外,但细胞密度难以积累。外膜蛋白定位到细胞外膜,大量表达会有部分分泌到胞外。尽管对大肠杆菌生长影响小,但分子量较大(多数在20KDa以上),融合蛋白中目的蛋白占比低,造成产量偏低;且大分子量的分子伴侣由于序列更长,酶切分离目的基因时错切概率增加,会产生更多杂质;另外更长的序列也会形成更复杂的空间结构,容易与目的蛋白发生共价结合,封闭酶切位点,增加分离难度。
发明内容
本发明的首要目的在于克服现有技术的缺点与不足,提供Lpp或其突变体作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用。
本发明的另一目的在于提供一种用于大肠杆菌分泌表达的融合蛋白,所述融合蛋白由分子伴侣、连接肽、目的蛋白依次连接组成,所述分子伴侣为Lpp或其突变体。
本发明的再一目的在于提供一种大肠杆菌分泌表达融合蛋白的方法,该方法使用大肠杆菌Lpp蛋白或其突变体作为分子伴侣,在大肠杆菌中分泌表达外源蛋白。相比传统的全固相合成,步骤简单,成本更低,环保压力更小。相比目前国内常用的大肠杆菌胞内可溶表达和包涵体表达方式,本发明表达的融合蛋白可直接分泌到胞外培养基,下游纯化步骤少,产量高。相比诺和诺德酿酒酵母表达方式,本发明使用的大肠杆菌生长快,发酵周期短,生产成本更低。
本发明的目的通过下述技术方案实现:Lpp或其突变体作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用,是基于本发明发明人意外发现Lpp作为目的蛋白的分子伴侣,有利于包含分子伴侣和目的蛋白的重组蛋白的分泌表达,产量高,且对大肠杆菌菌体生长的影响小,无显著的菌体裂解情况,易于实现高密度发酵,便于工业化生产。
所述的Lpp为脂质蛋白,其氨基酸序列如下所示:
MKATKLVLGAVILGSTLLAGCSSNAKIDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAARANQRLDNMATKYRK。
所述的重组蛋白的结构如下:A-B-C;其中,
A为分子伴侣,主要作用是减少目的蛋白降解,协助目的蛋白转运到周质空间,进而分泌到胞外;选自Lpp或mLpp,m表示突变体;
B为缺失或用于蛋白酶识别的连接肽;缺失表示不存在,即重组蛋白的结构为A-C;连接肽的主要作用是提供蛋白酶识别位点,以便分子伴侣和目的基因的分离,优选DDDDK;
C为目的蛋白,优选长度为20~50个氨基酸的目的蛋白;更优选长度为27~31个氨基酸的目的蛋白。
一种用于大肠杆菌分泌表达的融合蛋白,是基于上述应用设计得到,其结构式如下:A-B-C;其中,
A为分子伴侣,选自Lpp或mLpp,m表示突变体;
B为缺失或用于蛋白酶识别的连接肽,优选为DDDDK;
C为目的蛋白,优选长度为20~50个氨基酸的目的蛋白;更优选长度为27~31个氨基酸的目的蛋白;包括但不限于GLP-1或其类似物、胰岛素类、蛋白酶类,优选为GLP-1或其类似物,包括但不限于GLP-1(7-37)、GLP-1(9-37)或GLP-1(11-37)。
一种大肠杆菌分泌表达融合蛋白的方法,包括以下步骤:
(1)获得编码上述用于大肠杆菌分泌表达的融合蛋白的融合基因,将上述融合基因构建到表达载体,得到重组载体;
(2)将重组载体转化宿主细胞;
(3)将含有重组载体的宿主细胞发酵、纯化,获得融合蛋白。
所述的大肠杆菌分泌表达融合蛋白的方法,还包括以下步骤:
(4)对得到的融合蛋白进行脂肪酸侧链修饰;
(5)蛋白酶酶切脂肪酸侧链修饰后的融合蛋白的连接肽,获得侧链修饰的融合蛋白;
(6)如有必要,对侧链修饰的融合蛋白进行转肽,连接另外一段多肽。
步骤(1)中所述的融合基因可通过直接合成法获得,或是通过片段拼接得到。
步骤(1)中所述的表达载体可选自大肠杆菌常用载体或将tac启动子替换pET载体中的T7启动子得到的载体。
所述的pET载体是pET系列表达载体;优选为pET-28a(+)载体。
步骤(2)中所述的宿主细胞为野生型或改造型大肠杆菌,如大肠杆菌BL21(DE3)或其改造菌、大肠杆菌W3110或其改造菌。
步骤(3)中所述的发酵优选为在发酵后期添加诱导剂,诱导表达。
所述的诱导剂优选为IPTG。
步骤(4)中所述的脂肪酸侧链修饰优选为使用利拉鲁肽或索玛鲁肽脂肪酸酰化剂进行修饰。
步骤(5)中所述的蛋白酶优选为肠激酶、胰蛋白酶和赖氨酰内切酶中的至少一种。
本发明相对于现有技术具有如下的优点及效果:
1、本发明使用商业化大肠杆菌作为宿主菌,生长快速,产量高,发酵密度高。重组蛋白的表达形式为分泌到胞外培养基,无需破碎细胞,直接纯化发酵液上清即可,纯度更高,工艺更简单。相比全固相合成,更加经济环保;相比大肠杆菌胞内可溶或包涵体表达,无需破碎细胞。对宿主菌要求低,直接使用商业化菌株即可,免去繁琐的菌种突变及筛选操作。
2、本发明使用Lpp作为分子伴侣,利用其高度可溶性和高转录表达,可以增加GLP-1及其类似物受体激动剂对蛋白酶的抵抗能力,从而增加产量;Lpp可以定位到周质空间,协助目的蛋白转运;Lpp等电点和GLP-1及其类似物相差较大,空间构象上与Lpp可以紧密结合;Lpp分子量小,GLP-1及其类似物受体激动剂占比高,进一步增加GLP-1及其类似物的产量,Lpp融合蛋白对大肠杆菌菌体生长的影响小,无显著的菌体裂解情况。Lpp内部碱性氨基酸少,可以使用常规的胰蛋白酶,赖氨酰内切酶,肠激酶等切割分离分子伴侣和目的蛋白,产生的杂质更少,便于纯化。重组蛋白为分泌性表达,对宿主菌体生长影响较小,易于实现高密度发酵,便于工业化生产。
附图说明
图1是pETFLAG-CTC-Lpp-GLP-1(7-37)质粒图谱图。
图2是Lpp-ARG34-GLP-l(7-37)融合蛋白的SDS-PAGE图;其中,泳道M为蛋白marker,至上而下的分子量(KDa)为40、25、15、10、4.6、1.7;泳道1为W3110/pETFLAG-CTC-Lpp-GLP-1诱导0h的发酵液上清;泳道2为W3110/pETFLAG-CTC-Lpp-GLP-1诱导0h的细胞破碎上清;泳道3为W3110/pETFLAG-CTC-Lpp-GLP-1诱导0h的细胞破碎沉淀;泳道4为W3110/pETFLAG-CTC-Lpp-GLP-1诱导72h的发酵液上清;泳道5为W3110/pETFLAG-CTC-Lpp-GLP-1诱导72h的细胞破碎上清;泳道6为W3110/pETFLAG-CTC-Lpp-GLP-1诱导72h的细胞破碎沉淀;泳道7为BL21(DE3)/pETFLAG-CTC-Lpp-GLP-1诱导72h的发酵液上清;泳道8是BL21(DE3)/pETFLAG-CTC-Lpp-GLP-1诱导72h的细胞破碎上清;泳道9是BL21(DE3)/pETFLAG-CTC-Lpp-GLP-1诱导72h的细胞破碎沉淀。
图3是利拉鲁肽质谱图。
图4是索玛鲁肽质谱图。
图5是检测Lpp信号肽-DDDDK-ARG34-GLP-l(7-37)融合蛋白的SDS-PAGE图;其中,泳道M为蛋白Marker,至上而下的分子量(KDa)为40、25、15、10、4.6、1.7;泳道1为BL21(DE3)/pETFLAG-CTC-Lpp信号肽-GLP-1诱导0h的发酵液上清;泳道2为BL21(DE3)/pETFLAG-CTC-Lpp信号肽-GLP-1诱导24h的发酵液上清;泳道3为BL21(DE3)/pETFLAG-CTC-Lpp信号肽-GLP-1诱导48h的发酵液上清;泳道4为BL21(DE3)/pETFLAG-CTC-Lpp信号肽-GLP-1诱导72h的发酵液上清;泳道5为BL21(DE3)/pETFLAG-CTC-Lpp信号肽-GLP-1诱导72h的细胞破碎上清;泳道6为BL21(DE3)/pETFLAG-CTC-Lpp信号肽-GLP-1诱导72h的细胞破碎沉淀。
图6是检测Lpp'-OmpA(46-66)-DDDDK-ARG34-GLP-1(11-37)和Lpp'-OmpA(46-159)-DDDDK-ARG34-GLP-1(11-37)的SDS-PAGE图;泳道1为W3110/pETFLAG-CTC-Lpp'-OmpA(46-66)-GLP-1诱导0h的发酵液上清;泳道2为W3110/pETFLAG-CTC-Lpp'-OmpA(46-66)-GLP-1诱导17h的发酵液上清;泳道3为W3110/pETFLAG-CTC-Lpp'-OmpA(46-66)-GLP-1诱导24h的发酵液上清;泳道4为W3110/pETFLAG-CTC-Lpp'-OmpA(46-66)-GLP-1诱导40h的发酵液上清;泳道5为W3110/pETFLAG-CTC-Lpp'-OmpA(46-66)-GLP-1诱导64h的发酵液上清;泳道6为W3110/pETFLAG-CTC-Lpp'-OmpA(46-66)-GLP-1诱导64h的细胞破碎上清;泳道M为蛋白Marker,至上而下的分子量(KDa)为40、25、15、10、4.6、1.7;泳道7为W3110/pETFLAG-CTC-Lpp'-OmpA(46-159)-GLP-1诱导0h的发酵液上清;泳道8为W3110/pETFLAG-CTC-Lpp'-OmpA(46-159)-GLP-1诱导17h的发酵液上清;泳道9为W3110/pETFLAG-CTC-Lpp'-OmpA(46-159)-GLP-1诱导24h的发酵液上清;泳道10为W3110/pETFLAG-CTC-Lpp'-OmpA(46-159)-GLP-1诱导40h的发酵液上清;泳道11为W3110/pETFLAG-CTC-Lpp'-OmpA(46-159)-GLP-1诱导64h的发酵液上清;泳道12为W3110/pETFLAG-CTC-Lpp'-OmpA(46-159)-GLP-1诱导64h的细胞破碎上清。
具体实施方式
下面结合实施例及附图对本发明作进一步详细的描述,但本发明的实施方式不限于此。
实施例1
技术服务公司合成基因序列,其翻译氨基酸序列具有如表1所示的以下特征:
表1
pET-28a(+)载体改造:PCR扩增pFLAG–CTC载体(Sigma)的tac启动子区域,引物两端加上BlpI和SphI酶切位点,BlpI和SphI双酶切PCR产物和pET-28a(+)载体,T4 DNA连接酶连接后化转大肠杆菌top10感受态细胞,PCR鉴定阳性克隆。提取质粒后获得启动子更换的pET-28a(+)载体,该载体命名为pETFLAG-CTC。
参照大肠杆菌密码子偏好性合成编码上述氨基酸序列的基因序列(如下所示,由技术服务公司合成)。以SEQ ID NO.4为例,引物两端加上BamHI和NdeI酶切位点,PCR扩增,BamHI和NdeI双酶切PCR产物和pETFLAG-CTC,T4 DNA连接酶连接后化转大肠杆菌top10感受态细胞,PCR鉴定阳性克隆,最后测序获得目的克隆。提取质粒后获得带有融合蛋白基因的pETFLAG-CTC载体,该重组表达载体命名为pETFLAG-CTC-Lpp-GLP-1(7-37),质粒图谱如图1所示。含有如下其他基因序列的重组表达载体的构建如前所述。
编码Lpp-DDDDK-ARG34-GLP-1(7-37)融合蛋白的核酸序列(SEQ ID NO.4):
ATGAAAGCGACCAAACTGGTGCTGGGCGCGGTGATTCTGGGCAGCACCCTGCTGGCGGGCTGCAGCAGCAACGCGAAAATTGATCAGCTGAGCAGCGATGTGCAGACCCTGAACGCGAAAGTGGATCAGCTGAGCAACGATGTGAACGCGATGCGCAGCGATGTGCAGGCGGCGAAAGATGATGCGGCGCGCGCGAACCAGCGCCTGGATAACATGGCGACCAAATATCGCAAAGATGATGATGATAAACATGCGGAAGGCACCTTTACCAGCGATGTGAGCAGCTATCTGGAAGGCCAGGCGGCGAAAGAATTTATTGCGTGGCTGGTGCGCGGCCGCGGC;
编码Lpp-DDDDK-ARG34-GLP-1(9-37)融合蛋白的核酸序列(SEQ ID NO.5):
ATGAAAGCGACCAAACTGGTGCTGGGCGCGGTGATTCTGGGCAGCACCCTGCTGGCGGGCTGCAGCAGCAACGCGAAAATTGATCAGCTGAGCAGCGATGTGCAGACCCTGAACGCGAAAGTGGATCAGCTGAGCAACGATGTGAACGCGATGCGCAGCGATGTGCAGGCGGCGAAAGATGATGCGGCGCGCGCGAACCAGCGCCTGGATAACATGGCGACCAAATATCGCAAAGATGATGATGATAAAGAAGGCACCTTTACCAGCGATGTGAGCAGCTATCTGGAAGGCCAGGCGGCGAAAGAATTTATTGCGTGGCTGGTGCGCGGCCGCGGC;
编码Lpp-DDDDK-ARG34-GLP-1(11-37)融合蛋白的核酸序列(SEQ ID NO.6):
ATGAAAGCGACCAAACTGGTGCTGGGCGCGGTGATTCTGGGCAGCACCCTGCTGGCGGGCTGCAGCAGCAACGCGAAAATTGATCAGCTGAGCAGCGATGTGCAGACCCTGAACGCGAAAGTGGATCAGCTGAGCAACGATGTGAACGCGATGCGCAGCGATGTGCAGGCGGCGAAAGATGATGCGGCGCGCGCGAACCAGCGCCTGGATAACATGGCGACCAAATATCGCAAAGATGATGATGATAAAACCTTTACCAGCGATGTGAGCAGCTATCTGGAAGGCCAGGCGGCGAAAGAATTTATTGCGTGGCTGGTGCGCGGCCGCGGC。
实施例2大肠杆菌转化及筛选
将实施例1得到的重组表达载体转化到大肠杆菌复制扩增,具体过程为:按照氯化钙法(参照《分子克隆实验指南》第三版)制备大肠杆菌top10感受态,取1μL重组表达载体加入到top10感受态中,冰浴30min,42℃热激90s,冰浴5min,加入1ml液体SOC培养基(2%w/v胰蛋白胨、0.5%w/v酵母提取物、0.05%w/v NaCl、2.5mM KCl、10mM MgCl2、20mM葡萄糖),在37℃摇床中振荡培养1h后涂布到LB固体培养基(含有50mg/L卡那霉素kan),37℃培养箱培养过夜,直至出现肉眼可见菌落。挑菌到LB液体培养基(蛋白胨10g/L,酵母提取物5g/L,氯化钠5g/L,pH7.0~7.5,含有50mg/L卡那霉素kan),37℃摇床中振荡培养至OD600=1-3,使用omega质粒提取试剂盒,按照说明书提取质粒。
按照相同的方法制备大肠杆菌BL21(DE3)和W3110化转感受态,转入重组表达载体。涂布到LB固体培养基(含有50mg/L kan),37℃培养箱培养过夜,直至出现肉眼可见菌落,得到的工程菌命名为BL21(DE3)/pETFLAG-CTC-Lpp-GLP-1(X)和W3110/pETFLAG-CTC-Lpp-GLP-1(X),X表示分别含有如SEQ ID NO.4~6序列的三种重组表达载体。挑菌到LB液体培养基(蛋白胨10g/L,酵母提取物5g/L,氯化钠5g/L,pH7.0~7.5,含有50mg/L kan),37℃摇床中振荡培养至OD600=1-3,加入甘油至终浓度15%v/v,保存于-70℃冰箱。
实施例3、大肠杆菌发酵及纯化
实施例2保存的菌种制备种子培养物和20L发酵罐发酵工艺:
①制备种子培养物
取20μL在-70℃冷冻保存的菌种BL21(DE3)/pETFLAG-CTC-Lpp-GLP-1(X)和W3110/pETFLAG-CTC-Lpp-GLP-1(X),分别接种至50mL添加了卡那霉素(Kanamycin,终浓度为50μg/mL)的LB液体培养基,在28℃,250rpm的摇床中培养16小时,从而活化菌种。再将50mL活化过的菌种接种到400mL添加了50μg/mL卡那霉素的LB液体培养基中,在28℃、250rpm的条件下继续培养3h,获得种子培养物,控制它的菌体浓度OD600在0.8~1.2之间。
②20L发酵罐中的发酵培养
使用20L搅拌式发酵罐(南京华龙公司),按照发酵培养基配方进行投料,投料体积为8L。严格控制发酵条件:温度控制在28℃~32℃之间,pH控制在6.5~7.0之间,发酵转速控制在150rpm~700rpm(根据DO的变化调控)之间,空气流速控制在200L/h~1600L/h之间(根据DO的变化调控),溶解氧(DO)控制在5~50%的最大氧饱和度之间。培养至碳源耗尽时开始补料,采用匀速补料(补料速率控制在0.6mL·min-1·L-1),补料的用量为4L。当培养至菌体浓度OD600≈30时,开始添加IPTG至终浓度0.3mM,开始诱导,诱导时间为72h。发酵培养基的配方如下:每升含有酵母粉2~5g、蛋白胨3~8g、氯化钠1~2g、磷酸二氢钾2~5g、磷酸氢二钠2~5g、二水合氯化钙0.01~0.02g、硫酸镁1~2g、甘油4~7g、硫酸铵5~7g、微量元素0.875mL;用水定容至1L,pH6.5~7.0;微量元素的组成如下:每升含有四水合氯化亚铁20~30g、氯化锌1~3g、六水合氯化钴2~4g、二水合钼酸钠2~4g、二水合氯化钙1~2g、二水合氯化铜1~2g、硼酸0.4~0.6g、一水合硫酸锰2~3g、浓度为质量百分比37%的浓盐酸10mL,用水定容至1L。补料培养基每L含有500g甘油、25g酵母粉、40g蛋白胨,水定容至1L。
取W3110/pETFLAG-CTC-Lpp-GLP-1(7-37)和BL21(DE3)/pETFLAG-CTC-Lpp-GLP-1(7-37)发酵液上清稀释5倍后进行SDS-PAGE电泳,电泳结果见图2,其中泳道4的融合蛋白浓度为300mg/L,对应的发酵液上清融合蛋白浓度为1.5g/L,根据目的蛋白在融合蛋白中的占比可计算出目的蛋白ARG34-GLP-l(7-37)浓度为0.49g/L;泳道7的融合蛋白浓度为250mg/L,对应的发酵液上清融合蛋白浓度为1.25g/L,根据目的蛋白在融合蛋白中的占比可计算出目的蛋白ARG34-GLP-l(7-37)浓度为0.41g/L。从图2可知,Lpp作为分子伴侣融合目的蛋白得到的重组蛋白多数分泌到胞外培养基。发酵液离心后收集上清,使用离子交换层析纯化。纯化方法参考诺和诺德专利US6444788B1实施例1。根据在线网站(https://web.expasy.org/compute_pi/)计算等电点,成熟的Lpp-GLP-l融合蛋白具有和ARG34-GLP-l(7-37)接近的等电点,分别为4.98和5.53。
同样,对于W3110/pETFLAG-CTC-Lpp-GLP-1(9-37)、BL21(DE3)/pETFLAG-CTC-Lpp-GLP-1(9-37)、W3110/pETFLAG-CTC-Lpp-GLP-1(11-37)、BL21(DE3)/pETFLAG-CTC-Lpp-GLP-1(11-37)的发酵上清进行检测和纯化,目的蛋白ARG34-GLP-l(9-37)和ARG34-GLP-l(11-37)亦为分泌性表达,可以得到目的蛋白ARG34-GLP-l(9-37)浓度为0.40g/L,目的蛋白ARG34-GLP-l(11-37)浓度为0.38g/L。可见,Lpp作为分子伴侣有利于重组蛋白进行分泌性表达。
实施例4、脂肪酸酰化剂合成及修饰
利拉鲁肽脂肪酸酰化剂合成参照专利CN97198413.1中例35,索玛鲁肽脂肪酸酰化剂合成参照专利CN201510459093.9实施例6。
利拉鲁肽脂肪酸酰化剂修饰实施例3获得的ARG34-GLP-1(7-37)融合蛋白参照专利CN97198413.1中例37,得到赖氨酸侧链酰化修饰的ARG34-GLP-1(7-37)融合蛋白。
索玛鲁肽脂肪酸酰化剂修饰实施3获得的ARG34-GLP-1(9-37)融合蛋白参照专利CN201510459093.9实施例9,得到赖氨酸侧链酰化修饰的ARG34-GLP-1(9-37)融合蛋白。
实施例5、修饰后融合蛋白酶切工艺及纯化
将实施例4中获得的经脂肪酸酰化剂修饰的融合蛋白收集液稀释后,用重组赖氨酰内切酶(购自wako日本和光纯药株式会社)进行酶切(按照说明书操作),25℃反应2h后调酸终止反应,经酶切即可得到利拉鲁肽单体,索玛鲁肽前体ARG34-GLP-1(9-37)。
将100mL含利拉鲁肽单体的酶切样品,含索玛鲁肽前体ARG34-GLP-1(9-37)的酶切样品上样到装有20mL Uni ps30-500填料(购自苏州纳微科技有限公司)的层析柱上(其预先用缓冲液1平衡,配方为含0.1%v/v TFA的20%v/v异丙醇的水溶液,上样结束后用平衡缓冲液2(含0.1%v/v TFA的20%v/v异丙醇的水溶液)冲洗至基线平稳。再以100ml的20~80%异丙醇线性梯度(含0.1%v/v TFA缓冲液)洗脱,收集洗脱峰,用等电点法调沉,冷冻干燥即得到利拉鲁肽成品,索玛鲁肽前体ARG34-GLP-1(9-37)。
实施例6保护二肽合成
Boc-His(Boc)-Aib-OH合成按照专利CN201510459093.9实施例26的方法合成。
实施例7转肽连接获得索玛鲁肽
实施例5获得的侧链修饰的索玛鲁肽前体ARG34-GLP-1(9-37)与实施例6获得的保护二肽转肽连接方法和纯化方法参照专利CN201510459093.9实施例10,制备得到索玛鲁肽成品。
实施例8利拉鲁肽,索玛鲁肽成品的鉴定
以利拉鲁肽注射液(购自丹麦诺和诺德公司)为对照品,用超高效液相色谱进行分析,发现本发明的利拉鲁肽成品与诺和诺德公司生产的利拉鲁肽对照品出峰时间一致。经赛默飞高分辨液质联用仪取主峰进行高分辨质谱分析,使用Thermo Biopharma Finder2.0解卷积,测得单同位素分子量为3748.9,质谱分析结果与利拉鲁肽对照一致,质谱图见图3。
以索玛鲁肽注射液(购自丹麦诺和诺德公司)为对照品,用超高效液相色谱进行分析,发现本发明的索玛鲁肽成品与诺和诺德公司生产的索玛鲁肽对照品出峰时间一致。经赛默飞高分辨液质联用仪取主峰进行高分辨质谱分析,使用Thermo Biopharma Finder2.0解卷积,测得单同位素分子量为4111.1,质谱分析结果与索玛鲁肽对照一致,质谱图见图4。
对比例1
使用Lpp信号肽或是Lpp'-OmpA(46-66)、Lpp'-OmpA(46-159)作为分子伴侣,目的蛋白为ARG34-GLP-l(7-37),连接肽为DDDDK,参照大肠杆菌密码子偏好性合成如下序列:
Lpp信号肽-DDDDK-ARG34-GLP-l(7-37)融合蛋白的氨基酸序列如下:
编码Lpp信号肽-DDDDK-ARG34-GLP-l(7-37)融合蛋白的核酸序列如下:
ATGAAAGCGACCAAACTGGTGCTGGGCGCGGTGATTCTGGGCAGCACCCTGCTGGCGGGCGATGATGATGATAAACATGCGGAAGGCACCTTTACCAGCGATGTGAGCAGCTATCTGGAAGGCCAGGCGGCGAAAGAATTTATTGCGTGGCTGGTGCGCGGCCGCGGC。
Lpp'-OmpA(46-66)-DDDDK-ARG34-GLP-1(11-37)(Lpp信号肽+Lpp蛋白1-9氨基酸+OmpA蛋白的第46-66位氨基酸)融合蛋白的氨基酸序列如下:
其中,Lpp信号肽为第1~20位,Lpp蛋白1-9氨基酸为第21~29位,连接Lpp’和OmpA的连接肽为第30~31位。
编码Lpp'-OmpA(46-66)-DDDDK-ARG34-GLP-1(11-37)融合蛋白的核酸序列如下:
ATGAAAGCGACCAAACTGGTGCTGGGCGCGGTGATTCTGGGCAGCACCCTGCTGGCGGGCTGCAGCAGCAACGCGAAAATTGATCAGGGCATTAACCCGTATGTGGGCTTTGAAATGGGCTATGATTGGCTGGGCCGCATGCCGTATAAAGGCAGCGATGATGATGATAAAACCTTTACCAGCGATGTGAGCAGCTATCTGGAAGGCCAGGCGGCGAAAGAATTTATTGCGTGGCTGGTGCGCGGCCGCGGC。
Lpp'-OmpA(46-159)-DDDDK-ARG34-GLP-1(11-37)(Lpp信号肽+Lpp蛋白1-9氨基酸+OmpA蛋白的第46-159位氨基酸)融合蛋白的序列如下:
其中,Lpp信号肽为第1~20位,Lpp蛋白1-9氨基酸为第21~29位,连接Lpp’和OmpA的连接肽为第30~31位。
编码Lpp'-OmpA(46-159)-DDDDK-ARG34-GLP-1(11-37)融合蛋白的核酸序列如下:
ATGAAAGCGACCAAACTGGTGCTGGGCGCGGTGATTCTGGGCAGCACCCTGCTGGCGGGCTGCAGCAGCAACGCGAAAATTGATCAGGGCATTAACCCGTATGTGGGCTTTGAAATGGGCTATGATTGGCTGGGCCGCATGCCGTATAAAGGCAGCGTGGAAAACGGCGCGTATAAAGCGCAGGGCGTGCAGCTGACCGCGAAACTGGGCTATCCGATTACCGATGATCTGGATATTTATACCCGCCTGGGCGGCATGGTGTGGCGCGCGGATACCAAAAGCAACGTGTATGGCAAAAACCATGATACCGGCGTGAGCCCGGTGTTTGCGGGCGGCGTGGAATATGCGATTACCCCGGAAATTGCGACCCGCCTGGAATATCAGTGGACCAACAACATTGGCGATGCGCATACCATTGGCACCCGCCCGGATAACGATGATGATGATAAAACCTTTACCAGCGATGTGAGCAGCTATCTGGAAGGCCAGGCGGCGAAAGAATTTATTGCGTGGCTGGTGCGCGGCCGCGGC。
依据实施例1的步骤得到重组表达载体,依据实施例2得到表达菌种BL21(DE3)/pETFLAG-CTC-Lpp信号肽-GLP-1、W3110/pETFLAG-CTC-Lpp'-OmpA(46-66)-GLP-1、W3110/pETFLAG-CTC-Lpp'-OmpA(46-159)-GLP-1,依据实施例3进行表达,结果如图5和图6所示:图5的结果表明Lpp信号肽无法表达GLP-1;图6的结果表明Lpp信号肽+部分Lpp蛋白+OmpA部分蛋白作为分子伴侣无法表达GLP-1。
上述实施例为本发明较佳的实施方式,但本发明的实施方式并不受上述实施例的限制,其他的任何未背离本发明的精神实质与原理下所作的改变、修饰、替代、组合、简化,均应为等效的置换方式,都包含在本发明的保护范围之内。
序列表
<110> 珠海联邦制药股份有限公司
<120> Lpp或其突变体作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用
<160> 13
<170> SIPOSequenceListing 1.0
<210> 1
<211> 114
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> Lpp – DDDDK-ARG34-GLP-l(7-37)融合蛋白
<400> 1
Met Lys Ala Thr Lys Leu Val Leu Gly Ala Val Ile Leu Gly Ser Thr
1 5 10 15
Leu Leu Ala Gly Cys Ser Ser Asn Ala Lys Ile Asp Gln Leu Ser Ser
20 25 30
Asp Val Gln Thr Leu Asn Ala Lys Val Asp Gln Leu Ser Asn Asp Val
35 40 45
Asn Ala Met Arg Ser Asp Val Gln Ala Ala Lys Asp Asp Ala Ala Arg
50 55 60
Ala Asn Gln Arg Leu Asp Asn Met Ala Thr Lys Tyr Arg Lys Asp Asp
65 70 75 80
Asp Asp Lys His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr
85 90 95
Leu Glu Gly Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Arg Gly
100 105 110
Arg Gly
<210> 2
<211> 112
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> Lpp – DDDDK-ARG34-GLP-l(9-37)融合蛋白
<400> 2
Met Lys Ala Thr Lys Leu Val Leu Gly Ala Val Ile Leu Gly Ser Thr
1 5 10 15
Leu Leu Ala Gly Cys Ser Ser Asn Ala Lys Ile Asp Gln Leu Ser Ser
20 25 30
Asp Val Gln Thr Leu Asn Ala Lys Val Asp Gln Leu Ser Asn Asp Val
35 40 45
Asn Ala Met Arg Ser Asp Val Gln Ala Ala Lys Asp Asp Ala Ala Arg
50 55 60
Ala Asn Gln Arg Leu Asp Asn Met Ala Thr Lys Tyr Arg Lys Asp Asp
65 70 75 80
Asp Asp Lys Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu
85 90 95
Gly Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly
100 105 110
<210> 3
<211> 110
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> Lpp – DDDDK-ARG34-GLP-l(11-37)融合蛋白
<400> 3
Met Lys Ala Thr Lys Leu Val Leu Gly Ala Val Ile Leu Gly Ser Thr
1 5 10 15
Leu Leu Ala Gly Cys Ser Ser Asn Ala Lys Ile Asp Gln Leu Ser Ser
20 25 30
Asp Val Gln Thr Leu Asn Ala Lys Val Asp Gln Leu Ser Asn Asp Val
35 40 45
Asn Ala Met Arg Ser Asp Val Gln Ala Ala Lys Asp Asp Ala Ala Arg
50 55 60
Ala Asn Gln Arg Leu Asp Asn Met Ala Thr Lys Tyr Arg Lys Asp Asp
65 70 75 80
Asp Asp Lys Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln
85 90 95
Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly
100 105 110
<210> 4
<211> 342
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 编码Lpp- DDDDK-ARG34-GLP-1(7-37)融合蛋白的核酸序列
<400> 4
atgaaagcga ccaaactggt gctgggcgcg gtgattctgg gcagcaccct gctggcgggc 60
tgcagcagca acgcgaaaat tgatcagctg agcagcgatg tgcagaccct gaacgcgaaa 120
gtggatcagc tgagcaacga tgtgaacgcg atgcgcagcg atgtgcaggc ggcgaaagat 180
gatgcggcgc gcgcgaacca gcgcctggat aacatggcga ccaaatatcg caaagatgat 240
gatgataaac atgcggaagg cacctttacc agcgatgtga gcagctatct ggaaggccag 300
gcggcgaaag aatttattgc gtggctggtg cgcggccgcg gc 342
<210> 5
<211> 336
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 编码Lpp- DDDDK- ARG34-GLP-1(9-37)融合蛋白的核酸序列
<400> 5
atgaaagcga ccaaactggt gctgggcgcg gtgattctgg gcagcaccct gctggcgggc 60
tgcagcagca acgcgaaaat tgatcagctg agcagcgatg tgcagaccct gaacgcgaaa 120
gtggatcagc tgagcaacga tgtgaacgcg atgcgcagcg atgtgcaggc ggcgaaagat 180
gatgcggcgc gcgcgaacca gcgcctggat aacatggcga ccaaatatcg caaagatgat 240
gatgataaag aaggcacctt taccagcgat gtgagcagct atctggaagg ccaggcggcg 300
aaagaattta ttgcgtggct ggtgcgcggc cgcggc 336
<210> 6
<211> 330
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 编码Lpp- DDDDK- ARG34-GLP-1(11-37)融合蛋白的核酸序列
<400> 6
atgaaagcga ccaaactggt gctgggcgcg gtgattctgg gcagcaccct gctggcgggc 60
tgcagcagca acgcgaaaat tgatcagctg agcagcgatg tgcagaccct gaacgcgaaa 120
gtggatcagc tgagcaacga tgtgaacgcg atgcgcagcg atgtgcaggc ggcgaaagat 180
gatgcggcgc gcgcgaacca gcgcctggat aacatggcga ccaaatatcg caaagatgat 240
gatgataaaa cctttaccag cgatgtgagc agctatctgg aaggccaggc ggcgaaagaa 300
tttattgcgt ggctggtgcg cggccgcggc 330
<210> 7
<211> 78
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> Lpp的氨基酸序列
<400> 7
Met Lys Ala Thr Lys Leu Val Leu Gly Ala Val Ile Leu Gly Ser Thr
1 5 10 15
Leu Leu Ala Gly Cys Ser Ser Asn Ala Lys Ile Asp Gln Leu Ser Ser
20 25 30
Asp Val Gln Thr Leu Asn Ala Lys Val Asp Gln Leu Ser Asn Asp Val
35 40 45
Asn Ala Met Arg Ser Asp Val Gln Ala Ala Lys Asp Asp Ala Ala Arg
50 55 60
Ala Asn Gln Arg Leu Asp Asn Met Ala Thr Lys Tyr Arg Lys
65 70 75
<210> 8
<211> 56
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> Lpp信号肽- DDDDK- ARG34-GLP-l(7-37)融合蛋白
<400> 8
Met Lys Ala Thr Lys Leu Val Leu Gly Ala Val Ile Leu Gly Ser Thr
1 5 10 15
Leu Leu Ala Gly Asp Asp Asp Asp Lys His Ala Glu Gly Thr Phe Thr
20 25 30
Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln Ala Ala Lys Glu Phe Ile
35 40 45
Ala Trp Leu Val Arg Gly Arg Gly
50 55
<210> 9
<211> 168
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 编码Lpp信号肽- DDDDK- ARG34-GLP-l(7-37)融合蛋白的核酸序列
<400> 9
atgaaagcga ccaaactggt gctgggcgcg gtgattctgg gcagcaccct gctggcgggc 60
gatgatgatg ataaacatgc ggaaggcacc tttaccagcg atgtgagcag ctatctggaa 120
ggccaggcgg cgaaagaatt tattgcgtgg ctggtgcgcg gccgcggc 168
<210> 10
<211> 84
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> Lpp'-OmpA(46-66)- DDDDK-ARG34-GLP-1(11-37)融合蛋白
<400> 10
Met Lys Ala Thr Lys Leu Val Leu Gly Ala Val Ile Leu Gly Ser Thr
1 5 10 15
Leu Leu Ala Gly Cys Ser Ser Asn Ala Lys Ile Asp Gln Gly Ile Asn
20 25 30
Pro Tyr Val Gly Phe Glu Met Gly Tyr Asp Trp Leu Gly Arg Met Pro
35 40 45
Tyr Lys Gly Ser Asp Asp Asp Asp Lys Thr Phe Thr Ser Asp Val Ser
50 55 60
Ser Tyr Leu Glu Gly Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val
65 70 75 80
Arg Gly Arg Gly
<210> 11
<211> 252
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 编码Lpp'-OmpA(46-66)- DDDDK-ARG34-GLP-1(11-37)融合蛋白的核酸序列
<400> 11
atgaaagcga ccaaactggt gctgggcgcg gtgattctgg gcagcaccct gctggcgggc 60
tgcagcagca acgcgaaaat tgatcagggc attaacccgt atgtgggctt tgaaatgggc 120
tatgattggc tgggccgcat gccgtataaa ggcagcgatg atgatgataa aacctttacc 180
agcgatgtga gcagctatct ggaaggccag gcggcgaaag aatttattgc gtggctggtg 240
cgcggccgcg gc 252
<210> 12
<211> 177
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> Lpp'-OmpA(46-159)- DDDDK-ARG34-GLP-1(11-37)融合蛋白
<400> 12
Met Lys Ala Thr Lys Leu Val Leu Gly Ala Val Ile Leu Gly Ser Thr
1 5 10 15
Leu Leu Ala Gly Cys Ser Ser Asn Ala Lys Ile Asp Gln Gly Ile Asn
20 25 30
Pro Tyr Val Gly Phe Glu Met Gly Tyr Asp Trp Leu Gly Arg Met Pro
35 40 45
Tyr Lys Gly Ser Val Glu Asn Gly Ala Tyr Lys Ala Gln Gly Val Gln
50 55 60
Leu Thr Ala Lys Leu Gly Tyr Pro Ile Thr Asp Asp Leu Asp Ile Tyr
65 70 75 80
Thr Arg Leu Gly Gly Met Val Trp Arg Ala Asp Thr Lys Ser Asn Val
85 90 95
Tyr Gly Lys Asn His Asp Thr Gly Val Ser Pro Val Phe Ala Gly Gly
100 105 110
Val Glu Tyr Ala Ile Thr Pro Glu Ile Ala Thr Arg Leu Glu Tyr Gln
115 120 125
Trp Thr Asn Asn Ile Gly Asp Ala His Thr Ile Gly Thr Arg Pro Asp
130 135 140
Asn Asp Asp Asp Asp Lys Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu
145 150 155 160
Glu Gly Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg
165 170 175
Gly
<210> 13
<211> 531
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 编码Lpp'-OmpA(46-159)- DDDDK-ARG34-GLP-1(11-37) 融合蛋白的核酸序列
<400> 13
atgaaagcga ccaaactggt gctgggcgcg gtgattctgg gcagcaccct gctggcgggc 60
tgcagcagca acgcgaaaat tgatcagggc attaacccgt atgtgggctt tgaaatgggc 120
tatgattggc tgggccgcat gccgtataaa ggcagcgtgg aaaacggcgc gtataaagcg 180
cagggcgtgc agctgaccgc gaaactgggc tatccgatta ccgatgatct ggatatttat 240
acccgcctgg gcggcatggt gtggcgcgcg gataccaaaa gcaacgtgta tggcaaaaac 300
catgataccg gcgtgagccc ggtgtttgcg ggcggcgtgg aatatgcgat taccccggaa 360
attgcgaccc gcctggaata tcagtggacc aacaacattg gcgatgcgca taccattggc 420
acccgcccgg ataacgatga tgatgataaa acctttacca gcgatgtgag cagctatctg 480
gaaggccagg cggcgaaaga atttattgcg tggctggtgc gcggccgcgg c 531
Claims (7)
1.Lpp作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用,其特征在于:
所述的重组蛋白的结构如下: A-B-C;其中,
A为分子伴侣,选自Lpp;
B为用于蛋白酶识别的连接肽DDDDK;
C为目的蛋白;
所述的Lpp的氨基酸序列如下所示:
MKATKLVLGAVILGSTLLAGCSSNAKIDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAARANQRLDNMATKYRK;
所述的重组蛋白的氨基酸序列如SEQ ID NO.1、SEQ ID NO.2或SEQ ID NO.3所示。
2.一种用于大肠杆菌分泌表达的融合蛋白,其特征在于:是基于权利要求1所述的应用设计得到;所述的融合蛋白的结构式如下:A-B-C;其中,
A为分子伴侣,选自Lpp;
B为用于蛋白酶识别的连接肽DDDDK;
C为目的蛋白;
所述的Lpp的氨基酸序列如下所示:
MKATKLVLGAVILGSTLLAGCSSNAKIDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAARANQRLDNMATKYRK;
所述的重组蛋白的氨基酸序列如SEQ ID NO.1、SEQ ID NO.2或SEQ ID NO.3所示。
3.一种大肠杆菌分泌表达融合蛋白的方法,其特征在于包括以下步骤:
(1)获得编码权利要求2所述用于大肠杆菌分泌表达的融合蛋白的融合基因,将融合基因构建到表达载体,得到重组载体;
(2)将重组载体转化宿主细胞;
(3)将含有重组载体的宿主细胞发酵、纯化,获得融合蛋白。
4.根据权利要求3所述的大肠杆菌分泌表达融合蛋白的方法,其特征在于还包括以下步骤:
(4)对得到的融合蛋白进行脂肪酸侧链修饰;
(5)蛋白酶酶切脂肪酸侧链修饰后的融合蛋白的连接肽,获得侧链修饰的融合蛋白;
(6)如有必要,对侧链修饰的融合蛋白进行转肽,连接另外一段多肽。
5.根据权利要求3或4所述的大肠杆菌分泌表达融合蛋白的方法,其特征在于:
步骤(1)中所述的融合基因通过直接合成法获得,或是通过片段拼接得到;
步骤(1)中所述的表达载体选自大肠杆菌常用载体或将tac启动子替换pET载体中的T7启动子得到的载体;
步骤(2)中所述的宿主细胞为野生型或改造型大肠杆菌;
步骤(3)中所述的发酵是在发酵后期添加诱导剂,诱导表达。
6.根据权利要求5所述的大肠杆菌分泌表达融合蛋白的方法,其特征在于:
所述的pET载体为pET-28a(+)载体;
所述的宿主细胞为大肠杆菌BL21(DE3)或其改造菌、大肠杆菌W3110或其改造菌;
所述的诱导剂为IPTG。
7.根据权利要求4所述的大肠杆菌分泌表达融合蛋白的方法,其特征在于:
步骤(4)中所述的脂肪酸侧链修饰为使用利拉鲁肽或索玛鲁肽脂肪酸酰化剂进行修饰;
步骤(5)中所述的蛋白酶为肠激酶、胰蛋白酶和赖氨酰内切酶中的至少一种。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011620414.6A CN114686504B (zh) | 2020-12-30 | 2020-12-30 | Lpp或其突变体作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011620414.6A CN114686504B (zh) | 2020-12-30 | 2020-12-30 | Lpp或其突变体作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114686504A CN114686504A (zh) | 2022-07-01 |
CN114686504B true CN114686504B (zh) | 2023-11-17 |
Family
ID=82134281
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011620414.6A Active CN114686504B (zh) | 2020-12-30 | 2020-12-30 | Lpp或其突变体作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114686504B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115029404B (zh) * | 2021-03-04 | 2024-05-14 | 珠海联邦制药股份有限公司 | 用于lpp单基因敲除或突变的大肠杆菌分泌表达短肽类蛋白的发酵培养基及应用 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110724187A (zh) * | 2018-07-16 | 2020-01-24 | 甘李药业股份有限公司 | 一种高效表达利拉鲁肽前体的重组工程菌及其应用 |
WO2020020438A1 (de) * | 2018-07-24 | 2020-01-30 | Wacker Chemie Ag | Neue bakterielle lpp-mutante und deren verwendung zur sekretorischen herstellung von rekombinanten proteinen |
-
2020
- 2020-12-30 CN CN202011620414.6A patent/CN114686504B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110724187A (zh) * | 2018-07-16 | 2020-01-24 | 甘李药业股份有限公司 | 一种高效表达利拉鲁肽前体的重组工程菌及其应用 |
WO2020020438A1 (de) * | 2018-07-24 | 2020-01-30 | Wacker Chemie Ag | Neue bakterielle lpp-mutante und deren verwendung zur sekretorischen herstellung von rekombinanten proteinen |
Non-Patent Citations (2)
Title |
---|
Meisam Jeiranikhameneh 等.Designing novel construction for cell surface display of protein E on Escherichia coli using non‑classical pathway based on Lpp‑OmpA.AMB Expr.2017,第7卷全文. * |
李朋彦 等.利拉鲁肽前体肽GLP-1(7-37)K34R在大肠埃希菌中的表达与优化.医药导报.2020,全文. * |
Also Published As
Publication number | Publication date |
---|---|
CN114686504A (zh) | 2022-07-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN115975047B (zh) | 一种重组融合蛋白生产多肽的方法及其应用 | |
CN111072783B (zh) | 一种采用大肠杆菌表达串联序列制备glp-1或其类似物多肽的方法 | |
US8298789B2 (en) | Orthogonal process for purification of recombinant human parathyroid hormone (rhPTH) (1-34) | |
US8877895B2 (en) | Prokaryotic expression construct | |
CN110724187B (zh) | 一种高效表达利拉鲁肽前体的重组工程菌及其应用 | |
US10000544B2 (en) | Process for production of insulin and insulin analogues | |
US6391585B1 (en) | Process for preparing recombinant proteins using highly efficient expression vector from saccharomyces cerevisiae | |
CN110257347B (zh) | 硫氧还蛋白突变体、其制备方法及其在重组融合蛋白生产中的应用 | |
CN114686504B (zh) | Lpp或其突变体作为分子伴侣在大肠杆菌中分泌表达重组蛋白的应用 | |
KR20080039879A (ko) | Pdi를 융합파트너로 이용한 수용성 및 활성형 재조합단백질의 제조방법 | |
JP7266325B2 (ja) | 蛍光タンパク質フラグメントを含む融合タンパク質およびその用途 | |
AU2019218315A1 (en) | Codon optimized precursor gene and signal peptide gene of human insulin analogue | |
CN114380903B (zh) | 一种胰岛素或其类似物前体 | |
JP2021511785A (ja) | 組換えポリペプチド生産用n末端融合パートナーおよびこれを用いた組換えポリペプチドの生産方法 | |
CN110358770B (zh) | 一种酵母生物合成芋螺毒素的方法 | |
JP2022548598A (ja) | 組換え治療用ペプチドの発現のためのn末端伸長配列 | |
Rao et al. | Expression, purification, and characterisation of nesiritide using an E. coli expression system | |
CN117964708A (zh) | 一种内含肽及其突变体的应用 | |
CN115029404A (zh) | 用于lpp单基因敲除或突变的大肠杆菌高效分泌表达短肽类蛋白的发酵培养基及应用 | |
EP2867250A2 (en) | Proinsulin with enhanced helper sequence | |
US20060234351A1 (en) | Yeast protein expression secretion system | |
CN116656516A (zh) | 基于两拷贝基因筛选高表达眼镜王蛇联合肽damp4-oh30重组菌株的方法 | |
CN114381471A (zh) | 一种辅助蛋白在重组蛋白生产中的应用及融合表达系统 | |
JP2000078989A (ja) | ヒト成長ホルモンの分泌生産方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |