CN114350587B - 一种基因重组串联表达利那洛肽的工程菌 - Google Patents
一种基因重组串联表达利那洛肽的工程菌 Download PDFInfo
- Publication number
- CN114350587B CN114350587B CN202210082694.2A CN202210082694A CN114350587B CN 114350587 B CN114350587 B CN 114350587B CN 202210082694 A CN202210082694 A CN 202210082694A CN 114350587 B CN114350587 B CN 114350587B
- Authority
- CN
- China
- Prior art keywords
- cys
- lys
- ala
- gly
- leu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- KXGCNMMJRFDFNR-WDRJZQOASA-N linaclotide Chemical compound C([C@H](NC(=O)[C@@H]1CSSC[C@H]2C(=O)N[C@H]3CSSC[C@H](N)C(=O)N[C@H](C(N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N2)=O)CSSC[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]2CCCN2C(=O)[C@H](CC(N)=O)NC3=O)C(=O)N[C@H](C(NCC(=O)N1)=O)[C@H](O)C)C(O)=O)C1=CC=C(O)C=C1 KXGCNMMJRFDFNR-WDRJZQOASA-N 0.000 title claims abstract description 88
- 108010024409 linaclotide Proteins 0.000 title claims abstract description 84
- 229960000812 linaclotide Drugs 0.000 title claims abstract description 84
- 241000894006 Bacteria Species 0.000 title claims abstract description 63
- 108090000623 proteins and genes Proteins 0.000 title claims description 45
- 238000005215 recombination Methods 0.000 title claims description 14
- 230000006798 recombination Effects 0.000 title claims description 14
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 27
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 27
- 238000000855 fermentation Methods 0.000 claims description 40
- 230000004151 fermentation Effects 0.000 claims description 40
- 239000001963 growth medium Substances 0.000 claims description 19
- 229930027917 kanamycin Natural products 0.000 claims description 18
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 claims description 18
- 229960000318 kanamycin Drugs 0.000 claims description 18
- 229930182823 kanamycin A Natural products 0.000 claims description 18
- 238000007363 ring formation reaction Methods 0.000 claims description 14
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 claims description 12
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 7
- 239000008103 glucose Substances 0.000 claims description 7
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 claims description 6
- 230000015572 biosynthetic process Effects 0.000 claims description 6
- 230000006698 induction Effects 0.000 claims description 6
- 229910052760 oxygen Inorganic materials 0.000 claims description 6
- 239000001301 oxygen Substances 0.000 claims description 6
- 238000003786 synthesis reaction Methods 0.000 claims description 6
- 238000004321 preservation Methods 0.000 claims description 5
- 102000004190 Enzymes Human genes 0.000 claims description 4
- 108090000790 Enzymes Proteins 0.000 claims description 4
- 239000000203 mixture Substances 0.000 claims description 4
- 238000009629 microbiological culture Methods 0.000 claims description 2
- 238000012807 shake-flask culturing Methods 0.000 claims 3
- 241000588722 Escherichia Species 0.000 claims 1
- 230000003321 amplification Effects 0.000 claims 1
- 238000011081 inoculation Methods 0.000 claims 1
- 238000003199 nucleic acid amplification method Methods 0.000 claims 1
- 238000004519 manufacturing process Methods 0.000 abstract description 10
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 69
- 108010004073 cysteinylcysteine Proteins 0.000 description 45
- 229920001184 polypeptide Polymers 0.000 description 42
- 108090000765 processed proteins & peptides Proteins 0.000 description 42
- 102000004196 processed proteins & peptides Human genes 0.000 description 42
- 230000004927 fusion Effects 0.000 description 41
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 30
- 238000000034 method Methods 0.000 description 28
- DVKQPQKQDHHFTE-ZLUOBGJFSA-N Cys-Cys-Asn Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)N DVKQPQKQDHHFTE-ZLUOBGJFSA-N 0.000 description 27
- AOZBJZBKFHOYHL-AVGNSLFASA-N Cys-Glu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O AOZBJZBKFHOYHL-AVGNSLFASA-N 0.000 description 26
- LLUXQOVDMQZMPJ-KKUMJFAQSA-N Cys-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CS)CC1=CC=C(O)C=C1 LLUXQOVDMQZMPJ-KKUMJFAQSA-N 0.000 description 25
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 24
- 239000002773 nucleotide Substances 0.000 description 24
- 125000003729 nucleotide group Chemical group 0.000 description 24
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 23
- ISWAQPWFWKGCAL-ACZMJKKPSA-N Cys-Cys-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISWAQPWFWKGCAL-ACZMJKKPSA-N 0.000 description 23
- CGDZGRLRXPNCOC-SRVKXCTJSA-N Tyr-Cys-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CGDZGRLRXPNCOC-SRVKXCTJSA-N 0.000 description 23
- 150000001413 amino acids Chemical class 0.000 description 21
- 210000004027 cell Anatomy 0.000 description 21
- 108010069495 cysteinyltyrosine Proteins 0.000 description 21
- 108010061238 threonyl-glycine Proteins 0.000 description 21
- SBMGKDLRJLYZCU-BIIVOSGPSA-N Cys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N)C(=O)O SBMGKDLRJLYZCU-BIIVOSGPSA-N 0.000 description 20
- QOOFKCCZZWTCEP-AVGNSLFASA-N Glu-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QOOFKCCZZWTCEP-AVGNSLFASA-N 0.000 description 20
- ALJGSKMBIUEJOB-FXQIFTODSA-N Pro-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 ALJGSKMBIUEJOB-FXQIFTODSA-N 0.000 description 20
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 20
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 19
- CFVQPNSCQMKDPB-CIUDSAMLSA-N Lys-Cys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N CFVQPNSCQMKDPB-CIUDSAMLSA-N 0.000 description 19
- GYAUWXXORNTCHU-QWRGUYRKSA-N Gly-Cys-Tyr Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 GYAUWXXORNTCHU-QWRGUYRKSA-N 0.000 description 18
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 18
- 238000001976 enzyme digestion Methods 0.000 description 17
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 16
- 238000010828 elution Methods 0.000 description 16
- 239000012071 phase Substances 0.000 description 16
- 239000000047 product Substances 0.000 description 15
- 102000004169 proteins and genes Human genes 0.000 description 15
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 12
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 12
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 12
- 239000004202 carbamide Substances 0.000 description 12
- 238000013461 design Methods 0.000 description 12
- 238000000746 purification Methods 0.000 description 12
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 11
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 10
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 10
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 10
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 10
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 10
- 108010092854 aspartyllysine Proteins 0.000 description 10
- 238000001514 detection method Methods 0.000 description 10
- 108010078144 glutaminyl-glycine Proteins 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- 239000000872 buffer Substances 0.000 description 9
- 238000003776 cleavage reaction Methods 0.000 description 9
- 239000012634 fragment Substances 0.000 description 9
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 9
- 239000002054 inoculum Substances 0.000 description 9
- 238000002360 preparation method Methods 0.000 description 9
- 230000007017 scission Effects 0.000 description 9
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 9
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 8
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 8
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 8
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 8
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 8
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 8
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 8
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 8
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 8
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 8
- LBSKYJOZIIOZIO-DCAQKATOSA-N Cys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N LBSKYJOZIIOZIO-DCAQKATOSA-N 0.000 description 8
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 8
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 8
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 8
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 8
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 8
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 8
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 8
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 8
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 8
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 8
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 8
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 8
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 8
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 8
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 8
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 8
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 8
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 8
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 8
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 8
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 8
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 8
- 108010038633 aspartylglutamate Proteins 0.000 description 8
- 238000005119 centrifugation Methods 0.000 description 8
- 230000001276 controlling effect Effects 0.000 description 8
- 238000012258 culturing Methods 0.000 description 8
- 230000029087 digestion Effects 0.000 description 8
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 8
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 8
- 239000006228 supernatant Substances 0.000 description 8
- LYDKQVYYCMYNMC-SRVKXCTJSA-N His-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYDKQVYYCMYNMC-SRVKXCTJSA-N 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 238000012795 verification Methods 0.000 description 7
- 108010053070 Glutathione Disulfide Proteins 0.000 description 6
- 102000035195 Peptidases Human genes 0.000 description 6
- 108091005804 Peptidases Proteins 0.000 description 6
- 239000004365 Protease Substances 0.000 description 6
- 229960000583 acetic acid Drugs 0.000 description 6
- 239000012295 chemical reaction liquid Substances 0.000 description 6
- 238000010276 construction Methods 0.000 description 6
- 239000012149 elution buffer Substances 0.000 description 6
- 239000006167 equilibration buffer Substances 0.000 description 6
- 239000012362 glacial acetic acid Substances 0.000 description 6
- YPZRWBKMTBYPTK-BJDJZHNGSA-N glutathione disulfide Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@H](C(=O)NCC(O)=O)CSSC[C@@H](C(=O)NCC(O)=O)NC(=O)CC[C@H](N)C(O)=O YPZRWBKMTBYPTK-BJDJZHNGSA-N 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 6
- 239000000843 powder Substances 0.000 description 6
- 230000001502 supplementing effect Effects 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- ZOOGRGPOEVQQDX-UUOKFMHZSA-N 3',5'-cyclic GMP Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=C(NC2=O)N)=C2N=C1 ZOOGRGPOEVQQDX-UUOKFMHZSA-N 0.000 description 5
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 5
- 239000012564 Q sepharose fast flow resin Substances 0.000 description 5
- 229920002684 Sepharose Polymers 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 125000004122 cyclic group Chemical group 0.000 description 5
- 229960003180 glutathione Drugs 0.000 description 5
- 238000004128 high performance liquid chromatography Methods 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 230000035939 shock Effects 0.000 description 5
- 238000009423 ventilation Methods 0.000 description 5
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 4
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 4
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 4
- 102000003670 Carboxypeptidase B Human genes 0.000 description 4
- 108090000087 Carboxypeptidase B Proteins 0.000 description 4
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 4
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 4
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 4
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 4
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 4
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- 108010053229 Lysyl endopeptidase Proteins 0.000 description 4
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 4
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 235000011114 ammonium hydroxide Nutrition 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 238000004108 freeze drying Methods 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 108010028295 histidylhistidine Proteins 0.000 description 4
- 108010085325 histidylproline Proteins 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 239000013028 medium composition Substances 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-diisopropylethylamine Substances CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 238000001742 protein purification Methods 0.000 description 3
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 238000003756 stirring Methods 0.000 description 3
- IGXNPQWXIRIGBF-KEOOTSPTSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IGXNPQWXIRIGBF-KEOOTSPTSA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 2
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 2
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 2
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 2
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 2
- SGFBVLBKDSXGAP-GKCIPKSASA-N Ala-Phe-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N SGFBVLBKDSXGAP-GKCIPKSASA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 2
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 2
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 2
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 2
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 2
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 2
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 2
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- RZNAMKZJPBQWDJ-SRVKXCTJSA-N Asn-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N RZNAMKZJPBQWDJ-SRVKXCTJSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 2
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 2
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 2
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 2
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 2
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 2
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 2
- ZXRQJQCXPSMNMR-XIRDDKMYSA-N Asp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N ZXRQJQCXPSMNMR-XIRDDKMYSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- 206010010774 Constipation Diseases 0.000 description 2
- HAYVLBZZBDCKRA-SRVKXCTJSA-N Cys-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N HAYVLBZZBDCKRA-SRVKXCTJSA-N 0.000 description 2
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 2
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 2
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 2
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 2
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 2
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 2
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 2
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- CQIIXEHDSZUSAG-QWRGUYRKSA-N Gly-His-His Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CQIIXEHDSZUSAG-QWRGUYRKSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- PYFHPYDQHCEVIT-KBPBESRZSA-N Gly-Trp-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O PYFHPYDQHCEVIT-KBPBESRZSA-N 0.000 description 2
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 2
- FJCGVRRVBKYYOU-DCAQKATOSA-N His-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N FJCGVRRVBKYYOU-DCAQKATOSA-N 0.000 description 2
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 2
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 2
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 2
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 2
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 2
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 2
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 2
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 2
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 2
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 2
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 2
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 2
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 2
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- SEZADXQOJJTXPG-VFAJRCTISA-N Lys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N)O SEZADXQOJJTXPG-VFAJRCTISA-N 0.000 description 2
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 2
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 2
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- 244000097724 Mesua ferrea Species 0.000 description 2
- 235000010931 Mesua ferrea Nutrition 0.000 description 2
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 2
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 2
- PTYVBBNIAQWUFV-DCAQKATOSA-N Met-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N PTYVBBNIAQWUFV-DCAQKATOSA-N 0.000 description 2
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 2
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 2
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 2
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 description 2
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 235000005704 Olneya tesota Nutrition 0.000 description 2
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 2
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 2
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 2
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 2
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 2
- ABEFOXGAIIJDCL-SFJXLCSZSA-N Phe-Thr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ABEFOXGAIIJDCL-SFJXLCSZSA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 2
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 2
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 2
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 2
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 2
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 2
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 2
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 2
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 2
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 2
- NBDHWLZEMKSVHH-UVBJJODRSA-N Pro-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 NBDHWLZEMKSVHH-UVBJJODRSA-N 0.000 description 2
- 235000008198 Prosopis juliflora Nutrition 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 2
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 2
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 2
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 2
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 2
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 2
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 2
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 2
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 2
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 2
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 2
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 2
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 2
- LVILBTSHPTWDGE-PMVMPFDFSA-N Tyr-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CC=C(O)C=C1 LVILBTSHPTWDGE-PMVMPFDFSA-N 0.000 description 2
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- SRWWRLKBEJZFPW-IHRRRGAJSA-N Val-Cys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SRWWRLKBEJZFPW-IHRRRGAJSA-N 0.000 description 2
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 2
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 2
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 2
- MYLNLEIZWHVENT-VKOGCVSHSA-N Val-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N MYLNLEIZWHVENT-VKOGCVSHSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 2
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 2
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010066988 asparaginyl-alanyl-glycyl-alanine Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 239000011248 coating agent Substances 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 208000029742 colonic neoplasm Diseases 0.000 description 2
- 108010081447 cytochrophin-4 Proteins 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 238000001035 drying Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000000968 intestinal effect Effects 0.000 description 2
- 208000002551 irritable bowel syndrome Diseases 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 238000000074 matrix-assisted laser desorption--ionisation tandem time-of-flight detection Methods 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 108010025488 pinealon Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 239000000741 silica gel Substances 0.000 description 2
- 229910002027 silica gel Inorganic materials 0.000 description 2
- 238000001308 synthesis method Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- BDNKZNFMNDZQMI-UHFFFAOYSA-N 1,3-diisopropylcarbodiimide Chemical compound CC(C)N=C=NC(C)C BDNKZNFMNDZQMI-UHFFFAOYSA-N 0.000 description 1
- 102100034605 Atrial natriuretic peptide receptor 3 Human genes 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- NZNMSOFKMUBTKW-UHFFFAOYSA-N Cyclohexanecarboxylic acid Natural products OC(=O)C1CCCCC1 NZNMSOFKMUBTKW-UHFFFAOYSA-N 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 102000000820 Enterotoxin Receptors Human genes 0.000 description 1
- 108010001687 Enterotoxin Receptors Proteins 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- AKAPKBNIVNPIPO-KKUMJFAQSA-N His-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 AKAPKBNIVNPIPO-KKUMJFAQSA-N 0.000 description 1
- 101000924488 Homo sapiens Atrial natriuretic peptide receptor 3 Proteins 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241001614181 Phera Species 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- AOLHUMAVONBBEZ-STQMWFEESA-N Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AOLHUMAVONBBEZ-STQMWFEESA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000008484 agonism Effects 0.000 description 1
- AFVLVVWMAFSXCK-VMPITWQZSA-N alpha-cyano-4-hydroxycinnamic acid Chemical compound OC(=O)C(\C#N)=C\C1=CC=C(O)C=C1 AFVLVVWMAFSXCK-VMPITWQZSA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 230000013872 defecation Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000013401 experimental design Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 125000001475 halogen functional group Chemical group 0.000 description 1
- 229940042040 innovative drug Drugs 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 230000033116 oxidation-reduction process Effects 0.000 description 1
- YPZRWBKMTBYPTK-UHFFFAOYSA-N oxidized gamma-L-glutamyl-L-cysteinylglycine Natural products OC(=O)C(N)CCC(=O)NC(C(=O)NCC(O)=O)CSSCC(C(=O)NCC(O)=O)NC(=O)CCC(N)C(O)=O YPZRWBKMTBYPTK-UHFFFAOYSA-N 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000002994 raw material Chemical class 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000001028 reflection method Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000011347 resin Chemical class 0.000 description 1
- 229920005989 resin Chemical class 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 208000009935 visceral pain Diseases 0.000 description 1
- 239000012224 working solution Substances 0.000 description 1
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明涉及了用于高表达利那洛肽的工程菌,通过特定数量的利那洛肽与重组标签串联获得融合蛋白,并用于利那洛肽的表达生产,极大地提高了利那洛肽的产率。
Description
技术领域
本发明属于药物化学领域,涉及一种重组工程菌,具体涉及一种基因重组串联表达利那洛肽的融合蛋白及其制备方法。
背景技术
利那洛肽(linaclotide)是Ironwood公司开发的治疗肠易激综合征(IBS-C)及成人慢性特发性便秘(CIC)的创新药物,全球年销售额在10亿美元以上,属于“重磅炸弹”药物,2019年在中国获批上市(产品名:令则舒)。该药物可以与肠道细胞表面的鸟苷酸环化酶C结合,促进胞内和胞外cGMP浓度增加,从而刺激肠液分泌,促进肠活动导致排便次数增多,同时兼具缓解内脏疼痛的作用。
原研和国内目前已报道的制备方法全都采用多肽固相合成,中国专利CN103626849A公开了一种合成方法,虽然其总收率据记载最高可达69.60%,但该方法在后期环化形成二硫键时需要分三步进行,操作非常复杂,产业化意义不大;中国专利CN104163853A、CN104231051A、CN102875655A、CN104844693A分别公开了利那洛肽的合成方法,总收率据记载最高为27%-43.5%,合成时需要使用昂贵的修饰氨基酸、树脂等原料,成本依然较高,而且生产过程需要使用N,N-二甲基甲酰胺(DMF)、N,N-二异丙基乙胺(DIPEA)、N,N-二异丙基碳二亚胺(DIC)、三氟乙酸(TFA)、二甲基亚砜(DMSO)、无水乙醚、乙腈等大量有机溶剂,生产成本和环保成本都很高。为了克服现有技术的不足,本案设计了一种基于生物法的制备工艺,即采用基因重组串联表达生产,过程只需要葡萄糖和几种无机盐以及少量乙腈,大大降低了生产成本,且生产过程绿色环保。
发明内容
本发明要解决的技术问题是提供一种生产成本低、生产过程绿色环保的串联表达利那洛肽的基因工程菌,以及其制备方法。
本发明所要解决的技术问题是通过以下技术方案来实现的:
一种基因重组串联表达利那洛肽的融合蛋白,其特征在于,
由TrxA融合标签和利那洛肽串联表达制备;
其中,所述TrxA融合标签包含来自SEQ ID NO:1所述核苷酸序列:
SEQ ID NO:1:
atggcagacaaaatcatccacctgaccgacgactctttcgacaccgacgttctgaaagcggacggtgcgatcctggttgacttctgggcggaatggtgcggtccgtgcaaaatgatcgcgccgatcctggacgaaatcgcggacgaataccagggtaaactgaccgttgcgaaactgaacatcgaccagaacccgggtaccgcgccgaaatacggtatccgtggtatcccgaccctgctgctgttcaaaaacggtgaagttgcggcgaccaaagttggtgcgctgtctaaaggtcagctgaaagaattcctggacgcgaacctggcgggttctggtcatcatcatcatcatcat
进一步地,所述TrxA融合标签包含来自SEQ ID NO:2所述氨基酸序列:
SEQ ID NO:2:
MADKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLAGSGHHHHHH
其中,利那洛肽包含来自SEQ ID NO:3所述核苷酸序列:
SEQ ID NO:3:
tgttgcgagtactgctgcaacccggcctgcaccggttgttat
进一步地,所述利那洛肽包含来自SEQ ID NO:4所述氨基酸序列:
SEQ ID NO:4:
CCEYCCNPACTGCY。
所述利那洛肽结合在TrxA融合标签的N末端或C末端,优选地,融合在TrxA融合标签的C末端。
优选地,所述利那洛肽串联数为1-10个,优选地,串联数为3-8个。
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:5所述核苷酸序列。
SEQ ID NO:5:
atggcagacaaaatcatccacctgaccgacgactctttcgacaccgacgttctgaaagcggacggtgcgatcctggttgacttctgggcggaatggtgcggtccgtgcaaaatgatcgcgccgatcctggacgaaatcgcggacgaataccagggtaaactgaccgttgcgaaactgaacatcgaccagaacccgggtaccgcgccgaaatacggtatccgtggtatcccgaccctgctgctgttcaaaaacggtgaagttgcggcgaccaaagttggtgcgctgtctaaaggtcagctgaaagaattcctggacgcgaacctggcgggttctggtcatcatcatcatcatcataaatgttgcgagtactgctgcaacccggcctgcaccggttgttataaataa
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:6所述氨基酸序列。
SEQ ID NO:6:
MADKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLAGSGHHHHHHKCCEYCCNPACTGCYK
在另一实施方式中,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:7
所述核苷酸序列。
SEQ ID NO:7:
atggcagacaaaatcatccacctgaccgacgactctttcgacaccgacgttctgaaagcggacggtgcgatcctggttgacttctgggcggaatggtgcggtccgtgcaaaatgatcgcgccgatcctggacgaaatcgcggacgaataccagggtaaactgaccgttgcgaaactgaacatcgaccagaacccgggtaccgcgccgaaatacggtatccgtggtatcccgaccctgctgctgttcaaaaacggtgaagttgcggcgaccaaagttggtgcgctgtctaaaggtcagctgaaagaattcctggacgcgaacctggcgggttctggtcatcatcatcatcatcataaatgttgcgagtactgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgtaccggttgttataaataa
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:8所述氨基酸序列。
SEQ ID NO:8:
MADKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLAGSGHHHHHHKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYK
在另一实施方式中,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:9
所述核苷酸序列。
SEQ ID NO:9:
atggcagacaaaatcatccacctgaccgacgactctttcgacaccgacgttctgaaagcggacggtgcgatcctggttgacttctgggcggaatggtgcggtccgtgcaaaatgatcgcgccgatcctggacgaaatcgcggacgaataccagggtaaactgaccgttgcgaaactgaacatcgaccagaacccgggtaccgcgccgaaatacggtatccgtggtatcccgaccctgctgctgttcaaaaacggtgaagttgcggcgaccaaagttggtgcgctgtctaaaggtcagctgaaagaattcctggacgcgaacctggcgggttctggtcatcatcatcatcatcataaatgttgcgagtactgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgtaccggttgttataaatgttgtgaatactgctgcaacccggcatgtaccggttgttataaataa
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:10所述氨基酸序列。
SEQ ID NO:10:
MADKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLAGSGHHHHHHKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYK
在另一实施方式中,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:11所述核苷酸序列。
SEQ ID NO:11:
atggcagacaaaatcatccacctgaccgacgactctttcgacaccgacgttctgaaagcggacggtgcgatcctggttgacttctgggcggaatggtgcggtccgtgcaaaatgatcgcgccgatcctggacgaaatcgcggacgaataccagggtaaactgaccgttgcgaaactgaacatcgaccagaacccgggtaccgcgccgaaatacggtatccgtggtatcccgaccctgctgctgttcaaaaacggtgaagttgcggcgaccaaagttggtgcgctgtctaaaggtcagctgaaagaattcctggacgcgaacctggcgggttctggtcatcatcatcatcatcataaatgttgcgagtactgctgcaacccggcctgcacaggttgttataaatgttgtgaatactgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaatccggcatgtaccggttgttataaatgttgtgaatactgctgcaacccggcctgtaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgtaccggttgttataaataa
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:12所述氨基酸序列。
SEQ ID NO:12:
MADKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLAGSGHHHHHHKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYK
在另一实施方式中,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:13所述核苷酸序列。
SEQ ID NO:13:
atggcagacaaaatcatccacctgaccgacgactctttcgacaccgacgttctgaaagcggacggtgcgatcctggttgacttctgggcggaatggtgcggtccgtgcaaaatgatcgcgccgatcctggacgaaatcgcggacgaataccagggtaaactgaccgttgcgaaactgaacatcgaccagaacccgggtaccgcgccgaaatacggtatccgtggtatcccgaccctgctgctgttcaaaaacggtgaagttgcggcgaccaaagttggtgcgctgtctaaaggtcagctgaaagaattcctggacgcgaacctggcgggttctggtcatcatcatcatcatcataaatgttgcgagtactgctgcaacccggcctgcacaggttgttataaatgttgtgaatactgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaatccggcatgtaccggttgttataaatgttgtgaatactgctgcaacccggcctgtaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgtaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaataa
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:14所述氨基酸序列。
SEQ ID NO:14:
MADKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLAGSGHHHHHHKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYK
在另一实施方式中,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:15所述核苷酸序列。
SEQ ID NO:15:
atggcagacaaaatcatccacctgaccgacgactctttcgacaccgacgttctgaaagcggacggtgcgatcctggttgacttctgggcggaatggtgcggtccgtgcaaaatgatcgcgccgatcctggacgaaatcgcggacgaataccagggtaaactgaccgttgcgaaactgaacatcgaccagaacccgggtaccgcgccgaaatacggtatccgtggtatcccgaccctgctgctgttcaaaaacggtgaagttgcggcgaccaaagttggtgcgctgtctaaaggtcagctgaaagaattcctggacgcgaacctggcgggttctggtcatcatcatcatcatcataaatgttgcgagtactgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgcgagtactgctgcaacccggcctgcacaggttgttataaatgttgtgaatactgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaatccggcatgtaccggttgttataaatgttgtgaatactgctgcaacccggcctgtaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgtaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaataa
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:16所述氨基酸序列。
SEQ ID NO:16:
MADKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLAGSGHHHHHHKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYK
在另一实施方式中,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:17所述核苷酸序列。
SEQ ID NO:17:
atggcagacaaaatcatccacctgaccgacgactctttcgacaccgacgttctgaaagcggacggtgcgatcctggttgacttctgggcggaatggtgcggtccgtgcaaaatgatcgcgccgatcctggacgaaatcgcggacgaataccagggtaaactgaccgttgcgaaactgaacatcgaccagaacccgggtaccgcgccgaaatacggtatccgtggtatcccgaccctgctgctgttcaaaaacggtgaagttgcggcgaccaaagttggtgcgctgtctaaaggtcagctgaaagaattcctggacgcgaacctggcgggttctggtcatcatcatcatcatcataaatgttgcgagtactgctgcaacccggcctgcacaggttgttataaatgttgtgaatactgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaatccggcatgtaccggttgttataaatgttgtgaatactgctgcaacccggcctgtaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgtaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgcgagtactgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgtaccggttgttataaatgttgtgaatactgctgcaacccggcatgtaccggttgttataaataa
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:18所述氨基酸序列。
SEQ ID NO:18:
MADKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLAGSGHHHHHHKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYK
在另一实施方式中,所述基因重组串联表达利那洛肽融合蛋白由SUMO融合标签和利那洛肽串联表达制备;
其中,所述SUMO融合标签包含来自SEQ ID NO:19所述核苷酸序列:
SEQ ID NO:19:
atggggtcgagccaccatcatcatcaccacagctcaggacttgtgccgcgcggtagtcacatgtcggattctgaagtcaaccaggaagctaagcctgaagtcaagcctgaggttaaacccgaaacacacatcaacctgaaagtttcagacggcagcagcgagattttcttcaagattaaaaaaacaacaccgcttcgtcgccttatggaggcgtttgcgaagcgccaaggaaaggagatggacagtcttcgcttcttgtatgatggtatccgtattcaggcggaccaaacaccagaggaccttgatatggaggacaacgatattattgaggcgcaccgcgaacaaattggggga
进一步地,所述SUMO融合标签包含来自SEQ ID NO:20所述氨基酸序列:
SEQ ID NO:20:
MGSSHHHHHHSSGLVPRGSHMSDSEVNQEAKPEVKPEVKPETHINLKVSDGSSEIFFKIKKTTPLRRLMEAFAKRQGKEMDSLRFLYDGIRIQADQTPEDLDMEDNDIIEAHREQIGG其中,利那洛肽包含来自SEQ IDNO:3所述核苷酸序列:
SEQ ID NO:3:
tgttgcgagtactgctgcaacccggcctgcaccggttgttat
进一步地,所述利那洛肽包含来自SEQ ID NO:4所述氨基酸序列:
SEQ ID NO:4:
CCEYCCNPACTGCY。
所述利那洛肽结合在SUMO融合标签的N末端或C末端,优选地,融合在SUMO融合标签的C末端。
优选地,所述利那洛肽串联数为3-8个,优选地,串联数为6个。
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:21所述核苷酸序列。
SEQ ID NO:21:
atggggtcgagccaccatcatcatcaccacagctcaggacttgtgccgcgcggtagtcacatgtcggattctgaagtcaaccaggaagctaagcctgaagtcaagcctgaggttaaacccgaaacacacatcaacctgaaagtttcagacggcagcagcgagattttcttcaagattaaaaaaacaacaccgcttcgtcgccttatggaggcgtttgcgaagcgccaaggaaaggagatggacagtcttcgcttcttgtatgatggtatccgtattcaggcggaccaaacaccagaggaccttgatatggaggacaacgatattattgaggcgcaccgcgaacaaattgggggaaaatgctgcgagtattgctgtaatcccgcttgtacaggatgctataaatgttgtgagtattgttgtaacccggcgtgtacaggctgctacaagtgctgtgaatattgctgcaacccagcttgtactggctgctataaatgttgtgagtattgttgtaacccggcgtgtacaggctgctacaaataa
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:22所述氨基酸序列。
SEQ ID NO:22:
MGSSHHHHHHSSGLVPRGSHMSDSEVNQEAKPEVKPEVKPETHINLKVSDGSSEIFFKIKKTTPLRRLMEAFAKRQGKEMDSLRFLYDGIRIQADQTPEDLDMEDNDIIEAHREQIGGKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYK
在另一实施方式中,所述基因重组串联表达利那洛肽工程菌由GST融合标签和利那洛肽串联表达制备;
其中,所述GST融合标签包含来自SEQ ID NO:23所述核苷酸序列:
SEQ ID NO:23:
atggctcctatactaggttattggaaaattaagggccttgtgcaacccactcgacttcttttggaatatcttgaagaaaaatatgaagagcatttgtatgagcgcgatgaaggtgataaatggcgaaacaaaaagtttgaattgggtttggagtttcccaatcttccttattatattgatggtgatgttaaattaacacagtctatggccatcatacgttatatagctgacaagcacaacatgttgggtggttgtccaaaagagcgtgcagagatttcaatgcttgaaggagcggttttggatattagatacggtgtttcgagaattgcatatagtaaagactttgaaactctcaaagttgattttcttagcaagctacctgaaatgctgaaaatgttcgaagatcgtttatgtcataaaacatatttaaatggtgatcatgtaacccatcctgacttcatgttgtatgacgctcttgatgttgttttatacatggacccaatgtgcctggatgcgttcccaaaattagtttgttttaaaaaacgtattgaagctatcccacaaattgataagtacttgaaatccagcaagtatatagcatggcctttgcagggctggcaagccacgtttggtggtggcgaccatcctccaaaatcggatggttcaggtcatcatcatcatcatcat
进一步地,所述GST融合标签包含来自SEQ ID NO:24所述氨基酸序列:
SEQ ID NO:24:
MGPILGYWKIKGLVQPTRLLLEYLEEKYEEHLYERDEGDKWRNKKFELGLEFPNLPYYIDGDVKLTQSMAIIRYIADKHNMLGGCPKERAEISMLEGAVLDIRYGVSRIAYSKDFETLKVDFLSKLPEMLKMFEDRLCHKTYLNGDHVTHPDFMLYDALDVVLYMDPMCLDAFPKLVCFKKRIEAIPQIDKYLKSSKYIAWPLQGWQATFGGGDHPPKSDGSGHHHHHH
其中,利那洛肽包含来自SEQ ID NO:3所述核苷酸序列:
SEQ ID NO:3:
tgttgcgagtactgctgcaacccggcctgcaccggttgttat
进一步地,所述利那洛肽包含来自SEQ ID NO:4所述氨基酸序列:
SEQ ID NO:4:
CCEYCCNPACTGCY。
所述利那洛肽结合在GST融合标签的N末端或C末端,优选地,融合在GST融合标签的C末端。
优选地,所述利那洛肽串联数为3-8个,优选地,串联数为6个。
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:25所述核苷酸序列。
SEQ ID NO:25:
atggctcctatactaggttattggaaaattaagggccttgtgcaacccactcgacttcttttggaatatcttgaagaaaaatatgaagagcatttgtatgagcgcgatgaaggtgataaatggcgaaacaaaaagtttgaattgggtttggagtttcccaatcttccttattatattgatggtgatgttaaattaacacagtctatggccatcatacgttatatagctgacaagcacaacatgttgggtggttgtccaaaagagcgtgcagagatttcaatgcttgaaggagcggttttggatattagatacggtgtttcgagaattgcatatagtaaagactttgaaactctcaaagttgattttcttagcaagctacctgaaatgctgaaaatgttcgaagatcgtttatgtcataaaacatatttaaatggtgatcatgtaacccatcctgacttcatgttgtatgacgctcttgatgttgttttatacatggacccaatgtgcctggatgcgttcccaaaattagtttgttttaaaaaacgtattgaagctatcccacaaattgataagtacttgaaatccagcaagtatatagcatggcctttgcagggctggcaagccacgtttggtggtggcgaccatcctccaaaatcggatggttcaggtcatcatcatcatcatcataaatgttgcgagtactgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgtaccggttgttataaatgttgtgaatactgctgcaacccggcatgtaccggttgttataaataa
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:26所述氨基酸序列。
SEQ ID NO:26:
MGPILGYWKIKGLVQPTRLLLEYLEEKYEEHLYERDEGDKWRNKKFELGLEFPNLPYYIDGDVKLTQSMAIIRYIADKHNMLGGCPKERAEISMLEGAVLDIRYGVSRIAYSKDFETLKVDFLSKLPEMLKMFEDRLCHKTYLNGDHVTHPDFMLYDALDVVLYMDPMCLDAFPKLVCFKKRIEAIPQIDKYLKSSKYIAWPLQGWQATFGGGDHPPKSDGSGHHHHHHKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYK
在另一实施方式中,所述基因重组串联表达利那洛肽融合蛋白由MBP融合标签和利那洛肽串联表达制备;
其中,所述MBP融合标签包含来自SEQ ID NO:27所述核苷酸序列:
SEQ ID NO:27:
atgggtaaaatcgaagaaggtaaactggtaatctggattaacggcgataaaggctataacggtctcgctgaagtcggtaagaaattcgagaaagataccggaattaaagtcaccgttgagcatccggataaactggaagagaaattcccacaggttgcggcaactggcgatggccctgacattatcttctgggcacacgaccgctttggtggctacgctcaatctggcctgttggctgaaatcaccccggacaaagcgttccaggacaagctgtatccgtttacctgggatgccgtacgttacaacggcaagctgattgcttacccgatcgctgttgaagcgttatcgctgatttataacaaagatctgctgccgaacccgccaaaaacctgggaagagatcccggcgctggataaagaactgaaagcgaaaggtaagagcgcgctgatgttcaacctgcaagaaccgtacttcacctggccgctgattgctgctgacgggggttatgcgttcaagtatgaaaacggcaagtacgacattaaagacgtgggcgtggataacgctggcgcgaaagcgggtctgaccttcctggttgacctgattaaaaacaaacacatgaatgcagacaccgattactccatcgcagaagctgcctttaataaaggcgaaacagcgatgaccatcaacggcccgtgggcatggtccaacatcgacaccagcaaagtgaattatggtgtaacggtactgccgaccttcaagggtcaaccatccaaaccgttcgttggcgtgctgagcgcaggtattaacgccgccagtccgaacaaagagctggcaaaagagttcctcgaaaactatctgctgactgatgaaggtctggaagcggttaataaagacaaaccgctgggtgccgtagcgctgaagtcttacgaggaagagttggcgaaagatccacgtattgccgccactatggaaaacgcccagaaaggtgaaatcatgccgaacatcccgcagatgtccgctttctggtatgccgtgcgtactgcggtgatcaacgccgccagcggtcgtcagactgtcgatgaagccctgaaagacgcgcagactccgggtagcggtcatcatcatcatcatcat
进一步地,所述MBP融合标签包含来自SEQ ID NO:28所述氨基酸序列:
SEQ ID NO:28:
MGKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTPGSGHHHHHH
其中,利那洛肽包含来自SEQ ID NO:3所述核苷酸序列:
SEQ ID NO:3:
tgttgcgagtactgctgcaacccggcctgcaccggttgttat
进一步地,所述利那洛肽包含来自SEQ ID NO:4所述氨基酸序列:
SEQ ID NO:4:
CCEYCCNPACTGCY。
所述利那洛肽结合在MBP融合标签的N末端或C末端,优选地,融合在MBP
融合标签的C末端。
优选地,所述利那洛肽串联数为3-8个,优选地,串联数为6个。
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:29所述核苷酸序列。
SEQ ID NO:29:
atgggtaaaatcgaagaaggtaaactggtaatctggattaacggcgataaaggctataacggtctcgctgaagtcggtaagaaattcgagaaagataccggaattaaagtcaccgttgagcatccggataaactggaagagaaattcccacaggttgcggcaactggcgatggccctgacattatcttctgggcacacgaccgctttggtggctacgctcaatctggcctgttggctgaaatcaccccggacaaagcgttccaggacaagctgtatccgtttacctgggatgccgtacgttacaacggcaagctgattgcttacccgatcgctgttgaagcgttatcgctgatttataacaaagatctgctgccgaacccgccaaaaacctgggaagagatcccggcgctggataaagaactgaaagcgaaaggtaagagcgcgctgatgttcaacctgcaagaaccgtacttcacctggccgctgattgctgctgacgggggttatgcgttcaagtatgaaaacggcaagtacgacattaaagacgtgggcgtggataacgctggcgcgaaagcgggtctgaccttcctggttgacctgattaaaaacaaacacatgaatgcagacaccgattactccatcgcagaagctgcctttaataaaggcgaaacagcgatgaccatcaacggcccgtgggcatggtccaacatcgacaccagcaaagtgaattatggtgtaacggtactgccgaccttcaagggtcaaccatccaaaccgttcgttggcgtgctgagcgcaggtattaacgccgccagtccgaacaaagagctggcaaaagagttcctcgaaaactatctgctgactgatgaaggtctggaagcggttaataaagacaaaccgctgggtgccgtagcgctgaagtcttacgaggaagagttggcgaaagatccacgtattgccgccactatggaaaacgcccagaaaggtgaaatcatgccgaacatcccgcagatgtccgctttctggtatgccgtgcgtactgcggtgatcaacgccgccagcggtcgtcagactgtcgatgaagccctgaaagacgcgcagactccgggtagcggtcatcatcatcatcatcataaatgttgcgagtactgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgcaccggttgttataaatgttgtgaatattgctgcaacccggcctgtaccggttgttataaatgttgtgaatactgctgcaacccggcatgtaccggttgttataaataa
进一步地,所述基因重组串联表达利那洛肽工程菌包含SEQ ID NO:30所述氨基酸序列。
SEQ ID NO:30:
MGKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTPGSGHHHHHHKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYK
在另一实施方式中,所述基因重组串联表达利那洛肽工程菌不含融合标签,包含SEQ ID NO:31所述核苷酸序列。
SEQ ID NO:31:
atgggttctaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaataa
进一步地,所述工程菌包含SEQ ID NO:32所述氨基酸序列。
SEQ ID NO:32:
MGSKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYK
在另一实施方式中,所述基因重组串联表达利那洛肽工程菌不含融合标签,包含SEQ ID NO:33所述核苷酸序列。
SEQ ID NO:33:
atgggttctaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaatgctgcgaatactgctgcaacccggcgtgcaccggttgctacaaataa
进一步地,所述工程菌包含SEQ ID NO:34所述氨基酸序列。
SEQ ID NO:34:
MGSKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYKCCEYCCNPACTGCYK
本发明筛选出的基因重组串联表达利那洛肽工程菌为TrxA标签串联6个利那洛肽后经转化获得,已保藏于中国微生物菌种保藏管理委员会普通微生物中心,保藏日期是2021年11月15日,分类命名为大肠埃希氏菌Escherichia coli,保藏地址北京市朝阳区北辰西路1号院3号中国科学院微生物研究所,保藏编号为CGMCC No.23800。
本发明的另一技术方案涉及利用所述基因重组串联表达利那洛肽工程菌表达利那洛肽的方法。具体地,
采用大肠杆菌融合表达获得融合蛋白,通过酶切、纯化、环化得到高纯度的目标多肽。
进一步地,包含如下步骤:
1)构建利那洛肽融合串联表达基因,根据大肠杆菌密码子偏好性,优化相关基因序列后人工合成基因片段,插入质粒,转化大肠杆菌BL21(DE3)感受态细胞;优选地,所述质粒为pET9d、pET28a、pET33b
优选的,融合基因选自TrxA,SUMO,GST,MBP,FLAG,Avi,Halo,SNAP,更优选的,为TrxA或SUMO。
优选地,所述感受态细胞预先经过CaCl2处理;
优选地,所述转化采用热击法、电穿孔法;
2)进一步地,所述步骤1)制得的重组工程菌接种至培养基发酵后收菌,重悬后超声破壁,离心上清进行亲和柱层析,得到融合蛋白;
进一步地,发酵步骤包含摇瓶发酵及罐发酵,
优选地,培养基为LB培养基;
优选地,所述培养基含卡那霉素;
优选地,所述发酵温度为37℃;
优选地,所述接种量为1%;
进一步地,发酵至OD600为0.6-1.0时加入IPTG;
优选地,OD600选择0.8;
进一步地,加入IPTG后进一步诱导并收集,优选地,诱导温度为25-37℃,诱导时间为4-12h,最优的,诱导温度为30℃,诱导时间为8h。
进一步地,所述亲和层析使用Ni-NTA Sepharose FF;
优选地,所述罐发酵还包含通气、补糖工序,其中,所述补糖优选为70%葡萄糖;
优选地,所述罐发酵控制溶氧20%-50%,优选为30%-40%;
优选地,所述罐发酵控制pH为6-8,优选为7.0;
优选地,所述发酵培养基为:
优选地,所述破壁采用超声或高压均质法,所述超声为500W-800W,20-60min,所述均质条件为4℃,80-150MPa,次数2-4次。
3)步骤2)得到的融合蛋白,加入蛋白酶酶切,加入DTT还原;
优选地,步骤3)选用的蛋白酶是胰蛋白酶,更优选的,所述蛋白酶是赖氨酰内肽酶;
优选地,所述酶切条件为15-35℃酶切4-8h,最优选地,25℃酶切6小时;
优选地,DTT浓度为20mM。
4)步骤3)还原产物过Q Sepharose FF柱,得到线性多肽,通过C18反相硅胶纯化,冷冻干燥得到线性多肽纯品;
5)步骤4)得到的线性多肽纯品通过环化工艺,得到环状多肽,然后通过酶切将末端赖氨酸切除,C18反相硅胶纯化后,冷冻干燥得到利那洛肽纯品。
优选地,所述环化工艺采用GSH/GSSG氧化还原体系,GSH为还原型谷胱甘肽,浓度为0.1-10mmol/L,GSSG为氧化型谷胱甘肽,浓度范围为0.01-1mmol/L;进一步的,所述酶切工艺采用羧肽酶B;
优选地,所述酶切条件为20-35℃酶切4-12h,最优选地,30℃酶切10小时。
应当理解的是,上述反应条件取决于原料类型的选取等,所有能够实现反应进行的条件均应视为落入本发明的保护范围。
与现有技术相比,本发明的有益效果是:能够低成本快速获取高纯度利那洛肽,按照如上工艺,利那洛肽发酵产量最高达到0.5g/L,纯品收率最高达到0.2g/L,纯度最高达到99%。
附图说明:
图1不同构建方法下蛋白表达情况。图a-l分别对应设计1-12的蛋白表达情况。
图2设计4融合蛋白Ni-NTA Sepharose FF纯化图
图3设计4线性多肽Q Sepharose FF纯化图
图4设计4多肽的HPLC检测图谱。图a为线性多肽的HPLC检测图谱;图b为环状多肽的检测图谱;图c为Ironwood公司对照品HPLC检测图谱。
图5设计4线性多肽和环状多肽的分子量检测。图a为线性多肽的分子量检测,结果显示单同位素分子量为1660.49([M+H]+),与线性多肽理论值分子量1659.53一致;图b为环状多肽分子量检测,结果显示单同位素分子量为1526.36([M+H]+),1548.35([M+Na]+),与环状多肽理论值分子量1525.44一致。
图6设计4利那洛肽的活性检测。检测结果表明,本案方法制备的利那洛肽对人结肠癌细胞株T84促进cGMP产生的EC50为17.90nM,与阳性对照药19.22nM一致。
具体实施方式
为了进一步阐明本发明,现在将描述本发明的优选实施例。还应理解,提供实施方案是出于说明的目的,而不限制本发明的范围。
实施例1
一种基因重组串联表达利那洛肽工程菌表达利那洛肽的方法。
1.重组蛋白设计
设计3:TrxA融合标签,串联4个利那洛肽序列,其核苷酸和蛋白序列分别如SEQ IDNO:9,SEQ ID NO:10所示。
2.工程菌构建
委托华大基因合成基因片段。合成的基因片段经限制性内切酶Nco I和Xho I双酶切,连接到同样双酶切处理的pET28a质粒,测序验证;CaCl2处理法制备大肠杆菌BL21(DE3)感受态细胞,42℃热击将重组质粒转入细胞,37℃培养过夜,挑取单克隆菌株进行产物表达验证。
3.工程菌摇瓶发酵
验证后的菌种接种于20ml LB培养基(含卡那霉素50μg/ml),37℃培养过夜作为种子,接种于1L LB培养基(含卡那霉素50μg/ml)发酵(1%接种量),首先220rpm,37℃培养至OD600达到0.8,调温至30℃,加入IPTG(终浓度0.2mM)诱导蛋白表达8h,8000rpm离心收菌。
4.工程菌罐发酵
验证后的菌种接种于100ml LB培养基(含卡那霉素50μg/ml),37℃培养过夜作为种子,接种于10L发酵培养基(含卡那霉素50μg/ml,1%接种量),37℃培养,浓氨水控pH7.0,转速、通气和补糖(70%葡糖糖)控溶氧30%-40%;OD600达到40时,调温至30℃,加入IPTG(终浓度0.5mM)开始诱导蛋白表达,10h后发酵结束,8000rpm离心收菌。
发酵培养基组分:
/>
5.工程菌破壁
湿菌按1:8用破壁缓冲液溶解(破壁缓冲液:50mM Tris-Cl,2M尿素,pH8.5),搅拌至无明显颗粒,超声或高压均质破壁(超声条件:10号变幅杆,700W,35min;均质条件:4℃,110MPa,均质4次),破壁后调节pH至8.0。4℃,12000rpm离心30min,收集上清,补加20mMβ-巯基乙醇。
6.融合蛋白纯化
按1:1使用柱料,将上清上样至预平衡后的Ni-NTA Sepharose FF亲和柱,用平衡缓冲液冲洗至基线平衡,然后用洗脱缓冲液将融合蛋白洗脱,收集洗脱峰。
平衡缓冲液:50mM Tris-Cl,2M尿素,20mMβ-巯基乙醇,pH8.0
洗脱缓冲液:50mM Tris-Cl,500mM咪唑,2M尿素,20mMβ-巯基乙醇,pH8.0
7.融合蛋白酶切
纯化后融合蛋白调节pH 9.0,加入赖氨酰内肽酶,5-30AU/g融合蛋白加量,置于25℃静置酶切过夜。酶切结束补加20mM DTT,静置2h还原线性多肽。
8.线性多肽纯化
按1:1使用柱料,将还原产物上样至预平衡的Q Sepharose FF柱,用缓冲液A冲洗至基线平衡,然后0-100%线性洗脱,将线性多肽洗脱,洗脱体积20CV,收集洗脱峰。
缓冲液A:50mM Tris-Cl,2M尿素,20mM DTT,pH8.7
缓冲液B:50mM Tris-Cl,500mM NaCl,2M尿素,20mM DTT,pH8.7
将洗脱峰进行反相色谱纯化(色谱柱:C18,10μm,),线性洗脱,收集洗脱峰,收集波长280nm。冷冻干燥,得线性多肽冻干粉。
流动相A:0.1%TFA的纯水;
流动相B:0.1%TFA的乙腈;
梯度方法:
9.多肽环化
将线性多肽冻干粉用环化反应液溶解,浓度0.1-2mg/ml,25℃静置反应30h。
反应液体系:50mM Tris-Cl,1mM GSH,0.1mM GSSG,pH 8.0-9.0
10.多肽酶切
在环化反应体系加入羧肽酶B酶切,1.0-10mg/g加酶量,30℃静置酶切过夜。
11.多肽纯化
酶切产物进行反相色谱纯化(色谱柱:C18,10μm,),线性洗脱,收集洗脱峰,收集波长280nm。冷冻干燥,得纯品醋酸利那洛肽冻干粉。
流动相A:1%冰醋酸纯水;
流动相B:1%冰醋酸乙腈;
梯度方法:
时间(min) | A(%) | B(%) |
0 | 90 | 15 |
5 | 90 | 15 |
35 | 60 | 40 |
45 | 60 | 40 |
按照该制备工艺,多肽发酵产量为0.503g/L,纯品收率达到0.201g/L。
实施例2
一种基因重组串联表达利那洛肽工程菌表达利那洛肽的方法。
1.重组蛋白设计
设计4:TrxA融合标签,串联6个利那洛肽序列,其核苷酸和蛋白序列分别如SEQ IDNO:11,SEQ ID NO:12所示。
2.工程菌构建
委托华大基因合成基因片段。合成的基因片段经限制性内切酶Nco I和Xho I双酶切,连接到同样双酶切处理的pET28a质粒,测序验证;CaCl2处理法制备大肠杆菌BL21(DE3)感受态细胞,42℃热击将重组质粒转入细胞,37℃培养过夜,挑取单克隆菌株进行产物表达验证。
将工程菌进行保藏,保藏日期是2021年11月15日,保藏地址北京市朝阳区北辰西路1号院3号中国科学院微生物研究所,保藏编号为CGMCC No.23800。
3.工程菌摇瓶发酵
验证后的菌种接种于20ml LB培养基(含卡那霉素50μg/ml),37℃培养过夜作为种子,接种于1L LB培养基(含卡那霉素50μg/ml)发酵(1%接种量),首先220rpm,37℃培养至OD600达到0.8,调温至30℃,加入IPTG(终浓度0.2mM)诱导蛋白表达8h,8000rpm离心收菌。
4.工程菌罐发酵
验证后的菌种接种于100ml LB培养基(含卡那霉素50μg/ml),37℃培养过夜作为种子,接种于10L发酵培养基(含卡那霉素50μg/ml,1%接种量),37℃培养,浓氨水控pH7.0,转速、通气和补糖(70%葡糖糖)控溶氧30%-40%;OD600达到40时,调温至30℃,加入IPTG(终浓度0.5mM)开始诱导蛋白表达,10h后发酵结束,8000rpm离心收菌。发酵培养基组分同实施例1。
5.工程菌破壁
湿菌按1:8用破壁缓冲液溶解(破壁缓冲液:50mM Tris-Cl,2M尿素,pH8.5),搅拌至无明显颗粒,超声或高压均质破壁(超声条件:10号变幅杆,700W,35min;均质条件:4℃,110MPa,均质4次),破壁后调节pH至8.0。4℃,12000rpm离心30min,收集上清,补加20mMβ-巯基乙醇。
6.融合蛋白纯化
按1:1使用柱料,将上清上样至预平衡后的Ni-NTA Sepharose FF亲和柱,用平衡缓冲液冲洗至基线平衡,然后用洗脱缓冲液将融合蛋白洗脱,收集洗脱峰。平衡缓冲液:50mM Tris-Cl,2M尿素,20mMβ-巯基乙醇,pH8.0
洗脱缓冲液:50mM Tris-Cl,500mM咪唑,2M尿素,20mMβ-巯基乙醇,pH8.0
7.融合蛋白酶切
纯化后融合蛋白调节pH 9.0,加入赖氨酰内肽酶,5-30AU/g融合蛋白加量,置于25℃静置酶切过夜。酶切结束补加20mM DTT,静置2h还原线性多肽。
8.线性多肽纯化
按1:1使用柱料,将还原产物上样至预平衡的Q Sepharose FF柱,用缓冲液A冲洗至基线平衡,然后0-100%线性洗脱,将线性多肽洗脱,洗脱体积20CV,收集洗脱峰。
缓冲液A:50mM Tris-Cl,2M尿素,20mM DTT,pH8.7
缓冲液B:50mM Tris-Cl,500mM NaCl,2M尿素,20mM DTT,pH8.7
将洗脱峰进行反相色谱纯化(色谱柱:C18,10μm,),线性洗脱,收集洗脱峰,收集波长280nm。冷冻干燥,得线性多肽冻干粉。
流动相A:0.1%TFA的纯水;
流动相B:0.1%TFA的乙腈;
梯度同实施例1
9.多肽环化
将线性多肽冻干粉用环化反应液溶解,浓度0.1-2mg/ml,25℃静置反应30h。
反应液体系:50mM Tris-Cl,1mM GSH,0.1mM GSSG,pH 8.0-9.0
10.多肽酶切
在环化反应体系加入羧肽酶B酶切,1.0-10mg/g加酶量,30℃静置酶切过夜。
11.多肽纯化
酶切产物进行反相色谱纯化(色谱柱:C18,10μm,),线性洗脱,收集洗脱峰,收集波长280nm。冷冻干燥,得纯品醋酸利那洛肽冻干粉。
流动相A:1%冰醋酸纯水;
流动相B:1%冰醋酸乙腈;
梯度方法同实施例1
按照该制备工艺,多肽发酵产量为0.518g/L,纯品收率达到0.209g/L。
实施例3
一种基因重组串联表达利那洛肽工程菌表达利那洛肽的方法。
1.重组蛋白设计
设计6:TrxA融合标签,串联10个利那洛肽序列,其核苷酸和蛋白序列分别如SEQID NO:15,SEQ ID NO:16所示。
2.工程菌构建
委托华大基因合成基因片段。合成的基因片段经限制性内切酶Nco I和Xho I双酶切,连接到同样双酶切处理的pET28a质粒,测序验证;CaCl2处理法制备大肠杆菌BL21(DE3)感受态细胞,42℃热击将重组质粒转入细胞,37℃培养过夜,挑取单克隆菌株进行产物表达验证。
3.工程菌摇瓶发酵
验证后的菌种接种于20ml LB培养基(含卡那霉素50μg/ml),37℃培养过夜作为种子,接种于1L LB培养基(含卡那霉素50μg/ml)发酵(1%接种量),首先220rpm,37℃培养至OD600达到0.8,调温至30℃,加入IPTG(终浓度0.2mM)诱导蛋白表达8h,8000rpm离心收菌。
4.工程菌罐发酵
验证后的菌种接种于100ml LB培养基(含卡那霉素50μg/ml),37℃培养过夜作为种子,接种于10L发酵培养基(含卡那霉素50μg/ml,1%接种量),37℃培养,浓氨水控pH7.0,转速、通气和补糖(70%葡糖糖)控溶氧30%-40%;OD600达到40时,调温至30℃,加入IPTG(终浓度0.5mM)开始诱导蛋白表达,10h后发酵结束,8000rpm离心收菌。发酵培养基组分同实施例1。
5.工程菌破壁
湿菌按1:8用破壁缓冲液溶解(破壁缓冲液:50mM Tris-Cl,2M尿素,pH8.5),搅拌至无明显颗粒,超声或高压均质破壁(超声条件:10号变幅杆,700W,35min;均质条件:4℃,110MPa,均质4次),破壁后调节pH至8.0。4℃,12000rpm离心30min,收集上清,补加20mMβ-巯基乙醇。
6.融合蛋白纯化
按1:1使用柱料,将上清上样至预平衡后的Ni-NTA Sepharose FF亲和柱,用平衡缓冲液冲洗至基线平衡,然后用洗脱缓冲液将融合蛋白洗脱,收集洗脱峰。
平衡缓冲液:50mM Tris-Cl,2M尿素,20mMβ-巯基乙醇,pH8.0
洗脱缓冲液:50mM Tris-Cl,500mM咪唑,2M尿素,20mMβ-巯基乙醇,pH8.0
7.融合蛋白酶切
纯化后融合蛋白调节pH 9.0,加入赖氨酰内肽酶,5-30AU/g融合蛋白加量,置于25℃静置酶切过夜。酶切结束补加20mM DTT,静置2h还原线性多肽。
8.线性多肽纯化
按1:1使用柱料,将还原产物上样至预平衡的Q Sepharose FF柱,用缓冲液A冲洗至基线平衡,然后0-100%线性洗脱,将线性多肽洗脱,洗脱体积20CV,收集洗脱峰。
缓冲液A:50mM Tris-Cl,2M尿素,20mM DTT,pH8.7
缓冲液B:50mM Tris-Cl,500mM NaCl,2M尿素,20mM DTT,pH8.7
将洗脱峰进行反相色谱纯化(色谱柱:C18,10μm,),线性洗脱,收集洗脱峰,收集波长280nm。冷冻干燥,得线性多肽冻干粉。
流动相A:0.1%TFA的纯水;
流动相B:0.1%TFA的乙腈;
梯度同实施例1
9.多肽环化
将线性多肽冻干粉用环化反应液溶解,浓度0.1-2mg/ml,25℃静置反应30h。反应液体系:50mM Tris-Cl,1mM GSH,0.1mM GSSG,pH 8.0-9.0
10.多肽酶切
在环化反应体系加入羧肽酶B酶切,1.0-10mg/g加酶量,30℃静置酶切过夜。
11.多肽纯化
酶切产物进行反相色谱纯化(色谱柱:C18,10μm,),线性洗脱,收集洗脱峰,收集波长280nm。冷冻干燥,得纯品醋酸利那洛肽冻干粉。
流动相A:1%冰醋酸纯水;
流动相B:1%冰醋酸乙腈;
梯度方法同实施例1
按照该制备工艺,多肽发酵产量为0.177g/L,纯品收率达到0.069g/L。
实施例4
一种基因重组串联表达利那洛肽工程菌表达利那洛肽的方法。
1.重组蛋白设计
设计7:TrxA融合标签,串联12个利那洛肽序列,其核苷酸和蛋白序列分别如SEQID NO:17,SEQ ID NO:18所示。
2.工程菌构建
委托华大基因合成基因片段。合成的基因片段经限制性内切酶Nco I和Xho I双酶切,连接到同样双酶切处理的pET28a质粒,测序验证;CaCl2处理法制备大肠杆菌BL21(DE3)感受态细胞,42℃热击将重组质粒转入细胞,37℃培养过夜,挑取单克隆菌株进行产物表达验证。
3.工程菌摇瓶发酵
验证后的菌种接种于20ml LB培养基(含卡那霉素50μg/ml),37℃培养过夜作为种子,接种于1L LB培养基(含卡那霉素50μg/ml)发酵(1%接种量),首先220rpm,37℃培养至OD600达到0.8,调温至30℃,加入IPTG(终浓度0.2mM)诱导蛋白表达8h,8000rpm离心收菌。
4.工程菌罐发酵
验证后的菌种接种于100ml LB培养基(含卡那霉素50μg/ml),37℃培养过夜作为种子,接种于10L发酵培养基(含卡那霉素50μg/ml,1%接种量),37℃培养,浓氨水控pH7.0,转速、通气和补糖(70%葡糖糖)控溶氧30%-40%;OD600达到40时,调温至30℃,加入IPTG(终浓度0.5mM)开始诱导蛋白表达,10h后发酵结束,8000rpm离心收菌。发酵培养基组分同实施例1。
按照该制备工艺,融合蛋白不表达,不能得到利那洛肽。
实施例5
设计实验组1-12,其中实验组1-7分别采用TrxA为融合标签,串联数分别为1,3,4,6,8,10,12,实验组8采用SUMO为融合标签,串联数为4,实验组9采用GST融合标签,串联数为4,实验组10采用MBP融合标签,串联数为4,实验组11-12不设置融合标签,串联数为6,8。
整体制备过程同实施例1。
具体实验设计及实验结果如下表所示:
其中,设计4融合蛋白经工程菌构建步骤获得工程菌CGMCC No.23800。
实施例6
多肽HPLC检测。
采用HPLC法检测线性多肽和环状多肽,进行纯度和含量测定。色谱柱采用YMC-Pack Pro C18柱,3.0*150mm,3μm,柱温40℃,流速0.6ml/min,检测波长220nm。
流动相A:10%乙腈,0.1%TFA;
流动相B:80%乙腈,0.1%TFA;
梯度方法:
时间(min) | A(%) | B(%) |
0 | 100 | 0 |
5 | 100 | 0 |
35 | 47 | 53 |
40 | 0 | 100 |
40.1 | 100 | 0 |
50 | 100 | 0 |
实施例7
多肽分子量检测。
采用ABSciex 5800 MALDI-TOF/TOF对蛋白质相对分子质量进行测试,准确可靠的获得蛋白质相对分子质量信息。将样品点至样品靶上,自然干燥后,再取CHCA基质溶液点至对应靶位上并自然干燥,在正离子模式下选择反射方法测试样品分子量。5800 MALDI-TOF/TOF产生的原始数据及图谱由4000 Series Explorer V3.5软件导出。
实施例8
多肽活性检测。
体外研究中,利那洛肽与人结肠癌细胞株T84上GC-C受体结合可以促进cGMP的产生和积累。通过测定人T84细胞内cGMP的量,评价待测样品的体外激动作用。
使用T84细胞株作为筛选模型,当细胞汇合度达到80%-85%时,进行消化处理,将收集到的细胞悬液,以适宜密度接种到96孔板,然后放入37℃/5%CO2培养箱中继续培养48小时后用于实验。48小时后取出细胞培养板,用DMEM培养基(含有1mM/L IBMX,PH=7.0)清洗并在37℃孵育10分钟。孵育结束后,加入待测样品工作液,然后将细胞板放到37℃/5%CO2培养箱孵育30分钟。孵育结束后,离心收集上清,加入cGMP检测试剂,用酶标仪(PheraStar)读取并记录数据。
通过PHERA star获得原始数据,分别将665nm和620nm波长处的信号检测值之比乘以10000后得到R值,按照下列方式处理后作为作图数据,数据采集和分析使用Excel和GraphPad Prism 6软件程序。
%激活率=100%-(RCompound-RAgonist 100)/(RBackground-RAgonist 100)x100%
计量反应曲线使用GraphPad Prism 6用四参数方程对数据进行分析。
序列表
<110> 修实生物医药(南通)有限公司
<120> 一种基因重组串联表达利那洛肽的工程菌
<141> 2022-01-24
<160> 34
<170> SIPOSequenceListing 1.0
<210> 1
<211> 354
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 1
atggcagaca aaatcatcca cctgaccgac gactctttcg acaccgacgt tctgaaagcg 60
gacggtgcga tcctggttga cttctgggcg gaatggtgcg gtccgtgcaa aatgatcgcg 120
ccgatcctgg acgaaatcgc ggacgaatac cagggtaaac tgaccgttgc gaaactgaac 180
atcgaccaga acccgggtac cgcgccgaaa tacggtatcc gtggtatccc gaccctgctg 240
ctgttcaaaa acggtgaagt tgcggcgacc aaagttggtg cgctgtctaa aggtcagctg 300
aaagaattcc tggacgcgaa cctggcgggt tctggtcatc atcatcatca tcat 354
<210> 2
<211> 118
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 2
Met Ala Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp
1 5 10 15
Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala Glu Trp
20 25 30
Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp
35 40 45
Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp Gln Asn
50 55 60
Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu
65 70 75 80
Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser
85 90 95
Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Gly
100 105 110
His His His His His His
115
<210> 3
<211> 42
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 3
tgttgcgagt actgctgcaa cccggcctgc accggttgtt at 42
<210> 4
<211> 14
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 4
Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr
1 5 10
<210> 5
<211> 405
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 5
atggcagaca aaatcatcca cctgaccgac gactctttcg acaccgacgt tctgaaagcg 60
gacggtgcga tcctggttga cttctgggcg gaatggtgcg gtccgtgcaa aatgatcgcg 120
ccgatcctgg acgaaatcgc ggacgaatac cagggtaaac tgaccgttgc gaaactgaac 180
atcgaccaga acccgggtac cgcgccgaaa tacggtatcc gtggtatccc gaccctgctg 240
ctgttcaaaa acggtgaagt tgcggcgacc aaagttggtg cgctgtctaa aggtcagctg 300
aaagaattcc tggacgcgaa cctggcgggt tctggtcatc atcatcatca tcataaatgt 360
tgcgagtact gctgcaaccc ggcctgcacc ggttgttata aataa 405
<210> 6
<211> 134
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 6
Met Ala Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp
1 5 10 15
Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala Glu Trp
20 25 30
Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp
35 40 45
Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp Gln Asn
50 55 60
Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu
65 70 75 80
Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser
85 90 95
Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Gly
100 105 110
His His His His His His Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala
115 120 125
Cys Thr Gly Cys Tyr Lys
130
<210> 7
<211> 495
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 7
atggcagaca aaatcatcca cctgaccgac gactctttcg acaccgacgt tctgaaagcg 60
gacggtgcga tcctggttga cttctgggcg gaatggtgcg gtccgtgcaa aatgatcgcg 120
ccgatcctgg acgaaatcgc ggacgaatac cagggtaaac tgaccgttgc gaaactgaac 180
atcgaccaga acccgggtac cgcgccgaaa tacggtatcc gtggtatccc gaccctgctg 240
ctgttcaaaa acggtgaagt tgcggcgacc aaagttggtg cgctgtctaa aggtcagctg 300
aaagaattcc tggacgcgaa cctggcgggt tctggtcatc atcatcatca tcataaatgt 360
tgcgagtact gctgcaaccc ggcctgcacc ggttgttata aatgttgtga atattgctgc 420
aacccggcct gcaccggttg ttataaatgt tgtgaatatt gctgcaaccc ggcctgtacc 480
ggttgttata aataa 495
<210> 8
<211> 164
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 8
Met Ala Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp
1 5 10 15
Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala Glu Trp
20 25 30
Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp
35 40 45
Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp Gln Asn
50 55 60
Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu
65 70 75 80
Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser
85 90 95
Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Gly
100 105 110
His His His His His His Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala
115 120 125
Cys Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys
130 135 140
Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr
145 150 155 160
Gly Cys Tyr Lys
<210> 9
<211> 540
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 9
atggcagaca aaatcatcca cctgaccgac gactctttcg acaccgacgt tctgaaagcg 60
gacggtgcga tcctggttga cttctgggcg gaatggtgcg gtccgtgcaa aatgatcgcg 120
ccgatcctgg acgaaatcgc ggacgaatac cagggtaaac tgaccgttgc gaaactgaac 180
atcgaccaga acccgggtac cgcgccgaaa tacggtatcc gtggtatccc gaccctgctg 240
ctgttcaaaa acggtgaagt tgcggcgacc aaagttggtg cgctgtctaa aggtcagctg 300
aaagaattcc tggacgcgaa cctggcgggt tctggtcatc atcatcatca tcataaatgt 360
tgcgagtact gctgcaaccc ggcctgcacc ggttgttata aatgttgtga atattgctgc 420
aacccggcct gcaccggttg ttataaatgt tgtgaatatt gctgcaaccc ggcctgtacc 480
ggttgttata aatgttgtga atactgctgc aacccggcat gtaccggttg ttataaataa 540
<210> 10
<211> 179
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 10
Met Ala Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp
1 5 10 15
Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala Glu Trp
20 25 30
Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp
35 40 45
Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp Gln Asn
50 55 60
Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu
65 70 75 80
Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser
85 90 95
Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Gly
100 105 110
His His His His His His Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala
115 120 125
Cys Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys
130 135 140
Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr
145 150 155 160
Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly
165 170 175
Cys Tyr Lys
<210> 11
<211> 630
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 11
atggcagaca aaatcatcca cctgaccgac gactctttcg acaccgacgt tctgaaagcg 60
gacggtgcga tcctggttga cttctgggcg gaatggtgcg gtccgtgcaa aatgatcgcg 120
ccgatcctgg acgaaatcgc ggacgaatac cagggtaaac tgaccgttgc gaaactgaac 180
atcgaccaga acccgggtac cgcgccgaaa tacggtatcc gtggtatccc gaccctgctg 240
ctgttcaaaa acggtgaagt tgcggcgacc aaagttggtg cgctgtctaa aggtcagctg 300
aaagaattcc tggacgcgaa cctggcgggt tctggtcatc atcatcatca tcataaatgt 360
tgcgagtact gctgcaaccc ggcctgcaca ggttgttata aatgttgtga atactgctgc 420
aacccggcct gcaccggttg ttataaatgt tgtgaatatt gctgcaatcc ggcatgtacc 480
ggttgttata aatgttgtga atactgctgc aacccggcct gtaccggttg ttataaatgt 540
tgtgaatatt gctgcaaccc ggcctgcacc ggttgttata aatgttgtga atattgctgc 600
aacccggcct gtaccggttg ttataaataa 630
<210> 12
<211> 209
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 12
Met Ala Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp
1 5 10 15
Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala Glu Trp
20 25 30
Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp
35 40 45
Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp Gln Asn
50 55 60
Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu
65 70 75 80
Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser
85 90 95
Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Gly
100 105 110
His His His His His His Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala
115 120 125
Cys Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys
130 135 140
Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr
145 150 155 160
Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly
165 170 175
Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys
180 185 190
Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr
195 200 205
Lys
<210> 13
<211> 720
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 13
atggcagaca aaatcatcca cctgaccgac gactctttcg acaccgacgt tctgaaagcg 60
gacggtgcga tcctggttga cttctgggcg gaatggtgcg gtccgtgcaa aatgatcgcg 120
ccgatcctgg acgaaatcgc ggacgaatac cagggtaaac tgaccgttgc gaaactgaac 180
atcgaccaga acccgggtac cgcgccgaaa tacggtatcc gtggtatccc gaccctgctg 240
ctgttcaaaa acggtgaagt tgcggcgacc aaagttggtg cgctgtctaa aggtcagctg 300
aaagaattcc tggacgcgaa cctggcgggt tctggtcatc atcatcatca tcataaatgt 360
tgcgagtact gctgcaaccc ggcctgcaca ggttgttata aatgttgtga atactgctgc 420
aacccggcct gcaccggttg ttataaatgt tgtgaatatt gctgcaatcc ggcatgtacc 480
ggttgttata aatgttgtga atactgctgc aacccggcct gtaccggttg ttataaatgt 540
tgtgaatatt gctgcaaccc ggcctgcacc ggttgttata aatgttgtga atattgctgc 600
aacccggcct gtaccggttg ttataaatgt tgtgaatatt gctgcaaccc ggcctgcacc 660
ggttgttata aatgttgtga atattgctgc aacccggcct gcaccggttg ttataaataa 720
<210> 14
<211> 239
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 14
Met Ala Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp
1 5 10 15
Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala Glu Trp
20 25 30
Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp
35 40 45
Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp Gln Asn
50 55 60
Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu
65 70 75 80
Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser
85 90 95
Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Gly
100 105 110
His His His His His His Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala
115 120 125
Cys Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys
130 135 140
Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr
145 150 155 160
Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly
165 170 175
Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys
180 185 190
Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr
195 200 205
Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys
210 215 220
Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys
225 230 235
<210> 15
<211> 810
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 15
atggcagaca aaatcatcca cctgaccgac gactctttcg acaccgacgt tctgaaagcg 60
gacggtgcga tcctggttga cttctgggcg gaatggtgcg gtccgtgcaa aatgatcgcg 120
ccgatcctgg acgaaatcgc ggacgaatac cagggtaaac tgaccgttgc gaaactgaac 180
atcgaccaga acccgggtac cgcgccgaaa tacggtatcc gtggtatccc gaccctgctg 240
ctgttcaaaa acggtgaagt tgcggcgacc aaagttggtg cgctgtctaa aggtcagctg 300
aaagaattcc tggacgcgaa cctggcgggt tctggtcatc atcatcatca tcataaatgt 360
tgcgagtact gctgcaaccc ggcctgcacc ggttgttata aatgttgtga atattgctgc 420
aacccggcct gcaccggttg ttataaatgt tgcgagtact gctgcaaccc ggcctgcaca 480
ggttgttata aatgttgtga atactgctgc aacccggcct gcaccggttg ttataaatgt 540
tgtgaatatt gctgcaatcc ggcatgtacc ggttgttata aatgttgtga atactgctgc 600
aacccggcct gtaccggttg ttataaatgt tgtgaatatt gctgcaaccc ggcctgcacc 660
ggttgttata aatgttgtga atattgctgc aacccggcct gtaccggttg ttataaatgt 720
tgtgaatatt gctgcaaccc ggcctgcacc ggttgttata aatgttgtga atattgctgc 780
aacccggcct gcaccggttg ttataaataa 810
<210> 16
<211> 269
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 16
Met Ala Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp
1 5 10 15
Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala Glu Trp
20 25 30
Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp
35 40 45
Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp Gln Asn
50 55 60
Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu
65 70 75 80
Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser
85 90 95
Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Gly
100 105 110
His His His His His His Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala
115 120 125
Cys Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys
130 135 140
Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr
145 150 155 160
Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly
165 170 175
Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys
180 185 190
Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr
195 200 205
Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys
210 215 220
Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys
225 230 235 240
Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys Cys
245 250 255
Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys
260 265
<210> 17
<211> 900
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 17
atggcagaca aaatcatcca cctgaccgac gactctttcg acaccgacgt tctgaaagcg 60
gacggtgcga tcctggttga cttctgggcg gaatggtgcg gtccgtgcaa aatgatcgcg 120
ccgatcctgg acgaaatcgc ggacgaatac cagggtaaac tgaccgttgc gaaactgaac 180
atcgaccaga acccgggtac cgcgccgaaa tacggtatcc gtggtatccc gaccctgctg 240
ctgttcaaaa acggtgaagt tgcggcgacc aaagttggtg cgctgtctaa aggtcagctg 300
aaagaattcc tggacgcgaa cctggcgggt tctggtcatc atcatcatca tcataaatgt 360
tgcgagtact gctgcaaccc ggcctgcaca ggttgttata aatgttgtga atactgctgc 420
aacccggcct gcaccggttg ttataaatgt tgtgaatatt gctgcaatcc ggcatgtacc 480
ggttgttata aatgttgtga atactgctgc aacccggcct gtaccggttg ttataaatgt 540
tgtgaatatt gctgcaaccc ggcctgcacc ggttgttata aatgttgtga atattgctgc 600
aacccggcct gtaccggttg ttataaatgt tgtgaatatt gctgcaaccc ggcctgcacc 660
ggttgttata aatgttgtga atattgctgc aacccggcct gcaccggttg ttataaatgt 720
tgcgagtact gctgcaaccc ggcctgcacc ggttgttata aatgttgtga atattgctgc 780
aacccggcct gcaccggttg ttataaatgt tgtgaatatt gctgcaaccc ggcctgtacc 840
ggttgttata aatgttgtga atactgctgc aacccggcat gtaccggttg ttataaataa 900
<210> 18
<211> 299
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 18
Met Ala Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp
1 5 10 15
Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala Glu Trp
20 25 30
Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp
35 40 45
Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp Gln Asn
50 55 60
Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu
65 70 75 80
Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser
85 90 95
Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Gly
100 105 110
His His His His His His Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala
115 120 125
Cys Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys
130 135 140
Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr
145 150 155 160
Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly
165 170 175
Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys
180 185 190
Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr
195 200 205
Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys
210 215 220
Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys
225 230 235 240
Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys Cys
245 250 255
Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys Cys Glu
260 265 270
Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys Cys Glu Tyr
275 280 285
Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys
290 295
<210> 19
<211> 354
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 19
atggggtcga gccaccatca tcatcaccac agctcaggac ttgtgccgcg cggtagtcac 60
atgtcggatt ctgaagtcaa ccaggaagct aagcctgaag tcaagcctga ggttaaaccc 120
gaaacacaca tcaacctgaa agtttcagac ggcagcagcg agattttctt caagattaaa 180
aaaacaacac cgcttcgtcg ccttatggag gcgtttgcga agcgccaagg aaaggagatg 240
gacagtcttc gcttcttgta tgatggtatc cgtattcagg cggaccaaac accagaggac 300
cttgatatgg aggacaacga tattattgag gcgcaccgcg aacaaattgg ggga 354
<210> 20
<211> 118
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 20
Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro
1 5 10 15
Arg Gly Ser His Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro
20 25 30
Glu Val Lys Pro Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val
35 40 45
Ser Asp Gly Ser Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro
50 55 60
Leu Arg Arg Leu Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met
65 70 75 80
Asp Ser Leu Arg Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln
85 90 95
Thr Pro Glu Asp Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His
100 105 110
Arg Glu Gln Ile Gly Gly
115
<210> 21
<211> 540
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 21
atggggtcga gccaccatca tcatcaccac agctcaggac ttgtgccgcg cggtagtcac 60
atgtcggatt ctgaagtcaa ccaggaagct aagcctgaag tcaagcctga ggttaaaccc 120
gaaacacaca tcaacctgaa agtttcagac ggcagcagcg agattttctt caagattaaa 180
aaaacaacac cgcttcgtcg ccttatggag gcgtttgcga agcgccaagg aaaggagatg 240
gacagtcttc gcttcttgta tgatggtatc cgtattcagg cggaccaaac accagaggac 300
cttgatatgg aggacaacga tattattgag gcgcaccgcg aacaaattgg gggaaaatgc 360
tgcgagtatt gctgtaatcc cgcttgtaca ggatgctata aatgttgtga gtattgttgt 420
aacccggcgt gtacaggctg ctacaagtgc tgtgaatatt gctgcaaccc agcttgtact 480
ggctgctata aatgttgtga gtattgttgt aacccggcgt gtacaggctg ctacaaataa 540
<210> 22
<211> 179
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 22
Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro
1 5 10 15
Arg Gly Ser His Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro
20 25 30
Glu Val Lys Pro Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val
35 40 45
Ser Asp Gly Ser Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro
50 55 60
Leu Arg Arg Leu Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met
65 70 75 80
Asp Ser Leu Arg Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln
85 90 95
Thr Pro Glu Asp Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His
100 105 110
Arg Glu Gln Ile Gly Gly Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala
115 120 125
Cys Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys
130 135 140
Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr
145 150 155 160
Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly
165 170 175
Cys Tyr Lys
<210> 23
<211> 687
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 23
atggctccta tactaggtta ttggaaaatt aagggccttg tgcaacccac tcgacttctt 60
ttggaatatc ttgaagaaaa atatgaagag catttgtatg agcgcgatga aggtgataaa 120
tggcgaaaca aaaagtttga attgggtttg gagtttccca atcttcctta ttatattgat 180
ggtgatgtta aattaacaca gtctatggcc atcatacgtt atatagctga caagcacaac 240
atgttgggtg gttgtccaaa agagcgtgca gagatttcaa tgcttgaagg agcggttttg 300
gatattagat acggtgtttc gagaattgca tatagtaaag actttgaaac tctcaaagtt 360
gattttctta gcaagctacc tgaaatgctg aaaatgttcg aagatcgttt atgtcataaa 420
acatatttaa atggtgatca tgtaacccat cctgacttca tgttgtatga cgctcttgat 480
gttgttttat acatggaccc aatgtgcctg gatgcgttcc caaaattagt ttgttttaaa 540
aaacgtattg aagctatccc acaaattgat aagtacttga aatccagcaa gtatatagca 600
tggcctttgc agggctggca agccacgttt ggtggtggcg accatcctcc aaaatcggat 660
ggttcaggtc atcatcatca tcatcat 687
<210> 24
<211> 229
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 24
Met Gly Pro Ile Leu Gly Tyr Trp Lys Ile Lys Gly Leu Val Gln Pro
1 5 10 15
Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His Leu
20 25 30
Tyr Glu Arg Asp Glu Gly Asp Lys Trp Arg Asn Lys Lys Phe Glu Leu
35 40 45
Gly Leu Glu Phe Pro Asn Leu Pro Tyr Tyr Ile Asp Gly Asp Val Lys
50 55 60
Leu Thr Gln Ser Met Ala Ile Ile Arg Tyr Ile Ala Asp Lys His Asn
65 70 75 80
Met Leu Gly Gly Cys Pro Lys Glu Arg Ala Glu Ile Ser Met Leu Glu
85 90 95
Gly Ala Val Leu Asp Ile Arg Tyr Gly Val Ser Arg Ile Ala Tyr Ser
100 105 110
Lys Asp Phe Glu Thr Leu Lys Val Asp Phe Leu Ser Lys Leu Pro Glu
115 120 125
Met Leu Lys Met Phe Glu Asp Arg Leu Cys His Lys Thr Tyr Leu Asn
130 135 140
Gly Asp His Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala Leu Asp
145 150 155 160
Val Val Leu Tyr Met Asp Pro Met Cys Leu Asp Ala Phe Pro Lys Leu
165 170 175
Val Cys Phe Lys Lys Arg Ile Glu Ala Ile Pro Gln Ile Asp Lys Tyr
180 185 190
Leu Lys Ser Ser Lys Tyr Ile Ala Trp Pro Leu Gln Gly Trp Gln Ala
195 200 205
Thr Phe Gly Gly Gly Asp His Pro Pro Lys Ser Asp Gly Ser Gly His
210 215 220
His His His His His
225
<210> 25
<211> 873
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 25
atggctccta tactaggtta ttggaaaatt aagggccttg tgcaacccac tcgacttctt 60
ttggaatatc ttgaagaaaa atatgaagag catttgtatg agcgcgatga aggtgataaa 120
tggcgaaaca aaaagtttga attgggtttg gagtttccca atcttcctta ttatattgat 180
ggtgatgtta aattaacaca gtctatggcc atcatacgtt atatagctga caagcacaac 240
atgttgggtg gttgtccaaa agagcgtgca gagatttcaa tgcttgaagg agcggttttg 300
gatattagat acggtgtttc gagaattgca tatagtaaag actttgaaac tctcaaagtt 360
gattttctta gcaagctacc tgaaatgctg aaaatgttcg aagatcgttt atgtcataaa 420
acatatttaa atggtgatca tgtaacccat cctgacttca tgttgtatga cgctcttgat 480
gttgttttat acatggaccc aatgtgcctg gatgcgttcc caaaattagt ttgttttaaa 540
aaacgtattg aagctatccc acaaattgat aagtacttga aatccagcaa gtatatagca 600
tggcctttgc agggctggca agccacgttt ggtggtggcg accatcctcc aaaatcggat 660
ggttcaggtc atcatcatca tcatcataaa tgttgcgagt actgctgcaa cccggcctgc 720
accggttgtt ataaatgttg tgaatattgc tgcaacccgg cctgcaccgg ttgttataaa 780
tgttgtgaat attgctgcaa cccggcctgt accggttgtt ataaatgttg tgaatactgc 840
tgcaacccgg catgtaccgg ttgttataaa taa 873
<210> 26
<211> 290
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 26
Met Gly Pro Ile Leu Gly Tyr Trp Lys Ile Lys Gly Leu Val Gln Pro
1 5 10 15
Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His Leu
20 25 30
Tyr Glu Arg Asp Glu Gly Asp Lys Trp Arg Asn Lys Lys Phe Glu Leu
35 40 45
Gly Leu Glu Phe Pro Asn Leu Pro Tyr Tyr Ile Asp Gly Asp Val Lys
50 55 60
Leu Thr Gln Ser Met Ala Ile Ile Arg Tyr Ile Ala Asp Lys His Asn
65 70 75 80
Met Leu Gly Gly Cys Pro Lys Glu Arg Ala Glu Ile Ser Met Leu Glu
85 90 95
Gly Ala Val Leu Asp Ile Arg Tyr Gly Val Ser Arg Ile Ala Tyr Ser
100 105 110
Lys Asp Phe Glu Thr Leu Lys Val Asp Phe Leu Ser Lys Leu Pro Glu
115 120 125
Met Leu Lys Met Phe Glu Asp Arg Leu Cys His Lys Thr Tyr Leu Asn
130 135 140
Gly Asp His Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala Leu Asp
145 150 155 160
Val Val Leu Tyr Met Asp Pro Met Cys Leu Asp Ala Phe Pro Lys Leu
165 170 175
Val Cys Phe Lys Lys Arg Ile Glu Ala Ile Pro Gln Ile Asp Lys Tyr
180 185 190
Leu Lys Ser Ser Lys Tyr Ile Ala Trp Pro Leu Gln Gly Trp Gln Ala
195 200 205
Thr Phe Gly Gly Gly Asp His Pro Pro Lys Ser Asp Gly Ser Gly His
210 215 220
His His His His His Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys
225 230 235 240
Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr
245 250 255
Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly
260 265 270
Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys
275 280 285
Tyr Lys
290
<210> 27
<211> 1134
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 27
atgggtaaaa tcgaagaagg taaactggta atctggatta acggcgataa aggctataac 60
ggtctcgctg aagtcggtaa gaaattcgag aaagataccg gaattaaagt caccgttgag 120
catccggata aactggaaga gaaattccca caggttgcgg caactggcga tggccctgac 180
attatcttct gggcacacga ccgctttggt ggctacgctc aatctggcct gttggctgaa 240
atcaccccgg acaaagcgtt ccaggacaag ctgtatccgt ttacctggga tgccgtacgt 300
tacaacggca agctgattgc ttacccgatc gctgttgaag cgttatcgct gatttataac 360
aaagatctgc tgccgaaccc gccaaaaacc tgggaagaga tcccggcgct ggataaagaa 420
ctgaaagcga aaggtaagag cgcgctgatg ttcaacctgc aagaaccgta cttcacctgg 480
ccgctgattg ctgctgacgg gggttatgcg ttcaagtatg aaaacggcaa gtacgacatt 540
aaagacgtgg gcgtggataa cgctggcgcg aaagcgggtc tgaccttcct ggttgacctg 600
attaaaaaca aacacatgaa tgcagacacc gattactcca tcgcagaagc tgcctttaat 660
aaaggcgaaa cagcgatgac catcaacggc ccgtgggcat ggtccaacat cgacaccagc 720
aaagtgaatt atggtgtaac ggtactgccg accttcaagg gtcaaccatc caaaccgttc 780
gttggcgtgc tgagcgcagg tattaacgcc gccagtccga acaaagagct ggcaaaagag 840
ttcctcgaaa actatctgct gactgatgaa ggtctggaag cggttaataa agacaaaccg 900
ctgggtgccg tagcgctgaa gtcttacgag gaagagttgg cgaaagatcc acgtattgcc 960
gccactatgg aaaacgccca gaaaggtgaa atcatgccga acatcccgca gatgtccgct 1020
ttctggtatg ccgtgcgtac tgcggtgatc aacgccgcca gcggtcgtca gactgtcgat 1080
gaagccctga aagacgcgca gactccgggt agcggtcatc atcatcatca tcat 1134
<210> 28
<211> 378
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 28
Met Gly Lys Ile Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly Asp
1 5 10 15
Lys Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp
20 25 30
Thr Gly Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys
35 40 45
Phe Pro Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe Trp
50 55 60
Ala His Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala Glu
65 70 75 80
Ile Thr Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr Trp
85 90 95
Asp Ala Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala Val
100 105 110
Glu Ala Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro
115 120 125
Lys Thr Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys
130 135 140
Gly Lys Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr Trp
145 150 155 160
Pro Leu Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly
165 170 175
Lys Tyr Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala
180 185 190
Gly Leu Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn Ala
195 200 205
Asp Thr Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr
210 215 220
Ala Met Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr Ser
225 230 235 240
Lys Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln Pro
245 250 255
Ser Lys Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala Ser
260 265 270
Pro Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr
275 280 285
Asp Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val
290 295 300
Ala Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg Ile Ala
305 310 315 320
Ala Thr Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile Pro
325 330 335
Gln Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn Ala
340 345 350
Ala Ser Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln Thr
355 360 365
Pro Gly Ser Gly His His His His His His
370 375
<210> 29
<211> 1320
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 29
atgggtaaaa tcgaagaagg taaactggta atctggatta acggcgataa aggctataac 60
ggtctcgctg aagtcggtaa gaaattcgag aaagataccg gaattaaagt caccgttgag 120
catccggata aactggaaga gaaattccca caggttgcgg caactggcga tggccctgac 180
attatcttct gggcacacga ccgctttggt ggctacgctc aatctggcct gttggctgaa 240
atcaccccgg acaaagcgtt ccaggacaag ctgtatccgt ttacctggga tgccgtacgt 300
tacaacggca agctgattgc ttacccgatc gctgttgaag cgttatcgct gatttataac 360
aaagatctgc tgccgaaccc gccaaaaacc tgggaagaga tcccggcgct ggataaagaa 420
ctgaaagcga aaggtaagag cgcgctgatg ttcaacctgc aagaaccgta cttcacctgg 480
ccgctgattg ctgctgacgg gggttatgcg ttcaagtatg aaaacggcaa gtacgacatt 540
aaagacgtgg gcgtggataa cgctggcgcg aaagcgggtc tgaccttcct ggttgacctg 600
attaaaaaca aacacatgaa tgcagacacc gattactcca tcgcagaagc tgcctttaat 660
aaaggcgaaa cagcgatgac catcaacggc ccgtgggcat ggtccaacat cgacaccagc 720
aaagtgaatt atggtgtaac ggtactgccg accttcaagg gtcaaccatc caaaccgttc 780
gttggcgtgc tgagcgcagg tattaacgcc gccagtccga acaaagagct ggcaaaagag 840
ttcctcgaaa actatctgct gactgatgaa ggtctggaag cggttaataa agacaaaccg 900
ctgggtgccg tagcgctgaa gtcttacgag gaagagttgg cgaaagatcc acgtattgcc 960
gccactatgg aaaacgccca gaaaggtgaa atcatgccga acatcccgca gatgtccgct 1020
ttctggtatg ccgtgcgtac tgcggtgatc aacgccgcca gcggtcgtca gactgtcgat 1080
gaagccctga aagacgcgca gactccgggt agcggtcatc atcatcatca tcataaatgt 1140
tgcgagtact gctgcaaccc ggcctgcacc ggttgttata aatgttgtga atattgctgc 1200
aacccggcct gcaccggttg ttataaatgt tgtgaatatt gctgcaaccc ggcctgtacc 1260
ggttgttata aatgttgtga atactgctgc aacccggcat gtaccggttg ttataaataa 1320
<210> 30
<211> 439
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 30
Met Gly Lys Ile Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly Asp
1 5 10 15
Lys Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp
20 25 30
Thr Gly Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys
35 40 45
Phe Pro Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe Trp
50 55 60
Ala His Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala Glu
65 70 75 80
Ile Thr Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr Trp
85 90 95
Asp Ala Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala Val
100 105 110
Glu Ala Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro
115 120 125
Lys Thr Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys
130 135 140
Gly Lys Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr Trp
145 150 155 160
Pro Leu Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly
165 170 175
Lys Tyr Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala
180 185 190
Gly Leu Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn Ala
195 200 205
Asp Thr Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr
210 215 220
Ala Met Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr Ser
225 230 235 240
Lys Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln Pro
245 250 255
Ser Lys Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala Ser
260 265 270
Pro Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr
275 280 285
Asp Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val
290 295 300
Ala Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg Ile Ala
305 310 315 320
Ala Thr Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile Pro
325 330 335
Gln Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn Ala
340 345 350
Ala Ser Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln Thr
355 360 365
Pro Gly Ser Gly His His His His His His Lys Cys Cys Glu Tyr Cys
370 375 380
Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys
385 390 395 400
Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn
405 410 415
Pro Ala Cys Thr Gly Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro
420 425 430
Ala Cys Thr Gly Cys Tyr Lys
435
<210> 31
<211> 285
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 31
atgggttcta aatgctgcga atactgctgc aacccggcgt gcaccggttg ctacaaatgc 60
tgcgaatact gctgcaaccc ggcgtgcacc ggttgctaca aatgctgcga atactgctgc 120
aacccggcgt gcaccggttg ctacaaatgc tgcgaatact gctgcaaccc ggcgtgcacc 180
ggttgctaca aatgctgcga atactgctgc aacccggcgt gcaccggttg ctacaaatgc 240
tgcgaatact gctgcaaccc ggcgtgcacc ggttgctaca aataa 285
<210> 32
<211> 94
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 32
Met Gly Ser Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly
1 5 10 15
Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys
20 25 30
Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr
35 40 45
Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys
50 55 60
Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys
65 70 75 80
Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys
85 90
<210> 33
<211> 375
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 33
atgggttcta aatgctgcga atactgctgc aacccggcgt gcaccggttg ctacaaatgc 60
tgcgaatact gctgcaaccc ggcgtgcacc ggttgctaca aatgctgcga atactgctgc 120
aacccggcgt gcaccggttg ctacaaatgc tgcgaatact gctgcaaccc ggcgtgcacc 180
ggttgctaca aatgctgcga atactgctgc aacccggcgt gcaccggttg ctacaaatgc 240
tgcgaatact gctgcaaccc ggcgtgcacc ggttgctaca aatgctgcga atactgctgc 300
aacccggcgt gcaccggttg ctacaaatgc tgcgaatact gctgcaaccc ggcgtgcacc 360
ggttgctaca aataa 375
<210> 34
<211> 124
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 34
Met Gly Ser Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly
1 5 10 15
Cys Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys
20 25 30
Tyr Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr
35 40 45
Lys Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys
50 55 60
Cys Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys
65 70 75 80
Cys Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys Cys
85 90 95
Glu Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys Cys Cys Glu
100 105 110
Tyr Cys Cys Asn Pro Ala Cys Thr Gly Cys Tyr Lys
115 120
Claims (1)
1.一种基因重组串联表达利那洛肽的工程菌在合成利那洛肽中应用,其特征在于,所述利那洛肽表达步骤包括:
(1)工程菌构建;
(2)摇瓶培养及发酵罐放大;
(3)融合蛋白提取;
(4)酶切及环化;
(5)纯化及检测;
所述基因重组串联表达利那洛肽的工程菌保藏编号为CGMCC No.23800,分类命名为大肠埃希氏菌Escherichia coli,保藏于中国微生物菌种保藏管理委员会普通微生物中心;
所述发酵参数为:溶氧30%-40%,pH=7.0,补料为70%葡萄糖,且发酵过程添加IPTG,并在加入IPTG后进一步诱导并收集,所述诱导温度为30℃,诱导时间为8h;
所述摇瓶培养的培养基为LB培养基,且含卡那霉素;
所述摇瓶培养的接种量为1%。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210082694.2A CN114350587B (zh) | 2022-01-24 | 2022-01-24 | 一种基因重组串联表达利那洛肽的工程菌 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210082694.2A CN114350587B (zh) | 2022-01-24 | 2022-01-24 | 一种基因重组串联表达利那洛肽的工程菌 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114350587A CN114350587A (zh) | 2022-04-15 |
CN114350587B true CN114350587B (zh) | 2023-10-31 |
Family
ID=81093462
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210082694.2A Active CN114350587B (zh) | 2022-01-24 | 2022-01-24 | 一种基因重组串联表达利那洛肽的工程菌 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114350587B (zh) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101319222A (zh) * | 2007-06-06 | 2008-12-10 | 中国农业科学院饲料研究所 | 重组融合表达串联抗菌肽基因的方法 |
CN103626849A (zh) * | 2013-11-27 | 2014-03-12 | 深圳翰宇药业股份有限公司 | 一种利那洛肽的制备方法 |
WO2016161983A1 (zh) * | 2015-04-10 | 2016-10-13 | 中国医学科学院药物研究所 | 一种融合载体蛋白及其在促进目的蛋白或多肽表达中的应用 |
CN106167514A (zh) * | 2016-08-29 | 2016-11-30 | 杭州湃肽生化科技有限公司 | 一种利那洛肽的合成和纯化方法 |
CN107532190A (zh) * | 2014-12-01 | 2018-01-02 | 菲尼克斯公司 | 用于肽生产的融合伴侣 |
CN110724187A (zh) * | 2018-07-16 | 2020-01-24 | 甘李药业股份有限公司 | 一种高效表达利拉鲁肽前体的重组工程菌及其应用 |
CN112876536A (zh) * | 2019-11-30 | 2021-06-01 | 康码(上海)生物科技有限公司 | 一种多肽标签及其在体外蛋白合成中的应用 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010005515A2 (en) * | 2008-06-30 | 2010-01-14 | Ironwood Pharmaceuticals Incorporated | Protein expression methods |
-
2022
- 2022-01-24 CN CN202210082694.2A patent/CN114350587B/zh active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101319222A (zh) * | 2007-06-06 | 2008-12-10 | 中国农业科学院饲料研究所 | 重组融合表达串联抗菌肽基因的方法 |
CN103626849A (zh) * | 2013-11-27 | 2014-03-12 | 深圳翰宇药业股份有限公司 | 一种利那洛肽的制备方法 |
CN107532190A (zh) * | 2014-12-01 | 2018-01-02 | 菲尼克斯公司 | 用于肽生产的融合伴侣 |
WO2016161983A1 (zh) * | 2015-04-10 | 2016-10-13 | 中国医学科学院药物研究所 | 一种融合载体蛋白及其在促进目的蛋白或多肽表达中的应用 |
CN106167514A (zh) * | 2016-08-29 | 2016-11-30 | 杭州湃肽生化科技有限公司 | 一种利那洛肽的合成和纯化方法 |
CN110724187A (zh) * | 2018-07-16 | 2020-01-24 | 甘李药业股份有限公司 | 一种高效表达利拉鲁肽前体的重组工程菌及其应用 |
CN112876536A (zh) * | 2019-11-30 | 2021-06-01 | 康码(上海)生物科技有限公司 | 一种多肽标签及其在体外蛋白合成中的应用 |
Non-Patent Citations (1)
Title |
---|
Insulin chains as efficient fusion tags for prokaryotic expression of short peptides;Ligang Deng et al.;《Protein Expression and Purification》;第138卷;第46-55页 * |
Also Published As
Publication number | Publication date |
---|---|
CN114350587A (zh) | 2022-04-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113105536B (zh) | 一种新甘精胰岛素原及其制备甘精胰岛素的方法 | |
CN111117977A (zh) | 一种重组多肽连接酶原及其制备、激活方法与应用 | |
WO2024087784A1 (zh) | 酵母重组xvii型人源化胶原蛋白及其制备方法 | |
CN113087804A (zh) | 一种二价植物免疫融合蛋白及其生产方法和应用 | |
CN114149954B (zh) | 利用谷氨酸棒状杆菌高效分泌生产类蛛丝、类弹性蛋白并快速纯化的方法 | |
CN113354745B (zh) | 一种组合物及规模化生产成纤维细胞生长因子的方法 | |
US20220411764A1 (en) | Thioredoxin mutant, preparation method thereof, and application thereof in production of recombinant fusion protein | |
EP4328316A1 (en) | Preparation method for polypeptide | |
CN114350587B (zh) | 一种基因重组串联表达利那洛肽的工程菌 | |
CN110938151B (zh) | 用于表达甲状旁腺素pth的融合蛋白及重组质粒、重组工程菌 | |
CN114507293B (zh) | 一种基因重组串联表达利那洛肽的融合蛋白及表达利那洛肽的方法 | |
CN110776569A (zh) | 一种具有粘附-抗冻双功能的二嵌段融合蛋白及合成方法和应用 | |
CN102898512B (zh) | 一种重组菌丝霉素及其制备方法和用途 | |
CN111019927B (zh) | 用于表达tev蛋白的重组质粒、重组工程菌,以及制备和纯化tev蛋白的方法 | |
CN110093394B (zh) | 一种蛋白包涵体及重组人β-神经生长因子的制备方法 | |
CN109971776A (zh) | 基于光切割基序的蛋白纯化方法 | |
CN113801239B (zh) | 多肽标签、高度可溶性的重组腈水解酶及其在医药化学品合成中的应用 | |
CN112029697B (zh) | 一株重组枯草芽孢杆菌及其应用 | |
CN113151343A (zh) | 一种酿酒酵母表达长效重组人egf-hsa融合蛋白及其标准品的制备方法 | |
CN102277327B (zh) | 过表达RimL的大肠杆菌及其在制备N-乙酰化胸腺素α中的应用 | |
CN111575314A (zh) | 尿激酶受体稳定突变体suPARcc在真核胞外蛋白表达中的应用 | |
CN113773392A (zh) | 一种甘精胰岛素的制备方法 | |
CN113801235A (zh) | 一种赖脯胰岛素衍生物及其应用 | |
US20230312668A1 (en) | Insulin aspart derivative, and preparation method therefor and use thereof | |
CN113773391B (zh) | 一种门冬胰岛素的制备方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |