CN110092835A - A kind of GLP-1 analog-COL3A1 fusion protein - Google Patents
A kind of GLP-1 analog-COL3A1 fusion protein Download PDFInfo
- Publication number
- CN110092835A CN110092835A CN201810089639.XA CN201810089639A CN110092835A CN 110092835 A CN110092835 A CN 110092835A CN 201810089639 A CN201810089639 A CN 201810089639A CN 110092835 A CN110092835 A CN 110092835A
- Authority
- CN
- China
- Prior art keywords
- gly
- pro
- ala
- glu
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 94
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 92
- DTHNMHAUYICORS-KTKZVXAJSA-N Glucagon-like peptide 1 Chemical class C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 DTHNMHAUYICORS-KTKZVXAJSA-N 0.000 title claims abstract description 75
- 101710198884 GATA-type zinc finger protein 1 Proteins 0.000 title abstract 2
- 102100025101 GATA-type zinc finger protein 1 Human genes 0.000 title 1
- 241000282414 Homo sapiens Species 0.000 claims abstract description 18
- 102000008186 Collagen Human genes 0.000 claims abstract description 12
- 108010035532 Collagen Proteins 0.000 claims abstract description 12
- 229920001436 collagen Polymers 0.000 claims abstract description 12
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 59
- 210000004027 cell Anatomy 0.000 claims description 52
- 229920001184 polypeptide Polymers 0.000 claims description 34
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 34
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 21
- 108091033319 polynucleotide Proteins 0.000 claims description 20
- 102000040430 polynucleotide Human genes 0.000 claims description 20
- 239000002157 polynucleotide Substances 0.000 claims description 20
- 206010012601 diabetes mellitus Diseases 0.000 claims description 16
- 239000003814 drug Substances 0.000 claims description 16
- FPIPGXGPPPQFEQ-OVSJKPMPSA-N all-trans-retinol Chemical compound OC\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-OVSJKPMPSA-N 0.000 claims description 14
- 229940079593 drug Drugs 0.000 claims description 12
- 102000002734 Collagen Type VI Human genes 0.000 claims description 8
- 108010043741 Collagen Type VI Proteins 0.000 claims description 8
- 239000011717 all-trans-retinol Substances 0.000 claims description 7
- 235000019169 all-trans-retinol Nutrition 0.000 claims description 7
- 239000008194 pharmaceutical composition Substances 0.000 claims description 7
- 230000008901 benefit Effects 0.000 claims description 4
- 238000002360 preparation method Methods 0.000 claims description 3
- 210000000349 chromosome Anatomy 0.000 claims description 2
- 239000003937 drug carrier Substances 0.000 claims description 2
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 2
- 230000002265 prevention Effects 0.000 claims description 2
- 229930182470 glycoside Natural products 0.000 claims 1
- 150000002338 glycosides Chemical class 0.000 claims 1
- 102100040918 Pro-glucagon Human genes 0.000 abstract description 33
- 101800000224 Glucagon-like peptide 1 Proteins 0.000 abstract description 32
- 238000001727 in vivo Methods 0.000 abstract description 12
- 210000004369 blood Anatomy 0.000 abstract description 9
- 239000008280 blood Substances 0.000 abstract description 9
- 230000012666 negative regulation of transcription by glucose Effects 0.000 abstract description 2
- 230000009467 reduction Effects 0.000 abstract description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 56
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 50
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 39
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 37
- 108010029020 prolylglycine Proteins 0.000 description 37
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 30
- 102100031611 Collagen alpha-1(III) chain Human genes 0.000 description 29
- 101000993285 Homo sapiens Collagen alpha-1(III) chain Proteins 0.000 description 29
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 29
- 238000000034 method Methods 0.000 description 26
- 108090000623 proteins and genes Proteins 0.000 description 26
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 24
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 23
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 22
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 21
- 108010047495 alanylglycine Proteins 0.000 description 20
- 108020004414 DNA Proteins 0.000 description 19
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 19
- 235000018102 proteins Nutrition 0.000 description 19
- 102000004169 proteins and genes Human genes 0.000 description 19
- 108010077515 glycylproline Proteins 0.000 description 18
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 16
- 108010079364 N-glycylalanine Proteins 0.000 description 15
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 15
- 235000001014 amino acid Nutrition 0.000 description 15
- 108010026333 seryl-proline Proteins 0.000 description 15
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 14
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 14
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 14
- 230000000694 effects Effects 0.000 description 14
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 13
- 150000001413 amino acids Chemical class 0.000 description 13
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 13
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 12
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 12
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 11
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 11
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 11
- 108010092854 aspartyllysine Proteins 0.000 description 11
- 201000010099 disease Diseases 0.000 description 11
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 10
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 10
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 10
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 10
- SCAKQYSGEIHPLV-IUCAKERBSA-N (4S)-4-[(2-aminoacetyl)amino]-5-[(2S)-2-(carboxymethylcarbamoyl)pyrrolidin-1-yl]-5-oxopentanoic acid Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SCAKQYSGEIHPLV-IUCAKERBSA-N 0.000 description 9
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 9
- 102000016622 Dipeptidyl Peptidase 4 Human genes 0.000 description 9
- 108010067722 Dipeptidyl Peptidase 4 Proteins 0.000 description 9
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 9
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 9
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 9
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 108010061238 threonyl-glycine Proteins 0.000 description 9
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 description 8
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 8
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 8
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 8
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 8
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 8
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 8
- 230000004927 fusion Effects 0.000 description 8
- 108010078144 glutaminyl-glycine Proteins 0.000 description 8
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 8
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 8
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 7
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 7
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 7
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 7
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 230000005847 immunogenicity Effects 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 6
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 6
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 6
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 6
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 6
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 6
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 6
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 6
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 6
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 6
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 6
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 6
- 241000235058 Komagataella pastoris Species 0.000 description 6
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 6
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 6
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- 108010004073 cysteinylcysteine Proteins 0.000 description 6
- 239000013604 expression vector Substances 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 6
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- GCYXWQUSHADNBF-AAEALURTSA-N preproglucagon 78-108 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 GCYXWQUSHADNBF-AAEALURTSA-N 0.000 description 6
- 238000003259 recombinant expression Methods 0.000 description 6
- 108010080629 tryptophan-leucine Proteins 0.000 description 6
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 5
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 5
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 5
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 5
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 5
- 108010086246 Glucagon-Like Peptide-1 Receptor Proteins 0.000 description 5
- 101800004266 Glucagon-like peptide 1(7-37) Proteins 0.000 description 5
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 5
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 5
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 5
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 5
- 239000004471 Glycine Substances 0.000 description 5
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 5
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 5
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 5
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 5
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 5
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 5
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 5
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 5
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 description 5
- 238000007796 conventional method Methods 0.000 description 5
- 108010020688 glycylhistidine Proteins 0.000 description 5
- 108010064235 lysylglycine Proteins 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- CUVSTAMIHSSVKL-UWVGGRQHSA-N (4s)-4-[(2-aminoacetyl)amino]-5-[[(2s)-6-amino-1-(carboxymethylamino)-1-oxohexan-2-yl]amino]-5-oxopentanoic acid Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN CUVSTAMIHSSVKL-UWVGGRQHSA-N 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 4
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 4
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 4
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 4
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 4
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 4
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 4
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 4
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 4
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 4
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 4
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 4
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 4
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 210000004899 c-terminal region Anatomy 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 239000008103 glucose Substances 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 4
- 230000003914 insulin secretion Effects 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 238000005215 recombination Methods 0.000 description 4
- 230000006798 recombination Effects 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- DQJCDTNMLBYVAY-ZXXIYAEKSA-N (2S,5R,10R,13R)-16-{[(2R,3S,4R,5R)-3-{[(2S,3R,4R,5S,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy}-5-(ethylamino)-6-hydroxy-2-(hydroxymethyl)oxan-4-yl]oxy}-5-(4-aminobutyl)-10-carbamoyl-2,13-dimethyl-4,7,12,15-tetraoxo-3,6,11,14-tetraazaheptadecan-1-oic acid Chemical compound NCCCC[C@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@@H](C)NC(=O)C(C)O[C@@H]1[C@@H](NCC)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 DQJCDTNMLBYVAY-ZXXIYAEKSA-N 0.000 description 3
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 3
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 3
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 3
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 3
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 3
- 108010072062 GEKG peptide Proteins 0.000 description 3
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 3
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 3
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 3
- 102100032882 Glucagon-like peptide 1 receptor Human genes 0.000 description 3
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 3
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 3
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 3
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 3
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 3
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 3
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 3
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 3
- CAVKXZMMDNOZJU-UHFFFAOYSA-N Gly-Pro-Ala-Gly-Pro Natural products C1CCC(C(O)=O)N1C(=O)CNC(=O)C(C)NC(=O)C1CCCN1C(=O)CN CAVKXZMMDNOZJU-UHFFFAOYSA-N 0.000 description 3
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 3
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 3
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 3
- 102000002068 Glycopeptides Human genes 0.000 description 3
- 108010015899 Glycopeptides Proteins 0.000 description 3
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 3
- 108090001061 Insulin Proteins 0.000 description 3
- 102000004877 Insulin Human genes 0.000 description 3
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 3
- 108010019598 Liraglutide Proteins 0.000 description 3
- YSDQQAXHVYUZIW-QCIJIYAXSA-N Liraglutide Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCNC(=O)CC[C@H](NC(=O)CCCCCCCCCCCCCCC)C(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=C(O)C=C1 YSDQQAXHVYUZIW-QCIJIYAXSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 3
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 238000000855 fermentation Methods 0.000 description 3
- 230000004151 fermentation Effects 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 229940125396 insulin Drugs 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 229960002701 liraglutide Drugs 0.000 description 3
- 150000007523 nucleic acids Chemical group 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 229940124597 therapeutic agent Drugs 0.000 description 3
- QCVGEOXPDFCNHA-UHFFFAOYSA-N 5,5-dimethyl-2,4-dioxo-1,3-oxazolidine-3-carboxamide Chemical compound CC1(C)OC(=O)N(C(N)=O)C1=O QCVGEOXPDFCNHA-UHFFFAOYSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 2
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 2
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 2
- 108010051330 Arg-Pro-Gly-Pro Proteins 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 2
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 2
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 2
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 2
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 208000017667 Chronic Disease Diseases 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 102000002322 Egg Proteins Human genes 0.000 description 2
- 108010000912 Egg Proteins Proteins 0.000 description 2
- 108010011459 Exenatide Proteins 0.000 description 2
- HTQBXNHDCUEHJF-XWLPCZSASA-N Exenatide Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)NCC(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CO)C(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 HTQBXNHDCUEHJF-XWLPCZSASA-N 0.000 description 2
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 2
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 2
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 2
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 2
- 102000007446 Glucagon-Like Peptide-1 Receptor Human genes 0.000 description 2
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 2
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 2
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 2
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 2
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- 206010018473 Glycosuria Diseases 0.000 description 2
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 2
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 2
- 241001506991 Komagataella phaffii GS115 Species 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 2
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 2
- 208000008589 Obesity Diseases 0.000 description 2
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 2
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 2
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 2
- YQMILNREHKTFBS-IHRRRGAJSA-N Val-Phe-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YQMILNREHKTFBS-IHRRRGAJSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 230000010261 cell growth Effects 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 231100000433 cytotoxic Toxicity 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 235000014103 egg white Nutrition 0.000 description 2
- 210000000969 egg white Anatomy 0.000 description 2
- 235000013601 eggs Nutrition 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 229960001519 exenatide Drugs 0.000 description 2
- 210000002744 extracellular matrix Anatomy 0.000 description 2
- 210000001035 gastrointestinal tract Anatomy 0.000 description 2
- 239000003292 glue Substances 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000002218 hypoglycaemic effect Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 235000020824 obesity Nutrition 0.000 description 2
- 101710135378 pH 6 antigen Proteins 0.000 description 2
- 210000000496 pancreas Anatomy 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 230000035755 proliferation Effects 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 238000004064 recycling Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 238000007086 side reaction Methods 0.000 description 2
- 210000003491 skin Anatomy 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000000108 ultra-filtration Methods 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 108010000998 wheylin-2 peptide Proteins 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- CEHZCZCQHUNAJF-AVGNSLFASA-N (2s)-1-[2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N1[C@H](C(O)=O)CCC1 CEHZCZCQHUNAJF-AVGNSLFASA-N 0.000 description 1
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 1
- BOVGTQGAOIONJV-BETUJISGSA-N 1-[(3ar,6as)-3,3a,4,5,6,6a-hexahydro-1h-cyclopenta[c]pyrrol-2-yl]-3-(4-methylphenyl)sulfonylurea Chemical compound C1=CC(C)=CC=C1S(=O)(=O)NC(=O)NN1C[C@H]2CCC[C@H]2C1 BOVGTQGAOIONJV-BETUJISGSA-N 0.000 description 1
- PEZMQPADLFXCJJ-ZETCQYMHSA-N 2-[[2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(O)=O PEZMQPADLFXCJJ-ZETCQYMHSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- 235000010894 Artemisia argyi Nutrition 0.000 description 1
- PTNFNTOBUDWHNZ-GUBZILKMSA-N Asn-Arg-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O PTNFNTOBUDWHNZ-GUBZILKMSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- HMUKKNAMNSXDBB-CIUDSAMLSA-N Asn-Met-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMUKKNAMNSXDBB-CIUDSAMLSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 1
- DBLPNHGKMDHWNZ-UHFFFAOYSA-N Asp Gly Arg Asn Chemical compound OC(=O)CC(N)C(=O)NCC(=O)NC(CCCN=C(N)N)C(=O)NC(CC(N)=O)C(O)=O DBLPNHGKMDHWNZ-UHFFFAOYSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 1
- 210000002237 B-cell of pancreatic islet Anatomy 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 208000024172 Cardiovascular disease Diseases 0.000 description 1
- 102000001189 Cyclic Peptides Human genes 0.000 description 1
- 108010069514 Cyclic Peptides Proteins 0.000 description 1
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 1
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- HYKFOHGZGLOCAY-ZLUOBGJFSA-N Cys-Cys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O HYKFOHGZGLOCAY-ZLUOBGJFSA-N 0.000 description 1
- ZJBWJHQDOIMVLM-WHFBIAKZSA-N Cys-Cys-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZJBWJHQDOIMVLM-WHFBIAKZSA-N 0.000 description 1
- RFHGRMMADHHQSA-KBIXCLLPSA-N Cys-Gln-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RFHGRMMADHHQSA-KBIXCLLPSA-N 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 1
- VFGADOJXRLWTBU-JBDRJPRFSA-N Cys-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N VFGADOJXRLWTBU-JBDRJPRFSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- CAXGCBSRJLADPD-FXQIFTODSA-N Cys-Pro-Asn Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CAXGCBSRJLADPD-FXQIFTODSA-N 0.000 description 1
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 1
- IRKLTAKLAFUTLA-KATARQTJSA-N Cys-Thr-Lys Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCCCN)C(O)=O IRKLTAKLAFUTLA-KATARQTJSA-N 0.000 description 1
- MQQLYEHXSBJTRK-FXQIFTODSA-N Cys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N MQQLYEHXSBJTRK-FXQIFTODSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108010042546 GCGGCCGC-specific type II deoxyribonucleases Proteins 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 1
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- GTFYQOVVVJASOA-ACZMJKKPSA-N Glu-Ser-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N GTFYQOVVVJASOA-ACZMJKKPSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 1
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 description 1
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 1
- QLNKFGTZOBVMCS-JBACZVJFSA-N Glu-Tyr-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QLNKFGTZOBVMCS-JBACZVJFSA-N 0.000 description 1
- 102000051325 Glucagon Human genes 0.000 description 1
- 108060003199 Glucagon Proteins 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical group NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- CMPHFUWXKBPNRS-WDSOQIARSA-N His-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 CMPHFUWXKBPNRS-WDSOQIARSA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 208000013016 Hypoglycemia Diseases 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 1
- JHCVYQKVKOLAIU-NAKRPEOUSA-N Ile-Cys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N JHCVYQKVKOLAIU-NAKRPEOUSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 239000007836 KH2PO4 Substances 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000270322 Lepidosauria Species 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- CTBMEDOQJFGNMI-IHPCNDPISA-N Lys-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCCCN)N CTBMEDOQJFGNMI-IHPCNDPISA-N 0.000 description 1
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 1
- JQECLVNLAZGHRQ-CIUDSAMLSA-N Met-Asp-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O JQECLVNLAZGHRQ-CIUDSAMLSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- BZQFBWGGLXLEPQ-UHFFFAOYSA-N O-phosphoryl-L-serine Natural products OC(=O)C(N)COP(O)(O)=O BZQFBWGGLXLEPQ-UHFFFAOYSA-N 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- QEPZQAPZKIPVDV-KKUMJFAQSA-N Phe-Cys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N QEPZQAPZKIPVDV-KKUMJFAQSA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- DIZLUAZLNDFDPR-CIUDSAMLSA-N Pro-Cys-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 DIZLUAZLNDFDPR-CIUDSAMLSA-N 0.000 description 1
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- 101710129616 Protein glp-1 Proteins 0.000 description 1
- 101100221606 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) COS7 gene Proteins 0.000 description 1
- 102100037505 Secretin Human genes 0.000 description 1
- 108010086019 Secretin Proteins 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 229940100389 Sulfonylurea Drugs 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- 241000218636 Thuja Species 0.000 description 1
- VDUJEEQMRQCLHB-YTQUADARSA-N Trp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VDUJEEQMRQCLHB-YTQUADARSA-N 0.000 description 1
- ACGIVBXINJFALS-HKUYNNGSSA-N Trp-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N ACGIVBXINJFALS-HKUYNNGSSA-N 0.000 description 1
- WSMVEHPVOYXPAQ-XIRDDKMYSA-N Trp-Ser-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N WSMVEHPVOYXPAQ-XIRDDKMYSA-N 0.000 description 1
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 1
- GQYPNFIFJRNDPY-ONUFPDRFSA-N Trp-Trp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC=3C4=CC=CC=C4NC=3)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 GQYPNFIFJRNDPY-ONUFPDRFSA-N 0.000 description 1
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- OHNXAUCZVWGTLL-KKUMJFAQSA-N Tyr-His-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N)O OHNXAUCZVWGTLL-KKUMJFAQSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- JRMCISZDVLOTLR-BVSLBCMMSA-N Tyr-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N JRMCISZDVLOTLR-BVSLBCMMSA-N 0.000 description 1
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- SRWWRLKBEJZFPW-IHRRRGAJSA-N Val-Cys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SRWWRLKBEJZFPW-IHRRRGAJSA-N 0.000 description 1
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 125000000218 acetic acid group Chemical group C(C)(=O)* 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000002269 analeptic agent Substances 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 230000000636 anti-proteolytic effect Effects 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 230000036528 appetite Effects 0.000 description 1
- 235000019789 appetite Nutrition 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 244000030166 artemisia Species 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000007640 basal medium Substances 0.000 description 1
- 210000000227 basophil cell of anterior lobe of hypophysis Anatomy 0.000 description 1
- 210000004204 blood vessel Anatomy 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 230000007211 cardiovascular event Effects 0.000 description 1
- 210000000845 cartilage Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 208000026106 cerebrovascular disease Diseases 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 210000002808 connective tissue Anatomy 0.000 description 1
- 210000000695 crystalline len Anatomy 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 230000022811 deglycosylation Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 229950006137 dexfosfoserine Drugs 0.000 description 1
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 1
- 229910000396 dipotassium phosphate Inorganic materials 0.000 description 1
- 208000016097 disease of metabolism Diseases 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000002086 displacement chromatography Methods 0.000 description 1
- 238000001647 drug administration Methods 0.000 description 1
- 238000002651 drug therapy Methods 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000002360 explosive Substances 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 235000012631 food intake Nutrition 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 230000030136 gastric emptying Effects 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 208000004104 gestational diabetes Diseases 0.000 description 1
- 229960000346 gliclazide Drugs 0.000 description 1
- MASNOZXLGMXCHN-ZLPAWPGGSA-N glucagon Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 MASNOZXLGMXCHN-ZLPAWPGGSA-N 0.000 description 1
- 229960004666 glucagon Drugs 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 210000003128 head Anatomy 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 201000001421 hyperglycemia Diseases 0.000 description 1
- 210000003016 hypothalamus Anatomy 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical class O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- WRUGWIBCXHJTDG-UHFFFAOYSA-L magnesium sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Mg+2].[O-]S([O-])(=O)=O WRUGWIBCXHJTDG-UHFFFAOYSA-L 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 239000013028 medium composition Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 208000030159 metabolic disease Diseases 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 229940057059 monascus purpureus Drugs 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 210000004877 mucosa Anatomy 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000003538 oral antidiabetic agent Substances 0.000 description 1
- 229940127209 oral hypoglycaemic agent Drugs 0.000 description 1
- 201000008968 osteosarcoma Diseases 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000000149 penetrating effect Effects 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 210000000578 peripheral nerve Anatomy 0.000 description 1
- 239000012466 permeate Substances 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- USRGIUJOYOXOQJ-GBXIJSLDSA-N phosphothreonine Chemical group OP(=O)(O)O[C@H](C)[C@H](N)C(O)=O USRGIUJOYOXOQJ-GBXIJSLDSA-N 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- OTYBMLCTZGSZBG-UHFFFAOYSA-L potassium sulfate Chemical compound [K+].[K+].[O-]S([O-])(=O)=O OTYBMLCTZGSZBG-UHFFFAOYSA-L 0.000 description 1
- 229910052939 potassium sulfate Inorganic materials 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 239000013558 reference substance Substances 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 210000001525 retina Anatomy 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 102220117237 rs142486394 Human genes 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 229960002101 secretin Drugs 0.000 description 1
- OWMZNFCDEHGFEP-NFBCVYDUSA-N secretin human Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(N)=O)[C@@H](C)O)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)C1=CC=CC=C1 OWMZNFCDEHGFEP-NFBCVYDUSA-N 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 238000003153 stable transfection Methods 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000002459 sustained effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 210000002435 tendon Anatomy 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010003885 valyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 210000004885 white matter Anatomy 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/605—Glucagons
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/78—Connective tissue peptides, e.g. collagen, elastin, laminin, fibronectin, vitronectin or cold insoluble globulin [CIG]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Gastroenterology & Hepatology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- Toxicology (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Endocrinology (AREA)
- Peptides Or Proteins (AREA)
Abstract
The present invention relates to a kind of GLP-1 analog-COL3A1 fusion proteins.Specifically, the fusion protein includes the fusion protein of 1 chain of human III type collagen α of glucagon-like peptide 1 analog sum, and the fusion protein has the reduction blood glucose effect of glucagon-like peptide 1, and has extended Half-life in vivo.
Description
Technical field
The present invention relates to field of medicaments, more particularly to a kind of GLP-1 analog-COL3A1 fusion protein.
Background technique
Diabetes are a kind of serious chronic diseases, mainly by the hyposecretion and function of hyperglycemia and endogenous insulin
Can lose causes.Diabetes can cause multiple complications, for example, vascular system, kidney, retina, crystalline lens, peripheral nerve and
Skin etc., and then influence service life and quality of life.With the development of countries in the world social economy and the raising of Living consumption,
The disease incidence and illness rate of diabetes rise year by year, and it is tight that diabetes have become the third position after tumour, cardiovascular and cerebrovascular disease
The chronic disease of human health is endangered again.Diabetes are divided into four classes substantially, comprising: I type (insulin-dependent), II type (non-pancreas
Island element dependent form), its alloytype and gestational diabetes mellitus.Wherein, type II diabetes is the current most common diabetes, accounts for about diabetes
The 90% of patient.Type II diabetes is that a kind of cause of disease is complicated, the metabolic disease characterized by blood glucose rise, Chinese II type glycosuria
Patient is in past more than 20 Nian Zhongcheng explosive growths.
The treatment of type II diabetes at present mainly based on oral hypoglycemic agents, have sulfonylureas object such as Ge Lieben, gliclazide,
Melbine and insulin etc..But these drug therapies, which are used for a long time, can generate tolerance, can not control blood glucose and cell for a long time
The disorder of function.Therefore, it researches and develops a kind of more safely, effectively particularly important for the newtype drug of pathogenesis of diabetes mellitus.
Glucagon-like peptide 1 (GLP-1) is a kind of secretin by being located at the secretion of gastrointestinal tract mucosa L cell, GLP-1
Secretion be blood glucose dependence, i.e., when blood sugar concentration is higher than it is normal when, promoting insulin secretion is presented in GLP-1, and works as blood
When sugared concentration is normal, the promoting insulin secretion of GLP-1 weakens, therefore exogenous GLP-1 treatment not will increase hypoglycemia wind
Danger.In addition, GLP-1 is by increasing insulin secretion and biosynthesis, effectively reducing blood in conjunction with the receptor of alpha Cell of islet
Sugar;Promote beta Cell of islet proliferation, it is inhibited to adjust apoptosis, increases the insulin secretion of glucose dependency;Gastrointestinal tract can be weakened
It wriggles, delays gastric emptying, reduce food intake;Hypothalamus is acted on, appetite is reduced, to lose weight.Based on the above feature,
GLP-1 has become the exploitation hot spot of novel type II diabetes therapeutic agent.
Natural GLP-1 is extremely unstable in vivo, and dipeptidyl peptidase 4 (DPP-4) fast degradation is easy to after release,
Intracorporal half-life period is only 1~2min, does not have druggability.First GLP-1 receptor stimulating agent drug Chinese mugwort in the world in 2005
The listing of that peptide is filled in, it is a kind of GLP-1 analog from lizard saliva, has 50% homology with people GLP-1.Later
The Liraglutide of listing is artificial synthesized GLP-1 analog, has 97% homology, therefore its validity with people GLP-1
Its side effect is substantially reduced compared to Exenatide while promotion.But Liraglutide requires daily skin as Exenatide
Lower drug administration by injection, there are still certain disadvantages in terms of the ease for use of drug, therefore carry out structural modification to GLP-1, are retaining it
Extend the R&D direction that its Half-life in vivo has become GLP-1 drug while biological effect.And and human immunoglobulin(HIg)
The characteristics of source of people GLP-1 modifier degree that the part IgG Fc combines draws glycopeptide, IgG long circulating half-life period is utilized, effect not
While being inferior to Liraglutide, it can achieve and be weekly administered, be optimal product in current similar product.But degree is drawn
The Fc section of IgG in glycopeptide may have ADCC, and (Antibody-Dependent Cell Cytotoxicity, antibody-dependant are thin
The cytotoxic effect that born of the same parents mediate) effect, there is potential immunogenicity and side reaction.
Therefore, this field needs to develop the drug of the permanently effective treatment diabetes of a kind of novel, safe and energy.
Summary of the invention
The object of the present invention is to provide the drugs of the permanently effective treatment diabetes of a kind of novel, safe and energy.
The first aspect of the present invention provides a kind of fusion protein, and the structure of the fusion protein is as shown in following formula I:
A-L-B (I)
In formula, A is GLP-1 analog, and B is III Collagen Type VI α of people, 1 chain, and L is that nothing or link peptide, each "-" independently are connection
Peptide or peptide bond;And
The GLP-1 analog has the polypeptide of amino acid sequence shown in SEQ ID NO.:1,
His-Xaa8-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Xaa22-
Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Val-Lys-Gly-Arg-Xaa37
Wherein Xaa8It is Gly or Ala, Xaa22It is Glu or Gly, Xaa37It is Gly or nothing.
In another preferred example, there is in corresponding to sequence shown in SEQ ID NO.:6 choosing in the GLP-1 analog
From the amino acid mutation of the following group: the 2nd glycine (Ala), the 16th glycine (Gly), the 31st glycine (Gly) or its
Combination.
In another preferred example, there is in corresponding to sequence shown in SEQ ID NO.:6 choosing in the GLP-1 analog
From mutation: A2G, G16E, missing G31, or combinations thereof.
In another preferred example, the GLP-1 analog is in addition to the mutation (the such as the 2nd, 16,31 amino acids), remaining
Amino acid and sequence shown in SEQ ID NO.:6 are identical or essentially identical.
In another preferred example, the GLP-1 analog has the amino acid sequence as shown in SEQ ID NO.:6 or 12.
In another preferred example, 1 chain (i.e. COL3A1) of human III type collagen α be COL3A1 full length protein or segment,
Described in segment be selected from the group: COL3A1598-896Segment, COL3A1733-896Segment, or combinations thereof.
In another preferred example, the COL3A1 full length protein has amino acid sequence shown in SEQ ID NO.:2.
In another preferred example, the COL3A1598-896Segment is 598-896 amino acids sequence in COL3A1 albumen
Column.
In another preferred example, the COL3A1598-896Segment has amino acid sequence shown in SEQ ID NO.:3.
In another preferred example, the COL3A1733-896Segment is 733-896 amino acids sequence in COL3A1 albumen
Column.
In another preferred example, the COL3A1733-896Segment has amino acid sequence shown in SEQ ID NO.:4.
In another preferred example, 1 chain of human III type collagen α has amino acid sequence shown in SEQ ID NO.:2,3 or 4
Column.
In another preferred example, the link peptide is duplicate such as SEQ ID NO.:5 (Gly-Gly-Gly- with n
Gly-Ser sequence shown in), and its C-terminal is also connected with the polypeptide of an alanine (Ala), and wherein n is 2-6, preferably n
It is 2.
In another preferred example, the link peptide has amino acid sequence shown in SEQ ID NO.:7.
In another preferred example, the fusion protein has amino acid sequence shown in SEQ ID NO.:8,9 or 13.
The second aspect of the present invention, provides a kind of oligomer, and the oligomer includes described in first aspect present invention
Fusion protein.
In another preferred example, the oligomer is dimer, tripolymer, the tetramer or pentamer.
In another preferred example, the oligomer is the dimer of fusion protein described in first aspect present invention.
The third aspect of the present invention provides a kind of isolated polynucleotides, the polynucleotide encoding present invention first
Fusion protein described in aspect.
In another preferred example, the coded sequence of the GLP-1 analog such as 1-90 institutes in SEQ ID NO.:10
Show.
In another preferred example, the COL3A1598-896139- in the coded sequence such as SEQ ID NO.:10 of segment
Shown in 1035.
In another preferred example, the COL3A1733-896139- in the coded sequence such as SEQ ID NO.:11 of segment
Shown in 630.
In another preferred example, the polynucleotides have the sequence as shown in SEQ ID NO.:10 or 11.
The fourth aspect of the present invention, provides a kind of carrier, and the carrier includes multicore described in third aspect present invention
Thuja acid.
In another preferred example, the carrier is selected from the group: DNA, RNA, plasmid, slow virus carrier, adenovirus vector,
Retroviral vector, transposons, or combinations thereof.
In another preferred example, the carrier is plasmid, preferably pUC57 plasmid.
The fifth aspect of the present invention, provides a kind of host cell, and the host cell contains fourth aspect present invention
Polynucleotides described in the third aspect present invention of external source or the expression present invention the are integrated in the carrier or chromosome
Oligomer described in fusion protein described in one side or expression second aspect of the present invention.
In another preferred example, the host cell is yeast, preferably Pichia pastoris, more preferably Pichia pastoris
Cell GS115.
The sixth aspect of the present invention, provides a kind of pharmaceutical composition, and described pharmaceutical composition includes first party of the present invention
Oligomer described in fusion protein described in face or second aspect of the present invention and pharmaceutically acceptable carrier or excipient.
In another preferred example, described pharmaceutical composition is for treating Non-Insulin Dependent Diabetes Mellitus or its related disease
Disease.
The seventh aspect of the present invention provides fusion protein as described in the first aspect of the invention, second aspect of the present invention
Polynucleotides described in the oligomer, third aspect present invention, carrier described in fourth aspect present invention, the present invention the 5th
Host cell described in aspect is used to prepare prevention and/or treats the drug or preparation of diabetes.
In another preferred example, the diabetes are Non-Insulin Dependent Diabetes Mellitus or its related disease.
In the eighth aspect of the present invention, fusion protein, the present invention second described in a kind of first aspect present invention are provided
The purposes of pharmaceutical composition described in oligomer described in aspect or sixth aspect present invention, for preventing and/or treating glycosuria
Disease, preferably Non-Insulin Dependent Diabetes Mellitus or its related disease.
The ninth aspect of the present invention provides a kind of method for treating disease, including suitable to object in need for the treatment of application
Fusion protein described in the first aspect present invention of amount, oligomer described in second aspect of the present invention or sixth aspect present invention institute
The pharmaceutical composition stated.
In another preferred example, the disease is Non-Insulin Dependent Diabetes Mellitus or its related disease.
It should be understood that above-mentioned each technical characteristic of the invention and having in below (eg embodiment) within the scope of the present invention
It can be combined with each other between each technical characteristic of body description, to form a new or preferred technical solution.As space is limited, exist
This no longer tires out one by one states.
Detailed description of the invention
Fig. 1 shows the structure figures of pPic9m-GLP-COL-1 expression plasmid.
Fig. 2 shows pPic9m-GLP-COL-2 plasmid construct figure.
Fig. 3 shows GLP-1-2L-COL after purification598-896And GLP-1-2L-COL733-896The electrophoresis purity of fusion protein
Analyze result.
Fig. 4 shows GLP-1-2L-COL598-896And GLP-1-2L-COL733-896The GLP-1R receptor activation of fusion protein
Activity analysis.
Fig. 5 shows GLP-1-2L-COL598-896And GLP-1-2L-COL733-896The Half-life in vivo of fusion protein is analyzed
As a result.
Specific embodiment
The present inventor after extensive and in-depth study, unexpectedly obtains a kind of safe, permanently effective non-pancreas islet for the treatment of
The drug of plain dependent diabetes.The drug is a kind of GLP-1 analog-COL3A1 fusion protein, includes glucagon
The fusion protein of 1 chain of human III type collagen α of 1 analog sum of sample peptide, the fusion protein have the reduction of glucagon-like peptide 1
Blood glucose effect, and there is extended Half-life in vivo.The present invention is also mutated glucagon-like peptide 1 analog, drop
The sensibility that low GLP-1 analog hydrolyzes DPP-4, activity improve, and immunogenicity reduces.The present invention also filters out two kinds of people
COL3A1 segment (shown in SEQ ID NO.:3 and SEQ ID NO.:4) remains the ability that people COL3A1 forms homotrimer
The recombinant expression for being conducive to heterologous fusion proteins in the present invention simultaneously avoids too long amino acid sequence from leading to the tired of recombinant expression
It is difficult.Fused protein of the present invention can be used for treating type II diabetes and various conditions associated.On this basis, inventor completes
The present invention.
Fusion protein
As used herein, " fusion protein of the present invention ", " recombination fusion protein " or " polypeptide " refer both to first aspect present invention
The fusion protein.The structure of fusion protein of the present invention is as shown in following formula I:
A-L-B (I)
In formula, A is GLP-1 analog, and B is III Collagen Type VI α of people, 1 chain, and L is that nothing or link peptide, each "-" independently are connection
Peptide or peptide bond;And
The GLP-1 analog has the polypeptide of amino acid sequence shown in SEQ ID NO.:1,
His-Xaa8-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Xaa22-
Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Val-Lys-Gly-Arg-Xaa37
Wherein Xaa8It is Gly or Ala, Xaa22It is Glu or Gly, Xaa37It is Gly or nothing.
As used herein, term " fusion protein " further includes having the sequence of above-mentioned active, SEQ ID NO.:8,9 or 13
Variant form.These variant forms include (but being not limited to): 1-3 (usually 1-2 is a, more preferably 1) amino acid
Missing, insertion and/or replace, and C-terminal and/or N-terminal addition or lack it is one or several (usually within 3, compared with
Being goodly is more preferably within 1 within 2) amino acid.For example, in the art, with amino acid similar in performance
When being replaced, the function of protein is not usually changed.For another example, C-terminal and/or N-terminal addition or missing one or
Several amino acid will not generally also change the structure and function of protein.In addition, the term further includes monomer and the poly bodily form
The polypeptide of the present invention of formula.The term further includes linear and nonlinear polypeptide (such as cyclic peptide).
The invention also includes the active fragments of above-mentioned fusion protein, derivative and analogue.As used herein, term " piece
Section ", " derivative " and " analog " refer to the function of being kept substantially fusion protein of the present invention or active polypeptide.Of the invention
Polypeptide fragment, derivative or the like, which can be (i), has one or several conservative or non-conservative amino acid residues (preferably conservative
Acidic amino acid residue) substituted polypeptide, or (ii) in one or more amino acid residues with the polypeptide of substituent group, or
(iii) Antigenic Peptide and another compound (for example extending the compound of polypeptide half-life period, such as polyethylene glycol) fusion are formed
Polypeptide, or (iv) additional amino acid sequence is blended in this polypeptide sequence and the polypeptide that is formed is (with leader sequence, secretion sequence
Or the fusion protein of the fusion of the sequence labels such as 6His and formation).According to the teaching of this article, these segments, derivative and analogue
It belongs to scope known to those skilled in the art.
A kind of preferred reactive derivative refers to compared with the amino acid sequence of formulas I there is at most 3, preferably at most 2,
More preferably at most 1 amino acid is replaced by amino acid with similar or analogous properties and forms polypeptide.These conservative variations are more
Peptide carries out amino acid substitution preferably based on Table A and generates.
Table A
Initial residue | Representative substitution | It is preferred to replace |
Ala(A) | Val;Leu;Ile | Val |
Arg(R) | Lys;Gln;Asn | Lys |
Asn(N) | Gln;His;Lys;Arg | Gln |
Asp(D) | Glu | Glu |
Cys(C) | Ser | Ser |
Gln(Q) | Asn | Asn |
Glu(E) | Asp | Asp |
Gly(G) | Pro;Ala | Ala |
His(H) | Asn;Gln;Lys;Arg | Arg |
Ile(I) | Leu;Val;Met;Ala;Phe | Leu |
Leu(L) | Ile;Val;Met;Ala;Phe | Ile |
Lys(K) | Arg;Gln;Asn | Arg |
Met(M) | Leu;Phe;Ile | Leu |
Phe(F) | Leu;Val;Ile;Ala;Tyr | Leu |
Pro(P) | Ala | Ala |
Ser(S) | Thr | Thr |
Thr(T) | Ser | Ser |
Trp(W) | Tyr;Phe | Tyr |
Tyr(Y) | Trp;Phe;Thr;Ser | Phe |
Val(V) | Ile;Leu;Met;Phe;Ala | Leu |
The present invention also provides the analogs of fusion protein of the present invention.Shown in these analogs and SEQ ID NO.:8,9 or 13
The difference of polypeptide can be the difference on amino acid sequence, be also possible to not influence the difference on the modified forms of sequence, or
Person haves both at the same time.Analog further includes the analog with the residue (such as D- amino acid) different from natural L-amino acids, and
Analog with non-naturally occurring or synthesis amino acid (such as β, gamma-amino acid).It should be understood that polypeptide of the invention is not
It is limited to enumerated representative polypeptide.
Modification (not changing primary structure usually) form includes: the chemical derivative form such as acetyl of internal or external polypeptide
Change or carboxylated.Modification further includes glycosylation, is carried out in the synthesis and processing of polypeptide or in further processing step such as those
Glycosylation modified and generation polypeptide.This modification can carry out glycosylated enzyme (such as mammal by the way that polypeptide to be exposed to
Glycosylase or deglycosylation enzyme) and complete.Modified forms further include with phosphorylated amino acid residue (such as phosphoric acid junket ammonia
Acid, phosphoserine, phosphothreonine) sequence.It further include being modified to improve its anti-proteolytic properties or optimization
The polypeptide of solubility property.
The compound of the present invention includes a kind of heterologous fusion proteins matter, wherein first polypeptide is GLP-1 analog, sequence
Column selection from SEQIDNO.:1,
His-Xaa8-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Xaa22-
Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Val-Lys-Gly-Arg-Xaa37
Wherein Xaa8It is Gly or Ala;
Wherein Xaa22It is Glu or Gly;
Wherein Xaa37It is Gly or is removed.
Second polypeptide is that 1 chain (i.e. COL3A1) overall length of human III type collagen α or segment, sequence are selected from
(a) overall length COL3A1 (SEQ ID NO.:2)
Met-Met-Ser-Phe-Val-Gln-Lys-Gly-Ser--Trp-Leu-Leu-Leu-Ala-Leu-Leu-His-
Pro--Thr-Ile-Ile-Leu-Ala-Gln-Gln-Glu-Ala-Val-Glu-Gly-Gly-Cys-Ser-His-Leu-Gly-
Gln-Ser--Tyr-Ala-Asp-Arg-Asp-Val--Trp-Lys-Pro-Glu-Pro-Cys-Gln-Ile-Cys-Val-
Cys-Asp-Ser-Gly-Ser-Val-Leu-Cys-Asp-Asp-Ile-Ile-Cys-Asp-Asp-Gln-Glu-Leu-Asp-
Cys-Pro-Asn-Pro-Glu-Ile-Pro-Phe-Gly-Glu-Cys-Cys-Ala-Val-Cys-Pro-Gln-Pro-Pro--
Thr-Ala-Pro--Thr-Arg-Pro-Pro-Asn-Gly-Gln-Gly-Pro-Gln-Gly-Pro-Lys-Gly-Asp-Pro-
Gly-Pro-Pro-Gly-Ile-Pro-Gly-Arg-Asn-Gly-Asp-Pro-Gly-Ile-Pro-Gly-Gln-Pro-Gly-
Ser-Pro-Gly-Ser-Pro-Gly-Pro-Pro-Gly-Ile-Cys-Glu-Ser-Cys-Pro--Thr-Gly-Pro-Gln-
Asn--Tyr-Ser-Pro-Gln--Tyr-Asp-Ser--Tyr-Asp-Val-Lys-Ser-Gly-Val-Ala-Val-Gly-
Gly-Leu-Ala-Gly--Tyr-Pro-Gly-Pro-Ala-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Pro-Pro-
Gly--Thr-Ser-Gly-His-Pro-Gly-Ser-Pro-Gly-Ser-Pro-Gly--Tyr-Gln-Gly-Pro-Pro-
Gly-Glu-Pro-Gly-Gln-Ala-Gly-Pro-Ser-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Ala-Ile-Gly-
Pro-Ser-Gly-Pro-Ala-Gly-Lys-Asp-Gly-Glu-Ser-Gly-Arg-Pro-Gly-Arg-Pro-Gly-Glu-
Arg-Gly-Leu-Pro-Gly-Pro-Pro-Gly-Ile-Lys-Gly-Pro-Ala-Gly-Ile-Pro-Gly-Phe-Pro-
Gly-Met-Lys-Gly-His-Arg-Gly-Phe-Asp-Gly-Arg-Asn-Gly-Glu-Lys-Gly-Glu--Thr-Gly-
Ala-Pro-Gly-Leu-Lys-Gly-Glu-Asn-Gly-Leu-Pro-Gly-Glu-Asn-Gly-Ala-Pro-Gly-Pro-
Met-Gly-Pro-Arg-Gly-Ala-Pro-Gly-Glu-Arg-Gly-Arg-Pro-Gly-Leu-Pro-Gly-Ala-Ala-
Gly-Ala-Arg-Gly-Asn-Asp-Gly-Ala-Arg-Gly-Ser-Asp-Gly-Gln-Pro-Gly-Pro-Pro-Gly-
Pro-Pro-Gly--Thr-Ala-Gly-Phe-Pro-Gly-Ser-Pro-Gly-Ala-Lys-Gly-Glu-Val-Gly-Pro-
Ala-Gly-Ser-Pro-Gly-Ser-Asn-Gly-Ala-Pro-Gly-Gln-Arg-Gly-Glu-Pro-Gly-Pro-Gln-
Gly-His-Ala-Gly-Ala-Gln-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Ile-Asn-Gly-Ser-Pro-Gly-
Gly-Lys-Gly-Glu-Met-Gly-Pro-Ala-Gly-Ile-Pro-Gly-Ala-Pro-Gly-Leu-Met-Gly-Ala-
Arg-Gly-Pro-Pro-Gly-Pro-Ala-Gly-Ala-Asn-Gly-Ala-Pro-Gly-Leu-Arg-Gly-Gly-Ala-
Gly-Glu-Pro-Gly-Lys-Asn-Gly-Ala-Lys-Gly-Glu-Pro-Gly-Pro-Arg-Gly-Glu-Arg-Gly-
Glu-Ala-Gly-Ile-Pro-Gly-Val-Pro-Gly-Ala-Lys-Gly-Glu-Asp-Gly-Lys-Asp-Gly-Ser-
Pro-Gly-Glu-Pro-Gly-Ala-Asn-Gly-Leu-Pro-Gly-Ala-Ala-Gly-Glu-Arg-Gly-Ala-Pro-
Gly-Phe-Arg-Gly-Pro-Ala-Gly-Pro-Asn-Gly-Ile-Pro-Gly-Glu-Lys-Gly-Pro-Ala-Gly-
Glu-Arg-Gly-Ala-Pro-Gly-Pro-Ala-Gly-Pro-Arg-Gly-Ala-Ala-Gly-Glu-Pro-Gly-Arg-
Asp-Gly-Val-Pro-Gly-Gly-Pro-Gly-Met-Arg-Gly-Met-Pro-Gly-Ser-Pro-Gly-Gly-Pro-
Gly-Ser-Asp-Gly-Lys-Pro-Gly-Pro-Pro-Gly-Ser-Gln-Gly-Glu-Ser-Gly-Arg-Pro-Gly-
Pro-Pro-Gly-Pro-Ser-Gly-Pro-Arg-Gly-Gln-Pro-Gly-Val-Met-Gly-Phe-Pro-Gly-Pro-
Lys-Gly-Asn-Asp-Gly-Ala-Pro-Gly-Lys-Asn-Gly-Glu-Arg-Gly-Gly-Pro-Gly-Gly-Pro-
Gly-Pro-Gln-Gly-Pro-Pro-Gly-Lys-Asn-Gly-Glu--Thr-Gly-Pro-Gln-Gly-Pro-Pro-Gly-
Pro--Thr-Gly-Pro-Gly-Gly-Asp-Lys-Gly-Asp--Thr-Gly-Pro-Pro-Gly-Pro-Gln-Gly-
Leu-Gln-Gly-Leu-Pro-Gly--Thr-Gly-Gly-Pro-Pro-Gly-Glu-Asn-Gly-Lys-Pro-Gly-Glu-
Pro-Gly-Pro-Lys-Gly-Asp-Ala-Gly-Ala-Pro-Gly-Ala-Pro-Gly-Gly-Lys-Gly-Asp-Ala-
Gly-Ala-Pro-Gly-Glu-Arg-Gly-Pro-Pro-Gly-Leu-Ala-Gly-Ala-Pro-Gly-Leu-Arg-Gly-
Gly-Ala-Gly-Pro-Pro-Gly-Pro-Glu-Gly-Gly-Lys-Gly-Ala-Ala-Gly-Pro-Pro-Gly-Pro-
Pro-Gly-Ala-Ala-Gly--Thr-Pro-Gly-Leu-Gln-Gly-Met-Pro-Gly-Glu-Arg-Gly-Gly-Leu-
Gly-Ser-Pro-Gly-Pro-Lys-Gly-Asp-Lys-Gly-Glu-Pro-Gly-Gly-Pro-Gly-Ala-Asp-Gly-
Val-Pro-Gly-Lys-Asp-Gly-Pro-Arg-Gly-Pro--Thr-Gly-Pro-Ile-Gly-Pro-Pro-Gly-Pro-
Ala-Gly-Gln-Pro-Gly-Asp-Lys-Gly-Glu-Gly-Gly-Ala-Pro-Gly-Leu-Pro-Gly-Ile-Ala-
Gly-Pro-Arg-Gly-Ser-Pro-Gly-Glu-Arg-Gly-Glu--Thr-Gly-Pro-Pro-Gly-Pro-Ala-Gly-
Phe-Pro-Gly-Ala-Pro-Gly-Gln-Asn-Gly-Glu-Pro-Gly-Gly-Lys-Gly-Glu-Arg-Gly-Ala-
Pro-Gly-Glu-Lys-Gly-Glu-Gly-Gly-Pro-Pro-Gly-Val-Ala-Gly-Pro-Pro-Gly-Lys-Asp-
Gly--Thr-Ser-Gly-His-Pro-Gly-Pro-Ile-Gly-Pro-Pro-Gly-Pro-Arg-Gly-Asn-Arg-Gly-
Glu-Arg-Gly-Ser-Glu-Gly-Ser-Pro-Gly-His-Pro-Gly-Gln-Pro-Gly-Pro-Pro-Gly-Pro-
Pro-Gly-Ala-Pro-Gly-Pro-Cys-Cys-Gly-Gly-Val-Gly-Ala-Ala-Ala-Ile-Ala-Gly-Ile-
Gly-Gly-Glu-Lys-Ala-Gly-Gly-Phe-Ala-Pro--Tyr--Tyr-Gly-Asp-Glu-Pro-Met-Asp-
Phe-Lys-Ile-Asn--Thr-Asp-Glu-Ile-Met--Thr-Ser-Leu-Lys-Ser-Val-Asn-Gly-Gln-
Ile-Glu-Ser-Leu-Ile-Ser-Pro-Asp-Gly-Ser-Arg-Lys-Asn-Pro-Ala-Arg-Asn-Cys-Arg-
Asp-Leu-Lys-Phe-Cys-His-Pro-Glu-Leu-Lys-Ser-Gly-Glu--Tyr--Trp-Val-Asp-Pro-
Asn-Gln-Gly-Cys-Lys-Leu-Asp-Ala-Ile-Lys-Val-Phe-Cys-Asn-Met-Glu--Thr-Gly-
Glu--Thr-Cys-Ile-Ser-Ala-Asn-Pro-Leu-Asn-Val-Pro-Arg-Lys-His--Trp--Trp--Thr-
Asp-Ser-Ser-Ala-Glu-Lys-Lys-His-Val--Trp-Phe-Gly-Glu-Ser-Met-Asp-Gly-Gly-Phe-
Gln-Phe-Ser--Tyr-Gly-Asn-Pro-Glu-Leu-Pro-Glu-Asp-Val-Leu-Asp-Val-Gln-Leu-Ala-
Phe-Leu-Arg-Leu-Leu-Ser-Ser-Arg-Ala-Ser-Gln-Asn-Ile--Thr--Tyr-His-Cys-Lys-
Asn-Ser-Ile-Ala--Tyr-Met-Asp-Gln-Ala-Ser-Gly-Asn-Val-Lys-Lys-Ala-Leu-Lys-Leu-
Met-Gly-Ser-Asn-Glu-Gly-Glu-Phe-Lys-Ala-Glu-Gly-Asn-Ser-Lys-Phe--Thr--Tyr--
Thr-Val-Leu-Glu-Asp-Gly-Cys--Thr-Lys-His--Thr-Gly-Glu--Trp-Ser-Lys--Thr-Val-
Phe-Glu--Tyr-Arg--Thr-Arg-Lys-Ala-Val-Arg-Leu-Pro-Ile-Val-Asp-Ile-Ala-Pro--
Tyr-Asp-Ile-Gly-Gly-Pro-Asp-Gln-Glu-Phe-Gly-Val-Asp-Val-Gly-Pro-Val-Cys-Phe-
Leu
(b)COL3A1598-896(SEQ ID NO.:3)
Gly-Pro-Gly-Gly-Pro-Gly-Pro-Gln-Gly-Pro-Pro-Gly-Lys-Asn-Gly-Glu-Thr-
Gly-Pro-Gln-Gly-Pro-Pro-Gly-Pro-Thr-Gly-Pro-Gly-Gly-Asp-Lys-Gly-Asp-Thr-Gly-
Pro-Pro-Gly-Pro-Gln-Gly-Leu-Gln-Gly-Leu-Pro-Gly-Thr-Gly-Gly-Pro-Pro-Gly-Glu-
Asn-Gly-Lys-Pro-Gly-Glu-Pro-Gly-Pro-Lys-Gly-Asp-Ala-Gly-Ala-Pro-Gly-Ala-Pro-
Gly-Gly-Lys-Gly-Asp-Ala-Gly-Ala-Pro-Gly-Glu-Arg-Gly-Pro-Pro-Gly-Leu-Ala-Gly-
Ala-Pro-Gly-Leu-Arg-Gly-Gly-Ala-Gly-Pro-Pro-Gly-Pro-Glu-Gly-Gly-Lys-Gly-Ala-
Ala-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Ala-Ala-Gly-Thr-Pro-Gly-Leu-Gln-Gly-Met-Pro-
Gly-Glu-Arg-Gly-Gly-Leu-Gly-Ser-Pro-Lys-Gly-Asp-Lys-Gly-Glu-Pro-Gly-Gly-Pro-
Gly-Ala-Asp-Gly-Val-Pro-Gly-Lys-Asp-Gly-Pro-Arg-Gly-Pro-Thr-Gly-Pro-Ile-Gly-
Pro-Pro-Gly-Pro-Ala-GLy-Gln-Pro-Gly-Asp-Lys-Gly-Glu-Gly-Gly-Ala-Pro-Gly-Leu-
Pro-Gly-Ile-Ala-Gly-Pro-Arg-Gly-Ser-Pro-Gly-Glu-Arg-Gly-Glu-Thr-Gly-Pro-Pro-
Gly-Pro-Ala-Gly-Phe-Pro-Gly-Ala-Pro-Gly-Gln-Asn-Gly-Glu-Pro-Gly-Gly-Lys-Gly-
Glu-Arg-Gly-Ala-Pro-Gly-Glu-Lys-Gly-Glu-Gly-Gly-Pro-Pro-Gly-Val-Ala-Gly-Pro-
Pro-Gly-Lys-Asp-Gly-Thr-Ser-Gly-His-Pro-Gly-Pro-I le-Gly-Pro-Pro-Gly-Pro-Arg-
Gly-Asn-Arg-Gly-Glu-Arg-Gly-Ser-Glu-Gly-Ser-Pro-Gly-His-Pro-Gly-Gln-Pro-Gly-
Pro-Pro-Gly-Pro-Pro-Gly-Ala-Pro-Gly-Pro-Cys-Cys-Gly-Gly
(c)COL3A1733-896(SEQ ID NO.:4)
Gly-Leu-Gly-Ser-Pro-Lys-Gly-Asp-Lys-Gly-Glu-Pro-Gly-Gly-Pro-Gly-Ala-
Asp-Gly-Val-Pro-Gly-Lys-Asp-Gly-Pro-Arg-Gly-Pro-Thr-Gly-Pro-Ile-Gly-Pro-Pro-
Gly-Pro-Ala-GLy-Gln-Pro-Gly-Asp-Lys-Gly-Glu-Gly-Gly-Ala-Pro-Gly-Leu-Pro-Gly-
Ile-Ala-Gly-Pro-Arg-Gly-Ser-Pro-Gly-Glu-Arg-Gly-Glu-Thr-Gly-Pro-Pro-Gly-Pro-
Ala-Gly-Phe-Pro-Gly-Ala-Pro-Gly-Gln-Asn-Gly-Glu-Pro-Gly-Gly-Lys-Gly-Glu-Arg-
Gly-Ala-Pro-Gly-Glu-Lys-Gly-Glu-Gly-Gly-Pro-Pro-Gly-Val-Ala-Gly-Pro-Pro-Gly-
Lys-Asp-Gly-Thr-Ser-Gly-His-Pro-Gly-Pro-Ile-Gly-Pro-Pro-Gly-Pro-Arg-Gly-Asn-
Arg-Gly-Glu-Arg-Gly-Ser-Glu-Gly-Ser-Pro-Gly-His-Pro-Gly-Gln-Pro-Gly-Pro-Pro-
Gly-Pro-Pro-Gly-Ala-Pro-Gly-Pro-Cys-Cys-Gly-Gly
The C-terminal of heterologous fusion proteins matter GLP-1 analogue polypeptide of the invention and the N-terminal of people's COL3A1 segment are preferred
It is fused together by the peptide linker (i.e. link peptide) rich in G, wherein peptide linker has sequence [Gly-Gly-Gly-Gly-Ser
(SEQ ID NO.:5)]nThe sequence of-Ala, wherein n is 2-6, it is therefore preferable to 2.
Heterologous fusion proteins matter of the invention includes GLP-1 analog part and people's COL3A1 segment portion.By to day
The partial replacement of right GLP-1 sequence with merge people's COL3A1 segment, while retaining natural GLP-1 activity, fusion protein leads to
The remarkable region COL3A1 forms stable oligomer, increases the internal stability of fusion protein.
Natural GLP-1 is cut into the active segment of AA7-AA37, therefore, according to this field in vivo by processing
The aminoterminal of GLP-1 is appointed as No. 7 by habit, and c-terminus is No. 37.To its in the polypeptide as shown in SEQ ID NO.:6
His amino acid serial number.
7His-Ala-Glu-10Gly-Thr-Phe-Thr-Ser-15Asp-Val-Ser-Ser-Tyr-20Leu-Glu-Gly-
Gln-Ala-25Ala-Lys-Glu-Phe-Ile-30Ala-Trp-Leu-Val-Lys-35Gly-Arg-37Gly
(SEQ ID NO:6)
Relative to natural GLP-1 (7-37), the GLP-1 analog part of heterologous fusion proteins matter includes the 8th, 22 and 36
Three preliminary replacements.Endogenous dipeptidyl peptidase 4 (DPP-4) is cut naturally between Ala and the 9th of the 8th Glu
GLP-1, inactive GLP-1 (9-37) segment of generation, the 8th replace with Gly after can reduce GLP-1 analog to DPP-
The sensibility of 4 hydrolysis.The activity of GLP-1 analog can be improved in 22nd replacement.37th removal can reduce fusion egg
White matter obtains immunogenicity.Sequence after mutation is as shown in SEQ ID NO.:12.
7His-Gly-Glu-10Gly-Thr-Phe-Thr-Ser-15Asp-Val-Ser-Ser-Tyr-20Leu-Glu-Glu-
Gln-Ala-25Ala-Lys-Glu-Phe-Ile-30Ala-Trp-Leu-Val-Lys-35Gly-Arg(SEQ ID NO.:12)
Heterologous fusion proteins COL3A1 containing someone of the invention and its segment.On molecular structure, III collagen type is
It is made of parallel line type chain, each linear chain is combined closely by interchain interaction by three left-handed 1 chains of α of distortion and formed
An extremely strong dextrorotation triple helices structure.Every III Collagen Type VI α, 1 chain repeats structure by up to 300 or more Gly-X-Y triplet
At the triplet configuration is the key that III Collagen Type VI α, 1 chain forms homotrimer.Therefore, the present invention is using overall length people
On the basis of COL3A1 (SEQ ID NO.:2), in order to avoid too long amino acid sequence leads to the difficulty of recombinant expression, preferably
Two kinds of people's COL3A1 segments, sequence is respectively as shown in SEQ ID NO.:3 and SEQ ID NO.:4.Two kinds of segments contain difference
The Gly-X-Y triplet configuration domain of length remains people COL3A1 and forms the ability of homotrimer while being conducive to the present invention
The recombinant expression of middle heterologous fusion proteins.
Joint peptide [Gly- of the C-terminal amino acid of GLP-1 analog part in the present invention preferably by being rich in glycine
Gly-Gly-Gly-Ser(SEQ ID NO.:5)]n- Ala is merged with the N-terminal of people's COL3A1 segment.Increasing peptide linker can be to prevent
Only interfering with each other between potential two structural domains, improves the stability of heterologous fusion proteins.In addition, connecing rich in glycine
Head provides certain structural flexibility, allow GLP-1 analog part and the GLP-1 on target cell such as pancreatic beta cell by
Body molecule effectively interacts, and is conducive to play its bioactivity.Center tap peptide [Gly-Gly-Gly-Gly-Ser of the present invention
(SEQ ID NO.:5)]nRepeat number n >=2 of-Ala, but too long joint peptide is unfavorable for the stability of fusion protein and may increase
Add potential immunogenicity.It is therefore preferable that joint peptide includes sequence:
Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Ala(SEQ ID NO.:7)
Therefore currently preferred GLP-1-COL3A1 heterologous fusion proteins matter includes following protein:
(a)GLP-1-2L-COL598-896(SEQ ID NO.:8)
HGEGTFTSDVSSYLEEQAAKEFIAWLVKGRGGGGSGGGGSGGGGSAGPGGPGPQGPPGKNGETGPQGPP
GPTGPGGDKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGERGPPGLAGAPGLRGG
AGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAG
QPGDKGEGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGKDGTS
GHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGG
(b)GLP-1-2L-COL733-896(SEQ ID NO.:9)
HGEGTFTSDVSSYLEEQAAKEFIAWLVKGRGGGGSGGGGSGGGGSAGLGSPGPKGDKGEPGGPGADGVP
GKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGE
KGEGGPPGVAGPPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGG
(c)GLP-1-2L-COL(SEQ ID NO.:13)
HGEGTFTSDVSSYLEEQAAKEFIAWLVKGRGGGGSGGGGSGGGGSAAMMSFVQKGSWLLLALLHPTI
ILAQQEAVEGGCSHLGQSYADRDVWKPEPCQICVCDSGSVLCDDIICDDQELDCPNPEIPFGECCAVCPQPPTAPTR
PPNGQGPQGPKGDPGPPGIPGRNGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDVKSGVAVGGLAGYPG
PAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPSGPAGKDGESGRPGRPGERGLPGPPGIK
GPAGIPGFPGMKGHRGFDGRNGEKGETGAPGLKGENGLPGENGAPGPMGPRGAPGERGRPGLPGAAGARGNDGARGS
DGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQGPPGPPGINGSPGGKGEMGPAGIPG
APGLMGARGPPGPAGANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAGIPGVPGAKGEDGKDGSPGEPGANGLPGAA
GERGAPGFRGPAGPNGIPGEKGPAGERGAPGPAGPRGAAGEPGRDGVPGGPGMRGMPGSPGGPGSDGKPGPPGSQGE
SGRPGPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGPGGPGPQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPG
PQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGERGPPGLAGAPGLRGGAGPPGPEGGKGAAGPP
GPPGAAGTPGLQGMPGERGGLGSPGPKGDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGI
AGPRGSPGERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGKDGTSGHPGPIGPPGPRGNRG
ERGSEGSPGHPGQPGPPGPPGAPGPCCGGVGAAAIAGIGGEKAGGFAPYYGDEPMDFKINTDEIMTSLKSVNGQIES
LISPDGSRKNPARNCRDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISANPLNVPRKHWWTDSSAEKK
HVWFGESMDGGFQFSYGNPELPEDVLDVQLAFLRLLSSRASQNITYHCKNSIAYMDQASGNVKKALKLMGSNEGEFK
AEGNSKFTYTVLEDGCTKHTGEWSKTVFEYRTRKAVRLPIVDIAPYDIGGPDQEFGVDVGPVCFL
The nomenclature used herein for referring to specific heterologous fusion proteins matter is defined as follows: the part fused protein GLP-1
The analog of mature GLP-1 (7-37) is refered in particular to, wherein the 8th Ala sports Gly, the 22nd Gly sports Glu, the 37th
The Gly removal of position.L refers to sequence [Gly-Gly-Gly-Gly-Ser (SEQ ID NO.:5)]nThe connector of-Ala.Directly in L
The number of front refers to the repetition number of n in joint peptide.The joint peptide for being appointed as 2L refers to sequence Gly-Gly-Gly-Gly-Ser-
Gly-Gly-Gly-Gly-Ser-Ala(SEQ ID NO.:7).Fused protein COL3A1 segment is abbreviated as COL, amino
Acid sequence initial position is indicated with residue numbering.COL598-896Indicate the part COL3A1 of mature fused protein with the 598th
The Gly of position starts, and is terminated with the 896th Gly;COL733-896Indicate the part COL3A1 of mature fused protein with the 733rd
Gly start, terminated with the 896th Gly.
The present invention relates to the fusion proteins that source of people GLP-1 analog is combined with 1 chain segment of human III type collagen α.The fusion
Albumen and degree draw glycopeptide the difference is that, the Fc section of IgG may have ADCC (Antibody-Dependent Cell
Cytotoxicity, Antibody -dependent cell cytotoxicity effect) effect, there is potential immunogenicity and side reaction.
Opposite, collagen is a kind of most protein of in-vivo content, accounts for about the 25%-33% of total protein, widely deposits
Be human body bone, tendon, cartilage and skin and other connective tissue kinds, be the main component of extracellular matrix (ECM), have it is good
Bio-compatibility well, bioresorbable.Wherein, III Collagen Type VI only accounts for the 10% of collagen total amount, is primarily present in blood vessel
In.On molecular structure, III collagen type is made of parallel line type chain, each linear chain α 1 chain left-handed by three distortions
The extremely strong dextrorotation triple helices structure of to be formed one that combined closely by interchain interaction.Every III Collagen Type VI α, 1 chain is by more
Gly-X-Y triplet up to 300 or more repeats to constitute, which is the pass that III Collagen Type VI α, 1 chain forms homotrimer
Key.
Nucleic acid coding sequence
The invention also includes the polynucleotides for encoding heterologous fusion proteins matter of the present invention and include these polynucleotides
Carrier and host cell.The present invention also includes treatment with non-insulin-depending type, obesity and various other diseases and illness
The method of patient comprising apply heterologous fusion proteins matter discussed herein.
The invention further relates to the polynucleotides for encoding fusion protein according to the present invention.
In a preference of the invention, the nucleotide sequence is as shown in SEQ ID NO.:10 or 11.
Polynucleotides of the invention can be DNA form or rna form.DNA form includes cDNA, genomic DNA or people
The DNA of work synthesis.DNA can be single-stranded or double-strand.DNA can be coding strand or noncoding strand.Encoding mature polypeptide
Coding region sequence can the sequence of polypeptide be identical or the variation of degeneracy with shown in coding SEQ ID NO.:8,9 or 13
Body.As used herein, " variant of degeneracy " it is more shown in SEQ ID NO.:8,9 or 13 to refer to that coding has in the present invention
Peptide, but the differentiated nucleic acid sequence of corresponding encoded region sequence.
The nucleotide full length sequence of polypeptide of the invention or its segment can usually use PCR amplification method, recombination method or artificial
Synthetic method obtains.It, can be according to published related nucleotide sequence, especially open reading frame sequence for PCR amplification method
Column carry out design primer, and make with the commercially available library cDNA or by the library cDNA prepared by conventional method well known by persons skilled in the art
For template, expands and obtain related sequence.When sequence is longer, it is often necessary to twice or repeatedly PCR amplification is carried out, it then again will be each
The secondary segment amplified is stitched together by proper order.At present, it is already possible to code book be obtained by chemical synthesis completely
The DNA sequence dna of invention polypeptide (or its segment, or derivatives thereof).Then the DNA sequence dna can be introduced as known in the art each
In the existing DNA molecular of kind (or such as carrier) and cell.
The present invention also relates to the carriers comprising polynucleotides of the invention, and with carrier of the invention or peptide coding sequence
Arrange genetically engineered host cell.Above-mentioned polynucleotides, carrier or host cell can be separation.
As used herein, " separation " it is (former if it is crude to refer to that substance is separated from its primal environment
Beginning environment is natural surroundings).If the polynucleotides and polypeptides under the native state in active somatic cell do not isolate and purify,
But same polynucleotides or polypeptide such as from separating in other substances with existing in native state, then isolate and purify.
Once obtaining related sequence, so that it may obtain related sequence in large quantity with recombination method.This is usually will
It is cloned into carrier, then is transferred to cell, then the isolated related sequence from the host cell after proliferation by conventional method.
In addition, related sequence can be also synthesized with artificial synthesized method, when especially fragment length is shorter.In general, logical
After first synthesizing multiple small fragments, it is then attached the very long segment of available sequence again.
It is optimized for obtaining gene of the invention using round pcr DNA amplification/RNA method.Primer for PCR
It can be properly selected according to the sequence information of invention disclosed herein, and available conventional method synthesis.Conventional method can be used
The DNA/RNA segment of amplification is such as separated and purified by gel electrophoresis.
The present invention also relates to the carriers comprising polynucleotides of the invention, and with carrier of the invention or encoding histone sequence
Genetically engineered host cell is arranged, and utilizes host cell expression fusion protein of the present invention through recombinant technique
Method.
By the recombinant dna technology of routine, expression present invention fusion egg is obtained using polynucleotide sequence of the invention
White host cell.In general comprising steps of by polynucleotides or fourth aspect present invention described in third aspect present invention
The carrier transduction enters in host cell.
Method well-known to those having ordinary skill in the art can be used to construct DNA sequences encoding and suitable turn containing enzyme of the present invention
Record/translation control signal expression vector.These methods include recombinant DNA technology in vi, DNA synthetic technology, In vivo recombination skill
Art etc..The DNA sequence dna can be effectively connected in the appropriate promoter in expression vector, to instruct mRNA to synthesize.Expression carries
Body further includes the ribosome bind site and transcription terminator of translation initiation.
In addition, expression vector preferably includes one or more selected markers, to provide for selecting conversion
The phenotypic character of host cell, such as the dihyrofolate reductase of eukaryotic culture, neomycin resistance and green fluorescence egg
White (GFP), or tetracycline or amicillin resistance for Escherichia coli.
Carrier comprising above-mentioned appropriate DNA sequence dna and appropriate promoter or control sequence, can be used for converting suitable
When host cell, allow it to expression protein.
Host cell can be prokaryotic cell, such as bacterial cell;Or low eukaryocyte, such as yeast cells;Or it is high
Equal eukaryocytes, such as mammalian cell.Representative example has: Escherichia coli, Bacillus subtillis, and the bacterium of streptomyces is thin
Born of the same parents;Fungal cell such as Pichia pastoris, brewing yeast cell;Plant cell;The insect cell of drosophila S2 or Sf9;CHO,NS0,
COS7 or the zooblast of 293 cells etc..In another preferred example, the host cell is Pichia pastoris.
It can be carried out with routine techniques well known to those skilled in the art with recombinant DNA conversion host cell.When host is original
When core biology such as Escherichia coli, the competent cell that can absorb DNA can harvest after exponential phase of growth, use CaCl2Method processing, institute
With the step of it is generally well-known in the art.Another method is using MgCl2.If desired, conversion can also use the side of electroporation
Method carries out.When host is eucaryote, following DNA transfection method can be selected: calcium phosphate precipitation, conventional mechanical methods are such as
Microinjection, electroporation, liposome packaging etc..
The transformant of acquisition can use conventional method culture, express the protein of coded by said gene of the invention.According to institute
Host cell, culture medium used in culture can be selected from various conventional mediums.In the condition for being suitable for host cell growth
Under cultivated.After host cell growth is to cell density appropriate, with suitable method, (such as temperature transition or chemistry are lured
Lead) promoter that induces selection, cell is further cultured for a period of time.
Protein in the above methods can be expressed in cells, or on the cell membrane, or secreted outside the cell.If
It needs, can be separated by various separation methods and purifying protein using its physics, chemical and other characteristics.These methods are
It is well-known to those skilled in the art.The example of these methods includes but is not limited to: conventional renaturation process uses albumen precipitation
Agent handle (salting-out method), centrifugation, permeate broken bacterium, super processing, ultracentrifugation, sieve chromatography (gel filtration), adsorption chromatography, from
The combination of sub- displacement chromatography, high performance liquid chroma- tography (HPLC) and various other liquid chromatography technologies and these methods.
The DNA for encoding GLP-1 analog of the invention can be generated by a variety of different methods, can be based on natural sequence
Column design primer, can be in the pre-connection or in the entire fusion egg of coding to generate the DNA for encoding GLP-1 analog described herein
Encoding wild type GLP-1DNA is mutated in the cDNA of white matter.It usually can be used as from the overall length wild-type sequence of specific library clone
The template for generating COL3A1 segment of the present invention can generate coding COL3A1 segment described herein by design primer
DNA.By round pcr and design of primers, the gene and coding COL3A1 analog protein of GLP-1 analog can will be encoded
Gene connected in frame also by coding rich in the DNA of joint peptide of G.The chemical synthesis for carrying out complete sequence is also feasible skill
Art.Round pcr can be used, generate the piece with the primer hybridized with the sequence corresponding to end needed for COL3A1 segment is designed
Section.PCR primer can also be designed and generate restriction enzyme site in order to be cloned into expression vector.
It is provided in SEQ ID NO.:10 and encodes one of currently preferred heterologous fusion proteins matter GLP-1-2L-
COL598-896Preferred DNA sequence dna:
CACGGTGAGGGTACTTTTACCTCTGATGTTTCCTCATACTTGGAAGAACAAGCTGCTAAGGAATTCATTGCCTGGCT
GGTCAAAGGCAGAGGAGGTGGCGGATCCGGTGGCGGTGGGTCCGGAGGAGGTGGTTCAGCTGGTCCAGGTGGTCCAG
GTCCTCAAGGTCCTCCAGGTAAGAATGGTGAAACTGGTCCTCAGGGACCTCCAGGCCCAACCGGTCCTGGAGGTGAT
AAGGGTGATACCGGACCACCTGGCCCACAAGGCTTGCAGGGTCTGCCAGGTACAGGGGGTCCACCCGGTGAAAACGG
CAAGCCTGGTGAACCAGGCCCAAAAGGTGACGCTGGAGCTCCAGGAGCCCCAGGAGGTAAGGGTGATGCTGGTGCCC
CCGGTGAGAGAGGCCCACCAGGTTTGGCCGGTGCTCCCGGTCTGAGAGGGGGAGCTGGTCCACCAGGACCTGAAGGC
GGAAAAGGTGCTGCTGGTCCACCTGGACCACCTGGTGCTGCCGGAACTCCAGGACTGCAGGGAATGCCTGGTGAAAG
AGGCGGATTGGGATCTCCTGGCCCAAAAGGAGACAAGGGAGAGCCTGGTGGACCAGGGGCAGATGGAGTTCCTGGAA
AAGATGGTCCTCGTGGTCCAACAGGACCTATCGGTCCCCCAGGACCTGCTGGTCAACCTGGAGATAAAGGTGAAGGC
GGGGCTCCAGGATTGCCTGGTATTGCCGGCCCTAGAGGTTCTCCCGGTGAAAGAGGTGAGACCGGCCCACCTGGTCC
AGCTGGCTTCCCTGGAGCACCAGGTCAGAATGGTGAGCCAGGTGGTAAGGGTGAGAGAGGAGCTCCAGGTGAGAAGG
GGGAAGGTGGTCCACCTGGTGTTGCTGGTCCACCAGGTAAGGATGGTACATCCGGTCATCCTGGACCAATTGGACCT
CCAGGGCCTAGAGGTAACAGGGGTGAAAGGGGATCTGAAGGATCTCCTGGACATCCAGGTCAGCCCGGTCCTCCTGG
TCCACCCGGAGCTCCTGGGCCATGCTGTGGTGGC(SEQ ID NO.:10)
It is provided in SEQ ID NO.:11 and encodes one of currently preferred heterologous fusion proteins matter GLP-1-2L-
COL733-896Preferred DNA sequence dna:
CACGGTGAGGGTACTTTTACCTCTGATGTTTCCTCATACTTGGAAGAACAAGCTGCTAAGGAATTCATTGCCTGGCT
GGTCAAAGGCAGAGGAGGTGGCGGATCCGGTGGCGGTGGGTCCGGAGGAGGTGGTTCAGCTGGTTTGGGATCTCCTG
GCCCAAAAGGAGACAAGGGAGAGCCTGGTGGACCAGGGGCAGATGGAGTTCCTGGAAAAGATGGTCCTCGTGGTCCA
ACAGGACCTATCGGTCCCCCAGGACCTGCTGGTCAACCTGGAGATAAAGGTGAAGGCGGGGCTCCAGGATTGCCTGG
TATTGCCGGCCCTAGAGGTTCTCCCGGTGAAAGAGGTGAGACCGGCCCACCTGGTCCAGCTGGCTTCCCTGGAGCAC
CAGGTCAGAATGGTGAGCCAGGTGGTAAGGGTGAGAGAGGAGCTCCAGGTGAGAAGGGGGAAGGTGGTCCACCTGGT
GTTGCTGGTCCACCAGGTAAGGATGGTACATCCGGTCATCCTGGACCAATTGGACCTCCAGGGCCTAGAGGTAACAG
GGGTGAAAGGGGATCTGAAGGATCTCCTGGACATCCAGGTCAGCCCGGTCCTCCTGGTCCACCCGGAGCTCCTGGGC
CATGCTGTGGTGGC (SEQ ID NO.:11)
Expression vector and host cell
The present invention also provides a kind of expression vectors for fusion protein of the present invention.
The host cell of clone or expression nucleic acid of the present invention can be prokaryotic cell, more preferably host cell include yeast or
Higher eucaryotic cells.Antigen-4 fusion protein gene is isolated and purified from expression product after host cell expression, can be used for preparing
The therapeutic agent of diabetes and related disease.The related disease includes: type II diabetes, Type I diabetes, obesity, II type sugar
Urinate patient's major cardiovascular events and other severe complications etc..
Compared with prior art, the present invention mainly has the advantage that
GLP-1-COL3A1 fusion protein involved in the present invention grasps reservation GLP-1 hypoglycemic activity and significantly extends it
The target of Half-life in vivo devises a kind of novel molecular different from other GLP-1 analog drugs, swashs with GLP-1R
Polymer and extended Half-life in vivo are stablized in activity, formation.
(a) relative to natural GLP-1 (7-37), the GLP-1 analog part of fused protein of the present invention includes the 8th, 22
With 36 three replacements.Endogenous dipeptidyl peptidase 4 (DPP-4) is cut naturally between Ala and the 9th of the 8th Glu
GLP-1, inactive GLP-1 (9-37) segment of generation, the 8th replace with Gly after reduce GLP-1 analog to DPP-4
The sensibility of hydrolysis.22nd replacement improves the activity of GLP-1 analog.37th removal reduces fused protein
Obtain immunogenicity.The sensibility that fusion protein of the present invention hydrolyzes DPP-4 is low, and activity is very high, and immunogenicity is low.
(b) fusion protein of the present invention contains 1 chain of human III type collagen α, for the first time using COL3A1 segment and GLP-1 analog into
The building of row long-acting GLP-1 analog, good biocompatibility can form oligomer and be easy to express.Preferred COL3A1 is advantageous
In the expression the advantages of, the fusion protein of building is suitable for pichia yeast expression system, and production cost is lower than other GLP-1 class drugs
The higher eukaryotic cell lines of use are united, advanced optimizing and amplify by preparation process, are expected to obtain a kind of price less expensive
Long-acting diabetes B therapeutic agent.Also, while retaining GLP-1 biological activity, pass through 1 chain of α using fusion protein
The characteristic of tripolymer is formed, significant extended Half-life in vivo is obtained.
(c) in order to avoid the difficulty that too long amino acid sequence leads to recombinant expression, the preferred two kinds of people of the present invention
COL3A1 segment (COL3A1598-896And COL3A1733-8962A segment), sequence is respectively such as SEQ ID NO.:3 and SEQ ID
Shown in NO.:4.Two kinds of segments contain the Gly-X-Y triplet configuration domain of different length, remain people COL3A1 and form homologous three
The ability of aggressiveness is conducive to the recombinant expression of heterologous fusion proteins in the present invention simultaneously, and fusion protein stability of the present invention is high.
Below with reference to specific implementation, the present invention is further explained.It should be understood that these embodiments be merely to illustrate the present invention and
It is not used in and limits the scope of the invention.In the following examples, the experimental methods for specific conditions are not specified, usually according to normal condition,
Such as Sambrook et al., molecular cloning: laboratory manual (New York:Cold SpringHarbor Laboratory
Press, 1989) condition described in, or according to the normal condition proposed by manufacturer.Unless otherwise stated, otherwise percentage and
Number is calculated by weight.
Embodiment 1: the DNA of building coding GLP-1-COL3A1 fusion protein
Encode fusion protein GLP-1-2L-COL of the present invention598-896Gene (SEQ ID NO:8) by Nanjing gold this
Auspicious Biotechnology Co., Ltd synthesizes and is cloned into pUC57 plasmid, and XhoI restriction enzyme site is contained at 5 ' ends of fusion, and 3 ' ends contain
There are TAA terminator codon and NotI restriction enzyme site, the pUC57 plasmid is named as pUC57-GLP-COL-1.
With limitation nucleic acid restriction endonuclease XhoI and NotI (being purchased from Fermentas) to specifications to pUC57-GLP-
COLA-1 progress is double digested, to the coding GLP-1-2L-COL that generation length is 1050bp or so after digestion598-896Fusion
The genetic fragment of albumen carries out glue recycling (plastic recovery kit is purchased from Axygen).PPic9m is carried out with XhoI and NotI simultaneously
It is double digested, and glue recycling is carried out to the plasmid band that length after digestion is 9000bp.
The fusion protein gene fraction and pPic9m plasmid fragments T4DNA ligase that above-mentioned digestion obtains (are purchased from
Fermentas it) is attached, connection product heat shock converts competent escherichia coli cell DH5 α, converted product coating
On the LB solid medium with kanamycins chloramphenicol resistance, picking monoclonal carries out gene sequencing, determines insertion gene
Sequence is correct, and the plasmid of acquisition is named as pPic9m-GLP-COL-1 (as shown in Figure 1).
It is similar, it constructs containing coding GLP-1-2L-COL733-896The expressing fusion protein of fusion protein encoding gene
Carrier is named as pPic9m-GLP-COL-2 (Fig. 2).
Embodiment 2: the expression of heterologous fusion proteins
The carrier inserted with antigen-4 fusion protein gene in embodiment 1 is stripped, is transferred to using the method for electrotransformation complete
Red yeast GS115 competent cell.After nutrient limitation Screening of Media recon, high copy is carried out using G418 resistance
The screening of recon.It is thin finally to obtain the recombinant yeast pichia pastoris containing heterologous fusion proteins gene for being suitable for be recombinantly expressed
Born of the same parents.
Recombinant yeast pichia pastoris cell after seed expansion, is being inoculated with into being prepared on a small scale in 5L fermentor, is fermented
Continue 5 days, wherein carrying out the inducing expression of heterologous fusion proteins using methanol, inducing sustained 36h ferments after fermentation
The collection of liquid is used for the purifying of albumen.
Table 1: Pichia pastoris GS115 seed expansion culture medium composition
Formula | Content |
Yeast Extract | 10.0g/L |
Peptone | 20.0g/L |
KH2PO4 | 11.8g/L |
K2HPO4 | 3.0g/L |
Glycerol | 10.0ml/L |
Table 2: Pichia pastoris GS115 fermentation medium composition
Formula | Content |
YNB | 0.67g/L |
CaCl2 | 0.4g/L |
K2SO4 | 10.0g/L |
MgSO4.7H2O | 8.0g/L |
(NH4)2SO4 | 8.0g/L |
Citric acid | 5.0g/L |
K2HPO4.3H2O | 18.0g/L |
Glycerol | 40.0ml/L |
Embodiment 3: the purifying of heterologous fusion proteins
Two kinds of preferred heterologous fusion proteins GLP-1-2L-COL598-896、GLP-1-2L-COL733-896Using similar pure
Change step.
4L fermentation medium carries out the collection of supernatant using 0.2 μm of PALL of hollow fibre filtering system, obtains about 4L
Supernatant.Supernatant uses the ultrafiltration system of PALL 50kDa filter membrane to carry out sample ultrafiltration again, removes part foreign protein.Ultrafiltrate
Body is finally purified with Source30Q anion-exchange chromatography, carries out washing for sample using 0-500mM NaCl linear gradient
It is de-, finally obtain the heterologous fusion proteins of purifying.The purity and molecular weight of heterologous fusion proteins has been determined using SDS-PAGE, it is pure
Degree > 90%, molecular weight are consistent (Fig. 3) with expection.
Embodiment 4: the bioactivity and pharmacokinetic of heterologous fusion proteins
The bioactivity research of embodiment 4a, fusion protein
There is the human osteosarcoma cell U of GLP-1R using stable transfection2OS carries out the determination of activity of heterologous fusion proteins, heterologous
The GLP-1 analog segment and U of fusion protein2GLP-1R receptor combination on OS cell can stimulate cell to secrete cAMP, pass through
The enzyme-linked measuring method of cAMP is active come the GLP-1 for detecting the activity of cAMP to characterize fusion protein.U2OS cell is with 1.2 × 105/ hole
It is inoculated in 96 orifice plates, with the DMEM culture medium containing 10%FBS, in 37 DEG C, 5%CO2Middle culture is for 24 hours.Culture medium is removed, then
Base culture base is added to stay overnight.Basal medium is removed, the sample to be tested of various concentration, including 2 kinds of fusion proteins are added
GLP-1-2L-COL598-896、GLP-1-2L-COL733-896And artificial synthesized GLP-1 (7-37) reference substance, 37 DEG C, 5%CO2In
Cultivate 0.5h.The measurement of cell cAMP content is carried out using cAMP kit (being purchased from R&D), as a result as shown in Figure 4.
Fig. 4's the results show that through GLP-1-2L-COL598-896、GLP-1-2L-COL733-896Stimulate U2What OS cell generated
CAMP content is and suitable with GLP-1 there are apparent dosage effect, determines GLP-1-2L-COL598-896、GLP-1-2L-
COL733-896Fusion protein has the similar GLP-1R Activation Activity of GLP-1.
The pharmacokinetic of embodiment 4b, fusion protein
SD male rat is used in pharmacokinetic trial, and GLP-1-2L-COL is set598-896、GLP-1-2L-
COL733-896And chemical synthesis GLP-1 (7-37) control group, every group 8.It is injected intravenously according to 1mg/kg dosage, respectively acquisition note
The blood sample of different time points before penetrating and after injection: 0h, 0.5h, 1h, 2h, 4h, 6h, 10h, for 24 hours, 2d, 4d, 6d, 8d, 10d, 14d,
21d.The serum of acquisition is placed in -80 DEG C of preservations.The amount of fusion protein is detected (Fig. 5) using GLP-1 kit in serum.
By Fig. 5 result it is found that fusion protein of the invention can significantly extend the circulating half-life in vivo of GLP-1 analog.
The above is only preferred embodiment of the invention, not the limitation to the present invention in any form.It should be noted that
It is that, for improvement and supplement that one of ordinary skill in the art makes the present invention, also should be regarded as guarantor of the invention
Protect range.
All references mentioned in the present invention is incorporated herein by reference, independent just as each document
It is incorporated as with reference to such.In addition, it should also be understood that, after reading the above teachings of the present invention, those skilled in the art can
To make various changes or modifications to the present invention, such equivalent forms equally fall within model defined by the application the appended claims
It encloses.
Sequence table
<110>Shanghai Hui Dun Bioisystech Co., Ltd
<120>a kind of GLP-1 analog-COL3A1 fusion protein
<130> P2018-0197
<160> 13
<170> PatentIn version 3.5
<210> 1
<211> 31
<212> PRT
<213>artificial sequence (artificial sequence)
<400> 1
His Xaa Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Xaa
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg Xaa
20 25 30
<210> 2
<211> 1163
<212> PRT
<213>homo sapiens (Homo sapiens)
<400> 2
Met Met Ser Phe Val Gln Lys Gly Ser Trp Leu Leu Leu Ala Leu Leu
1 5 10 15
His Pro Thr Ile Ile Leu Ala Gln Gln Glu Ala Val Glu Gly Gly Cys
20 25 30
Ser His Leu Gly Gln Ser Tyr Ala Asp Arg Asp Val Trp Lys Pro Glu
35 40 45
Pro Cys Gln Ile Cys Val Cys Asp Ser Gly Ser Val Leu Cys Asp Asp
50 55 60
Ile Ile Cys Asp Asp Gln Glu Leu Asp Cys Pro Asn Pro Glu Ile Pro
65 70 75 80
Phe Gly Glu Cys Cys Ala Val Cys Pro Gln Pro Pro Thr Ala Pro Thr
85 90 95
Arg Pro Pro Asn Gly Gln Gly Pro Gln Gly Pro Lys Gly Asp Pro Gly
100 105 110
Pro Pro Gly Ile Pro Gly Arg Asn Gly Asp Pro Gly Ile Pro Gly Gln
115 120 125
Pro Gly Ser Pro Gly Ser Pro Gly Pro Pro Gly Ile Cys Glu Ser Cys
130 135 140
Pro Thr Gly Pro Gln Asn Tyr Ser Pro Gln Tyr Asp Ser Tyr Asp Val
145 150 155 160
Lys Ser Gly Val Ala Val Gly Gly Leu Ala Gly Tyr Pro Gly Pro Ala
165 170 175
Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Thr Ser Gly His Pro Gly
180 185 190
Ser Pro Gly Ser Pro Gly Tyr Gln Gly Pro Pro Gly Glu Pro Gly Gln
195 200 205
Ala Gly Pro Ser Gly Pro Pro Gly Pro Pro Gly Ala Ile Gly Pro Ser
210 215 220
Gly Pro Ala Gly Lys Asp Gly Glu Ser Gly Arg Pro Gly Arg Pro Gly
225 230 235 240
Glu Arg Gly Leu Pro Gly Pro Pro Gly Ile Lys Gly Pro Ala Gly Ile
245 250 255
Pro Gly Phe Pro Gly Met Lys Gly His Arg Gly Phe Asp Gly Arg Asn
260 265 270
Gly Glu Lys Gly Glu Thr Gly Ala Pro Gly Leu Lys Gly Glu Asn Gly
275 280 285
Leu Pro Gly Glu Asn Gly Ala Pro Gly Pro Met Gly Pro Arg Gly Ala
290 295 300
Pro Gly Glu Arg Gly Arg Pro Gly Leu Pro Gly Ala Ala Gly Ala Arg
305 310 315 320
Gly Asn Asp Gly Ala Arg Gly Ser Asp Gly Gln Pro Gly Pro Pro Gly
325 330 335
Pro Pro Gly Thr Ala Gly Phe Pro Gly Ser Pro Gly Ala Lys Gly Glu
340 345 350
Val Gly Pro Ala Gly Ser Pro Gly Ser Asn Gly Ala Pro Gly Gln Arg
355 360 365
Gly Glu Pro Gly Pro Gln Gly His Ala Gly Ala Gln Gly Pro Pro Gly
370 375 380
Pro Pro Gly Ile Asn Gly Ser Pro Gly Gly Lys Gly Glu Met Gly Pro
385 390 395 400
Ala Gly Ile Pro Gly Ala Pro Gly Leu Met Gly Ala Arg Gly Pro Pro
405 410 415
Gly Pro Ala Gly Ala Asn Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly
420 425 430
Glu Pro Gly Lys Asn Gly Ala Lys Gly Glu Pro Gly Pro Arg Gly Glu
435 440 445
Arg Gly Glu Ala Gly Ile Pro Gly Val Pro Gly Ala Lys Gly Glu Asp
450 455 460
Gly Lys Asp Gly Ser Pro Gly Glu Pro Gly Ala Asn Gly Leu Pro Gly
465 470 475 480
Ala Ala Gly Glu Arg Gly Ala Pro Gly Phe Arg Gly Pro Ala Gly Pro
485 490 495
Asn Gly Ile Pro Gly Glu Lys Gly Pro Ala Gly Glu Arg Gly Ala Pro
500 505 510
Gly Pro Ala Gly Pro Arg Gly Ala Ala Gly Glu Pro Gly Arg Asp Gly
515 520 525
Val Pro Gly Gly Pro Gly Met Arg Gly Met Pro Gly Ser Pro Gly Gly
530 535 540
Pro Gly Ser Asp Gly Lys Pro Gly Pro Pro Gly Ser Gln Gly Glu Ser
545 550 555 560
Gly Arg Pro Gly Pro Pro Gly Pro Ser Gly Pro Arg Gly Gln Pro Gly
565 570 575
Val Met Gly Phe Pro Gly Pro Lys Gly Asn Asp Gly Ala Pro Gly Lys
580 585 590
Asn Gly Glu Arg Gly Gly Pro Gly Gly Pro Gly Pro Gln Gly Pro Pro
595 600 605
Gly Lys Asn Gly Glu Thr Gly Pro Gln Gly Pro Pro Gly Pro Thr Gly
610 615 620
Pro Gly Gly Asp Lys Gly Asp Thr Gly Pro Pro Gly Pro Gln Gly Leu
625 630 635 640
Gln Gly Leu Pro Gly Thr Gly Gly Pro Pro Gly Glu Asn Gly Lys Pro
645 650 655
Gly Glu Pro Gly Pro Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly
660 665 670
Gly Lys Gly Asp Ala Gly Ala Pro Gly Glu Arg Gly Pro Pro Gly Leu
675 680 685
Ala Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly Pro Pro Gly Pro Glu
690 695 700
Gly Gly Lys Gly Ala Ala Gly Pro Pro Gly Pro Pro Gly Ala Ala Gly
705 710 715 720
Thr Pro Gly Leu Gln Gly Met Pro Gly Glu Arg Gly Gly Leu Gly Ser
725 730 735
Pro Gly Pro Lys Gly Asp Lys Gly Glu Pro Gly Gly Pro Gly Ala Asp
740 745 750
Gly Val Pro Gly Lys Asp Gly Pro Arg Gly Pro Thr Gly Pro Ile Gly
755 760 765
Pro Pro Gly Pro Ala Gly Gln Pro Gly Asp Lys Gly Glu Gly Gly Ala
770 775 780
Pro Gly Leu Pro Gly Ile Ala Gly Pro Arg Gly Ser Pro Gly Glu Arg
785 790 795 800
Gly Glu Thr Gly Pro Pro Gly Pro Ala Gly Phe Pro Gly Ala Pro Gly
805 810 815
Gln Asn Gly Glu Pro Gly Gly Lys Gly Glu Arg Gly Ala Pro Gly Glu
820 825 830
Lys Gly Glu Gly Gly Pro Pro Gly Val Ala Gly Pro Pro Gly Lys Asp
835 840 845
Gly Thr Ser Gly His Pro Gly Pro Ile Gly Pro Pro Gly Pro Arg Gly
850 855 860
Asn Arg Gly Glu Arg Gly Ser Glu Gly Ser Pro Gly His Pro Gly Gln
865 870 875 880
Pro Gly Pro Pro Gly Pro Pro Gly Ala Pro Gly Pro Cys Cys Gly Gly
885 890 895
Val Gly Ala Ala Ala Ile Ala Gly Ile Gly Gly Glu Lys Ala Gly Gly
900 905 910
Phe Ala Pro Tyr Tyr Gly Asp Glu Pro Met Asp Phe Lys Ile Asn Thr
915 920 925
Asp Glu Ile Met Thr Ser Leu Lys Ser Val Asn Gly Gln Ile Glu Ser
930 935 940
Leu Ile Ser Pro Asp Gly Ser Arg Lys Asn Pro Ala Arg Asn Cys Arg
945 950 955 960
Asp Leu Lys Phe Cys His Pro Glu Leu Lys Ser Gly Glu Tyr Trp Val
965 970 975
Asp Pro Asn Gln Gly Cys Lys Leu Asp Ala Ile Lys Val Phe Cys Asn
980 985 990
Met Glu Thr Gly Glu Thr Cys Ile Ser Ala Asn Pro Leu Asn Val Pro
995 1000 1005
Arg Lys His Trp Trp Thr Asp Ser Ser Ala Glu Lys Lys His Val
1010 1015 1020
Trp Phe Gly Glu Ser Met Asp Gly Gly Phe Gln Phe Ser Tyr Gly
1025 1030 1035
Asn Pro Glu Leu Pro Glu Asp Val Leu Asp Val Gln Leu Ala Phe
1040 1045 1050
Leu Arg Leu Leu Ser Ser Arg Ala Ser Gln Asn Ile Thr Tyr His
1055 1060 1065
Cys Lys Asn Ser Ile Ala Tyr Met Asp Gln Ala Ser Gly Asn Val
1070 1075 1080
Lys Lys Ala Leu Lys Leu Met Gly Ser Asn Glu Gly Glu Phe Lys
1085 1090 1095
Ala Glu Gly Asn Ser Lys Phe Thr Tyr Thr Val Leu Glu Asp Gly
1100 1105 1110
Cys Thr Lys His Thr Gly Glu Trp Ser Lys Thr Val Phe Glu Tyr
1115 1120 1125
Arg Thr Arg Lys Ala Val Arg Leu Pro Ile Val Asp Ile Ala Pro
1130 1135 1140
Tyr Asp Ile Gly Gly Pro Asp Gln Glu Phe Gly Val Asp Val Gly
1145 1150 1155
Pro Val Cys Phe Leu
1160
<210> 3
<211> 297
<212> PRT
<213>homo sapiens (Homo sapiens)
<400> 3
Gly Pro Gly Gly Pro Gly Pro Gln Gly Pro Pro Gly Lys Asn Gly Glu
1 5 10 15
Thr Gly Pro Gln Gly Pro Pro Gly Pro Thr Gly Pro Gly Gly Asp Lys
20 25 30
Gly Asp Thr Gly Pro Pro Gly Pro Gln Gly Leu Gln Gly Leu Pro Gly
35 40 45
Thr Gly Gly Pro Pro Gly Glu Asn Gly Lys Pro Gly Glu Pro Gly Pro
50 55 60
Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly Gly Lys Gly Asp Ala
65 70 75 80
Gly Ala Pro Gly Glu Arg Gly Pro Pro Gly Leu Ala Gly Ala Pro Gly
85 90 95
Leu Arg Gly Gly Ala Gly Pro Pro Gly Pro Glu Gly Gly Lys Gly Ala
100 105 110
Ala Gly Pro Pro Gly Pro Pro Gly Ala Ala Gly Thr Pro Gly Leu Gln
115 120 125
Gly Met Pro Gly Glu Arg Gly Gly Leu Gly Ser Pro Lys Gly Asp Lys
130 135 140
Gly Glu Pro Gly Gly Pro Gly Ala Asp Gly Val Pro Gly Lys Asp Gly
145 150 155 160
Pro Arg Gly Pro Thr Gly Pro Ile Gly Pro Pro Gly Pro Ala Gly Gln
165 170 175
Pro Gly Asp Lys Gly Glu Gly Gly Ala Pro Gly Leu Pro Gly Ile Ala
180 185 190
Gly Pro Arg Gly Ser Pro Gly Glu Arg Gly Glu Thr Gly Pro Pro Gly
195 200 205
Pro Ala Gly Phe Pro Gly Ala Pro Gly Gln Asn Gly Glu Pro Gly Gly
210 215 220
Lys Gly Glu Arg Gly Ala Pro Gly Glu Lys Gly Glu Gly Gly Pro Pro
225 230 235 240
Gly Val Ala Gly Pro Pro Gly Lys Asp Gly Thr Ser Gly His Pro Gly
245 250 255
Pro Ile Gly Pro Pro Gly Pro Arg Gly Asn Arg Gly Glu Arg Gly Ser
260 265 270
Glu Gly Ser Pro Gly His Pro Gly Gln Pro Gly Pro Pro Gly Pro Pro
275 280 285
Gly Ala Pro Gly Pro Cys Cys Gly Gly
290 295
<210> 4
<211> 162
<212> PRT
<213>homo sapiens (Homo sapiens)
<400> 4
Gly Leu Gly Ser Pro Lys Gly Asp Lys Gly Glu Pro Gly Gly Pro Gly
1 5 10 15
Ala Asp Gly Val Pro Gly Lys Asp Gly Pro Arg Gly Pro Thr Gly Pro
20 25 30
Ile Gly Pro Pro Gly Pro Ala Gly Gln Pro Gly Asp Lys Gly Glu Gly
35 40 45
Gly Ala Pro Gly Leu Pro Gly Ile Ala Gly Pro Arg Gly Ser Pro Gly
50 55 60
Glu Arg Gly Glu Thr Gly Pro Pro Gly Pro Ala Gly Phe Pro Gly Ala
65 70 75 80
Pro Gly Gln Asn Gly Glu Pro Gly Gly Lys Gly Glu Arg Gly Ala Pro
85 90 95
Gly Glu Lys Gly Glu Gly Gly Pro Pro Gly Val Ala Gly Pro Pro Gly
100 105 110
Lys Asp Gly Thr Ser Gly His Pro Gly Pro Ile Gly Pro Pro Gly Pro
115 120 125
Arg Gly Asn Arg Gly Glu Arg Gly Ser Glu Gly Ser Pro Gly His Pro
130 135 140
Gly Gln Pro Gly Pro Pro Gly Pro Pro Gly Ala Pro Gly Pro Cys Cys
145 150 155 160
Gly Gly
<210> 5
<211> 5
<212> PRT
<213>artificial sequence (artificial sequence)
<400> 5
Gly Gly Gly Gly Ser
1 5
<210> 6
<211> 31
<212> PRT
<213>artificial sequence (artificial sequence)
<400> 6
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg Gly
20 25 30
<210> 7
<211> 11
<212> PRT
<213>artificial sequence (artificial sequence)
<400> 7
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala
1 5 10
<210> 8
<211> 345
<212> PRT
<213>artificial sequence (artificial sequence)
<400> 8
His Gly Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Glu
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg Gly Gly
20 25 30
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Gly Pro
35 40 45
Gly Gly Pro Gly Pro Gln Gly Pro Pro Gly Lys Asn Gly Glu Thr Gly
50 55 60
Pro Gln Gly Pro Pro Gly Pro Thr Gly Pro Gly Gly Asp Lys Gly Asp
65 70 75 80
Thr Gly Pro Pro Gly Pro Gln Gly Leu Gln Gly Leu Pro Gly Thr Gly
85 90 95
Gly Pro Pro Gly Glu Asn Gly Lys Pro Gly Glu Pro Gly Pro Lys Gly
100 105 110
Asp Ala Gly Ala Pro Gly Ala Pro Gly Gly Lys Gly Asp Ala Gly Ala
115 120 125
Pro Gly Glu Arg Gly Pro Pro Gly Leu Ala Gly Ala Pro Gly Leu Arg
130 135 140
Gly Gly Ala Gly Pro Pro Gly Pro Glu Gly Gly Lys Gly Ala Ala Gly
145 150 155 160
Pro Pro Gly Pro Pro Gly Ala Ala Gly Thr Pro Gly Leu Gln Gly Met
165 170 175
Pro Gly Glu Arg Gly Gly Leu Gly Ser Pro Gly Pro Lys Gly Asp Lys
180 185 190
Gly Glu Pro Gly Gly Pro Gly Ala Asp Gly Val Pro Gly Lys Asp Gly
195 200 205
Pro Arg Gly Pro Thr Gly Pro Ile Gly Pro Pro Gly Pro Ala Gly Gln
210 215 220
Pro Gly Asp Lys Gly Glu Gly Gly Ala Pro Gly Leu Pro Gly Ile Ala
225 230 235 240
Gly Pro Arg Gly Ser Pro Gly Glu Arg Gly Glu Thr Gly Pro Pro Gly
245 250 255
Pro Ala Gly Phe Pro Gly Ala Pro Gly Gln Asn Gly Glu Pro Gly Gly
260 265 270
Lys Gly Glu Arg Gly Ala Pro Gly Glu Lys Gly Glu Gly Gly Pro Pro
275 280 285
Gly Val Ala Gly Pro Pro Gly Lys Asp Gly Thr Ser Gly His Pro Gly
290 295 300
Pro Ile Gly Pro Pro Gly Pro Arg Gly Asn Arg Gly Glu Arg Gly Ser
305 310 315 320
Glu Gly Ser Pro Gly His Pro Gly Gln Pro Gly Pro Pro Gly Pro Pro
325 330 335
Gly Ala Pro Gly Pro Cys Cys Gly Gly
340 345
<210> 9
<211> 210
<212> PRT
<213>artificial sequence (artificial sequence)
<400> 9
His Gly Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Glu
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg Gly Gly
20 25 30
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Gly Leu
35 40 45
Gly Ser Pro Gly Pro Lys Gly Asp Lys Gly Glu Pro Gly Gly Pro Gly
50 55 60
Ala Asp Gly Val Pro Gly Lys Asp Gly Pro Arg Gly Pro Thr Gly Pro
65 70 75 80
Ile Gly Pro Pro Gly Pro Ala Gly Gln Pro Gly Asp Lys Gly Glu Gly
85 90 95
Gly Ala Pro Gly Leu Pro Gly Ile Ala Gly Pro Arg Gly Ser Pro Gly
100 105 110
Glu Arg Gly Glu Thr Gly Pro Pro Gly Pro Ala Gly Phe Pro Gly Ala
115 120 125
Pro Gly Gln Asn Gly Glu Pro Gly Gly Lys Gly Glu Arg Gly Ala Pro
130 135 140
Gly Glu Lys Gly Glu Gly Gly Pro Pro Gly Val Ala Gly Pro Pro Gly
145 150 155 160
Lys Asp Gly Thr Ser Gly His Pro Gly Pro Ile Gly Pro Pro Gly Pro
165 170 175
Arg Gly Asn Arg Gly Glu Arg Gly Ser Glu Gly Ser Pro Gly His Pro
180 185 190
Gly Gln Pro Gly Pro Pro Gly Pro Pro Gly Ala Pro Gly Pro Cys Cys
195 200 205
Gly Gly
210
<210> 10
<211> 1035
<212> DNA
<213>artificial sequence (artificial sequence)
<400> 10
cacggtgagg gtacttttac ctctgatgtt tcctcatact tggaagaaca agctgctaag 60
gaattcattg cctggctggt caaaggcaga ggaggtggcg gatccggtgg cggtgggtcc 120
ggaggaggtg gttcagctgg tccaggtggt ccaggtcctc aaggtcctcc aggtaagaat 180
ggtgaaactg gtcctcaggg acctccaggc ccaaccggtc ctggaggtga taagggtgat 240
accggaccac ctggcccaca aggcttgcag ggtctgccag gtacaggggg tccacccggt 300
gaaaacggca agcctggtga accaggccca aaaggtgacg ctggagctcc aggagcccca 360
ggaggtaagg gtgatgctgg tgcccccggt gagagaggcc caccaggttt ggccggtgct 420
cccggtctga gagggggagc tggtccacca ggacctgaag gcggaaaagg tgctgctggt 480
ccacctggac cacctggtgc tgccggaact ccaggactgc agggaatgcc tggtgaaaga 540
ggcggattgg gatctcctgg cccaaaagga gacaagggag agcctggtgg accaggggca 600
gatggagttc ctggaaaaga tggtcctcgt ggtccaacag gacctatcgg tcccccagga 660
cctgctggtc aacctggaga taaaggtgaa ggcggggctc caggattgcc tggtattgcc 720
ggccctagag gttctcccgg tgaaagaggt gagaccggcc cacctggtcc agctggcttc 780
cctggagcac caggtcagaa tggtgagcca ggtggtaagg gtgagagagg agctccaggt 840
gagaaggggg aaggtggtcc acctggtgtt gctggtccac caggtaagga tggtacatcc 900
ggtcatcctg gaccaattgg acctccaggg cctagaggta acaggggtga aaggggatct 960
gaaggatctc ctggacatcc aggtcagccc ggtcctcctg gtccacccgg agctcctggg 1020
ccatgctgtg gtggc 1035
<210> 11
<211> 630
<212> DNA
<213>artificial sequence (artificial sequence)
<400> 11
cacggtgagg gtacttttac ctctgatgtt tcctcatact tggaagaaca agctgctaag 60
gaattcattg cctggctggt caaaggcaga ggaggtggcg gatccggtgg cggtgggtcc 120
ggaggaggtg gttcagctgg tttgggatct cctggcccaa aaggagacaa gggagagcct 180
ggtggaccag gggcagatgg agttcctgga aaagatggtc ctcgtggtcc aacaggacct 240
atcggtcccc caggacctgc tggtcaacct ggagataaag gtgaaggcgg ggctccagga 300
ttgcctggta ttgccggccc tagaggttct cccggtgaaa gaggtgagac cggcccacct 360
ggtccagctg gcttccctgg agcaccaggt cagaatggtg agccaggtgg taagggtgag 420
agaggagctc caggtgagaa gggggaaggt ggtccacctg gtgttgctgg tccaccaggt 480
aaggatggta catccggtca tcctggacca attggacctc cagggcctag aggtaacagg 540
ggtgaaaggg gatctgaagg atctcctgga catccaggtc agcccggtcc tcctggtcca 600
cccggagctc ctgggccatg ctgtggtggc 630
<210> 12
<211> 30
<212> PRT
<213>artificial sequence (artificial sequence)
<400> 12
His Gly Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Glu
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg
20 25 30
<210> 13
<211> 1210
<212> PRT
<213>artificial sequence (artificial sequence)
<400> 13
His Gly Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Glu
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg Gly Gly
20 25 30
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Ala Met
35 40 45
Met Ser Phe Val Gln Lys Gly Ser Trp Leu Leu Leu Ala Leu Leu His
50 55 60
Pro Thr Ile Ile Leu Ala Gln Gln Glu Ala Val Glu Gly Gly Cys Ser
65 70 75 80
His Leu Gly Gln Ser Tyr Ala Asp Arg Asp Val Trp Lys Pro Glu Pro
85 90 95
Cys Gln Ile Cys Val Cys Asp Ser Gly Ser Val Leu Cys Asp Asp Ile
100 105 110
Ile Cys Asp Asp Gln Glu Leu Asp Cys Pro Asn Pro Glu Ile Pro Phe
115 120 125
Gly Glu Cys Cys Ala Val Cys Pro Gln Pro Pro Thr Ala Pro Thr Arg
130 135 140
Pro Pro Asn Gly Gln Gly Pro Gln Gly Pro Lys Gly Asp Pro Gly Pro
145 150 155 160
Pro Gly Ile Pro Gly Arg Asn Gly Asp Pro Gly Ile Pro Gly Gln Pro
165 170 175
Gly Ser Pro Gly Ser Pro Gly Pro Pro Gly Ile Cys Glu Ser Cys Pro
180 185 190
Thr Gly Pro Gln Asn Tyr Ser Pro Gln Tyr Asp Ser Tyr Asp Val Lys
195 200 205
Ser Gly Val Ala Val Gly Gly Leu Ala Gly Tyr Pro Gly Pro Ala Gly
210 215 220
Pro Pro Gly Pro Pro Gly Pro Pro Gly Thr Ser Gly His Pro Gly Ser
225 230 235 240
Pro Gly Ser Pro Gly Tyr Gln Gly Pro Pro Gly Glu Pro Gly Gln Ala
245 250 255
Gly Pro Ser Gly Pro Pro Gly Pro Pro Gly Ala Ile Gly Pro Ser Gly
260 265 270
Pro Ala Gly Lys Asp Gly Glu Ser Gly Arg Pro Gly Arg Pro Gly Glu
275 280 285
Arg Gly Leu Pro Gly Pro Pro Gly Ile Lys Gly Pro Ala Gly Ile Pro
290 295 300
Gly Phe Pro Gly Met Lys Gly His Arg Gly Phe Asp Gly Arg Asn Gly
305 310 315 320
Glu Lys Gly Glu Thr Gly Ala Pro Gly Leu Lys Gly Glu Asn Gly Leu
325 330 335
Pro Gly Glu Asn Gly Ala Pro Gly Pro Met Gly Pro Arg Gly Ala Pro
340 345 350
Gly Glu Arg Gly Arg Pro Gly Leu Pro Gly Ala Ala Gly Ala Arg Gly
355 360 365
Asn Asp Gly Ala Arg Gly Ser Asp Gly Gln Pro Gly Pro Pro Gly Pro
370 375 380
Pro Gly Thr Ala Gly Phe Pro Gly Ser Pro Gly Ala Lys Gly Glu Val
385 390 395 400
Gly Pro Ala Gly Ser Pro Gly Ser Asn Gly Ala Pro Gly Gln Arg Gly
405 410 415
Glu Pro Gly Pro Gln Gly His Ala Gly Ala Gln Gly Pro Pro Gly Pro
420 425 430
Pro Gly Ile Asn Gly Ser Pro Gly Gly Lys Gly Glu Met Gly Pro Ala
435 440 445
Gly Ile Pro Gly Ala Pro Gly Leu Met Gly Ala Arg Gly Pro Pro Gly
450 455 460
Pro Ala Gly Ala Asn Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly Glu
465 470 475 480
Pro Gly Lys Asn Gly Ala Lys Gly Glu Pro Gly Pro Arg Gly Glu Arg
485 490 495
Gly Glu Ala Gly Ile Pro Gly Val Pro Gly Ala Lys Gly Glu Asp Gly
500 505 510
Lys Asp Gly Ser Pro Gly Glu Pro Gly Ala Asn Gly Leu Pro Gly Ala
515 520 525
Ala Gly Glu Arg Gly Ala Pro Gly Phe Arg Gly Pro Ala Gly Pro Asn
530 535 540
Gly Ile Pro Gly Glu Lys Gly Pro Ala Gly Glu Arg Gly Ala Pro Gly
545 550 555 560
Pro Ala Gly Pro Arg Gly Ala Ala Gly Glu Pro Gly Arg Asp Gly Val
565 570 575
Pro Gly Gly Pro Gly Met Arg Gly Met Pro Gly Ser Pro Gly Gly Pro
580 585 590
Gly Ser Asp Gly Lys Pro Gly Pro Pro Gly Ser Gln Gly Glu Ser Gly
595 600 605
Arg Pro Gly Pro Pro Gly Pro Ser Gly Pro Arg Gly Gln Pro Gly Val
610 615 620
Met Gly Phe Pro Gly Pro Lys Gly Asn Asp Gly Ala Pro Gly Lys Asn
625 630 635 640
Gly Glu Arg Gly Gly Pro Gly Gly Pro Gly Pro Gln Gly Pro Pro Gly
645 650 655
Lys Asn Gly Glu Thr Gly Pro Gln Gly Pro Pro Gly Pro Thr Gly Pro
660 665 670
Gly Gly Asp Lys Gly Asp Thr Gly Pro Pro Gly Pro Gln Gly Leu Gln
675 680 685
Gly Leu Pro Gly Thr Gly Gly Pro Pro Gly Glu Asn Gly Lys Pro Gly
690 695 700
Glu Pro Gly Pro Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly Gly
705 710 715 720
Lys Gly Asp Ala Gly Ala Pro Gly Glu Arg Gly Pro Pro Gly Leu Ala
725 730 735
Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly Pro Pro Gly Pro Glu Gly
740 745 750
Gly Lys Gly Ala Ala Gly Pro Pro Gly Pro Pro Gly Ala Ala Gly Thr
755 760 765
Pro Gly Leu Gln Gly Met Pro Gly Glu Arg Gly Gly Leu Gly Ser Pro
770 775 780
Gly Pro Lys Gly Asp Lys Gly Glu Pro Gly Gly Pro Gly Ala Asp Gly
785 790 795 800
Val Pro Gly Lys Asp Gly Pro Arg Gly Pro Thr Gly Pro Ile Gly Pro
805 810 815
Pro Gly Pro Ala Gly Gln Pro Gly Asp Lys Gly Glu Gly Gly Ala Pro
820 825 830
Gly Leu Pro Gly Ile Ala Gly Pro Arg Gly Ser Pro Gly Glu Arg Gly
835 840 845
Glu Thr Gly Pro Pro Gly Pro Ala Gly Phe Pro Gly Ala Pro Gly Gln
850 855 860
Asn Gly Glu Pro Gly Gly Lys Gly Glu Arg Gly Ala Pro Gly Glu Lys
865 870 875 880
Gly Glu Gly Gly Pro Pro Gly Val Ala Gly Pro Pro Gly Lys Asp Gly
885 890 895
Thr Ser Gly His Pro Gly Pro Ile Gly Pro Pro Gly Pro Arg Gly Asn
900 905 910
Arg Gly Glu Arg Gly Ser Glu Gly Ser Pro Gly His Pro Gly Gln Pro
915 920 925
Gly Pro Pro Gly Pro Pro Gly Ala Pro Gly Pro Cys Cys Gly Gly Val
930 935 940
Gly Ala Ala Ala Ile Ala Gly Ile Gly Gly Glu Lys Ala Gly Gly Phe
945 950 955 960
Ala Pro Tyr Tyr Gly Asp Glu Pro Met Asp Phe Lys Ile Asn Thr Asp
965 970 975
Glu Ile Met Thr Ser Leu Lys Ser Val Asn Gly Gln Ile Glu Ser Leu
980 985 990
Ile Ser Pro Asp Gly Ser Arg Lys Asn Pro Ala Arg Asn Cys Arg Asp
995 1000 1005
Leu Lys Phe Cys His Pro Glu Leu Lys Ser Gly Glu Tyr Trp Val
1010 1015 1020
Asp Pro Asn Gln Gly Cys Lys Leu Asp Ala Ile Lys Val Phe Cys
1025 1030 1035
Asn Met Glu Thr Gly Glu Thr Cys Ile Ser Ala Asn Pro Leu Asn
1040 1045 1050
Val Pro Arg Lys His Trp Trp Thr Asp Ser Ser Ala Glu Lys Lys
1055 1060 1065
His Val Trp Phe Gly Glu Ser Met Asp Gly Gly Phe Gln Phe Ser
1070 1075 1080
Tyr Gly Asn Pro Glu Leu Pro Glu Asp Val Leu Asp Val Gln Leu
1085 1090 1095
Ala Phe Leu Arg Leu Leu Ser Ser Arg Ala Ser Gln Asn Ile Thr
1100 1105 1110
Tyr His Cys Lys Asn Ser Ile Ala Tyr Met Asp Gln Ala Ser Gly
1115 1120 1125
Asn Val Lys Lys Ala Leu Lys Leu Met Gly Ser Asn Glu Gly Glu
1130 1135 1140
Phe Lys Ala Glu Gly Asn Ser Lys Phe Thr Tyr Thr Val Leu Glu
1145 1150 1155
Asp Gly Cys Thr Lys His Thr Gly Glu Trp Ser Lys Thr Val Phe
1160 1165 1170
Glu Tyr Arg Thr Arg Lys Ala Val Arg Leu Pro Ile Val Asp Ile
1175 1180 1185
Ala Pro Tyr Asp Ile Gly Gly Pro Asp Gln Glu Phe Gly Val Asp
1190 1195 1200
Val Gly Pro Val Cys Phe Leu
1205 1210
Claims (10)
1. a kind of fusion protein, which is characterized in that the structure of the fusion protein is as shown in following formula I:
A-L-B (I)
In formula, A be GLP-1 analog, B be III Collagen Type VI α of people, 1 chain, L be without or link peptide, each "-" independently be link peptide or
Peptide bond;And
The GLP-1 analog has the polypeptide of amino acid sequence shown in SEQ ID NO.:1,
His-Xaa8-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Xaa22-Gln-
Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Val-Lys-Gly-Arg-Xaa37
Wherein Xaa8It is Gly or Ala, Xaa22It is Glu or Gly, Xaa37It is Gly or nothing.
2. fusion protein as described in claim 1, which is characterized in that the GLP-1 analog has such as SEQ ID NO.:6
Or amino acid sequence shown in 12.
3. fusion protein as described in claim 1, which is characterized in that 1 chain of human III type collagen α has SEQ ID NO.:
2, amino acid sequence shown in 3 or 4.
4. fusion protein as described in claim 1, which is characterized in that the fusion protein has SEQ ID NO.:8,9 or 13
Shown in amino acid sequence.
5. a kind of oligomer, which is characterized in that the oligomer includes fusion protein described in claim 1.
6. a kind of isolated polynucleotides, which is characterized in that fusion protein described in the polynucleotide encoding claim 1.
7. a kind of carrier, which is characterized in that the carrier includes polynucleotides as claimed in claim 6.
8. a kind of host cell, which is characterized in that the host cell contains carrier or chromosome as claimed in claim 7
In be integrated with the polynucleotides as claimed in claim 6 or expression fusion protein described in claim 1 or expression power of external source
Benefit require 5 described in oligomer.
9. a kind of pharmaceutical composition, which is characterized in that described pharmaceutical composition includes fusion protein described in claim 1 or power
Benefit require 5 described in oligomer and pharmaceutically acceptable carrier or excipient.
10. oligomer described in fusion protein as described in claim 1, claim 5, multicore glycosides as claimed in claim 6
Carrier sour, as claimed in claim 7, host cell according to any one of claims 8, which is characterized in that be used to prepare prevention and/or control
Treat the drug or preparation of diabetes.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810089639.XA CN110092835A (en) | 2018-01-30 | 2018-01-30 | A kind of GLP-1 analog-COL3A1 fusion protein |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810089639.XA CN110092835A (en) | 2018-01-30 | 2018-01-30 | A kind of GLP-1 analog-COL3A1 fusion protein |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110092835A true CN110092835A (en) | 2019-08-06 |
Family
ID=67442488
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810089639.XA Withdrawn CN110092835A (en) | 2018-01-30 | 2018-01-30 | A kind of GLP-1 analog-COL3A1 fusion protein |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110092835A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114539390A (en) * | 2022-03-02 | 2022-05-27 | 广州美神生物科技有限公司 | Recombinant III-type humanized collagen C3, and expression vector, expression strain, expression method and application thereof |
CN114805551A (en) * | 2022-06-28 | 2022-07-29 | 华熙生物科技股份有限公司 | Recombinant type III collagen and preparation method thereof |
CN116284340A (en) * | 2023-02-01 | 2023-06-23 | 美尔健(深圳)生物科技有限公司 | Chaperone peptide-based transdermal enhanced recombinant human-derived three-type collagen and application thereof |
CN116874590A (en) * | 2023-08-16 | 2023-10-13 | 医械妆(广州)技术服务有限公司 | Recombinant III type collagen and preparation method thereof |
CN117384276A (en) * | 2023-12-11 | 2024-01-12 | 上海昱菘医药科技有限公司 | Recombinant collagen and preparation method and application thereof |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030064436A1 (en) * | 1996-10-29 | 2003-04-03 | Vaughan Paul Richard | Method for producing, in yeast, a hydroxylated triple helical protein, and yeast host cells useful in said method |
CN1483041A (en) * | 2000-12-07 | 2004-03-17 | GLP-1 fusion protein | |
CN102164949A (en) * | 2009-11-19 | 2011-08-24 | 浙江大学 | Novel nonnatural protein |
CN103641896A (en) * | 2009-11-19 | 2014-03-19 | 浙江大学 | Use of gelatin-like unit |
CN104870478A (en) * | 2012-12-24 | 2015-08-26 | 北京安信怀德生物技术有限公司 | Fusion protein of therapeutic polypeptide with improved pharmacokinetic profile and use thereof |
US20160194370A1 (en) * | 2012-12-24 | 2016-07-07 | Beijing Anxinhuaide Biotech. Co., Ltd. | Fusion protein of therapeutic polypeptide with improved pharmacokinetic profile and use therof |
-
2018
- 2018-01-30 CN CN201810089639.XA patent/CN110092835A/en not_active Withdrawn
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030064436A1 (en) * | 1996-10-29 | 2003-04-03 | Vaughan Paul Richard | Method for producing, in yeast, a hydroxylated triple helical protein, and yeast host cells useful in said method |
CN1483041A (en) * | 2000-12-07 | 2004-03-17 | GLP-1 fusion protein | |
CN102164949A (en) * | 2009-11-19 | 2011-08-24 | 浙江大学 | Novel nonnatural protein |
US20130203959A1 (en) * | 2009-11-19 | 2013-08-08 | Zhejiang University | Nonnatural collagen-like protein and use thereof |
CN103641896A (en) * | 2009-11-19 | 2014-03-19 | 浙江大学 | Use of gelatin-like unit |
CN104870478A (en) * | 2012-12-24 | 2015-08-26 | 北京安信怀德生物技术有限公司 | Fusion protein of therapeutic polypeptide with improved pharmacokinetic profile and use thereof |
US20160194370A1 (en) * | 2012-12-24 | 2016-07-07 | Beijing Anxinhuaide Biotech. Co., Ltd. | Fusion protein of therapeutic polypeptide with improved pharmacokinetic profile and use therof |
Non-Patent Citations (2)
Title |
---|
"COL3A1 protein [Homo sapiens]" * |
LEENA ALA-KOKKO等: "Structure of cDNA clones coding for the entire preproxl(III) chain of human type III procollagen. Differences in protein structure from type I procollagen and conservation of codon preferences" * |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114539390A (en) * | 2022-03-02 | 2022-05-27 | 广州美神生物科技有限公司 | Recombinant III-type humanized collagen C3, and expression vector, expression strain, expression method and application thereof |
CN114805551A (en) * | 2022-06-28 | 2022-07-29 | 华熙生物科技股份有限公司 | Recombinant type III collagen and preparation method thereof |
CN116284340A (en) * | 2023-02-01 | 2023-06-23 | 美尔健(深圳)生物科技有限公司 | Chaperone peptide-based transdermal enhanced recombinant human-derived three-type collagen and application thereof |
CN116284340B (en) * | 2023-02-01 | 2024-06-25 | 美尔健(深圳)生物科技有限公司 | Chaperone peptide-based transdermal enhanced recombinant human-derived three-type collagen and application thereof |
CN116874590A (en) * | 2023-08-16 | 2023-10-13 | 医械妆(广州)技术服务有限公司 | Recombinant III type collagen and preparation method thereof |
CN116874590B (en) * | 2023-08-16 | 2024-06-07 | 医械妆(广州)技术服务有限公司 | Recombinant III type collagen and preparation method thereof |
CN117384276A (en) * | 2023-12-11 | 2024-01-12 | 上海昱菘医药科技有限公司 | Recombinant collagen and preparation method and application thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110092835A (en) | A kind of GLP-1 analog-COL3A1 fusion protein | |
CN106459222A (en) | MIC-1 fusion proteins and uses thereof | |
CN113502296B (en) | Recombinant engineering bacterium for expressing semaglutide precursor and construction method thereof | |
WO2012062078A1 (en) | N-terminal deletion variant of human fibroblast growth factor 21 and conjugate thereof | |
CA2565300A1 (en) | Fgf-21 fusion proteins | |
CN107266579A (en) | Fusion protein for treating metabolic disease | |
CN113265007B (en) | Fusion protein for treating metabolic diseases and preparation method and application thereof | |
CN109414471A (en) | MIC-1 compound and application thereof | |
US20210253662A1 (en) | Long-Acting Recombinant GLP1-Fc-CD47 Protein and Preparation and Use Thereof | |
CN101875700B (en) | Method for improving bioactivity of exendin fusion protein | |
CN106397607A (en) | Recombinant human fibroblast growth factor 21 fusion protein and application thereof in preparation of medicine for treating metabolic diseases | |
CN113683679A (en) | Recombinant I-type humanized collagen C1L6T and preparation method and application thereof | |
CN113683680A (en) | Recombinant I-type humanized collagen C1L1T, and preparation method and application thereof | |
CN107108754A (en) | The antitrypsins of α 1 (A1AT) fusion protein and application thereof | |
CN113105561B (en) | Preparation method and application of double-target fusion protein | |
CN112851791B (en) | Novel FGF analogue for resisting metabolic disorder and application thereof | |
JP6612360B2 (en) | Fusion protein complex and fusion protein having medicinal action | |
CN113583142A (en) | Double-target fusion protein, coding gene, vector or host cell and application and expression and purification method thereof | |
CN110172103B (en) | GLP-1 analogue-Fc fusion protein, and preparation method and application thereof | |
CN108794634A (en) | The long-acting human growth hormone (HGH) fusion protein and its preparation and use of recombination | |
CN105884901B (en) | Tool persistently controls recombination human serum albumin/glicentin class peptide fusion protein of blood-sugar content function | |
CN113292646B (en) | GLP-1/glucagon dual agonist fusion proteins | |
CN115991793A (en) | Fusion proteins with multiple activities and uses thereof | |
CN101062948B (en) | Monomer quick-effective insulin and preparation method and usage thereof | |
CN114651063A (en) | N-terminal extension sequences for expression of recombinant therapeutic peptides |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20210618 Address after: Room 205, West District, 2nd floor, no.707 Zhangyang Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai, 200120 Applicant after: Shanghai Huidun Yintai Biotechnology Co.,Ltd. Address before: 200433 rooms 309, 311, 313, 316, No. 135, Guowei Road, Yangpu District, Shanghai Applicant before: SHANGHAI HUIDUN BIOTECHNOLOGY Co.,Ltd. |
|
WW01 | Invention patent application withdrawn after publication | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20190806 |