CN115873127A - Recombinant long-acting human growth hormone fusion protein and preparation method and application thereof - Google Patents
Recombinant long-acting human growth hormone fusion protein and preparation method and application thereof Download PDFInfo
- Publication number
- CN115873127A CN115873127A CN202210395423.2A CN202210395423A CN115873127A CN 115873127 A CN115873127 A CN 115873127A CN 202210395423 A CN202210395423 A CN 202210395423A CN 115873127 A CN115873127 A CN 115873127A
- Authority
- CN
- China
- Prior art keywords
- leu
- ser
- val
- lys
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 172
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 172
- 108010000521 Human Growth Hormone Proteins 0.000 title claims description 33
- 102000002265 Human Growth Hormone Human genes 0.000 title claims description 33
- 239000000854 Human Growth Hormone Substances 0.000 title claims description 33
- 238000002360 preparation method Methods 0.000 title claims description 11
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 68
- 239000013604 expression vector Substances 0.000 claims abstract description 41
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 39
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 36
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 36
- 239000008194 pharmaceutical composition Substances 0.000 claims abstract description 28
- 206010056438 Growth hormone deficiency Diseases 0.000 claims abstract description 20
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 9
- 210000004027 cell Anatomy 0.000 claims description 113
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 30
- 201000010099 disease Diseases 0.000 claims description 28
- 230000035772 mutation Effects 0.000 claims description 27
- 102000018997 Growth Hormone Human genes 0.000 claims description 26
- 108010051696 Growth Hormone Proteins 0.000 claims description 26
- 239000000122 growth hormone Substances 0.000 claims description 26
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 claims description 20
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 claims description 20
- 239000012634 fragment Substances 0.000 claims description 17
- 102000007079 Peptide Fragments Human genes 0.000 claims description 15
- 108010033276 Peptide Fragments Proteins 0.000 claims description 15
- 108010075254 C-Peptide Proteins 0.000 claims description 14
- 230000005856 abnormality Effects 0.000 claims description 14
- 125000001151 peptidyl group Chemical group 0.000 claims description 10
- 238000002347 injection Methods 0.000 claims description 9
- 239000007924 injection Substances 0.000 claims description 9
- 239000003814 drug Substances 0.000 claims description 7
- 239000013598 vector Substances 0.000 claims description 7
- 206010053759 Growth retardation Diseases 0.000 claims description 6
- 208000026928 Turner syndrome Diseases 0.000 claims description 5
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 5
- 238000010254 subcutaneous injection Methods 0.000 claims description 5
- 239000007929 subcutaneous injection Substances 0.000 claims description 5
- 208000030507 AIDS Diseases 0.000 claims description 4
- 208000001362 Fetal Growth Retardation Diseases 0.000 claims description 4
- 206010070531 Foetal growth restriction Diseases 0.000 claims description 4
- 201000010769 Prader-Willi syndrome Diseases 0.000 claims description 4
- 208000001647 Renal Insufficiency Diseases 0.000 claims description 4
- 238000002512 chemotherapy Methods 0.000 claims description 4
- 208000030941 fetal growth restriction Diseases 0.000 claims description 4
- 201000006370 kidney failure Diseases 0.000 claims description 4
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 3
- 238000010253 intravenous injection Methods 0.000 claims description 3
- 210000004962 mammalian cell Anatomy 0.000 claims description 2
- 230000002265 prevention Effects 0.000 claims description 2
- 239000003111 growth hormone derivative Substances 0.000 claims 2
- 230000002159 abnormal effect Effects 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 65
- 238000000338 in vitro Methods 0.000 abstract description 25
- 238000001727 in vivo Methods 0.000 abstract description 21
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 abstract description 18
- 239000000178 monomer Substances 0.000 abstract description 18
- 230000000975 bioactive effect Effects 0.000 abstract description 7
- 150000001413 amino acids Chemical group 0.000 description 32
- 108020004414 DNA Proteins 0.000 description 29
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 26
- 238000002474 experimental method Methods 0.000 description 24
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 23
- 238000000034 method Methods 0.000 description 23
- 108090000623 proteins and genes Proteins 0.000 description 22
- 241000880493 Leptailurus serval Species 0.000 description 19
- 230000027455 binding Effects 0.000 description 19
- 239000002773 nucleotide Substances 0.000 description 19
- 125000003729 nucleotide group Chemical group 0.000 description 19
- 239000002609 medium Substances 0.000 description 18
- 102000004169 proteins and genes Human genes 0.000 description 17
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 16
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 16
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 16
- 108010015792 glycyllysine Proteins 0.000 description 16
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 15
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 15
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 15
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 15
- 108010033310 GX-H9 Proteins 0.000 description 14
- 230000004663 cell proliferation Effects 0.000 description 14
- 235000018102 proteins Nutrition 0.000 description 14
- 108010052774 valyl-lysyl-glycyl-phenylalanyl-tyrosine Proteins 0.000 description 14
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 13
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 13
- 238000000746 purification Methods 0.000 description 13
- 239000000523 sample Substances 0.000 description 13
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 12
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 12
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 12
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 11
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 11
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 11
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 11
- 230000014509 gene expression Effects 0.000 description 11
- 108010051242 phenylalanylserine Proteins 0.000 description 11
- 108010031719 prolyl-serine Proteins 0.000 description 11
- 108010033419 somatotropin-binding protein Proteins 0.000 description 11
- 108010051110 tyrosyl-lysine Proteins 0.000 description 11
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 10
- ORXCYAFUCSTQGY-FXQIFTODSA-N Asn-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N ORXCYAFUCSTQGY-FXQIFTODSA-N 0.000 description 10
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 10
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 10
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 10
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 10
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 10
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 10
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 10
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 10
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 10
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 10
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 10
- 230000008901 benefit Effects 0.000 description 10
- 230000004071 biological effect Effects 0.000 description 10
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 10
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 10
- 108010048818 seryl-histidine Proteins 0.000 description 10
- 238000012360 testing method Methods 0.000 description 10
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 10
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 9
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 9
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 9
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 9
- 102100020948 Growth hormone receptor Human genes 0.000 description 9
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 9
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 9
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 9
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 9
- JDMKQHSHKJHAHR-UHFFFAOYSA-N Phe-Phe-Leu-Tyr Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)CC1=CC=CC=C1 JDMKQHSHKJHAHR-UHFFFAOYSA-N 0.000 description 9
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 9
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 9
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 9
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 9
- 108010068542 Somatotropin Receptors Proteins 0.000 description 9
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 9
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 9
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 9
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 9
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 9
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 9
- 239000012091 fetal bovine serum Substances 0.000 description 9
- 108010009298 lysylglutamic acid Proteins 0.000 description 9
- 108010080629 tryptophan-leucine Proteins 0.000 description 9
- NVKAWKQGWWIWPM-ABEVXSGRSA-N 17-β-hydroxy-5-α-Androstan-3-one Chemical compound C1C(=O)CC[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CC[C@H]21 NVKAWKQGWWIWPM-ABEVXSGRSA-N 0.000 description 8
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 8
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 8
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 8
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 8
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 8
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 8
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 8
- ZUVMUOOHJYNJPP-XIRDDKMYSA-N Arg-Trp-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZUVMUOOHJYNJPP-XIRDDKMYSA-N 0.000 description 8
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 8
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 8
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 8
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 8
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 8
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 8
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 8
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 8
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 8
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 8
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 8
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 8
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 8
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 8
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 8
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 8
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 8
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 8
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 8
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 8
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 8
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 8
- OMNVOTCFQQLEQU-CIUDSAMLSA-N His-Asn-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMNVOTCFQQLEQU-CIUDSAMLSA-N 0.000 description 8
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 8
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 8
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 8
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 8
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 8
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 8
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 8
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 8
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 8
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 8
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 8
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 8
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 8
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 8
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 8
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 8
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 8
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 8
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 8
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 8
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 8
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 8
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 8
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 8
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 8
- 108010003201 RGH 0205 Proteins 0.000 description 8
- 241000700159 Rattus Species 0.000 description 8
- 108700008625 Reporter Genes Proteins 0.000 description 8
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 8
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 8
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 8
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 8
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 8
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 8
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 8
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 8
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 8
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 8
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 8
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 8
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 8
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 8
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 8
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 8
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 8
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 8
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 8
- 108010047857 aspartylglycine Proteins 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 8
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 8
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 8
- 108010000761 leucylarginine Proteins 0.000 description 8
- 108010017391 lysylvaline Proteins 0.000 description 8
- 108010012581 phenylalanylglutamate Proteins 0.000 description 8
- 229920001184 polypeptide Polymers 0.000 description 8
- 102000004196 processed proteins & peptides Human genes 0.000 description 8
- 108010070643 prolylglutamic acid Proteins 0.000 description 8
- 108010044292 tryptophyltyrosine Proteins 0.000 description 8
- 108020004705 Codon Proteins 0.000 description 7
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 7
- PUFNQIPSRXVLQJ-IHRRRGAJSA-N His-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N PUFNQIPSRXVLQJ-IHRRRGAJSA-N 0.000 description 7
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 7
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 7
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 7
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 7
- 108010005905 delta-hGHR Proteins 0.000 description 7
- 108010053037 kyotorphin Proteins 0.000 description 7
- 238000002156 mixing Methods 0.000 description 7
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 6
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 6
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 6
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 6
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 6
- 108060001084 Luciferase Proteins 0.000 description 6
- 239000005089 Luciferase Substances 0.000 description 6
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 6
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 6
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 6
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 6
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 6
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 6
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 6
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 6
- 239000006285 cell suspension Substances 0.000 description 6
- 230000004927 fusion Effects 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 239000013642 negative control Substances 0.000 description 6
- 239000013641 positive control Substances 0.000 description 6
- 230000001737 promoting effect Effects 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 108010027345 wheylin-1 peptide Proteins 0.000 description 6
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 5
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 5
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 5
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 5
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 5
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 5
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 5
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 5
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 5
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 5
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 5
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 5
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 5
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 5
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 5
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 5
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 5
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 5
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 5
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 5
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 5
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 5
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 5
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 5
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 5
- 235000001014 amino acid Nutrition 0.000 description 5
- 230000009286 beneficial effect Effects 0.000 description 5
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 239000001963 growth medium Substances 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 108010091871 leucylmethionine Proteins 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 208000024891 symptom Diseases 0.000 description 5
- 238000001890 transfection Methods 0.000 description 5
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 4
- LGCVSPFCFXWUEY-IHPCNDPISA-N Asn-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N LGCVSPFCFXWUEY-IHPCNDPISA-N 0.000 description 4
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 4
- PJWIPBIMSKJTIE-DCAQKATOSA-N Cys-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N PJWIPBIMSKJTIE-DCAQKATOSA-N 0.000 description 4
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 4
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 4
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 4
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 4
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 4
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 4
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 4
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 4
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 229910019142 PO4 Inorganic materials 0.000 description 4
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 4
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 4
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 4
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 4
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 4
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 4
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 4
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 4
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 4
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 229950002916 avelumab Drugs 0.000 description 4
- 238000004113 cell culture Methods 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 230000003247 decreasing effect Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 230000029087 digestion Effects 0.000 description 4
- 239000000539 dimer Substances 0.000 description 4
- 239000012636 effector Substances 0.000 description 4
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 108010073101 phenylalanylleucine Proteins 0.000 description 4
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 4
- 239000010452 phosphate Substances 0.000 description 4
- 108010077112 prolyl-proline Proteins 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 235000019786 weight gain Nutrition 0.000 description 4
- IESDGNYHXIOKRW-YXMSTPNBSA-N (2s)-2-[[(2s)-1-[(2s)-6-amino-2-[[(2s,3r)-2-amino-3-hydroxybutanoyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IESDGNYHXIOKRW-YXMSTPNBSA-N 0.000 description 3
- DIBLBAURNYJYBF-XLXZRNDBSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-6-amino-2-[[(2s)-2-amino-3-methylbutanoyl]amino]hexanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 DIBLBAURNYJYBF-XLXZRNDBSA-N 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 3
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 3
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 3
- 102000008096 B7-H1 Antigen Human genes 0.000 description 3
- 108010074708 B7-H1 Antigen Proteins 0.000 description 3
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 3
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 3
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 3
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 3
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 3
- WMKXFMUJRCEGRP-SRVKXCTJSA-N His-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N WMKXFMUJRCEGRP-SRVKXCTJSA-N 0.000 description 3
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 3
- GRRNUXAQVGOGFE-UHFFFAOYSA-N Hygromycin-B Natural products OC1C(NC)CC(N)C(O)C1OC1C2OC3(C(C(O)C(O)C(C(N)CO)O3)O)OC2C(O)C(CO)O1 GRRNUXAQVGOGFE-UHFFFAOYSA-N 0.000 description 3
- 206010062767 Hypophysitis Diseases 0.000 description 3
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 3
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 3
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 3
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 3
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 3
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 3
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 3
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 3
- 239000012980 RPMI-1640 medium Substances 0.000 description 3
- UEJYSALTSUZXFV-SRVKXCTJSA-N Rigin Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UEJYSALTSUZXFV-SRVKXCTJSA-N 0.000 description 3
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 3
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 3
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 3
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 3
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 3
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 3
- SSSDKJMQMZTMJP-BVSLBCMMSA-N Trp-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 SSSDKJMQMZTMJP-BVSLBCMMSA-N 0.000 description 3
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 3
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 3
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 3
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 3
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 3
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 3
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 239000007864 aqueous solution Substances 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 238000007664 blowing Methods 0.000 description 3
- 230000037396 body weight Effects 0.000 description 3
- 239000007853 buffer solution Substances 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000004587 chromatography analysis Methods 0.000 description 3
- 239000011248 coating agent Substances 0.000 description 3
- 238000000576 coating method Methods 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- UQLDLKMNUJERMK-UHFFFAOYSA-L di(octadecanoyloxy)lead Chemical compound [Pb+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O UQLDLKMNUJERMK-UHFFFAOYSA-L 0.000 description 3
- 230000004069 differentiation Effects 0.000 description 3
- 238000007865 diluting Methods 0.000 description 3
- 239000003085 diluting agent Substances 0.000 description 3
- 239000002552 dosage form Substances 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- GRRNUXAQVGOGFE-NZSRVPFOSA-N hygromycin B Chemical compound O[C@@H]1[C@@H](NC)C[C@@H](N)[C@H](O)[C@H]1O[C@H]1[C@H]2O[C@@]3([C@@H]([C@@H](O)[C@@H](O)[C@@H](C(N)CO)O3)O)O[C@H]2[C@@H](O)[C@@H](CO)O1 GRRNUXAQVGOGFE-NZSRVPFOSA-N 0.000 description 3
- 229940097277 hygromycin b Drugs 0.000 description 3
- 238000001990 intravenous administration Methods 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 230000001817 pituitary effect Effects 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 230000035755 proliferation Effects 0.000 description 3
- 108020003175 receptors Proteins 0.000 description 3
- 102000005962 receptors Human genes 0.000 description 3
- 238000007920 subcutaneous administration Methods 0.000 description 3
- 108010073969 valyllysine Proteins 0.000 description 3
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 2
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 2
- OANWAFQRNQEDSY-DCAQKATOSA-N Arg-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N OANWAFQRNQEDSY-DCAQKATOSA-N 0.000 description 2
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 2
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 2
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 2
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 2
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 2
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 2
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- PCKOTDPDHIBGRW-CIUDSAMLSA-N Gln-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N PCKOTDPDHIBGRW-CIUDSAMLSA-N 0.000 description 2
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 2
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 2
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 2
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 2
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 2
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 2
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 2
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 2
- 102100037792 Interleukin-6 receptor subunit alpha Human genes 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 2
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 2
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 2
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 2
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 2
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 2
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 2
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 2
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 2
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 2
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 230000002411 adverse Effects 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 239000012911 assay medium Substances 0.000 description 2
- 229960000074 biopharmaceutical Drugs 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 230000000857 drug effect Effects 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 108040006858 interleukin-6 receptor activity proteins Proteins 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 238000004020 luminiscence type Methods 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000006371 metabolic abnormality Effects 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000007857 nested PCR Methods 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 229920000136 polysorbate Polymers 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 230000000699 topical effect Effects 0.000 description 2
- 239000012096 transfection reagent Substances 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 230000004584 weight gain Effects 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- 208000019901 Anxiety disease Diseases 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- 208000003174 Brain Neoplasms Diseases 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 208000032170 Congenital Abnormalities Diseases 0.000 description 1
- 206010010356 Congenital anomaly Diseases 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 206010061818 Disease progression Diseases 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- AQPZYBSRDRZBAG-AVGNSLFASA-N Gln-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N AQPZYBSRDRZBAG-AVGNSLFASA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 206010021143 Hypoxia Diseases 0.000 description 1
- 206010022489 Insulin Resistance Diseases 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 238000003725 ONE-Glo Luciferase Assay System Methods 0.000 description 1
- 239000012124 Opti-MEM Substances 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 1
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 208000020221 Short stature Diseases 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- 208000022971 Tuberculous meningitis Diseases 0.000 description 1
- GFJXBLSZOFWHAW-JYJNAYRXSA-N Tyr-His-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GFJXBLSZOFWHAW-JYJNAYRXSA-N 0.000 description 1
- RKIGNDAHUOOIMJ-BQFCYCMXSA-N Val-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 RKIGNDAHUOOIMJ-BQFCYCMXSA-N 0.000 description 1
- SBJCTAZFSZXWSR-AVGNSLFASA-N Val-Met-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SBJCTAZFSZXWSR-AVGNSLFASA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 210000004381 amniotic fluid Anatomy 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 230000036506 anxiety Effects 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 238000010009 beating Methods 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 210000005013 brain tissue Anatomy 0.000 description 1
- 210000000692 cap cell Anatomy 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 238000003570 cell viability assay Methods 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 208000035850 clinical syndrome Diseases 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000001054 cortical effect Effects 0.000 description 1
- 238000005336 cracking Methods 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 230000005750 disease progression Effects 0.000 description 1
- 238000002224 dissection Methods 0.000 description 1
- 231100000673 dose–response relationship Toxicity 0.000 description 1
- 239000012527 feed solution Substances 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 239000010419 fine particle Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 231100000001 growth retardation Toxicity 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 230000007954 hypoxia Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000007919 intrasynovial administration Methods 0.000 description 1
- 238000007913 intrathecal administration Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 230000037356 lipid metabolism Effects 0.000 description 1
- 238000003670 luciferase enzyme activity assay Methods 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 208000001223 meningeal tuberculosis Diseases 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 229940124531 pharmaceutical excipient Drugs 0.000 description 1
- 230000003285 pharmacodynamic effect Effects 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 230000036314 physical performance Effects 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 230000002685 pulmonary effect Effects 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000012439 solid excipient Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 230000000451 tissue damage Effects 0.000 description 1
- 231100000827 tissue damage Toxicity 0.000 description 1
- 238000006257 total synthesis reaction Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/61—Growth hormone [GH], i.e. somatotropin
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P5/00—Drugs for disorders of the endocrine system
- A61P5/06—Drugs for disorders of the endocrine system of the anterior pituitary hormones, e.g. TSH, ACTH, FSH, LH, PRL, GH
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/50—Immunoglobulins specific features characterized by immunoglobulin fragments
- C07K2317/52—Constant or Fc region; Isotype
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/70—Immunoglobulins specific features characterized by effect upon binding to a cell or to an antigen
- C07K2317/73—Inducing cell death, e.g. apoptosis, necrosis or inhibition of cell proliferation
- C07K2317/732—Antibody-dependent cellular cytotoxicity [ADCC]
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/30—Non-immunoglobulin-derived peptide or protein having an immunoglobulin constant or Fc region, or a fragment thereof, attached thereto
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Endocrinology (AREA)
- Medicinal Chemistry (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Diabetes (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The invention relates to an Fc mutant, a fusion protein, a nucleic acid molecule, an expression vector, a recombinant cell, a pharmaceutical composition and application thereof. The Fc mutant has an amino acid sequence shown as any one of SEQ ID NO 3-7; the fusion protein comprises: a second peptide segment which is a bioactive molecule functional region, a third peptide segment which comprises an amino acid sequence shown in SEQ ID NO. 2 or the Fc mutant, and the second peptide segment is connected with the third peptide segment. The Fc mutant is in a monomer form, and the fusion protein has higher in vivo and in vitro activity and lower ADCC and CDC effects, and can effectively treat growth hormone deficiency.
Description
Technical Field
The invention relates to the fields of molecular biology and medicines, in particular to a recombinant long-acting human growth hormone fusion protein and a preparation method and application thereof, and more particularly relates to an Fc mutant, a fusion protein, a nucleic acid molecule, an expression vector, a recombinant cell, a pharmaceutical composition and application thereof.
Background
Growth hormone deficiency is a recognized clinical syndrome associated with a number of metabolic abnormalities, including abnormalities in body composition, decreased physical performance, altered lipid metabolism, decreased bone mass, increased insulin resistance, and decreased quality of life. Most of the metabolic abnormalities associated with growth hormone deficiency can be reversed by recombinant human growth hormone (rhGH) substitutes. Conventional treatment of growth hormone deficiency includes daily subcutaneous injection of rhGH, however, the treatment is cumbersome, requires daily injections, is inconvenient for many patients, raises concerns about poor compliance with the treatment, and may result in reduced efficacy. The long-acting rhGH preparation or the long-acting growth hormone preparation not only can reduce the injection times and improve the compliance, but also is helpful to improve the curative effect of GH treatment.
Therefore, there is still a need to develop a new fusion protein effective in treating growth hormone deficiency.
Disclosure of Invention
The present invention aims to solve at least to some extent at least one of the technical problems of the prior art. To this end, the invention provides an Fc mutant, a fusion protein, a nucleic acid molecule, an expression vector, a recombinant cell, a pharmaceutical composition, and uses thereof. The Fc mutants of the present invention are susceptible to formation of monomeric forms. The fusion protein has strong stability and can effectively treat growth hormone deficiency.
The present invention has been completed based on the following findings of the inventors:
the inventor finds out through experiments that mutation of specific sites of the Fc fragment of wild-type IgG1, such as at least one of P228Delete, C229Delete, P230 Delete, T366R, L368H, P395K and K409D/K409T, and L351S, M428Y and K447A, or P228Delete, C229Delete, T366R, L368H, P395K and K409D or K409T, can make the Fc fragment in a natural dimer form expressed to form an Fc mutant in a monomer form. Moreover, experiments further show that the Fc mutant is connected with growth hormone to form the fusion protein in a monomer form, so that the fusion protein can be purified by ProteinA, the purification process is simplified, and the subsequent purification process is simple. Meanwhile, more importantly, the fusion protein obtains stronger human growth hormone receptor binding activity, promotes cell proliferation activity and promotes reporter gene cell to express luciferase activity, and has longer half-life and better in vivo biological activity.
To this end, in a first aspect of the invention, the invention proposes an Fc mutant. According to an embodiment of the invention, comprising: a first peptidyl fragment having mutations compared to the Fc fragment of wild-type IgG1 at the following positions: at least one of 228 th bit, 229 th bit, 230 th bit, 366 th bit, 368 th bit, 395 th bit and 409 th bit, and 351 th bit, 428 th bit and 447 th bit; or 228 th bit, 229 th bit, 366 th bit, 368 th bit, 395 th bit and 409 th bit. The inventor finds through a large number of experiments that the mutation is beneficial to enabling the Fc fragment in the form of natural dimer to be expressed and then form the Fc mutant in the form of monomer, the Fc mutant in the form of monomer is adopted to be fused with bioactive molecules, and the subsequent purification can be realized through ProteinA, so that the purification process is simplified; more importantly, the fusion protein prepared by the Fc mutant has higher in vivo and in vitro activity and lower ADCC and CDC effects.
It should be noted that the Fc fragment of the antibody IgG referred to herein refers to the CH2 region and the CH3 region of the antibody IgG.
For example, the amino acid sequence (including CH2-CH 3) of the Fc fragment of wild-type IgG1 is as follows:
PCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK(SEQ ID NO:40)。
the amino acid sequence of the Hinge region (Hinge) of wild-type IgG1 is shown below:
DKTHTG(SEQ ID NO:41)。
the nucleotide sequence of the Hinge region (Hinge) of wild-type IgG1 is shown below:
GACAAGACACACACCGGA(SEQ ID NO:42)。
in a second aspect of the invention, a fusion protein is provided. According to an embodiment of the invention, the fusion protein comprises: a second peptide segment comprising a functional region of a biologically active molecule; a third peptide stretch comprising an Fc mutant having the amino acid sequence shown in SEQ ID NO 2 or as defined in the first aspect, the second peptide stretch being linked to the third peptide stretch. The fusion protein provided by the embodiment of the invention has higher in vitro and in vivo activity and lower ADCC and CDC effects, and can be purified by ProteinA subsequently, so that the purification process is simplified.
In a third aspect of the invention, a fusion protein is provided. According to an embodiment of the invention, the general structural formula of the fusion protein is represented as X-L-Y or Y-L-X, wherein X is a first bioactive molecule; l is absent or is a linker peptide; y is a second biologically active molecule; -is a peptide bond; and the active molecule X or Y is selected from a protein or protein domain, a polypeptide, an antibody or an antibody fragment.
It should be noted that "X-L-Y" means that the C-terminal of X is connected to the N-terminal of L, and the C-terminal of L is connected to the N-terminal of Y; the term "Y-L-X" refers to the condition that the C end of Y is connected with the N end of L, and the C end of L is connected with the N end of X.
According to an embodiment of the present invention, the fusion protein according to the second or third aspect may further comprise at least one of the following additional technical features:
according to an embodiment of the invention, the fusion protein is in monomeric form.
According to an embodiment of the invention, said X is human growth hormone. In some embodiments, the amino acid sequence of human growth hormone comprises at least a portion of an amino acid sequence selected from SEQ ID No. 1 or an amino acid sequence having at least 80% to 99% identity thereto, or a sequence thereof.
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF(SEQ ID NO:1)。
According to an embodiment of the invention, the nucleotide sequence of human growth hormone comprises a codon optimized sequence selected from SEQ ID NO 11 or a nucleotide sequence having at least 80% -99% identity thereto or at least a part of the sequence thereof or the sequence thereof.
TTTCCCACCATTCCTCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTTGACACATACCAGGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCCCAGACCAGCCTGTGCTTCAGCGAGAGCATCCCCACCCCCAGCAATAGAGAGGAGACACAGCAGAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTGCTGATCCAGAGCTGGCTGGAGCCCGTTCAATTTCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCAGCGATTCTAATGTGTACGACCTGCTGAAGGACCTGGAGGAGGGCATCCAGACCCTGATGGGCAGACTGGAGGACGGCTCTCCTAGAACCGGACAAATTTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACATTCCTGAGAATCGTGCAGTGCAGAAGCGTGGAGGGCAGCTGCGGATTC(SEQ ID NO:11)。
According to an embodiment of the invention, said Y is an IgG Fc mutant (abbreviated as "Fc mutant"). In some embodiments, the Y is an Ig G1 Fc mutant. In some embodiments, the Ig G1 Fc mutant is at least one of SEQ ID No. 2, SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 7, or an amino acid sequence having at least 80% -99% identity to any one thereof, or at least a portion of any one thereof.
APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSTLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK(SEQ ID NO:2)。
PAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSTLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK(SEQ ID NO:3)。
APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSTLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGA(SEQ ID NO:4)。
APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTSPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSDLTVDKSRWQQGNVFSCSVYHEALHNHYTQKSLSLSPGK(SEQ ID NO:5)。
APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTSPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSDLTVDKSRWQQGNVFSCSVYHEALHNHYTQKSLSLSPGA(SEQ ID NO:6)。
DKTHTGPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSTLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK(SEQ ID NO:7)。
The presence of a mutation in the first peptidyl fragment of SEQ ID NO:2 compared to the Fc fragment of wild type IgG1 may comprise: p228Delete, C229Delete, P230 Delete, T366R, L368H, P395K, and K409T.
The presence of mutations in the first peptidyl fragments of SEQ ID NOs 3 and 7 compared to the Fc fragment of wild-type IgG1 may include: p228Delete, C229Delete, T366R, L368H, P395K, and K409T.
The presence of a mutation in the first peptidyl fragment of SEQ ID NO:4 compared to the Fc fragment of wild type IgG1 may comprise: p228Delete, C229Delete, P230 Delete, T366R, L368H, P395K, K409T, and K447A.
The presence of a mutation in the first peptidyl fragment of SEQ ID NO:5 compared to the Fc fragment of wild type IgG1 may comprise: p228Delete, C229Delete, P230 Delete, L351S, T366R, L368H, P395K, K409D and M428Y.
The presence of a mutation in the first peptidyl fragment of SEQ ID NO:6 compared to the Fc fragment of wild type IgG1 may comprise: p228Delete, C229Delete, P230 Delete, L351S, T366R, L368H, P395K, K409D, M428Y and K447A.
The mutation is beneficial to the formation of a monomer form after the Fc mutant in a natural dimer form is expressed, and is beneficial to the purification of the fusion protein through ProteinA, the purification process is simplified, and the in vivo and in vitro activity is improved.
According to an embodiment of the invention, the linker peptide comprises 1 or more amino acids selected from the group consisting of glycine, serine, alanine and threonine.
According to an embodiment of the invention, the amino acid sequence of the linker peptide comprises a sequence selected from (GGGGS) n Or an amino acid sequence having at least 80% to 99% identity to any one of the sequences or at least a portion of any one of the sequences, wherein n is an integer of 1 or more or n is 1, 2, 3, 4, 5 or 6.
According to an embodiment of the invention, the amino acid sequence of the linker peptide comprises at least one selected from the group consisting of SEQ ID NO 8, SEQ ID NO 9, SEQ ID NO 10 or an amino acid sequence having at least 80% to 99% identity to any one of them or at least a part of any one of them.
GGGGSGGGGSGGGGS(SEQ ID NO:8)。
GGGGSGGGGS(SEQ ID NO:9)。
GGGGS(SEQ ID NO:10)
According to an embodiment of the present invention, the nucleotide sequence of the linker peptide may comprise at least one selected from the group consisting of SEQ ID NO 21, SEQ ID NO 22, SEQ ID NO 23 or a nucleotide sequence having at least 80% to 99% identity to any one of them or a codon optimized sequence of at least a part of any one of them or any one of them.
GGAGGAGGAGGAAGCGGAGGCGGAGGATCTGGAGGAGGAGGAAGC(SEQ ID NO:21)。
GGCGGCGGAGGATCTGGCGGAGGTGGAAGT(SEQ ID NO:22)。
GGCGGCGGAGGATCT(SEQ ID NO:23)。
According to an embodiment of the invention, the amino acid sequence of the fusion protein comprises at least one selected from the group consisting of SEQ ID NO 24, SEQ ID NO 25, SEQ ID NO 26, SEQ ID NO 27, SEQ ID NO 28, SEQ ID NO 29, SEQ ID NO 30, SEQ ID NO 31 or an amino acid sequence having at least 80% -99% identity to any one of the sequences or at least a part of any one of the sequences.
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQ QKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSTLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK(SEQ ID NO:24)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSTLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK(SEQ ID NO:25)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFDKTHTGPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSTLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK(SEQ ID NO:26)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSTLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK(SEQ ID NO:27)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSTLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK(SEQ ID NO:28)。
APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSTLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGAGGGGSGGGGSGGGGSFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF(SEQ ID NO:29)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTSPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSDLTVDKSRWQQGNVFSCSVYHEALHNHYTQKSLSLSPGK(SEQ ID NO:30)。
APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTSPPSRDELTKNQVSLRCHVKGFYPSDIAVEWESNGQPENNYKTTKPVLDSDGSFFLYSDLTVDKSRWQQGNVFSCSVYHEALHNHYTQKSLSLSPGAGGGGSGGGGSGGGGSFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF(SEQ ID NO:31)。
According to an embodiment of the present invention, the first biomolecule, the second biomolecule and the linker peptide in the amino acid sequences shown in SEQ ID NO:24-31 are shown in the following table:
according to an embodiment of the invention, the nucleotide sequence of the fusion protein comprises at least one selected from the group consisting of SEQ ID NO 32, 33, 34, 35, 36, 37, 38, 39 or a nucleotide sequence having at least 80% -99% identity to any one of them or a codon optimized sequence of at least a part of any one of them or any one of them.
TTCCCCACCATTCCTCTGAGCCGGCTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAAGCCTACATTCCCAAAGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACCAGCCTGTGCTTCAGCGAGAGCATCCCCACACCTAGCAACAGAGAGGAAACCCAGCAGAAGTCCAACCTGGAACTGCTGCGGATCAGCCTGCTGCTGATCCAGTCTTGGCTGGAACCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCAGCGACAGCAACGTTTACGACCTGCTGAAGGACCTGGAAGAGGGCATCCAGACACTGATGGGCAGACTGGAAGATGGCAGCCCTAGAACCGGCCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTTCGGAAGGACATGGACAAGGTGGAAACCTTCCTGCGGATCGTGCAGTGCAGAAGCGTGGAAGGCAGCTGTGGATTTGCCCCTGAACTGCTCGGAGGCCCCTCCGTGTTTCTGTTCCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGATGTGTCCCACGAGGATCCCGAAGTGAAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTACAACAGCACCTACAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGCCCTGCCTGCTCCTATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCTAGGGAACCCCAGGTTTACACACTGCCTCCAAGCAGGGACGAGCTGACCAAGAATCAGGTGTCCCTGCGGTGTCACGTGAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACCAAGCCAGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCACCCTGACCGTGGACAAGTCCAGATGGCAACAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCCCTGAGCCTGTCTCCTGGCAAA(SEQ ID NO:32)。
TTTCCCACCATCCCCCTGTCTAGACTGTTCGACAATGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTTGACACATACCAGGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCCCAGACCAGCCTGTGTTTCAGCGAAAGCATCCCCACCCCCAGCAATAGAGAGGAAACCCAGCAAAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTGCTGATCCAGAGCTGGCTGGAGCCTGTGCAATTTCTGAGAAGCGTGTTCGCCAATAGCCTGGTGTACGGCGCCAGCGATAGCAACGTGTATGATCTGCTGAAAGACCTGGAGGAGGGCATTCAGACACTGATGGGCAGACTGGAGGACGGCAGCCCTAGAACAGGACAGATTTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACATTCCTGAGAATCGTGCAGTGCAGAAGCGTGGAGGGCAGCTGTGGATTTGGAGGAGGAGGAAGCGGAGGCGGAGGATCTGGAGGAGGAGGAAGCGCTCCTGAACTGCTGGGAGGACCTAGCGTGTTTCTGTTTCCTCCCAAACCTAAAGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTGGTTGATGTGTCTCATGAAGACCCCGAAGTGAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAATGCTAAAACCAAGCCCAGAGAGGAGCAGTACAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGATTGGCTGAATGGCAAGGAATACAAGTGCAAGGTGAGCAACAAGGCCCTGCCCGCCCCTATTGAAAAAACCATTAGCAAGGCTAAGGGCCAGCCCAGAGAGCCCCAAGTGTATACACTGCCCCCTAGCAGAGATGAACTGACCAAAAACCAGGTGAGCCTGAGATGCCACGTGAAGGGCTTTTACCCCAGCGACATTGCCGTGGAGTGGGAGAGCAATGGCCAACCTGAAAACAATTACAAGACCACCAAGCCCGTGCTGGACAGCGACGGATCTTTCTTTCTGTATAGCACCCTGACCGTGGACAAGAGCAGATGGCAGCAGGGCAATGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAATCACTATACCCAGAAGAGCCTGAGCCTGAGCCCCGGCAAA(SEQ ID NO:33)。
TTCCCTACAATCCCCCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGTTAGCCTTTGATACCTATCAGGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCCCAGACCAGCCTGTGTTTCAGCGAAAGCATCCCCACCCCCAGCAACAGAGAGGAGACACAGCAAAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTGCTGATCCAGAGCTGGCTGGAGCCTGTGCAATTCCTGAGAAGCGTGTTCGCCAATAGCCTGGTGTACGGCGCTAGCGACAGCAACGTGTATGACCTGCTGAAGGACCTGGAGGAGGGCATCCAAACACTGATGGGCAGACTGGAAGATGGCAGCCCCAGAACCGGACAAATTTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTATTGCTTCAGAAAGGACATGGACAAGGTGGAAACCTTCCTGAGAATCGTGCAGTGCAGAAGCGTGGAGGGCAGCTGTGGATTTGACAAGACACACACCGGACCCGCCCCCGAACTGCTGGGAGGACCTTCTGTGTTTCTGTTTCCTCCCAAACCCAAAGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTTGTGGATGTGTCTCATGAGGATCCCGAGGTGAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAATGCTAAGACAAAGCCCAGAGAGGAGCAGTACAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGATTGGCTGAATGGAAAGGAATACAAGTGCAAGGTGAGCAACAAGGCCCTGCCCGCCCCTATTGAAAAAACAATTAGCAAGGCCAAGGGCCAGCCCAGAGAACCTCAAGTGTATACCCTGCCTCCCAGCAGAGATGAGCTGACAAAAAATCAGGTGAGCCTGAGATGCCACGTGAAGGGCTTCTACCCCAGCGACATTGCCGTGGAGTGGGAAAGCAATGGCCAACCTGAGAACAACTACAAGACCACCAAGCCCGTGCTGGACAGCGACGGATCTTTTTTTCTGTACAGCACCCTGACCGTGGACAAGAGCAGATGGCAGCAGGGCAATGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCATAATCACTACACCCAGAAAAGCCTGAGCCTGAGCCCCGGCAAG(SEQ ID NO:34)。
TTCCCCACCATTCCTCTGAGCCGGCTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAAGCCTACATTCCCAAAGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACCAGCCTGTGCTTCAGCGAGAGCATCCCCACACCTAGCAACAGAGAGGAAACCCAGCAGAAGTCCAACCTGGAACTGCTGCGGATCAGCCTGCTGCTGATCCAGTCTTGGCTGGAACCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCAGCGACAGCAACGTTTACGACCTGCTGAAGGACCTGGAAGAGGGCATCCAGACACTGATGGGCAGACTGGAAGATGGCAGCCCTAGAACCGGCCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTTCGGAAGGACATGGACAAGGTGGAAACCTTCCTGCGGATCGTGCAGTGCAGAAGCGTGGAAGGCTCTTGCGGATTTGGCGGCGGAGGATCTGGCGGAGGTGGAAGTGCTCCTGAACTGCTCGGAGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGATGTGTCCCACGAGGATCCCGAAGTGAAGTTCAATTGGTACGTGGACGGCGTCGAGGTGCACAACGCCAAGACAAAGCCTAGAGAGGAACAGTACAACAGCACCTACAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGCCCTGCCTGCTCCTATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCTAGGGAACCCCAGGTTTACACACTGCCTCCAAGCAGGGACGAGCTGACCAAGAATCAGGTGTCCCTGCGGTGTCACGTGAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACCAAGCCAGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCACCCTGACCGTGGACAAGTCCAGATGGCAACAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCCCTGAGCCTGTCTCCTGGCAAA(SEQ ID NO:35)。
TTCCCCACCATTCCTCTGAGCCGGCTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAAGCCTACATTCCCAAAGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACCAGCCTGTGCTTCAGCGAGAGCATCCCCACACCTAGCAACAGAGAGGAAACCCAGCAGAAGTCCAACCTGGAACTGCTGCGGATCAGCCTGCTGCTGATCCAGTCTTGGCTGGAACCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCAGCGACAGCAACGTTTACGACCTGCTGAAGGACCTGGAAGAGGGCATCCAGACACTGATGGGCAGACTGGAAGATGGCAGCCCTAGAACCGGCCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTTCGGAAGGACATGGACAAGGTGGAAACCTTCCTGCGGATCGTGCAGTGCAGAAGCGTGGAAGGCTCTTGCGGATTTGGCGGCGGAGGATCTGCTCCTGAACTGCTCGGAGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGATGTGTCCCACGAGGATCCCGAAGTGAAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTACAACAGCACCTACAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGCCCTGCCTGCTCCTATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCTAGGGAACCCCAGGTTTACACACTGCCTCCAAGCAGGGACGAGCTGACCAAGAATCAGGTGTCCCTGCGGTGTCACGTGAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACCAAGCCAGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCACCCTGACCGTGGACAAGTCCAGATGGCAACAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCCCTGAGCCTGTCTCCTGGCAAA(SEQ ID NO:36)。
GCTCCCGAACTGCTGGGAGGACCCAGCGTGTTTCTGTTTCCCCCTAAACCTAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTGGTGGATGTGAGCCATGAGGACCCCGAAGTGAAATTCAACTGGTACGTGGACGGCGTGGAGGTGCACAATGCTAAGACCAAGCCCAGAGAGGAGCAGTACAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGATTGGCTGAATGGCAAAGAGTACAAGTGCAAGGTGAGCAACAAGGCCCTGCCCGCCCCTATTGAAAAAACCATTAGCAAAGCCAAGGGCCAGCCCAGAGAGCCCCAAGTGTACACATTACCCCCTAGCAGAGATGAACTGACCAAAAACCAGGTGAGCCTGAGATGCCACGTGAAGGGCTTTTACCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGACAACCTGAGAACAACTACAAGACCACCAAGCCCGTGCTGGACAGCGACGGATCTTTCTTTCTGTACAGCACCCTGACCGTGGACAAGAGCAGATGGCAGCAGGGCAATGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCATAATCACTACACCCAGAAGAGCCTGAGCCTGAGCCCCGGAGCTGGAGGAGGAGGAAGCGGAGGAGGAGGAAGCGGCGGAGGAGGATCTTTTCCCACAATTCCTCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTTGACACATACCAGGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCCCAGACCAGCCTGTGCTTTAGCGAAAGCATTCCCACCCCCAGCAACAGAGAGGAGACACAACAAAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTGCTGATCCAGAGCTGGCTGGAGCCCGTGCAATTTCTGAGAAGCGTGTTCGCCAATAGCCTGGTGTACGGCGCTAGCGATAGCAACGTGTATGACCTGCTGAAGGACCTGGAGGAGGGCATCCAAACCCTGATGGGCAGACTGGAGGACGGCAGCCCTAGAACAGGACAGATTTTTAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACATTCCTGAGAATCGTGCAGTGCAGAAGCGTGGAGGGCAGCTGTGGATTT(SEQ ID NO:37)。
TTCCCCACAATCCCCCTGAGCAGACTGTTTGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTCGATACATACCAGGAGTTTGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCCCAGACCAGCCTGTGTTTCAGCGAGAGCATTCCCACACCCAGCAACAGAGAGGAAACCCAGCAAAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTGCTGATCCAGAGCTGGCTGGAGCCTGTGCAATTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCTTCTGATAGCAATGTGTACGACCTGCTGAAGGACCTGGAGGAGGGCATTCAGACACTGATGGGCAGACTGGAGGACGGCAGCCCTAGAACAGGACAAATTTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACATTCCTGAGAATCGTGCAGTGCAGAAGCGTGGAGGGCAGCTGTGGATTTGGAGGCGGAGGAAGCGGAGGAGGAGGAAGCGGAGGAGGAGGAAGCGCTCCTGAACTGCTGGGAGGACCTAGCGTGTTTCTGTTTCCTCCTAAACCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTGGTGGATGTGAGCCATGAAGATCCCGAAGTGAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAATGCTAAAACCAAGCCCAGAGAGGAGCAGTACAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGATTGGCTGAATGGAAAGGAGTACAAGTGCAAGGTGAGCAACAAGGCCCTGCCCGCCCCTATTGAAAAAACCATTAGCAAAGCTAAGGGCCAGCCCAGAGAGCCCCAAGTGTATACAAGCCCTCCCAGCAGGGACGAGCTGACCAAAAATCAGGTGAGCCTGAGATGCCACGTGAAGGGCTTTTACCCCAGCGACATCGCCGTGGAATGGGAAAGCAATGGCCAACCCGAGAACAACTATAAGACCACCAAGCCCGTGCTGGACAGCGACGGATCCTTTTTTCTGTACAGCGACCTGACCGTGGACAAGAGCAGATGGCAGCAGGGCAATGTGTTCAGCTGCAGCGTGTATCACGAGGCCCTGCACAATCACTACACCCAGAAAAGCCTGAGCCTGAGCCCCGGCAAA(SEQ ID NO:38)。
GCCCCTGAGCTGCTTGGAGGACCTAGCGTTTTTTTATTTCCCCCCAAACCTAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTTGTGGATGTGAGCCATGAAGATCCCGAGGTGAAATTCAACTGGTACGTGGACGGCGTGGAGGTGCACAATGCCAAAACAAAGCCCAGAGAGGAGCAGTACAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGATTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGCCCTGCCCGCCCCTATTGAAAAAACCATTAGCAAGGCCAAGGGCCAGCCCAGAGAACCCCAAGTGTATACCAGCCCCCCCAGCAGAGATGAGCTGACAAAAAATCAGGTGAGCCTGAGATGCCACGTGAAGGGCTTTTACCCCAGCGACATCGCCGTGGAGTGGGAATCTAATGGACAACCTGAGAACAACTACAAGACCACCAAGCCCGTGCTGGACAGCGACGGGAGCTTTTTCCTGTATAGCGACCTGACCGTGGACAAGAGCAGATGGCAGCAGGGCAATGTGTTCAGCTGCAGCGTGTATCACGAGGCCCTGCATAATCACTACACCCAGAAAAGCCTGAGCCTGAGCCCCGGAGCTGGAGGAGGAGGAAGCGGAGGAGGAGGAAGCGGAGGAGGAGGATCTTTTCCTACAATTCCTCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTTGATACATACCAGGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCCCAGACCAGCCTGTGTTTTAGCGAAAGCATTCCCACACCCAGCAATAGAGAAGAGACACAGCAGAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTGCTGATCCAGAGCTGGCTGGAGCCTGTGCAATTTCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCTAGCGACTCTAACGTGTATGATCTGCTGAAAGACCTGGAGGAGGGCATCCAGACACTGATGGGCAGACTGGAGGACGGCAGCCCCAGAACAGGACAGATATTTAAGCAAACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACATTCCTGAGAATCGTGCAGTGCAGAAGCGTGGAGGGCAGCTGTGGATTT(SEQ ID NO:39)。
In a fourth aspect of the invention, the invention provides a linker peptide for use in the construction of fusion proteins. According to an embodiment of the invention, the amino acid sequence of the linker peptide comprises at least one selected from the group consisting of SEQ ID NO 8, SEQ ID NO 9, SEQ ID NO 10 or an amino acid sequence having at least 80% to 99% identity to any one of them or at least a part of any one of them.
In a fifth aspect of the invention, a nucleic acid molecule is presented. According to an embodiment of the invention, the nucleic acid molecule encodes the Fc mutant of the first aspect, the fusion protein of the second aspect or the third aspect or the linker peptide of the fourth aspect. The Fc mutant encoded by the nucleic acid molecule of the embodiment of the invention is in a monomer form, and the fusion protein obtained by fusing the Fc mutant in the monomer form and a bioactive molecule has higher in vitro and in vivo activity and lower ADCC and CDC effects. The fusion protein encoded by the nucleic acid molecule can be purified by ProteinA subsequently, so that the purification process is simplified, and the fusion protein has higher in vitro and in vivo activity and lower ADCC and CDC effects.
According to an embodiment of the invention, the nucleotide sequence comprises a codon optimized sequence selected from at least one of SEQ ID NO 21, SEQ ID NO 22, SEQ ID NO 23 or a nucleotide sequence having at least 80% to 99% identity to any of them or at least a part of any of them or any of them.
The SEQ ID NO. 21 is used for coding SEQ ID NO. 8.
The SEQ ID NO. 22 is used for coding SEQ ID NO. 9.
The SEQ ID NO. 23 is used for coding SEQ ID NO. 10.
According to an embodiment of the invention, the nucleotide sequence comprises a codon optimized sequence selected from at least one of SEQ ID NO 32, 33, 34, 35, 36, 37, 38, 39 or a nucleotide sequence having at least 80% -99% identity to any of them or at least a part of any of them or any of them.
The SEQ ID NO. 32 is used for coding SEQ ID NO. 24.
The SEQ ID NO. 33 is used for coding SEQ ID NO. 25.
The SEQ ID NO. 34 is used for coding SEQ ID NO. 26.
The SEQ ID NO. 35 is used for coding the SEQ ID NO. 27.
The SEQ ID NO. 36 is used for coding SEQ ID NO. 28.
The SEQ ID NO. 37 is used for coding SEQ ID NO. 29.
The SEQ ID NO 38 is used for coding the SEQ ID NO 30.
The SEQ ID NO 39 is used for coding SEQ ID NO 31.
According to an embodiment of the invention, the nucleotide sequence comprises at least one selected from the group consisting of SEQ ID NO 12, 13, 14, 15, 16, 17, 18, 19 and 20 or a nucleotide sequence having at least 80% -99% identity to any one of them or at least a part of any one of them or a codon optimized sequence of any one of them.
The SEQ ID NO 12, 13, 15 and 16 are used for coding SEQ ID NO 2.
The SEQ ID NO. 14 is used for coding SEQ ID NO. 3.
The SEQ ID NO. 17 is used for coding SEQ ID NO. 4.
The SEQ ID NO. 18 is used for coding SEQ ID NO. 5.
The SEQ ID NO. 19 is used for coding SEQ ID NO. 6.
The SEQ ID NO. 20 is used for coding SEQ ID NO. 7.
GCCCCTGAACTGCTCGGAGGCCCCTCCGTGTTTCTGTTCCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGATGTGTCCCACGAGGATCCCGAAGTGAAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTACAACAGCACCTACAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGCCCTGCCTGCTCCTATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCTAGGGAACCCCAGGTTTACACACTGCCTCCAAGCAGGGACGAGCTGACCAAGAATCAGGTGTCCCTGCGGTGTCACGTGAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACCAAGCCAGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCACCCTGACCGTGGACAAGTCCAGATGGCAACAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCCCTGAGCCTGTCTCCTGGCAAA(SEQ ID NO:12)。
GCTCCTGAACTGCTGGGAGGACCTAGCGTGTTTCTGTTTCCTCCCAAACCTAAAGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTGGTTGATGTGTCTCATGAAGACCCCGAAGTGAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAATGCTAAAACCAAGCCCAGAGAGGAGCAGTACAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGATTGGCTGAATGGCAAGGAATACAAGTGCAAGGTGAGCAACAAGGCCCTGCCCGCCCCTATTGAAAAAACCATTAGCAAGGCTAAGGGCCAGCCCAGAGAGCCCCAAGTGTATACACTGCCCCCTAGCAGAGATGAACTGACCAAAAACCAGGTGAGCCTGAGATGCCACGTGAAGGGCTTTTACCCCAGCGACATTGCCGTGGAGTGGGAGAGCAATGGCCAACCTGAAAACAATTACAAGACCACCAAGCCCGTGCTGGACAGCGACGGATCTTTCTTTCTGTATAGCACCCTGACCGTGGACAAGAGCAGATGGCAGCAGGGCAATGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAATCACTATACCCAGAAGAGCCTGAGCCTGAGCCCCGGCAAA(SEQ ID NO:13)。
CCCGCCCCCGAACTGCTGGGAGGACCTTCTGTGTTTCTGTTTCCTCCCAAACCCAAAGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTTGTGGATGTGTCTCATGAGGATCCCGAGGTGAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAATGCTAAGACAAAGCCCAGAGAGGAGCAGTACAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGATTGGCTGAATGGAAAGGAATACAAGTGCAAGGTGAGCAACAAGGCCCTGCCCGCCCCTATTGAAAAAACAATTAGCAAGGCCAAGGGCCAGCCCAGAGAACCTCAAGTGTATACCCTGCCTCCCAGCAGAGATGAGCTGACAAAAAATCAGGTGAGCCTGAGATGCCACGTGAAGGGCTTCTACCCCAGCGACATTGCCGTGGAGTGGGAAAGCAATGGCCAACCTGAGAACAACTACAAGACCACCAAGCCCGTGCTGGACAGCGACGGATCTTTTTTTCTGTACAGCACCCTGACCGTGGACAAGAGCAGATGGCAGCAGGGCAATGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCATAATCACTACACCCAGAAAAGCCTGAGCCTGAGCCCCGGCAAG(SEQ ID NO:14)。
GCTCCTGAACTGCTCGGAGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGATGTGTCCCACGAGGATCCCGAAGTGAAGTTCAATTGGTACGTGGACGGCGTCGAGGTGCACAACGCCAAGACAAAGCCTAGAGAGGAACAGTACAACAGCACCTACAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGCCCTGCCTGCTCCTATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCTAGGGAACCCCAGGTTTACACACTGCCTCCAAGCAGGGACGAGCTGACCAAGAATCAGGTGTCCCTGCGGTGTCACGTGAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACCAAGCCAGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCACCCTGACCGTGGACAAGTCCAGATGGCAACAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCCCTGAGCCTGTCTCCTGGCAAA(SEQ ID NO:15)。
GCTCCTGAACTGCTCGGAGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGATGTGTCCCACGAGGATCCCGAAGTGAAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTACAACAGCACCTACAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGCCCTGCCTGCTCCTATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCTAGGGAACCCCAGGTTTACACACTGCCTCCAAGCAGGGACGAGCTGACCAAGAATCAGGTGTCCCTGCGGTGTCACGTGAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACCAAGCCAGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCACCCTGACCGTGGACAAGTCCAGATGGCAACAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCCCTGAGCCTGTCTCCTGGCAAA(SEQ ID NO:16)。
GCTCCCGAACTGCTGGGAGGACCCAGCGTGTTTCTGTTTCCCCCTAAACCTAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTGGTGGATGTGAGCCATGAGGACCCCGAAGTGAAATTCAACTGGTACGTGGACGGCGTGGAGGTGCACAATGCTAAGACCAAGCCCAGAGAGGAGCAGTACAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGATTGGCTGAATGGCAAAGAGTACAAGTGCAAGGTGAGCAACAAGGCCCTGCCCGCCCCTATTGAAAAAACCATTAGCAAAGCCAAGGGCCAGCCCAGAGAGCCCCAAGTGTACACATTACCCCCTAGCAGAGATGAACTGACCAAAAACCAGGTGAGCCTGAGATGCCACGTGAAGGGCTTTTACCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGACAACCTGAGAACAACTACAAGACCACCAAGCCCGTGCTGGACAGCGACGGATCTTTCTTTCTGTACAGCACCCTGACCGTGGACAAGAGCAGATGGCAGCAGGGCAATGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCATAATCACTACACCCAGAAGAGCCTGAGCCTGAGCCCCGGAGCT(SEQ ID NO:17)。
GCTCCTGAACTGCTGGGAGGACCTAGCGTGTTTCTGTTTCCTCCTAAACCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTGGTGGATGTGAGCCATGAAGATCCCGAAGTGAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAATGCTAAAACCAAGCCCAGAGAGGAGCAGTACAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGATTGGCTGAATGGAAAGGAGTACAAGTGCAAGGTGAGCAACAAGGCCCTGCCCGCCCCTATTGAAAAAACCATTAGCAAAGCTAAGGGCCAGCCCAGAGAGCCCCAAGTGTATACAAGCCCTCCCAGCAGGGACGAGCTGACCAAAAATCAGGTGAGCCTGAGATGCCACGTGAAGGGCTTTTACCCCAGCGACATCGCCGTGGAATGGGAAAGCAATGGCCAACCCGAGAACAACTATAAGACCACCAAGCCCGTGCTGGACAGCGACGGATCCTTTTTTCTGTACAGCGACCTGACCGTGGACAAGAGCAGATGGCAGCAGGGCAATGTGTTCAGCTGCAGCGTGTATCACGAGGCCCTGCACAATCACTACACCCAGAAAAGCCTGAGCCTGAGCCCCGGCAAA(SEQ ID NO:18)。
GCCCCTGAGCTGCTTGGAGGACCTAGCGTTTTTTTATTTCCCCCCAAACCTAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTTGTGGATGTGAGCCATGAAGATCCCGAGGTGAAATTCAACTGGTACGTGGACGGCGTGGAGGTGCACAATGCCAAAACAAAGCCCAGAGAGGAGCAGTACAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGATTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGCCCTGCCCGCCCCTATTGAAAAAACCATTAGCAAGGCCAAGGGCCAGCCCAGAGAACCCCAAGTGTATACCAGCCCCCCCAGCAGAGATGAGCTGACAAAAAATCAGGTGAGCCTGAGATGCCACGTGAAGGGCTTTTACCCCAGCGACATCGCCGTGGAGTGGGAATCTAATGGACAACCTGAGAACAACTACAAGACCACCAAGCCCGTGCTGGACAGCGACGGGAGCTTTTTCCTGTATAGCGACCTGACCGTGGACAAGAGCAGATGGCAGCAGGGCAATGTGTTCAGCTGCAGCGTGTATCACGAGGCCCTGCATAATCACTACACCCAGAAAAGCCTGAGCCTGAGCCCCGGAGCT(SEQ ID NO:19)。
GACAAGACACACACCGGACCCGCCCCCGAACTGCTGGGAGGACCTTCTGTGTTTCTGTTTCCTCCCAAACCCAAAGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTTGTGGATGTGTCTCATGAGGATCCCGAGGTGAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAATGCTAAGACAAAGCCCAGAGAGGAGCAGTACAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGATTGGCTGAATGGAAAGGAATACAAGTGCAAGGTGAGCAACAAGGCCCTGCCCGCCCCTATTGAAAAAACAATTAGCAAGGCCAAGGGCCAGCCCAGAGAACCTCAAGTGTATACCCTGCCTCCCAGCAGAGATGAGCTGACAAAAAATCAGGTGAGCCTGAGATGCCACGTGAAGGGCTTCTACCCCAGCGACATTGCCGTGGAGTGGGAAAGCAATGGCCAACCTGAGAACAACTACAAGACCACCAAGCCCGTGCTGGACAGCGACGGATCTTTTTTTCTGTACAGCACCCTGACCGTGGACAAGAGCAGATGGCAGCAGGGCAATGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCATAATCACTACACCCAGAAAAGCCTGAGCCTGAGCCCCGGCAAG(SEQ ID NO:20)。
In a sixth aspect of the invention, an expression vector is provided. According to an embodiment of the invention, the expression vector carries a nucleic acid molecule according to the fifth aspect. After the expression vector according to the embodiment of the present invention is introduced into a suitable recipient cell, the expression of the Fc mutant, the fusion protein or the connecting peptide described above can be effectively achieved under the mediation of a regulatory system, so that the Fc mutant, the fusion protein or the connecting peptide can be obtained in a large amount.
In a seventh aspect of the invention, the invention features a recombinant cell. According to an embodiment of the invention, the recombinant cell comprises: carrying a nucleic acid molecule according to the fifth aspect; or, expressing the Fc mutant of the first aspect, the fusion protein of the second aspect or the third aspect, or the connecting peptide of the fourth aspect. The recombinant cells according to embodiments of the present invention can be used for in vitro expression and mass production of the aforementioned Fc mutants, fusion proteins or connecting peptides.
In an eighth aspect of the invention, a pharmaceutical composition is provided. According to an embodiment of the invention, the pharmaceutical composition comprises: the fusion protein of the second or third aspect. The pharmaceutical composition according to the embodiment of the present invention can be used for preventing and treating diseases associated with abnormality of biologically active molecules of the fusion protein.
According to an embodiment of the invention, the pharmaceutical composition is for oral, intravenous administration, such as bolus injection or by continuous infusion for a period of time, subcutaneous, intramuscular, intraarterial, intraperitoneal, intrapulmonary, intracerebrospinal, intraarticular, intrasynovial, intrathecal, intralesional, or inhalation route, such as intranasal, typically intravenous or subcutaneous administration.
According to the embodiment of the invention, the dosage form of the pharmaceutical composition is tablets, capsules, spray, injection, freeze-dried powder injection or pre-filled injection.
In a ninth aspect of the invention, the invention provides a method for preparing the fusion protein of the second or third aspect. According to an embodiment of the present invention, the preparation method comprises: culturing the recombinant cell of the seventh aspect to obtain a culture solution containing the fusion protein; separating the fusion protein from the culture medium. Thereby, the fusion proteins according to the second and third aspects can be efficiently produced.
In a tenth aspect, the present invention provides a use of the fusion protein of the second or third aspect, the nucleic acid molecule of the fifth aspect, the expression vector of the sixth aspect, the recombinant cell of the seventh aspect, or the pharmaceutical composition of the eighth aspect, in the preparation of a medicament for treating or preventing a disease associated with growth hormone abnormality.
Advantageous effects
Compared with the prior art, the invention at least has one of the following beneficial technical effects:
(1) The Fc mutant in the fusion protein used by the invention is mutated at a specific site, the growth hormone fused with the Fc mutant is secreted and expressed in a monomer form, and can be purified by ProteinA affinity chromatography, and the subsequent purification process is simple.
(2) The fusion protein provided by the invention binds to human GHR 50 Low value and strong binding activity with human GHR.
(3) EC of fusion protein provided by the invention for promoting Nb2-11 cell proliferation 50 Low value, and good proliferation effect on Nb2-11 cells.
(4) The fusion protein provided by the invention promotes the reporter gene cell to express the EC of luciferase 50 The value is low, and is equivalent to or lower than that of a control sample (Genexine GX-H9), and the luciferase activity expressed by reporter gene cells is excellent.
(5) Compared with human growth hormone, the fusion protein provided by the invention has longer half-life and better in-vivo biological activity.
Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.
Drawings
The above and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
FIG. 1 is a schematic representation of the monomeric form of a fusion protein in one embodiment of the invention;
FIG. 2 is a graph showing the results of J2 or J18 induced ADCC effect in example 7 of the present invention;
FIG. 3 is a graph showing the trend of the results of the weight gain of rats in each group measured for in vivo biological activity in example 9 of the present invention.
Detailed Description
The following describes in detail embodiments of the present invention. The following examples are illustrative only and are not to be construed as limiting the invention.
It should be noted that the terms "first" and "second" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include one or more of that feature. Further, in the description of the present invention, "a plurality" means two or more unless otherwise specified.
Herein, the term "room temperature" means ambient temperature, which may be from 20 ℃ to 30 ℃; in some embodiments, from 22 ℃ to 28 ℃; in some embodiments, from 24 ℃ to 26 ℃; and in some embodiments, 25 ℃.
Herein, "antibody fragment" means antigen-binding fragments and antibody analogs of antibodies, which typically comprise at least a portion of the antigen-binding or variable region (e.g., one or more CDRs) of a parent antibody (parent antibody).
In this document, the terms "comprise" or "comprise" are open-ended expressions that include the elements indicated in the present invention, but do not exclude other elements.
As used herein, the terms "optionally," "optional," or "optionally" generally mean that the subsequently described event or circumstance may, but need not, occur, and that the description includes instances where the event or circumstance occurs, and instances where it does not.
As used herein, the term "variant" or "mutant" refers generally to any naturally occurring or engineered molecule comprising one or more nucleotide or amino acid mutations.
Herein, the amino acid numbering of the IgG1 Fc moiety is according to EU numbering system, e.g., position 366 refers to position 366 according to EU numbering system numbering; the expression "T366R" means that the threonine at position 366 according to the EU numbering system is replaced by arginine; "L368H" means a substitution of leucine at position 368 by histidine according to the EU numbering system.
In this context, the term "(GGGGS) n "indicates that n GGGGS are connected, e.g.," (GGGGS) 3 "indicates GGGGSGGGGSGGGGS.
As used herein, the term "fusion protein" generally refers to a protein resulting from the fusion of two or more proteins or polypeptides. Genes or nucleic acid molecules encoding the two or more proteins or polypeptides may be linked to each other to form a fusion gene or a fused nucleic acid molecule, which may encode the fusion protein. Translation of the fusion gene results in a single polypeptide having the properties of at least one, and even each, of the two or more proteins or polypeptides prior to fusion. Recombinant fusion proteins are created artificially by recombinant DNA techniques for biological research or therapy. Recombinant fusion proteins are proteins created by genetic engineering of fusion genes. The present invention relates to recombinant fusion proteins, and the terms fusion protein and recombinant fusion protein are used herein with the same meaning. The fusion proteins described herein typically comprise at least two domains (a and C), and optionally a third component, a linker between the two domains. The generation of recombinant fusion proteins is known in the art and typically involves removing a stop codon from a cDNA sequence encoding a first protein or polypeptide and then attaching the cDNA sequence of a second protein in frame by ligation or overlap extension PCR. The DNA sequence will then be expressed by the cell as a single protein. The protein may be engineered to include the entire sequence of both original proteins or polypeptides, or only a portion thereof.
The fusion proteins of the invention are typically prepared by biosynthetic methods. The encoding nucleic acids of the present invention can be readily prepared by a variety of known methods by those skilled in the art based on the nucleotide sequences of the present invention. Such methods are for example but not limited to: PCR, DNA synthesis, etc., and specific methods can be found in sambrook, molecular cloning guidelines. As an embodiment of the present invention, the coding nucleic acid sequence of the present invention can be constructed by a method of synthesizing nucleotide sequences by segmentation and then performing overlap extension PCR.
As used herein, the terms "identity", "homology" or "similarity" are used to describe amino acid sequences or nucleic acid sequences relative to a reference sequence, and the determination of the percentage of identical amino acids or nucleotides between two amino acid or nucleic acid sequences is performed by conventional methods, e.g., see Ausubel et al, eds (1995), current Protocols in Molecular Biology, chapter 19 (Greene Publishing and Wiley-Interscience, new York); and the ALIGN program (Dayhoff (1978), atlas of Protein Sequence and Structure 5 (National biological Research Foundation, washington, D.C.). There are many algorithms for aligning sequences and determining Sequence identity, including the homology alignment algorithm of Needleman et al (1970) J.mol.biol.48: 443; the local homology algorithm of Smith et al (1981) adv.Appl.Math.2: 482; methods for similarity searching by Pearson et al (1988) Proc. Natl.Acad.Sci.85:2444, the Smith-Waterman algorithm (meth. Mol.biol.70:173-187 (1997)), and the BLASTP, BLASTN, and BLASTX algorithms (see Altschul et al (1990) J.mol.biol.215: 403-410) computer programs utilizing these algorithms are also available and include, but are not limited to, ALIGN or Megalign (DNASTAR) software, or WU-BLAST-2 (Altschul et al, meth.Enzym., 266-480 (1996)), or GAP, BESTFIT, BLAST Altschul et al, supra, TA, and TFASTA, provided in Genetics computer Computing Group (GCG) package, 8 th edition, madisin, USA, and FAS, the programs available in U.A., USA, and U.S.A..
As used herein, the term "nucleotide" generally refers to a modified form that can be a ribonucleotide, a deoxynucleotide, or any type of nucleotide, as well as combinations thereof.
Herein, the term "at least 90% identity" refers to at least 90%, and may be 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, 99.9% identity to each reference sequence.
As used herein, the term "at least 80% -99% sequence identity" refers to at least 80% -99%, at least 81% -99%, at least 82% -99%, at least 83% -99%, at least 84% -99%, at least 85% -99%, at least 86% -99%, at least 87% -99%, at least 88% -99%, at least 89% -99%, at least 90% -99%, at least 91% -99%, at least 92% -99%, at least 93% -99%, at least 94% -99%, at least 95% -99%, at least 96% -99%, at least 97% -99%, at least 98% -99%, or at least 99% sequence identity to each reference sequence.
In this context, the term "expression vector" generally refers to a nucleic acid molecule capable of inserting into a suitable host for self-replication, which transfers the inserted nucleic acid molecule into and/or between host cells. The expression vectors may include vectors primarily for the insertion of DNA or RNA into a cell, vectors primarily for the replication of DNA or RNA, and vectors primarily for the expression of transcription and/or translation of DNA or RNA. The expression vector also includes vectors having a variety of the above-described functions. The expression vector may be a polynucleotide capable of being transcribed and translated into a polypeptide when introduced into a suitable host cell. Typically, the expression vector can produce the desired expression product by culturing a suitable host cell containing the expression vector.
In this context, the term "recombinant cell" generally refers to a host cell whose genetic material has been modified or recombined by genetic engineering techniques or cell fusion techniques to obtain a cell with a unique trait of stable inheritance. The term "host cell" refers to a prokaryotic or eukaryotic cell into which a recombinant expression vector can be introduced. The term "transformed" or "transfected" as used herein refers to the introduction of a nucleic acid (e.g., a vector) into a cell by various techniques known in the art. Suitable host cells can be transformed or transfected with the DNA sequences of the invention and can be used for expression and/or secretion of the target protein. Examples of suitable host cells that can be used in the present invention include immortalized hybridoma cells, NS/0 myeloma cells, 293 cells, chinese Hamster Ovary (CHO) cells, heLa cells, cap cells (human amniotic fluid derived cells), and CoS cells.
The term "pharmaceutical composition" as used herein generally refers to a unit dosage form and may be prepared by any of the methods well known in the pharmaceutical art. All methods include the step of bringing into association the active ingredient with the carrier which constitutes one or more accessory ingredients. Generally, compositions are prepared by uniformly and sufficiently combining the active compound with a liquid carrier, a finely divided solid carrier, or both.
As used herein, the term "pharmaceutically acceptable excipient" can include any solvent, solid excipient, diluent, or other liquid excipient, and the like, suitable for the particular intended dosage form. Except insofar as any conventional adjuvant is incompatible with the compounds of the invention, e.g., any adverse biological effect produced or interaction in a deleterious manner with any other component of a pharmaceutically acceptable composition, their use is contemplated by the present invention.
Reference may be made to a number of other pharmaceutically acceptable adjuvants or processes mentioned herein for this subject matter, see in particular Handbook of Pharmaceutical Excipients, 3 rd edition, edited by Arthur h. Kibbe, american Pharmaceutical Association, washington, USA and Pharmaceutical Press, london; and Lexikon der Hilfsstuffe fur Pharmazie, kosmetik and angrenzene Gebiete, edited by H.P. Fiedler, 4 th edition, cantor, aulentorf and earlier versions.
As used herein, the term "administering" refers to introducing a predetermined amount of a substance into a patient by some suitable means. The fusion protein of the present invention can be administered by any common route as long as it can reach the desired tissue. Various modes of administration are contemplated, including peritoneal, intravenous, intramuscular, subcutaneous, cortical, oral, topical, nasal, pulmonary, rectal, and topical, but the invention is not limited to these exemplified modes of administration. Preferably, the fusion protein of the invention is administered by injection.
Herein, the term "growth hormone abnormality-related disease" generally refers to a disease caused by growth hormone abnormality, such as a related disease caused by growth hormone deficiency or a dissimilatory state, including, but not limited to, childhood growth hormone deficiency, idiopathic short stature, adult growth hormone deficiency, turner's Syndrome (TurnersSyndrome), prader Willi Syndrome, intrauterine growth retardation, idiopathic short stature, renal failure, chemotherapy treatment, and dissimilatory state during AIDS treatment. Growth hormone deficiency may include congenital or acquired deficiency. Regarding the congenital defect, a growth hormone defect may occur when the pituitary does not develop growth hormone secretion disorders. Acquired growth hormone deficiency may occur due to brain tissue damage caused by hypoxia resulting from difficult delivery. Other causes of growth hormone deficiency include pituitary damage caused by radiation used to treat brain tumors or postnatal tubercular meningitis. Growth hormone deficiency exhibits symptoms such as growth retardation and short stature, and innate growth hormone deficiency exhibits low glucose symptoms, beginning with newborns. In addition, children exhibit symptoms such as increased anxiety and decreased vitality.
As used herein, the term "treating" is intended to mean obtaining a desired pharmacological and/or physiological effect. The effect may be prophylactic in terms of complete or partial prevention of the disease or symptoms thereof, and/or may be therapeutic in terms of a partial or complete cure for the disease and/or adverse effects resulting from the disease. As used herein, "treatment" encompasses diseases in mammals, particularly humans, including: (a) Preventing the occurrence of a disease or disorder in an individual susceptible to the disease but not yet diagnosed; (b) inhibiting a disease, e.g., arresting disease progression; or (c) alleviating the disease, e.g., alleviating symptoms associated with the disease. As used herein, "treatment" encompasses any administration of a drug or compound to an individual to treat, cure, alleviate, ameliorate, reduce, or inhibit a disease in the individual, including, but not limited to, administering a drug containing a compound described herein to an individual in need thereof.
It is to be understood by those skilled in the art that the monomeric or dimeric form of an Fc mutant determines the monomeric or dimeric form of the fusion protein it forms.
The present invention provides an Fc mutant, a fusion protein, a nucleic acid molecule, an expression vector, a recombinant cell, a pharmaceutical composition and uses thereof, which will be described in detail below, respectively.
Fc mutants
In one aspect of the invention, the invention provides an Fc mutant. According to an embodiment of the invention, comprising: a first peptidyl fragment having mutations compared to the Fc fragment of wild-type IgG1 at the following positions: at least one of 228 th bit, 229 th bit, 230 th bit, 366 th bit, 368 th bit, 395 th bit and 409 th bit, and 351 th bit, 428 th bit and 447 th bit; or 228 th bit, 229 th bit, 366 th bit, 368 th bit, 395 th bit and 409 th bit.
The inventor finds through a large number of experiments that the mutation is beneficial to enabling the Fc fragment in the form of natural dimer to be expressed and then form the Fc mutant in the form of monomer, the Fc mutant in the form of monomer is adopted to be fused with bioactive molecules, and the subsequent purification can be realized through ProteinA, so that the purification process is simplified; in addition, the fusion protein prepared by the Fc mutant has higher in vivo and in vitro activity and lower ADCC and CDC effects.
According to an embodiment of the invention, the first peptide fragment has mutations compared to the Fc fragment of wild-type IgG1 at the following positions: 1) 228, 229, 230, 366, 368, 395, 409, and 447 bits; or, 2) 228 th bit, 229 th bit, 230 th bit, 351 th bit, 366 th bit, 368 th bit, 395 th bit, 409 th bit and 428 th bit; or, 3) 228 th bit, 229 th bit, 230 th bit, 351 th bit, 366 th bit, 368 th bit, 395 th bit, 409 th bit, 428 th bit and 447 th bit. Thus, the above mutations may form Fc mutants in monomeric form.
According to an embodiment of the invention, the first peptide fragment has mutations compared to the Fc fragment of wild-type IgG1 at the following positions: T366R, L368H, P395K and K409D or K409T; and at least one of L351S, M428Y and K447A. Thus, the above mutations may form Fc mutants in monomeric form.
According to an embodiment of the invention, the first peptide fragment has mutations compared to the Fc fragment of wild-type IgG1 at the following positions: 1) P228Delete, C229Delete, P230 Delete, T366R, L368H, P395K, K447A and K409D or K409T; or, 2) P228Delete, C229Delete, P230 Delete, L351S, T366R, L368H, P395K, M428Y and K409D or K409T; or, 3) P228Delete, C229Delete, P230 Delete, L351S, T366R, L368H, P395K, M428Y, K447A and K409D or K409T; or, 4) P228Delete, C229Delete, T366R, L368H, P395K and K409D or K409T. The inventor carries out a large number of experiments, and obtains the monomer-form Fc mutant, the fusion protein of the Fc mutant and bioactive molecules after fusion is secreted and expressed in a monomer form, and the fusion protein has higher in vivo and in vitro activity and lower ADCC and CDC effects.
Preferably, the first peptide fragment has mutations compared to the Fc fragment of wild-type IgG1 at the following positions: p228Delete, C229Delete, P230 Delete, L351S, T366R, L368H, P395K, M428Y, K447A and K409D or K409T.
The inventor finds out through a large number of experiments that the Fc mutant is fused with human growth hormone, and the obtained fusion protein has higher binding force to a human growth hormone receptor, higher cell proliferation activity and higher in vitro activity. Illustratively, the Fc mutants of fusion proteins J16 and J18 according to the embodiments of the present invention have different mutation sites, and the Fc mutant of J18 contains the above site mutations, whereas J16 has no L351S and M428Y mutations compared to J18, and fusion protein J18 has stronger binding force to human growth hormone receptor, cell proliferation activity and in vitro activity. Furthermore, fusion protein J17 increased L351S and M428Y mutations compared to fusion protein J2 (P228 Delete, C229Delete, P230 Delete, T366R, L368H, P395K and K409D) and J2 and J17 had different mutation sites at position 409 (J17 is K409T), but the J17 fusion protein was more active on cell proliferation and in vitro activity. Therefore, the fusion protein prepared by adopting the Fc mutant with L351S and M428Y mutation has higher human growth hormone receptor binding force, cell proliferation activity and in vitro activity.
According to an embodiment of the present invention, the Fc mutant further comprises: a hinge region fragment of wild type IgG1, wherein the C end of the hinge region fragment of wild type IgG1 is connected with the N end of the first peptide segment.
According to an embodiment of the present invention, the hinge region fragment of wild type IgG1 has the amino acid sequence shown in SEQ ID NO: 41.
According to an embodiment of the present invention, the Fc mutant has an amino acid sequence as shown in any one of SEQ ID NOs 3 to 7. The inventor finds that the fusion protein obtained by fusing the Fc mutant and bioactive molecules has high in vivo and in vitro bioactivity.
According to an embodiment of the invention, the Fc mutant is in monomeric form.
Fusion proteins
In yet another aspect of the invention, a fusion protein is provided. According to an embodiment of the invention, the fusion protein comprises: a second peptide segment comprising a biologically active molecular domain; a third peptide stretch comprising an Fc mutant having the amino acid sequence shown in SEQ ID NO 2 or an Fc mutant as defined previously, the second peptide stretch being linked to the third peptide stretch. The inventor finds through a large number of experiments that the fusion protein is in a monomer form, and has higher in vivo and in vitro activity and lower ADCC and CDC effects; moreover, the fusion protein can be purified through ProteinA subsequently, and the purification process is simplified.
It should be noted that "monomer" refers to 1 Fc mutant, and the specific structure is shown in fig. 1.
According to an embodiment of the invention, said second peptide segment comprises growth hormone, a growth hormone analogue, a growth hormone functional region or a growth hormone analogue functional region, preferably human growth hormone or a human growth hormone functional region. The inventor finds that the fusion protein is in a monomer form, has higher in vivo and in vitro activity and lower ADCC and CDC effects, and has the advantages of strong binding force to human growth hormone receptors, good cell proliferation activity, high in vitro activity and the like.
According to an embodiment of the invention, the human growth hormone has the amino acid sequence as SEQ ID No. 1 or an amino acid sequence having at least 90% identity thereto.
According to an embodiment of the invention, the N-terminus of the second peptide stretch is linked to the C-terminus of the third peptide stretch; or the C end of the second peptide segment is connected with the N end of the third peptide segment. Therefore, the fusion protein prepared by adopting the connection mode has higher in vivo and in vitro activity.
According to an embodiment of the invention, the N-terminus of the second peptidyl fragment is linked to the C-terminus of the third peptidyl fragment. The inventors have found through a large number of experiments that the fusion protein obtained by the above connection method can further improve the binding force to the corresponding receptor, the cell proliferation activity and the in vitro activity, such as: the fusion protein prepared by fusing the Fc mutant and the growth hormone has better drug effect in animals.
According to an embodiment of the present invention, the fusion protein further comprises a linker peptide disposed between the second peptide segment and the third peptide segment. Therefore, the second peptide segment and the third peptide segment which can be fused by the connecting peptide are adopted, and the biological activity of the finally obtained fusion protein is improved. Illustratively, the fusion proteins J1, J2, J6 and J7 differ in their connecting peptides, such that J2, J6 and J7 have better binding activity for human GHR and better promotion of cell proliferation than J1 without the connecting peptide.
According to an embodiment of the present invention, the N-terminus of the linker peptide is linked to the C-terminus of the third peptide fragment, and the C-terminus of the linker peptide is linked to the N-terminus of the second peptide fragment; or the N end of the connecting peptide is connected with the C end of the second peptide segment, and the C end of the connecting peptide is connected with the N end of the third peptide segment.
According to an embodiment of the invention, the N-terminus of the linker peptide is linked to the C-terminus of the third peptide stretch, and the C-terminus of the linker peptide is linked to the N-terminus of the second peptide stretch. The inventors have found through a large number of experiments that the fusion protein obtained by the above connection method can further improve the binding force to the corresponding receptor, the cell proliferation activity and the in vitro activity, such as: the fusion protein prepared by fusing the Fc mutant and the growth hormone has better drug effect in animals.
According to an embodiment of the present invention, the amino acid sequence of the linker peptide is (GGGGS) n Wherein n is an integer greater than or equal to 1, preferably 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10. Thus, the use of the second peptide fragment and the third peptide fragment, which are fused by the linker peptide, can improve the biological activity of the finally obtained fusion protein.
According to an embodiment of the invention, the linker peptide has an amino acid sequence as shown in any one of SEQ ID NOs 8 to 10. Thus, the use of the second peptide fragment and the third peptide fragment, which are fused by the linker peptide, can further improve the biological activity of the finally obtained fusion protein.
According to an embodiment of the invention, the fusion protein has an amino acid sequence as shown in any one of SEQ ID NOs 24 to 31. The inventor finds that the fusion protein has strong binding force to human growth hormone receptors, good cell proliferation activity and high in vitro activity through experiments.
It will be appreciated by those skilled in the art that the features and advantages described above for the Fc mutants apply equally to the fusion proteins and are not described in detail here.
Nucleic acid molecules
In another aspect of the invention, the invention features a nucleic acid molecule. According to an embodiment of the invention, the nucleic acid molecule encodes the aforementioned Fc mutant or the aforementioned fusion protein. The nucleic acid molecule can be effectively used for expressing the Fc mutant or the fusion protein, particularly can effectively express the fusion protein in a prokaryotic organism or lower eukaryotic organism expression system, and the fusion protein is in a monomer form and has higher in vitro and in vivo activity and lower ADCC and CDC effects.
According to an embodiment of the invention, the nucleic acid molecule is DNA.
It will be appreciated by those skilled in the art that the features and advantages described above for the Fc mutants and fusion proteins apply equally to the nucleic acid molecules and will not be described in further detail herein.
Expression vector
In another aspect of the invention, the invention features an expression vector. According to an embodiment of the invention, the expression vector carries the aforementioned nucleic acid molecule. The Fc mutant or the fusion protein can be effectively expressed in cells by using the expression vector, particularly the fusion protein can be effectively expressed in a prokaryotic organism or lower eukaryotic organism expression system, and the fusion protein is in a monomer form and has higher in-vivo and in-vitro activity and lower ADCC and CDC effects.
According to an embodiment of the invention, the expression vector is a eukaryotic expression vector.
According to an embodiment of the invention, the expression vector is a lentiviral vector.
It will be appreciated by those skilled in the art that the features and advantages described above for the Fc mutants, fusion proteins and nucleic acid molecules apply equally to the expression vector and are not described in detail here.
Recombinant cell
In another aspect of the invention, the invention features a recombinant cell. According to an embodiment of the invention, the recombinant cell comprises: carrying the aforementioned nucleic acid molecule; or, expressing the aforementioned Fc mutant or the aforementioned fusion protein. The recombinant cell can be used for effectively expressing the Fc mutant or the fusion protein, particularly the fusion protein can be effectively expressed in a prokaryotic organism or lower eukaryotic organism expression system, and the fusion protein is in a monomer form and has higher in vitro and in vivo activity and lower ADCC and CDC effects.
According to an embodiment of the present invention, the recombinant cell is obtained by introducing the aforementioned expression vector into a host cell.
According to an embodiment of the invention, the recombinant cell is a eukaryotic cell.
According to an embodiment of the invention, the recombinant cell is a mammalian cell.
It will be appreciated by those skilled in the art that the features and advantages described above for the Fc mutants, fusion proteins, nucleic acid molecules and expression vectors apply equally to the recombinant cells and will not be described in detail here.
Pharmaceutical composition
In another aspect of the invention, a pharmaceutical composition is provided. According to an embodiment of the invention, the pharmaceutical composition comprises: the aforementioned fusion protein. The pharmaceutical composition according to the embodiment of the invention can be used for preventing and treating diseases related to growth hormone abnormality.
According to an embodiment of the invention, the pharmaceutical composition further comprises a pharmaceutically acceptable excipient.
According to an embodiment of the present invention, the pharmaceutical composition is in the form of an injection.
According to an embodiment of the invention, the route of administration of the pharmaceutical composition comprises subcutaneous injection or intravenous injection.
It will be appreciated by those skilled in the art that the features and advantages described above for the Fc mutants, fusion proteins, nucleic acid molecules, expression vectors and recombinant cells apply equally to the pharmaceutical composition and will not be described in further detail herein.
Method for preventing and/or treating diseases related to growth hormone abnormality
In another aspect of the present invention, the present invention provides a method for preventing and/or treating diseases associated with growth hormone abnormality. According to an embodiment of the invention, the method comprises: administering to the subject a pharmaceutically acceptable amount of the aforementioned fusion protein or the aforementioned pharmaceutical composition. According to embodiments of the present invention, the method is effective for preventing or treating diseases associated with growth hormone abnormality.
According to an embodiment of the invention, the route of administration of the method comprises subcutaneous injection or intravenous injection.
According to an embodiment of the present invention, the disease associated with growth hormone abnormality includes at least one selected from the group consisting of: childhood growth hormone deficiency, idiopathic short stature, adult growth hormone deficiency, turner's syndrome, prader-willi syndrome, renal failure, diseases caused by dissimilatory states during chemotherapy treatment and AIDS treatment, intrauterine growth retardation.
It will be appreciated by those skilled in the art that the features and advantages described above for the Fc mutants, fusion proteins, nucleic acid molecules, expression vectors, recombinant cells and pharmaceutical compositions are equally applicable to the method for preventing and/or treating diseases associated with growth hormone abnormalities and will not be described in detail herein.
Use of
In another aspect of the present invention, the present invention provides a use of the aforementioned fusion protein, the aforementioned nucleic acid molecule, the aforementioned expression vector, the aforementioned recombinant cell, and the aforementioned pharmaceutical composition in the preparation of a medicament for treating or preventing a disease associated with growth hormone abnormality.
According to an embodiment of the present invention, the disease associated with growth hormone abnormality includes at least one selected from the group consisting of: childhood growth hormone deficiency, idiopathic short stature, adult growth hormone deficiency, turner's syndrome, prader-willi syndrome, renal failure, diseases caused by dissimilatory states during chemotherapy treatment and AIDS treatment, intrauterine growth retardation.
It will be appreciated by those skilled in the art that the features and advantages described above for the Fc mutants, fusion proteins, nucleic acid molecules, expression vectors, recombinant cells and pharmaceutical compositions are equally applicable to this use and will not be described in further detail herein.
The scheme of the invention will be explained with reference to the following examples. It will be appreciated by those skilled in the art that the following examples are illustrative of the invention only and should not be taken as limiting the scope of the invention. The examples, where specific techniques or conditions are not indicated, are to be construed according to the techniques or conditions described in the literature in the art or according to the product specifications. The reagents or instruments used are not indicated by the manufacturer, and are all conventional products commercially available.
Example 1: preparation of recombinant long-acting human growth hormone fusion protein expression vector
In the present application, human growth hormone (SEQ ID NO: 1) was linked to Fc mutants (SEQ ID NO:2 and SEQ ID NO: 7) to construct fusion proteins, to obtain J1 and J3, respectively. In the application, human growth hormone (SEQ ID NO: 1) is connected with various Fc mutants (SEQ ID NO:2 and SEQ ID NO: 4-6) through connecting peptides (SEQ ID NO: 8-10) to construct fusion proteins, and J2, J6, J7 and J16-18 are obtained respectively. The specific experimental procedures were as follows:
nucleotide sequences (SEQ ID NO: 11) for coding human growth hormone, nucleotide sequences (SEQ ID NO: 21-23) for coding connecting peptide and nucleotide sequences (SEQ ID NO: 16-20) for coding various mutant Fc mutants are combined, and the nucleotide sequences SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38 and SEQ ID NO:39 are respectively cloned into an expression vector pCDNA3.4 (purchased from Thermo Fisher) by adopting gene total synthesis and molecular cloning technologies, so as to respectively obtain fusion protein expression vectors of the nucleotide sequences.
Example 2: recombinant long-acting human growth hormone fusion protein expression
The host cells were transfected with the plasmids obtained in example 1 using Expi CHO-S (Gibco, A29133) as host cells, and the fusion proteins (i.e., recombinant long-acting human growth hormone fusion proteins) were transiently expressed using a chemical transfection reagent Polyplus-FectoPRO (Polyplus, 116-010), and the corresponding amino acid sequences were SEQ ID NO:24-31. The experimental procedure was as follows:
1. the cells were passaged one day before transient infection with Expi CHO-S cells, and the cell density was adjusted to 3X 10 with the medium 6 cells/mL or so. The cell culture flask was returned to the shaker (37 ℃, 8% CO) 2 ) And (5) continuing culturing.
2. Day of transient infection (1L per molecule transfection): taking Expi CHO-S cell fluid to count cells, adjusting the cell density to 6 x 10 by using a culture medium 6 cells/mL or so.
3. Preparing a transfection complex: taking 16 sterile cell culture bottles, marking 8 marked DNA and 8 marked FectoPRO, and adding a transfection reagent Polyplus-FectoPRO into the FectoPRO bottle; adding 60mL of Opti-MEM solution to 8 DNA bottles and 500. Mu.g of each of the fusion protein expression vectors of the nucleotide sequences obtained in example 1 to each DNA bottle; mixing to obtain fusion protein expression vector diluent, adding the fusion protein expression vector diluent obtained from each nucleotide sequence into a FectoPRO bottle respectively, mixing, incubating at room temperature for 10min, adding into cell liquid, shaking, and placing the cell culture bottle back to the shaker for continuous culture and transfection.
4. After transfection for 18-22h, adding a proper amount of OPM-CHO Profeed, and measuring the biochemical indexes of cell fluid. According to the biochemical index, glucose is supplemented to 6g/L. Titer assay was started 4d after transfection and supplemented with sugar every other day. The CHO cell fermentation liquor is obtained when the cell survival rate is less than 80 percent, and supernatant can be harvested for purification.
Example 3: recombinant long-acting human growth hormone fusion protein purification
The CHO cell fermentation liquid obtained in example 2 was subjected to secondary centrifugation (primary: 3000 Xg, 30min; secondary: 12000 Xg, 20 min), and the supernatant was collected and filtered through a 0.2 μm filter for use.
Protein a affinity chromatography: the chromatography column is equilibrated for at least 3 column volumes using an aqueous solution of pH7.2 containing 20mM phosphate and 150mM sodium chloride, the filtered clear filtrate is loaded, the feed solution is retained on the chromatography column for 5 minutes, after the sample is loaded, the chromatography column is equilibrated for at least 1 column volume using an aqueous solution of pH7.2 containing 20mM phosphate and 150mM sodium chloride, the target protein is eluted using a buffer solution of pH 4.5 containing 50mM acetic acid (HAc), a buffer solution of pH4.0 containing 50mM HAc, and a buffer solution of pH 3.5 containing 50mM HAc, respectively, and when an absorption peak occurs, the peak absorption range is 50mAU-peak-50mAU, a fusion protein having a purity SEC (> 98.0) is obtained, and the fusion protein containing the nucleotide sequence SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, or SEQ ID: 39, SEQ ID NO:39, CHO, SEQ ID NO:16, CHO, 18J, 17J, 18J, 18, 17, 18J, 18, respectively, and the fusion protein is fermented. The specific structure of the fusion protein is shown in FIG. 1, wherein the structures of J1, J2, J3, J6, J7 and J17 are shown in the left picture of FIG. 1, and the structures of J16 and J18 are shown in the right picture of FIG. 1.
Example 4: fusion protein binding assay to human GHR
In this example, the purified fusion protein obtained in example 3 is subjected to in vitro receptor (human growth hormone receptor, abbreviated as human GHR) binding activity detection, and Genexine GX-H9 is used as a control group, and the specific experimental operations are as follows:
diluting human GHR to obtain 0.5 μ g/ml coating solution, adding 100 μ L/well of the coating solution into an ELISA plate, and coating at 2-8 deg.C for more than 12 hr. The plate-coated residual solution was discarded, and 1% BSA-PBST (1% bovine serum albumin in phosphate Tween buffer) was added thereto, at 300. Mu.L per well, and blocked at 37 ℃ for 1 hour. Adding 300 mu L of PBST (phosphate Tween buffer) into each well, washing for 3 times, diluting the recombinant long-acting human growth hormone fusion protein and the Genexine GX-H9 to 5 mu g/ml, diluting the diluted recombinant long-acting human growth hormone fusion protein and the Genexine GX-H9 to 8 gradient concentrations by 5 times, and adding 100 mu L of the fusion protein into an ELISA plate. Incubation at 37 ℃ for 1 hour300 μ L of PBST was added to each well, washed 3 times, and then 1% of Goat anti human IgG Fc-HRP diluted 10000 times with BSA-PBST was added, and 100 μ L/well was added. After incubation for 1 hour at 37 deg.C, 300. Mu.L of PBST was added to each well, washed 3 times, and patted dry. TMB developing solution was added thereto in an amount of 100. Mu.L per well. After 5 minutes reaction at room temperature, 2M H was added 2 SO 4 The reaction was stopped with 100. Mu.L/well of aqueous solution. Placing the enzyme label plate on an enzyme label instrument, and reading the absorbance OD at the wavelength of 450nm 450 The EC50 value of the human GHR binding activity of each fusion protein is calculated, and the result is shown in Table 1, and the result shows that the fusion protein provided by the invention has low EC50 value of the human GHR binding activity and strong binding activity with the human GHR. See the binding activity relative to Genexine GX-H9, which shows that the binding activity to human GHR is similar or better than that of Genexine GX-H9.
Wherein the amino acid sequence of the Genexine GX-H9 is as follows:
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFRNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSRLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLGK(SEQ ID NO:43)。
table 1: detection result of recombinant long-acting human growth hormone fusion protein in human GHR (growth hormone receptor) binding activity test
Example 5: experiment for promoting Nb2-11 cell proliferation by recombinant long-acting human growth hormone fusion protein
The purified fusion proteins obtained in example 3 were tested, and the above Genexine GX-H9 was used as a control, and the following specific test procedures were performed:
cells were collected and cultured at 1X 10 per ml 5 Individual cell concentrations were suspended in culture medium (PRMI)1640 medium, 1% FBS and 50. Mu.M. Beta. -mercaptoethanol). Each 50 μ L cell sample was added to each well of a 96-well cell culture plate. The cells were cultured in 50. Mu.L of assay medium containing various concentrations of recombinant long-acting human growth hormone fusion protein from 0.051ng/mL to 3000ng/mL, and Genexine GX-H9. At 37 ℃ C, 5% CO 2 The Cell plates were cultured in a wet incubator for 96 hours, and then 50. Mu.L of an ellTiter-Glo luminescennt Cell Viability Assay (Promega, G7571) was added to each well. After 10 minutes, the microplate reader detects the chemiluminescent signal. The biological activity of the recombinant long-acting human growth hormone fusion protein can be measured by the obtained dose response curve, and the result is shown in Table 2, and the EC of the fusion protein provided by the invention on promoting the proliferation of Nb2-11 cells 50 Low value, and good proliferation effect on Nb2-11 cells. Compared with Genexine GX-H9, the relative activity of promoting the cell proliferation of Nb2-11 is similar or better.
Table 2: experimental detection result of recombinant long-acting human growth hormone fusion protein for promoting Nb2-11 cell proliferation
Example 6: cell biological activity experiment of recombinant long-acting human growth hormone fusion protein on reporter gene
In this example, experiments were performed on the purified fusion protein obtained in example 3, and the above Genexine GX-H9 was used as a control group, and the specific experimental operations were as follows:
taking reporter gene cells (293-GHR/STAT 5 cell line) in logarithmic growth phase, plating after trypsinization (costar, 3917), 4 × 10 4 cells/well, 50. Mu.L/well, 96-well white plate at 37 ℃ C. And 5% CO 2 Incubate overnight in the incubator. Adding gradient diluted recombinant long-acting human growth hormone fusion protein to the cells, starting at 100nM, 5-fold gradient dilution, 10 gradient concentrations, adding the dilution to the cells, adding the volume of 50 μ L/well, placing 96-well white plate at 37 deg.C, and 5% CO 2 Incubate in incubator for 6 hours. After the incubation was completed, the 96-well white plate and the Nano-Glo Luciferase Assay kit (promega, N112B) were removed and the samples were averagedThe reaction substrate was added to the wells at room temperature, the wells were left at 50. Mu.L/well for 10 minutes at room temperature, the luminescence signal values were recorded using a microplate reader (Promega, GM 2000) with the protein concentration as the X-axis and the luminescence signal values as the Y-axis, and the EC was calculated using GraphPad Prisim 5 as four-parameter fit 50 The results are shown in Table 3. The results show that the fusion protein provided by the invention promotes the reporter gene cell to express the EC of luciferase 50 The value is low, and is equivalent to or lower than that of a control group, and the luciferase activity expressed by reporter gene cells is excellent.
Table 3: experimental detection result of recombinant long-acting human growth hormone fusion protein for promoting reporter gene cell to express luciferase
Fusion protein numbering | EC 50 (ng/mL) | Relative Activity with respect to Genexine GX-H9 (%) |
J1 | 95.47 | 79.67 |
J2 | 70.81 | 101.71 |
J3 | 87.06 | 82.72 |
J6 | 84.14 | 92.58 |
J7 | 89.72 | 95.29 |
J16 | 39.69 | 215.12 |
J17 | 56.46 | 157.46 |
J18 | 33.61 | 302.59 |
Example 7: the test sample induces ADCC Effect of Effector cells Jurkat-CD16a-luc on target cells CHO-K1-PD-L1/GHR
1. Sample information
Sample name | Concentration (mg/mL) |
M7824 | 2.05 |
2(J2) | 0.6 |
5(Genexine GX-H9) | 1.33 |
18(J18) | 2.66 |
|
20 |
2. Cell processing
Target cells CHO-K1-PD-L1/GHR (Shanghai Jiabei biological medicine technology Co., ltd., SBTCL 015) treatment: subculture was carried out 2 days before the experiment, the supernatant was discarded, the cells were washed with 5mL of PBS, 1mL of 0.25% Trypsin-EDTA was added, after digestion in an incubator at 37 ℃ for 1min, 6mL of Medium 1 (450mL F12 Medium +50mL FBS) was added to stop the digestion, and the suspension was resuspended to a single cell suspension, 5mL of the cell suspension was discarded, 8mL of Medium 1 (450mL F12 Medium +50mL Fetal Bovine Serum (FBS)) was added, 120. Mu.L of G418 was added at a concentration of 600. Mu.g/mL, mixed well, and placed in the incubator to continue the culture.
Effector cell Jurkat-CD16a-luc (Shanghai Jiabei Biopharmaceutical technology, inc., SBTCL 001) treatment: subculturing 2 days before experiment, blowing and beating the mixed cell suspension, taking 5mL of cell suspension to a new T75 culture flask, adding 15mL of culture medium 2 (450mL of RPMI 1640 culture medium +50mL of FBS), adding 320 μ L of G418, uniformly mixing the mixture with 60 μ L of Hygromycin B (Hygromycin B) and 150 μ G/mL of Hygromycin B, and putting the mixture into an incubator for continuous culture.
3.3 Experimental procedure
The first day, target cells CHO-K1-PD-L1/GHR density was adjusted to 1.83X 10 with assay Medium 1 (40mL F12 Medium +0.4mL FBS) 5 cells/mL, 50 μ L/well plated overnight; the next day, the test sample (i.e. J2 or J18), the positive control M7824 and the negative control Acterma were first diluted to 0.1mg/mL with the analytical medium 2 (RPMI 1640 medium), second diluted to 24. Mu.g/mL, then diluted 3 times to 9 gradients, 25. Mu.L/well was added to the cell plate, mixed well, incubated in a 37 ℃ incubator for 45min, and the fine particles were removedTaking out the cell plate; effector cell Jurkat-CD16a-luc Density adjusted to 1.47X 10 with analytical Medium 2 (RPMI 1640 Medium) 6 Adding 25 mu L/hole of cells/mL into a cell plate, uniformly mixing, placing the cell plate in an incubator at 37 ℃ for co-incubation for 5.5h, taking the cell plate out, balancing the cell plate at room temperature, adding One-Glo Luciferase Assay System (Promega, E6120), adding 100 mu L/hole of the cell plate, blowing by a pipette for 10 times, and uniformly mixing; and (4) after the mixture is cracked for 10min in a dark place at room temperature, detecting a chemiluminescence signal value by using an enzyme-labeling instrument. As shown in FIG. 2, after adding the positive control PD-L1 antibody M7824 (Shanghai Behcet Biomedicine technology Co., ltd.), the negative control IL-6R antibody Acterma (Roche) and the test sample to the ADCC experiment, the positive control M7824 induces significant ADCC effect of the effector cell Jurkat-CD16a-luc on the target cell CHO-K1-PD-L1/GHR, no ADCC effect of the negative control Acterma and no ADCC effect of the test sample, wherein the concentration unit of the abscissa of FIG. 2 is ng/ml.
Example 8: test samples induce CDC Effect of human serum complement on target cells CHO-K1-PD-L1/GHR
1. Sample information
Sample name | Concentration (mg/mL) |
Avelumab | 22.65 |
J2 | 0.6 |
Genexine GX-H9 | 1.33 |
J18 | 2.66 |
Acterma | 20 |
2. Cell processing
CHO-K1-PD-L1/GHR treatment of target cells: subculture was carried out 2 days before the experiment, the supernatant was discarded, the cells were washed with 5mL of PBS, 1mL of 0.25% Trypsin-EDTA was added, after digestion in an incubator at 37 ℃ for 1min, 6mL of Medium 1 (450mL F12 Medium +50mL FBS) was added to stop the digestion, and the suspension was resuspended to a single cell suspension, 5mL of the cell suspension was discarded, 8mL of Medium 1 (450mL F12 Medium +50mL FBS) was added, 120. Mu.L of G418 was added, and the concentration of G418 was 600. Mu.g/mL, mixed well, and placed in the incubator to continue the culture.
3. Experimental procedure
The first day, the target cells CHO-K1-PD-L1/GHR density was adjusted to 2.0X 10 with differentiation medium (99% F12 medium +1% FBS) 5 cells/mL, 50 μ L/well plated overnight; the next day, the test sample (i.e., J2 or J18), the positive control Avelumab (PD-L1 antibody, shanghai bei bio-pharmaceutical technology limited), and the negative control Acterma were first diluted to 0.1mg/mL with differentiation medium (99% f12 medium +1 fbs), second diluted to 15 μ g/mL, further diluted 3-fold for 7 gradients, 50 μ L/well added to the cell plate, mixed well, placed in a 37 ℃ incubator for 30min, and then the cell plate was removed; adding 30% complement (containing 70% differentiation medium and 30% complement), 50 uL/hole, mixing, placing in an incubator at 37 ℃ for incubation for 6h, taking out the Cell plate, balancing the Cell plate at room temperature, adding One-Glo Luminescent Cell vitality Assay, adding 50 uL/hole into the Cell plate, blowing by a pipette for 10 times, and mixing; after cracking for 10min at room temperature in dark, the chemiluminescence signal value is detected by an enzyme-linked immunosorbent assay. The result shows that after the positive control PD-L1 antibody Avelumab, the negative control IL-6R antibody Acterma and the SBT123 test sample molecules are added in a CDC experiment, the positive control Avelumab induces that human serum complement has obvious CDC effect on the target cell CHO-K1-PD-L1/GHR, the negative control Acterma has no CDC effect, and the test sample has no CDC effect.
Example 9: in vivo biological Activity assay
Pharmacodynamic studies of long-acting in vivo growth promotion of J2 fusion proteins and J18 fusion proteins: selecting SPF male SD rats with the body weight of 60g-80g and the age of 4 weeks. The hypophysis is removed under the clean condition of 2 weeks before the experiment, 2 weeks after the hypophysis removal operation is a recovery period, qualified healthy animals with the weight change of rats less than +/-10% of the weight before the operation are selected before administration, and the hypophysis removal rats are uniformly and randomly divided into 5 groups according to the weight, namely a model group, a J2 fusion protein low-dose group, a J2 fusion protein high-dose group, a J18 fusion protein low-dose group and a J18 fusion protein high-dose group; each group had 8. The administration mode is neck subcutaneous injection, and the J2 fusion protein is administered to the J2 fusion protein low-dose group according to the administration amount of 9.35 nmol/kg; the J18 fusion protein low-dose group is used for administering the J18 fusion protein according to the administration amount of 9.35 nmol/kg; the high-dose group of the J2 fusion protein administers the J2 fusion protein according to the administration amount of 56.1 nmol/kg; the high-dose group of the J18 fusion protein administers the J18 fusion protein according to the administration amount of 56.1 nmol/kg; vehicle administration to the model group; each group was dosed or administered vehicle once a week, co-dosed or administered vehicle 2 times as described above. After dosing, each rat was weighed at the same time each day, daily rat weight changes were calculated, the test was stopped after 14 days, and the rats were weighed. The weight gain (g) on a day after administration was the difference between the daily weight and the weight before administration for each animal. At the end of the experiment, dissection examination was performed to confirm that no pituitary residue remained in the sphenoid saddle region. The results of comparing the change in body weight gain of each group over 14 days are shown in fig. 3, where "2# -low" is the J2 fusion protein low dose group, "2# -high" is the J2 fusion protein high dose group, "18# -low" is the J18 fusion protein low dose group, and "18# -high" is the J18 fusion protein high dose group; 2# -low vs model group, p =0.0083;2# -high vs model group, p < 0.0001;18# -low vs model set, p =0.0030;18# -high vs model group, p =0.0003.
From the results of fig. 3, it can be seen that:
(1) Compared with a model group, the weight of a rat can be obviously improved by the J2 fusion protein and the J18 fusion protein under the conditions of low dose and high dose, and the fusion protein provided by the invention has good in vivo activity.
(2) The high dose group showed a greater magnitude of body weight gain than the low dose group.
(3) The J18 fusion protein high dose group had a more significant effect than the J2 high dose group.
In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.
Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.
SEQUENCE LISTING
<110> Shenzhen Konjing pharmaceutical Limited
<120> recombinant long-acting human growth hormone fusion protein, preparation method and application thereof
<130> PDI220116
<160> 43
<170> PatentIn version 3.3
<210> 1
<211> 191
<212> PRT
<213> Artificial
<220>
<223> 1
<400> 1
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe
180 185 190
<210> 2
<211> 217
<212> PRT
<213> Artificial
<220>
<223> 2
<400> 2
Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys
1 5 10 15
Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val
20 25 30
Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr
35 40 45
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu
50 55 60
Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His
65 70 75 80
Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys
85 90 95
Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln
100 105 110
Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu
115 120 125
Thr Lys Asn Gln Val Ser Leu Arg Cys His Val Lys Gly Phe Tyr Pro
130 135 140
Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn
145 150 155 160
Tyr Lys Thr Thr Lys Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu
165 170 175
Tyr Ser Thr Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val
180 185 190
Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln
195 200 205
Lys Ser Leu Ser Leu Ser Pro Gly Lys
210 215
<210> 3
<211> 218
<212> PRT
<213> Artificial
<220>
<223> 3
<400> 3
Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro
1 5 10 15
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
20 25 30
Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp
35 40 45
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
50 55 60
Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu
65 70 75 80
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
85 90 95
Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
100 105 110
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu
115 120 125
Leu Thr Lys Asn Gln Val Ser Leu Arg Cys His Val Lys Gly Phe Tyr
130 135 140
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
145 150 155 160
Asn Tyr Lys Thr Thr Lys Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
165 170 175
Leu Tyr Ser Thr Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn
180 185 190
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
195 200 205
Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
210 215
<210> 4
<211> 217
<212> PRT
<213> Artificial
<220>
<223> 4
<400> 4
Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys
1 5 10 15
Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val
20 25 30
Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr
35 40 45
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu
50 55 60
Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His
65 70 75 80
Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys
85 90 95
Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln
100 105 110
Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu
115 120 125
Thr Lys Asn Gln Val Ser Leu Arg Cys His Val Lys Gly Phe Tyr Pro
130 135 140
Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn
145 150 155 160
Tyr Lys Thr Thr Lys Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu
165 170 175
Tyr Ser Thr Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val
180 185 190
Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln
195 200 205
Lys Ser Leu Ser Leu Ser Pro Gly Ala
210 215
<210> 5
<211> 217
<212> PRT
<213> Artificial
<220>
<223> 5
<400> 5
Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys
1 5 10 15
Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val
20 25 30
Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr
35 40 45
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu
50 55 60
Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His
65 70 75 80
Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys
85 90 95
Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln
100 105 110
Pro Arg Glu Pro Gln Val Tyr Thr Ser Pro Pro Ser Arg Asp Glu Leu
115 120 125
Thr Lys Asn Gln Val Ser Leu Arg Cys His Val Lys Gly Phe Tyr Pro
130 135 140
Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn
145 150 155 160
Tyr Lys Thr Thr Lys Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu
165 170 175
Tyr Ser Asp Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val
180 185 190
Phe Ser Cys Ser Val Tyr His Glu Ala Leu His Asn His Tyr Thr Gln
195 200 205
Lys Ser Leu Ser Leu Ser Pro Gly Lys
210 215
<210> 6
<211> 217
<212> PRT
<213> Artificial
<220>
<223> 6
<400> 6
Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys
1 5 10 15
Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val
20 25 30
Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr
35 40 45
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu
50 55 60
Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His
65 70 75 80
Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys
85 90 95
Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln
100 105 110
Pro Arg Glu Pro Gln Val Tyr Thr Ser Pro Pro Ser Arg Asp Glu Leu
115 120 125
Thr Lys Asn Gln Val Ser Leu Arg Cys His Val Lys Gly Phe Tyr Pro
130 135 140
Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn
145 150 155 160
Tyr Lys Thr Thr Lys Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu
165 170 175
Tyr Ser Asp Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val
180 185 190
Phe Ser Cys Ser Val Tyr His Glu Ala Leu His Asn His Tyr Thr Gln
195 200 205
Lys Ser Leu Ser Leu Ser Pro Gly Ala
210 215
<210> 7
<211> 224
<212> PRT
<213> Artificial
<220>
<223> 7
<400> 7
Asp Lys Thr His Thr Gly Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser
1 5 10 15
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
20 25 30
Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro
35 40 45
Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala
50 55 60
Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val
65 70 75 80
Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr
85 90 95
Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr
100 105 110
Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu
115 120 125
Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Arg Cys
130 135 140
His Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
145 150 155 160
Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Lys Pro Val Leu Asp
165 170 175
Ser Asp Gly Ser Phe Phe Leu Tyr Ser Thr Leu Thr Val Asp Lys Ser
180 185 190
Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala
195 200 205
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
210 215 220
<210> 8
<211> 15
<212> PRT
<213> Artificial
<220>
<223> 8
<400> 8
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
1 5 10 15
<210> 9
<211> 10
<212> PRT
<213> Artificial
<220>
<223> 9
<400> 9
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
1 5 10
<210> 10
<211> 5
<212> PRT
<213> Artificial
<220>
<223> 10
<400> 10
Gly Gly Gly Gly Ser
1 5
<210> 11
<211> 573
<212> DNA
<213> Artificial
<220>
<223> 11
<400> 11
tttcccacca ttcctctgag cagactgttc gacaacgcca tgctgagagc ccacagactg 60
caccagctgg cctttgacac ataccaggag ttcgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa cccccagacc agcctgtgct tcagcgagag catccccacc 180
cccagcaata gagaggagac acagcagaag agcaacctgg agctgctgag aatcagcctg 240
ctgctgatcc agagctggct ggagcccgtt caatttctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgccagcga ttctaatgtg tacgacctgc tgaaggacct ggaggagggc 360
atccagaccc tgatgggcag actggaggac ggctctccta gaaccggaca aattttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg agacattcct gagaatcgtg 540
cagtgcagaa gcgtggaggg cagctgcgga ttc 573
<210> 12
<211> 651
<212> DNA
<213> Artificial
<220>
<223> 12
<400> 12
gcccctgaac tgctcggagg cccctccgtg tttctgttcc ctccaaagcc taaggacacc 60
ctgatgatca gcagaacccc tgaagtgacc tgcgtggtgg tggatgtgtc ccacgaggat 120
cccgaagtga agttcaattg gtacgtggac ggcgtggaag tgcacaacgc caagaccaag 180
cctagagagg aacagtacaa cagcacctac agagtggtgt ccgtgctgac cgtgctgcac 240
caggattggc tgaacggcaa agagtacaag tgcaaggtgt ccaacaaggc cctgcctgct 300
cctatcgaga aaaccatcag caaggccaag ggccagccta gggaacccca ggtttacaca 360
ctgcctccaa gcagggacga gctgaccaag aatcaggtgt ccctgcggtg tcacgtgaag 420
ggcttctacc cttccgatat cgccgtggaa tgggagagca atggccagcc tgagaacaac 480
tacaagacca ccaagccagt gctggacagc gacggctcat tcttcctgta cagcaccctg 540
accgtggaca agtccagatg gcaacagggc aacgtgttca gctgcagcgt gatgcacgag 600
gccctgcaca accactacac ccagaagtcc ctgagcctgt ctcctggcaa a 651
<210> 13
<211> 651
<212> DNA
<213> Artificial
<220>
<223> 13
<400> 13
gctcctgaac tgctgggagg acctagcgtg tttctgtttc ctcccaaacc taaagacacc 60
ctgatgatca gcagaacccc cgaggtgacc tgcgtggtgg ttgatgtgtc tcatgaagac 120
cccgaagtga agttcaactg gtacgtggac ggcgtggagg tgcacaatgc taaaaccaag 180
cccagagagg agcagtacaa cagcacctac agagtggtga gcgtgctgac cgtgctgcac 240
caagattggc tgaatggcaa ggaatacaag tgcaaggtga gcaacaaggc cctgcccgcc 300
cctattgaaa aaaccattag caaggctaag ggccagccca gagagcccca agtgtataca 360
ctgcccccta gcagagatga actgaccaaa aaccaggtga gcctgagatg ccacgtgaag 420
ggcttttacc ccagcgacat tgccgtggag tgggagagca atggccaacc tgaaaacaat 480
tacaagacca ccaagcccgt gctggacagc gacggatctt tctttctgta tagcaccctg 540
accgtggaca agagcagatg gcagcagggc aatgtgttca gctgcagcgt gatgcacgag 600
gccctgcaca atcactatac ccagaagagc ctgagcctga gccccggcaa a 651
<210> 14
<211> 654
<212> DNA
<213> Artificial
<220>
<223> 14
<400> 14
cccgcccccg aactgctggg aggaccttct gtgtttctgt ttcctcccaa acccaaagac 60
accctgatga tcagcagaac ccccgaggtg acctgcgtgg ttgtggatgt gtctcatgag 120
gatcccgagg tgaagttcaa ctggtacgtg gacggcgtgg aggtgcacaa tgctaagaca 180
aagcccagag aggagcagta caacagcacc tacagagtgg tgagcgtgct gaccgtgctg 240
caccaagatt ggctgaatgg aaaggaatac aagtgcaagg tgagcaacaa ggccctgccc 300
gcccctattg aaaaaacaat tagcaaggcc aagggccagc ccagagaacc tcaagtgtat 360
accctgcctc ccagcagaga tgagctgaca aaaaatcagg tgagcctgag atgccacgtg 420
aagggcttct accccagcga cattgccgtg gagtgggaaa gcaatggcca acctgagaac 480
aactacaaga ccaccaagcc cgtgctggac agcgacggat ctttttttct gtacagcacc 540
ctgaccgtgg acaagagcag atggcagcag ggcaatgtgt tcagctgcag cgtgatgcac 600
gaggccctgc ataatcacta cacccagaaa agcctgagcc tgagccccgg caag 654
<210> 15
<211> 651
<212> DNA
<213> Artificial
<220>
<223> 15
<400> 15
gctcctgaac tgctcggagg cccttccgtg ttcctgtttc ctccaaagcc taaggacacc 60
ctgatgatca gcagaacccc tgaagtgacc tgcgtggtgg tggatgtgtc ccacgaggat 120
cccgaagtga agttcaattg gtacgtggac ggcgtcgagg tgcacaacgc caagacaaag 180
cctagagagg aacagtacaa cagcacctac agagtggtgt ccgtgctgac cgtgctgcac 240
caggattggc tgaacggcaa agagtacaag tgcaaggtgt ccaacaaggc cctgcctgct 300
cctatcgaga aaaccatcag caaggccaag ggccagccta gggaacccca ggtttacaca 360
ctgcctccaa gcagggacga gctgaccaag aatcaggtgt ccctgcggtg tcacgtgaag 420
ggcttctacc cttccgatat cgccgtggaa tgggagagca atggccagcc tgagaacaac 480
tacaagacca ccaagccagt gctggacagc gacggctcat tcttcctgta cagcaccctg 540
accgtggaca agtccagatg gcaacagggc aacgtgttca gctgcagcgt gatgcacgag 600
gccctgcaca accactacac ccagaagtcc ctgagcctgt ctcctggcaa a 651
<210> 16
<211> 651
<212> DNA
<213> Artificial
<220>
<223> 16
<400> 16
gctcctgaac tgctcggagg cccttccgtg ttcctgtttc ctccaaagcc taaggacacc 60
ctgatgatca gcagaacccc tgaagtgacc tgcgtggtgg tggatgtgtc ccacgaggat 120
cccgaagtga agttcaattg gtacgtggac ggcgtggaag tgcacaacgc caagaccaag 180
cctagagagg aacagtacaa cagcacctac agagtggtgt ccgtgctgac cgtgctgcac 240
caggattggc tgaacggcaa agagtacaag tgcaaggtgt ccaacaaggc cctgcctgct 300
cctatcgaga aaaccatcag caaggccaag ggccagccta gggaacccca ggtttacaca 360
ctgcctccaa gcagggacga gctgaccaag aatcaggtgt ccctgcggtg tcacgtgaag 420
ggcttctacc cttccgatat cgccgtggaa tgggagagca atggccagcc tgagaacaac 480
tacaagacca ccaagccagt gctggacagc gacggctcat tcttcctgta cagcaccctg 540
accgtggaca agtccagatg gcaacagggc aacgtgttca gctgcagcgt gatgcacgag 600
gccctgcaca accactacac ccagaagtcc ctgagcctgt ctcctggcaa a 651
<210> 17
<211> 651
<212> DNA
<213> Artificial
<220>
<223> 17
<400> 17
gctcccgaac tgctgggagg acccagcgtg tttctgtttc cccctaaacc taaggacacc 60
ctgatgatca gcagaacccc cgaggtgacc tgcgtggtgg tggatgtgag ccatgaggac 120
cccgaagtga aattcaactg gtacgtggac ggcgtggagg tgcacaatgc taagaccaag 180
cccagagagg agcagtacaa cagcacctac agagtggtga gcgtgctgac cgtgctgcac 240
caagattggc tgaatggcaa agagtacaag tgcaaggtga gcaacaaggc cctgcccgcc 300
cctattgaaa aaaccattag caaagccaag ggccagccca gagagcccca agtgtacaca 360
ttacccccta gcagagatga actgaccaaa aaccaggtga gcctgagatg ccacgtgaag 420
ggcttttacc ccagcgacat cgccgtggag tgggagagca atggacaacc tgagaacaac 480
tacaagacca ccaagcccgt gctggacagc gacggatctt tctttctgta cagcaccctg 540
accgtggaca agagcagatg gcagcagggc aatgtgttca gctgcagcgt gatgcacgag 600
gccctgcata atcactacac ccagaagagc ctgagcctga gccccggagc t 651
<210> 18
<211> 651
<212> DNA
<213> Artificial
<220>
<223> 18
<400> 18
gctcctgaac tgctgggagg acctagcgtg tttctgtttc ctcctaaacc caaggacacc 60
ctgatgatca gcagaacccc cgaggtgacc tgcgtggtgg tggatgtgag ccatgaagat 120
cccgaagtga agttcaactg gtacgtggac ggcgtggagg tgcacaatgc taaaaccaag 180
cccagagagg agcagtacaa cagcacctac agagtggtga gcgtgctgac cgtgctgcac 240
caagattggc tgaatggaaa ggagtacaag tgcaaggtga gcaacaaggc cctgcccgcc 300
cctattgaaa aaaccattag caaagctaag ggccagccca gagagcccca agtgtataca 360
agccctccca gcagggacga gctgaccaaa aatcaggtga gcctgagatg ccacgtgaag 420
ggcttttacc ccagcgacat cgccgtggaa tgggaaagca atggccaacc cgagaacaac 480
tataagacca ccaagcccgt gctggacagc gacggatcct tttttctgta cagcgacctg 540
accgtggaca agagcagatg gcagcagggc aatgtgttca gctgcagcgt gtatcacgag 600
gccctgcaca atcactacac ccagaaaagc ctgagcctga gccccggcaa a 651
<210> 19
<211> 651
<212> DNA
<213> Artificial
<220>
<223> 19
<400> 19
gcccctgagc tgcttggagg acctagcgtt tttttatttc cccccaaacc taaggacacc 60
ctgatgatca gcagaacccc cgaggtgacc tgcgtggttg tggatgtgag ccatgaagat 120
cccgaggtga aattcaactg gtacgtggac ggcgtggagg tgcacaatgc caaaacaaag 180
cccagagagg agcagtacaa cagcacctac agagtggtga gcgtgctgac cgtgctgcac 240
caagattggc tgaacggcaa ggagtacaag tgcaaggtga gcaacaaggc cctgcccgcc 300
cctattgaaa aaaccattag caaggccaag ggccagccca gagaacccca agtgtatacc 360
agccccccca gcagagatga gctgacaaaa aatcaggtga gcctgagatg ccacgtgaag 420
ggcttttacc ccagcgacat cgccgtggag tgggaatcta atggacaacc tgagaacaac 480
tacaagacca ccaagcccgt gctggacagc gacgggagct ttttcctgta tagcgacctg 540
accgtggaca agagcagatg gcagcagggc aatgtgttca gctgcagcgt gtatcacgag 600
gccctgcata atcactacac ccagaaaagc ctgagcctga gccccggagc t 651
<210> 20
<211> 672
<212> DNA
<213> Artificial
<220>
<223> 20
<400> 20
gacaagacac acaccggacc cgcccccgaa ctgctgggag gaccttctgt gtttctgttt 60
cctcccaaac ccaaagacac cctgatgatc agcagaaccc ccgaggtgac ctgcgtggtt 120
gtggatgtgt ctcatgagga tcccgaggtg aagttcaact ggtacgtgga cggcgtggag 180
gtgcacaatg ctaagacaaa gcccagagag gagcagtaca acagcaccta cagagtggtg 240
agcgtgctga ccgtgctgca ccaagattgg ctgaatggaa aggaatacaa gtgcaaggtg 300
agcaacaagg ccctgcccgc ccctattgaa aaaacaatta gcaaggccaa gggccagccc 360
agagaacctc aagtgtatac cctgcctccc agcagagatg agctgacaaa aaatcaggtg 420
agcctgagat gccacgtgaa gggcttctac cccagcgaca ttgccgtgga gtgggaaagc 480
aatggccaac ctgagaacaa ctacaagacc accaagcccg tgctggacag cgacggatct 540
ttttttctgt acagcaccct gaccgtggac aagagcagat ggcagcaggg caatgtgttc 600
agctgcagcg tgatgcacga ggccctgcat aatcactaca cccagaaaag cctgagcctg 660
agccccggca ag 672
<210> 21
<211> 45
<212> DNA
<213> Artificial
<220>
<223> 21
<400> 21
ggaggaggag gaagcggagg cggaggatct ggaggaggag gaagc 45
<210> 22
<211> 30
<212> DNA
<213> Artificial
<220>
<223> 22
<400> 22
ggcggcggag gatctggcgg aggtggaagt 30
<210> 23
<211> 15
<212> DNA
<213> Artificial
<220>
<223> 23
<400> 23
ggcggcggag gatct 15
<210> 24
<211> 408
<212> PRT
<213> Artificial
<220>
<223> 24
<400> 24
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Ala
180 185 190
Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro
195 200 205
Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val
210 215 220
Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val
225 230 235 240
Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln
245 250 255
Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln
260 265 270
Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala
275 280 285
Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro
290 295 300
Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr
305 310 315 320
Lys Asn Gln Val Ser Leu Arg Cys His Val Lys Gly Phe Tyr Pro Ser
325 330 335
Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr
340 345 350
Lys Thr Thr Lys Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr
355 360 365
Ser Thr Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe
370 375 380
Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys
385 390 395 400
Ser Leu Ser Leu Ser Pro Gly Lys
405
<210> 25
<211> 423
<212> PRT
<213> Artificial
<220>
<223> 25
<400> 25
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Pro
195 200 205
Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys
210 215 220
Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val
225 230 235 240
Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp
245 250 255
Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr
260 265 270
Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp
275 280 285
Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu
290 295 300
Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg
305 310 315 320
Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys
325 330 335
Asn Gln Val Ser Leu Arg Cys His Val Lys Gly Phe Tyr Pro Ser Asp
340 345 350
Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys
355 360 365
Thr Thr Lys Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser
370 375 380
Thr Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser
385 390 395 400
Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser
405 410 415
Leu Ser Leu Ser Pro Gly Lys
420
<210> 26
<211> 415
<212> PRT
<213> Artificial
<220>
<223> 26
<400> 26
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Asp
180 185 190
Lys Thr His Thr Gly Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val
195 200 205
Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr
210 215 220
Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu
225 230 235 240
Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys
245 250 255
Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser
260 265 270
Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys
275 280 285
Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile
290 295 300
Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro
305 310 315 320
Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Arg Cys His
325 330 335
Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn
340 345 350
Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Lys Pro Val Leu Asp Ser
355 360 365
Asp Gly Ser Phe Phe Leu Tyr Ser Thr Leu Thr Val Asp Lys Ser Arg
370 375 380
Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu
385 390 395 400
His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
405 410 415
<210> 27
<211> 418
<212> PRT
<213> Artificial
<220>
<223> 27
<400> 27
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Pro Glu Leu Leu Gly Gly
195 200 205
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
210 215 220
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
225 230 235 240
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
245 250 255
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
260 265 270
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
275 280 285
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu
290 295 300
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
305 310 315 320
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
325 330 335
Arg Cys His Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
340 345 350
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Lys Pro Val
355 360 365
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Thr Leu Thr Val Asp
370 375 380
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
385 390 395 400
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
405 410 415
Gly Lys
<210> 28
<211> 413
<212> PRT
<213> Artificial
<220>
<223> 28
<400> 28
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu
195 200 205
Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu
210 215 220
Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys
225 230 235 240
Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys
245 250 255
Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu
260 265 270
Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys
275 280 285
Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys
290 295 300
Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser
305 310 315 320
Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Arg Cys His Val Lys
325 330 335
Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln
340 345 350
Pro Glu Asn Asn Tyr Lys Thr Thr Lys Pro Val Leu Asp Ser Asp Gly
355 360 365
Ser Phe Phe Leu Tyr Ser Thr Leu Thr Val Asp Lys Ser Arg Trp Gln
370 375 380
Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn
385 390 395 400
His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
405 410
<210> 29
<211> 423
<212> PRT
<213> Artificial
<220>
<223> 29
<400> 29
Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys
1 5 10 15
Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val
20 25 30
Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr
35 40 45
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu
50 55 60
Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His
65 70 75 80
Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys
85 90 95
Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln
100 105 110
Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu
115 120 125
Thr Lys Asn Gln Val Ser Leu Arg Cys His Val Lys Gly Phe Tyr Pro
130 135 140
Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn
145 150 155 160
Tyr Lys Thr Thr Lys Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu
165 170 175
Tyr Ser Thr Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val
180 185 190
Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln
195 200 205
Lys Ser Leu Ser Leu Ser Pro Gly Ala Gly Gly Gly Gly Ser Gly Gly
210 215 220
Gly Gly Ser Gly Gly Gly Gly Ser Phe Pro Thr Ile Pro Leu Ser Arg
225 230 235 240
Leu Phe Asp Asn Ala Met Leu Arg Ala His Arg Leu His Gln Leu Ala
245 250 255
Phe Asp Thr Tyr Gln Glu Phe Glu Glu Ala Tyr Ile Pro Lys Glu Gln
260 265 270
Lys Tyr Ser Phe Leu Gln Asn Pro Gln Thr Ser Leu Cys Phe Ser Glu
275 280 285
Ser Ile Pro Thr Pro Ser Asn Arg Glu Glu Thr Gln Gln Lys Ser Asn
290 295 300
Leu Glu Leu Leu Arg Ile Ser Leu Leu Leu Ile Gln Ser Trp Leu Glu
305 310 315 320
Pro Val Gln Phe Leu Arg Ser Val Phe Ala Asn Ser Leu Val Tyr Gly
325 330 335
Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu Lys Asp Leu Glu Glu Gly
340 345 350
Ile Gln Thr Leu Met Gly Arg Leu Glu Asp Gly Ser Pro Arg Thr Gly
355 360 365
Gln Ile Phe Lys Gln Thr Tyr Ser Lys Phe Asp Thr Asn Ser His Asn
370 375 380
Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu Leu Tyr Cys Phe Arg Lys
385 390 395 400
Asp Met Asp Lys Val Glu Thr Phe Leu Arg Ile Val Gln Cys Arg Ser
405 410 415
Val Glu Gly Ser Cys Gly Phe
420
<210> 30
<211> 423
<212> PRT
<213> Artificial
<220>
<223> 30
<400> 30
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Pro
195 200 205
Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys
210 215 220
Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val
225 230 235 240
Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp
245 250 255
Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr
260 265 270
Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp
275 280 285
Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu
290 295 300
Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg
305 310 315 320
Glu Pro Gln Val Tyr Thr Ser Pro Pro Ser Arg Asp Glu Leu Thr Lys
325 330 335
Asn Gln Val Ser Leu Arg Cys His Val Lys Gly Phe Tyr Pro Ser Asp
340 345 350
Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys
355 360 365
Thr Thr Lys Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser
370 375 380
Asp Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser
385 390 395 400
Cys Ser Val Tyr His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser
405 410 415
Leu Ser Leu Ser Pro Gly Lys
420
<210> 31
<211> 423
<212> PRT
<213> Artificial
<220>
<223> 31
<400> 31
Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys
1 5 10 15
Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val
20 25 30
Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr
35 40 45
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu
50 55 60
Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His
65 70 75 80
Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys
85 90 95
Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln
100 105 110
Pro Arg Glu Pro Gln Val Tyr Thr Ser Pro Pro Ser Arg Asp Glu Leu
115 120 125
Thr Lys Asn Gln Val Ser Leu Arg Cys His Val Lys Gly Phe Tyr Pro
130 135 140
Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn
145 150 155 160
Tyr Lys Thr Thr Lys Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu
165 170 175
Tyr Ser Asp Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val
180 185 190
Phe Ser Cys Ser Val Tyr His Glu Ala Leu His Asn His Tyr Thr Gln
195 200 205
Lys Ser Leu Ser Leu Ser Pro Gly Ala Gly Gly Gly Gly Ser Gly Gly
210 215 220
Gly Gly Ser Gly Gly Gly Gly Ser Phe Pro Thr Ile Pro Leu Ser Arg
225 230 235 240
Leu Phe Asp Asn Ala Met Leu Arg Ala His Arg Leu His Gln Leu Ala
245 250 255
Phe Asp Thr Tyr Gln Glu Phe Glu Glu Ala Tyr Ile Pro Lys Glu Gln
260 265 270
Lys Tyr Ser Phe Leu Gln Asn Pro Gln Thr Ser Leu Cys Phe Ser Glu
275 280 285
Ser Ile Pro Thr Pro Ser Asn Arg Glu Glu Thr Gln Gln Lys Ser Asn
290 295 300
Leu Glu Leu Leu Arg Ile Ser Leu Leu Leu Ile Gln Ser Trp Leu Glu
305 310 315 320
Pro Val Gln Phe Leu Arg Ser Val Phe Ala Asn Ser Leu Val Tyr Gly
325 330 335
Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu Lys Asp Leu Glu Glu Gly
340 345 350
Ile Gln Thr Leu Met Gly Arg Leu Glu Asp Gly Ser Pro Arg Thr Gly
355 360 365
Gln Ile Phe Lys Gln Thr Tyr Ser Lys Phe Asp Thr Asn Ser His Asn
370 375 380
Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu Leu Tyr Cys Phe Arg Lys
385 390 395 400
Asp Met Asp Lys Val Glu Thr Phe Leu Arg Ile Val Gln Cys Arg Ser
405 410 415
Val Glu Gly Ser Cys Gly Phe
420
<210> 32
<211> 1224
<212> DNA
<213> Artificial
<220>
<223> 32
<400> 32
ttccccacca ttcctctgag ccggctgttc gacaacgcca tgctgagagc ccacagactg 60
caccagctgg ccttcgacac ctaccaagag ttcgaggaag cctacattcc caaagagcag 120
aagtacagct tcctgcagaa ccctcagacc agcctgtgct tcagcgagag catccccaca 180
cctagcaaca gagaggaaac ccagcagaag tccaacctgg aactgctgcg gatcagcctg 240
ctgctgatcc agtcttggct ggaacccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgccagcga cagcaacgtt tacgacctgc tgaaggacct ggaagagggc 360
atccagacac tgatgggcag actggaagat ggcagcccta gaaccggcca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt actgctttcg gaaggacatg gacaaggtgg aaaccttcct gcggatcgtg 540
cagtgcagaa gcgtggaagg cagctgtgga tttgcccctg aactgctcgg aggcccctcc 600
gtgtttctgt tccctccaaa gcctaaggac accctgatga tcagcagaac ccctgaagtg 660
acctgcgtgg tggtggatgt gtcccacgag gatcccgaag tgaagttcaa ttggtacgtg 720
gacggcgtgg aagtgcacaa cgccaagacc aagcctagag aggaacagta caacagcacc 780
tacagagtgg tgtccgtgct gaccgtgctg caccaggatt ggctgaacgg caaagagtac 840
aagtgcaagg tgtccaacaa ggccctgcct gctcctatcg agaaaaccat cagcaaggcc 900
aagggccagc ctagggaacc ccaggtttac acactgcctc caagcaggga cgagctgacc 960
aagaatcagg tgtccctgcg gtgtcacgtg aagggcttct acccttccga tatcgccgtg 1020
gaatgggaga gcaatggcca gcctgagaac aactacaaga ccaccaagcc agtgctggac 1080
agcgacggct cattcttcct gtacagcacc ctgaccgtgg acaagtccag atggcaacag 1140
ggcaacgtgt tcagctgcag cgtgatgcac gaggccctgc acaaccacta cacccagaag 1200
tccctgagcc tgtctcctgg caaa 1224
<210> 33
<211> 1269
<212> DNA
<213> Artificial
<220>
<223> 33
<400> 33
tttcccacca tccccctgtc tagactgttc gacaatgcca tgctgagagc ccacagactg 60
caccagctgg cctttgacac ataccaggag ttcgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa cccccagacc agcctgtgtt tcagcgaaag catccccacc 180
cccagcaata gagaggaaac ccagcaaaag agcaacctgg agctgctgag aatcagcctg 240
ctgctgatcc agagctggct ggagcctgtg caatttctga gaagcgtgtt cgccaatagc 300
ctggtgtacg gcgccagcga tagcaacgtg tatgatctgc tgaaagacct ggaggagggc 360
attcagacac tgatgggcag actggaggac ggcagcccta gaacaggaca gattttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg agacattcct gagaatcgtg 540
cagtgcagaa gcgtggaggg cagctgtgga tttggaggag gaggaagcgg aggcggagga 600
tctggaggag gaggaagcgc tcctgaactg ctgggaggac ctagcgtgtt tctgtttcct 660
cccaaaccta aagacaccct gatgatcagc agaacccccg aggtgacctg cgtggtggtt 720
gatgtgtctc atgaagaccc cgaagtgaag ttcaactggt acgtggacgg cgtggaggtg 780
cacaatgcta aaaccaagcc cagagaggag cagtacaaca gcacctacag agtggtgagc 840
gtgctgaccg tgctgcacca agattggctg aatggcaagg aatacaagtg caaggtgagc 900
aacaaggccc tgcccgcccc tattgaaaaa accattagca aggctaaggg ccagcccaga 960
gagccccaag tgtatacact gccccctagc agagatgaac tgaccaaaaa ccaggtgagc 1020
ctgagatgcc acgtgaaggg cttttacccc agcgacattg ccgtggagtg ggagagcaat 1080
ggccaacctg aaaacaatta caagaccacc aagcccgtgc tggacagcga cggatctttc 1140
tttctgtata gcaccctgac cgtggacaag agcagatggc agcagggcaa tgtgttcagc 1200
tgcagcgtga tgcacgaggc cctgcacaat cactataccc agaagagcct gagcctgagc 1260
cccggcaaa 1269
<210> 34
<211> 1245
<212> DNA
<213> Artificial
<220>
<223> 34
<400> 34
ttccctacaa tccccctgag cagactgttc gacaacgcca tgctgagagc ccacagactg 60
caccagttag cctttgatac ctatcaggag ttcgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa cccccagacc agcctgtgtt tcagcgaaag catccccacc 180
cccagcaaca gagaggagac acagcaaaag agcaacctgg agctgctgag aatcagcctg 240
ctgctgatcc agagctggct ggagcctgtg caattcctga gaagcgtgtt cgccaatagc 300
ctggtgtacg gcgctagcga cagcaacgtg tatgacctgc tgaaggacct ggaggagggc 360
atccaaacac tgatgggcag actggaagat ggcagcccca gaaccggaca aattttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt attgcttcag aaaggacatg gacaaggtgg aaaccttcct gagaatcgtg 540
cagtgcagaa gcgtggaggg cagctgtgga tttgacaaga cacacaccgg acccgccccc 600
gaactgctgg gaggaccttc tgtgtttctg tttcctccca aacccaaaga caccctgatg 660
atcagcagaa cccccgaggt gacctgcgtg gttgtggatg tgtctcatga ggatcccgag 720
gtgaagttca actggtacgt ggacggcgtg gaggtgcaca atgctaagac aaagcccaga 780
gaggagcagt acaacagcac ctacagagtg gtgagcgtgc tgaccgtgct gcaccaagat 840
tggctgaatg gaaaggaata caagtgcaag gtgagcaaca aggccctgcc cgcccctatt 900
gaaaaaacaa ttagcaaggc caagggccag cccagagaac ctcaagtgta taccctgcct 960
cccagcagag atgagctgac aaaaaatcag gtgagcctga gatgccacgt gaagggcttc 1020
taccccagcg acattgccgt ggagtgggaa agcaatggcc aacctgagaa caactacaag 1080
accaccaagc ccgtgctgga cagcgacgga tctttttttc tgtacagcac cctgaccgtg 1140
gacaagagca gatggcagca gggcaatgtg ttcagctgca gcgtgatgca cgaggccctg 1200
cataatcact acacccagaa aagcctgagc ctgagccccg gcaag 1245
<210> 35
<211> 1254
<212> DNA
<213> Artificial
<220>
<223> 35
<400> 35
ttccccacca ttcctctgag ccggctgttc gacaacgcca tgctgagagc ccacagactg 60
caccagctgg ccttcgacac ctaccaagag ttcgaggaag cctacattcc caaagagcag 120
aagtacagct tcctgcagaa ccctcagacc agcctgtgct tcagcgagag catccccaca 180
cctagcaaca gagaggaaac ccagcagaag tccaacctgg aactgctgcg gatcagcctg 240
ctgctgatcc agtcttggct ggaacccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgccagcga cagcaacgtt tacgacctgc tgaaggacct ggaagagggc 360
atccagacac tgatgggcag actggaagat ggcagcccta gaaccggcca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt actgctttcg gaaggacatg gacaaggtgg aaaccttcct gcggatcgtg 540
cagtgcagaa gcgtggaagg ctcttgcgga tttggcggcg gaggatctgg cggaggtgga 600
agtgctcctg aactgctcgg aggcccttcc gtgttcctgt ttcctccaaa gcctaaggac 660
accctgatga tcagcagaac ccctgaagtg acctgcgtgg tggtggatgt gtcccacgag 720
gatcccgaag tgaagttcaa ttggtacgtg gacggcgtcg aggtgcacaa cgccaagaca 780
aagcctagag aggaacagta caacagcacc tacagagtgg tgtccgtgct gaccgtgctg 840
caccaggatt ggctgaacgg caaagagtac aagtgcaagg tgtccaacaa ggccctgcct 900
gctcctatcg agaaaaccat cagcaaggcc aagggccagc ctagggaacc ccaggtttac 960
acactgcctc caagcaggga cgagctgacc aagaatcagg tgtccctgcg gtgtcacgtg 1020
aagggcttct acccttccga tatcgccgtg gaatgggaga gcaatggcca gcctgagaac 1080
aactacaaga ccaccaagcc agtgctggac agcgacggct cattcttcct gtacagcacc 1140
ctgaccgtgg acaagtccag atggcaacag ggcaacgtgt tcagctgcag cgtgatgcac 1200
gaggccctgc acaaccacta cacccagaag tccctgagcc tgtctcctgg caaa 1254
<210> 36
<211> 1239
<212> DNA
<213> Artificial
<220>
<223> 36
<400> 36
ttccccacca ttcctctgag ccggctgttc gacaacgcca tgctgagagc ccacagactg 60
caccagctgg ccttcgacac ctaccaagag ttcgaggaag cctacattcc caaagagcag 120
aagtacagct tcctgcagaa ccctcagacc agcctgtgct tcagcgagag catccccaca 180
cctagcaaca gagaggaaac ccagcagaag tccaacctgg aactgctgcg gatcagcctg 240
ctgctgatcc agtcttggct ggaacccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgccagcga cagcaacgtt tacgacctgc tgaaggacct ggaagagggc 360
atccagacac tgatgggcag actggaagat ggcagcccta gaaccggcca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt actgctttcg gaaggacatg gacaaggtgg aaaccttcct gcggatcgtg 540
cagtgcagaa gcgtggaagg ctcttgcgga tttggcggcg gaggatctgc tcctgaactg 600
ctcggaggcc cttccgtgtt cctgtttcct ccaaagccta aggacaccct gatgatcagc 660
agaacccctg aagtgacctg cgtggtggtg gatgtgtccc acgaggatcc cgaagtgaag 720
ttcaattggt acgtggacgg cgtggaagtg cacaacgcca agaccaagcc tagagaggaa 780
cagtacaaca gcacctacag agtggtgtcc gtgctgaccg tgctgcacca ggattggctg 840
aacggcaaag agtacaagtg caaggtgtcc aacaaggccc tgcctgctcc tatcgagaaa 900
accatcagca aggccaaggg ccagcctagg gaaccccagg tttacacact gcctccaagc 960
agggacgagc tgaccaagaa tcaggtgtcc ctgcggtgtc acgtgaaggg cttctaccct 1020
tccgatatcg ccgtggaatg ggagagcaat ggccagcctg agaacaacta caagaccacc 1080
aagccagtgc tggacagcga cggctcattc ttcctgtaca gcaccctgac cgtggacaag 1140
tccagatggc aacagggcaa cgtgttcagc tgcagcgtga tgcacgaggc cctgcacaac 1200
cactacaccc agaagtccct gagcctgtct cctggcaaa 1239
<210> 37
<211> 1269
<212> DNA
<213> Artificial
<220>
<223> 37
<400> 37
gctcccgaac tgctgggagg acccagcgtg tttctgtttc cccctaaacc taaggacacc 60
ctgatgatca gcagaacccc cgaggtgacc tgcgtggtgg tggatgtgag ccatgaggac 120
cccgaagtga aattcaactg gtacgtggac ggcgtggagg tgcacaatgc taagaccaag 180
cccagagagg agcagtacaa cagcacctac agagtggtga gcgtgctgac cgtgctgcac 240
caagattggc tgaatggcaa agagtacaag tgcaaggtga gcaacaaggc cctgcccgcc 300
cctattgaaa aaaccattag caaagccaag ggccagccca gagagcccca agtgtacaca 360
ttacccccta gcagagatga actgaccaaa aaccaggtga gcctgagatg ccacgtgaag 420
ggcttttacc ccagcgacat cgccgtggag tgggagagca atggacaacc tgagaacaac 480
tacaagacca ccaagcccgt gctggacagc gacggatctt tctttctgta cagcaccctg 540
accgtggaca agagcagatg gcagcagggc aatgtgttca gctgcagcgt gatgcacgag 600
gccctgcata atcactacac ccagaagagc ctgagcctga gccccggagc tggaggagga 660
ggaagcggag gaggaggaag cggcggagga ggatcttttc ccacaattcc tctgagcaga 720
ctgttcgaca acgccatgct gagagcccac agactgcacc agctggcctt tgacacatac 780
caggagttcg aggaggccta catccccaag gagcagaagt acagcttcct gcagaacccc 840
cagaccagcc tgtgctttag cgaaagcatt cccaccccca gcaacagaga ggagacacaa 900
caaaagagca acctggagct gctgagaatc agcctgctgc tgatccagag ctggctggag 960
cccgtgcaat ttctgagaag cgtgttcgcc aatagcctgg tgtacggcgc tagcgatagc 1020
aacgtgtatg acctgctgaa ggacctggag gagggcatcc aaaccctgat gggcagactg 1080
gaggacggca gccctagaac aggacagatt tttaagcaga cctacagcaa gttcgacacc 1140
aacagccaca acgacgacgc cctgctgaaa aactacggcc tgctgtactg cttcagaaag 1200
gacatggaca aggtggagac attcctgaga atcgtgcagt gcagaagcgt ggagggcagc 1260
tgtggattt 1269
<210> 38
<211> 1269
<212> DNA
<213> Artificial
<220>
<223> 38
<400> 38
ttccccacaa tccccctgag cagactgttt gacaacgcca tgctgagagc ccacagactg 60
caccagctgg ccttcgatac ataccaggag tttgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa cccccagacc agcctgtgtt tcagcgagag cattcccaca 180
cccagcaaca gagaggaaac ccagcaaaag agcaacctgg agctgctgag aatcagcctg 240
ctgctgatcc agagctggct ggagcctgtg caattcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgcttctga tagcaatgtg tacgacctgc tgaaggacct ggaggagggc 360
attcagacac tgatgggcag actggaggac ggcagcccta gaacaggaca aattttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg agacattcct gagaatcgtg 540
cagtgcagaa gcgtggaggg cagctgtgga tttggaggcg gaggaagcgg aggaggagga 600
agcggaggag gaggaagcgc tcctgaactg ctgggaggac ctagcgtgtt tctgtttcct 660
cctaaaccca aggacaccct gatgatcagc agaacccccg aggtgacctg cgtggtggtg 720
gatgtgagcc atgaagatcc cgaagtgaag ttcaactggt acgtggacgg cgtggaggtg 780
cacaatgcta aaaccaagcc cagagaggag cagtacaaca gcacctacag agtggtgagc 840
gtgctgaccg tgctgcacca agattggctg aatggaaagg agtacaagtg caaggtgagc 900
aacaaggccc tgcccgcccc tattgaaaaa accattagca aagctaaggg ccagcccaga 960
gagccccaag tgtatacaag ccctcccagc agggacgagc tgaccaaaaa tcaggtgagc 1020
ctgagatgcc acgtgaaggg cttttacccc agcgacatcg ccgtggaatg ggaaagcaat 1080
ggccaacccg agaacaacta taagaccacc aagcccgtgc tggacagcga cggatccttt 1140
tttctgtaca gcgacctgac cgtggacaag agcagatggc agcagggcaa tgtgttcagc 1200
tgcagcgtgt atcacgaggc cctgcacaat cactacaccc agaaaagcct gagcctgagc 1260
cccggcaaa 1269
<210> 39
<211> 1269
<212> DNA
<213> Artificial
<220>
<223> 39
<400> 39
gcccctgagc tgcttggagg acctagcgtt tttttatttc cccccaaacc taaggacacc 60
ctgatgatca gcagaacccc cgaggtgacc tgcgtggttg tggatgtgag ccatgaagat 120
cccgaggtga aattcaactg gtacgtggac ggcgtggagg tgcacaatgc caaaacaaag 180
cccagagagg agcagtacaa cagcacctac agagtggtga gcgtgctgac cgtgctgcac 240
caagattggc tgaacggcaa ggagtacaag tgcaaggtga gcaacaaggc cctgcccgcc 300
cctattgaaa aaaccattag caaggccaag ggccagccca gagaacccca agtgtatacc 360
agccccccca gcagagatga gctgacaaaa aatcaggtga gcctgagatg ccacgtgaag 420
ggcttttacc ccagcgacat cgccgtggag tgggaatcta atggacaacc tgagaacaac 480
tacaagacca ccaagcccgt gctggacagc gacgggagct ttttcctgta tagcgacctg 540
accgtggaca agagcagatg gcagcagggc aatgtgttca gctgcagcgt gtatcacgag 600
gccctgcata atcactacac ccagaaaagc ctgagcctga gccccggagc tggaggagga 660
ggaagcggag gaggaggaag cggaggagga ggatcttttc ctacaattcc tctgagcaga 720
ctgttcgaca acgccatgct gagagcccac agactgcacc agctggcctt tgatacatac 780
caggagttcg aggaggccta catccccaag gagcagaagt acagcttcct gcagaacccc 840
cagaccagcc tgtgttttag cgaaagcatt cccacaccca gcaatagaga agagacacag 900
cagaagagca acctggagct gctgagaatc agcctgctgc tgatccagag ctggctggag 960
cctgtgcaat ttctgagaag cgtgttcgcc aacagcctgg tgtacggcgc tagcgactct 1020
aacgtgtatg atctgctgaa agacctggag gagggcatcc agacactgat gggcagactg 1080
gaggacggca gccccagaac aggacagata tttaagcaaa cctacagcaa gttcgacacc 1140
aacagccaca acgacgacgc cctgctgaaa aactacggcc tgctgtactg cttcagaaag 1200
gacatggaca aggtggagac attcctgaga atcgtgcagt gcagaagcgt ggagggcagc 1260
tgtggattt 1269
<210> 40
<211> 220
<212> PRT
<213> Artificial
<220>
<223> 40
<400> 40
Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe
1 5 10 15
Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val
20 25 30
Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe
35 40 45
Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro
50 55 60
Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr
65 70 75 80
Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val
85 90 95
Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala
100 105 110
Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg
115 120 125
Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly
130 135 140
Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro
145 150 155 160
Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser
165 170 175
Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln
180 185 190
Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His
195 200 205
Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
210 215 220
<210> 41
<211> 6
<212> PRT
<213> Artificial
<220>
<223> 41
<400> 41
Asp Lys Thr His Thr Gly
1 5
<210> 42
<211> 18
<212> DNA
<213> Artificial
<220>
<223> 42
<400> 42
gacaagacac acaccgga 18
<210> 43
<211> 436
<212> PRT
<213> Artificial
<220>
<223> 43
<400> 43
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Arg
180 185 190
Asn Thr Gly Arg Gly Gly Glu Glu Lys Lys Lys Glu Lys Glu Lys Glu
195 200 205
Glu Gln Glu Glu Arg Glu Thr Lys Thr Pro Glu Cys Pro Ser His Thr
210 215 220
Gln Pro Leu Gly Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu
225 230 235 240
Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser
245 250 255
Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly Val Glu
260 265 270
Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn Ser Thr
275 280 285
Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn
290 295 300
Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser
305 310 315 320
Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln
325 330 335
Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn Gln Val
340 345 350
Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val
355 360 365
Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro
370 375 380
Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Arg Leu Thr
385 390 395 400
Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys Ser Val
405 410 415
Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu
420 425 430
Ser Leu Gly Lys
435
Claims (10)
1. An Fc mutant, comprising:
a first peptide stretch having mutations compared to the Fc fragment of wild-type IgG1 at the following positions:
at least one of 228 th bit, 229 th bit, 230 th bit, 366 th bit, 368 th bit, 395 th bit and 409 th bit, and 351 th bit, 428 th bit and 447 th bit; or
228 th bit, 229 th bit, 366 th bit, 368 th bit, 395 th bit and 409 th bit;
optionally, the first peptidyl fragment has mutations compared to the Fc fragment of wild-type IgG1 at the following positions:
1) 228 th, 229 th, 230 th, 366 th, 368 th, 395 th, 409 th and 447 th bits; or,
2) 228 th, 229 th, 230 th, 351 th, 366 th, 368 th, 395 th, 409 th and 428 th bits; or,
3) 228, 229, 230, 351, 366, 368, 395, 409, 428, and 447 bits;
optionally, the first peptide fragment has the following mutations compared to the Fc fragment of wild-type IgG 1:
1) P228Delete, C229Delete, P230 Delete, T366R, L368H, P395K, K447A and K409D or K409T; or,
2) P228Delete, C229Delete, P230 Delete, L351S, T366R, L368H, P395K, M428Y and K409D or K409T; or,
3) P228Delete, C229Delete, P230 Delete, L351S, T366R, L368H, P395K, M428Y, K447A and K409D or K409T; or
4) P228Delete, C229Delete, T366R, L368H, P395K and K409D or K409T;
optionally, further comprising:
a hinge region fragment of wild type IgG1, wherein the C end of the hinge region fragment of wild type IgG1 is connected with the N end of the first peptide segment;
optionally, the hinge region fragment of the wild type IgG1 has an amino acid sequence shown as SEQ ID NO. 41;
optionally, the Fc mutant has an amino acid sequence shown as any one of SEQ ID NO 3-7;
optionally, the Fc mutant is in monomeric form.
2. A fusion protein, comprising:
a second peptide segment comprising a functional region of a biologically active molecule;
a third peptide stretch comprising an Fc mutant having an amino acid sequence as set forth in SEQ ID NO 2 or as defined in claim 1, the second peptide stretch being linked to the third peptide stretch.
3. The fusion protein of claim 2, wherein the second peptide segment comprises a growth hormone, a growth hormone analog, a growth hormone functional region, or a growth hormone analog functional region, preferably a human growth hormone or a human growth hormone functional region;
optionally, the human growth hormone has an amino acid sequence as set forth in SEQ ID NO 1 or at least 90% identity thereto;
optionally, the N-terminus of the second peptide fragment is linked to the C-terminus of the third peptide fragment; or,
and the C end of the second peptide segment is connected with the N end of the third peptide segment.
4. The fusion protein of claim 2 or 3, further comprising a linker peptide disposed between the second and third peptide segments;
optionally, the N-terminus of the linker peptide is linked to the C-terminus of the third peptide fragment, and the C-terminus of the linker peptide is linked to the N-terminus of the second peptide fragment; or,
the N end of the connecting peptide is connected with the C end of the second peptide segment, and the C end of the connecting peptide is connected with the N end of the third peptide segment;
optionally, the amino acid sequence of the linker peptide is (GGGGS) n Wherein n is an integer greater than or equal to 1, preferably 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10;
optionally, the connecting peptide has an amino acid sequence shown in any one of SEQ ID NO 8-10.
5. The fusion protein of claim 2, wherein the fusion protein has an amino acid sequence represented by any one of SEQ ID NOs 24 to 31.
6. A nucleic acid molecule encoding the Fc mutant of claim 1 or the fusion protein of any one of claims 2 to 5;
optionally, the nucleic acid molecule is DNA.
7. An expression vector carrying the nucleic acid molecule of claim 6;
optionally, the expression vector is a eukaryotic expression vector;
preferably, the expression vector is a lentiviral vector.
8. A recombinant cell, wherein the recombinant cell comprises:
carrying the nucleic acid molecule of claim 6; or,
expressing the Fc mutant according to claim 1 or the fusion protein according to any one of claims 2 to 5;
optionally, the recombinant cell is obtained by introducing the expression vector of claim 7 into a host cell;
optionally, the recombinant cell is a eukaryotic cell;
preferably, the recombinant cell is a mammalian cell.
9. A pharmaceutical composition, comprising:
the fusion protein of any one of claims 2 to 5;
optionally, the pharmaceutical composition further comprises pharmaceutically acceptable excipients;
optionally, the pharmaceutical composition is in the form of an injection;
optionally, the route of administration of the pharmaceutical composition comprises subcutaneous injection or intravenous injection.
10. Use of the fusion protein according to any one of claims 2 to 5, the nucleic acid molecule according to claim 6, the expression vector according to claim 7, the recombinant cell according to claim 8, the pharmaceutical composition according to claim 9 for the preparation of a medicament for the treatment and/or prevention of a disease associated with growth hormone abnormalities;
optionally, the disease associated with abnormal growth hormone comprises at least one selected from the group consisting of:
childhood growth hormone deficiency, idiopathic short stature, adult growth hormone deficiency, turner's syndrome, prader-willi syndrome, renal failure, diseases caused by dissimilatory states during chemotherapy treatment and AIDS treatment, intrauterine growth retardation.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2022/100114 WO2023093021A1 (en) | 2021-11-26 | 2022-06-21 | Recombinant long-acting human growth hormone fusion protein and its use thereof |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2021114208429 | 2021-11-26 | ||
CN202111420842 | 2021-11-26 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115873127A true CN115873127A (en) | 2023-03-31 |
Family
ID=85769358
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210395423.2A Pending CN115873127A (en) | 2021-11-26 | 2022-04-14 | Recombinant long-acting human growth hormone fusion protein and preparation method and application thereof |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN115873127A (en) |
WO (1) | WO2023093021A1 (en) |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI687441B (en) * | 2011-06-30 | 2020-03-11 | 中外製藥股份有限公司 | Heterodimerized polypeptide |
JP6433297B2 (en) * | 2012-12-27 | 2018-12-05 | 中外製薬株式会社 | Heterodimerized polypeptide |
MX2016008782A (en) * | 2014-01-15 | 2016-09-08 | Hoffmann La Roche | Fc-region variants with improved protein a-binding. |
EP3583126A1 (en) * | 2017-02-17 | 2019-12-25 | Sanofi | Multispecific binding molecules having specificity to dystroglycan and laminin-2 |
MX2020007389A (en) * | 2018-01-10 | 2020-10-14 | Denali Therapeutics Inc | Transferrin receptor-binding polypeptides and uses thereof. |
CA3106142A1 (en) * | 2018-07-11 | 2020-01-16 | Momenta Pharmaceuticals, Inc. | Compositions and methods related to engineered fc-antigen binding domain constructs targeted to cd38 |
-
2022
- 2022-04-14 CN CN202210395423.2A patent/CN115873127A/en active Pending
- 2022-06-21 WO PCT/CN2022/100114 patent/WO2023093021A1/en unknown
Also Published As
Publication number | Publication date |
---|---|
WO2023093021A1 (en) | 2023-06-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20190248872A1 (en) | Fc FUSION PROTEINS COMPRISING NOVEL LINKERS OR ARRANGEMENTS | |
US9714281B2 (en) | Fibronectin based scaffold domain proteins that bind IL-23 | |
EP4122951A1 (en) | Interleukin-2 mutant and use thereof | |
JP4035400B2 (en) | Fusion proteins with enhanced erythropoietin activity in vivo | |
KR20170087486A (en) | Common light chains and methods of use | |
CN112566698A (en) | T cell receptor and engineered cells expressing the same | |
WO2024109611A1 (en) | Mutant respiratory syncytial virus pre-fusion f protein and use thereof | |
EP4122952A1 (en) | Interleukin-2 mutant and use thereof | |
JPS61251700A (en) | Insulin receptor | |
CN101072793A (en) | Il-7 variants with reduced immunogenicity | |
CN109336980B (en) | Muc 1-targeted chimeric antigen receptor modified T cell and application thereof | |
EP4288529A2 (en) | Synthetic degrader system for targeted protein degradation | |
CN114107263A (en) | High-affinity human angiotensin converting enzyme 2(ACE2) mutant and application thereof | |
CN114874333A (en) | Growth hormone fusion protein and application thereof | |
CA3103684A1 (en) | Solubilized apyrases, methods and use | |
CN114591415A (en) | GLP-1/GCG dual-receptor agonist polypeptide and fusion protein thereof | |
WO2024056097A1 (en) | Use of nkg2d-based cell adapter molecule in removal of aging cells | |
CN106008717B (en) | Long-acting recombinant GLP-1 fusion protein and preparation method and application thereof | |
KR20160113268A (en) | Bifunctional fusion protein, preparation method therefor, and use thereof | |
CN113061168A (en) | Truncated fever with thrombocytopenia syndrome virus Gn protein and application thereof | |
CN106957365B (en) | Monoclonal antibody FnAb8 and application thereof | |
CN115873127A (en) | Recombinant long-acting human growth hormone fusion protein and preparation method and application thereof | |
CN113150172B (en) | GLP-1R/GIPR double-target agonist fusion protein and preparation method and application thereof | |
AU2021284697A1 (en) | Modified Semaphorin 3A, compositions comprising the same and uses thereof | |
KR20180003677A (en) | Pharmaceutical compositions comprising mutant proteins of human growth hormone or transferrin fusion proteins thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |