KR20190124665A - 신규한 융합 단백질 및 이를 포함하는 암의 예방 또는 치료용 약학적 조성물 - Google Patents
신규한 융합 단백질 및 이를 포함하는 암의 예방 또는 치료용 약학적 조성물 Download PDFInfo
- Publication number
- KR20190124665A KR20190124665A KR1020190049295A KR20190049295A KR20190124665A KR 20190124665 A KR20190124665 A KR 20190124665A KR 1020190049295 A KR1020190049295 A KR 1020190049295A KR 20190049295 A KR20190049295 A KR 20190049295A KR 20190124665 A KR20190124665 A KR 20190124665A
- Authority
- KR
- South Korea
- Prior art keywords
- seq
- leu
- region
- cancer
- immunoglobulin
- Prior art date
Links
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 126
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 121
- 206010028980 Neoplasm Diseases 0.000 title claims abstract description 38
- 201000011510 cancer Diseases 0.000 title claims abstract description 33
- 239000008194 pharmaceutical composition Substances 0.000 title claims description 19
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 80
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 78
- 210000004027 cell Anatomy 0.000 claims abstract description 72
- 108060003951 Immunoglobulin Proteins 0.000 claims abstract description 47
- 102000018358 immunoglobulin Human genes 0.000 claims abstract description 47
- 239000007787 solid Substances 0.000 claims abstract description 11
- 102000016844 Immunoglobulin-like domains Human genes 0.000 claims abstract description 7
- 108050006430 Immunoglobulin-like domains Proteins 0.000 claims abstract description 7
- 150000007523 nucleic acids Chemical class 0.000 claims description 35
- 238000000034 method Methods 0.000 claims description 29
- 239000013604 expression vector Substances 0.000 claims description 18
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 17
- 108020004707 nucleic acids Proteins 0.000 claims description 16
- 102000039446 nucleic acids Human genes 0.000 claims description 16
- 229960003697 abatacept Drugs 0.000 claims description 7
- 201000001441 melanoma Diseases 0.000 claims description 7
- 206010009944 Colon cancer Diseases 0.000 claims description 6
- 239000012634 fragment Substances 0.000 claims description 5
- 210000003292 kidney cell Anatomy 0.000 claims description 4
- 206010005003 Bladder cancer Diseases 0.000 claims description 2
- 206010006187 Breast cancer Diseases 0.000 claims description 2
- 208000026310 Breast neoplasm Diseases 0.000 claims description 2
- 206010058467 Lung neoplasm malignant Diseases 0.000 claims description 2
- 206010033128 Ovarian cancer Diseases 0.000 claims description 2
- 206010061535 Ovarian neoplasm Diseases 0.000 claims description 2
- 206010061902 Pancreatic neoplasm Diseases 0.000 claims description 2
- 206010060862 Prostate cancer Diseases 0.000 claims description 2
- 208000000236 Prostatic Neoplasms Diseases 0.000 claims description 2
- 208000005718 Stomach Neoplasms Diseases 0.000 claims description 2
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 claims description 2
- 239000004480 active ingredient Substances 0.000 claims description 2
- 208000029742 colonic neoplasm Diseases 0.000 claims description 2
- 206010017758 gastric cancer Diseases 0.000 claims description 2
- 208000005017 glioblastoma Diseases 0.000 claims description 2
- 201000010536 head and neck cancer Diseases 0.000 claims description 2
- 208000014829 head and neck neoplasm Diseases 0.000 claims description 2
- 201000007270 liver cancer Diseases 0.000 claims description 2
- 208000014018 liver neoplasm Diseases 0.000 claims description 2
- 201000005202 lung cancer Diseases 0.000 claims description 2
- 208000020816 lung neoplasm Diseases 0.000 claims description 2
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 claims description 2
- 208000037819 metastatic cancer Diseases 0.000 claims description 2
- 208000011575 metastatic malignant neoplasm Diseases 0.000 claims description 2
- 201000002528 pancreatic cancer Diseases 0.000 claims description 2
- 208000008443 pancreatic carcinoma Diseases 0.000 claims description 2
- 201000011549 stomach cancer Diseases 0.000 claims description 2
- 201000005112 urinary bladder cancer Diseases 0.000 claims description 2
- 210000003289 regulatory T cell Anatomy 0.000 abstract description 42
- 210000003162 effector t lymphocyte Anatomy 0.000 abstract description 19
- 230000000694 effects Effects 0.000 abstract description 18
- 239000003446 ligand Substances 0.000 abstract description 13
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 abstract description 7
- 230000003993 interaction Effects 0.000 abstract description 7
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 abstract description 6
- 230000012010 growth Effects 0.000 abstract description 5
- 230000004913 activation Effects 0.000 abstract 1
- 241000880493 Leptailurus serval Species 0.000 description 73
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 54
- 108010047857 aspartylglycine Proteins 0.000 description 52
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 51
- 108010057821 leucylproline Proteins 0.000 description 51
- 238000002360 preparation method Methods 0.000 description 50
- 108010050848 glycylleucine Proteins 0.000 description 41
- 108010047495 alanylglycine Proteins 0.000 description 39
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 39
- 108010048818 seryl-histidine Proteins 0.000 description 35
- 108010073969 valyllysine Proteins 0.000 description 35
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 34
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 32
- 108010077245 asparaginyl-proline Proteins 0.000 description 31
- 108010025306 histidylleucine Proteins 0.000 description 31
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 30
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 29
- 241000282414 Homo sapiens Species 0.000 description 29
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 29
- 210000001744 T-lymphocyte Anatomy 0.000 description 29
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 29
- 108010031719 prolyl-serine Proteins 0.000 description 28
- 108010080629 tryptophan-leucine Proteins 0.000 description 28
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 27
- 108010015792 glycyllysine Proteins 0.000 description 27
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 25
- 150000001413 amino acids Chemical class 0.000 description 24
- 108010026333 seryl-proline Proteins 0.000 description 24
- 230000014509 gene expression Effects 0.000 description 23
- 108010070643 prolylglutamic acid Proteins 0.000 description 23
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 22
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 22
- 108020004414 DNA Proteins 0.000 description 22
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 22
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 22
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 22
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 22
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 22
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 22
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 21
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 20
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 20
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 20
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 20
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 20
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 20
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 20
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 20
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 20
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 20
- 108010003201 RGH 0205 Proteins 0.000 description 20
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 20
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 20
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 20
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 20
- 108010070944 alanylhistidine Proteins 0.000 description 20
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 20
- 108010054813 diprotin B Proteins 0.000 description 20
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 20
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 20
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 20
- 108010025488 pinealon Proteins 0.000 description 20
- 239000013598 vector Substances 0.000 description 20
- GITAWLWBTMJPKH-AVGNSLFASA-N Arg-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GITAWLWBTMJPKH-AVGNSLFASA-N 0.000 description 19
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 19
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 19
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 19
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 19
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 19
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 19
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 19
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 19
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 18
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 18
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 18
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 18
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 18
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 18
- 108010005233 alanylglutamic acid Proteins 0.000 description 18
- 108010054155 lysyllysine Proteins 0.000 description 18
- 108010003700 lysyl aspartic acid Proteins 0.000 description 17
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 16
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 16
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 16
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 16
- 108010010147 glycylglutamine Proteins 0.000 description 16
- 108010064235 lysylglycine Proteins 0.000 description 16
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 15
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 15
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 15
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 15
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 15
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 15
- 108010013835 arginine glutamate Proteins 0.000 description 15
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 14
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 14
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 14
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 14
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 14
- JDMKQHSHKJHAHR-UHFFFAOYSA-N Phe-Phe-Leu-Tyr Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)CC1=CC=CC=C1 JDMKQHSHKJHAHR-UHFFFAOYSA-N 0.000 description 14
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 14
- 108010078144 glutaminyl-glycine Proteins 0.000 description 14
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 13
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 13
- 241000699666 Mus <mouse, genus> Species 0.000 description 13
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 13
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 13
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 13
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 13
- 108010061238 threonyl-glycine Proteins 0.000 description 13
- 108010027345 wheylin-1 peptide Proteins 0.000 description 13
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 12
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 12
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 12
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 12
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 12
- 230000002401 inhibitory effect Effects 0.000 description 12
- 108010052774 valyl-lysyl-glycyl-phenylalanyl-tyrosine Proteins 0.000 description 12
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 11
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 11
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 11
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 11
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 11
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 11
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 11
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 11
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 11
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 11
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 11
- IEAUDUOCWNPZBR-LKTVYLICSA-N Ala-Trp-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IEAUDUOCWNPZBR-LKTVYLICSA-N 0.000 description 11
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 11
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 11
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 11
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 11
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 11
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 11
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 11
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 11
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 11
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 11
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 11
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 11
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 11
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 11
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 11
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 11
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 11
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 11
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 11
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 11
- APYNREQHZOGYHV-ACZMJKKPSA-N Asp-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N APYNREQHZOGYHV-ACZMJKKPSA-N 0.000 description 11
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 11
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 11
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 11
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 11
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 11
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 11
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 11
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 11
- XEEIQMGZRFFSRD-XVYDVKMFSA-N Cys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N XEEIQMGZRFFSRD-XVYDVKMFSA-N 0.000 description 11
- CUXIOFHFFXNUGG-HTFCKZLJSA-N Cys-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CS)N CUXIOFHFFXNUGG-HTFCKZLJSA-N 0.000 description 11
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 11
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 11
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 11
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 11
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 11
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 11
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 11
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 11
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 11
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 11
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 11
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 11
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 11
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 11
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 11
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 11
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 11
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 11
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 11
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 11
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 11
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 11
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 11
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 11
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 11
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 11
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 11
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 11
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 11
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 11
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 11
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 11
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 11
- 102100026120 IgG receptor FcRn large subunit p51 Human genes 0.000 description 11
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 11
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 11
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 11
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 11
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 11
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 11
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 11
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 11
- KWHFUMYCSPJCFQ-NGTWOADLSA-N Ile-Thr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N KWHFUMYCSPJCFQ-NGTWOADLSA-N 0.000 description 11
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 11
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 11
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 11
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 11
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 11
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 11
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 11
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 11
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 11
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 11
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 11
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 11
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 11
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 11
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 11
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 11
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 11
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 11
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 11
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 11
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 11
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 11
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 11
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 11
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 11
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 11
- SPNKGZFASINBMR-IHRRRGAJSA-N Lys-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N SPNKGZFASINBMR-IHRRRGAJSA-N 0.000 description 11
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 11
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 11
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- YRKFKTQRVBJYLT-CQDKDKBSSA-N Phe-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 YRKFKTQRVBJYLT-CQDKDKBSSA-N 0.000 description 11
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 11
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 11
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 11
- FSPGBMWPNMRWDB-AVGNSLFASA-N Phe-Cys-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FSPGBMWPNMRWDB-AVGNSLFASA-N 0.000 description 11
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 11
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 11
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 11
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 11
- DXWNFNOPBYAFRM-IHRRRGAJSA-N Phe-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N DXWNFNOPBYAFRM-IHRRRGAJSA-N 0.000 description 11
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 11
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 11
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 11
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 11
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 11
- GBRUQFBAJOKCTF-DCAQKATOSA-N Pro-His-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O GBRUQFBAJOKCTF-DCAQKATOSA-N 0.000 description 11
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 11
- RTQKBZIRDWZLDF-BZSNNMDCSA-N Pro-Pro-Trp Chemical compound C([C@H]1C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)O)CCN1C(=O)[C@@H]1CCCN1 RTQKBZIRDWZLDF-BZSNNMDCSA-N 0.000 description 11
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 11
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 11
- XSYJDGIDKRNWFX-SRVKXCTJSA-N Ser-Cys-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XSYJDGIDKRNWFX-SRVKXCTJSA-N 0.000 description 11
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 11
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 11
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 11
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 11
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 11
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 11
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 11
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 11
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 11
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 11
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 11
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 11
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 11
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 11
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 11
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 11
- PAOYNIKMYOGBMR-PBCZWWQYSA-N Thr-Asn-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PAOYNIKMYOGBMR-PBCZWWQYSA-N 0.000 description 11
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 11
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 11
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 11
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 11
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 11
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 11
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 11
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 11
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 11
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 11
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 11
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 11
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 11
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 11
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 11
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 11
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 11
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 11
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 11
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 11
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 11
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 11
- 108010041407 alanylaspartic acid Proteins 0.000 description 11
- 108010087924 alanylproline Proteins 0.000 description 11
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 11
- 108010079547 glutamylmethionine Proteins 0.000 description 11
- 108010089804 glycyl-threonine Proteins 0.000 description 11
- 108010028295 histidylhistidine Proteins 0.000 description 11
- 108010092114 histidylphenylalanine Proteins 0.000 description 11
- 108010012058 leucyltyrosine Proteins 0.000 description 11
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 11
- 238000011282 treatment Methods 0.000 description 11
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 10
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 10
- DYJJJCHDHLEFDW-FXQIFTODSA-N Ala-Pro-Cys Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N DYJJJCHDHLEFDW-FXQIFTODSA-N 0.000 description 10
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 10
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 10
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 10
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 10
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 10
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 10
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 10
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 10
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 10
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 10
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 10
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 10
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 10
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 10
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 10
- DHPPWTOLRWYIDS-XKBZYTNZSA-N Thr-Cys-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O DHPPWTOLRWYIDS-XKBZYTNZSA-N 0.000 description 10
- QFHRUCJIRVILCK-YJRXYDGGSA-N Tyr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O QFHRUCJIRVILCK-YJRXYDGGSA-N 0.000 description 10
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 10
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 10
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 10
- 108020004999 messenger RNA Proteins 0.000 description 10
- 239000000203 mixture Substances 0.000 description 10
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 9
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 9
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 9
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 9
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 9
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 9
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 9
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 9
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 9
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 9
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 9
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 9
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 9
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 9
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 9
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 9
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 9
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 9
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 9
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 9
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 9
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 9
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 9
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 9
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 9
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 9
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 9
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 9
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 9
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 9
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 9
- GOPFMQJUQDLUFW-LKXGYXEUSA-N Asn-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O GOPFMQJUQDLUFW-LKXGYXEUSA-N 0.000 description 9
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 9
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 9
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 9
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 9
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 9
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 9
- 102100021277 Beta-secretase 2 Human genes 0.000 description 9
- 101710150190 Beta-secretase 2 Proteins 0.000 description 9
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 9
- OLIYIKRCOZBFCW-ZLUOBGJFSA-N Cys-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)C(=O)O OLIYIKRCOZBFCW-ZLUOBGJFSA-N 0.000 description 9
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 9
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 9
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 9
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 9
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 9
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 9
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 9
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 9
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 9
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 9
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 9
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 9
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 9
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 9
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 9
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 9
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 9
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 9
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 9
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 9
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 9
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 9
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 9
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 9
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 9
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 9
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 9
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 9
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 9
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 9
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 9
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 9
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 9
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 9
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 9
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 9
- LYSVCKOXIDKEEL-SRVKXCTJSA-N His-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYSVCKOXIDKEEL-SRVKXCTJSA-N 0.000 description 9
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 9
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 9
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 9
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 9
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 9
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 9
- CKONPJHGMIDMJP-IHRRRGAJSA-N His-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CKONPJHGMIDMJP-IHRRRGAJSA-N 0.000 description 9
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 9
- 101710177940 IgG receptor FcRn large subunit p51 Proteins 0.000 description 9
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 9
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 9
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 9
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 9
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 9
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 9
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 9
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 9
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 9
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 9
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 9
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 9
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 9
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 9
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 9
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 9
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 9
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 9
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 9
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 9
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 9
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 9
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 9
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 9
- STTRPDDKDVKIDF-KKUMJFAQSA-N Met-Glu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 STTRPDDKDVKIDF-KKUMJFAQSA-N 0.000 description 9
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 9
- 108010065395 Neuropep-1 Proteins 0.000 description 9
- NEHSHYOUIWBYSA-DCPHZVHLSA-N Phe-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NEHSHYOUIWBYSA-DCPHZVHLSA-N 0.000 description 9
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 9
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 9
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 9
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 9
- GZGPMBKUJDRICD-ULQDDVLXSA-N Phe-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O GZGPMBKUJDRICD-ULQDDVLXSA-N 0.000 description 9
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 9
- ZVJGAXNBBKPYOE-HKUYNNGSSA-N Phe-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 ZVJGAXNBBKPYOE-HKUYNNGSSA-N 0.000 description 9
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 9
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 9
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 9
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 9
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 9
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 9
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 9
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 9
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 9
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 9
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 9
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 9
- CLKKNZQUQMZDGD-SRVKXCTJSA-N Ser-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CN=CN1 CLKKNZQUQMZDGD-SRVKXCTJSA-N 0.000 description 9
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 9
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 9
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 9
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 9
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 9
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 9
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 9
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 9
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 9
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 9
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 9
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 9
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 9
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 9
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 9
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 9
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 9
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 9
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 9
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 9
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 9
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 9
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 9
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 9
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 9
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 9
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 9
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 9
- WKCFCVBOFKEVKY-HSCHXYMDSA-N Trp-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WKCFCVBOFKEVKY-HSCHXYMDSA-N 0.000 description 9
- JZSLIZLZGWOJBJ-PMVMPFDFSA-N Trp-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N JZSLIZLZGWOJBJ-PMVMPFDFSA-N 0.000 description 9
- CWQZAUYFWRLITN-AVGNSLFASA-N Tyr-Gln-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O CWQZAUYFWRLITN-AVGNSLFASA-N 0.000 description 9
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 9
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 9
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 9
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 9
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 9
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 9
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 9
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 9
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 9
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 9
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 9
- 108010011559 alanylphenylalanine Proteins 0.000 description 9
- 108010008355 arginyl-glutamine Proteins 0.000 description 9
- 108010060035 arginylproline Proteins 0.000 description 9
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 9
- 108010036413 histidylglycine Proteins 0.000 description 9
- 108010085325 histidylproline Proteins 0.000 description 9
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 9
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 9
- 108010056582 methionylglutamic acid Proteins 0.000 description 9
- 108010012581 phenylalanylglutamate Proteins 0.000 description 9
- 229920001184 polypeptide Polymers 0.000 description 9
- 102000004196 processed proteins & peptides Human genes 0.000 description 9
- 108010077112 prolyl-proline Proteins 0.000 description 9
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 9
- 230000001105 regulatory effect Effects 0.000 description 9
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 9
- 108010051110 tyrosyl-lysine Proteins 0.000 description 9
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 8
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 8
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 8
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 8
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 8
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 8
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 8
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 8
- SWDSRANUCKNBLA-AVGNSLFASA-N Gln-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SWDSRANUCKNBLA-AVGNSLFASA-N 0.000 description 8
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 8
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 8
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 8
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 8
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 8
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 8
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 8
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 description 8
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 8
- 239000000427 antigen Substances 0.000 description 8
- 102000036639 antigens Human genes 0.000 description 8
- 108091007433 antigens Proteins 0.000 description 8
- 239000003814 drug Substances 0.000 description 8
- 229940079593 drug Drugs 0.000 description 8
- 108010084389 glycyltryptophan Proteins 0.000 description 8
- 239000000243 solution Substances 0.000 description 8
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 7
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 7
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 7
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 7
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 7
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 7
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 7
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 7
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 7
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 7
- 108010065920 Insulin Lispro Proteins 0.000 description 7
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 7
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 7
- 241000699670 Mus sp. Species 0.000 description 7
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 7
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 7
- 241000700159 Rattus Species 0.000 description 7
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 7
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 7
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 7
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 7
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 7
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 7
- 108010060199 cysteinylproline Proteins 0.000 description 7
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 7
- DIBLBAURNYJYBF-XLXZRNDBSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-6-amino-2-[[(2s)-2-amino-3-methylbutanoyl]amino]hexanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 DIBLBAURNYJYBF-XLXZRNDBSA-N 0.000 description 6
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 6
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 6
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 6
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 6
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 6
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 6
- 125000003412 L-alanyl group Chemical group [H]N([H])[C@@](C([H])([H])[H])(C(=O)[*])[H] 0.000 description 6
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 6
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 6
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 6
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 6
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 6
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 6
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 6
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 6
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 6
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 6
- SSSDKJMQMZTMJP-BVSLBCMMSA-N Trp-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 SSSDKJMQMZTMJP-BVSLBCMMSA-N 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 201000010099 disease Diseases 0.000 description 6
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 5
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 5
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 5
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 5
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 5
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 5
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 5
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 5
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 5
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 5
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 5
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 5
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 5
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 5
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 5
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 5
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 5
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 5
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 5
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 5
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 5
- WMKXFMUJRCEGRP-SRVKXCTJSA-N His-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N WMKXFMUJRCEGRP-SRVKXCTJSA-N 0.000 description 5
- 241000282412 Homo Species 0.000 description 5
- 101001057504 Homo sapiens Interferon-stimulated gene 20 kDa protein Proteins 0.000 description 5
- 101001055144 Homo sapiens Interleukin-2 receptor subunit alpha Proteins 0.000 description 5
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 5
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 5
- 102100026878 Interleukin-2 receptor subunit alpha Human genes 0.000 description 5
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 5
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 5
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 5
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 5
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 5
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 5
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 5
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 5
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 5
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 5
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 5
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 5
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 5
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 5
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 5
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 5
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 5
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 5
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 5
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 5
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 5
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 5
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 5
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 5
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 5
- UEJYSALTSUZXFV-SRVKXCTJSA-N Rigin Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UEJYSALTSUZXFV-SRVKXCTJSA-N 0.000 description 5
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 5
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 5
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 5
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 5
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 5
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 5
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 5
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 5
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 5
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 5
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 5
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 5
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 5
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 5
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 5
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 5
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 5
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 5
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 5
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 5
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 5
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 5
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 5
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 5
- 108010038633 aspartylglutamate Proteins 0.000 description 5
- 239000002775 capsule Substances 0.000 description 5
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 5
- 108010049041 glutamylalanine Proteins 0.000 description 5
- 210000002602 induced regulatory T cell Anatomy 0.000 description 5
- 239000007924 injection Substances 0.000 description 5
- 238000002347 injection Methods 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 108010017391 lysylvaline Proteins 0.000 description 5
- 108010051242 phenylalanylserine Proteins 0.000 description 5
- 230000002441 reversible effect Effects 0.000 description 5
- 108010044292 tryptophyltyrosine Proteins 0.000 description 5
- IESDGNYHXIOKRW-YXMSTPNBSA-N (2s)-2-[[(2s)-1-[(2s)-6-amino-2-[[(2s,3r)-2-amino-3-hydroxybutanoyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IESDGNYHXIOKRW-YXMSTPNBSA-N 0.000 description 4
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 4
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 4
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 4
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 4
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 4
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 4
- LGCVSPFCFXWUEY-IHPCNDPISA-N Asn-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N LGCVSPFCFXWUEY-IHPCNDPISA-N 0.000 description 4
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 4
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 4
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 4
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 4
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 4
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 4
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 4
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 4
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 4
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 4
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 4
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 4
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 4
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 4
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 4
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 4
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 4
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 4
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 4
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 4
- 241000699660 Mus musculus Species 0.000 description 4
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 4
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 4
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 4
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 4
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 4
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 4
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 4
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 4
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 4
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 4
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 4
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 4
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 4
- RKIGNDAHUOOIMJ-BQFCYCMXSA-N Val-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 RKIGNDAHUOOIMJ-BQFCYCMXSA-N 0.000 description 4
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 4
- SBJCTAZFSZXWSR-AVGNSLFASA-N Val-Met-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SBJCTAZFSZXWSR-AVGNSLFASA-N 0.000 description 4
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 4
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 4
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 4
- 238000012790 confirmation Methods 0.000 description 4
- 230000004069 differentiation Effects 0.000 description 4
- 239000012636 effector Substances 0.000 description 4
- 230000001976 improved effect Effects 0.000 description 4
- 239000013642 negative control Substances 0.000 description 4
- 108010073101 phenylalanylleucine Proteins 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 4
- -1 3,3'-dithiobis (succinimidylpropionate) Imidoesters Chemical class 0.000 description 3
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 3
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 3
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 3
- ZUVMUOOHJYNJPP-XIRDDKMYSA-N Arg-Trp-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZUVMUOOHJYNJPP-XIRDDKMYSA-N 0.000 description 3
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 3
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 3
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 3
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 3
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 3
- LIQNMKIBMPEOOP-IHRRRGAJSA-N Asp-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)O)N LIQNMKIBMPEOOP-IHRRRGAJSA-N 0.000 description 3
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 3
- 241000699800 Cricetinae Species 0.000 description 3
- VDUPGIDTWNQAJD-CIUDSAMLSA-N Cys-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O VDUPGIDTWNQAJD-CIUDSAMLSA-N 0.000 description 3
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 3
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 108091092584 GDNA Proteins 0.000 description 3
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 3
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 3
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 3
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 3
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 3
- SFKMXFWWDUGXRT-NWLDYVSISA-N Glu-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N)O SFKMXFWWDUGXRT-NWLDYVSISA-N 0.000 description 3
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 3
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 3
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 3
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 3
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 3
- CNHSMSFYVARZLI-YJRXYDGGSA-N His-His-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CNHSMSFYVARZLI-YJRXYDGGSA-N 0.000 description 3
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 3
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 3
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 3
- WRDTXMBPHMBGIB-STECZYCISA-N Ile-Tyr-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 WRDTXMBPHMBGIB-STECZYCISA-N 0.000 description 3
- 125000000570 L-alpha-aspartyl group Chemical group [H]OC(=O)C([H])([H])[C@]([H])(N([H])[H])C(*)=O 0.000 description 3
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 3
- 108010006444 Leucine-Rich Repeat Proteins Proteins 0.000 description 3
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 3
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 3
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 3
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 3
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 3
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 3
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 3
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 3
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 3
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 3
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 3
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 3
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 3
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 3
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 3
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 3
- 210000000447 Th1 cell Anatomy 0.000 description 3
- 210000000068 Th17 cell Anatomy 0.000 description 3
- 210000004241 Th2 cell Anatomy 0.000 description 3
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 3
- NQVDGKYAUHTCME-QTKMDUPCSA-N Thr-His-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O NQVDGKYAUHTCME-QTKMDUPCSA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 3
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 3
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 3
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 3
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 3
- HJWLQSFTGDQSRX-BPUTZDHNSA-N Trp-Met-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HJWLQSFTGDQSRX-BPUTZDHNSA-N 0.000 description 3
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 3
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 3
- RHYOAUJXSRWVJT-GVXVVHGQSA-N Val-His-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RHYOAUJXSRWVJT-GVXVVHGQSA-N 0.000 description 3
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 3
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 3
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 3
- 230000002378 acidificating effect Effects 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 239000000539 dimer Substances 0.000 description 3
- 239000003937 drug carrier Substances 0.000 description 3
- 229940088598 enzyme Drugs 0.000 description 3
- 238000009472 formulation Methods 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 238000007918 intramuscular administration Methods 0.000 description 3
- 210000004901 leucine-rich repeat Anatomy 0.000 description 3
- 108010091871 leucylmethionine Proteins 0.000 description 3
- 244000005700 microbiome Species 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- 239000003755 preservative agent Substances 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 210000000952 spleen Anatomy 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- 239000003826 tablet Substances 0.000 description 3
- 230000001225 therapeutic effect Effects 0.000 description 3
- 238000001890 transfection Methods 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 2
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 2
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 2
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 2
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 2
- ZTRJUKDEALVRMW-SRVKXCTJSA-N Asn-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZTRJUKDEALVRMW-SRVKXCTJSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 2
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 2
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 2
- YQPSDMUGFKJZHR-QRTARXTBSA-N Asn-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N YQPSDMUGFKJZHR-QRTARXTBSA-N 0.000 description 2
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 2
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 2
- GWOVSEVNXNVMMY-BPUTZDHNSA-N Asp-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N GWOVSEVNXNVMMY-BPUTZDHNSA-N 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 241000700198 Cavia Species 0.000 description 2
- RWVBNRYBHAGYSG-GUBZILKMSA-N Cys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N RWVBNRYBHAGYSG-GUBZILKMSA-N 0.000 description 2
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 108010087819 Fc receptors Proteins 0.000 description 2
- 102000009109 Fc receptors Human genes 0.000 description 2
- PODFFOWWLUPNMN-DCAQKATOSA-N Gln-His-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PODFFOWWLUPNMN-DCAQKATOSA-N 0.000 description 2
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 2
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 2
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 2
- SYTFJIQPBRJSOK-NKIYYHGXSA-N Gln-Thr-His Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 SYTFJIQPBRJSOK-NKIYYHGXSA-N 0.000 description 2
- OWVURWCRZZMAOZ-XHNCKOQMSA-N Glu-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OWVURWCRZZMAOZ-XHNCKOQMSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 2
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 2
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 2
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- QQFSKBMCAKWHLG-UHFFFAOYSA-N Ile-Phe-Pro-Pro Chemical compound C1CCC(C(=O)N2C(CCC2)C(O)=O)N1C(=O)C(NC(=O)C(N)C(C)CC)CC1=CC=CC=C1 QQFSKBMCAKWHLG-UHFFFAOYSA-N 0.000 description 2
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 2
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 description 2
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 description 2
- 125000003440 L-leucyl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])C(C([H])([H])[H])([H])C([H])([H])[H] 0.000 description 2
- 125000002842 L-seryl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])O[H] 0.000 description 2
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 2
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 2
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 2
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 2
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 2
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 2
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 2
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 2
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- CJAHQEZWDZNSJO-KKUMJFAQSA-N Phe-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CJAHQEZWDZNSJO-KKUMJFAQSA-N 0.000 description 2
- YVIVIQWMNCWUFS-UFYCRDLUSA-N Phe-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N YVIVIQWMNCWUFS-UFYCRDLUSA-N 0.000 description 2
- 206010035226 Plasma cell myeloma Diseases 0.000 description 2
- 229920002873 Polyethylenimine Polymers 0.000 description 2
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 2
- OLTFZQIYCNOBLI-DCAQKATOSA-N Pro-Cys-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O OLTFZQIYCNOBLI-DCAQKATOSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 239000012980 RPMI-1640 medium Substances 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 2
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 2
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 2
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 2
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 2
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 2
- 241000282887 Suidae Species 0.000 description 2
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 2
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 2
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 2
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 2
- 102000004887 Transforming Growth Factor beta Human genes 0.000 description 2
- 108090001012 Transforming Growth Factor beta Proteins 0.000 description 2
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 230000010261 cell growth Effects 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 239000003431 cross linking reagent Substances 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000003205 fragrance Substances 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 239000008187 granular material Substances 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 210000002865 immune cell Anatomy 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 229940072221 immunoglobulins Drugs 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 239000000314 lubricant Substances 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 201000000050 myeloid neoplasm Diseases 0.000 description 2
- 210000002501 natural regulatory T cell Anatomy 0.000 description 2
- 108010068617 neonatal Fc receptor Proteins 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 210000002826 placenta Anatomy 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 230000035755 proliferation Effects 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 238000003753 real-time PCR Methods 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 238000007920 subcutaneous administration Methods 0.000 description 2
- 239000007929 subcutaneous injection Substances 0.000 description 2
- 238000010254 subcutaneous injection Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000000829 suppository Substances 0.000 description 2
- 239000000375 suspending agent Substances 0.000 description 2
- 239000006188 syrup Substances 0.000 description 2
- 235000020357 syrup Nutrition 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 238000002054 transplantation Methods 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- 150000003923 2,5-pyrrolediones Chemical class 0.000 description 1
- NLPWSMKACWGINL-UHFFFAOYSA-N 4-azido-2-hydroxybenzoic acid Chemical compound OC(=O)C1=CC=C(N=[N+]=[N-])C=C1O NLPWSMKACWGINL-UHFFFAOYSA-N 0.000 description 1
- FHVDTGUDJYJELY-UHFFFAOYSA-N 6-{[2-carboxy-4,5-dihydroxy-6-(phosphanyloxy)oxan-3-yl]oxy}-4,5-dihydroxy-3-phosphanyloxane-2-carboxylic acid Chemical compound O1C(C(O)=O)C(P)C(O)C(O)C1OC1C(C(O)=O)OC(OP)C(O)C1O FHVDTGUDJYJELY-UHFFFAOYSA-N 0.000 description 1
- 102100030374 Actin, cytoplasmic 2 Human genes 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 244000144730 Amygdalus persica Species 0.000 description 1
- 101001073212 Arabidopsis thaliana Peroxidase 33 Proteins 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 101100098985 Caenorhabditis elegans cct-3 gene Proteins 0.000 description 1
- 101100275473 Caenorhabditis elegans ctc-3 gene Proteins 0.000 description 1
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- PTHCMJGKKRQCBF-UHFFFAOYSA-N Cellulose, microcrystalline Chemical compound OC1C(O)C(OC)OC(CO)C1OC1C(O)C(O)C(OC)C(CO)O1 PTHCMJGKKRQCBF-UHFFFAOYSA-N 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 239000004386 Erythritol Substances 0.000 description 1
- UNXHWFMMPAWVPI-UHFFFAOYSA-N Erythritol Natural products OCC(O)C(O)CO UNXHWFMMPAWVPI-UHFFFAOYSA-N 0.000 description 1
- 102100027581 Forkhead box protein P3 Human genes 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 1
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- PFOUFRJYHWZJKW-NKIYYHGXSA-N His-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O PFOUFRJYHWZJKW-NKIYYHGXSA-N 0.000 description 1
- 101000773237 Homo sapiens Actin, cytoplasmic 2 Proteins 0.000 description 1
- 101000861452 Homo sapiens Forkhead box protein P3 Proteins 0.000 description 1
- 101001038321 Homo sapiens Leucine-rich repeat protein 1 Proteins 0.000 description 1
- 101000619640 Homo sapiens Leucine-rich repeats and immunoglobulin-like domains protein 1 Proteins 0.000 description 1
- 101000619642 Homo sapiens Leucine-rich repeats and immunoglobulin-like domains protein 2 Proteins 0.000 description 1
- 101001017855 Homo sapiens Leucine-rich repeats and immunoglobulin-like domains protein 3 Proteins 0.000 description 1
- 101001123325 Homo sapiens Peroxisome proliferator-activated receptor gamma coactivator 1-beta Proteins 0.000 description 1
- 102000008100 Human Serum Albumin Human genes 0.000 description 1
- 108091006905 Human Serum Albumin Proteins 0.000 description 1
- 102000018251 Hypoxanthine Phosphoribosyltransferase Human genes 0.000 description 1
- 108010091358 Hypoxanthine Phosphoribosyltransferase Proteins 0.000 description 1
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- 108090000978 Interleukin-4 Proteins 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 125000001176 L-lysyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C([H])([H])C([H])([H])C([H])([H])C(N([H])[H])([H])[H] 0.000 description 1
- 125000000769 L-threonyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])[C@](O[H])(C([H])([H])[H])[H] 0.000 description 1
- 125000003580 L-valyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(C([H])([H])[H])(C([H])([H])[H])[H] 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 101150030213 Lag3 gene Proteins 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- 240000007472 Leucaena leucocephala Species 0.000 description 1
- 235000010643 Leucaena leucocephala Nutrition 0.000 description 1
- 102100040249 Leucine-rich repeat protein 1 Human genes 0.000 description 1
- 102100022170 Leucine-rich repeats and immunoglobulin-like domains protein 1 Human genes 0.000 description 1
- 102100022173 Leucine-rich repeats and immunoglobulin-like domains protein 2 Human genes 0.000 description 1
- 102100033284 Leucine-rich repeats and immunoglobulin-like domains protein 3 Human genes 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- KSFQPRLZAUXXPT-GARJFASQSA-N Lys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)C(=O)O KSFQPRLZAUXXPT-GARJFASQSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- JCVOHUKUYSYBAD-DCAQKATOSA-N Lys-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CS)C(=O)O JCVOHUKUYSYBAD-DCAQKATOSA-N 0.000 description 1
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- XMQZLGBUJMMODC-AVGNSLFASA-N Met-His-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O XMQZLGBUJMMODC-AVGNSLFASA-N 0.000 description 1
- WYDFQSJOARJAMM-GUBZILKMSA-N Met-Pro-Asp Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WYDFQSJOARJAMM-GUBZILKMSA-N 0.000 description 1
- 229920000168 Microcrystalline cellulose Polymers 0.000 description 1
- 101100120552 Mus musculus Foxp3 gene Proteins 0.000 description 1
- PNRBWFSTNLUEKP-UHFFFAOYSA-N N(=[N+]=[N-])C=1C=C(C(CC2C(=O)N(C(C2)=O)O)=CC1)O Chemical class N(=[N+]=[N-])C=1C=C(C(CC2C(=O)N(C(C2)=O)O)=CC1)O PNRBWFSTNLUEKP-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 108090000526 Papain Proteins 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102100028961 Peroxisome proliferator-activated receptor gamma coactivator 1-beta Human genes 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 235000006040 Prunus persica var persica Nutrition 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 208000006265 Renal cell carcinoma Diseases 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000256248 Spodoptera Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 108091008874 T cell receptors Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 1
- NRBUKAHTWRCUEQ-XGEHTFHBSA-N Thr-Cys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O NRBUKAHTWRCUEQ-XGEHTFHBSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- WPRVVBVWIUWLOH-UFYCRDLUSA-N Tyr-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPRVVBVWIUWLOH-UFYCRDLUSA-N 0.000 description 1
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 1
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- TVXBFESIOXBWNM-UHFFFAOYSA-N Xylitol Natural products OCCC(O)C(O)C(O)CCO TVXBFESIOXBWNM-UHFFFAOYSA-N 0.000 description 1
- SIDAYYRYPHYJKL-UHFFFAOYSA-N [N+](=[N-])=CC(=O)C(CC1=CC=CC=C1)C(C=[N+]=[N-])=O.[N+](=[N-])=CC(=O)C(CC1=CC=CC=C1)C(C=[N+]=[N-])=O Chemical compound [N+](=[N-])=CC(=O)C(CC1=CC=CC=C1)C(C=[N+]=[N-])=O.[N+](=[N-])=CC(=O)C(CC1=CC=CC=C1)C(C=[N+]=[N-])=O SIDAYYRYPHYJKL-UHFFFAOYSA-N 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 229940072056 alginate Drugs 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 230000000202 analgesic effect Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000003146 anticoagulant agent Substances 0.000 description 1
- 229940127219 anticoagulant drug Drugs 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 239000007900 aqueous suspension Substances 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 239000002585 base Substances 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 235000013361 beverage Nutrition 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 239000000090 biomarker Substances 0.000 description 1
- 230000017531 blood circulation Effects 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 238000010805 cDNA synthesis kit Methods 0.000 description 1
- 239000000378 calcium silicate Substances 0.000 description 1
- 229910052918 calcium silicate Inorganic materials 0.000 description 1
- 235000012241 calcium silicate Nutrition 0.000 description 1
- OYACROKNLOSFPA-UHFFFAOYSA-N calcium;dioxido(oxo)silane Chemical compound [Ca+2].[O-][Si]([O-])=O OYACROKNLOSFPA-UHFFFAOYSA-N 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000006652 catabolic pathway Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000002659 cell therapy Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 235000010980 cellulose Nutrition 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000004087 circulation Effects 0.000 description 1
- 238000002648 combination therapy Methods 0.000 description 1
- 230000004540 complement-dependent cytotoxicity Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 210000005220 cytoplasmic tail Anatomy 0.000 description 1
- 230000003013 cytotoxicity Effects 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- 230000022811 deglycosylation Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000003405 delayed action preparation Substances 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 239000007884 disintegrant Substances 0.000 description 1
- 239000002270 dispersing agent Substances 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 239000008298 dragée Substances 0.000 description 1
- 239000000890 drug combination Substances 0.000 description 1
- 229920001971 elastomer Polymers 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 230000012202 endocytosis Effects 0.000 description 1
- 210000001163 endosome Anatomy 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- UNXHWFMMPAWVPI-ZXZARUISSA-N erythritol Chemical compound OC[C@H](O)[C@H](O)CO UNXHWFMMPAWVPI-ZXZARUISSA-N 0.000 description 1
- 235000019414 erythritol Nutrition 0.000 description 1
- 229940009714 erythritol Drugs 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 210000001723 extracellular space Anatomy 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000006126 farnesylation Effects 0.000 description 1
- 210000003754 fetus Anatomy 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 230000004727 humoral immunity Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 238000007917 intracranial administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 239000007928 intraperitoneal injection Substances 0.000 description 1
- 239000007951 isotonicity adjuster Substances 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 210000003712 lysosome Anatomy 0.000 description 1
- 230000001868 lysosomic effect Effects 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- HEBKCHPVOIAQTA-UHFFFAOYSA-N meso ribitol Natural products OCC(O)C(O)C(O)CO HEBKCHPVOIAQTA-UHFFFAOYSA-N 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 235000010981 methylcellulose Nutrition 0.000 description 1
- LXCFILQKKLGQFO-UHFFFAOYSA-N methylparaben Chemical compound COC(=O)C1=CC=C(O)C=C1 LXCFILQKKLGQFO-UHFFFAOYSA-N 0.000 description 1
- 239000008108 microcrystalline cellulose Substances 0.000 description 1
- 235000019813 microcrystalline cellulose Nutrition 0.000 description 1
- 229940016286 microcrystalline cellulose Drugs 0.000 description 1
- 235000010446 mineral oil Nutrition 0.000 description 1
- 239000002480 mineral oil Substances 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000003068 molecular probe Substances 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 210000004897 n-terminal region Anatomy 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 239000002674 ointment Substances 0.000 description 1
- 239000006186 oral dosage form Substances 0.000 description 1
- 229940055729 papain Drugs 0.000 description 1
- 235000019834 papain Nutrition 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- 230000010412 perfusion Effects 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 230000003169 placental effect Effects 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- QELSKZZBTMNZEB-UHFFFAOYSA-N propylparaben Chemical compound CCCOC(=O)C1=CC=C(O)C=C1 QELSKZZBTMNZEB-UHFFFAOYSA-N 0.000 description 1
- 229960003415 propylparaben Drugs 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 239000012562 protein A resin Substances 0.000 description 1
- 150000003254 radicals Chemical class 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 208000015347 renal cell adenocarcinoma Diseases 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000002207 retinal effect Effects 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 235000010356 sorbitol Nutrition 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 239000000454 talc Substances 0.000 description 1
- 229910052623 talc Inorganic materials 0.000 description 1
- 235000012222 talc Nutrition 0.000 description 1
- 238000011200 topical administration Methods 0.000 description 1
- 239000003860 topical agent Substances 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 108091005703 transmembrane proteins Proteins 0.000 description 1
- 102000035160 transmembrane proteins Human genes 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 235000012431 wafers Nutrition 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 239000000811 xylitol Substances 0.000 description 1
- 235000010447 xylitol Nutrition 0.000 description 1
- HEBKCHPVOIAQTA-SCDXWVJYSA-N xylitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)CO HEBKCHPVOIAQTA-SCDXWVJYSA-N 0.000 description 1
- 229960002675 xylitol Drugs 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4702—Regulators; Modulating activity
- C07K14/4703—Inhibitors; Suppressors
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/68—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/68—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment
- A61K47/6801—Drug-antibody or immunoglobulin conjugates defined by the pharmacologically or therapeutically active agent
- A61K47/6803—Drugs conjugated to an antibody or immunoglobulin, e.g. cisplatin-antibody conjugates
- A61K47/6811—Drugs conjugated to an antibody or immunoglobulin, e.g. cisplatin-antibody conjugates the drug being a protein or peptide, e.g. transferrin or bleomycin
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/50—Immunoglobulins specific features characterized by immunoglobulin fragments
- C07K2317/52—Constant or Fc region; Isotype
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/50—Immunoglobulins specific features characterized by immunoglobulin fragments
- C07K2317/52—Constant or Fc region; Isotype
- C07K2317/53—Hinge
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/30—Non-immunoglobulin-derived peptide or protein having an immunoglobulin constant or Fc region, or a fragment thereof, attached thereto
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- Biochemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Zoology (AREA)
- Toxicology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Engineering & Computer Science (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Epidemiology (AREA)
- Cell Biology (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Medicinal Preparation (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
본 발명은 Lrig-1(leucine-rich and immunoglobulin-like domains 1) 단백질의 세포 외 도메인(extracellular domain) 및 면역글로불린(immunoglobulin) Fc 영역을 포함하는 융합 단백질에 관한 것이다.
본 발명에서 제공하는 융합 단백질은 이펙터 T 세포(effector T cell)에 존재하는 Lrig-1 단백질에 대한 리간드와 상호 작용하여, Lrig-1 단백질을 표면에 포함하고 있는 조절 T 세포(regulatory T cell, Treg cell)와 이펙터 T 세포간의 상호 작용을 저해함에 따라 조절 T 세포의 활성을 억제하고 이펙터 T 세포의 활성은 유지 혹은 상승시켜 암 세포 중에서도 특히 고형암 세포의 성장을 효과적으로 억제할 수 있다.
본 발명에서 제공하는 융합 단백질은 이펙터 T 세포(effector T cell)에 존재하는 Lrig-1 단백질에 대한 리간드와 상호 작용하여, Lrig-1 단백질을 표면에 포함하고 있는 조절 T 세포(regulatory T cell, Treg cell)와 이펙터 T 세포간의 상호 작용을 저해함에 따라 조절 T 세포의 활성을 억제하고 이펙터 T 세포의 활성은 유지 혹은 상승시켜 암 세포 중에서도 특히 고형암 세포의 성장을 효과적으로 억제할 수 있다.
Description
본 발명은 Lrig-1(leucine-rich and immunoglobulin-like domains 1) 단백질의 세포 외 도메인(extracellular domain)과 면역글로불린(immunoglobulin) Fc 영역을 포함하는 신규한 융합 단백질과, 이의 용도에 관한 것이다.
면역글로불린(immunoglobulin)은 4개의 폴리펩타이드 쇄, 즉 쇄간 디술피드 결합을 통해 회합된 2개의 중쇄 및 2개의 경쇄를 포함한다. 각각의 경쇄는 2개의 도메인, 즉 가변 경쇄 도메인 (VL) 및 불변 경쇄 도메인 (CL)을 갖고, 각각의 중쇄는 2개의 영역, 즉 가변 중쇄 영역 (VH) 및 불변 중쇄 영역 (CH)을 갖는다. 불변 중쇄 영역 (CH)은 숫자에 의해 지정된 불변 중쇄 영역 (예를 들어, CH1, CH2, CH3 등)으로 이루어진다 (예를 들어, US 6,086,875 (블룸버그 알. 에스.(Blumberg R. S.) 등), US 5,624,821 (윈터 (Winter G. P.) 등) 및 US 5,116,964 (카폰 디. 제이.(Capon D. J.) 및 라스키 엘. 에이.(Lasky L. A.)) 참조). 면역글로불린은 그의 생물학적 특성, 유기체 내의 위치 및 상이한 항원을 처리하는 능력에 기초하여 상이한 이소형(즉, IgG, IgM, IgA, IgD 및 IgE)으로 분류된다. 이뮤노글로불린 이소형에 따라, 불변 중쇄 영역 (CH)은 3개 또는 4개의 CH 도메인을 가질 수 있다. 또한, 일부 이소형 (IgA, IgD 및 IgG)에서, 중쇄는 분자에 유연성을 부가하는 힌지 영역을 포함한다 (Janeway et al. 2001, Immunobiology, Garland Publishing, N.Y., N.Y.).
인간에는 4개의 IgG 하위클래스 (IgG1, 2, 3, 4)가 있고, 이들은 혈청에 풍부한 순서에 따라 명명된다 (IgG1이 가장 풍부하다). IgG 이소형은 2개의 경쇄 및 2개의 중쇄로 이루어지고, 각각의 중쇄는 3개의 불변 중쇄 도메인 (CH1, CH2, CH3)을 포함한다. IgG의 2개의 중쇄는 서로에 대해 및 경쇄에 각각 디술피드 결합 (-S-S-)에 의해 연결되어 있다. IgG의 항원 결합 부위는 가변 경쇄 (VL) 및 가변 중쇄 (VH) 도메인뿐만 아니라 불변 경쇄 (CL) 및 불변 중쇄 (CH1) 도메인을 포함하는 단편 항원 결합 영역 (Fab 영역)에 위치한다. IgG의 단편 결정화가능 영역 (Fc 영역)은 신생아 Fc 수용체 (FcRn)를 포함하는 특정 세포의 표면에서 발견되는 Fc 수용체에 결합하는 CH2 및 CH3 도메인을 함유하는 중쇄의 일부이다. IgG의 중쇄는 또한 Fab 영역을 Fc 영역으로부터 분리하고 2개의 중쇄를 디술피드 결합을 통해 함께 연결할 때 참여하는, CH1과 CH2 사이의 힌지 영역 (힌지)을 갖는다. 힌지 영역의 구조는 4개의 IgG 하위클래스 각각의 특유한 생물학적 특성에 기여한다.
IgG는 쉽게 조직을 관류할 수 있도록 허용하는 크기가 작은 단량체로 분비된다. 이것은 자궁 내의 태아를 보호하기 위해 인간 태반을 통과하는 것을 촉진하는 수용체 (신생아 Fc 수용체 (FcRn))를 갖는 유일한 이소형이다. 태반을 통해 흡수된 IgG는 그 자체 면역계가 발달하기 전에 체액성 면역을 신생아에게 제공한다.
IgG 신생아 Fc 수용체 (FcRn) 결합 부위는 항체의 Fc 영역에 위치한다. FcRn은 일반적으로 인간의 태반 및 상피 세포에서 발현되고, IgG의 분해를 막는 세포내이입 샐비지 경로에 참여한다. 이 샐비지 경로는 산성 pH에서 FcRn에 대한 IgG의 높은 pH 의존성 결합 친화도에 의해 매개된다. 산성 pH에서 FcRn에 대한 IgG의 높은 친화도는 산성 엔도솜 내로의 흡수 후에 내재화된 IgG의 FcRn에 대한 결합을 유발하는 것으로 생각된다 ([Goebl NA, et al., 2008]; [Junghans RP, et al., 1996]). 대부분의 가용성 단백질은 내재화 후에 리소솜으로 향하지만, 내재화된 FcRn-결합 IgG는 형질 막으로 되돌아가고, 기본 분해 경로로부터 효과적으로 구출된다. 세포외 공간의 중성 pH에 노출시, IgG는 FcRn으로부터 해리되어 순환계로 되돌아갈 수 있다. 따라서, 항체의 연장된 혈청 반감기 특성은 Fc 단편에서 유지된다.
상기 샐비지 경로는 비변형된 단백질 약물에 비해 혈액 순환에서 반감기가 연장된 차세대 단백질 약물의 개발을 위한 하나의 메커니즘을 제공한다. 특히, 비변형된 단백질 약물은 순환 반감기가 짧고, 따라서 필요한 장기간의 치료 기간에 걸쳐 빈번한 투여가 필요하다. PEG화 융합 단백질 기술 (미국 식품의약국 (Food and Drug Administration), [Osborn BL, et al., 2002])을 포함한 많은 방법에 의해 단백질 약물의 반감기를 연장하기 위한 광범위한 노력이 이루어졌지만, 이러한 노력의 결과는 이상적이지 않았다.
본 발명의 일 목적은 Lrig-1(leucine-rich and immunoglobulin-like domains 1) 단백질의 세포 외 도메인(extracellular domain)과 면역글로불린(immunoglobulin) Fc 영역을 포함하는 신규한 융합 단백질을 제공하는 것이다.
본 발명의 다른 목적은 본 발명에 따른 상기 융합 단백질을 코딩하는 핵산 분자에 관한 것이다.
본 발명의 또 다른 목적은 본 발명에 따른 상기 핵산 분자가 삽입된 발현 벡터를 제공하는 것이다.
본 발명의 또 다른 목적은 본 발명에 따른 상기 발현 벡터가 형질 감염된 숙주 세포주를 제공하는 것이다.
본 발명의 또 다른 목적은 본 발명에 따른 융합 단백질을 포함하는 암의 예방 또는 치료용 약학 조성물을 제공하는 것이다.
그러나 본 발명이 이루고자 하는 기술적 과제는 이상에서 언급한 과제에 제한되지 않으며, 언급되지 않은 또 다른 과제들은 아래의 기재로부터 당업계에서 통상의 지식을 가진 자에게 명확하게 이해될 수 있을 것이다.
본 발명의 일 구현 예에 따르면, Lrig-1(leucine-rich and immunoglobulin-like domains 1) 단백질의 세포 외 도메인(extracellular domain) 및 면역글로불린(immunoglobulin) Fc 영역을 포함하는 융합 단백질에 관한 것이다.
본 발명에서, 상기 "Lrig-1 단백질"은 조절 T 세포의 표면에 존재하는 1091개의 아미노산으로 이루어진 막관통 단백질로서, 세포 외 혹은 루멘 쪽의 루신 반복 서열(leucine-rich repeat(LRR))과 세개의 면역체 유사 도메인(immunoglobulin-like domains), 세포막 관통 서열 및 세포질 꼬리부분으로 구성되어 있다. LRIG 유전자 패밀리는 LRIG1, LRIG2와 LRIG3이 존재하며, 이들간의 아미노산들은 매우 보전적으로 구성되어 있다.
본 발명의 일 예시에서 상기 Lrig-1 단백질의 세포 외 도메인은 인간, 원숭이 등의 영장류, 마우스, 래트 등의 설치류, 등을 포함하는 포유류로부터 유래된 Lrig-1 단백질의 세포 외 도메인일 수 있다.
본 발명의 일 예시에서 상기 Lrig-1 단백질의 세포 외 도메인은 인간 유래 Lrig-1 단백질의 35번째 내지 794번째 아미노산 서열에 해당하는 서열번호 1로 표시될 수 있고, 이는 서열번호 2로 표시되는 핵산 서열에 의해 코딩될 수 있으나, 이에 제한되는 것은 아니다(표 1 참조).
서열목록 | 서열정보 |
서열번호 1 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF |
서열번호 2 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc |
본 발명의 다른 예시에서 상기 Lrig-1 단백질의 세포 외 도메인은 마우스 유래 Lrig-1 단백질의 35번째 내지 794번째 아미노산 서열에 해당하는 서열번호 3으로 표시될 수 있고, 이는 서열번호 4로 표시되는 핵산 서열에 의해 코딩될 수 있으나, 이에 제한되는 것은 아니다(표 2 참조).
서열목록 | 서열정보 |
서열번호 3 | AQAGPRAPCA AACTCAGDSL DCSGRGLATL PRDLPSWTRS LNLSYNRLSE IDSAAFEDLT NLQEVYLNSN ELTAIPSLGA ASIGVVSLFL QHNKILSVDG SQLKSYLSLE VLDLSSNNIT EIRSSCFPNG LRIRELNLAS NRISILESGA FDGLSRSLLT LRLSKNRITQ LPVKAFKLPR LTQLDLNRNR IRLIEGLTFQ GLDSLEVLRL QRNNISRLTD GAFWGLSKMH VLHLEYNSLV EVNSGSLYGL TALHQLHLSN NSISRIQRDG WSFCQKLHEL ILSFNNLTRL DEESLAELSS LSILRLSHNA ISHIAEGAFK GLKSLRVLDL DHNEISGTIE DTSGAFTGLD NLSKLTLFGN KIKSVAKRAF SGLESLEHLN LGENAIRSVQ FDAFAKMKNL KELYISSESF LCDCQLKWLP PWLMGRMLQA FVTATCAHPE SLKGQSIFSV LPDSFVCDDF PKPQIITQPE TTMAVVGKDI RFTCSAASSS SSPMTFAWKK DNEVLANADM ENFAHVRAQD GEVMEYTTIL HLRHVTFGHE GRYQCIITNH FGSTYSHKAR LTVNVLPSFT KIPHDIAIRT GTTARLECAA TGHPNPQIAW QKDGGTDFPA ARERRMHVMP DDDVFFITDV KIDDMGVYSC TAQNSAGSVS ANATLTVLET PSLAVPLEDR VVTVGETVAF QCKATGSPTP RITWLKGGRP LSLTERHHFT PGNQLLVVQN VMIDDAGRYT CEMSNPLGTE RAHSQLSILP TPGCRKDGTT |
서열번호 4 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA |
본 발명에서, "면역글로불린 Fc 영역"은, 면역글로불린의 중쇄와 경쇄 가변영역을 제외한, 중쇄 불변영역 2(CH2) 및/또는 중쇄 불변영역 3(CH3) 부분을 포함하는 부위를 의미한다. 상기 면역글로불린 Fc 영역은 본 발명의 단백질 결합체의 모이어티를 이루는 일 구성일 수 있다.
본 발명에서 상기 면역글로불린 Fc 영역은 중쇄 불변영역에 힌지(hinge) 부분을 포함함으로써 최종 제조될 융합 단백질의 구조적 유연성(flexibility)에 영향을 줄 수 있고, 융합 단백질의 생산성 및 안정성을 보다 높일 수 있으나, 이에 제한되는 것은 아니다.
또한 본 발명의 면역글로불린 Fc 영역은 천연형과 실질적으로 동등하거나 향상된 효과를 갖는 한, 면역 글로불린의 중쇄와 경쇄 가변영역만을 제외하고, 일부 또는 전체 중쇄 불변영역 1(CH1) 및/또는 경쇄 불변영역 1(CL1)을 포함하는 확장된 Fc영역일 수 있다. 또한, CH2 및/또는 CH3에 해당하는 상당히 긴 일부 아미노산 서열이 제거된 영역일 수도 있다.
예컨대, 본 발명의 면역글로불린 Fc 영역은 1) CH1 도메인, CH2 도메인, CH3 도메인 및 CH4 도메인, 2) CH1 도메인 및 CH2 도메인, 3) CH1 도메인 및 CH3 도메인, 4) CH2 도메인 및 CH3 도메인, 5) CH1 도메인, CH2 도메인, CH3 도메인 및 CH4 도메인 중 1개 또는 2개의 이상의 도메인과 면역글로불린 힌지 영역(또는 힌지 영역의 일부)와의 조합, 6) 중쇄 불변 영역 각 도메인과 경쇄 불변영역의 이량체일 수 있다. 그러나, 이에 제한되는 것은 아니다.
또한, 본 발명의 면역글로불린 Fc 영역은 천연형 아미노산 서열뿐만 아니라 이의 서열 유도체를 포함한다. 아미노산 서열 유도체란 천연 아미노산 서열 중의 하나 이상의 아미노산 잔기가 결실, 삽입, 비보전적 또는 보전적 치환 또는 이들의 조합에 의하여 상이한 서열을 가지는 것을 의미한다.
예를 들면, IgG Fc의 경우 결합에 중요하다고 알려진 214 내지 238, 297 내지 299, 318 내지 322 또는 327 내지 331번째 아미노산 잔기들이 변형을 위해 적당한 부위로서 이용될 수 있다.
또한, 이황화 결합을 형성할 수 있는 부위가 제거되거나, 천연형 Fc에서 N-말단의 몇몇 아미노산이 제거되거나 또는 천연형 Fc의 N-말단에 메티오닌 잔기가 부가될 수도 있는 등 다양한 종류의 유도체가 가능하다. 또한, 이펙터 기능을 없애기 위해 보체결합부위, 예로 C1q 결합부위가 제거될 수도 있고, ADCC (antibody dependent cell mediated cytotoxicity) 부위가 제거될 수도 있다. 이러한 면역글로불린 Fc 영역의 서열 유도체를 제조하는 기술은 국제특허공개 제WO 97/34631호, 국제특허공개 제96/32478호 등에 개시되어 있다.
분자의 활성을 전체적으로 변경시키지 않는 단백질 및 펩타이드에서의 아미노산 교환은 당해 분야에 공지되어 있다(H.Neurath, R.L.Hill, The Proteins, Academic Press, New York, 1979). 가장 통상적으로 일어나는 교환은 아미노산 잔기 Ala/Ser, Val/Ile, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, Ser/Asn, Ala/Val, Ser/Gly, Thy/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/Ile, Leu/Val, Ala/Glu, Asp/Gly 간의 교환이다. 경우에 따라서는 인산화(phosphorylation), 황화(sulfation), 아크릴화(acrylation), 당화(glycosylation), 메틸화(methylation), 파네실화(farnesylation), 아세틸화(acetylation) 및 아미드화(amidation) 등으로 수식(modification)될 수도 있다.
상기 기술한 Fc 유도체는 본 발명의 Fc 영역과 동등한 생물학적 활성을 나타내며 Fc 영역의 열, pH 등에 대한 구조적 안정성을 증대시킨 것일 수 있다.
또한, 이러한 Fc 영역은 인간, 소, 염소, 돼지, 마우스, 래빗, 햄스터, 랫트 또는 기니아 픽 등의 동물의 생체 내에서 분리한 천연형으로부터 얻어질 수도 있고, 형질 전환된 동물세포 또는 미생물로부터 얻어진 재조합형 또는 이의 유도체일 수 있다. 여기서, 천연형으로부터 획득하는 방법은 전체 면역글로불린을 인간 또는 동물의 생체로부터 분리한 후, 단백질 분해효소를 처리하여 획득하는 방법일 수 있다. 파파인으로 처리할 경우에는 Fab 및 Fc로 절단되고, 펩신으로 처리할 경우에는 pF'c 및 F(ab)2로 절단된다. 크기 배제 크로마토그래피(size-exclusion chromatography) 등을 이용하여 Fc 또는 pF'c를 분리할 수 있다. 더 구체적인 실시 형태에서는 인간 또는 마우스 유래의 Fc 영역을 미생물로부터 수득한 재조합형 면역글로불린 Fc 영역이다.
또한, 면역글로불린 Fc 영역은 천연형 당쇄, 천연형에 비해 증가된 당쇄, 천연형에 비해 감소한 당쇄 또는 당쇄가 제거된 형태일 수 있다. 이러한 면역글로불린 Fc 당쇄의 증감 또는 제거에는 화학적 방법, 효소학적 방법 및 미생물을 이용한 유전 공학적 방법과 같은 통상적인 방법이 이용될 수 있다. 여기서, Fc에서 당쇄가 제거된 면역글로불린 Fc 영역은 보체(c1q)와의 결합력이 현저히 저하되고, 항체-의존성 세포독성 또는 보체-의존성 세포 독성이 감소 또는 제거되므로, 생체 내에서 불필요한 면역 반응을 유발하지 않는다. 이런 점에서 약물의 캐리어로서의 본래의 목적에 보다 부합하는 형태는 당쇄가 제거되거나 비당쇄화된 면역글로불린 Fc 영역이라 할 것이다.
본 발명에서 "당쇄의 제거(Deglycosylation)"는 효소로 당을 제거한 Fc 영역을 말하며, 비당쇄화(Aglycosylation)는 원핵동물, 더 구체적인 실시 형태에서는 대장균에서 생산하여 당쇄화되지 않은 Fc 영역을 의미한다.
한편, 면역글로불린 Fc 영역은 인간 또는 소, 염소, 돼지, 마우스, 래빗, 햄스터, 랫트, 기니아 픽 등의 동물 유래일 수 있으며, 바람직한 일 예시로 인간 또는 마우스 유래일 수 있다.
또한, 본 발명의 면역글로불린 Fc 영역은 IgG, IgA, IgD, IgE, IgM 유래 Fc 영역, 중쇄 불변영역 2(CH2), 중쇄 불변영역 3(CH3), 힌지(hinge), 이의 단편, 또는 이들의 조합(combination), 또는 이들의 조합을 포함하는 하이브리드 Fc(hybrid Fc)일 수 있다.
한편, 본 발명에서 "조합(combination)"이란 이량체 또는 다량체를 형성할 때, 동일 기원 단쇄 면역글로불린 Fc 영역, 중쇄 불변영역 2(CH2) 또는 중쇄 불변영역 3(CH3)을 코딩하는 폴리펩타이드가 상이한 기원의 단쇄 폴리펩타이드와 결합을 형성하는 것을 의미한다. 즉, IgG, IgA, IgM, IgD 또는 IgE 유래의 Fc 영역, 중쇄 불변영역 2(CH2) 또는 중쇄 불변영역 3(CH3)으로부터 선택된 2개 이상의 단편으로부터 이량체 또는 다량체의 제조가 가능하다.
본 발명에서 상기 "하이브리드 Fc"는 인간 IgG 서브클래스의 조합 또는 인간 IgD 및 IgG의 조합으로부터 유도될 수 있다. 하나의 실시예에서, 상기 하이브리드 Fc는 예를 들어, IgD 힌지 영역 및 CH2 N-말단 영역 + IgG4 CH2 및 CH3 영역을 포함할 수 있으며, 예를 들어, 한국등록특허 제0897938호에 개시된 하이브리드 Fc 형태를 동일하게 차용하여 사용할 수 있고, 본 명세서에 참조로서 도입된다. 본 발명에서 상기 하이브리드 Fc는 생물학적 활성 분자, 폴리펩타이드 등에 결합하는 경우, 생물학적 활성 분자의 혈청 반감기를 증가시킬 뿐만 아니라 Fc-폴리펩타이드 융합 단백질을 코딩하는 뉴클레오티드가 발현될 때 폴리펩타이드의 발현 수준을 높이는 효과가 있다.
본 발명의 일 예시로 상기 면역글로불린 Fc 영역은 IgG, IgA, IgM, IgD 또는 IgE 유래의 Fc 영역이거나, 혹은 상기 IgG, IgA, IgM, IgD 또는 IgE 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함할 수 있으나, 이에 제한되는 것은 아니다.
본 발명의 일 예시로 상기 면역글로불린 Fc 영역은 인간 혈액에 가장 풍부한 IgG 또는 IgM 유래 Fc 영역이거나, 혹은 상기 IgG 또는 IgM 유래 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함할 수 있고, 다른 예시로 리간드 결합 단백질의 반감기를 향상시키는 것으로 공지된 IgG 유래 Fc 영역이거나, 혹은 상기 IgG 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함할 수 있고, 또 다른 예시로 IgG1, IgG2, IgG3 또는 IgG4 유래 Fc 영역이거나, 혹은 상기 IgG1, IgG2, IgG3 또는 IgG4 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함할 수 있으며, 또 다른 예시로 IgG1 또는 IgG2 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함할 수 있다.
본 발명에서 바람직한 예시로서 상기 면역글로불린 Fc 영역은 서열번호 5로 표시되는 인간 IgG1 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함하거나, 서열번호 6으로 표시되는 마우스 IgG2 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함할 수 있으나, 이에 제한되는 것은 아니다(하기 표 3 참조).
서열목록 | 서열정보 |
서열번호 5 | LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 6 | IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
본 발명의 일 예시로, 상기 면역글로불린 Fc 영역은 IgG, IgA, IgM, IgD, IgE, 또는 아바타셉트(Abatacept) 유래의 힌지 영역을 포함할 수 있고, 또 다른 예시로 IgG, IgD 또는 아바타셉트(Abatacept) 유래의 힌지 영역을 포함할 수 있으며, 혹은 IgG1, IgG2, IgG3, IgG4, IgD 또는 아바타셉트(Abatacept) 유래의 힌지 영역을 포함할 수 있으나, 이에 제한되는 것은 아니다.
본 발명에서 바람직한 예시로서 상기 면역글로불린 Fc 영역은 서열번호 7로 표시되는 인간 IgG1 유래의 힌지 영역; 서열번호 8로 표시되는 마우스 IgG2 유래의 힌지 영역; 서열번호 9로 표시되는 인간 IgD 유래의 힌지 영역; 및 서열번호 10으로 표시되는 아바타셉트(Abatacept)의 힌지 영역;으로 이루어진 군에서 선택된 1종 이상을 포함함으로써(하기 표 4 참조) 최종 제조될 융합 단백질의 구조적 유연성(flexibility)을 높이고, 융합 단백질의 생산성 및 안정성을 현저히 향상시킬 수 있으나, 이에 제한되는 것은 아니다.
서열목록 | 서열정보 |
서열번호 7 | EPKSSDKTHTSPPCPAPELLGGPSVF |
서열번호 8 | EPRGPTIKPCPPCKCPAPNLLGGPSVF |
서열번호 9 | RNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVF |
서열번호 10 | EPKSSDKTHTSPPSPAPELLGGSSVF |
본 발명에서 상기 Lrig-1 단백질의 세포 외 도메인과 Fc 영역이 링커를 통해 연결되는 경우, 상기 링커는 Fc 단편의 N-말단, C-말단 또는 유리기 (free radical)에 연결될 수 있고, Lrig-1 단백질의 세포 외 도메인의 N-말단, C-말단 또는 유리기에 연결될 수 있다. 링커가 펩타이드 링커인 경우, 연결은 임의의 부위에서 일어날 수 있다. 예를 들면, 상기 링커는 상기 Lrig-1 단백질의 세포 외 도메인의 C-말단 및 상기 면역글로불린의 Fc 영역의 N-말단에 연결될 수 있고, 혹은 상기 면역글로불린의 Fc 영역의 C-말단 및 상기 Lrig-1 단백질의 세포 외 도메인의 N-말단에 연결될 수 있다.
본 발명에서 상기 "링커(linker)"는 융합 단백질 내 Lrig-1 단백질의 세포 외 도메인과 면역글로불린 Fc 영역 사이의 간섭 효과를 줄여 타겟 세포에서 상기 Lrig-1 단백질의 세포 외 도메인의 목적하는 활성을 높일 수 있다. 또한, 본 발명에서 상기 링커는 목적하는 질환의 조직 또는 세포 내에서 과발현되는 효소에 의해 절단될 수 있는 서열을 포함할 수 있다. 상기와 같이 과발현되는 효소에 의해 절단될 수 있는 경우에는 Fc 부분으로 인하여 폴리펩타이드의 활성이 저하되는 것을 효과적으로 방지할 수 있다.
본 발명의 일 예시로서 상기 링커는 1 내지 100개의 아미노산을 갖는 것이 바람직하나, 이에 제한되지는 않으며, Lrig-1 단백질의 세포 외 도메인과 면역글로불린 Fc 영역을 분리시킬 수 있는 어떠한 펩타이드라도 가능하다. 상기 링커를 구성하는 아미노산 서열에는 특별한 제한은 없으나, 글라이신(G) 및 세린(S)을 포함하거나, 이들을 반복적으로 또는 무작위적 패턴으로 포함하는 것이 바람직하다. 그러한 예로서, 상기 링커는 서열번호 11로 표시되는 펩타이드 링커; 및 서열번호 12로 표시되는 펩타이드 링커; 중 적어도 하나를 포함할 수 있고, 혹은 (GGGGS)N(N은 1 이상의 정수, 바람직하게는 1 내지 20의 정수임)의 아미노산 서열을 포함함으로써(하기 표 5 참조), 세포 내 활성 물질의 안정성을 높이고, 생산성을 보다 향상시킬 수 있다.
서열목록 | 서열정보 |
서열번호 11 | GS |
서열번호 12 | GGG |
또한, 본 발명에서 상기 링커의 일 예시로, 혈액 내에 가장 많이 존재하는 인간 알부민의 282번 내지 314번째 부분에 위치한 33개의 아미노산으로 이루어진 펩타이드 링커, 보다 바람직하게는 292번 내지 304번째 부분에 위치한 13개의 아미노산으로 이루어진 펩타이드 링커일 수 있으며, 이러한 부분은 3차원적인 구조상 대부분 외부에 노출된 부분으로서 체내에서 면역반응을 유도할 가능성이 최소화된 부분이다. 단, 이에 제한되는 것은 아니다.
또한, 본 발명에서 상기 링커 및 Fc 영역이 별개로 발현된 후에 서로 결합될 때, 링커는 당업계에 알려진 가교제일 수 있다. 상기 가교제는, 예를 들어, 1,1-비스(디아조아세틸)-2-페닐에탄 (1,1-bis(diazoacetyl)-2-phenylethane), 글루타르알데하이드 (glutaraldehyde), 4-아지도살리실릭산 (4-azidosalicylic acid)과 같은 N-하이드로옥시석신이미드 에스테르 (N-hydroxysuccinimide ester), 3,3'-디사이오비스(석신이미딜프로피오네이트) (3,3'-dithiobis(succinimidylpropionate))와 같은 디석신이미딜에스테르 (disuccinimidyl esters)를 포함하는 이미도에스테르 (imidoesters), 및 비스-N-말레이미도-1,8-옥테인과 같은 이중 기능적 말레이미드 (bifunctional maleimides)일 수 있으나, 이에 제한되는 것은 아니다.
본 발명에서 제공하는 융합 단백질의 일 예시는 상기 서열번호 1로 표시되는 Lrig-1 단백질의 세포 외 도메인; 서열번호 7 내지 10 중 어느 하나로 표시되는 힌지 영역; 서열번호 5로 표시되는 인간 IgG1 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함하는 융합 단백질일 수 있다(하기 표 6 참조). 하기 표 6의 서열번호 13 내지 16으로 표시되는 융합 단백질은 하기 표 7의 서열번호 17 내지 20으로 표시되는 핵산 서열에 의해 코딩될 수 있다(하기 표 7 참조)
서열목록 | 서열정보 |
서열번호 13 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF EPKSSDKTHT SPPCPAPELL GGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 14 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF EPRGPTIKPCPPCKCPAPNLLGGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 15 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF RNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 16 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF EPKSSDKTHTSPPSPAPELLGGSSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열목록 | 서열정보 |
서열번호 17 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc GAGCCAAAGTCCTCTGATAAGACACACACCTCTCCACCATGCCCAGCACCAGAGCTGCTGGGAGGACCAAGCGTGTTC CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 18 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc Gagcctcggggccctaccatcaagccctgccccccttgcaagtgccctgcccctaatctgctgggcggaccctccgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 19 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc cgcaacaccggccgcggcggcgaggagaagaagaaggagaaggagaaggaggagcaggaggagcgcgagaccaagacccccgagtgccccagccacacccagcccctgggcgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 20 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc gaaccgaaatcttctgacaaaacccacacctctccgccgtctccggctccggaactgctgggtggttcttctgttttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
본 발명에서 제공하는 융합 단백질의 일 예시는 상기 서열번호 3로 표시되는 Lrig-1 단백질의 세포 외 도메인; 서열번호 7 내지 10 중 어느 하나로 표시되는 힌지 영역; 서열번호 5로 표시되는 인간 IgG1 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함하는 융합 단백질일 수 있다(하기 표 8 참조). 하기 표 8의 서열번호 21 내지 24로 표시되는 융합 단백질은 각각 하기 표 9의 서열번호 25 내지 28으로 표시되는 핵산 서열에 의해 코딩될 수 있다(하기 표 9 참조)
서열목록 | 서열정보 |
서열번호 21 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT EPKSSDKTHT SPPCPAPELL GGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 22 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT EPRGPTIKPCPPCKCPAPNLLGGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 23 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT RNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 24 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT EPKSSDKTHTSPPSPAPELLGGSSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열목록 | 서열정보 |
서열번호 25 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GAGCCAAAGTCCTCTGATAAGACACACACCTCTCCACCATGCCCAGCACCAGAGCTGCTGGGAGGACCAAGCGTGTTC CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 26 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA Gagcctcggggccctaccatcaagccctgccccccttgcaagtgccctgcccctaatctgctgggcggaccctccgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 27 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA cgcaacaccggccgcggcggcgaggagaagaagaaggagaaggagaaggaggagcaggaggagcgcgagaccaagacccccgagtgccccagccacacccagcccctgggcgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 28 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA gaaccgaaatcttctgacaaaacccacacctctccgccgtctccggctccggaactgctgggtggttcttctgttttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
본 발명에서 제공하는 융합 단백질의 다른 일 예시는 상기 서열번호 3으로 표시되는 Lrig-1 단백질의 세포 외 도메인; 서열번호 7 내지 10 중 어느 하나로 표시되는 힌지 영역; 서열번호 6으로 표시되는 마우스 IgG2 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함하는 융합 단백질일 수 있다(하기 표 10 참조). 하기 표 10의 서열번호 29 내지 32로 표시되는 융합 단백질은 각각 하기 표 11의 서열번호 33 내지 36으로 표시되는 핵산 서열에 의해 코딩될 수 있다(하기 표 11 참조).
서열목록 | 서열정보 |
서열번호 29 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT EPKSSDKTHTSPPCPAPELLGGPSVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열번호 30 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT EPRGPTIKPCPPCKCPAPNLLGGPSVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열번호 31 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT RNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열번호 32 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT EPKSSDKTHTSPPSPAPELLGGSSVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열목록 | 서열정보 |
서열번호 33 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GAGCCAAAGTCCTCTGATAAGACACACACCTCTCCACCATGCCCAGCACCAGAGCTGCTGGGAGGACCAAGCGTGTTC atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
서열번호 34 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA Gagcctcggggccctaccatcaagccctgccccccttgcaagtgccctgcccctaatctgctgggcggaccctccgtgttc atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
서열번호 35 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA cgcaacaccggccgcggcggcgaggagaagaagaaggagaaggagaaggaggagcaggaggagcgcgagaccaagacccccgagtgccccagccacacccagcccctgggcgtgttc atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
서열번호 36 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA gaaccgaaatcttctgacaaaacccacacctctccgccgtctccggctccggaactgctgggtggttcttctgttttc atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
본 발명에서 제공하는 융합 단백질의 또 다른 일 예시는 상기 서열번호 1로 표시되는 Lrig-1 단백질의 세포 외 도메인; 서열번호 11로 표시되는 링커; 서열번호 7 내지 10 중 어느 하나로 표시되는 힌지 영역; 서열번호 5로 표시되는 인간 IgG1 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함하는 융합 단백질일 수 있다(하기 표 12 참조). 하기 표 12의 서열번호 37 내지 40으로 표시되는 융합 단백질은 각각 하기 표 13의 서열번호 41 내지 44로 표시되는 핵산 서열에 의해 코딩될 수 있다(하기 표 13 참조).
서열목록 | 서열정보 |
서열번호 37 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF GS EPKSSDKTHT SPPCPAPELL GGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 38 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF GS EPRGPTIKPCPPCKCPAPNLLGGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 39 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF GS RNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 40 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF GS EPKSSDKTHTSPPSPAPELLGGSSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열목록 | 서열정보 |
서열번호 41 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc GGATCC GAGCCAAAGTCCTCTGATAAGACACACACCTCTCCACCATGCCCAGCACCAGAGCTGCTGGGAGGACCAAGCGTGTTC CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 42 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc GGATCC Gagcctcggggccctaccatcaagccctgccccccttgcaagtgccctgcccctaatctgctgggcggaccctccgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 43 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc GGATCC cgcaacaccggccgcggcggcgaggagaagaagaaggagaaggagaaggaggagcaggaggagcgcgagaccaagacccccgagtgccccagccacacccagcccctgggcgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 44 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc GGATCC gaaccgaaatcttctgacaaaacccacacctctccgccgtctccggctccggaactgctgggtggttcttctgttttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
본 발명에서 제공하는 융합 단백질의 또 다른 일 예시는 상기 서열번호 3으로 표시되는 Lrig-1 단백질의 세포 외 도메인; 서열번호 11로 표시되는 링커; 서열번호 7 내지 10 중 어느 하나로 표시되는 힌지 영역; 서열번호 5로 표시되는 인간 IgG1 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함하는 융합 단백질일 수 있다(하기 표 14 참조). 하기 표 14의 서열번호 45 내지 48로 표시되는 융합 단백질은 각각 하기 표 15의 서열번호 49 내지 52로 표시되는 핵산 서열에 의해 코딩될 수 있다(하기 표 15 참조).
서열목록 | 서열정보 |
서열번호 45 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GS EPKSSDKTHT SPPCPAPELL GGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 46 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GS EPRGPTIKPCPPCKCPAPNLLGGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 47 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GS RNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 48 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GS EPKSSDKTHTSPPSPAPELLGGSSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열목록 | 서열정보 |
서열번호 49 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGATCC GAGCCAAAGTCCTCTGATAAGACACACACCTCTCCACCATGCCCAGCACCAGAGCTGCTGGGAGGACCAAGCGTGTTC CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 50 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGATCC Gagcctcggggccctaccatcaagccctgccccccttgcaagtgccctgcccctaatctgctgggcggaccctccgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 51 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGATCC cgcaacaccggccgcggcggcgaggagaagaagaaggagaaggagaaggaggagcaggaggagcgcgagaccaagacccccgagtgccccagccacacccagcccctgggcgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 52 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGATCC gaaccgaaatcttctgacaaaacccacacctctccgccgtctccggctccggaactgctgggtggttcttctgttttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
본 발명에서 제공하는 융합 단백질의 또 다른 일 예시는 상기 서열번호 3으로 표시되는 Lrig-1 단백질의 세포 외 도메인; 서열번호 11로 표시되는 링커; 서열번호 7 내지 10 중 어느 하나로 표시되는 힌지 영역; 서열번호 6로 표시되는 마우스 IgG2 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함하는 융합 단백질일 수 있다(하기 표 16 참조). 하기 표 16의 서열번호 53 내지 56으로 표시되는 융합 단백질은 각각 하기 표 17의 서열번호 57 내지 60으로 표시되는 핵산 서열에 의해 코딩될 수 있다(하기 표 17 참조).
서열목록 | 서열정보 |
서열번호 53 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GS EPKSSDKTHTSPPCPAPELLGGPSVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열번호 54 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GS EPRGPTIKPCPPCKCPAPNLLGGPSVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열번호 55 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GS RNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열번호 56 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GS EPKSSDKTHTSPPSPAPELLGGSSVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열목록 | 서열정보 |
서열번호 57 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGATCC GAGCCAAAGTCCTCTGATAAGACACACACCTCTCCACCATGCCCAGCACCAGAGCTGCTGGGAGGACCAAGCGTGTTC atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
서열번호 58 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGATCC Gagcctcggggccctaccatcaagccctgccccccttgcaagtgccctgcccctaatctgctgggcggaccctccgtgttc atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
서열번호 59 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGATCC cgcaacaccggccgcggcggcgaggagaagaagaaggagaaggagaaggaggagcaggaggagcgcgagaccaagacccccgagtgccccagccacacccagcccctgggcgtgttc atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
서열번호 60 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGATCC gaaccgaaatcttctgacaaaacccacacctctccgccgtctccggctccggaactgctgggtggttcttctgttttc atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
본 발명에서 제공하는 융합 단백질의 또 다른 일 예시는 상기 서열번호 1로 표시되는 Lrig-1 단백질의 세포 외 도메인; 서열번호 12로 표시되는 링커; 서열번호 11로 표시되는 링커; 서열번호 7 내지 10 중 어느 하나로 표시되는 힌지 영역; 서열번호 5로 표시되는 인간 IgG1 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함하는 융합 단백질일 수 있다(하기 표 18 참조). 하기 표 18의 서열번호 61 내지 64로 표시되는 융합 단백질은 하기 표 19의 서열번호 65 내지 68로 표시되는 핵산 서열에 의해 코딩될 수 있다(하기 표 19 참조).
서열목록 | 서열정보 |
서열번호 61 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF GGG GS EPKSSDKTHT SPPCPAPELL GGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 62 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF GGG GS EPRGPTIKPCPPCKCPAPNLLGGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 63 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF GGG GS RNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 64 | AGPRAPCAAACTCAGDSLDCGGRGLAALPGDLPSWTRSLNLSYNKLSEIDPAGFEDLPNLQEVYLNNNELTAVPSLGAASSHVVSLFLQHNKIRSVEGSQLKAYLSLEVLDLSLNNITEVRNTCFPHGPPIKELNLAGNRIGTLELGAFDGLSRSLLTLRLSKNRITQLPVRAFKLPRLTQLDLNRNRIRLIEGLTFQGLNSLEVLKLQRNNISKLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSIARIHRKGWSFCQKLHELVLSFNNLTRLDEESLAELSSLSVLRLSHNSISHIAEGAFKGLRSLRVLDLDHNEISGTIEDTSGAFSGLDSLSKLTLFGNKIKSVAKRAFSGLEGLEHLNLGGNAIRSVQFDAFVKMKNLKELHISSDSFLCDCQLKWLPPWLIGRMLQAFVTATCAHPESLKGQSIFSVPPESFVCDDFLKPQIITQPETTMAMVGKDIRFTCSAASSSSSPMTFAWKKDNEVLTNADMENFVHVHAQDGEVMEYTTILHLRQVTFGHEGRYQCVITNHFGSTYSHKARLTVNVLPSFTKTPHDITIRTTTVARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDAGVYSCTAQNSAGSISANATLTVLETPSLVVPLEDRVVSVGETVALQCKATGNPPPRITWFKGDRPLSLTERHHLTPDNQLLVVQNVVAEDAGRYTCEMSNTLGTERAHSQLSVLPAAGCRKDGTTVGIF GGG GS EPKSSDKTHTSPPSPAPELLGGSSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열목록 | 서열정보 |
서열번호 65 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc GGCGGTGGC GGATCC GAGCCAAAGTCCTCTGATAAGACACACACCTCTCCACCATGCCCAGCACCAGAGCTGCTGGGAGGACCAAGCGTGTTC CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 66 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc GGCGGTGGC GGATCC Gagcctcggggccctaccatcaagccctgccccccttgcaagtgccctgcccctaatctgctgggcggaccctccgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 67 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc GGCGGTGGC GGATCC cgcaacaccggccgcggcggcgaggagaagaagaaggagaaggagaaggaggagcaggaggagcgcgagaccaagacccccgagtgccccagccacacccagcccctgggcgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 68 | GCTGGGCCTCGGGCTCCTTGTGCTGCCGCCTGCACATGTGCAGGCGATTCCCTGGACTGCGGCGGCAGAGGCCTGGCCGCCCTGCCTGGCGATCTGCCATCCTGGACCCGGAGCCTGAACCTGAGCTACAACAAGCTGAGCGAGATCGATCCCGCCGGCTTTGAGGACCTGCCTAACCTGCAGGAGGTGTATCTGAACAATAACGAGCTGACCGCGGTACCatccctgggcgctgcttcatcacatgtcgtctctctctttctgcagcacaacaagattcgcagcgtggaggggagccagctgaaggcctacctttccttagaagtgttagatctgagtttgaacaacatcacggaagtgcggaacacctgctttccacacggaccgcctataaaggagctcaacctggcaggcaatcggattggcaccctggagttgggagcatttgatggtctgtcacggtcgctgctaactcttcgcctgagcaaaaacaggatcacccagcttcctgtaagagcattcaagctacccaggctgacacaactggacctcaatcggaacaggattcggctgatagagggcctcaccttccaggggctcaacagcttggaggtgctgaagcttcagcgaaacaacatcagcaaactgacagatggggccttctggggactgtccaagatgcatgtgctgcacctggagtacaacagcctggtagaagtgaacagcggctcgctctacggcctcacggccctgcatcagctccacctcagcaacaattccatcgctcgcattcaccgcaagggctggagcttctgccagaagctgcatgagttggtcctgtccttcaacaacctgacacggctggacgaggagagcctggccgagctgagcagcctgagtgtcctgcgtctcagccacaattccatcagccacattgcggagggtgccttcaagggactcaggagcctgcgagtcttggatctggaccataacgagatttcgggcacaatagaggacacgagcggcgccttctcagggctcgacagcctcagcaagctgactctgtttggaaacaagatcaagtctgtggctaagagagcattctcggggctggaaggcctggagcacctgaaccttggagggaatgcgatcagatctgtccagtttgatgcctttgtgaagatgaagaatcttaaagagctccatatcagcagcgacagcttcctgtgtgactgccagctgaagtggctgcccccgtggctaattggcaggatgctgcaggcctttgtgacagccacctgtgcccacccagaatcactgaagggtcagagcattttctctgtgccaccagagagtttcgtgtgcgatgacttcctgaagccacagatcatcacccagccagaaaccaccatggctatggtgggcaaggacatccggtttacatgctcagcagccagcagcagcagctcccccatgacctttgcctggaagaaagacaatgaagtcctgaccaatgcagacatggagaactttgtccacgtccacgcgcaggacggggaagtgatggagtacaccaccatcctgcacctccgtcaggtcactttcgggcacgagggccgctaccaatgtgtcatcaccaaccactttggctccacctattcacataaggccaggctcaccgtgaatgtgttgccatcattcaccaaaacgccccacgacataaccatccggaccaccaccgtggcccgcctcgaatgtgctgccacaggtcacccaaaccctcagattgcctggcagaaggatggaggcacggatttccccgctgcccgtgagcgacgcatgcatgtcatgccggatgacgacgtgtttttcatcactgatgtgaaaatagatgacgcaggggtttacagctgtactgctcagaactcagccggttctatttcagctaatgccaccctgactgtcctagagaccccatccttggtggtccccttggaagaccgtgtggtatctgtgggagaaacagtggccctccaatgcaaagccacggggaaccctccgccccgcatcacctggttcaagggggaccgcccgctgagcctcactgagcggcaccacctgacccctgacaaccagctcctggtggttcagaacgtggtggcagaggatgcgggccgatatacctgtgagatgtccaacaccctgggcacggagcgagctcacagccagctgagcgtcctgcccgcagcaggctgcaggaaggatgggaccacggtaggcatcttc GGCGGTGGC GGATCC gaaccgaaatcttctgacaaaacccacacctctccgccgtctccggctccggaactgctgggtggttcttctgttttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
본 발명에서 제공하는 융합 단백질의 또 다른 일 예시는 상기 서열번호 3으로 표시되는 Lrig-1 단백질의 세포 외 도메인; 서열번호 12로 표시되는 링커; 서열번호 11로 표시되는 링커; 서열번호 7 내지 10 중 어느 하나로 표시되는 힌지 영역; 서열번호 5로 표시되는 인간 IgG1 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함하는 융합 단백질일 수 있다(하기 표 20 참조). 하기 표 20의 서열번호 69 내지 72로 표시되는 융합 단백질은 각각 하기 표 21의 서열번호 73 내지 76으로 표시되는 핵산 서열에 의해 코딩될 수 있다(하기 표 21 참조).
서열목록 | 서열정보 |
서열번호 69 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GGG GS EPKSSDKTHT SPPCPAPELL GGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 70 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GGG GS EPRGPTIKPCPPCKCPAPNLLGGPSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 71 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GGG GS RNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열번호 72 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GGG GS EPKSSDKTHTSPPSPAPELLGGSSVF LFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK |
서열목록 | 서열정보 |
서열번호 73 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGCGGTGGC GGATCC GAGCCAAAGTCCTCTGATAAGACACACACCTCTCCACCATGCCCAGCACCAGAGCTGCTGGGAGGACCAAGCGTGTTC CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 74 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGCGGTGGC GGATCC Gagcctcggggccctaccatcaagccctgccccccttgcaagtgccctgcccctaatctgctgggcggaccctccgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 75 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGCGGTGGC GGATCC cgcaacaccggccgcggcggcgaggagaagaagaaggagaaggagaaggaggagcaggaggagcgcgagaccaagacccccgagtgccccagccacacccagcccctgggcgtgttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
서열번호 76 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGCGGTGGC GGATCC gaaccgaaatcttctgacaaaacccacacctctccgccgtctccggctccggaactgctgggtggttcttctgttttc CTGTTTCCTCCAAAGCCCAAGGACACACTGATGATCTCCAGGACACCAGAGGTGACCTGCGTGGTGGTGGACGTGAGCCACGAGGACCCCGAGGTGAAGTTCAACTGGTACGTGGATGGCGTGGAGGTGCACAATGCCAAGACCAAGCCCAGAGAGGAGCAGTACAACTCTACCTATAGGGTGGTGAGCGTGCTGACAGTGCTGCACCAGGACTGGCTGAACGGCAAGGAGTATAAGTGCAAGGTGAGCAATAAGGCCCTGCCTGCCCCAATCGAGAAGACAATCTCCAAGGCCAAGGGCCAGCCAAGAGAGCCCCAGGTGTACACCCTGCCCCCTAGCAGGGATGAGCTGACAAAGAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTTTATCCCTCCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAGCCTGAGAATAACTACAAGACAACCCCACCCGTGCTGGATTCTGACGGCAGCTTCTTTCTGTATTCTAAGCTGACCGTGGACAAGAGCAGGTGGCAGCAGGGCAACGTGTTCAGCTGCTCCGTGATGCACGAAGCACTGCACAATCACTACACCCAGAAATCACTGTCACTGAGCCCTGGCAAA |
본 발명에서 제공하는 융합 단백질의 또 다른 일 예시는 상기 서열번호 3으로 표시되는 Lrig-1 단백질의 세포 외 도메인; 서열번호 12로 표시되는 링커; 서열번호 11로 표시되는 링커; 서열번호 7 내지 10 중 어느 하나로 표시되는 힌지 영역; 서열번호 6으로 표시되는 마우스 IgG2 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3)을 포함하는 융합 단백질일 수 있다(하기 표 22 참조). 하기 표 22의 서열번호 77 내지 80로 표시되는 융합 단백질은 하기 표 23의 서열번호 81 내지 84로 표시되는 핵산 서열에 의해 코딩될 수 있다(하기 표 23 참조).
서열목록 | 서열정보 |
서열번호 77 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GGG GS EPKSSDKTHTSPPCPAPELLGGPSVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열번호 78 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GGG GS EPRGPTIKPCPPCKCPAPNLLGGPSVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열번호 79 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GGG GS RNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열번호 80 | AQAGPRAPCAAACTCAGDSLDCSGRGLATLPRDLPSWTRSLNLSYNRLSEIDSAAFEDLTNLQEVYLNSNELTAIPSLGAASIGVVSLFLQHNKILSVDGSQLKSYLSLEVLDLSSNNITEIRSSCFPNGLRIRELNLASNRISILESGAFDGLSRSLLTLRLSKNRITQLPVKAFKLPRLTQLDLNRNRIRLIEGLTFQGLDSLEVLRLQRNNISRLTDGAFWGLSKMHVLHLEYNSLVEVNSGSLYGLTALHQLHLSNNSISRIQRDGWSFCQKLHELILSFNNLTRLDEESLAELSSLSILRLSHNAISHIAEGAFKGLKSLRVLDLDHNEISGTIEDTSGAFTGLDNLSKLTLFGNKIKSVAKRAFSGLESLEHLNLGENAIRSVQFDAFAKMKNLKELYISSESFLCDCQLKWLPPWLMGRMLQAFVTATCAHPESLKGQSIFSVLPDSFVCDDFPKPQIITQPETTMAVVGKDIRFTCSAASSSSSPMTFAWKKDNEVLANADMENFAHVRAQDGEVMEYTTILHLRHVTFGHEGRYQCIITNHFGSTYSHKARLTVNVLPSFTKIPHDIAIRTGTTARLECAATGHPNPQIAWQKDGGTDFPAARERRMHVMPDDDVFFITDVKIDDMGVYSCTAQNSAGSVSANATLTVLETPSLAVPLEDRVVTVGETVAFQCKATGSPTPRITWLKGGRPLSLTERHHFTPGNQLLVVQNVMIDDAGRYTCEMSNPLGTERAHSQLSILPTPGCRKDGTT GGG GS EPKSSDKTHTSPPSPAPELLGGSSVF IFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK |
서열목록 | 서열정보 |
서열번호 81 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGCGGTGGC GGATCC GAGCCAAAGTCCTCTGATAAGACACACACCTCTCCACCATGCCCAGCACCAGAGCTGCTGGGAGGACCAAGCGTGTTC atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
서열번호 82 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGCGGTGGC GGATCC Gagcctcggggccctaccatcaagccctgccccccttgcaagtgccctgcccctaatctgctgggcggaccctccgtgttc atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
서열번호 83 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGCGGTGGC GGATCC cgcaacaccggccgcggcggcgaggagaagaagaaggagaaggagaaggaggagcaggaggagcgcgagaccaagacccccgagtgccccagccacacccagcccctgggcgtgttc atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
서열번호 84 | GCTCAGGCTGGACCTAGGGCTCCTTGCGCTGCCGCCTGCACCTGTGCAGGCGATTCTCTGGACTGCAGCGGCCGGGGCCTGGCCACACTGCCCAGGGACCTGCCTTCCTGGACCAGATCTCTGAACCTGAGCTACAATCGGCTGTCCGAGATCGATTCTGCCGCCTTTGAGGACCTGACAAATCTGCAGGAGGTGTATCTGAACAGCAATGAGCTGACCGCAATCCCCTCCCTGGGAGCAGCCTCTATCGGCGTGGTGAGCCTGTTCCTGCAGCACAACAAGATCCTGAGCGTGGATGGCTCCCAGCTGAAGAGCTACCTGTCTCTGGAGGTGCTGGACCTGAGCTCCAACAATATCACCGAGATCAGATCTAGCTGTTTTCCTAATGGCCTGCGGATCAGAGAGCTGAACCTGGCCTCTAATCGGATCAGCATCCTGGAGTCCGGCGCCTTCGATGGCCTGAGCAGATCCCTGCTGACACTGCGCCTGTCCAAGAACCGGATCACCCAGCTGCCCGTGAAGGCCTTTAAGCTGCCTAGGCTGACACAGCTGGACCTGAACCGGAATAGAATCAGGCTGATCGAGGGCCTGACCTTCCAGGGCCTGGATAGCCTGGAGGTGCTGCGCCTGCAGCGGAACAATATCTCCCGCCTGACAGACGGAGCATTTTGGGGCCTGTCTAAGATGCACGTGCTGCACCTGGAGTACAATAGCCTGGTGGAGGTGAACTCTGGCAGCCTGTATGGCCTGACCGCCCTGCACCAGCTGCACCTGTCCAACAATAGCATCAGCAGAATCCAGAGGGATGGCTGGTCCTTCTGCCAGAAGCTGCACGAGCTGATCCTGTCTTTTAACAATCTGACCAGGCTGGACGAGGAGAGCCTGGCAGAGCTGTCCTCTCTGTCCATCCTGCGCCTGTCTCACAATGCCATCAGCCACATCGCCGAGGGCGCCTTTAAGGGCCTGAAGAGCCTGAGGGTGCTGGATCTGGACCACAACGAGATCTCTGGCACCATCGAGGATACAAGCGGCGCCTTCACAGGCCTGGACAATCTGTCCAAGCTGACCCTGTTTGGCAACAAGATCAAGTCTGTGGCCAAGCGGGCCTTCTCTGGCCTGGAGAGCCTGGAGCACCTGAACCTGGGCGAGAATGCCATCAGATCCGTGCAGTTCGATGCCTTTGCCAAGATGAAGAATCTGAAGGAGCTGTACATCAGCTCCGAGAGCTTCCTGTGCGACTGTCAGCTGAAGTGGCTGCCACCTTGGCTGATGGGAAGGATGCTGCAGGCCTTTGTGACCGCCACATGCGCCCACCCAGAGAGCCTGAAGGGCCAGAGCATCTTCTCCGTGCTGCCCGATAGCTTCGTGTGCGACGATTTTCCTAAGCCACAGATCATCACCCAGCCAGAGACAACAATGGCCGTGGTGGGCAAGGACATCCGGTTTACATGTTCCGCCGCCTCTAGCTCCTCTAGCCCCATGACCTTCGCCTGGAAGAAGGATAACGAGGTGCTGGCCAATGCCGACATGGAGAACTTCGCCCACGTGAGAGCCCAGGATGGCGAAGTGATGGAGTATACCACAATCCTGCACCTGCGGCACGTGACCTTTGGCCACGAGGGCAGATACCAGTGCATCATCACAAATCACTTCGGCTCTACCTATAGCCACAAGGCCAGGCTGACAGTGAACGTGCTGCCTAGCTTTACCAAGATCCCACACGACATCGCCATCAGAACAGGCACCACAGCAAGGCTGGAGTGTGCAGCAACCGGACACCCAAACCCTCAGATCGCATGGCAGAAGGATGGAGGCACAGACTTCCCTGCAGCCCGCGAGAGGAGAATGCACGTGATGCCAGACGATGACGTGTTCTTTATCACAGATGTGAAGATCGATGACATGGGCGTGTACTCCTGCACCGCACAGAACAGCGCCGGCAGCGTGTCCGCCAACGCCACCCTGACCGTGCTGGAGACACCATCCCTGGCCGTGCCCCTGGAGGACAGGGTGGTGACCGTGGGCGAGACAGTGGCCTTTCAGTGTAAGGCCACCGGCTCTCCAACACCAAGGATCACCTGGCTGAAGGGCGGCAGGCCCCTGAGCCTGACAGAGCGCCACCACTTCACCCCTGGCAATCAGCTGCTGGTGGTGCAGAACGTGATGATCGATGACGCCGGCAGGTATACATGCGAGATGAGCAATCCTCTGGGCACCGAGAGGGCACACTCCCAGCTGTCTATCCTGCCTACCCCAGGCTGCCGGAAGGATGGCACCACA GGCGGTGGC GGATCC gaaccgaaatcttctgacaaaacccacacctctccgccgtctccggctccggaactgctgggtggttcttctgttttc atcttcccacccaagatcaaggacgtgctgatgatctccctgtcccccatcgtgacctgcgtggtggtggacgtgtccgaggacgaccccgacgtgcagatcagttggttcgtgaacaacgtggaagtgcacaccgcccagacccagacccacagagaggactacaactccaccctgcgggtggtgtccgccctgcccatccagcaccaggactggatgtccggcaaagaattcaagtgcaaagtgaacaacaaggacctgcctgcccccatcgagcggaccatctccaagcccaagggctccgtgcgggctccccaggtgtacgtgctgccccctccagaggaagagatgaccaagaagcaggtcacactgacctgcatggtcaccgacttcatgcccgaggacatctacgtggaatggaccaacaatggcaagaccgagctgaactacaagaacaccgagcctgtgctggactccgacggctcctacttcatgtactccaagctgcgggtggaaaagaagaactgggtcgagcggaactcctactcctgctccgtggtgcacgagggcctgcacaaccaccacaccaccaagtccttctcccggacccccggcaaa |
본 발명에서 제공하는 상기의 융합 단백질은 이펙터 T 세포(effector T cell)에 존재하는 Lrig-1 단백질에 대한 리간드와 상호 작용하여, Lrig-1 단백질을 표면에 포함하고 있는 조절 T 세포(regulatory T cell, Treg cell)와 이펙터 T 세포간의 상호 작용을 저해함에 따라 조절 T 세포의 활성을 억제하고 이펙터 T 세포의 활성은 유지 혹은 상승시켜 암 세포 중에서도 특히 고형암 세포의 성장을 효과적으로 억제할 수 있다.
본 발명의 다른 구현 예에 따르면, 본 발명에서 제공하는 상기 융합 단백질을 코딩하는 핵산 분자를 제공한다.
본 발명의 핵산 분자는 본 발명에서 제공하는 융합 단백질의 아미노산 서열을 당업자에게 알려진 바와 같이 폴리뉴클레오티드 서열로 번역된 핵산 분자 모두를 포함한다. 그러므로 ORF(open reading frame)에 의한 다양한 폴리뉴클레오티드 서열이 제조될 수 있으며 이 또한 모두 본 발명의 핵산 분자에 포함된다.
본 발명의 바람직한 일 예시로, 상기 핵산 분자는 서열번호 17 내지 20, 25 내지 28, 33 내지 36, 41 내지 44, 49 내지 52, 57 내지 60, 65 내지 68, 73 내지 76 및 81 내지 84 중 어느 하나로 표시될 수 있으나, 이에 제한되는 것은 아니다.
본 발명의 또 다른 구현 예에 따르면, 본 발명에서 제공하는 상기 단리된 핵산 분자가 삽입된 발현 벡터를 제공한다.
본 발명에서 상기 "벡터"는 어떤 핵산 분자가 연결된 또 다른 핵산을 수송할 수 있는 상기 핵산 분자이다. 벡터의 한 가지 유형은, 추가적인 DNA 세그멘트가 결찰될 수 있는 원형 이중가닥 DNA를 가리키는 "플라스미드"이다. 또 다른 유형의 벡터는 파지 벡터이다. 또 다른 유형의 벡터는 바이러스성 벡터로, 추가적인 DNA 세그멘트가 바이러스 게놈에 결찰될 수 있다. 어떤 벡터들은 그들이 유입된 숙주세포에서 자율적인 복제를 할 수 있다(예컨대, 박테리아성 벡터는 박테리아성 복제 기원을 갖는 에피솜 포유류 벡터). 기타 벡터(예컨대, 비-에피솜 포유류 벡터)는 숙주세포에 유입되면서 숙주세포의 게놈에 통합될 수 있고, 그럼으로써, 숙주 게놈과 함께 복제된다. 뿐만 아니라, 어떤 벡터는 이들이 작동차원에서 연결된 유전자의 발현을 지시할 수 있다. 이와 같은 벡터는 본원에서 "재조합 발현 벡터" 또는 단순히 "발현 벡터"라 명명된다. 일반적으로 재조합 DNA 기법에서 유용한 발현 벡터는 종종 플라스미드의 형태로 존재한다. 본 명세서에서, "플라스미드"와 "벡터"는, 플라스미드가 벡터 중 가장 통상적으로 사용되는 형태이기 때문에, 상호 교환하여 사용될 수 있다.
본 발명에서 상기 발현 벡터의 구체적인 예시로는 상업적으로 널리 사용되는 pCDNA 벡터, F, R1, RP1, Col, pBR322, ToL, Ti 벡터; 코스미드; 람다, 람도이드(lambdoid), M13, Mu, p1 P22, Qμ, T-even, T2, T3, T7 등의 파아지; 식물 바이러스로 이루어진 군으로부터 선택될 수 있으나, 이에 제한되는 것은 아니며, 당업자에게 발현 벡터로 알려진 모든 발현 벡터는 본 발명에 사용 가능하고, 발현 벡터를 선택할 때에는 목적으로 하는 숙주 세포의 성질에 따른다. 숙주세포로의 벡터 도입 시 인산칼슘 트랜스펙션, 바이러스 감염, DEAE-덱스트란 조절 트랜스펙션, 리포펙타민 트랜스펙션 또는 전기천공법에 의해 수행될 수 있으나 이에 한정되는 것은 아니며 당업자는 사용하는 발현 벡터 및 숙주 세포에 알맞은 도입 방법을 선택하여 이용할 수 있다. 바람직하게 벡터는 하나 이상의 선별 마커를 함유하나 이에 한정되지 않으며, 선별 마커를 포함하지 않은 벡터를 이용하여 생산물 생산 여부에 따라 선별이 가능하다. 선별 마커의 선택은 목적하는 숙주 세포에 의해 선별되며, 이는 이미 당업자에게 알려진 방법을 이용하므로 본 발명은 이에 제한을 두지 않는다.
본 발명의 핵산 분자를 정제를 용이하게 하기 위하여 태그 서열을 발현 벡터 상에 삽입하여 융합시킬 수 있다. 상기 태그로는 헥사-히스티딘 태그, 헤마글루티닌 태그, myc 태그 또는 flag 태그를 포함하나 이에 한정되는 것은 아니며 당업자에게 알려진 정제를 용이하게 하는 태그는 모두 본 발명에서 이용 가능하다.
본 발명의 또 다른 구현 예에 다르면, 본 발명에서 제공하는 상기 발현 벡터가 형질 감염된 숙주 세포주를 제공한다.
본 발명에서 상기 "숙주 세포"에는 폴리펩타이드 삽입물의 편입을 위한 벡터(들)의 수령자(recipient)일 수 있거나 또는 수령자였던 개별적인 세포 또는 세포배양물이 포함된다. 숙주 세포에는 단일 숙주 세포의 자손이 포함되고, 상기 자손은 자연적인, 우발적인 또는 고의의 돌연변이 때문에 반드시 원래 모세포와 완전히 동일(형태학상 또는 게놈 DNA 보완체에서)하지 않을 수 있다. 숙주 세포에는 본원의 폴리펩타이드(들)로 체내에서 형질주입된 세포가 포함된다.
본 발명에 있어서, 상기 숙주 세포로는 포유동물, 식물, 곤충, 균류 또는 세포성 기원의 세포를 포함할 수 있고, 예를 들면 대장균, 스트렙토미세스, 살모넬라 티피뮤리움 등의 박테리아 세포; 효모 세포, 피치아 파스 토리스 등의 균류세포; 드로조필라, 스포도프테라 Sf9 세포 등의 곤충 세포; CHO(중국 햄스터 난소 세포, Chinese hamster ovary cells), SP2/0(생쥐 골수종), 인간 림프아구(Human lymphoblastoid), COS, NSO(생쥐 골 수종), 293T, 보우 멜라노마 세포, HT-1080, BHK(베이비 햄스터 신장세포, Baby Hamster Kidney cells), HEK(인간 배아신장 세포, Human Embryonic Kidney cells) 또는 PERC.6(인간 망막 세포)의 동물 세포; 또는 식물 세포일 수 있으나, 이에 제한되는 것은 아니며, 당업자에게 알려진 숙주 세포주로 사용 가능한 세포는 모두 이용 가능하다.
본 발명의 또 다른 구현 예에 따르면, 본 발명에서 제공하는 융합 단백질을 유효 성분으로 포함하는 암의 예방 또는 치료용 약학 조성물을 제공한다.
본 발명에서 상기 "암" 포유류에서 전형적으로 조절되지 않는 세포 성장으로 특징 지어진 생리적 상태를 나타내거나 가리킨다. 본 발명에서 예방, 개선 또는 치료의 대상이 되는 암은 고형 장기(solid organ)에서 비정상적으로 세포가 성장하여 발생한 덩어리로 이루어진 고형암(solid tumor)일 수 있고, 고형 장기의 부위에 따라 위암, 간암, 교세포종, 난소암, 대장암, 두경부암, 방광암, 신장세포암, 유방암, 전이암, 전립선암, 췌장암, 흑색종 또는 폐암 등일 수 있으며, 바람직하게는 흑색종 또는 대장암일 수 있으나, 이에 제한되는 것은 아니다.
한편, 본 발명에서, "예방"은 본 발명의 약학 조성물을 이용하여 질환의 증상을 차단하거나, 그 증상을 억제 또는 지연시키는 모든 행위라면 제한없이 포함할 수 있다.
또한, 본 발명에서, "치료"는 본 발명의 약학 조성물을 이용하여 질환의 증상이 호전되거나 이롭게 되는 모든 행위라면 제한없이 포함할 수 있다.
본 발명에서 상기 약학 조성물은 캡슐, 정제, 과립, 주사제, 연고제, 분말 또는 음료 형태임을 특징으로 할 수 있으며, 상기 약학 조성물은 인간을 대상으로 하는 것을 특징으로 할 수 있다.
본 발명에서 상기 약학 조성물은 이들로 한정되는 것은 아니지만, 각각 통상의 방법에 따라 산제, 과립제, 캡슐, 정제, 수성 현탁액 등의 경구형 제형, 외용제, 좌제 및 멸균 주사 용액의 형태로 제형화하여 사용될 수 있다. 본 발명의 약학 조성물은 약학적으로 허용 가능한 담체를 포함할 수 있다. 약학적으로 허용되는 담체는 경구 투여 시에는 결합제, 활탁제, 붕해제, 부형제, 가용화제, 분산제, 안정화제, 현탁화제, 색소, 향료 등을 사용할 수 있으며, 주사제의 경우에는 완충제, 보존제, 무통화제, 가용화제, 등장제, 안정화제 등을 혼합하여 사용할 수 있으며, 국소투여용의 경우에는 기제, 부형제, 윤활제, 보존제 등을 사용할 수 있다. 본 발명의 약학 조성물의 제형은 상술한 바와 같은 약제학적으로 허용되는 담체와 혼합하여 다양하게 제조될 수 있다. 예를 들어, 경구 투여시에는 정제, 트로키, 캡슐, 엘릭서(elixir), 서스펜션, 시럽, 웨이퍼 등의 형태로 제조할 수 있으며, 주사제의 경우에는 단위 투약 앰플 또는 다수회 투약 형태로 제조할 수 있다. 기타, 용액, 현탁액, 정제, 캡슐, 서방형 제제 등으로 제형화할 수 있다.
한편, 제제화에 적합한 담체, 부형제 및 희석제의 예로는, 락토즈, 덱스트로즈, 수크로즈, 솔비톨, 만니톨, 자일리톨, 에리스리톨, 말디톨, 전분, 아카시아 고무, 알지네이트, 젤라틴, 칼슘 포스페이트, 칼슘 실리케이트, 셀룰로즈, 메틸 셀룰로즈, 미정질 셀룰로즈, 폴리비닐피롤리돈, 물, 메틸하이드록시벤조에이트, 프로필하이드록시벤조에이트, 탈크, 마그네슘 스테아레이트 또는 광물유 등이 사용될 수 있다. 또한, 충진제, 항 응집제, 윤활제, 습윤제, 향료, 유화제, 방부제 등을 추가로 포함할 수 있다.
본 발명에 상기 약학 조성물의 투여 경로는 이들로 한정되는 것은 아니지만 구강, 정맥내, 근육내, 동맥내, 골수내, 경막내, 심장내, 경피, 피하, 복강내, 비강내, 장관, 국소, 설하 또는 직장이 포함된다. 경구 또는 비경구 투하가 바람직하다.
본 발명에서 상기 "비경구"란, 피하, 피내, 정맥내, 근육내, 관절내, 활액낭내, 흉골내, 경막내, 병소내 및 두개골내 주사 또는 주입기술을 포함한다. 본 발명의 약학 조성물은 또한 직장 투여를 위한 좌제의 형태로 투여될 수 있다.
본 발명의 상기 약학 조성물은 사용된 특정 화합물의 활성, 연령, 체중, 일반적인 건강, 성별, 정식, 투여 시간, 투여 경로, 배출율, 약물 배합 및 예방 또는 치료될 특정 질환의 중증을 포함한 여러 요인에 따라 다양하게 변할 수 있고, 상기 약학 조성물의 투여량은 환자의 상태, 체중, 질병의 정도, 약무 형태, 투여 경로 및 기간에 따라 다르지만 당업자에 의해 적절하게 선택될 수 있고, 1일 0.0001 내지 50mg/kg 또는 0.001 내지 50mg/kg으로 투여할 수 있다. 투여는 하루에 한번 투여할 수도 있고, 수회 나누어 투여할 수도 있다. 상기 투여량은 어떠한 면으로든 본 발명의 범위를 한정하는 것은 아니다. 본 발명에 따른 의약 조성물은 환제, 당의정, 캡슐, 액제, 겔, 시럽, 슬러리, 현탁제로 제형화될 수 있다.
본 발명의 또 다른 구현 예에 따르면, 본 발명에 따른 융합 단백질 또는 이를 포함하는 조성물을 개체에 투여하는 단계를 포함하는 암의 예방 또는 치료 방법에 관한 것이다.
본 발명의 융합 단백질은 이펙터 T 세포(effector T cell)에 존재하는 Lrig-1 단백질에 대한 리간드와 상호 작용하여, Lrig-1 단백질을 표면에 포함하고 있는 조절 T 세포(Treg cell)와 이펙터 T 세포간의 상호 작용을 저해함에 따라 조절 T 세포의 활성을 억제하고 이펙터 T 세포의 활성은 유지 혹은 상승시켜 암 세포 중에서도 특히 고형암 세포의 성장을 효과적으로 억제할 수 있다.
본 발명에서 상기 "개체"는 암의 발병이 의심되는 개체로서, 상기 암 발병의 의심 개체는 해당 질환이 발병하였거나 발병할 수 있는 인간을 포함한 쥐, 가축 등을 포함하는 포유동물을 의미하나, 본 발명의 융합 단백질 또는 이를 포함하는 상기 조성물로 치료 가능한 개체는 제한 없이 포함된다.
본 발명의 방법은 융합 단백질 또는 이를 포함하는 조성물을 약학적 유효량으로 투여하는 것을 포함할 수 있다. 적합한 총 1일 사용량은 올바른 의학적 판단범위 내에서 처치의에 의해 결정될 수 있으며, 1회 또는 수회로 나누어 투여할 수 있다. 그러나 본 발명의 목적상, 특정 환자에 대한 구체적인 치료적 유효량은 달성하고자 하는 반응의 종류와 정도, 경우에 따라 다른 제제가 사용되는지의 여부를 비롯한 구체적 조성물, 환자의 연령, 체중, 일반 건강 상태, 성별 및 식이, 투여 시간, 투여 경로 및 조성물의 분비율, 치료기간, 구체적 조성물과 함께 사용되거나 동시 사용되는 약물을 비롯한 다양한 인자와 의약 분야에 잘 알려진 유사 인자에 따라 다르게 적용하는 것이 바람직하다.
한편, 이에 제한되지 않으나, 상기 암의 예방 또는 치료 방법은 하나 이상의 암 질환에 대한 치료적 활성을 가지는 화합물 또는 물질을 투여하는 것을 더 포함하는 병용 요법일 수 있다.
본 발명에서 상기 "병용"은 동시, 개별 또는 순차 투여를 나타내는 것으로 이해되어야 한다. 상기 투여가 순차 또는 개별적인 경우, 2차 성분 투여의 간격은 상기 병용의 이로운 효과를 잃지 않도록 하는 것이어야 한다.
본 발명에서 상기 융합 단백질의 투여 용량은 환자 체중 1 kg당 약 0.0001 μg 내지 500 mg일 수 있으나, 이에 제한되는 것은 아니다.
본 발명에서 제공하는 융합 단백질은 이펙터 T 세포(effector T cell)에 존재하는 Lrig-1 단백질에 대한 리간드와 상호 작용하여, Lrig-1 단백질을 표면에 포함하고 있는 조절 T 세포(Treg cell)와 이펙터 T 세포간의 상호 작용을 저해함에 따라 조절 T 세포의 활성을 억제하고 이펙터 T 세포의 활성은 유지 혹은 상승시켜 암 세포 중에서도 특히 고형암 세포의 성장을 효과적으로 억제할 수 있다.
도 1은 본 발명의 일 실시예에 따른 Lrig-1 단백질의 구조를 나타낸 것이다.
도 2는 본 발명의 일 실시예에 따른 Lrig-1 단백질의 구조를 나타낸 것이다.
도 3은 본 발명의 일 실시예에 따른 Lrig-1 단백질의 항원 결정기(epitope)를 예측한 결과를 나타낸 것이다.
도 4는 본 발명의 일 실시예에 따른 Lrig-1 단백질의 항원 결정기(epitope)를 예측한 결과를 나타낸 것이다.
도 5는 본 발명의 일 실시예에 따른 Lrig-1 mRNA의 발현 정도를 나타낸 것이다.
도 6은 본 발명의 일 실시예에 따른 Lrig-1 mRNA의 발현 정도를 나타낸 것이다.
도 7은 본 발명의 일 실시예에 따른 Lrig-1 mRNA의 발현 정도를 나타낸 것이다.
도 8은 본 발명의 일 실시예에 따른 Lrig-1, Lrig-2 및 Lrig-3 mRNA의 발현 정도를 나타낸 것이다.
도 9는 본 발명의 일 실시예에 따른 조절 T 세포와 비-조절 T 세포 내 Lrig-1 단백질의 발현량 비교 결과를 나타낸 것이다.
도 10은 본 발명의 일 실시예에 따른 조절 T 세포의 표면에 Lrig-1 단백질의 발현을 나타낸 것이다.
도 11은 본 발명의 일 실시예에 따른 융합 단백질의 이펙터 T 세포에 대한 조절 T 세포의 억제능을 저해하는 효과를 나타낸 것이다.
도 12는 본 발명의 일 실시예에 따른 융합 단백질에 의해 인식되는 리간드가 존재하는 T 세포 서브세트(subset)를 나타낸 것이다.
도 13은 본 발명의 일 실시예에 따른 융합 단백질을 이용한 암 치료 실험의 설계도를 나타낸 것이다.
도 14은 본 발명의 일 실시예에 따른 융합 단백질을 이용한 암 치료 효과를 분석한 결과를 나타낸 것이다.
도 15는 본 발명의 일 실시예에 따른 융합 단백질을 이용한 암 치료 효과를 분석한 결과를 나타낸 것이다.
도 2는 본 발명의 일 실시예에 따른 Lrig-1 단백질의 구조를 나타낸 것이다.
도 3은 본 발명의 일 실시예에 따른 Lrig-1 단백질의 항원 결정기(epitope)를 예측한 결과를 나타낸 것이다.
도 4는 본 발명의 일 실시예에 따른 Lrig-1 단백질의 항원 결정기(epitope)를 예측한 결과를 나타낸 것이다.
도 5는 본 발명의 일 실시예에 따른 Lrig-1 mRNA의 발현 정도를 나타낸 것이다.
도 6은 본 발명의 일 실시예에 따른 Lrig-1 mRNA의 발현 정도를 나타낸 것이다.
도 7은 본 발명의 일 실시예에 따른 Lrig-1 mRNA의 발현 정도를 나타낸 것이다.
도 8은 본 발명의 일 실시예에 따른 Lrig-1, Lrig-2 및 Lrig-3 mRNA의 발현 정도를 나타낸 것이다.
도 9는 본 발명의 일 실시예에 따른 조절 T 세포와 비-조절 T 세포 내 Lrig-1 단백질의 발현량 비교 결과를 나타낸 것이다.
도 10은 본 발명의 일 실시예에 따른 조절 T 세포의 표면에 Lrig-1 단백질의 발현을 나타낸 것이다.
도 11은 본 발명의 일 실시예에 따른 융합 단백질의 이펙터 T 세포에 대한 조절 T 세포의 억제능을 저해하는 효과를 나타낸 것이다.
도 12는 본 발명의 일 실시예에 따른 융합 단백질에 의해 인식되는 리간드가 존재하는 T 세포 서브세트(subset)를 나타낸 것이다.
도 13은 본 발명의 일 실시예에 따른 융합 단백질을 이용한 암 치료 실험의 설계도를 나타낸 것이다.
도 14은 본 발명의 일 실시예에 따른 융합 단백질을 이용한 암 치료 효과를 분석한 결과를 나타낸 것이다.
도 15는 본 발명의 일 실시예에 따른 융합 단백질을 이용한 암 치료 효과를 분석한 결과를 나타낸 것이다.
이하, 실시예를 통하여 본 발명을 더욱 상세히 설명하고자 한다. 이들 실시예는 오로지 본 발명을 보다 구체적으로 설명하기 위한 것으로서, 본 발명의 요지에 따라 본 발명의 범위가 이들 실시예에 의해 제한되지 않는다는 것은 당업계에서 통상의 지식을 가진 자에게 있어서 자명할 것이다.
실시예
[준비예 1] T 세포 아형 세포 배양
조절 T 세포(Treg)에서만 Lrig-1 단백질이 발현되는지 확인하기 위하여, T 세포의 아형(subset)인 Th0, Th1, Th2, Th17 및 iTreg을 준비하였다. 상기 iTreg은 자연적으로 분리한 nTreg과는 달리 하기 조성을 포함하는 배지에서 분화를 인공적으로 유도한 세포를 의미한다.
T 세포의 아형은 우선, 쥐의 비장으로부터 얻은 나이브(naive) T 세포를 분리한 뒤, 우태아혈청(FBS; hyclone, logan, UT) 10%를 포함하는 RPMI1640(Invitrogen Gibco, Grand Island, NY) 영양배지에 하기 표 24의 성분을 각각 더 포함하도록 하여, 37℃, 5 % CO2 배양기 내에서 72시간 배양을 통해 각각의 세포로 분화 유도하였다.
분화 세포 | 조성 |
Th0 | anti-CD3, anti-CD28 |
Th1 | IL-12, anti-IL-4 항체 |
Th2 | IL-4, anti-IFNβ |
Th17 | IL-6, TGFβ, anti-IFNβ, anti-IL-4 |
iTreg | IL-2, TGFβ |
[실시예 1] Lrig-1 구조 분석
조절 T 세포의 표면 단백질인 Lrig-1 단백질의 세포 외 도메인을 포함하는 융합 단백질을 제작하기 위하여 Lrig-1 단백질의 세포 외 도메인의 3차원 입체 구조를 예측하였다.
우선, 항원 결정기(Epitope) 염기서열 예측을 위해 Lrig-1 단백질의 세포 외 도메인(Extracellular domain; ECD)의 구조를 확인하기 위하여 Uniprot(http://www.uniprot.org)과 RCSB Protein Data Bank (http://www.rcsb.org/pdb) 툴을 이용하여 3차원 입체 구조를 예측한 뒤, 그 결과를 도 1 및 2에 나타내었다.
도 1에서 보는 바와 같이, Lrig-1 단백질의 세포 외 도메인 중 Lrig-LRR 도메인(아미노산 서열 41 ~ 494번) 내에는 LRR1 내지 LRR15의 총 15개의 류신 리치 부위(Leucine rich region)가 존재하였다. 상기 LRR 도메인 각각은 23 내지 27개의 아미노산으로 구성되고, 류신은 3 내지 5개가 존재하였다.
또한, 도 2에서 보는 바와 같이, Lrig-1 단백질의 세포외 도메인 중 Lrig-1 단백질의 아미노산 서열 494 내지 781번에는 면역글로불린 유사 도메인(Immunoglobulin-like domain)이 3개 존재하였다.
[실시예 2] Lrig-1 항원 결정기(epitope) 아미노산 서열 예측
상기 염기서열의 예측은 Lrig-1 단백질의 구조를 기반으로 하는 항원 결정기 예측 소프트웨어(epitope prediction software)인 Ellipro 서버(http://tools.iedb.org/ellipro/)를 이용하였다. 상기 Ellipro 검색 엔진은 현존하는 항원 결정기를 예측하는 알고리즘 중에서 가장 신뢰도가 높다고 알려진 검색엔진에 해당하여 이를 이용하였다.
항원 결정기 예측 소프트웨어에 상기 실시예 1에서 분석된 세포외 도메인을 입력한 뒤, 예측된 항원 결정기의 예측된 연속 또는 불연속 아미노산 서열을 도 3 및 4에 나타내었다.
도 3 및 4에서 보는 바와 같이, 연속된 항원 결정기 아미노산 서열은 총 22개가 예측되었고, 불연속된 항원 결정기 아미노산 서열은 총 8개가 예측되었다.
[실시예 3] Lrig-1 mRNA의 조절 T 세포에서의 특이적 발현 확인
Lrig-1 단백질이 조절 T 세포에 특이적인 바이오마커(biomarker)로 작용할 수 있는지 검증하였다.
상기 검증을 위하여, 쥐의 비장으로부터 CD4 비드를 통해 자석 활성 세포 분류기(magnet-activated cell sorting; MACS)를 이용하여 CD4+ T 세포를 분리하였다. 이후, CD25 항체를 이용하여 형광 활성 세포 분류기(FACS)를 이용해 조절 T (CD4+CD25+ T) 세포 및 비 조절 T (CD4+CD25- T) 세포를 분리하였다. 각각의 세포 및 상기 준비예 1에서 분화된 세포는 트리졸(Trizol)을 이용하여 mRNA를 추출한 뒤, 게노믹 RNA는 gDNA 추출 키트(Qiagen)를 이용하여 업체에서 제공한 프로토콜에 의해 gDNA를 제거하였다. gDNA가 제거된 mRNA는 BDsprint cDNA 합성 키트 (Clonetech)를 통해 cDNA로 합성하였다.
상기 cDNA에서 Lrig-1 mRNA의 발현량을 정량적으로 확인하기 위하여 실시간 중합효소연쇄반응(real time PCR)을 수행하였다.
상기 실시간 중합효소연쇄반응은 SYBR Green (Molecular Probes)을 이용하여 업체에서 제공한 프로토콜에 의해 95℃에서 3분, 61℃에서 15초, 72℃에서 30초씩 40 사이클의 조건으로, 하기 표 25의 프라이머를 이용하여 수행하였고, 상대적인 유전자 발현량은 ΔCT 방법을 이용하여 계산하였으며, HPRT를 이용하여 일반화(normalization) 하여, 그 결과를 도 5 내지 8에 나타내었다.
프라이머 | 서열 |
쥐 Lrig-1 | Forward 5' - GAC GGA ATT CAG TGA GGA GAA CCT - 3' |
Reverse 5' - CAA CTG GTA GTG GCA GCT TGT AGG - 3' | |
쥐 Lrig-2 | forward 5' - TCA CAA GGA ACA TTG TCT GAA CCA- 3' |
reverse 5' - GCC TGA TCT AAC ACA TCC TCC TCA- 3' | |
쥐 Lrig-3 | forward 5' - CAG CAC CTT GAG CTG AAC AGA AAC - 3' |
reverse 5' - CCA GCC TTT GGT AAT CTC GGT TAG - 3' | |
쥐 FOXP3 | forward 5' - CTT TCA CCT ATC CCA CCC TTA TCC - 3' |
reverse 5' - ATT CAT CTA CGG TCC ACA CTG CTC - 3' | |
ACTG1 | forward 5' - GGC GTC ATG GTG GGC ATG GG - 3' |
reverse 5' - ATG GCG TGG GGA AGG GCG TA - 3' |
도 5에서 보는 바와 같이, 비-조절 T 세포(CD4+CD25- T cell)에 비하여 조절 T 세포(CD4+CD25+ T cell)에서 Lrig-1의 발현이 18.1배 높은 것을 알 수 있다. 이는, 기존에 알려져 있는 조절 T 세포의 마커인 Lag3 및 Ikzf4와 비교하였을 때 약 10배 정도 발현양이 높은 수준이었다.
또한, 도 6 및 7에서 보는 바와 같이, 다른 종류의 면역 세포에 비하여 조절 T 세포, 특히 유도된 조절 T 세포(iTreg)에 비해 자연적으로 분리한 조절 T 세포(nTreg)에서 Lrig-1 mRNA의 발현이 현저하게 높았다.
또한, 도 8에서 보는 바와 같이, Lrig 패밀리에 해당하는 Lrig-1, Lrig-2 및 Lrig-3 중에서 Lrig-1의 발현이 가장 높았다.
상기 결과를 통해 본 발명에 따른 Lrig-1 단백질은 조절 T 세포, 특히 자연적으로 존재하는 조절 T 세포에서 특이적으로 발현하는 것을 알 수 있다.
[실시예 4] Lrig-1 단백질의 조절 T 세포에서의 특이적 발현 확인
Lrig-1 mRNA로부터 발현된 Lrig-1 단백질이 조절 T 세포에서만 특이적으로 발현되는지 확인하였다.
조절 T 세포 특이적인 전사 인자인 FOXP3 프로모터에 RFP(Red fluorescence protein)이 결합된 FOXP3-RFP 주입(Knock-in) 쥐를 이용하여, 상기 쥐의 비장으로부터 CD4 비드로 자석 활성 세포 분류기(magnet-activated cell sorting; MACS)를 이용하여 CD4+ T 세포를 분리하였다. 이후, RFP 단백질을 이용하여, 형광 활성 세포 분류기(FACS)를 통해 조절 T 세포(CD4+RFP+ T cell) 및 비-조절 T 세포(CD4+RFP- T cell)를 분리하여 얻었다. 각각의 상기 세포는 구입한 Lrig-1 항체 및 음성 대조군은 아이소타입(isotype)을 통해 염색하여 형광 활성 세포 분류기로 Lrig-1의 발현량을 측정하여, 그 결과를 도 9에 나타내었다.
도 9에서 보는 바와 같이, 점선으로 표시되는 비-조절 T 세포의 경우 음성 대조군과 거의 동일한 Lrig-1의 발현 수준을 보였지만, 조절 T 세포의 경우 Lrig-1의 발현 수준이 높은 세포가 다수 존재하였다.
상기 결과를 통해 본 발명에 따른 Lrig-1 단백질은 조절 T 세포에서 특이적으로 발현하는 것을 알 수 있다.
[실시예 5] 조절 T 세포 표면에서의 Lrig-1 단백질 특이적 발현 확인
Lrig-1 단백질이 세포 치료의 타겟이 되기 위해서는 조절 T 세포의 표면에서 발현되어야 더욱 효과적으로 타겟 치료를 할 수 있으므로, Lrig-1 단백질이 표면에서의 발현 여부를 확인하였다.
상기 준비예 1의 각각의 분화된 T 세포 아형을 항-CD4-APC 및 항 Lrig-1-PE 항체로 염색하고, 형광 이용 세포 분류기(Fluorescence-Activated Cell Sorter; FACS)를 이용하여 각각의 세포 표면에서 Lrig-1의 발현량을 측정하여, 그 결과를 도 10에 나타내었다.
도 10에서 보는 바와 같이, 활성화된 T 세포(activated T cell), Th1 세포, Th2 세포, Th17 세포 및 나이브(Naive) T 세포에서는 Lrig-1의 발현이 0.77 내지 15.3의 양으로 발현되는 반면, 분화가 유도된 T 세포(iTreg)에서는 83.9로 높게 발현되었다.
상기 결과를 통해 본 발명에 따른 Lrig-1 단백질은 조절 T 세포(Treg) 세포에서 특이적으로 발현될 뿐만 아니라, 특히 상기 조절 T 세포의 표면에서 더욱 높게 발현되는 것을 알 수 있다.
[제조예] 융합 단백질의 제작
1. 발현 벡터의 제작
본 발명에 따른 융합 단백질을 제조하기 위하여, 하기 표 26의 각각의 융합 단백질을 코딩하는 각 핵산 서열을 합성하였다. 핵산 서열의 5' 말단과 3' 말단에 각각 NheI과 EcoRI 제한효소 서열을 첨가하였고, 5' 말단의 제한효소 서열 뒤에 코작 서열(Kozak's sequence)(GCCACC)과, 단백질 번역을 위한 시작 코돈과 발현된 단백질을 세포 밖으로 분비하게 하는 마우스 IgG 카파 경쇄 시그널 펩타이드(Mouse kappa light chain signal peptide)(ATGGAAACCGATACTCTGCTGCTGTGGGTGCTGCTGCTGTGGGTGCCAGGCTCTACCGGG)를 삽입하였다. 이후 하기 표 26의 각각의 융합 단백질을 코딩하는 핵산 서열로, 인간 유래 Lrig-1 단백질의 세포 외 도메인; 선택적으로 링커; 힌지 영역; 인간 IgG1 유래의 중쇄 불변영역 2(CH2) 및 중쇄 불변영역 3(CH3);을 코딩화하는 핵산 서열 뒤에는 종결 코돈을 삽입하였다. NheI 과 EcoRI 두 제한효소 서열을 이용하여 본 발명의 융합 단백질을 코딩하는 핵산 서열을 pcDNA3.1(+) 발현 벡터에 클로닝하였다.
연번 | 융합 단백질의 아미노산 서열 | 융합 단백질의 핵산 서열 |
제조예 1 | 서열번호 13 | 서열번호 17 |
제조예 2 | 서열번호 14 | 서열번호 18 |
제조예 3 | 서열번호 15 | 서열번호 19 |
제조예 4 | 서열번호 16 | 서열번호 20 |
제조예 5 | 서열번호 21 | 서열번호 25 |
제조예 6 | 서열번호 22 | 서열번호 26 |
제조예 7 | 서열번호 23 | 서열번호 27 |
제조예 8 | 서열번호 24 | 서열번호 28 |
제조예 9 | 서열번호 29 | 서열번호 33 |
제조예 10 | 서열번호 30 | 서열번호 34 |
제조예 11 | 서열번호 31 | 서열번호 35 |
제조예 12 | 서열번호 32 | 서열번호 36 |
제조예 13 | 서열번호 37 | 서열번호 41 |
제조예 14 | 서열번호 38 | 서열번호 42 |
제조예 15 | 서열번호 39 | 서열번호 43 |
제조예 16 | 서열번호 40 | 서열번호 44 |
제조예 17 | 서열번호 45 | 서열번호 49 |
제조예 18 | 서열번호 46 | 서열번호 50 |
제조예 19 | 서열번호 47 | 서열번호 51 |
제조예 20 | 서열번호 48 | 서열번호 52 |
제조예 21 | 서열번호 53 | 서열번호 57 |
제조예 22 | 서열번호 54 | 서열번호 58 |
제조예 23 | 서열번호 55 | 서열번호 59 |
제조예 24 | 서열번호 56 | 서열번호 60 |
제조예 25 | 서열번호 61 | 서열번호 65 |
제조예 26 | 서열번호 62 | 서열번호 66 |
제조예 27 | 서열번호 63 | 서열번호 67 |
제조예 28 | 서열번호 64 | 서열번호 68 |
제조예 29 | 서열번호 69 | 서열번호 73 |
제조예 30 | 서열번호 70 | 서열번호 74 |
제조예 31 | 서열번호 71 | 서열번호 75 |
제조예 32 | 서열번호 72 | 서열번호 76 |
제조예 33 | 서열번호 77 | 서열번호 81 |
제조예 34 | 서열번호 78 | 서열번호 82 |
제조예 35 | 서열번호 79 | 서열번호 83 |
제조예 36 | 서열번호 80 | 서열번호 84 |
2. 융합 단백질의 정제
293F 세포에 폴리에틸렌이민(Polyethylenimine)을 이용하여 상기 1.에서 제작한 발현 벡터를 형질 전환시킨 뒤 37 ℃ 및 8% CO2 조건 하에서 6일 동안 배양하였다. 배양 상청액을 여과한 뒤 단백질 A 레진(Protein A resin)에 흘려준 후 1X PBS 로 세척하였다. pH 3.5의 0.1M 글리신(glycine) 용액을 이용하여 용출한 뒤 얻어진 용액에 pH 9.0의 1M TRIS 용액을 첨가하여 pH 중화하였다. 이후 용액을 투석하여 PBS 용액 상태로 만든 후 농축하여 사용하였다.
[실시예 6] 본 발명에 따른 융합 단백질에 의한, 조절 T 세포의 이펙터 T 세포에 대한 억제능(suppression activity) 저해 효과
상기 제조예 5에서 제작된 본 발명에 따른 서열번호 21로 표시되는 융합 단백질(서열번호 25로 표시되는 핵산 서열을 이용한 제작)이 Lrig-1 단백질에 대한 리간드(ligand)와 결합하여, 상기 리간드와 조절 T 세포 사이의 상호 작용을 저해함에 따라 최종적으로 조절 T 세포의 이펙터 T 세포에 대한 증식 억제능을 감소시키는지 확인하기 위하여 하기의 실험을 수행하였다. 구체적으로는, 조절 T 세포와 이펙터 T 세포를 공동 배양(coculture)하는 조건에서 상기 제조예 5의 융합 단백질을 첨가하여, 조절 T 세포의 이펙터 T 세포에 대한 증식 억제능의 변화를 확인하였다. 그 결과는 도 11에 나타내었다.
도 11에서 보는 바와 같이, 본 발명에 따른 융합 단백질을 처리하자 이펙터 T 세포에 대한 조절 T 세포의 증식 억제능이 저하된 것을 확인할 수 있었다.
이를 통하여 본 발명에 따른 융합 단백질은 Lrig-1 단백질에 대한 리간드와 상호 작용하여, Lrig-1 단백질을 표면에 포함하고 있는 조절 T 세포와 상기 리간드 사이의 상호 작용을 저해함에 따라 조절 T 세포의 활성을 억제하고 이펙터 T 세포의 활성은 유지 혹은 상승시키는 것을 알 수 있었다.
[실시예 7]
본 발명에 따른 융합 단백질에 의해 인식되는 Lrig-1 리간드의 분포 확인
상기 실시예 6에서 보는 바와 같이, 상기 제조예 5에서 제작된 본 발명에 따른 서열번호 21로 표시되는 융합 단백질(서열번호 25로 표시되는 핵산 서열을 이용한 제작)은 Lrig-1 리간드를 인식할 수 있는 것을 알 수 있으므로, 상기 Lrig-1의 리간드가 존재하는 면역 세포들을 발굴하기 위하여, 조절 T 세포의 타겟으로 생각되는 나이브 T 세포(naive T cell)로부터, 활성화된 T 세포, Th1 세포, Th2 세포 또는 Th17 세포로 분화를 유도시킨 뒤 이들 세포를 상기 제조예 5의 융합 단백질(1차 항체) 및 항-인간-PE 항체(2차 항체)로 염색(staining) 하였다. 형광 이용 세포 분류기(Fluorescence-Activated Cell Sorter; FACS)를 이용하여 각각의 세포 표면에서 상기 융합 단백질의 발현량을 측정하여, 그 결과를 도 12에 나타내었다.
도 12에서 보는 바와 같이, 본 발명에 따른 융합 단백질은 나이브 T 세포의 표면에 있는 항원(antigen)은 거의 염색시키지 않았고(0.71%), 활성화된 T 세포, Th1 세포, Th2 세포 및 Th17 세포의 표면에 있는 항원은 96% 이상을 염색시켰다.
이를 통하여 Lrig-1의 리간드는 T 세포 리셉터(receptor)의 자극에 의하여 유도되는 표면 단백질인 것을 알 수 있었다.
[실시예 8] 본 발명에 따른 융합 단백질의 암 치료 효과
상기 제조예 5에서 제작된 본 발명에 따른 서열번호 21로 표시되는 융합 단백질(서열번호 25로 표시되는 핵산 서열을 이용한 제작)의 고형암에 대한 치료 효과를 확인하기 위하여, 도 13에서 보는 바와 같이, B16F10 흑색종 세포(melanoma cell)를 마우스 등에 3X105 세포의 양으로 피하 주사(subcutaneous injection)한 뒤, 4일, 8일, 12일 째에 상기 제조예 5의 융합 단백질을 200 ug의 양으로 복강 내 주사하였다. 상기 멜라노마 세포 이식 후 시간의 경과에 따른 종양의 부피 변화를 측정하여 그 결과를 도 14에 나타내었다.
도 14에서 보는 바와 같이, 본 발명에 따른 융합 단백질을 처리한 경우 아무런 처리하지 않은 음성 대조군에 비하여 흑색종 종양의 크기가 현저히 감소한 것을 확인할 수 있었다.
[실시예 9] 본 발명에 따른 융합 단백질의 암 치료 효과
상기 제조예 34에서 제작된 본 발명에 따른 서열번호 78로 표시되는 융합 단백질(서열번호 82로 표시되는 핵산 서열을 이용한 제작)의 고형암에 대한 치료 효과를 확인하기 위하여, CT-26 대장암 세포를 마우스 등에 3X105 세포의 양으로 피하 주사(subcutaneous injection)한 뒤, 4일, 8일, 12일 째에 상기 제조예 34의 융합 단백질을 200 ug의 양으로 복강 내 주사하였다. 상기 대장암 세포 이식 후 시간의 경과에 따른 종양의 부피 변화를 측정하여 그 결과를 도 15에 나타내었다.
도 15에서 보는 바와 같이, 본 발명에 따른 융합 단백질을 처리한 경우 아무런 처리하지 않은 음성 대조군에 비하여 대장암 종양의 크기가 현저히 감소한 것을 확인할 수 있었다.
이를 통하여, 본 발명에 따른 Lrig-1 단백질의 세포 외 도메인 및 면역글로불린 Fc 영역을 포함하는 융합 단백질은 조절 T 세포와 이펙터 T 세포간의 상호 작용을 저해함에 따라 조절 T 세포의 활성을 억제하고 이펙터 T 세포의 활성은 유지 혹은 상승시켜 암 세포 중에서도 특히 고형암 세포의 성장을 효과적으로 억제할 수 있는 것을 알 수 있다.
이상에서 본 발명에 대하여 상세하게 설명하였지만 본 발명의 권리범위는 이에 한정되는 것은 아니고, 청구범위에 기재된 본 발명의 기술적 사상을 벗어나지 않는 범위 내에서 다양한 수정 및 변형이 가능하다는 것은 당 기술분야의 통상의 지식을 가진 자에게는 자명할 것이다.
<110> Good T Cells, Inc.
<120> NOVEL FUSION PROTEIN AND PHARMACEUTICAL COMPOSITION FOR
PREVENTING OR TREATING CANCER COMPRISING THE SAME
<130> PDPB187151k01
<150> KR 10-2018-0048343
<151> 2018-04-26
<160> 84
<170> KoPatentIn 3.0
<210> 1
<211> 762
<212> PRT
<213> Homo sapiens
<400> 1
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe
755 760
<210> 2
<211> 2286
<212> DNA
<213> Homo sapiens
<400> 2
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttc 2286
<210> 3
<211> 760
<212> PRT
<213> Mus musculus
<400> 3
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr
755 760
<210> 4
<211> 2280
<212> DNA
<213> Mus musculus
<400> 4
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
2280
<210> 5
<211> 206
<212> PRT
<213> Homo sapiens
<400> 5
Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro
1 5 10 15
Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val
20 25 30
Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr
35 40 45
Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val
50 55 60
Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys
65 70 75 80
Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser
85 90 95
Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro
100 105 110
Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val
115 120 125
Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly
130 135 140
Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp
145 150 155 160
Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp
165 170 175
Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His
180 185 190
Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
195 200 205
<210> 6
<211> 206
<212> PRT
<213> Mus musculus
<400> 6
Ile Phe Pro Pro Lys Ile Lys Asp Val Leu Met Ile Ser Leu Ser Pro
1 5 10 15
Ile Val Thr Cys Val Val Val Asp Val Ser Glu Asp Asp Pro Asp Val
20 25 30
Gln Ile Ser Trp Phe Val Asn Asn Val Glu Val His Thr Ala Gln Thr
35 40 45
Gln Thr His Arg Glu Asp Tyr Asn Ser Thr Leu Arg Val Val Ser Ala
50 55 60
Leu Pro Ile Gln His Gln Asp Trp Met Ser Gly Lys Glu Phe Lys Cys
65 70 75 80
Lys Val Asn Asn Lys Asp Leu Pro Ala Pro Ile Glu Arg Thr Ile Ser
85 90 95
Lys Pro Lys Gly Ser Val Arg Ala Pro Gln Val Tyr Val Leu Pro Pro
100 105 110
Pro Glu Glu Glu Met Thr Lys Lys Gln Val Thr Leu Thr Cys Met Val
115 120 125
Thr Asp Phe Met Pro Glu Asp Ile Tyr Val Glu Trp Thr Asn Asn Gly
130 135 140
Lys Thr Glu Leu Asn Tyr Lys Asn Thr Glu Pro Val Leu Asp Ser Asp
145 150 155 160
Gly Ser Tyr Phe Met Tyr Ser Lys Leu Arg Val Glu Lys Lys Asn Trp
165 170 175
Val Glu Arg Asn Ser Tyr Ser Cys Ser Val Val His Glu Gly Leu His
180 185 190
Asn His His Thr Thr Lys Ser Phe Ser Arg Thr Pro Gly Lys
195 200 205
<210> 7
<211> 26
<212> PRT
<213> Homo sapiens
<400> 7
Glu Pro Lys Ser Ser Asp Lys Thr His Thr Ser Pro Pro Cys Pro Ala
1 5 10 15
Pro Glu Leu Leu Gly Gly Pro Ser Val Phe
20 25
<210> 8
<211> 27
<212> PRT
<213> Mus musculus
<400> 8
Glu Pro Arg Gly Pro Thr Ile Lys Pro Cys Pro Pro Cys Lys Cys Pro
1 5 10 15
Ala Pro Asn Leu Leu Gly Gly Pro Ser Val Phe
20 25
<210> 9
<211> 39
<212> PRT
<213> Homo sapiens
<400> 9
Arg Asn Thr Gly Arg Gly Gly Glu Glu Lys Lys Lys Glu Lys Glu Lys
1 5 10 15
Glu Glu Gln Glu Glu Arg Glu Thr Lys Thr Pro Glu Cys Pro Ser His
20 25 30
Thr Gln Pro Leu Gly Val Phe
35
<210> 10
<211> 26
<212> PRT
<213> Artificial Sequence
<220>
<223> Abatacept hinge
<400> 10
Glu Pro Lys Ser Ser Asp Lys Thr His Thr Ser Pro Pro Ser Pro Ala
1 5 10 15
Pro Glu Leu Leu Gly Gly Ser Ser Val Phe
20 25
<210> 11
<211> 2
<212> PRT
<213> Artificial Sequence
<220>
<223> linker
<400> 11
Gly Ser
1
<210> 12
<211> 3
<212> PRT
<213> Artificial Sequence
<220>
<223> linker
<400> 12
Gly Gly Gly
1
<210> 13
<211> 994
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 13
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Glu Pro Lys Ser Ser Asp
755 760 765
Lys Thr His Thr Ser Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly
770 775 780
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
785 790 795 800
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
805 810 815
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
820 825 830
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
835 840 845
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
850 855 860
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu
865 870 875 880
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
885 890 895
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
900 905 910
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
915 920 925
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
930 935 940
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
945 950 955 960
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
965 970 975
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
980 985 990
Gly Lys
<210> 14
<211> 995
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 14
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Glu Pro Arg Gly Pro Thr
755 760 765
Ile Lys Pro Cys Pro Pro Cys Lys Cys Pro Ala Pro Asn Leu Leu Gly
770 775 780
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
785 790 795 800
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His
805 810 815
Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val
820 825 830
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr
835 840 845
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
850 855 860
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile
865 870 875 880
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
885 890 895
Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser
900 905 910
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
915 920 925
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
930 935 940
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
945 950 955 960
Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met
965 970 975
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
980 985 990
Pro Gly Lys
995
<210> 15
<211> 1007
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 15
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Arg Asn Thr Gly Arg Gly
755 760 765
Gly Glu Glu Lys Lys Lys Glu Lys Glu Lys Glu Glu Gln Glu Glu Arg
770 775 780
Glu Thr Lys Thr Pro Glu Cys Pro Ser His Thr Gln Pro Leu Gly Val
785 790 795 800
Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr
805 810 815
Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu
820 825 830
Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys
835 840 845
Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser
850 855 860
Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys
865 870 875 880
Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile
885 890 895
Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro
900 905 910
Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys Leu
915 920 925
Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn
930 935 940
Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser
945 950 955 960
Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg
965 970 975
Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu
980 985 990
His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
995 1000 1005
<210> 16
<211> 994
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 16
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Glu Pro Lys Ser Ser Asp
755 760 765
Lys Thr His Thr Ser Pro Pro Ser Pro Ala Pro Glu Leu Leu Gly Gly
770 775 780
Ser Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
785 790 795 800
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
805 810 815
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
820 825 830
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
835 840 845
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
850 855 860
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu
865 870 875 880
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
885 890 895
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
900 905 910
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
915 920 925
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
930 935 940
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
945 950 955 960
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
965 970 975
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
980 985 990
Gly Lys
<210> 17
<211> 2982
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 17
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttcgagc caaagtcctc tgataagaca cacacctctc caccatgccc agcaccagag 2340
ctgctgggag gaccaagcgt gttcctgttt cctccaaagc ccaaggacac actgatgatc 2400
tccaggacac cagaggtgac ctgcgtggtg gtggacgtga gccacgagga ccccgaggtg 2460
aagttcaact ggtacgtgga tggcgtggag gtgcacaatg ccaagaccaa gcccagagag 2520
gagcagtaca actctaccta tagggtggtg agcgtgctga cagtgctgca ccaggactgg 2580
ctgaacggca aggagtataa gtgcaaggtg agcaataagg ccctgcctgc cccaatcgag 2640
aagacaatct ccaaggccaa gggccagcca agagagcccc aggtgtacac cctgccccct 2700
agcagggatg agctgacaaa gaaccaggtg tccctgacct gtctggtgaa gggcttttat 2760
ccctccgaca tcgccgtgga gtgggagtct aatggccagc ctgagaataa ctacaagaca 2820
accccacccg tgctggattc tgacggcagc ttctttctgt attctaagct gaccgtggac 2880
aagagcaggt ggcagcaggg caacgtgttc agctgctccg tgatgcacga agcactgcac 2940
aatcactaca cccagaaatc actgtcactg agccctggca aa 2982
<210> 18
<211> 2985
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 18
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttcgagc ctcggggccc taccatcaag ccctgccccc cttgcaagtg ccctgcccct 2340
aatctgctgg gcggaccctc cgtgttcctg tttcctccaa agcccaagga cacactgatg 2400
atctccagga caccagaggt gacctgcgtg gtggtggacg tgagccacga ggaccccgag 2460
gtgaagttca actggtacgt ggatggcgtg gaggtgcaca atgccaagac caagcccaga 2520
gaggagcagt acaactctac ctatagggtg gtgagcgtgc tgacagtgct gcaccaggac 2580
tggctgaacg gcaaggagta taagtgcaag gtgagcaata aggccctgcc tgccccaatc 2640
gagaagacaa tctccaaggc caagggccag ccaagagagc cccaggtgta caccctgccc 2700
cctagcaggg atgagctgac aaagaaccag gtgtccctga cctgtctggt gaagggcttt 2760
tatccctccg acatcgccgt ggagtgggag tctaatggcc agcctgagaa taactacaag 2820
acaaccccac ccgtgctgga ttctgacggc agcttctttc tgtattctaa gctgaccgtg 2880
gacaagagca ggtggcagca gggcaacgtg ttcagctgct ccgtgatgca cgaagcactg 2940
cacaatcact acacccagaa atcactgtca ctgagccctg gcaaa 2985
<210> 19
<211> 3021
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 19
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttccgca acaccggccg cggcggcgag gagaagaaga aggagaagga gaaggaggag 2340
caggaggagc gcgagaccaa gacccccgag tgccccagcc acacccagcc cctgggcgtg 2400
ttcctgtttc ctccaaagcc caaggacaca ctgatgatct ccaggacacc agaggtgacc 2460
tgcgtggtgg tggacgtgag ccacgaggac cccgaggtga agttcaactg gtacgtggat 2520
ggcgtggagg tgcacaatgc caagaccaag cccagagagg agcagtacaa ctctacctat 2580
agggtggtga gcgtgctgac agtgctgcac caggactggc tgaacggcaa ggagtataag 2640
tgcaaggtga gcaataaggc cctgcctgcc ccaatcgaga agacaatctc caaggccaag 2700
ggccagccaa gagagcccca ggtgtacacc ctgcccccta gcagggatga gctgacaaag 2760
aaccaggtgt ccctgacctg tctggtgaag ggcttttatc cctccgacat cgccgtggag 2820
tgggagtcta atggccagcc tgagaataac tacaagacaa ccccacccgt gctggattct 2880
gacggcagct tctttctgta ttctaagctg accgtggaca agagcaggtg gcagcagggc 2940
aacgtgttca gctgctccgt gatgcacgaa gcactgcaca atcactacac ccagaaatca 3000
ctgtcactga gccctggcaa a 3021
<210> 20
<211> 2982
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 20
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttcgaac cgaaatcttc tgacaaaacc cacacctctc cgccgtctcc ggctccggaa 2340
ctgctgggtg gttcttctgt tttcctgttt cctccaaagc ccaaggacac actgatgatc 2400
tccaggacac cagaggtgac ctgcgtggtg gtggacgtga gccacgagga ccccgaggtg 2460
aagttcaact ggtacgtgga tggcgtggag gtgcacaatg ccaagaccaa gcccagagag 2520
gagcagtaca actctaccta tagggtggtg agcgtgctga cagtgctgca ccaggactgg 2580
ctgaacggca aggagtataa gtgcaaggtg agcaataagg ccctgcctgc cccaatcgag 2640
aagacaatct ccaaggccaa gggccagcca agagagcccc aggtgtacac cctgccccct 2700
agcagggatg agctgacaaa gaaccaggtg tccctgacct gtctggtgaa gggcttttat 2760
ccctccgaca tcgccgtgga gtgggagtct aatggccagc ctgagaataa ctacaagaca 2820
accccacccg tgctggattc tgacggcagc ttctttctgt attctaagct gaccgtggac 2880
aagagcaggt ggcagcaggg caacgtgttc agctgctccg tgatgcacga agcactgcac 2940
aatcactaca cccagaaatc actgtcactg agccctggca aa 2982
<210> 21
<211> 992
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 21
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Glu Pro Lys Ser Ser Asp Lys Thr
755 760 765
His Thr Ser Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser
770 775 780
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
785 790 795 800
Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro
805 810 815
Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala
820 825 830
Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val
835 840 845
Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr
850 855 860
Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr
865 870 875 880
Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu
885 890 895
Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys
900 905 910
Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
915 920 925
Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp
930 935 940
Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser
945 950 955 960
Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala
965 970 975
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
980 985 990
<210> 22
<211> 993
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 22
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Glu Pro Arg Gly Pro Thr Ile Lys
755 760 765
Pro Cys Pro Pro Cys Lys Cys Pro Ala Pro Asn Leu Leu Gly Gly Pro
770 775 780
Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser
785 790 795 800
Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp
805 810 815
Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn
820 825 830
Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val
835 840 845
Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu
850 855 860
Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys
865 870 875 880
Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr
885 890 895
Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr
900 905 910
Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu
915 920 925
Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu
930 935 940
Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys
945 950 955 960
Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu
965 970 975
Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly
980 985 990
Lys
<210> 23
<211> 1005
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 23
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Arg Asn Thr Gly Arg Gly Gly Glu
755 760 765
Glu Lys Lys Lys Glu Lys Glu Lys Glu Glu Gln Glu Glu Arg Glu Thr
770 775 780
Lys Thr Pro Glu Cys Pro Ser His Thr Gln Pro Leu Gly Val Phe Leu
785 790 795 800
Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu
805 810 815
Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys
820 825 830
Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys
835 840 845
Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu
850 855 860
Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys
865 870 875 880
Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys
885 890 895
Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser
900 905 910
Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys
915 920 925
Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln
930 935 940
Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly
945 950 955 960
Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln
965 970 975
Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn
980 985 990
His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
995 1000 1005
<210> 24
<211> 992
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 24
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Glu Pro Lys Ser Ser Asp Lys Thr
755 760 765
His Thr Ser Pro Pro Ser Pro Ala Pro Glu Leu Leu Gly Gly Ser Ser
770 775 780
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
785 790 795 800
Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro
805 810 815
Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala
820 825 830
Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val
835 840 845
Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr
850 855 860
Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr
865 870 875 880
Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu
885 890 895
Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys
900 905 910
Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
915 920 925
Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp
930 935 940
Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser
945 950 955 960
Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala
965 970 975
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
980 985 990
<210> 25
<211> 2976
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 25
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
gagccaaagt cctctgataa gacacacacc tctccaccat gcccagcacc agagctgctg 2340
ggaggaccaa gcgtgttcct gtttcctcca aagcccaagg acacactgat gatctccagg 2400
acaccagagg tgacctgcgt ggtggtggac gtgagccacg aggaccccga ggtgaagttc 2460
aactggtacg tggatggcgt ggaggtgcac aatgccaaga ccaagcccag agaggagcag 2520
tacaactcta cctatagggt ggtgagcgtg ctgacagtgc tgcaccagga ctggctgaac 2580
ggcaaggagt ataagtgcaa ggtgagcaat aaggccctgc ctgccccaat cgagaagaca 2640
atctccaagg ccaagggcca gccaagagag ccccaggtgt acaccctgcc ccctagcagg 2700
gatgagctga caaagaacca ggtgtccctg acctgtctgg tgaagggctt ttatccctcc 2760
gacatcgccg tggagtggga gtctaatggc cagcctgaga ataactacaa gacaacccca 2820
cccgtgctgg attctgacgg cagcttcttt ctgtattcta agctgaccgt ggacaagagc 2880
aggtggcagc agggcaacgt gttcagctgc tccgtgatgc acgaagcact gcacaatcac 2940
tacacccaga aatcactgtc actgagccct ggcaaa 2976
<210> 26
<211> 2979
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 26
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
gagcctcggg gccctaccat caagccctgc cccccttgca agtgccctgc ccctaatctg 2340
ctgggcggac cctccgtgtt cctgtttcct ccaaagccca aggacacact gatgatctcc 2400
aggacaccag aggtgacctg cgtggtggtg gacgtgagcc acgaggaccc cgaggtgaag 2460
ttcaactggt acgtggatgg cgtggaggtg cacaatgcca agaccaagcc cagagaggag 2520
cagtacaact ctacctatag ggtggtgagc gtgctgacag tgctgcacca ggactggctg 2580
aacggcaagg agtataagtg caaggtgagc aataaggccc tgcctgcccc aatcgagaag 2640
acaatctcca aggccaaggg ccagccaaga gagccccagg tgtacaccct gccccctagc 2700
agggatgagc tgacaaagaa ccaggtgtcc ctgacctgtc tggtgaaggg cttttatccc 2760
tccgacatcg ccgtggagtg ggagtctaat ggccagcctg agaataacta caagacaacc 2820
ccacccgtgc tggattctga cggcagcttc tttctgtatt ctaagctgac cgtggacaag 2880
agcaggtggc agcagggcaa cgtgttcagc tgctccgtga tgcacgaagc actgcacaat 2940
cactacaccc agaaatcact gtcactgagc cctggcaaa 2979
<210> 27
<211> 3015
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 27
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
cgcaacaccg gccgcggcgg cgaggagaag aagaaggaga aggagaagga ggagcaggag 2340
gagcgcgaga ccaagacccc cgagtgcccc agccacaccc agcccctggg cgtgttcctg 2400
tttcctccaa agcccaagga cacactgatg atctccagga caccagaggt gacctgcgtg 2460
gtggtggacg tgagccacga ggaccccgag gtgaagttca actggtacgt ggatggcgtg 2520
gaggtgcaca atgccaagac caagcccaga gaggagcagt acaactctac ctatagggtg 2580
gtgagcgtgc tgacagtgct gcaccaggac tggctgaacg gcaaggagta taagtgcaag 2640
gtgagcaata aggccctgcc tgccccaatc gagaagacaa tctccaaggc caagggccag 2700
ccaagagagc cccaggtgta caccctgccc cctagcaggg atgagctgac aaagaaccag 2760
gtgtccctga cctgtctggt gaagggcttt tatccctccg acatcgccgt ggagtgggag 2820
tctaatggcc agcctgagaa taactacaag acaaccccac ccgtgctgga ttctgacggc 2880
agcttctttc tgtattctaa gctgaccgtg gacaagagca ggtggcagca gggcaacgtg 2940
ttcagctgct ccgtgatgca cgaagcactg cacaatcact acacccagaa atcactgtca 3000
ctgagccctg gcaaa 3015
<210> 28
<211> 2976
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 28
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
gaaccgaaat cttctgacaa aacccacacc tctccgccgt ctccggctcc ggaactgctg 2340
ggtggttctt ctgttttcct gtttcctcca aagcccaagg acacactgat gatctccagg 2400
acaccagagg tgacctgcgt ggtggtggac gtgagccacg aggaccccga ggtgaagttc 2460
aactggtacg tggatggcgt ggaggtgcac aatgccaaga ccaagcccag agaggagcag 2520
tacaactcta cctatagggt ggtgagcgtg ctgacagtgc tgcaccagga ctggctgaac 2580
ggcaaggagt ataagtgcaa ggtgagcaat aaggccctgc ctgccccaat cgagaagaca 2640
atctccaagg ccaagggcca gccaagagag ccccaggtgt acaccctgcc ccctagcagg 2700
gatgagctga caaagaacca ggtgtccctg acctgtctgg tgaagggctt ttatccctcc 2760
gacatcgccg tggagtggga gtctaatggc cagcctgaga ataactacaa gacaacccca 2820
cccgtgctgg attctgacgg cagcttcttt ctgtattcta agctgaccgt ggacaagagc 2880
aggtggcagc agggcaacgt gttcagctgc tccgtgatgc acgaagcact gcacaatcac 2940
tacacccaga aatcactgtc actgagccct ggcaaa 2976
<210> 29
<211> 992
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 29
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Glu Pro Lys Ser Ser Asp Lys Thr
755 760 765
His Thr Ser Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser
770 775 780
Val Phe Ile Phe Pro Pro Lys Ile Lys Asp Val Leu Met Ile Ser Leu
785 790 795 800
Ser Pro Ile Val Thr Cys Val Val Val Asp Val Ser Glu Asp Asp Pro
805 810 815
Asp Val Gln Ile Ser Trp Phe Val Asn Asn Val Glu Val His Thr Ala
820 825 830
Gln Thr Gln Thr His Arg Glu Asp Tyr Asn Ser Thr Leu Arg Val Val
835 840 845
Ser Ala Leu Pro Ile Gln His Gln Asp Trp Met Ser Gly Lys Glu Phe
850 855 860
Lys Cys Lys Val Asn Asn Lys Asp Leu Pro Ala Pro Ile Glu Arg Thr
865 870 875 880
Ile Ser Lys Pro Lys Gly Ser Val Arg Ala Pro Gln Val Tyr Val Leu
885 890 895
Pro Pro Pro Glu Glu Glu Met Thr Lys Lys Gln Val Thr Leu Thr Cys
900 905 910
Met Val Thr Asp Phe Met Pro Glu Asp Ile Tyr Val Glu Trp Thr Asn
915 920 925
Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr Glu Pro Val Leu Asp
930 935 940
Ser Asp Gly Ser Tyr Phe Met Tyr Ser Lys Leu Arg Val Glu Lys Lys
945 950 955 960
Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser Val Val His Glu Gly
965 970 975
Leu His Asn His His Thr Thr Lys Ser Phe Ser Arg Thr Pro Gly Lys
980 985 990
<210> 30
<211> 993
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 30
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Glu Pro Arg Gly Pro Thr Ile Lys
755 760 765
Pro Cys Pro Pro Cys Lys Cys Pro Ala Pro Asn Leu Leu Gly Gly Pro
770 775 780
Ser Val Phe Ile Phe Pro Pro Lys Ile Lys Asp Val Leu Met Ile Ser
785 790 795 800
Leu Ser Pro Ile Val Thr Cys Val Val Val Asp Val Ser Glu Asp Asp
805 810 815
Pro Asp Val Gln Ile Ser Trp Phe Val Asn Asn Val Glu Val His Thr
820 825 830
Ala Gln Thr Gln Thr His Arg Glu Asp Tyr Asn Ser Thr Leu Arg Val
835 840 845
Val Ser Ala Leu Pro Ile Gln His Gln Asp Trp Met Ser Gly Lys Glu
850 855 860
Phe Lys Cys Lys Val Asn Asn Lys Asp Leu Pro Ala Pro Ile Glu Arg
865 870 875 880
Thr Ile Ser Lys Pro Lys Gly Ser Val Arg Ala Pro Gln Val Tyr Val
885 890 895
Leu Pro Pro Pro Glu Glu Glu Met Thr Lys Lys Gln Val Thr Leu Thr
900 905 910
Cys Met Val Thr Asp Phe Met Pro Glu Asp Ile Tyr Val Glu Trp Thr
915 920 925
Asn Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr Glu Pro Val Leu
930 935 940
Asp Ser Asp Gly Ser Tyr Phe Met Tyr Ser Lys Leu Arg Val Glu Lys
945 950 955 960
Lys Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser Val Val His Glu
965 970 975
Gly Leu His Asn His His Thr Thr Lys Ser Phe Ser Arg Thr Pro Gly
980 985 990
Lys
<210> 31
<211> 1005
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 31
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Arg Asn Thr Gly Arg Gly Gly Glu
755 760 765
Glu Lys Lys Lys Glu Lys Glu Lys Glu Glu Gln Glu Glu Arg Glu Thr
770 775 780
Lys Thr Pro Glu Cys Pro Ser His Thr Gln Pro Leu Gly Val Phe Ile
785 790 795 800
Phe Pro Pro Lys Ile Lys Asp Val Leu Met Ile Ser Leu Ser Pro Ile
805 810 815
Val Thr Cys Val Val Val Asp Val Ser Glu Asp Asp Pro Asp Val Gln
820 825 830
Ile Ser Trp Phe Val Asn Asn Val Glu Val His Thr Ala Gln Thr Gln
835 840 845
Thr His Arg Glu Asp Tyr Asn Ser Thr Leu Arg Val Val Ser Ala Leu
850 855 860
Pro Ile Gln His Gln Asp Trp Met Ser Gly Lys Glu Phe Lys Cys Lys
865 870 875 880
Val Asn Asn Lys Asp Leu Pro Ala Pro Ile Glu Arg Thr Ile Ser Lys
885 890 895
Pro Lys Gly Ser Val Arg Ala Pro Gln Val Tyr Val Leu Pro Pro Pro
900 905 910
Glu Glu Glu Met Thr Lys Lys Gln Val Thr Leu Thr Cys Met Val Thr
915 920 925
Asp Phe Met Pro Glu Asp Ile Tyr Val Glu Trp Thr Asn Asn Gly Lys
930 935 940
Thr Glu Leu Asn Tyr Lys Asn Thr Glu Pro Val Leu Asp Ser Asp Gly
945 950 955 960
Ser Tyr Phe Met Tyr Ser Lys Leu Arg Val Glu Lys Lys Asn Trp Val
965 970 975
Glu Arg Asn Ser Tyr Ser Cys Ser Val Val His Glu Gly Leu His Asn
980 985 990
His His Thr Thr Lys Ser Phe Ser Arg Thr Pro Gly Lys
995 1000 1005
<210> 32
<211> 992
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 32
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Glu Pro Lys Ser Ser Asp Lys Thr
755 760 765
His Thr Ser Pro Pro Ser Pro Ala Pro Glu Leu Leu Gly Gly Ser Ser
770 775 780
Val Phe Ile Phe Pro Pro Lys Ile Lys Asp Val Leu Met Ile Ser Leu
785 790 795 800
Ser Pro Ile Val Thr Cys Val Val Val Asp Val Ser Glu Asp Asp Pro
805 810 815
Asp Val Gln Ile Ser Trp Phe Val Asn Asn Val Glu Val His Thr Ala
820 825 830
Gln Thr Gln Thr His Arg Glu Asp Tyr Asn Ser Thr Leu Arg Val Val
835 840 845
Ser Ala Leu Pro Ile Gln His Gln Asp Trp Met Ser Gly Lys Glu Phe
850 855 860
Lys Cys Lys Val Asn Asn Lys Asp Leu Pro Ala Pro Ile Glu Arg Thr
865 870 875 880
Ile Ser Lys Pro Lys Gly Ser Val Arg Ala Pro Gln Val Tyr Val Leu
885 890 895
Pro Pro Pro Glu Glu Glu Met Thr Lys Lys Gln Val Thr Leu Thr Cys
900 905 910
Met Val Thr Asp Phe Met Pro Glu Asp Ile Tyr Val Glu Trp Thr Asn
915 920 925
Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr Glu Pro Val Leu Asp
930 935 940
Ser Asp Gly Ser Tyr Phe Met Tyr Ser Lys Leu Arg Val Glu Lys Lys
945 950 955 960
Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser Val Val His Glu Gly
965 970 975
Leu His Asn His His Thr Thr Lys Ser Phe Ser Arg Thr Pro Gly Lys
980 985 990
<210> 33
<211> 2976
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 33
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
gagccaaagt cctctgataa gacacacacc tctccaccat gcccagcacc agagctgctg 2340
ggaggaccaa gcgtgttcat cttcccaccc aagatcaagg acgtgctgat gatctccctg 2400
tcccccatcg tgacctgcgt ggtggtggac gtgtccgagg acgaccccga cgtgcagatc 2460
agttggttcg tgaacaacgt ggaagtgcac accgcccaga cccagaccca cagagaggac 2520
tacaactcca ccctgcgggt ggtgtccgcc ctgcccatcc agcaccagga ctggatgtcc 2580
ggcaaagaat tcaagtgcaa agtgaacaac aaggacctgc ctgcccccat cgagcggacc 2640
atctccaagc ccaagggctc cgtgcgggct ccccaggtgt acgtgctgcc ccctccagag 2700
gaagagatga ccaagaagca ggtcacactg acctgcatgg tcaccgactt catgcccgag 2760
gacatctacg tggaatggac caacaatggc aagaccgagc tgaactacaa gaacaccgag 2820
cctgtgctgg actccgacgg ctcctacttc atgtactcca agctgcgggt ggaaaagaag 2880
aactgggtcg agcggaactc ctactcctgc tccgtggtgc acgagggcct gcacaaccac 2940
cacaccacca agtccttctc ccggaccccc ggcaaa 2976
<210> 34
<211> 2979
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 34
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
gagcctcggg gccctaccat caagccctgc cccccttgca agtgccctgc ccctaatctg 2340
ctgggcggac cctccgtgtt catcttccca cccaagatca aggacgtgct gatgatctcc 2400
ctgtccccca tcgtgacctg cgtggtggtg gacgtgtccg aggacgaccc cgacgtgcag 2460
atcagttggt tcgtgaacaa cgtggaagtg cacaccgccc agacccagac ccacagagag 2520
gactacaact ccaccctgcg ggtggtgtcc gccctgccca tccagcacca ggactggatg 2580
tccggcaaag aattcaagtg caaagtgaac aacaaggacc tgcctgcccc catcgagcgg 2640
accatctcca agcccaaggg ctccgtgcgg gctccccagg tgtacgtgct gccccctcca 2700
gaggaagaga tgaccaagaa gcaggtcaca ctgacctgca tggtcaccga cttcatgccc 2760
gaggacatct acgtggaatg gaccaacaat ggcaagaccg agctgaacta caagaacacc 2820
gagcctgtgc tggactccga cggctcctac ttcatgtact ccaagctgcg ggtggaaaag 2880
aagaactggg tcgagcggaa ctcctactcc tgctccgtgg tgcacgaggg cctgcacaac 2940
caccacacca ccaagtcctt ctcccggacc cccggcaaa 2979
<210> 35
<211> 3015
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 35
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
cgcaacaccg gccgcggcgg cgaggagaag aagaaggaga aggagaagga ggagcaggag 2340
gagcgcgaga ccaagacccc cgagtgcccc agccacaccc agcccctggg cgtgttcatc 2400
ttcccaccca agatcaagga cgtgctgatg atctccctgt cccccatcgt gacctgcgtg 2460
gtggtggacg tgtccgagga cgaccccgac gtgcagatca gttggttcgt gaacaacgtg 2520
gaagtgcaca ccgcccagac ccagacccac agagaggact acaactccac cctgcgggtg 2580
gtgtccgccc tgcccatcca gcaccaggac tggatgtccg gcaaagaatt caagtgcaaa 2640
gtgaacaaca aggacctgcc tgcccccatc gagcggacca tctccaagcc caagggctcc 2700
gtgcgggctc cccaggtgta cgtgctgccc cctccagagg aagagatgac caagaagcag 2760
gtcacactga cctgcatggt caccgacttc atgcccgagg acatctacgt ggaatggacc 2820
aacaatggca agaccgagct gaactacaag aacaccgagc ctgtgctgga ctccgacggc 2880
tcctacttca tgtactccaa gctgcgggtg gaaaagaaga actgggtcga gcggaactcc 2940
tactcctgct ccgtggtgca cgagggcctg cacaaccacc acaccaccaa gtccttctcc 3000
cggacccccg gcaaa 3015
<210> 36
<211> 2976
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 36
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
gaaccgaaat cttctgacaa aacccacacc tctccgccgt ctccggctcc ggaactgctg 2340
ggtggttctt ctgttttcat cttcccaccc aagatcaagg acgtgctgat gatctccctg 2400
tcccccatcg tgacctgcgt ggtggtggac gtgtccgagg acgaccccga cgtgcagatc 2460
agttggttcg tgaacaacgt ggaagtgcac accgcccaga cccagaccca cagagaggac 2520
tacaactcca ccctgcgggt ggtgtccgcc ctgcccatcc agcaccagga ctggatgtcc 2580
ggcaaagaat tcaagtgcaa agtgaacaac aaggacctgc ctgcccccat cgagcggacc 2640
atctccaagc ccaagggctc cgtgcgggct ccccaggtgt acgtgctgcc ccctccagag 2700
gaagagatga ccaagaagca ggtcacactg acctgcatgg tcaccgactt catgcccgag 2760
gacatctacg tggaatggac caacaatggc aagaccgagc tgaactacaa gaacaccgag 2820
cctgtgctgg actccgacgg ctcctacttc atgtactcca agctgcgggt ggaaaagaag 2880
aactgggtcg agcggaactc ctactcctgc tccgtggtgc acgagggcct gcacaaccac 2940
cacaccacca agtccttctc ccggaccccc ggcaaa 2976
<210> 37
<211> 996
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 37
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Gly Ser Glu Pro Lys Ser
755 760 765
Ser Asp Lys Thr His Thr Ser Pro Pro Cys Pro Ala Pro Glu Leu Leu
770 775 780
Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu
785 790 795 800
Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser
805 810 815
His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu
820 825 830
Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr
835 840 845
Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn
850 855 860
Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro
865 870 875 880
Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln
885 890 895
Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val
900 905 910
Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val
915 920 925
Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro
930 935 940
Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr
945 950 955 960
Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val
965 970 975
Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu
980 985 990
Ser Pro Gly Lys
995
<210> 38
<211> 997
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 38
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Gly Ser Glu Pro Arg Gly
755 760 765
Pro Thr Ile Lys Pro Cys Pro Pro Cys Lys Cys Pro Ala Pro Asn Leu
770 775 780
Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr
785 790 795 800
Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val
805 810 815
Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val
820 825 830
Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser
835 840 845
Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu
850 855 860
Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala
865 870 875 880
Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro
885 890 895
Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln
900 905 910
Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala
915 920 925
Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr
930 935 940
Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu
945 950 955 960
Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser
965 970 975
Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser
980 985 990
Leu Ser Pro Gly Lys
995
<210> 39
<211> 1009
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 39
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Gly Ser Arg Asn Thr Gly
755 760 765
Arg Gly Gly Glu Glu Lys Lys Lys Glu Lys Glu Lys Glu Glu Gln Glu
770 775 780
Glu Arg Glu Thr Lys Thr Pro Glu Cys Pro Ser His Thr Gln Pro Leu
785 790 795 800
Gly Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser
805 810 815
Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp
820 825 830
Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn
835 840 845
Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val
850 855 860
Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu
865 870 875 880
Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys
885 890 895
Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr
900 905 910
Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr
915 920 925
Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu
930 935 940
Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu
945 950 955 960
Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys
965 970 975
Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu
980 985 990
Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly
995 1000 1005
Lys
<210> 40
<211> 996
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 40
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Gly Ser Glu Pro Lys Ser
755 760 765
Ser Asp Lys Thr His Thr Ser Pro Pro Ser Pro Ala Pro Glu Leu Leu
770 775 780
Gly Gly Ser Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu
785 790 795 800
Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser
805 810 815
His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu
820 825 830
Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr
835 840 845
Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn
850 855 860
Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro
865 870 875 880
Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln
885 890 895
Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val
900 905 910
Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val
915 920 925
Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro
930 935 940
Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr
945 950 955 960
Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val
965 970 975
Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu
980 985 990
Ser Pro Gly Lys
995
<210> 41
<211> 2988
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 41
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttcggat ccgagccaaa gtcctctgat aagacacaca cctctccacc atgcccagca 2340
ccagagctgc tgggaggacc aagcgtgttc ctgtttcctc caaagcccaa ggacacactg 2400
atgatctcca ggacaccaga ggtgacctgc gtggtggtgg acgtgagcca cgaggacccc 2460
gaggtgaagt tcaactggta cgtggatggc gtggaggtgc acaatgccaa gaccaagccc 2520
agagaggagc agtacaactc tacctatagg gtggtgagcg tgctgacagt gctgcaccag 2580
gactggctga acggcaagga gtataagtgc aaggtgagca ataaggccct gcctgcccca 2640
atcgagaaga caatctccaa ggccaagggc cagccaagag agccccaggt gtacaccctg 2700
ccccctagca gggatgagct gacaaagaac caggtgtccc tgacctgtct ggtgaagggc 2760
ttttatccct ccgacatcgc cgtggagtgg gagtctaatg gccagcctga gaataactac 2820
aagacaaccc cacccgtgct ggattctgac ggcagcttct ttctgtattc taagctgacc 2880
gtggacaaga gcaggtggca gcagggcaac gtgttcagct gctccgtgat gcacgaagca 2940
ctgcacaatc actacaccca gaaatcactg tcactgagcc ctggcaaa 2988
<210> 42
<211> 2991
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 42
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttcggat ccgagcctcg gggccctacc atcaagccct gccccccttg caagtgccct 2340
gcccctaatc tgctgggcgg accctccgtg ttcctgtttc ctccaaagcc caaggacaca 2400
ctgatgatct ccaggacacc agaggtgacc tgcgtggtgg tggacgtgag ccacgaggac 2460
cccgaggtga agttcaactg gtacgtggat ggcgtggagg tgcacaatgc caagaccaag 2520
cccagagagg agcagtacaa ctctacctat agggtggtga gcgtgctgac agtgctgcac 2580
caggactggc tgaacggcaa ggagtataag tgcaaggtga gcaataaggc cctgcctgcc 2640
ccaatcgaga agacaatctc caaggccaag ggccagccaa gagagcccca ggtgtacacc 2700
ctgcccccta gcagggatga gctgacaaag aaccaggtgt ccctgacctg tctggtgaag 2760
ggcttttatc cctccgacat cgccgtggag tgggagtcta atggccagcc tgagaataac 2820
tacaagacaa ccccacccgt gctggattct gacggcagct tctttctgta ttctaagctg 2880
accgtggaca agagcaggtg gcagcagggc aacgtgttca gctgctccgt gatgcacgaa 2940
gcactgcaca atcactacac ccagaaatca ctgtcactga gccctggcaa a 2991
<210> 43
<211> 3027
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 43
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttcggat cccgcaacac cggccgcggc ggcgaggaga agaagaagga gaaggagaag 2340
gaggagcagg aggagcgcga gaccaagacc cccgagtgcc ccagccacac ccagcccctg 2400
ggcgtgttcc tgtttcctcc aaagcccaag gacacactga tgatctccag gacaccagag 2460
gtgacctgcg tggtggtgga cgtgagccac gaggaccccg aggtgaagtt caactggtac 2520
gtggatggcg tggaggtgca caatgccaag accaagccca gagaggagca gtacaactct 2580
acctataggg tggtgagcgt gctgacagtg ctgcaccagg actggctgaa cggcaaggag 2640
tataagtgca aggtgagcaa taaggccctg cctgccccaa tcgagaagac aatctccaag 2700
gccaagggcc agccaagaga gccccaggtg tacaccctgc cccctagcag ggatgagctg 2760
acaaagaacc aggtgtccct gacctgtctg gtgaagggct tttatccctc cgacatcgcc 2820
gtggagtggg agtctaatgg ccagcctgag aataactaca agacaacccc acccgtgctg 2880
gattctgacg gcagcttctt tctgtattct aagctgaccg tggacaagag caggtggcag 2940
cagggcaacg tgttcagctg ctccgtgatg cacgaagcac tgcacaatca ctacacccag 3000
aaatcactgt cactgagccc tggcaaa 3027
<210> 44
<211> 2988
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 44
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttcggat ccgaaccgaa atcttctgac aaaacccaca cctctccgcc gtctccggct 2340
ccggaactgc tgggtggttc ttctgttttc ctgtttcctc caaagcccaa ggacacactg 2400
atgatctcca ggacaccaga ggtgacctgc gtggtggtgg acgtgagcca cgaggacccc 2460
gaggtgaagt tcaactggta cgtggatggc gtggaggtgc acaatgccaa gaccaagccc 2520
agagaggagc agtacaactc tacctatagg gtggtgagcg tgctgacagt gctgcaccag 2580
gactggctga acggcaagga gtataagtgc aaggtgagca ataaggccct gcctgcccca 2640
atcgagaaga caatctccaa ggccaagggc cagccaagag agccccaggt gtacaccctg 2700
ccccctagca gggatgagct gacaaagaac caggtgtccc tgacctgtct ggtgaagggc 2760
ttttatccct ccgacatcgc cgtggagtgg gagtctaatg gccagcctga gaataactac 2820
aagacaaccc cacccgtgct ggattctgac ggcagcttct ttctgtattc taagctgacc 2880
gtggacaaga gcaggtggca gcagggcaac gtgttcagct gctccgtgat gcacgaagca 2940
ctgcacaatc actacaccca gaaatcactg tcactgagcc ctggcaaa 2988
<210> 45
<211> 994
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 45
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Ser Glu Pro Lys Ser Ser Asp
755 760 765
Lys Thr His Thr Ser Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly
770 775 780
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
785 790 795 800
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
805 810 815
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
820 825 830
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
835 840 845
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
850 855 860
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu
865 870 875 880
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
885 890 895
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
900 905 910
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
915 920 925
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
930 935 940
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
945 950 955 960
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
965 970 975
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
980 985 990
Gly Lys
<210> 46
<211> 995
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 46
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Ser Glu Pro Arg Gly Pro Thr
755 760 765
Ile Lys Pro Cys Pro Pro Cys Lys Cys Pro Ala Pro Asn Leu Leu Gly
770 775 780
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
785 790 795 800
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His
805 810 815
Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val
820 825 830
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr
835 840 845
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
850 855 860
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile
865 870 875 880
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
885 890 895
Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser
900 905 910
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
915 920 925
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
930 935 940
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
945 950 955 960
Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met
965 970 975
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
980 985 990
Pro Gly Lys
995
<210> 47
<211> 1007
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 47
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Ser Arg Asn Thr Gly Arg Gly
755 760 765
Gly Glu Glu Lys Lys Lys Glu Lys Glu Lys Glu Glu Gln Glu Glu Arg
770 775 780
Glu Thr Lys Thr Pro Glu Cys Pro Ser His Thr Gln Pro Leu Gly Val
785 790 795 800
Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr
805 810 815
Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu
820 825 830
Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys
835 840 845
Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser
850 855 860
Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys
865 870 875 880
Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile
885 890 895
Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro
900 905 910
Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys Leu
915 920 925
Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn
930 935 940
Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser
945 950 955 960
Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg
965 970 975
Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu
980 985 990
His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
995 1000 1005
<210> 48
<211> 994
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 48
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Ser Glu Pro Lys Ser Ser Asp
755 760 765
Lys Thr His Thr Ser Pro Pro Ser Pro Ala Pro Glu Leu Leu Gly Gly
770 775 780
Ser Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
785 790 795 800
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
805 810 815
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
820 825 830
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
835 840 845
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
850 855 860
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu
865 870 875 880
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
885 890 895
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
900 905 910
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
915 920 925
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
930 935 940
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
945 950 955 960
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
965 970 975
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
980 985 990
Gly Lys
<210> 49
<211> 2982
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 49
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggatccgagc caaagtcctc tgataagaca cacacctctc caccatgccc agcaccagag 2340
ctgctgggag gaccaagcgt gttcctgttt cctccaaagc ccaaggacac actgatgatc 2400
tccaggacac cagaggtgac ctgcgtggtg gtggacgtga gccacgagga ccccgaggtg 2460
aagttcaact ggtacgtgga tggcgtggag gtgcacaatg ccaagaccaa gcccagagag 2520
gagcagtaca actctaccta tagggtggtg agcgtgctga cagtgctgca ccaggactgg 2580
ctgaacggca aggagtataa gtgcaaggtg agcaataagg ccctgcctgc cccaatcgag 2640
aagacaatct ccaaggccaa gggccagcca agagagcccc aggtgtacac cctgccccct 2700
agcagggatg agctgacaaa gaaccaggtg tccctgacct gtctggtgaa gggcttttat 2760
ccctccgaca tcgccgtgga gtgggagtct aatggccagc ctgagaataa ctacaagaca 2820
accccacccg tgctggattc tgacggcagc ttctttctgt attctaagct gaccgtggac 2880
aagagcaggt ggcagcaggg caacgtgttc agctgctccg tgatgcacga agcactgcac 2940
aatcactaca cccagaaatc actgtcactg agccctggca aa 2982
<210> 50
<211> 2985
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 50
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggatccgagc ctcggggccc taccatcaag ccctgccccc cttgcaagtg ccctgcccct 2340
aatctgctgg gcggaccctc cgtgttcctg tttcctccaa agcccaagga cacactgatg 2400
atctccagga caccagaggt gacctgcgtg gtggtggacg tgagccacga ggaccccgag 2460
gtgaagttca actggtacgt ggatggcgtg gaggtgcaca atgccaagac caagcccaga 2520
gaggagcagt acaactctac ctatagggtg gtgagcgtgc tgacagtgct gcaccaggac 2580
tggctgaacg gcaaggagta taagtgcaag gtgagcaata aggccctgcc tgccccaatc 2640
gagaagacaa tctccaaggc caagggccag ccaagagagc cccaggtgta caccctgccc 2700
cctagcaggg atgagctgac aaagaaccag gtgtccctga cctgtctggt gaagggcttt 2760
tatccctccg acatcgccgt ggagtgggag tctaatggcc agcctgagaa taactacaag 2820
acaaccccac ccgtgctgga ttctgacggc agcttctttc tgtattctaa gctgaccgtg 2880
gacaagagca ggtggcagca gggcaacgtg ttcagctgct ccgtgatgca cgaagcactg 2940
cacaatcact acacccagaa atcactgtca ctgagccctg gcaaa 2985
<210> 51
<211> 3021
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 51
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggatcccgca acaccggccg cggcggcgag gagaagaaga aggagaagga gaaggaggag 2340
caggaggagc gcgagaccaa gacccccgag tgccccagcc acacccagcc cctgggcgtg 2400
ttcctgtttc ctccaaagcc caaggacaca ctgatgatct ccaggacacc agaggtgacc 2460
tgcgtggtgg tggacgtgag ccacgaggac cccgaggtga agttcaactg gtacgtggat 2520
ggcgtggagg tgcacaatgc caagaccaag cccagagagg agcagtacaa ctctacctat 2580
agggtggtga gcgtgctgac agtgctgcac caggactggc tgaacggcaa ggagtataag 2640
tgcaaggtga gcaataaggc cctgcctgcc ccaatcgaga agacaatctc caaggccaag 2700
ggccagccaa gagagcccca ggtgtacacc ctgcccccta gcagggatga gctgacaaag 2760
aaccaggtgt ccctgacctg tctggtgaag ggcttttatc cctccgacat cgccgtggag 2820
tgggagtcta atggccagcc tgagaataac tacaagacaa ccccacccgt gctggattct 2880
gacggcagct tctttctgta ttctaagctg accgtggaca agagcaggtg gcagcagggc 2940
aacgtgttca gctgctccgt gatgcacgaa gcactgcaca atcactacac ccagaaatca 3000
ctgtcactga gccctggcaa a 3021
<210> 52
<211> 2982
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 52
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggatccgaac cgaaatcttc tgacaaaacc cacacctctc cgccgtctcc ggctccggaa 2340
ctgctgggtg gttcttctgt tttcctgttt cctccaaagc ccaaggacac actgatgatc 2400
tccaggacac cagaggtgac ctgcgtggtg gtggacgtga gccacgagga ccccgaggtg 2460
aagttcaact ggtacgtgga tggcgtggag gtgcacaatg ccaagaccaa gcccagagag 2520
gagcagtaca actctaccta tagggtggtg agcgtgctga cagtgctgca ccaggactgg 2580
ctgaacggca aggagtataa gtgcaaggtg agcaataagg ccctgcctgc cccaatcgag 2640
aagacaatct ccaaggccaa gggccagcca agagagcccc aggtgtacac cctgccccct 2700
agcagggatg agctgacaaa gaaccaggtg tccctgacct gtctggtgaa gggcttttat 2760
ccctccgaca tcgccgtgga gtgggagtct aatggccagc ctgagaataa ctacaagaca 2820
accccacccg tgctggattc tgacggcagc ttctttctgt attctaagct gaccgtggac 2880
aagagcaggt ggcagcaggg caacgtgttc agctgctccg tgatgcacga agcactgcac 2940
aatcactaca cccagaaatc actgtcactg agccctggca aa 2982
<210> 53
<211> 994
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 53
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Ser Glu Pro Lys Ser Ser Asp
755 760 765
Lys Thr His Thr Ser Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly
770 775 780
Pro Ser Val Phe Ile Phe Pro Pro Lys Ile Lys Asp Val Leu Met Ile
785 790 795 800
Ser Leu Ser Pro Ile Val Thr Cys Val Val Val Asp Val Ser Glu Asp
805 810 815
Asp Pro Asp Val Gln Ile Ser Trp Phe Val Asn Asn Val Glu Val His
820 825 830
Thr Ala Gln Thr Gln Thr His Arg Glu Asp Tyr Asn Ser Thr Leu Arg
835 840 845
Val Val Ser Ala Leu Pro Ile Gln His Gln Asp Trp Met Ser Gly Lys
850 855 860
Glu Phe Lys Cys Lys Val Asn Asn Lys Asp Leu Pro Ala Pro Ile Glu
865 870 875 880
Arg Thr Ile Ser Lys Pro Lys Gly Ser Val Arg Ala Pro Gln Val Tyr
885 890 895
Val Leu Pro Pro Pro Glu Glu Glu Met Thr Lys Lys Gln Val Thr Leu
900 905 910
Thr Cys Met Val Thr Asp Phe Met Pro Glu Asp Ile Tyr Val Glu Trp
915 920 925
Thr Asn Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr Glu Pro Val
930 935 940
Leu Asp Ser Asp Gly Ser Tyr Phe Met Tyr Ser Lys Leu Arg Val Glu
945 950 955 960
Lys Lys Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser Val Val His
965 970 975
Glu Gly Leu His Asn His His Thr Thr Lys Ser Phe Ser Arg Thr Pro
980 985 990
Gly Lys
<210> 54
<211> 995
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 54
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Ser Glu Pro Arg Gly Pro Thr
755 760 765
Ile Lys Pro Cys Pro Pro Cys Lys Cys Pro Ala Pro Asn Leu Leu Gly
770 775 780
Gly Pro Ser Val Phe Ile Phe Pro Pro Lys Ile Lys Asp Val Leu Met
785 790 795 800
Ile Ser Leu Ser Pro Ile Val Thr Cys Val Val Val Asp Val Ser Glu
805 810 815
Asp Asp Pro Asp Val Gln Ile Ser Trp Phe Val Asn Asn Val Glu Val
820 825 830
His Thr Ala Gln Thr Gln Thr His Arg Glu Asp Tyr Asn Ser Thr Leu
835 840 845
Arg Val Val Ser Ala Leu Pro Ile Gln His Gln Asp Trp Met Ser Gly
850 855 860
Lys Glu Phe Lys Cys Lys Val Asn Asn Lys Asp Leu Pro Ala Pro Ile
865 870 875 880
Glu Arg Thr Ile Ser Lys Pro Lys Gly Ser Val Arg Ala Pro Gln Val
885 890 895
Tyr Val Leu Pro Pro Pro Glu Glu Glu Met Thr Lys Lys Gln Val Thr
900 905 910
Leu Thr Cys Met Val Thr Asp Phe Met Pro Glu Asp Ile Tyr Val Glu
915 920 925
Trp Thr Asn Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr Glu Pro
930 935 940
Val Leu Asp Ser Asp Gly Ser Tyr Phe Met Tyr Ser Lys Leu Arg Val
945 950 955 960
Glu Lys Lys Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser Val Val
965 970 975
His Glu Gly Leu His Asn His His Thr Thr Lys Ser Phe Ser Arg Thr
980 985 990
Pro Gly Lys
995
<210> 55
<211> 1007
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 55
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Ser Arg Asn Thr Gly Arg Gly
755 760 765
Gly Glu Glu Lys Lys Lys Glu Lys Glu Lys Glu Glu Gln Glu Glu Arg
770 775 780
Glu Thr Lys Thr Pro Glu Cys Pro Ser His Thr Gln Pro Leu Gly Val
785 790 795 800
Phe Ile Phe Pro Pro Lys Ile Lys Asp Val Leu Met Ile Ser Leu Ser
805 810 815
Pro Ile Val Thr Cys Val Val Val Asp Val Ser Glu Asp Asp Pro Asp
820 825 830
Val Gln Ile Ser Trp Phe Val Asn Asn Val Glu Val His Thr Ala Gln
835 840 845
Thr Gln Thr His Arg Glu Asp Tyr Asn Ser Thr Leu Arg Val Val Ser
850 855 860
Ala Leu Pro Ile Gln His Gln Asp Trp Met Ser Gly Lys Glu Phe Lys
865 870 875 880
Cys Lys Val Asn Asn Lys Asp Leu Pro Ala Pro Ile Glu Arg Thr Ile
885 890 895
Ser Lys Pro Lys Gly Ser Val Arg Ala Pro Gln Val Tyr Val Leu Pro
900 905 910
Pro Pro Glu Glu Glu Met Thr Lys Lys Gln Val Thr Leu Thr Cys Met
915 920 925
Val Thr Asp Phe Met Pro Glu Asp Ile Tyr Val Glu Trp Thr Asn Asn
930 935 940
Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr Glu Pro Val Leu Asp Ser
945 950 955 960
Asp Gly Ser Tyr Phe Met Tyr Ser Lys Leu Arg Val Glu Lys Lys Asn
965 970 975
Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser Val Val His Glu Gly Leu
980 985 990
His Asn His His Thr Thr Lys Ser Phe Ser Arg Thr Pro Gly Lys
995 1000 1005
<210> 56
<211> 994
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 56
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Ser Glu Pro Lys Ser Ser Asp
755 760 765
Lys Thr His Thr Ser Pro Pro Ser Pro Ala Pro Glu Leu Leu Gly Gly
770 775 780
Ser Ser Val Phe Ile Phe Pro Pro Lys Ile Lys Asp Val Leu Met Ile
785 790 795 800
Ser Leu Ser Pro Ile Val Thr Cys Val Val Val Asp Val Ser Glu Asp
805 810 815
Asp Pro Asp Val Gln Ile Ser Trp Phe Val Asn Asn Val Glu Val His
820 825 830
Thr Ala Gln Thr Gln Thr His Arg Glu Asp Tyr Asn Ser Thr Leu Arg
835 840 845
Val Val Ser Ala Leu Pro Ile Gln His Gln Asp Trp Met Ser Gly Lys
850 855 860
Glu Phe Lys Cys Lys Val Asn Asn Lys Asp Leu Pro Ala Pro Ile Glu
865 870 875 880
Arg Thr Ile Ser Lys Pro Lys Gly Ser Val Arg Ala Pro Gln Val Tyr
885 890 895
Val Leu Pro Pro Pro Glu Glu Glu Met Thr Lys Lys Gln Val Thr Leu
900 905 910
Thr Cys Met Val Thr Asp Phe Met Pro Glu Asp Ile Tyr Val Glu Trp
915 920 925
Thr Asn Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr Glu Pro Val
930 935 940
Leu Asp Ser Asp Gly Ser Tyr Phe Met Tyr Ser Lys Leu Arg Val Glu
945 950 955 960
Lys Lys Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser Val Val His
965 970 975
Glu Gly Leu His Asn His His Thr Thr Lys Ser Phe Ser Arg Thr Pro
980 985 990
Gly Lys
<210> 57
<211> 2982
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 57
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggatccgagc caaagtcctc tgataagaca cacacctctc caccatgccc agcaccagag 2340
ctgctgggag gaccaagcgt gttcatcttc ccacccaaga tcaaggacgt gctgatgatc 2400
tccctgtccc ccatcgtgac ctgcgtggtg gtggacgtgt ccgaggacga ccccgacgtg 2460
cagatcagtt ggttcgtgaa caacgtggaa gtgcacaccg cccagaccca gacccacaga 2520
gaggactaca actccaccct gcgggtggtg tccgccctgc ccatccagca ccaggactgg 2580
atgtccggca aagaattcaa gtgcaaagtg aacaacaagg acctgcctgc ccccatcgag 2640
cggaccatct ccaagcccaa gggctccgtg cgggctcccc aggtgtacgt gctgccccct 2700
ccagaggaag agatgaccaa gaagcaggtc acactgacct gcatggtcac cgacttcatg 2760
cccgaggaca tctacgtgga atggaccaac aatggcaaga ccgagctgaa ctacaagaac 2820
accgagcctg tgctggactc cgacggctcc tacttcatgt actccaagct gcgggtggaa 2880
aagaagaact gggtcgagcg gaactcctac tcctgctccg tggtgcacga gggcctgcac 2940
aaccaccaca ccaccaagtc cttctcccgg acccccggca aa 2982
<210> 58
<211> 2985
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 58
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggatccgagc ctcggggccc taccatcaag ccctgccccc cttgcaagtg ccctgcccct 2340
aatctgctgg gcggaccctc cgtgttcatc ttcccaccca agatcaagga cgtgctgatg 2400
atctccctgt cccccatcgt gacctgcgtg gtggtggacg tgtccgagga cgaccccgac 2460
gtgcagatca gttggttcgt gaacaacgtg gaagtgcaca ccgcccagac ccagacccac 2520
agagaggact acaactccac cctgcgggtg gtgtccgccc tgcccatcca gcaccaggac 2580
tggatgtccg gcaaagaatt caagtgcaaa gtgaacaaca aggacctgcc tgcccccatc 2640
gagcggacca tctccaagcc caagggctcc gtgcgggctc cccaggtgta cgtgctgccc 2700
cctccagagg aagagatgac caagaagcag gtcacactga cctgcatggt caccgacttc 2760
atgcccgagg acatctacgt ggaatggacc aacaatggca agaccgagct gaactacaag 2820
aacaccgagc ctgtgctgga ctccgacggc tcctacttca tgtactccaa gctgcgggtg 2880
gaaaagaaga actgggtcga gcggaactcc tactcctgct ccgtggtgca cgagggcctg 2940
cacaaccacc acaccaccaa gtccttctcc cggacccccg gcaaa 2985
<210> 59
<211> 3021
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 59
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggatcccgca acaccggccg cggcggcgag gagaagaaga aggagaagga gaaggaggag 2340
caggaggagc gcgagaccaa gacccccgag tgccccagcc acacccagcc cctgggcgtg 2400
ttcatcttcc cacccaagat caaggacgtg ctgatgatct ccctgtcccc catcgtgacc 2460
tgcgtggtgg tggacgtgtc cgaggacgac cccgacgtgc agatcagttg gttcgtgaac 2520
aacgtggaag tgcacaccgc ccagacccag acccacagag aggactacaa ctccaccctg 2580
cgggtggtgt ccgccctgcc catccagcac caggactgga tgtccggcaa agaattcaag 2640
tgcaaagtga acaacaagga cctgcctgcc cccatcgagc ggaccatctc caagcccaag 2700
ggctccgtgc gggctcccca ggtgtacgtg ctgccccctc cagaggaaga gatgaccaag 2760
aagcaggtca cactgacctg catggtcacc gacttcatgc ccgaggacat ctacgtggaa 2820
tggaccaaca atggcaagac cgagctgaac tacaagaaca ccgagcctgt gctggactcc 2880
gacggctcct acttcatgta ctccaagctg cgggtggaaa agaagaactg ggtcgagcgg 2940
aactcctact cctgctccgt ggtgcacgag ggcctgcaca accaccacac caccaagtcc 3000
ttctcccgga cccccggcaa a 3021
<210> 60
<211> 2982
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 60
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggatccgaac cgaaatcttc tgacaaaacc cacacctctc cgccgtctcc ggctccggaa 2340
ctgctgggtg gttcttctgt tttcatcttc ccacccaaga tcaaggacgt gctgatgatc 2400
tccctgtccc ccatcgtgac ctgcgtggtg gtggacgtgt ccgaggacga ccccgacgtg 2460
cagatcagtt ggttcgtgaa caacgtggaa gtgcacaccg cccagaccca gacccacaga 2520
gaggactaca actccaccct gcgggtggtg tccgccctgc ccatccagca ccaggactgg 2580
atgtccggca aagaattcaa gtgcaaagtg aacaacaagg acctgcctgc ccccatcgag 2640
cggaccatct ccaagcccaa gggctccgtg cgggctcccc aggtgtacgt gctgccccct 2700
ccagaggaag agatgaccaa gaagcaggtc acactgacct gcatggtcac cgacttcatg 2760
cccgaggaca tctacgtgga atggaccaac aatggcaaga ccgagctgaa ctacaagaac 2820
accgagcctg tgctggactc cgacggctcc tacttcatgt actccaagct gcgggtggaa 2880
aagaagaact gggtcgagcg gaactcctac tcctgctccg tggtgcacga gggcctgcac 2940
aaccaccaca ccaccaagtc cttctcccgg acccccggca aa 2982
<210> 61
<211> 999
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 61
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Gly Gly Gly Gly Ser Glu
755 760 765
Pro Lys Ser Ser Asp Lys Thr His Thr Ser Pro Pro Cys Pro Ala Pro
770 775 780
Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys
785 790 795 800
Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val
805 810 815
Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp
820 825 830
Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr
835 840 845
Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp
850 855 860
Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu
865 870 875 880
Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg
885 890 895
Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys
900 905 910
Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp
915 920 925
Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys
930 935 940
Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser
945 950 955 960
Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser
965 970 975
Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser
980 985 990
Leu Ser Leu Ser Pro Gly Lys
995
<210> 62
<211> 1000
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 62
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Gly Gly Gly Gly Ser Glu
755 760 765
Pro Arg Gly Pro Thr Ile Lys Pro Cys Pro Pro Cys Lys Cys Pro Ala
770 775 780
Pro Asn Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro
785 790 795 800
Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val
805 810 815
Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val
820 825 830
Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln
835 840 845
Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln
850 855 860
Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala
865 870 875 880
Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro
885 890 895
Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr
900 905 910
Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser
915 920 925
Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr
930 935 940
Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr
945 950 955 960
Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe
965 970 975
Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys
980 985 990
Ser Leu Ser Leu Ser Pro Gly Lys
995 1000
<210> 63
<211> 1012
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 63
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Gly Gly Gly Gly Ser Arg
755 760 765
Asn Thr Gly Arg Gly Gly Glu Glu Lys Lys Lys Glu Lys Glu Lys Glu
770 775 780
Glu Gln Glu Glu Arg Glu Thr Lys Thr Pro Glu Cys Pro Ser His Thr
785 790 795 800
Gln Pro Leu Gly Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu
805 810 815
Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser
820 825 830
His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu
835 840 845
Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr
850 855 860
Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn
865 870 875 880
Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro
885 890 895
Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln
900 905 910
Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val
915 920 925
Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val
930 935 940
Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro
945 950 955 960
Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr
965 970 975
Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val
980 985 990
Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu
995 1000 1005
Ser Pro Gly Lys
1010
<210> 64
<211> 999
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 64
Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala Gly Asp
1 5 10 15
Ser Leu Asp Cys Gly Gly Arg Gly Leu Ala Ala Leu Pro Gly Asp Leu
20 25 30
Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Lys Leu Ser Glu
35 40 45
Ile Asp Pro Ala Gly Phe Glu Asp Leu Pro Asn Leu Gln Glu Val Tyr
50 55 60
Leu Asn Asn Asn Glu Leu Thr Ala Val Pro Ser Leu Gly Ala Ala Ser
65 70 75 80
Ser His Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Arg Ser Val
85 90 95
Glu Gly Ser Gln Leu Lys Ala Tyr Leu Ser Leu Glu Val Leu Asp Leu
100 105 110
Ser Leu Asn Asn Ile Thr Glu Val Arg Asn Thr Cys Phe Pro His Gly
115 120 125
Pro Pro Ile Lys Glu Leu Asn Leu Ala Gly Asn Arg Ile Gly Thr Leu
130 135 140
Glu Leu Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr Leu Arg
145 150 155 160
Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Arg Ala Phe Lys Leu
165 170 175
Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg Leu Ile
180 185 190
Glu Gly Leu Thr Phe Gln Gly Leu Asn Ser Leu Glu Val Leu Lys Leu
195 200 205
Gln Arg Asn Asn Ile Ser Lys Leu Thr Asp Gly Ala Phe Trp Gly Leu
210 215 220
Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val Glu Val
225 230 235 240
Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu His Leu
245 250 255
Ser Asn Asn Ser Ile Ala Arg Ile His Arg Lys Gly Trp Ser Phe Cys
260 265 270
Gln Lys Leu His Glu Leu Val Leu Ser Phe Asn Asn Leu Thr Arg Leu
275 280 285
Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Val Leu Arg Leu
290 295 300
Ser His Asn Ser Ile Ser His Ile Ala Glu Gly Ala Phe Lys Gly Leu
305 310 315 320
Arg Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser Gly Thr
325 330 335
Ile Glu Asp Thr Ser Gly Ala Phe Ser Gly Leu Asp Ser Leu Ser Lys
340 345 350
Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg Ala Phe
355 360 365
Ser Gly Leu Glu Gly Leu Glu His Leu Asn Leu Gly Gly Asn Ala Ile
370 375 380
Arg Ser Val Gln Phe Asp Ala Phe Val Lys Met Lys Asn Leu Lys Glu
385 390 395 400
Leu His Ile Ser Ser Asp Ser Phe Leu Cys Asp Cys Gln Leu Lys Trp
405 410 415
Leu Pro Pro Trp Leu Ile Gly Arg Met Leu Gln Ala Phe Val Thr Ala
420 425 430
Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe Ser Val
435 440 445
Pro Pro Glu Ser Phe Val Cys Asp Asp Phe Leu Lys Pro Gln Ile Ile
450 455 460
Thr Gln Pro Glu Thr Thr Met Ala Met Val Gly Lys Asp Ile Arg Phe
465 470 475 480
Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe Ala Trp
485 490 495
Lys Lys Asp Asn Glu Val Leu Thr Asn Ala Asp Met Glu Asn Phe Val
500 505 510
His Val His Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr Ile Leu
515 520 525
His Leu Arg Gln Val Thr Phe Gly His Glu Gly Arg Tyr Gln Cys Val
530 535 540
Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg Leu Thr
545 550 555 560
Val Asn Val Leu Pro Ser Phe Thr Lys Thr Pro His Asp Ile Thr Ile
565 570 575
Arg Thr Thr Thr Val Ala Arg Leu Glu Cys Ala Ala Thr Gly His Pro
580 585 590
Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe Pro Ala
595 600 605
Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val Phe Phe
610 615 620
Ile Thr Asp Val Lys Ile Asp Asp Ala Gly Val Tyr Ser Cys Thr Ala
625 630 635 640
Gln Asn Ser Ala Gly Ser Ile Ser Ala Asn Ala Thr Leu Thr Val Leu
645 650 655
Glu Thr Pro Ser Leu Val Val Pro Leu Glu Asp Arg Val Val Ser Val
660 665 670
Gly Glu Thr Val Ala Leu Gln Cys Lys Ala Thr Gly Asn Pro Pro Pro
675 680 685
Arg Ile Thr Trp Phe Lys Gly Asp Arg Pro Leu Ser Leu Thr Glu Arg
690 695 700
His His Leu Thr Pro Asp Asn Gln Leu Leu Val Val Gln Asn Val Val
705 710 715 720
Ala Glu Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Thr Leu Gly
725 730 735
Thr Glu Arg Ala His Ser Gln Leu Ser Val Leu Pro Ala Ala Gly Cys
740 745 750
Arg Lys Asp Gly Thr Thr Val Gly Ile Phe Gly Gly Gly Gly Ser Glu
755 760 765
Pro Lys Ser Ser Asp Lys Thr His Thr Ser Pro Pro Ser Pro Ala Pro
770 775 780
Glu Leu Leu Gly Gly Ser Ser Val Phe Leu Phe Pro Pro Lys Pro Lys
785 790 795 800
Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val
805 810 815
Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp
820 825 830
Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr
835 840 845
Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp
850 855 860
Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu
865 870 875 880
Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg
885 890 895
Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys
900 905 910
Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp
915 920 925
Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys
930 935 940
Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser
945 950 955 960
Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser
965 970 975
Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser
980 985 990
Leu Ser Leu Ser Pro Gly Lys
995
<210> 65
<211> 2997
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 65
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttcggcg gtggcggatc cgagccaaag tcctctgata agacacacac ctctccacca 2340
tgcccagcac cagagctgct gggaggacca agcgtgttcc tgtttcctcc aaagcccaag 2400
gacacactga tgatctccag gacaccagag gtgacctgcg tggtggtgga cgtgagccac 2460
gaggaccccg aggtgaagtt caactggtac gtggatggcg tggaggtgca caatgccaag 2520
accaagccca gagaggagca gtacaactct acctataggg tggtgagcgt gctgacagtg 2580
ctgcaccagg actggctgaa cggcaaggag tataagtgca aggtgagcaa taaggccctg 2640
cctgccccaa tcgagaagac aatctccaag gccaagggcc agccaagaga gccccaggtg 2700
tacaccctgc cccctagcag ggatgagctg acaaagaacc aggtgtccct gacctgtctg 2760
gtgaagggct tttatccctc cgacatcgcc gtggagtggg agtctaatgg ccagcctgag 2820
aataactaca agacaacccc acccgtgctg gattctgacg gcagcttctt tctgtattct 2880
aagctgaccg tggacaagag caggtggcag cagggcaacg tgttcagctg ctccgtgatg 2940
cacgaagcac tgcacaatca ctacacccag aaatcactgt cactgagccc tggcaaa 2997
<210> 66
<211> 3000
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 66
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttcggcg gtggcggatc cgagcctcgg ggccctacca tcaagccctg ccccccttgc 2340
aagtgccctg cccctaatct gctgggcgga ccctccgtgt tcctgtttcc tccaaagccc 2400
aaggacacac tgatgatctc caggacacca gaggtgacct gcgtggtggt ggacgtgagc 2460
cacgaggacc ccgaggtgaa gttcaactgg tacgtggatg gcgtggaggt gcacaatgcc 2520
aagaccaagc ccagagagga gcagtacaac tctacctata gggtggtgag cgtgctgaca 2580
gtgctgcacc aggactggct gaacggcaag gagtataagt gcaaggtgag caataaggcc 2640
ctgcctgccc caatcgagaa gacaatctcc aaggccaagg gccagccaag agagccccag 2700
gtgtacaccc tgccccctag cagggatgag ctgacaaaga accaggtgtc cctgacctgt 2760
ctggtgaagg gcttttatcc ctccgacatc gccgtggagt gggagtctaa tggccagcct 2820
gagaataact acaagacaac cccacccgtg ctggattctg acggcagctt ctttctgtat 2880
tctaagctga ccgtggacaa gagcaggtgg cagcagggca acgtgttcag ctgctccgtg 2940
atgcacgaag cactgcacaa tcactacacc cagaaatcac tgtcactgag ccctggcaaa 3000
3000
<210> 67
<211> 3036
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 67
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttcggcg gtggcggatc ccgcaacacc ggccgcggcg gcgaggagaa gaagaaggag 2340
aaggagaagg aggagcagga ggagcgcgag accaagaccc ccgagtgccc cagccacacc 2400
cagcccctgg gcgtgttcct gtttcctcca aagcccaagg acacactgat gatctccagg 2460
acaccagagg tgacctgcgt ggtggtggac gtgagccacg aggaccccga ggtgaagttc 2520
aactggtacg tggatggcgt ggaggtgcac aatgccaaga ccaagcccag agaggagcag 2580
tacaactcta cctatagggt ggtgagcgtg ctgacagtgc tgcaccagga ctggctgaac 2640
ggcaaggagt ataagtgcaa ggtgagcaat aaggccctgc ctgccccaat cgagaagaca 2700
atctccaagg ccaagggcca gccaagagag ccccaggtgt acaccctgcc ccctagcagg 2760
gatgagctga caaagaacca ggtgtccctg acctgtctgg tgaagggctt ttatccctcc 2820
gacatcgccg tggagtggga gtctaatggc cagcctgaga ataactacaa gacaacccca 2880
cccgtgctgg attctgacgg cagcttcttt ctgtattcta agctgaccgt ggacaagagc 2940
aggtggcagc agggcaacgt gttcagctgc tccgtgatgc acgaagcact gcacaatcac 3000
tacacccaga aatcactgtc actgagccct ggcaaa 3036
<210> 68
<211> 2997
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 68
gctgggcctc gggctccttg tgctgccgcc tgcacatgtg caggcgattc cctggactgc 60
ggcggcagag gcctggccgc cctgcctggc gatctgccat cctggacccg gagcctgaac 120
ctgagctaca acaagctgag cgagatcgat cccgccggct ttgaggacct gcctaacctg 180
caggaggtgt atctgaacaa taacgagctg accgcggtac catccctggg cgctgcttca 240
tcacatgtcg tctctctctt tctgcagcac aacaagattc gcagcgtgga ggggagccag 300
ctgaaggcct acctttcctt agaagtgtta gatctgagtt tgaacaacat cacggaagtg 360
cggaacacct gctttccaca cggaccgcct ataaaggagc tcaacctggc aggcaatcgg 420
attggcaccc tggagttggg agcatttgat ggtctgtcac ggtcgctgct aactcttcgc 480
ctgagcaaaa acaggatcac ccagcttcct gtaagagcat tcaagctacc caggctgaca 540
caactggacc tcaatcggaa caggattcgg ctgatagagg gcctcacctt ccaggggctc 600
aacagcttgg aggtgctgaa gcttcagcga aacaacatca gcaaactgac agatggggcc 660
ttctggggac tgtccaagat gcatgtgctg cacctggagt acaacagcct ggtagaagtg 720
aacagcggct cgctctacgg cctcacggcc ctgcatcagc tccacctcag caacaattcc 780
atcgctcgca ttcaccgcaa gggctggagc ttctgccaga agctgcatga gttggtcctg 840
tccttcaaca acctgacacg gctggacgag gagagcctgg ccgagctgag cagcctgagt 900
gtcctgcgtc tcagccacaa ttccatcagc cacattgcgg agggtgcctt caagggactc 960
aggagcctgc gagtcttgga tctggaccat aacgagattt cgggcacaat agaggacacg 1020
agcggcgcct tctcagggct cgacagcctc agcaagctga ctctgtttgg aaacaagatc 1080
aagtctgtgg ctaagagagc attctcgggg ctggaaggcc tggagcacct gaaccttgga 1140
gggaatgcga tcagatctgt ccagtttgat gcctttgtga agatgaagaa tcttaaagag 1200
ctccatatca gcagcgacag cttcctgtgt gactgccagc tgaagtggct gcccccgtgg 1260
ctaattggca ggatgctgca ggcctttgtg acagccacct gtgcccaccc agaatcactg 1320
aagggtcaga gcattttctc tgtgccacca gagagtttcg tgtgcgatga cttcctgaag 1380
ccacagatca tcacccagcc agaaaccacc atggctatgg tgggcaagga catccggttt 1440
acatgctcag cagccagcag cagcagctcc cccatgacct ttgcctggaa gaaagacaat 1500
gaagtcctga ccaatgcaga catggagaac tttgtccacg tccacgcgca ggacggggaa 1560
gtgatggagt acaccaccat cctgcacctc cgtcaggtca ctttcgggca cgagggccgc 1620
taccaatgtg tcatcaccaa ccactttggc tccacctatt cacataaggc caggctcacc 1680
gtgaatgtgt tgccatcatt caccaaaacg ccccacgaca taaccatccg gaccaccacc 1740
gtggcccgcc tcgaatgtgc tgccacaggt cacccaaacc ctcagattgc ctggcagaag 1800
gatggaggca cggatttccc cgctgcccgt gagcgacgca tgcatgtcat gccggatgac 1860
gacgtgtttt tcatcactga tgtgaaaata gatgacgcag gggtttacag ctgtactgct 1920
cagaactcag ccggttctat ttcagctaat gccaccctga ctgtcctaga gaccccatcc 1980
ttggtggtcc ccttggaaga ccgtgtggta tctgtgggag aaacagtggc cctccaatgc 2040
aaagccacgg ggaaccctcc gccccgcatc acctggttca agggggaccg cccgctgagc 2100
ctcactgagc ggcaccacct gacccctgac aaccagctcc tggtggttca gaacgtggtg 2160
gcagaggatg cgggccgata tacctgtgag atgtccaaca ccctgggcac ggagcgagct 2220
cacagccagc tgagcgtcct gcccgcagca ggctgcagga aggatgggac cacggtaggc 2280
atcttcggcg gtggcggatc cgaaccgaaa tcttctgaca aaacccacac ctctccgccg 2340
tctccggctc cggaactgct gggtggttct tctgttttcc tgtttcctcc aaagcccaag 2400
gacacactga tgatctccag gacaccagag gtgacctgcg tggtggtgga cgtgagccac 2460
gaggaccccg aggtgaagtt caactggtac gtggatggcg tggaggtgca caatgccaag 2520
accaagccca gagaggagca gtacaactct acctataggg tggtgagcgt gctgacagtg 2580
ctgcaccagg actggctgaa cggcaaggag tataagtgca aggtgagcaa taaggccctg 2640
cctgccccaa tcgagaagac aatctccaag gccaagggcc agccaagaga gccccaggtg 2700
tacaccctgc cccctagcag ggatgagctg acaaagaacc aggtgtccct gacctgtctg 2760
gtgaagggct tttatccctc cgacatcgcc gtggagtggg agtctaatgg ccagcctgag 2820
aataactaca agacaacccc acccgtgctg gattctgacg gcagcttctt tctgtattct 2880
aagctgaccg tggacaagag caggtggcag cagggcaacg tgttcagctg ctccgtgatg 2940
cacgaagcac tgcacaatca ctacacccag aaatcactgt cactgagccc tggcaaa 2997
<210> 69
<211> 997
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 69
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Gly Gly Gly Ser Glu Pro Lys
755 760 765
Ser Ser Asp Lys Thr His Thr Ser Pro Pro Cys Pro Ala Pro Glu Leu
770 775 780
Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr
785 790 795 800
Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val
805 810 815
Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val
820 825 830
Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser
835 840 845
Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu
850 855 860
Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala
865 870 875 880
Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro
885 890 895
Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln
900 905 910
Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala
915 920 925
Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr
930 935 940
Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu
945 950 955 960
Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser
965 970 975
Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser
980 985 990
Leu Ser Pro Gly Lys
995
<210> 70
<211> 998
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 70
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Gly Gly Gly Ser Glu Pro Arg
755 760 765
Gly Pro Thr Ile Lys Pro Cys Pro Pro Cys Lys Cys Pro Ala Pro Asn
770 775 780
Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp
785 790 795 800
Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp
805 810 815
Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly
820 825 830
Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn
835 840 845
Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp
850 855 860
Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro
865 870 875 880
Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu
885 890 895
Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn
900 905 910
Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile
915 920 925
Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr
930 935 940
Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys
945 950 955 960
Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys
965 970 975
Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu
980 985 990
Ser Leu Ser Pro Gly Lys
995
<210> 71
<211> 1010
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 71
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Gly Gly Gly Ser Arg Asn Thr
755 760 765
Gly Arg Gly Gly Glu Glu Lys Lys Lys Glu Lys Glu Lys Glu Glu Gln
770 775 780
Glu Glu Arg Glu Thr Lys Thr Pro Glu Cys Pro Ser His Thr Gln Pro
785 790 795 800
Leu Gly Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
805 810 815
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
820 825 830
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
835 840 845
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
850 855 860
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
865 870 875 880
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu
885 890 895
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
900 905 910
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
915 920 925
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
930 935 940
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
945 950 955 960
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
965 970 975
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
980 985 990
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
995 1000 1005
Gly Lys
1010
<210> 72
<211> 997
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 72
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Gly Gly Gly Ser Glu Pro Lys
755 760 765
Ser Ser Asp Lys Thr His Thr Ser Pro Pro Ser Pro Ala Pro Glu Leu
770 775 780
Leu Gly Gly Ser Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr
785 790 795 800
Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val
805 810 815
Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val
820 825 830
Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser
835 840 845
Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu
850 855 860
Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala
865 870 875 880
Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro
885 890 895
Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln
900 905 910
Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala
915 920 925
Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr
930 935 940
Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu
945 950 955 960
Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser
965 970 975
Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser
980 985 990
Leu Ser Pro Gly Lys
995
<210> 73
<211> 2991
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 73
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggcggtggcg gatccgagcc aaagtcctct gataagacac acacctctcc accatgccca 2340
gcaccagagc tgctgggagg accaagcgtg ttcctgtttc ctccaaagcc caaggacaca 2400
ctgatgatct ccaggacacc agaggtgacc tgcgtggtgg tggacgtgag ccacgaggac 2460
cccgaggtga agttcaactg gtacgtggat ggcgtggagg tgcacaatgc caagaccaag 2520
cccagagagg agcagtacaa ctctacctat agggtggtga gcgtgctgac agtgctgcac 2580
caggactggc tgaacggcaa ggagtataag tgcaaggtga gcaataaggc cctgcctgcc 2640
ccaatcgaga agacaatctc caaggccaag ggccagccaa gagagcccca ggtgtacacc 2700
ctgcccccta gcagggatga gctgacaaag aaccaggtgt ccctgacctg tctggtgaag 2760
ggcttttatc cctccgacat cgccgtggag tgggagtcta atggccagcc tgagaataac 2820
tacaagacaa ccccacccgt gctggattct gacggcagct tctttctgta ttctaagctg 2880
accgtggaca agagcaggtg gcagcagggc aacgtgttca gctgctccgt gatgcacgaa 2940
gcactgcaca atcactacac ccagaaatca ctgtcactga gccctggcaa a 2991
<210> 74
<211> 2994
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 74
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggcggtggcg gatccgagcc tcggggccct accatcaagc cctgcccccc ttgcaagtgc 2340
cctgccccta atctgctggg cggaccctcc gtgttcctgt ttcctccaaa gcccaaggac 2400
acactgatga tctccaggac accagaggtg acctgcgtgg tggtggacgt gagccacgag 2460
gaccccgagg tgaagttcaa ctggtacgtg gatggcgtgg aggtgcacaa tgccaagacc 2520
aagcccagag aggagcagta caactctacc tatagggtgg tgagcgtgct gacagtgctg 2580
caccaggact ggctgaacgg caaggagtat aagtgcaagg tgagcaataa ggccctgcct 2640
gccccaatcg agaagacaat ctccaaggcc aagggccagc caagagagcc ccaggtgtac 2700
accctgcccc ctagcaggga tgagctgaca aagaaccagg tgtccctgac ctgtctggtg 2760
aagggctttt atccctccga catcgccgtg gagtgggagt ctaatggcca gcctgagaat 2820
aactacaaga caaccccacc cgtgctggat tctgacggca gcttctttct gtattctaag 2880
ctgaccgtgg acaagagcag gtggcagcag ggcaacgtgt tcagctgctc cgtgatgcac 2940
gaagcactgc acaatcacta cacccagaaa tcactgtcac tgagccctgg caaa 2994
<210> 75
<211> 3030
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 75
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggcggtggcg gatcccgcaa caccggccgc ggcggcgagg agaagaagaa ggagaaggag 2340
aaggaggagc aggaggagcg cgagaccaag acccccgagt gccccagcca cacccagccc 2400
ctgggcgtgt tcctgtttcc tccaaagccc aaggacacac tgatgatctc caggacacca 2460
gaggtgacct gcgtggtggt ggacgtgagc cacgaggacc ccgaggtgaa gttcaactgg 2520
tacgtggatg gcgtggaggt gcacaatgcc aagaccaagc ccagagagga gcagtacaac 2580
tctacctata gggtggtgag cgtgctgaca gtgctgcacc aggactggct gaacggcaag 2640
gagtataagt gcaaggtgag caataaggcc ctgcctgccc caatcgagaa gacaatctcc 2700
aaggccaagg gccagccaag agagccccag gtgtacaccc tgccccctag cagggatgag 2760
ctgacaaaga accaggtgtc cctgacctgt ctggtgaagg gcttttatcc ctccgacatc 2820
gccgtggagt gggagtctaa tggccagcct gagaataact acaagacaac cccacccgtg 2880
ctggattctg acggcagctt ctttctgtat tctaagctga ccgtggacaa gagcaggtgg 2940
cagcagggca acgtgttcag ctgctccgtg atgcacgaag cactgcacaa tcactacacc 3000
cagaaatcac tgtcactgag ccctggcaaa 3030
<210> 76
<211> 2991
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 76
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggcggtggcg gatccgaacc gaaatcttct gacaaaaccc acacctctcc gccgtctccg 2340
gctccggaac tgctgggtgg ttcttctgtt ttcctgtttc ctccaaagcc caaggacaca 2400
ctgatgatct ccaggacacc agaggtgacc tgcgtggtgg tggacgtgag ccacgaggac 2460
cccgaggtga agttcaactg gtacgtggat ggcgtggagg tgcacaatgc caagaccaag 2520
cccagagagg agcagtacaa ctctacctat agggtggtga gcgtgctgac agtgctgcac 2580
caggactggc tgaacggcaa ggagtataag tgcaaggtga gcaataaggc cctgcctgcc 2640
ccaatcgaga agacaatctc caaggccaag ggccagccaa gagagcccca ggtgtacacc 2700
ctgcccccta gcagggatga gctgacaaag aaccaggtgt ccctgacctg tctggtgaag 2760
ggcttttatc cctccgacat cgccgtggag tgggagtcta atggccagcc tgagaataac 2820
tacaagacaa ccccacccgt gctggattct gacggcagct tctttctgta ttctaagctg 2880
accgtggaca agagcaggtg gcagcagggc aacgtgttca gctgctccgt gatgcacgaa 2940
gcactgcaca atcactacac ccagaaatca ctgtcactga gccctggcaa a 2991
<210> 77
<211> 997
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 77
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Gly Gly Gly Ser Glu Pro Lys
755 760 765
Ser Ser Asp Lys Thr His Thr Ser Pro Pro Cys Pro Ala Pro Glu Leu
770 775 780
Leu Gly Gly Pro Ser Val Phe Ile Phe Pro Pro Lys Ile Lys Asp Val
785 790 795 800
Leu Met Ile Ser Leu Ser Pro Ile Val Thr Cys Val Val Val Asp Val
805 810 815
Ser Glu Asp Asp Pro Asp Val Gln Ile Ser Trp Phe Val Asn Asn Val
820 825 830
Glu Val His Thr Ala Gln Thr Gln Thr His Arg Glu Asp Tyr Asn Ser
835 840 845
Thr Leu Arg Val Val Ser Ala Leu Pro Ile Gln His Gln Asp Trp Met
850 855 860
Ser Gly Lys Glu Phe Lys Cys Lys Val Asn Asn Lys Asp Leu Pro Ala
865 870 875 880
Pro Ile Glu Arg Thr Ile Ser Lys Pro Lys Gly Ser Val Arg Ala Pro
885 890 895
Gln Val Tyr Val Leu Pro Pro Pro Glu Glu Glu Met Thr Lys Lys Gln
900 905 910
Val Thr Leu Thr Cys Met Val Thr Asp Phe Met Pro Glu Asp Ile Tyr
915 920 925
Val Glu Trp Thr Asn Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr
930 935 940
Glu Pro Val Leu Asp Ser Asp Gly Ser Tyr Phe Met Tyr Ser Lys Leu
945 950 955 960
Arg Val Glu Lys Lys Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser
965 970 975
Val Val His Glu Gly Leu His Asn His His Thr Thr Lys Ser Phe Ser
980 985 990
Arg Thr Pro Gly Lys
995
<210> 78
<211> 998
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 78
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Gly Gly Gly Ser Glu Pro Arg
755 760 765
Gly Pro Thr Ile Lys Pro Cys Pro Pro Cys Lys Cys Pro Ala Pro Asn
770 775 780
Leu Leu Gly Gly Pro Ser Val Phe Ile Phe Pro Pro Lys Ile Lys Asp
785 790 795 800
Val Leu Met Ile Ser Leu Ser Pro Ile Val Thr Cys Val Val Val Asp
805 810 815
Val Ser Glu Asp Asp Pro Asp Val Gln Ile Ser Trp Phe Val Asn Asn
820 825 830
Val Glu Val His Thr Ala Gln Thr Gln Thr His Arg Glu Asp Tyr Asn
835 840 845
Ser Thr Leu Arg Val Val Ser Ala Leu Pro Ile Gln His Gln Asp Trp
850 855 860
Met Ser Gly Lys Glu Phe Lys Cys Lys Val Asn Asn Lys Asp Leu Pro
865 870 875 880
Ala Pro Ile Glu Arg Thr Ile Ser Lys Pro Lys Gly Ser Val Arg Ala
885 890 895
Pro Gln Val Tyr Val Leu Pro Pro Pro Glu Glu Glu Met Thr Lys Lys
900 905 910
Gln Val Thr Leu Thr Cys Met Val Thr Asp Phe Met Pro Glu Asp Ile
915 920 925
Tyr Val Glu Trp Thr Asn Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn
930 935 940
Thr Glu Pro Val Leu Asp Ser Asp Gly Ser Tyr Phe Met Tyr Ser Lys
945 950 955 960
Leu Arg Val Glu Lys Lys Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys
965 970 975
Ser Val Val His Glu Gly Leu His Asn His His Thr Thr Lys Ser Phe
980 985 990
Ser Arg Thr Pro Gly Lys
995
<210> 79
<211> 1010
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 79
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Gly Gly Gly Ser Arg Asn Thr
755 760 765
Gly Arg Gly Gly Glu Glu Lys Lys Lys Glu Lys Glu Lys Glu Glu Gln
770 775 780
Glu Glu Arg Glu Thr Lys Thr Pro Glu Cys Pro Ser His Thr Gln Pro
785 790 795 800
Leu Gly Val Phe Ile Phe Pro Pro Lys Ile Lys Asp Val Leu Met Ile
805 810 815
Ser Leu Ser Pro Ile Val Thr Cys Val Val Val Asp Val Ser Glu Asp
820 825 830
Asp Pro Asp Val Gln Ile Ser Trp Phe Val Asn Asn Val Glu Val His
835 840 845
Thr Ala Gln Thr Gln Thr His Arg Glu Asp Tyr Asn Ser Thr Leu Arg
850 855 860
Val Val Ser Ala Leu Pro Ile Gln His Gln Asp Trp Met Ser Gly Lys
865 870 875 880
Glu Phe Lys Cys Lys Val Asn Asn Lys Asp Leu Pro Ala Pro Ile Glu
885 890 895
Arg Thr Ile Ser Lys Pro Lys Gly Ser Val Arg Ala Pro Gln Val Tyr
900 905 910
Val Leu Pro Pro Pro Glu Glu Glu Met Thr Lys Lys Gln Val Thr Leu
915 920 925
Thr Cys Met Val Thr Asp Phe Met Pro Glu Asp Ile Tyr Val Glu Trp
930 935 940
Thr Asn Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr Glu Pro Val
945 950 955 960
Leu Asp Ser Asp Gly Ser Tyr Phe Met Tyr Ser Lys Leu Arg Val Glu
965 970 975
Lys Lys Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser Val Val His
980 985 990
Glu Gly Leu His Asn His His Thr Thr Lys Ser Phe Ser Arg Thr Pro
995 1000 1005
Gly Lys
1010
<210> 80
<211> 997
<212> PRT
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 80
Ala Gln Ala Gly Pro Arg Ala Pro Cys Ala Ala Ala Cys Thr Cys Ala
1 5 10 15
Gly Asp Ser Leu Asp Cys Ser Gly Arg Gly Leu Ala Thr Leu Pro Arg
20 25 30
Asp Leu Pro Ser Trp Thr Arg Ser Leu Asn Leu Ser Tyr Asn Arg Leu
35 40 45
Ser Glu Ile Asp Ser Ala Ala Phe Glu Asp Leu Thr Asn Leu Gln Glu
50 55 60
Val Tyr Leu Asn Ser Asn Glu Leu Thr Ala Ile Pro Ser Leu Gly Ala
65 70 75 80
Ala Ser Ile Gly Val Val Ser Leu Phe Leu Gln His Asn Lys Ile Leu
85 90 95
Ser Val Asp Gly Ser Gln Leu Lys Ser Tyr Leu Ser Leu Glu Val Leu
100 105 110
Asp Leu Ser Ser Asn Asn Ile Thr Glu Ile Arg Ser Ser Cys Phe Pro
115 120 125
Asn Gly Leu Arg Ile Arg Glu Leu Asn Leu Ala Ser Asn Arg Ile Ser
130 135 140
Ile Leu Glu Ser Gly Ala Phe Asp Gly Leu Ser Arg Ser Leu Leu Thr
145 150 155 160
Leu Arg Leu Ser Lys Asn Arg Ile Thr Gln Leu Pro Val Lys Ala Phe
165 170 175
Lys Leu Pro Arg Leu Thr Gln Leu Asp Leu Asn Arg Asn Arg Ile Arg
180 185 190
Leu Ile Glu Gly Leu Thr Phe Gln Gly Leu Asp Ser Leu Glu Val Leu
195 200 205
Arg Leu Gln Arg Asn Asn Ile Ser Arg Leu Thr Asp Gly Ala Phe Trp
210 215 220
Gly Leu Ser Lys Met His Val Leu His Leu Glu Tyr Asn Ser Leu Val
225 230 235 240
Glu Val Asn Ser Gly Ser Leu Tyr Gly Leu Thr Ala Leu His Gln Leu
245 250 255
His Leu Ser Asn Asn Ser Ile Ser Arg Ile Gln Arg Asp Gly Trp Ser
260 265 270
Phe Cys Gln Lys Leu His Glu Leu Ile Leu Ser Phe Asn Asn Leu Thr
275 280 285
Arg Leu Asp Glu Glu Ser Leu Ala Glu Leu Ser Ser Leu Ser Ile Leu
290 295 300
Arg Leu Ser His Asn Ala Ile Ser His Ile Ala Glu Gly Ala Phe Lys
305 310 315 320
Gly Leu Lys Ser Leu Arg Val Leu Asp Leu Asp His Asn Glu Ile Ser
325 330 335
Gly Thr Ile Glu Asp Thr Ser Gly Ala Phe Thr Gly Leu Asp Asn Leu
340 345 350
Ser Lys Leu Thr Leu Phe Gly Asn Lys Ile Lys Ser Val Ala Lys Arg
355 360 365
Ala Phe Ser Gly Leu Glu Ser Leu Glu His Leu Asn Leu Gly Glu Asn
370 375 380
Ala Ile Arg Ser Val Gln Phe Asp Ala Phe Ala Lys Met Lys Asn Leu
385 390 395 400
Lys Glu Leu Tyr Ile Ser Ser Glu Ser Phe Leu Cys Asp Cys Gln Leu
405 410 415
Lys Trp Leu Pro Pro Trp Leu Met Gly Arg Met Leu Gln Ala Phe Val
420 425 430
Thr Ala Thr Cys Ala His Pro Glu Ser Leu Lys Gly Gln Ser Ile Phe
435 440 445
Ser Val Leu Pro Asp Ser Phe Val Cys Asp Asp Phe Pro Lys Pro Gln
450 455 460
Ile Ile Thr Gln Pro Glu Thr Thr Met Ala Val Val Gly Lys Asp Ile
465 470 475 480
Arg Phe Thr Cys Ser Ala Ala Ser Ser Ser Ser Ser Pro Met Thr Phe
485 490 495
Ala Trp Lys Lys Asp Asn Glu Val Leu Ala Asn Ala Asp Met Glu Asn
500 505 510
Phe Ala His Val Arg Ala Gln Asp Gly Glu Val Met Glu Tyr Thr Thr
515 520 525
Ile Leu His Leu Arg His Val Thr Phe Gly His Glu Gly Arg Tyr Gln
530 535 540
Cys Ile Ile Thr Asn His Phe Gly Ser Thr Tyr Ser His Lys Ala Arg
545 550 555 560
Leu Thr Val Asn Val Leu Pro Ser Phe Thr Lys Ile Pro His Asp Ile
565 570 575
Ala Ile Arg Thr Gly Thr Thr Ala Arg Leu Glu Cys Ala Ala Thr Gly
580 585 590
His Pro Asn Pro Gln Ile Ala Trp Gln Lys Asp Gly Gly Thr Asp Phe
595 600 605
Pro Ala Ala Arg Glu Arg Arg Met His Val Met Pro Asp Asp Asp Val
610 615 620
Phe Phe Ile Thr Asp Val Lys Ile Asp Asp Met Gly Val Tyr Ser Cys
625 630 635 640
Thr Ala Gln Asn Ser Ala Gly Ser Val Ser Ala Asn Ala Thr Leu Thr
645 650 655
Val Leu Glu Thr Pro Ser Leu Ala Val Pro Leu Glu Asp Arg Val Val
660 665 670
Thr Val Gly Glu Thr Val Ala Phe Gln Cys Lys Ala Thr Gly Ser Pro
675 680 685
Thr Pro Arg Ile Thr Trp Leu Lys Gly Gly Arg Pro Leu Ser Leu Thr
690 695 700
Glu Arg His His Phe Thr Pro Gly Asn Gln Leu Leu Val Val Gln Asn
705 710 715 720
Val Met Ile Asp Asp Ala Gly Arg Tyr Thr Cys Glu Met Ser Asn Pro
725 730 735
Leu Gly Thr Glu Arg Ala His Ser Gln Leu Ser Ile Leu Pro Thr Pro
740 745 750
Gly Cys Arg Lys Asp Gly Thr Thr Gly Gly Gly Gly Ser Glu Pro Lys
755 760 765
Ser Ser Asp Lys Thr His Thr Ser Pro Pro Ser Pro Ala Pro Glu Leu
770 775 780
Leu Gly Gly Ser Ser Val Phe Ile Phe Pro Pro Lys Ile Lys Asp Val
785 790 795 800
Leu Met Ile Ser Leu Ser Pro Ile Val Thr Cys Val Val Val Asp Val
805 810 815
Ser Glu Asp Asp Pro Asp Val Gln Ile Ser Trp Phe Val Asn Asn Val
820 825 830
Glu Val His Thr Ala Gln Thr Gln Thr His Arg Glu Asp Tyr Asn Ser
835 840 845
Thr Leu Arg Val Val Ser Ala Leu Pro Ile Gln His Gln Asp Trp Met
850 855 860
Ser Gly Lys Glu Phe Lys Cys Lys Val Asn Asn Lys Asp Leu Pro Ala
865 870 875 880
Pro Ile Glu Arg Thr Ile Ser Lys Pro Lys Gly Ser Val Arg Ala Pro
885 890 895
Gln Val Tyr Val Leu Pro Pro Pro Glu Glu Glu Met Thr Lys Lys Gln
900 905 910
Val Thr Leu Thr Cys Met Val Thr Asp Phe Met Pro Glu Asp Ile Tyr
915 920 925
Val Glu Trp Thr Asn Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr
930 935 940
Glu Pro Val Leu Asp Ser Asp Gly Ser Tyr Phe Met Tyr Ser Lys Leu
945 950 955 960
Arg Val Glu Lys Lys Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser
965 970 975
Val Val His Glu Gly Leu His Asn His His Thr Thr Lys Ser Phe Ser
980 985 990
Arg Thr Pro Gly Lys
995
<210> 81
<211> 2991
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 81
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggcggtggcg gatccgagcc aaagtcctct gataagacac acacctctcc accatgccca 2340
gcaccagagc tgctgggagg accaagcgtg ttcatcttcc cacccaagat caaggacgtg 2400
ctgatgatct ccctgtcccc catcgtgacc tgcgtggtgg tggacgtgtc cgaggacgac 2460
cccgacgtgc agatcagttg gttcgtgaac aacgtggaag tgcacaccgc ccagacccag 2520
acccacagag aggactacaa ctccaccctg cgggtggtgt ccgccctgcc catccagcac 2580
caggactgga tgtccggcaa agaattcaag tgcaaagtga acaacaagga cctgcctgcc 2640
cccatcgagc ggaccatctc caagcccaag ggctccgtgc gggctcccca ggtgtacgtg 2700
ctgccccctc cagaggaaga gatgaccaag aagcaggtca cactgacctg catggtcacc 2760
gacttcatgc ccgaggacat ctacgtggaa tggaccaaca atggcaagac cgagctgaac 2820
tacaagaaca ccgagcctgt gctggactcc gacggctcct acttcatgta ctccaagctg 2880
cgggtggaaa agaagaactg ggtcgagcgg aactcctact cctgctccgt ggtgcacgag 2940
ggcctgcaca accaccacac caccaagtcc ttctcccgga cccccggcaa a 2991
<210> 82
<211> 2994
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 82
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggcggtggcg gatccgagcc tcggggccct accatcaagc cctgcccccc ttgcaagtgc 2340
cctgccccta atctgctggg cggaccctcc gtgttcatct tcccacccaa gatcaaggac 2400
gtgctgatga tctccctgtc ccccatcgtg acctgcgtgg tggtggacgt gtccgaggac 2460
gaccccgacg tgcagatcag ttggttcgtg aacaacgtgg aagtgcacac cgcccagacc 2520
cagacccaca gagaggacta caactccacc ctgcgggtgg tgtccgccct gcccatccag 2580
caccaggact ggatgtccgg caaagaattc aagtgcaaag tgaacaacaa ggacctgcct 2640
gcccccatcg agcggaccat ctccaagccc aagggctccg tgcgggctcc ccaggtgtac 2700
gtgctgcccc ctccagagga agagatgacc aagaagcagg tcacactgac ctgcatggtc 2760
accgacttca tgcccgagga catctacgtg gaatggacca acaatggcaa gaccgagctg 2820
aactacaaga acaccgagcc tgtgctggac tccgacggct cctacttcat gtactccaag 2880
ctgcgggtgg aaaagaagaa ctgggtcgag cggaactcct actcctgctc cgtggtgcac 2940
gagggcctgc acaaccacca caccaccaag tccttctccc ggacccccgg caaa 2994
<210> 83
<211> 3030
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 83
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggcggtggcg gatcccgcaa caccggccgc ggcggcgagg agaagaagaa ggagaaggag 2340
aaggaggagc aggaggagcg cgagaccaag acccccgagt gccccagcca cacccagccc 2400
ctgggcgtgt tcatcttccc acccaagatc aaggacgtgc tgatgatctc cctgtccccc 2460
atcgtgacct gcgtggtggt ggacgtgtcc gaggacgacc ccgacgtgca gatcagttgg 2520
ttcgtgaaca acgtggaagt gcacaccgcc cagacccaga cccacagaga ggactacaac 2580
tccaccctgc gggtggtgtc cgccctgccc atccagcacc aggactggat gtccggcaaa 2640
gaattcaagt gcaaagtgaa caacaaggac ctgcctgccc ccatcgagcg gaccatctcc 2700
aagcccaagg gctccgtgcg ggctccccag gtgtacgtgc tgccccctcc agaggaagag 2760
atgaccaaga agcaggtcac actgacctgc atggtcaccg acttcatgcc cgaggacatc 2820
tacgtggaat ggaccaacaa tggcaagacc gagctgaact acaagaacac cgagcctgtg 2880
ctggactccg acggctccta cttcatgtac tccaagctgc gggtggaaaa gaagaactgg 2940
gtcgagcgga actcctactc ctgctccgtg gtgcacgagg gcctgcacaa ccaccacacc 3000
accaagtcct tctcccggac ccccggcaaa 3030
<210> 84
<211> 2991
<212> DNA
<213> Artificial Sequence
<220>
<223> fusion protein
<400> 84
gctcaggctg gacctagggc tccttgcgct gccgcctgca cctgtgcagg cgattctctg 60
gactgcagcg gccggggcct ggccacactg cccagggacc tgccttcctg gaccagatct 120
ctgaacctga gctacaatcg gctgtccgag atcgattctg ccgcctttga ggacctgaca 180
aatctgcagg aggtgtatct gaacagcaat gagctgaccg caatcccctc cctgggagca 240
gcctctatcg gcgtggtgag cctgttcctg cagcacaaca agatcctgag cgtggatggc 300
tcccagctga agagctacct gtctctggag gtgctggacc tgagctccaa caatatcacc 360
gagatcagat ctagctgttt tcctaatggc ctgcggatca gagagctgaa cctggcctct 420
aatcggatca gcatcctgga gtccggcgcc ttcgatggcc tgagcagatc cctgctgaca 480
ctgcgcctgt ccaagaaccg gatcacccag ctgcccgtga aggcctttaa gctgcctagg 540
ctgacacagc tggacctgaa ccggaataga atcaggctga tcgagggcct gaccttccag 600
ggcctggata gcctggaggt gctgcgcctg cagcggaaca atatctcccg cctgacagac 660
ggagcatttt ggggcctgtc taagatgcac gtgctgcacc tggagtacaa tagcctggtg 720
gaggtgaact ctggcagcct gtatggcctg accgccctgc accagctgca cctgtccaac 780
aatagcatca gcagaatcca gagggatggc tggtccttct gccagaagct gcacgagctg 840
atcctgtctt ttaacaatct gaccaggctg gacgaggaga gcctggcaga gctgtcctct 900
ctgtccatcc tgcgcctgtc tcacaatgcc atcagccaca tcgccgaggg cgcctttaag 960
ggcctgaaga gcctgagggt gctggatctg gaccacaacg agatctctgg caccatcgag 1020
gatacaagcg gcgccttcac aggcctggac aatctgtcca agctgaccct gtttggcaac 1080
aagatcaagt ctgtggccaa gcgggccttc tctggcctgg agagcctgga gcacctgaac 1140
ctgggcgaga atgccatcag atccgtgcag ttcgatgcct ttgccaagat gaagaatctg 1200
aaggagctgt acatcagctc cgagagcttc ctgtgcgact gtcagctgaa gtggctgcca 1260
ccttggctga tgggaaggat gctgcaggcc tttgtgaccg ccacatgcgc ccacccagag 1320
agcctgaagg gccagagcat cttctccgtg ctgcccgata gcttcgtgtg cgacgatttt 1380
cctaagccac agatcatcac ccagccagag acaacaatgg ccgtggtggg caaggacatc 1440
cggtttacat gttccgccgc ctctagctcc tctagcccca tgaccttcgc ctggaagaag 1500
gataacgagg tgctggccaa tgccgacatg gagaacttcg cccacgtgag agcccaggat 1560
ggcgaagtga tggagtatac cacaatcctg cacctgcggc acgtgacctt tggccacgag 1620
ggcagatacc agtgcatcat cacaaatcac ttcggctcta cctatagcca caaggccagg 1680
ctgacagtga acgtgctgcc tagctttacc aagatcccac acgacatcgc catcagaaca 1740
ggcaccacag caaggctgga gtgtgcagca accggacacc caaaccctca gatcgcatgg 1800
cagaaggatg gaggcacaga cttccctgca gcccgcgaga ggagaatgca cgtgatgcca 1860
gacgatgacg tgttctttat cacagatgtg aagatcgatg acatgggcgt gtactcctgc 1920
accgcacaga acagcgccgg cagcgtgtcc gccaacgcca ccctgaccgt gctggagaca 1980
ccatccctgg ccgtgcccct ggaggacagg gtggtgaccg tgggcgagac agtggccttt 2040
cagtgtaagg ccaccggctc tccaacacca aggatcacct ggctgaaggg cggcaggccc 2100
ctgagcctga cagagcgcca ccacttcacc cctggcaatc agctgctggt ggtgcagaac 2160
gtgatgatcg atgacgccgg caggtataca tgcgagatga gcaatcctct gggcaccgag 2220
agggcacact cccagctgtc tatcctgcct accccaggct gccggaagga tggcaccaca 2280
ggcggtggcg gatccgaacc gaaatcttct gacaaaaccc acacctctcc gccgtctccg 2340
gctccggaac tgctgggtgg ttcttctgtt ttcatcttcc cacccaagat caaggacgtg 2400
ctgatgatct ccctgtcccc catcgtgacc tgcgtggtgg tggacgtgtc cgaggacgac 2460
cccgacgtgc agatcagttg gttcgtgaac aacgtggaag tgcacaccgc ccagacccag 2520
acccacagag aggactacaa ctccaccctg cgggtggtgt ccgccctgcc catccagcac 2580
caggactgga tgtccggcaa agaattcaag tgcaaagtga acaacaagga cctgcctgcc 2640
cccatcgagc ggaccatctc caagcccaag ggctccgtgc gggctcccca ggtgtacgtg 2700
ctgccccctc cagaggaaga gatgaccaag aagcaggtca cactgacctg catggtcacc 2760
gacttcatgc ccgaggacat ctacgtggaa tggaccaaca atggcaagac cgagctgaac 2820
tacaagaaca ccgagcctgt gctggactcc gacggctcct acttcatgta ctccaagctg 2880
cgggtggaaa agaagaactg ggtcgagcgg aactcctact cctgctccgt ggtgcacgag 2940
ggcctgcaca accaccacac caccaagtcc ttctcccgga cccccggcaa a 2991
Claims (20)
- Lrig-1(leucine-rich and immunoglobulin-like domains 1) 단백질의 세포 외 도메인(extracellular domain) 및 면역글로불린(immunoglobulin) Fc 영역을 포함하는 융합 단백질.
- 제1항에 있어서,
상기 Lrig-1 단백질의 세포 외 도메인은 서열번호 1 또는 3으로 표시되는, 융합 단백질. - 제1항에 있어서,
상기 면역글로불린 Fc 영역은 CH1, CH2, CH3 및 CH4 도메인으로 이루어진 그룹으로부터 선택된 1개 내지 4개 도메인을 포함하는, 융합 단백질. - 제1항에 있어서,
상기 면역글로불린 Fc 영역은 IgG, IgA, IgD, IgE 또는 IgM 유래 Fc 영역, 중쇄 불변영역 2(CH2), 중쇄 불변영역 3(CH3), 힌지(hinge), 이의 단편, 이들의 조합(combination), 또는 이들의 조합을 포함하는 하이브리드 Fc(hybrid Fc)인, 융합 단백질. - 제1항에 있어서,
상기 면역글로불린 Fc 영역은 IgG, IgA, IgD, IgE 또는 IgM 유래의 CH2 및 CH3 도메인을 포함하는, 융합 단백질 - 제1항에 있어서,
상기 면역글로불린 Fc 영역은 IgG1, IgG2, IgG3 또는 IgG4 유래의 CH2 및 CH3 도메인을 포함하는, 융합 단백질. - 제1항에 있어서,
상기 면역글로불린 Fc 영역은 힌지(hinge) 영역을 포함하는, 융합 단백질. - 제7항에 있어서,
상기 면역글로불린 Fc 영역은 IgG, IgA, IgM, IgD, IgE, 또는 아바타셉트(Abatacept) 유래의 힌지 영역을 포함하는, 융합 단백질. - 제7항에 있어서,
상기 힌지(hinge) 영역은 IgG1, IgG2, IgG3, IgG4, IgD 또는 아바타셉트(Abatacept) 유래의 힌지 영역을 포함하는, 융합 단백질. - 제1항에 있어서,
상기 면역글로불린 Fc 영역은 서열번호 5 또는 6으로 표시되는 CH2 및 CH3을 포함하는, 융합 단백질. - 제1항에 있어서,
상기 면역글로불린 Fc 영역은 서열번호 7 내지 10 중 어느 하나로 표시되는 힌지 영역을 포함하는, 융합 단백질. - 제1항에 있어서,
상기 Lrig-1 단백질의 세포 외 도메인은 상기 면역글로불린 Fc 영역의 N-말단 또는 C-말단에 직접 연결되는, 융합 단백질. - 제1항에 있어서,
상기 Lrig-1 단백질의 세포 외 도메인은 링커를 통하여 상기 면역글로불린 Fc 영역의 N-말단 또는 C-말단에 연결되는, 융합 단백질. - 제13항에 있어서,
상기 링커는 서열번호 11로 표시되는 펩타이드 링커; 및 서열번호 12로 표시되는 펩타이드 링커; 중 적어도 하나인, 융합 단백질. - 제1항 내지 제14항 중 어느 한 항의 융합 단백질을 코딩하는 핵산 분자.
- 제15항의 핵산 분자가 삽입된 발현 벡터.
- 제16항의 발현 벡터가 형질 감염된 숙주 세포주.
- 제1항 내지 제14항 중 어느 한 항의 융합 단백질을 유효 성분으로 포함하는 암의 예방 또는 치료용 약학 조성물.
- 제18항에 있어서,
상기 암은 고형암인, 약학 조성물. - 제18항에 있어서,
상기 암은 위암, 간암, 교세포종, 난소암, 대장암, 두경부암, 방광암, 신장세포암, 유방암, 전이암, 전립선암, 췌장암, 흑색종 또는 폐암인, 약학 조성물.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020180048343 | 2018-04-26 | ||
KR20180048343 | 2018-04-26 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020200177336A Division KR20200144524A (ko) | 2018-04-26 | 2020-12-17 | 신규한 융합 단백질 및 이를 포함하는 암의 예방 또는 치료용 약학적 조성물 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20190124665A true KR20190124665A (ko) | 2019-11-05 |
KR102194644B1 KR102194644B1 (ko) | 2020-12-24 |
Family
ID=68293649
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020190049295A KR102194644B1 (ko) | 2018-04-26 | 2019-04-26 | 신규한 융합 단백질 및 이를 포함하는 암의 예방 또는 치료용 약학적 조성물 |
KR1020200177336A KR20200144524A (ko) | 2018-04-26 | 2020-12-17 | 신규한 융합 단백질 및 이를 포함하는 암의 예방 또는 치료용 약학적 조성물 |
KR1020230131463A KR20230146491A (ko) | 2018-04-26 | 2023-10-04 | 신규한 융합 단백질 및 이를 포함하는 암의 예방 또는 치료용 약학적 조성물 |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020200177336A KR20200144524A (ko) | 2018-04-26 | 2020-12-17 | 신규한 융합 단백질 및 이를 포함하는 암의 예방 또는 치료용 약학적 조성물 |
KR1020230131463A KR20230146491A (ko) | 2018-04-26 | 2023-10-04 | 신규한 융합 단백질 및 이를 포함하는 암의 예방 또는 치료용 약학적 조성물 |
Country Status (5)
Country | Link |
---|---|
US (1) | US11820801B2 (ko) |
EP (1) | EP3792276A4 (ko) |
KR (3) | KR102194644B1 (ko) |
CN (1) | CN112041333A (ko) |
WO (1) | WO2019209078A1 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021091359A1 (ko) * | 2019-11-08 | 2021-05-14 | 주식회사 굳티셀 | 조절 t 세포 표면 항원의 에피토프 및 이에 특이적으로 결합하는 항체 |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20210347849A1 (en) * | 2018-07-24 | 2021-11-11 | Good T Cells, Inc. | Composition for Preventing or Treating Immune-Related Diseases |
CN111285936A (zh) * | 2020-03-11 | 2020-06-16 | 北京双赢科创生物科技有限公司 | 靶向肿瘤的酸性敏感纳米肽段及其应用 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100888022B1 (ko) * | 2006-12-21 | 2009-03-09 | 재단법인 목암생명공학연구소 | 면역글로불린 Fc와 인간 아포리포단백질(a)크링글절편의 융합단백질 LK8Fc |
US20110151433A1 (en) * | 2005-09-27 | 2011-06-23 | Amunix Operating, Inc. | Methods for production of unstructured recombinant polymers and uses thereof |
WO2013184939A2 (en) * | 2012-06-08 | 2013-12-12 | Alkermes, Inc. | Fusion polypeptides comprising an active protein linked to a mucin-domain polypeptide |
US20150010556A1 (en) * | 2012-10-04 | 2015-01-08 | Research Development Foundation | Serine protease molecules and therapies |
US20150239964A1 (en) * | 2011-01-24 | 2015-08-27 | Sang-Kyou Lee | Novel Use of Regulatory T Cell-Specific Surface Protein LRIG-1 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1988007089A1 (en) | 1987-03-18 | 1988-09-22 | Medical Research Council | Altered antibodies |
US5116964A (en) | 1989-02-23 | 1992-05-26 | Genentech, Inc. | Hybrid immunoglobulins |
US6086875A (en) | 1995-01-17 | 2000-07-11 | The Brigham And Women's Hospital, Inc. | Receptor specific transepithelial transport of immunogens |
US6096871A (en) | 1995-04-14 | 2000-08-01 | Genentech, Inc. | Polypeptides altered to contain an epitope from the Fc region of an IgG molecule for increased half-life |
WO1997034631A1 (en) | 1996-03-18 | 1997-09-25 | Board Of Regents, The University Of Texas System | Immunoglobin-like domains with increased half lives |
ES2387028T3 (es) | 2003-12-31 | 2012-09-12 | Merck Patent Gmbh | Proteína de fusión de Fc-eritropoyetina con farmacocinética mejorada |
US8053183B2 (en) * | 2005-07-27 | 2011-11-08 | Oncotherapy Science, Inc. | Method of diagnosing esophageal cancer |
WO2008147143A2 (en) | 2007-05-30 | 2008-12-04 | Postech Academy-Industry Foundation | Immunoglobulin fusion proteins |
CA2695374A1 (en) * | 2007-08-15 | 2009-02-19 | Amunix, Inc. | Compositions and methods for modifying properties of biologically active polypeptides |
CN101891823B (zh) * | 2010-06-11 | 2012-10-03 | 北京东方百泰生物科技有限公司 | 一种Exendin-4及其类似物融合蛋白 |
KR101639015B1 (ko) * | 2013-07-30 | 2016-07-14 | 연세대학교 산학협력단 | 삭사틸린―Fc 융합 단백질 및 이의 용도 |
US20210347849A1 (en) | 2018-07-24 | 2021-11-11 | Good T Cells, Inc. | Composition for Preventing or Treating Immune-Related Diseases |
-
2019
- 2019-04-26 CN CN201980028463.XA patent/CN112041333A/zh active Pending
- 2019-04-26 US US17/050,816 patent/US11820801B2/en active Active
- 2019-04-26 EP EP19793399.7A patent/EP3792276A4/en active Pending
- 2019-04-26 WO PCT/KR2019/005096 patent/WO2019209078A1/ko active Application Filing
- 2019-04-26 KR KR1020190049295A patent/KR102194644B1/ko active IP Right Grant
-
2020
- 2020-12-17 KR KR1020200177336A patent/KR20200144524A/ko not_active IP Right Cessation
-
2023
- 2023-10-04 KR KR1020230131463A patent/KR20230146491A/ko active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110151433A1 (en) * | 2005-09-27 | 2011-06-23 | Amunix Operating, Inc. | Methods for production of unstructured recombinant polymers and uses thereof |
KR100888022B1 (ko) * | 2006-12-21 | 2009-03-09 | 재단법인 목암생명공학연구소 | 면역글로불린 Fc와 인간 아포리포단백질(a)크링글절편의 융합단백질 LK8Fc |
US20150239964A1 (en) * | 2011-01-24 | 2015-08-27 | Sang-Kyou Lee | Novel Use of Regulatory T Cell-Specific Surface Protein LRIG-1 |
KR101847523B1 (ko) * | 2011-01-24 | 2018-05-28 | 연세대학교 산학협력단 | 조절자 T 세포에 특이적으로 존재하는 새로운 표면단백질 Lrig-1의 용도 |
WO2013184939A2 (en) * | 2012-06-08 | 2013-12-12 | Alkermes, Inc. | Fusion polypeptides comprising an active protein linked to a mucin-domain polypeptide |
US20150010556A1 (en) * | 2012-10-04 | 2015-01-08 | Research Development Foundation | Serine protease molecules and therapies |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021091359A1 (ko) * | 2019-11-08 | 2021-05-14 | 주식회사 굳티셀 | 조절 t 세포 표면 항원의 에피토프 및 이에 특이적으로 결합하는 항체 |
Also Published As
Publication number | Publication date |
---|---|
WO2019209078A1 (ko) | 2019-10-31 |
US11820801B2 (en) | 2023-11-21 |
US20210130422A1 (en) | 2021-05-06 |
KR102194644B1 (ko) | 2020-12-24 |
CN112041333A (zh) | 2020-12-04 |
KR20230146491A (ko) | 2023-10-19 |
EP3792276A4 (en) | 2022-02-16 |
JP2021521817A (ja) | 2021-08-30 |
EP3792276A1 (en) | 2021-03-17 |
KR20200144524A (ko) | 2020-12-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109575140B (zh) | 靶向pd-1或pd-l1且靶向vegf家族的双靶向融合蛋白及其用途 | |
AU2017202437B2 (en) | Novel immunoconjugates | |
KR102392142B1 (ko) | 다량체 il-15 기반 분자 | |
DK3075745T3 (en) | Mutated interleukin-2 polypeptides | |
CN107835820B (zh) | 识别癌症特异性IL13Rα2的CAR T细胞 | |
KR20200003845A (ko) | 이중특이성 재조합단백질 및 이의 응용 | |
KR20230146491A (ko) | 신규한 융합 단백질 및 이를 포함하는 암의 예방 또는 치료용 약학적 조성물 | |
CN110234662A (zh) | 组织特异性wnt信号增强分子和其用途 | |
KR20220064986A (ko) | 항-pd-l1 단일-도메인 항체 및 그의 유도체 및 용도 | |
CN107207579A (zh) | 包含三聚体tnf家族配体的抗原结合分子 | |
CN111051350B (zh) | 包含信号调节蛋白α的免疫缀合物 | |
CN108727504A (zh) | 一种ifn与抗pd-l1抗体的融合蛋白及其应用 | |
KR20140091031A (ko) | 저밀도 지단백질-관련 단백질 6 (lrp6) - 반감기 연장제 구축물 | |
TW201138823A (en) | Anti-LRP6 antibodies | |
KR20220031054A (ko) | Egfrviii에 결합하는 단일클론 항체 및 이의 용도 | |
KR102263643B1 (ko) | 면역 관련 질환의 예방 또는 치료용 조성물 | |
KR20230137393A (ko) | Psma 결합 단백질 및 이의 용도 | |
CN112409484B (zh) | 多功能抗体、其制备及其用途 | |
KR20230059789A (ko) | 항-가변 muc1* 항체 및 이의 용도 | |
JP7485368B2 (ja) | 新規な融合タンパク質およびこれを含む癌の予防または治療用薬学的組成物 | |
KR20210017449A (ko) | 신규한 융합 단백질 및 이의 용도 | |
KR20230164118A (ko) | 암 치료를 위한 치료 조합 | |
KR20220088438A (ko) | 글리코실화된 lag3에 대해 특이적인 항체 및 이의 사용 방법 | |
CN117186222A (zh) | Pd-l1结合分子及其用途 | |
JPWO2019209078A5 (ko) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
A107 | Divisional application of patent | ||
GRNT | Written decision to grant |