CN116284426A - anti-GUCY 2C/CD3 bispecific antibody and application thereof - Google Patents
anti-GUCY 2C/CD3 bispecific antibody and application thereof Download PDFInfo
- Publication number
- CN116284426A CN116284426A CN202111497366.0A CN202111497366A CN116284426A CN 116284426 A CN116284426 A CN 116284426A CN 202111497366 A CN202111497366 A CN 202111497366A CN 116284426 A CN116284426 A CN 116284426A
- Authority
- CN
- China
- Prior art keywords
- ser
- gly
- val
- thr
- leu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000027455 binding Effects 0.000 claims abstract description 122
- 238000009739 binding Methods 0.000 claims abstract description 90
- 239000000427 antigen Substances 0.000 claims abstract description 58
- 102000036639 antigens Human genes 0.000 claims abstract description 57
- 108091007433 antigens Proteins 0.000 claims abstract description 57
- 101000899808 Homo sapiens Guanylyl cyclase C Proteins 0.000 claims abstract description 41
- 102100022662 Guanylyl cyclase C Human genes 0.000 claims abstract description 36
- 206010028980 Neoplasm Diseases 0.000 claims abstract description 36
- 239000012634 fragment Substances 0.000 claims abstract description 35
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 154
- 210000004027 cell Anatomy 0.000 claims description 55
- 230000004048 modification Effects 0.000 claims description 35
- 238000012986 modification Methods 0.000 claims description 35
- 108010047041 Complementarity Determining Regions Proteins 0.000 claims description 34
- 201000011510 cancer Diseases 0.000 claims description 31
- 238000000034 method Methods 0.000 claims description 26
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 22
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 22
- 239000000178 monomer Substances 0.000 claims description 21
- 229920001184 polypeptide Polymers 0.000 claims description 21
- 239000008194 pharmaceutical composition Substances 0.000 claims description 19
- 206010009944 Colon cancer Diseases 0.000 claims description 17
- 239000013604 expression vector Substances 0.000 claims description 16
- 230000014509 gene expression Effects 0.000 claims description 15
- 230000035772 mutation Effects 0.000 claims description 13
- 150000001413 amino acids Chemical class 0.000 claims description 11
- 210000004899 c-terminal region Anatomy 0.000 claims description 11
- 208000029742 colonic neoplasm Diseases 0.000 claims description 11
- 239000000833 heterodimer Substances 0.000 claims description 10
- 239000003814 drug Substances 0.000 claims description 9
- 208000002699 Digestive System Neoplasms Diseases 0.000 claims description 7
- 239000004570 mortar (masonry) Substances 0.000 claims description 7
- 108091033319 polynucleotide Proteins 0.000 claims description 7
- 102000040430 polynucleotide Human genes 0.000 claims description 7
- 239000002157 polynucleotide Substances 0.000 claims description 7
- 208000005718 Stomach Neoplasms Diseases 0.000 claims description 6
- 125000000539 amino acid group Chemical group 0.000 claims description 6
- 206010017758 gastric cancer Diseases 0.000 claims description 6
- 229940127121 immunoconjugate Drugs 0.000 claims description 6
- 201000011549 stomach cancer Diseases 0.000 claims description 6
- 208000000461 Esophageal Neoplasms Diseases 0.000 claims description 5
- 206010030155 Oesophageal carcinoma Diseases 0.000 claims description 5
- 239000003937 drug carrier Substances 0.000 claims description 5
- 201000004101 esophageal cancer Diseases 0.000 claims description 5
- 238000006467 substitution reaction Methods 0.000 claims description 5
- WSNMPAVSZJSIMT-UHFFFAOYSA-N COc1c(C)c2COC(=O)c2c(O)c1CC(O)C1(C)CCC(=O)O1 Chemical group COc1c(C)c2COC(=O)c2c(O)c1CC(O)C1(C)CCC(=O)O1 WSNMPAVSZJSIMT-UHFFFAOYSA-N 0.000 claims description 4
- 230000002159 abnormal effect Effects 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 4
- 210000003236 esophagogastric junction Anatomy 0.000 claims description 4
- 102000004127 Cytokines Human genes 0.000 claims description 3
- 108090000695 Cytokines Proteins 0.000 claims description 3
- 102000004190 Enzymes Human genes 0.000 claims description 3
- 108090000790 Enzymes Proteins 0.000 claims description 3
- 206010061902 Pancreatic neoplasm Diseases 0.000 claims description 3
- 208000015634 Rectal Neoplasms Diseases 0.000 claims description 3
- 230000008878 coupling Effects 0.000 claims description 3
- 238000010168 coupling process Methods 0.000 claims description 3
- 238000005859 coupling reaction Methods 0.000 claims description 3
- 229940079593 drug Drugs 0.000 claims description 3
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 claims description 3
- 201000002528 pancreatic cancer Diseases 0.000 claims description 3
- 208000008443 pancreatic carcinoma Diseases 0.000 claims description 3
- 206010038038 rectal cancer Diseases 0.000 claims description 3
- 201000001275 rectum cancer Diseases 0.000 claims description 3
- 201000002314 small intestine cancer Diseases 0.000 claims description 3
- 231100000765 toxin Toxicity 0.000 claims description 3
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 claims description 2
- 238000007792 addition Methods 0.000 claims description 2
- 238000012217 deletion Methods 0.000 claims description 2
- 230000037430 deletion Effects 0.000 claims description 2
- 239000000539 dimer Substances 0.000 claims description 2
- 238000004519 manufacturing process Methods 0.000 claims description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 2
- 239000003053 toxin Substances 0.000 claims description 2
- 230000009452 underexpressoin Effects 0.000 claims description 2
- 230000000259 anti-tumor effect Effects 0.000 abstract description 5
- 230000008685 targeting Effects 0.000 abstract description 5
- 230000002195 synergetic effect Effects 0.000 abstract description 3
- 102000017420 CD3 protein, epsilon/gamma/delta subunit Human genes 0.000 description 90
- 108090000623 proteins and genes Proteins 0.000 description 63
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 57
- 102000004169 proteins and genes Human genes 0.000 description 53
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 47
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 47
- 241000880493 Leptailurus serval Species 0.000 description 46
- 108020004414 DNA Proteins 0.000 description 44
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 44
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 44
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 43
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 38
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 34
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 32
- 101000642536 Apis mellifera Venom serine protease 34 Proteins 0.000 description 31
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 31
- 108010017391 lysylvaline Proteins 0.000 description 30
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 29
- 210000001744 T-lymphocyte Anatomy 0.000 description 29
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 28
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 28
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 27
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 27
- MBLJBGZWLHTJBH-SZMVWBNQSA-N Trp-Val-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 MBLJBGZWLHTJBH-SZMVWBNQSA-N 0.000 description 27
- 108010008355 arginyl-glutamine Proteins 0.000 description 27
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 27
- 239000000523 sample Substances 0.000 description 27
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 25
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 25
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 25
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 25
- 108010073969 valyllysine Proteins 0.000 description 24
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 23
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 23
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 23
- 238000001514 detection method Methods 0.000 description 23
- 108010037850 glycylvaline Proteins 0.000 description 23
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 22
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 22
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 22
- 108010089804 glycyl-threonine Proteins 0.000 description 22
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 21
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 21
- 108010064235 lysylglycine Proteins 0.000 description 21
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 20
- 108010050848 glycylleucine Proteins 0.000 description 20
- 108010031719 prolyl-serine Proteins 0.000 description 20
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 19
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 19
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 18
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 18
- 241001529936 Murinae Species 0.000 description 18
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 18
- 108010010147 glycylglutamine Proteins 0.000 description 18
- 210000004408 hybridoma Anatomy 0.000 description 18
- 108010065920 Insulin Lispro Proteins 0.000 description 17
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 17
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 17
- 108010015792 glycyllysine Proteins 0.000 description 17
- 108010078144 glutaminyl-glycine Proteins 0.000 description 16
- 108010003137 tyrosyltyrosine Proteins 0.000 description 16
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 15
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 15
- 238000002965 ELISA Methods 0.000 description 15
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 15
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 15
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 15
- 108010070643 prolylglutamic acid Proteins 0.000 description 15
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 14
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 14
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 14
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 14
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 14
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 14
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 14
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 14
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 14
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 14
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 14
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 14
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 14
- 108010051242 phenylalanylserine Proteins 0.000 description 14
- 108010020532 tyrosyl-proline Proteins 0.000 description 14
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 13
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 13
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 13
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 13
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 13
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 13
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 13
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 13
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 13
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 13
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 13
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 13
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 13
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 13
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 13
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 13
- 108010044292 tryptophyltyrosine Proteins 0.000 description 13
- 210000004881 tumor cell Anatomy 0.000 description 13
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 12
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 12
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 12
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 12
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 12
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 12
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 12
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 12
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 12
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 12
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 12
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 12
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 12
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 12
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 12
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 12
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 12
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 11
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 11
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 11
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 11
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 11
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 11
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 11
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 11
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 11
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 11
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 11
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 11
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 11
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 11
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 11
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 11
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 11
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 11
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 11
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 11
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 11
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 11
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 11
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 11
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 11
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 11
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 11
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 11
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 11
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 11
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 11
- 230000002147 killing effect Effects 0.000 description 11
- 108010090894 prolylleucine Proteins 0.000 description 11
- 108010053725 prolylvaline Proteins 0.000 description 11
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 10
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 10
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 10
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 10
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 10
- VEWZSFGRQDUAJM-YJRXYDGGSA-N Thr-Cys-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O VEWZSFGRQDUAJM-YJRXYDGGSA-N 0.000 description 10
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 10
- 108010087924 alanylproline Proteins 0.000 description 10
- 229940098773 bovine serum albumin Drugs 0.000 description 10
- UQLDLKMNUJERMK-UHFFFAOYSA-L di(octadecanoyloxy)lead Chemical compound [Pb+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O UQLDLKMNUJERMK-UHFFFAOYSA-L 0.000 description 10
- 239000007924 injection Substances 0.000 description 10
- 238000002347 injection Methods 0.000 description 10
- 239000000243 solution Substances 0.000 description 10
- 108010052774 valyl-lysyl-glycyl-phenylalanyl-tyrosine Proteins 0.000 description 10
- 108010027345 wheylin-1 peptide Proteins 0.000 description 10
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 9
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 9
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 9
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 9
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 9
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 9
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 9
- JDMKQHSHKJHAHR-UHFFFAOYSA-N Phe-Phe-Leu-Tyr Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)CC1=CC=CC=C1 JDMKQHSHKJHAHR-UHFFFAOYSA-N 0.000 description 9
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 9
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 9
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 9
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 9
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 9
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 9
- 239000000872 buffer Substances 0.000 description 9
- 108010060199 cysteinylproline Proteins 0.000 description 9
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 8
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 8
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 8
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 8
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 8
- LGCVSPFCFXWUEY-IHPCNDPISA-N Asn-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N LGCVSPFCFXWUEY-IHPCNDPISA-N 0.000 description 8
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 8
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 8
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 8
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 8
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 8
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 8
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 8
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 8
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 8
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 8
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 8
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 8
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 8
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 8
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 8
- 241000699670 Mus sp. Species 0.000 description 8
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 8
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 8
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 8
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 8
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 8
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 8
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 8
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 8
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 8
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 8
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 8
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 8
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 8
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 8
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 8
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 8
- SVGAWGVHFIYAEE-JSGCOSHPSA-N Trp-Gly-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 SVGAWGVHFIYAEE-JSGCOSHPSA-N 0.000 description 8
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 8
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 8
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 8
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 8
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 8
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 8
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 8
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 8
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 8
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 8
- 239000007788 liquid Substances 0.000 description 8
- 108010003700 lysyl aspartic acid Proteins 0.000 description 8
- 238000002360 preparation method Methods 0.000 description 8
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 8
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 8
- 108010080629 tryptophan-leucine Proteins 0.000 description 8
- 108010038745 tryptophylglycine Proteins 0.000 description 8
- 108010051110 tyrosyl-lysine Proteins 0.000 description 8
- 239000013598 vector Substances 0.000 description 8
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 7
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 7
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 7
- DEAGTWNKODHUIY-MRFFXTKBSA-N Ala-Tyr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DEAGTWNKODHUIY-MRFFXTKBSA-N 0.000 description 7
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 7
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 7
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 7
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 7
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 7
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 7
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 7
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 7
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 7
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 7
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 7
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 7
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 7
- FOCSWPCHUDVNLP-PMVMPFDFSA-N His-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC4=CN=CN4)N FOCSWPCHUDVNLP-PMVMPFDFSA-N 0.000 description 7
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 7
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 7
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 7
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 7
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 7
- LXGSOEPHQJONMG-PMVMPFDFSA-N Leu-Trp-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N LXGSOEPHQJONMG-PMVMPFDFSA-N 0.000 description 7
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 7
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 7
- 108010079364 N-glycylalanine Proteins 0.000 description 7
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 7
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 7
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 7
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 7
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 7
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 7
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 7
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 7
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 7
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 7
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 7
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 7
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 7
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 7
- SMLCYZYQFRTLCO-UWJYBYFXSA-N Tyr-Cys-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O SMLCYZYQFRTLCO-UWJYBYFXSA-N 0.000 description 7
- QOEZFICGUZTRFX-IHRRRGAJSA-N Tyr-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O QOEZFICGUZTRFX-IHRRRGAJSA-N 0.000 description 7
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 7
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 7
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 7
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 7
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 7
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 7
- 108010081404 acein-2 Proteins 0.000 description 7
- 108010047857 aspartylglycine Proteins 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 108010012058 leucyltyrosine Proteins 0.000 description 7
- 108010077112 prolyl-proline Proteins 0.000 description 7
- 238000005406 washing Methods 0.000 description 7
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 6
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 6
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 6
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 6
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 6
- ZUVMUOOHJYNJPP-XIRDDKMYSA-N Arg-Trp-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZUVMUOOHJYNJPP-XIRDDKMYSA-N 0.000 description 6
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 6
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 6
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 6
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 6
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 6
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 6
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 6
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 6
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 6
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 6
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 6
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 6
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 6
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 6
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 6
- HZWWOGWOBQBETJ-CUJWVEQBSA-N His-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O HZWWOGWOBQBETJ-CUJWVEQBSA-N 0.000 description 6
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 6
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 6
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 6
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 6
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 6
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 6
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 6
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 6
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 6
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 6
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 6
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 6
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 6
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 6
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 6
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 6
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 6
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 6
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 6
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 6
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 6
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 6
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 6
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 6
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 6
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 6
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 6
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 230000037396 body weight Effects 0.000 description 6
- 108010054813 diprotin B Proteins 0.000 description 6
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 6
- IESDGNYHXIOKRW-YXMSTPNBSA-N (2s)-2-[[(2s)-1-[(2s)-6-amino-2-[[(2s,3r)-2-amino-3-hydroxybutanoyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IESDGNYHXIOKRW-YXMSTPNBSA-N 0.000 description 5
- DIBLBAURNYJYBF-XLXZRNDBSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-6-amino-2-[[(2s)-2-amino-3-methylbutanoyl]amino]hexanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 DIBLBAURNYJYBF-XLXZRNDBSA-N 0.000 description 5
- NFGXHKASABOEEW-UHFFFAOYSA-N 1-methylethyl 11-methoxy-3,7,11-trimethyl-2,4-dodecadienoate Chemical compound COC(C)(C)CCCC(C)CC=CC(C)=CC(=O)OC(C)C NFGXHKASABOEEW-UHFFFAOYSA-N 0.000 description 5
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 5
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 5
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 5
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 5
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 5
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 5
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 5
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 5
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 5
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 5
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 5
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 5
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 5
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 5
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 5
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 5
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 5
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 5
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 5
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 5
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 5
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 5
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 5
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 5
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 5
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 5
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 5
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 5
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 5
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 5
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 5
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 5
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 5
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 5
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 5
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 5
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 5
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 5
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 5
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 5
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 5
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 5
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 5
- WMKXFMUJRCEGRP-SRVKXCTJSA-N His-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N WMKXFMUJRCEGRP-SRVKXCTJSA-N 0.000 description 5
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 5
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 5
- QQFSKBMCAKWHLG-UHFFFAOYSA-N Ile-Phe-Pro-Pro Chemical compound C1CCC(C(=O)N2C(CCC2)C(O)=O)N1C(=O)C(NC(=O)C(N)C(C)CC)CC1=CC=CC=C1 QQFSKBMCAKWHLG-UHFFFAOYSA-N 0.000 description 5
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 5
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 5
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 5
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 5
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 5
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 5
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 5
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 5
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 5
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 5
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 5
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 5
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 5
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 5
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 5
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 5
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 5
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 5
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 5
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 5
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 5
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 5
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 5
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 5
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 5
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 5
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 5
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 5
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 5
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 5
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 5
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 5
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 5
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 5
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 5
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 5
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 5
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 5
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 5
- BGWSLEYVITZIQP-DCPHZVHLSA-N Trp-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O BGWSLEYVITZIQP-DCPHZVHLSA-N 0.000 description 5
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 5
- UUZYQOUJTORBQO-ZVZYQTTQSA-N Trp-Val-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UUZYQOUJTORBQO-ZVZYQTTQSA-N 0.000 description 5
- CDRYEAWHKJSGAF-BPNCWPANSA-N Tyr-Ala-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O CDRYEAWHKJSGAF-BPNCWPANSA-N 0.000 description 5
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 5
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 5
- RZAGEHHVNYESNR-RNXOBYDBSA-N Tyr-Trp-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RZAGEHHVNYESNR-RNXOBYDBSA-N 0.000 description 5
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 5
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 5
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 5
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 5
- OXVPMZVGCAPFIG-BQFCYCMXSA-N Val-Gln-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N OXVPMZVGCAPFIG-BQFCYCMXSA-N 0.000 description 5
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 5
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 5
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 5
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 5
- 238000004113 cell culture Methods 0.000 description 5
- 239000011248 coating agent Substances 0.000 description 5
- 238000000576 coating method Methods 0.000 description 5
- 238000010276 construction Methods 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- 230000018109 developmental process Effects 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 230000002496 gastric effect Effects 0.000 description 5
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 5
- 102000045577 human GUCY2C Human genes 0.000 description 5
- 108010091871 leucylmethionine Proteins 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 238000012216 screening Methods 0.000 description 5
- 108010009962 valyltyrosine Proteins 0.000 description 5
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 4
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 4
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 4
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 4
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 4
- YQPSDMUGFKJZHR-QRTARXTBSA-N Asn-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N YQPSDMUGFKJZHR-QRTARXTBSA-N 0.000 description 4
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 4
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 4
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 4
- MHYHLWUGWUBUHF-GUBZILKMSA-N Cys-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N MHYHLWUGWUBUHF-GUBZILKMSA-N 0.000 description 4
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 4
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 4
- MSHXWFKYXJTLEZ-CIUDSAMLSA-N Gln-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MSHXWFKYXJTLEZ-CIUDSAMLSA-N 0.000 description 4
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 4
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 4
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 4
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 4
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 4
- USTCFDAQCLDPBD-XIRDDKMYSA-N Leu-Asn-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N USTCFDAQCLDPBD-XIRDDKMYSA-N 0.000 description 4
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 4
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 4
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 4
- QWTGQXGNNMIUCW-BPUTZDHNSA-N Met-Asn-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QWTGQXGNNMIUCW-BPUTZDHNSA-N 0.000 description 4
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 4
- 241000699666 Mus <mouse, genus> Species 0.000 description 4
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 4
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 4
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 4
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 4
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 4
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 4
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 4
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 4
- 108091008874 T cell receptors Proteins 0.000 description 4
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 4
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 4
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 4
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 4
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 4
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 4
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 4
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 4
- 239000002250 absorbent Substances 0.000 description 4
- 230000002745 absorbent Effects 0.000 description 4
- 239000002671 adjuvant Substances 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 4
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 4
- 108010053037 kyotorphin Proteins 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 230000001404 mediated effect Effects 0.000 description 4
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 3
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 3
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 3
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 3
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 3
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 3
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 3
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 3
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 3
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 3
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 3
- BPHKULHWEIUDOB-FXQIFTODSA-N Cys-Gln-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BPHKULHWEIUDOB-FXQIFTODSA-N 0.000 description 3
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 3
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 102000000802 Galectin 3 Human genes 0.000 description 3
- 108010001517 Galectin 3 Proteins 0.000 description 3
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 3
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 3
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 3
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 3
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 3
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 3
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 3
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 3
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 3
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 3
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 3
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 3
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 3
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 3
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 3
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 3
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 3
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 3
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 3
- 102000006833 Multifunctional Enzymes Human genes 0.000 description 3
- 108010047290 Multifunctional Enzymes Proteins 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- FSPGBMWPNMRWDB-AVGNSLFASA-N Phe-Cys-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FSPGBMWPNMRWDB-AVGNSLFASA-N 0.000 description 3
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 3
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 3
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 3
- 239000004698 Polyethylene Substances 0.000 description 3
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 3
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 3
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 3
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 3
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 3
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 3
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 3
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 3
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 3
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 3
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 3
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 3
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 3
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 3
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 3
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 3
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 3
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 3
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 3
- UJGDFQRPYGJBEH-AAEUAGOBSA-N Trp-Ser-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N UJGDFQRPYGJBEH-AAEUAGOBSA-N 0.000 description 3
- IEESWNWYUOETOT-BVSLBCMMSA-N Trp-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1ccccc1)C(O)=O IEESWNWYUOETOT-BVSLBCMMSA-N 0.000 description 3
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 description 3
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 3
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 3
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 3
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 3
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 3
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000022534 cell killing Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 108010069495 cysteinyltyrosine Proteins 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 238000010494 dissociation reaction Methods 0.000 description 3
- 230000005593 dissociations Effects 0.000 description 3
- 239000002552 dosage form Substances 0.000 description 3
- 238000004945 emulsification Methods 0.000 description 3
- 210000001035 gastrointestinal tract Anatomy 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 210000000936 intestine Anatomy 0.000 description 3
- 239000007928 intraperitoneal injection Substances 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 150000007523 nucleic acids Chemical class 0.000 description 3
- 229960005322 streptomycin Drugs 0.000 description 3
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 2
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 2
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 2
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 2
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 2
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 2
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 2
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 2
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 2
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 2
- 241000283074 Equus asinus Species 0.000 description 2
- 206010017993 Gastrointestinal neoplasms Diseases 0.000 description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 2
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 2
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 2
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 2
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 2
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 2
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 2
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- OMNVOTCFQQLEQU-CIUDSAMLSA-N His-Asn-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMNVOTCFQQLEQU-CIUDSAMLSA-N 0.000 description 2
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 2
- TTYKEFZRLKQTHH-MELADBBJSA-N His-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O TTYKEFZRLKQTHH-MELADBBJSA-N 0.000 description 2
- 101001037140 Homo sapiens Immunoglobulin heavy variable 3-23 Proteins 0.000 description 2
- 101000978127 Homo sapiens Immunoglobulin lambda variable 7-46 Proteins 0.000 description 2
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- 102100040220 Immunoglobulin heavy variable 3-23 Human genes 0.000 description 2
- 102100023751 Immunoglobulin lambda variable 7-46 Human genes 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 2
- BCUVPZLLSRMPJL-XIRDDKMYSA-N Leu-Trp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N BCUVPZLLSRMPJL-XIRDDKMYSA-N 0.000 description 2
- SNOUHRPNNCAOPI-SZMVWBNQSA-N Leu-Trp-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SNOUHRPNNCAOPI-SZMVWBNQSA-N 0.000 description 2
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 2
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 2
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 2
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 2
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 2
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 2
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- NOFBJKKOPKJDCO-KKXDTOCCSA-N Phe-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NOFBJKKOPKJDCO-KKXDTOCCSA-N 0.000 description 2
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 2
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 2
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 2
- 206010035226 Plasma cell myeloma Diseases 0.000 description 2
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 2
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 2
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 2
- VVEQUISRWJDGMX-VKOGCVSHSA-N Pro-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 VVEQUISRWJDGMX-VKOGCVSHSA-N 0.000 description 2
- 241000283984 Rodentia Species 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 2
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 2
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 2
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 2
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- YXGCIEUDOHILKR-IHRRRGAJSA-N Ser-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CO)N YXGCIEUDOHILKR-IHRRRGAJSA-N 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 2
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 2
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 2
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- ICNFHVUVCNWUAB-SZMVWBNQSA-N Trp-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ICNFHVUVCNWUAB-SZMVWBNQSA-N 0.000 description 2
- ARKBYVBCEOWRNR-UBHSHLNASA-N Trp-Ser-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O ARKBYVBCEOWRNR-UBHSHLNASA-N 0.000 description 2
- ZJPSMXCFEKMZFE-IHPCNDPISA-N Trp-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O ZJPSMXCFEKMZFE-IHPCNDPISA-N 0.000 description 2
- UOXPLPBMEPLZBW-WDSOQIARSA-N Trp-Val-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 UOXPLPBMEPLZBW-WDSOQIARSA-N 0.000 description 2
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 2
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 2
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 2
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 2
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 2
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 2
- CJDZKZFMAXGUOJ-IHRRRGAJSA-N Val-Cys-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CJDZKZFMAXGUOJ-IHRRRGAJSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 2
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 2
- MYLNLEIZWHVENT-VKOGCVSHSA-N Val-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N MYLNLEIZWHVENT-VKOGCVSHSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 2
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 239000000611 antibody drug conjugate Substances 0.000 description 2
- 229940049595 antibody-drug conjugate Drugs 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 238000004140 cleaning Methods 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 239000012228 culture supernatant Substances 0.000 description 2
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 2
- 208000024558 digestive system cancer Diseases 0.000 description 2
- 238000007865 diluting Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 201000010231 gastrointestinal system cancer Diseases 0.000 description 2
- 210000004602 germ cell Anatomy 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 238000002649 immunization Methods 0.000 description 2
- 230000003053 immunization Effects 0.000 description 2
- 238000007917 intracranial administration Methods 0.000 description 2
- 238000010255 intramuscular injection Methods 0.000 description 2
- 239000007927 intramuscular injection Substances 0.000 description 2
- 230000002601 intratumoral effect Effects 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 231100000518 lethal Toxicity 0.000 description 2
- 230000001665 lethal effect Effects 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 238000002595 magnetic resonance imaging Methods 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 201000000050 myeloid neoplasm Diseases 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 235000020183 skimmed milk Nutrition 0.000 description 2
- 238000010254 subcutaneous injection Methods 0.000 description 2
- 239000007929 subcutaneous injection Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 238000002198 surface plasmon resonance spectroscopy Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 229940124597 therapeutic agent Drugs 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- FSWYUDLVKBSHDX-UHFFFAOYSA-N 1,4,5,8-tetrahydronaphthalene Chemical compound C1C=CCC2=C1CC=CC2 FSWYUDLVKBSHDX-UHFFFAOYSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Natural products CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- NAARDJBSSPUGCF-FXQIFTODSA-N Arg-Cys-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N NAARDJBSSPUGCF-FXQIFTODSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- PSOPJDUQUVFSLS-GUBZILKMSA-N Arg-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PSOPJDUQUVFSLS-GUBZILKMSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- QUAWOKPCAKCHQL-SRVKXCTJSA-N Asn-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QUAWOKPCAKCHQL-SRVKXCTJSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- IIQIOFVDFOLCHP-UHFFFAOYSA-N Asn-Pro-Ser-Ser Chemical compound NC(=O)CC(N)C(=O)N1CCCC1C(=O)NC(CO)C(=O)NC(CO)C(O)=O IIQIOFVDFOLCHP-UHFFFAOYSA-N 0.000 description 1
- YWFLXGZHZXXINF-BPUTZDHNSA-N Asn-Pro-Trp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 YWFLXGZHZXXINF-BPUTZDHNSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- 101000914947 Bungarus multicinctus Long neurotoxin homolog TA-bm16 Proteins 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 1
- 231100000023 Cell-mediated cytotoxicity Toxicity 0.000 description 1
- 206010057250 Cell-mediated cytotoxicity Diseases 0.000 description 1
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 1
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 1
- RWGDABDXVXRLLH-ACZMJKKPSA-N Cys-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N RWGDABDXVXRLLH-ACZMJKKPSA-N 0.000 description 1
- POSRGGKLRWCUBE-CIUDSAMLSA-N Cys-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N POSRGGKLRWCUBE-CIUDSAMLSA-N 0.000 description 1
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 1
- 206010050685 Cytokine storm Diseases 0.000 description 1
- 101150017853 GUCY2C gene Proteins 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108010078321 Guanylate Cyclase Proteins 0.000 description 1
- 102000014469 Guanylate cyclase Human genes 0.000 description 1
- 208000002250 Hematologic Neoplasms Diseases 0.000 description 1
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- KECFCPNPPYCGBL-PMVMPFDFSA-N His-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N KECFCPNPPYCGBL-PMVMPFDFSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 1
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- 102000043129 MHC class I family Human genes 0.000 description 1
- 108091054437 MHC class I family Proteins 0.000 description 1
- DRINJBAHUGXNFC-DCAQKATOSA-N Met-Asp-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O DRINJBAHUGXNFC-DCAQKATOSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- WRXOPYNEKGZWAZ-FXQIFTODSA-N Met-Ser-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O WRXOPYNEKGZWAZ-FXQIFTODSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108091007491 NSP3 Papain-like protease domains Proteins 0.000 description 1
- 101710160107 Outer membrane protein A Proteins 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 1
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- UEJYSALTSUZXFV-SRVKXCTJSA-N Rigin Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UEJYSALTSUZXFV-SRVKXCTJSA-N 0.000 description 1
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 1
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- BIWBTRRBHIEVAH-IHPCNDPISA-N Ser-Tyr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BIWBTRRBHIEVAH-IHPCNDPISA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 1
- LHNNQVXITHUCAB-QTKMDUPCSA-N Thr-Met-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O LHNNQVXITHUCAB-QTKMDUPCSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- SJPDTIQHLBQPFO-VLCNGCBASA-N Thr-Tyr-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SJPDTIQHLBQPFO-VLCNGCBASA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- 102000000591 Tight Junction Proteins Human genes 0.000 description 1
- 108010002321 Tight Junction Proteins Proteins 0.000 description 1
- 101710120037 Toxin CcdB Proteins 0.000 description 1
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 1
- YTCNLMSUXPCFBW-SXNHZJKMSA-N Trp-Ile-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O YTCNLMSUXPCFBW-SXNHZJKMSA-N 0.000 description 1
- NLWCSMOXNKBRLC-WDSOQIARSA-N Trp-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLWCSMOXNKBRLC-WDSOQIARSA-N 0.000 description 1
- VMXLNDRJXVAJFT-JYBASQMISA-N Trp-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O VMXLNDRJXVAJFT-JYBASQMISA-N 0.000 description 1
- UIRVSEPRMWDVEW-RNXOBYDBSA-N Trp-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N UIRVSEPRMWDVEW-RNXOBYDBSA-N 0.000 description 1
- SSSDKJMQMZTMJP-BVSLBCMMSA-N Trp-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 SSSDKJMQMZTMJP-BVSLBCMMSA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- QOIKZODVIPOPDD-AVGNSLFASA-N Tyr-Cys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOIKZODVIPOPDD-AVGNSLFASA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 1
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 1
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 description 1
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- JPPXDMBGXJBTIB-ULQDDVLXSA-N Val-His-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N JPPXDMBGXJBTIB-ULQDDVLXSA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- LMVWCLDJNSBOEA-FKBYEOEOSA-N Val-Tyr-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N LMVWCLDJNSBOEA-FKBYEOEOSA-N 0.000 description 1
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 239000000538 analytical sample Substances 0.000 description 1
- 230000009830 antibody antigen interaction Effects 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 239000008365 aqueous carrier Substances 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 229960003008 blinatumomab Drugs 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 230000005890 cell-mediated cytotoxicity Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000002591 computed tomography Methods 0.000 description 1
- 239000004567 concrete Substances 0.000 description 1
- 239000000562 conjugate Substances 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 239000002872 contrast media Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 206010052015 cytokine release syndrome Diseases 0.000 description 1
- 230000009615 deamination Effects 0.000 description 1
- 238000006481 deamination reaction Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 230000000857 drug effect Effects 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000005734 heterodimerization reaction Methods 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000013632 homeostatic process Effects 0.000 description 1
- 239000000710 homodimer Substances 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000010253 intravenous injection Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 229940043515 other immunoglobulins in atc Drugs 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 239000006201 parenteral dosage form Substances 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 239000002504 physiological saline solution Substances 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000004017 serum-free culture medium Substances 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 210000000225 synapse Anatomy 0.000 description 1
- 210000001578 tight junction Anatomy 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000005909 tumor killing Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
- C07K16/28—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants
- C07K16/2803—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants against the immunoglobulin superfamily
- C07K16/2809—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants against the immunoglobulin superfamily against the T-cell receptor (TcR)-CD3 complex
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/68—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment
- A61K47/6801—Drug-antibody or immunoglobulin conjugates defined by the pharmacologically or therapeutically active agent
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/68—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment
- A61K47/6835—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment the modifying agent being an antibody or an immunoglobulin bearing at least one antigen-binding site
- A61K47/6849—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment the modifying agent being an antibody or an immunoglobulin bearing at least one antigen-binding site the antibody targeting a receptor, a cell surface antigen or a cell surface determinant
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/68—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment
- A61K47/6835—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment the modifying agent being an antibody or an immunoglobulin bearing at least one antigen-binding site
- A61K47/6871—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment the modifying agent being an antibody or an immunoglobulin bearing at least one antigen-binding site the antibody targeting an enzyme
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/40—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against enzymes
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/505—Medicinal preparations containing antigens or antibodies comprising antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/20—Immunoglobulins specific features characterized by taxonomic origin
- C07K2317/24—Immunoglobulins specific features characterized by taxonomic origin containing regions, domains or residues from different species, e.g. chimeric, humanized or veneered
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/30—Immunoglobulins specific features characterized by aspects of specificity or valency
- C07K2317/31—Immunoglobulins specific features characterized by aspects of specificity or valency multispecific
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/50—Immunoglobulins specific features characterized by immunoglobulin fragments
- C07K2317/56—Immunoglobulins specific features characterized by immunoglobulin fragments variable (Fv) region, i.e. VH and/or VL
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/50—Immunoglobulins specific features characterized by immunoglobulin fragments
- C07K2317/56—Immunoglobulins specific features characterized by immunoglobulin fragments variable (Fv) region, i.e. VH and/or VL
- C07K2317/565—Complementarity determining region [CDR]
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/60—Immunoglobulins specific features characterized by non-natural combinations of immunoglobulin fragments
- C07K2317/62—Immunoglobulins specific features characterized by non-natural combinations of immunoglobulin fragments comprising only variable region components
- C07K2317/622—Single chain antibody (scFv)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/70—Immunoglobulins specific features characterized by effect upon binding to a cell or to an antigen
- C07K2317/73—Inducing cell death, e.g. apoptosis, necrosis or inhibition of cell proliferation
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/90—Immunoglobulins specific features characterized by (pharmaco)kinetic aspects or by stability of the immunoglobulin
- C07K2317/92—Affinity (KD), association rate (Ka), dissociation rate (Kd) or EC50 value
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Immunology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Pharmacology & Pharmacy (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Epidemiology (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Molecular Biology (AREA)
- Cell Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Peptides Or Proteins (AREA)
Abstract
The invention provides an anti-GUCY 2C/CD3 bispecific antibody, comprising a domain A which binds to a target molecule A and a domain B which binds to a target molecule B; the target molecule A and the target molecule B are selected from GUCY2C and CD3; the domain a and domain B are selected from an antibody or antigen-binding fragment thereof directed against GUCY2C and an antibody or antigen-binding fragment thereof directed against CD 3. The anti-GUCY 2C/CD3 bispecific antibody has excellent targeting property, specificity and remarkable synergic anti-tumor activity, and has important clinical significance for tumor treatment.
Description
Technical Field
The present application relates to the field of antibody pharmaceuticals, in particular, the present application relates to an anti-GUCY 2C/CD3 bispecific antibody and uses thereof.
Background
Gastrointestinal malignant tumors, including colorectal cancer (CRC), gastric cancer and esophageal cancer, show that according to global cancer statistics report in 2020, the global colorectal cancer lethal number accounts for 9.4% of the total cancer lethal number, and gastric cancer accounts for 7.7%; in China, gastric cancer is 12.4% of all cancers, esophageal cancer is 10%, colorectal cancer is 9.5%, and gastrointestinal malignant tumor is a clinically far unsatisfied treatment field in China or even worldwide. GUCY2C or GCC, guanylate cyclase 2C, belongs to the family of guanylate cyclase receptors. Studies have shown that GUCY2C is overexpressed in a variety of cancers of the gastrointestinal tract, including 90% or more colorectal cancer and 50% or more gastric or gastroesophageal junction cancer at each malignant stage. In physiological states, the expression of GUCY2C is limited to the surface of epithelial cells tightly linked to the lumen of the healthy intestine to maintain homeostasis in the intestine. However, in pathological conditions, the occurrence of tumors can disrupt the tight junction structure of the luminal surface of the intestine, resulting in the exposure of GUCY2C, which may be the target for preferential binding of targeted drugs.
The therapeutic effects of redirecting T cell function have been demonstrated in a number of clinical trials, such as the use of bolafirumab (Blinatumomab) in the treatment of hematological malignancies, and the recently reported effective treatment of solid tumors, such as colorectal and prostate cancer, based on CD3 bispecific antibodies. Based on CD3 bispecific antibodies, one arm binds to a tumor-associated cell surface antigen and the other arm binds to CD3 epsilon protein on T cells, which can be used effectively in the treatment of tumors because they recruit and activate T cell populations specifically target specific antigens that are overexpressed on the tumor-associated cell surface. They do not require binding to mhc class i complex antigen peptides via T Cell Receptors (TCRs) to activate T cells, but rather, direct targeting of recruited T cells to tumor cells expressing cell surface antigens forms an immune synapse to activate T cells, causing Cytotoxic T Lymphocytes (CTLs) to kill tumor cells.
Therefore, in order to further improve the anti-tumor function of the GUCY2C antibody and activate the tumor killing activity of T cells, the development of the bispecific antibody targeting GUCY2C and CD3 simultaneously to treat gastrointestinal malignant tumor of GUCY2C abnormal expression has better clinical application prospect.
Disclosure of Invention
The invention aims to provide an anti-GUCY 2C/CD3 bispecific antibody and medical application thereof.
In a first aspect of the invention, there is provided an anti-GUCY 2C/CD3 bispecific antibody comprising a domain A that binds to a target molecule A and a domain B that binds to a target molecule B; the target molecule A and the target molecule B are selected from GUCY2C and CD3; the domain a and domain B are selected from an antibody or antigen-binding fragment thereof directed against GUCY2C, and an antibody or antigen-binding fragment thereof directed against CD 3.
In another preferred embodiment, the antibody or antigen-binding fragment thereof is a chimeric antibody or antigen-binding fragment thereof, or a humanized antibody or antigen-binding fragment thereof.
In another preferred embodiment, there is provided an anti-GUCY 2C/CD3 bispecific antibody comprising: a) a first domain a, B) a second domain B, and, optionally, further comprising c) an Fc domain; the target molecule A and the target molecule B are selected from GUCY2C and CD3; the domain a and domain B are selected from an antibody or antigen-binding fragment thereof directed against GUCY2C and an antibody or antigen-binding fragment thereof directed against CD 3.
In another preferred example, the first domain a and the second domain B may be 1, 2, 3 or 4.
In another preferred embodiment, the Fc domain comprises 2 Fc polypeptide monomers, each comprising CH2-CH3 in amino to carboxyl order, the Fc polypeptide monomers being linked by disulfide bonds.
In another preferred embodiment, the Fc domain is selected from the Fc domain of a homodimer or the Fc domain of a heterodimer; preferably, the Fc domain of the heterodimer comprises a heterodimeric modification. By heterodimeric modification is meant a modification that can induce heterodimerization of an Fc domain, including but not limited to, a knob-into-hole modification, a sterically hindered modification, a charge modification (charge mutation), a hydrogen bonding modification, a hydrophobic interaction modification, or a combination thereof. More preferably, the modification is in the CH3 region of the Fc domain.
In another preferred embodiment, the bispecific antibody comprises a monomer or dimer of monomers, which may be homologous or heterologous, comprising from amino-to carboxy-terminus a structure selected from the group consisting of:
structure I:
structure II:
structure III:
structure IV:
structure V:
wherein,,
b1, B2, B3, B4, B5 are each independently an antigen-binding fragment that is devoid of or binds to target molecule B, and at least one is not devoid of;
L1, L2, L3, L4, L5, L6 are each independently no or a bond or a linker;
VHA represents the heavy chain variable region that binds to target molecule a; VLA represents the light chain variable region that binds to target molecule a;
CL represents the light chain constant region; CH represents a heavy chain constant region;
"-" represents disulfide or covalent bonds; "-" represents a peptide bond.
In another preferred embodiment, the bispecific antibody comprises a structure selected from the group consisting of:
a) Homodimers formed from monomers of structure I;
b) A heterodimer formed from monomers of structure I and structure II;
c) Monomers of structure III;
d) Monomers of structure IV;
e) Monomers of structure V.
In another preferred embodiment, the antigen binding fragment is selected from scFv, fv, fd, fab, F (ab ') 2 or F (ab').
In another preferred embodiment, the bispecific antibody comprises 1 or 2 heavy chains, and 1 or 2 light chains, which heavy and light chains may be homologous or heterologous.
In another preferred embodiment, the heavy chain comprises from amino-terminus to carboxy-terminus a structure selected from the group consisting of:
a)VHA-CH1-CH2-CH3-L1-scFvB;
b)scFvB-L1-VHA-CH1-CH2-CH3;
c)VHA-CH1-L1-scFvB-CH2-CH3;
d)VHA-CH1-L1-scFvB;
e)VHA-CH1;
f)VHA-CH1-L1-scFvB-L2-VHA-CH1;
g)VHA-CH1-L1-scFvB-L2-VLA-CL;
h)VLA-CL-L1-scFvB-L2-VHA-CH1;
i)VHA-CH1-L1-VHA-CH1-L2-scFvB;
j)VHA-CH1-CH2-CH3;
wherein VHA refers to VH bound to target molecule a, scFvB refers to scFv bound to target molecule B, and L1, L2 are each independently a bond or a linker.
In another preferred embodiment, the light chain comprises from amino-terminus to carboxy-terminus a structure selected from the group consisting of:
k)VLA-CL;
l)scFvB-L3-VLA-CL;
m)VLA-CL-L3-scFvB;
n)VLA-CL-L3-scFvB-L4-VLA-CL;
0)VLA-CL-L3-VLA-CL-L4-scFvB;
Where VLA refers to VL that binds to target molecule A, scFvB refers to scFv that binds to target molecule B, and L3 and L4 are each independently a bond or linker.
In another preferred embodiment, the anti-GUCY 2C/CD3 bispecific antibody comprises a structure selected from the group consisting of:
structure 1: comprising 2 heavy chains a) and 2 light chains k);
structure 2: comprising 2 heavy chains b) and 2 light chains k);
structure 3: comprising 1 heavy chain a), 1 heavy chain j) and 2 light chains k), the CH3 region of heavy chain a) comprising a mortar structure modification and the CH3 region of heavy chain j) comprising a pestle structure modification;
structure 4: comprising 1 heavy chain a), 1 heavy chain j) and 2 light chains k), the CH3 region of heavy chain a) comprising a knob structure modification, the CH3 region of heavy chain j) comprising a hole structure modification;
and (5) a structure 5: comprising 1 heavy chain b), 1 heavy chain j) and 2 light chains k), the CH3 region of heavy chain b) comprising a mortar structure modification and the CH3 region of heavy chain j) comprising a pestle structure modification;
structure 6: comprising 1 heavy chain b), 1 heavy chain j) and 2 light chains k), the CH3 region of heavy chain b) comprising a knob structure modification, the CH3 region of heavy chain j) comprising a hole structure modification;
structure 7: comprising 2 heavy chains c) and 2 light chains k);
structure 8: comprising 1 heavy chain c), 1 heavy chain j), and 2 light chains k), the CH3 region of heavy chain c) comprising a mortar structure modification, the CH3 region of heavy chain j) comprising a pestle structure modification;
Structure 9: comprising 1 heavy chain c), 1 heavy chain j), and 2 light chains k), the CH3 region of heavy chain c) comprising a knob structure modification, the CH3 region of heavy chain j) comprising a hole structure modification;
structure 10: comprising 1 heavy chain d) and 1 light chain k);
structure 11: comprising 1 heavy chain e) and 1 light chain m);
structure 12: comprising 1 heavy chain f) and 2 light chains k);
structure 13: comprising 1 heavy chain g), 1 heavy chain e) and 1 light chain k);
structure 14: comprising 2 heavy chains e) and 1 light chain n);
structure 15: comprising 1 heavy chain h), 1 heavy chain e) and 1 light chain k);
structure 16: comprising 1 heavy chain i) and 2 light chains k);
structure 17: comprising 2 heavy chains e) and 1 light chain 0);
structure 18: comprising 2 heavy chains j) and 2 light chains l);
structure 19: comprising 2 heavy chains j) and 2 light chains m).
In another preferred embodiment, the anti-GUCY 2C antibody or antigen binding fragment thereof comprises a heavy chain complementarity determining region HCDR1-3 and a light chain complementarity determining region LCDR1-3, wherein:
a) The HCDR-1 amino acid sequence is shown in SEQ ID NO:3, the HCDR-2 amino acid sequence is shown as SEQ ID NO:4, the HCDR-3 amino acid sequence is shown as SEQ ID NO:5 is shown in the figure; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:8, the amino acid sequence of the LCDR-2 is shown as SEQ ID NO:9, the LCDR-3 amino acid sequence is shown as SEQ ID NO:10 is shown in the figure; or alternatively, the first and second heat exchangers may be,
b) The HCDR-1 amino acid sequence is shown in SEQ ID NO:13, the HCDR-2 amino acid sequence is shown in SEQ ID NO:14, the HCDR-3 amino acid sequence is shown in SEQ ID NO: 15; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:18, the amino acid sequence of the LCDR-2 is shown as SEQ ID NO:19, the amino acid sequence of the LCDR-3 is shown as SEQ ID NO: shown at 20; or alternatively, the first and second heat exchangers may be,
c) The HCDR-1 amino acid sequence is shown in SEQ ID NO:23, the HCDR-2 amino acid sequence is shown in SEQ ID NO:24, the HCDR-3 amino acid sequence is shown in SEQ ID NO: shown at 25; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:28, the amino acid sequence of the LCDR-2 is shown as SEQ ID NO:29, the LCDR-3 amino acid sequence is shown in SEQ ID NO: shown at 30; or alternatively, the first and second heat exchangers may be,
d) The HCDR-1 amino acid sequence is shown in SEQ ID NO:33, the HCDR-2 amino acid sequence is shown in SEQ ID NO:34, the HCDR-3 amino acid sequence is shown in SEQ ID NO: indicated at 35; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:38, the LCDR-2 amino acid sequence is shown in SEQ ID NO:39, wherein the amino acid sequence of the LCDR-3 is shown in SEQ ID NO: shown at 40; or alternatively, the first and second heat exchangers may be,
e) The HCDR-1 amino acid sequence is shown in SEQ ID NO:43, the HCDR-2 amino acid sequence is shown in SEQ ID NO:44, the HCDR-3 amino acid sequence is shown in SEQ ID NO: 45; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:48, the LCDR-2 amino acid sequence is shown in SEQ ID NO:49, said LCDR-3 amino acid sequence is as set forth in SEQ ID NO: shown at 50.
In another preferred embodiment, the anti-GUCY 2C antibody or antigen-binding fragment thereof comprises a heavy chain variable region VH or variant thereof and a light chain variable region VL or variant thereof, wherein:
a) The amino acid sequence of VH is shown in SEQ ID NO:51, the amino acid sequence of VL is shown in SEQ ID NO: indicated at 55; or alternatively, the first and second heat exchangers may be,
b) The amino acid sequence of VH is shown in SEQ ID NO:51, the amino acid sequence of VL is shown in SEQ ID NO: 57; or alternatively, the first and second heat exchangers may be,
c) The amino acid sequence of VH is shown in SEQ ID NO:53, the amino acid sequence of the VL is shown in SEQ ID NO: indicated at 55; or alternatively, the first and second heat exchangers may be,
d) The amino acid sequence of VH is shown in SEQ ID NO:53, the amino acid sequence of the VL is shown in SEQ ID NO: 57; or alternatively, the first and second heat exchangers may be,
e) The amino acid sequence of VH is shown in SEQ ID NO:59, the amino acid sequence of the VL is shown in SEQ ID NO: indicated at 63; or alternatively, the first and second heat exchangers may be,
f) The amino acid sequence of VH is shown in SEQ ID NO:61, the amino acid sequence of the VL is shown in SEQ ID NO: indicated at 63; or alternatively, the first and second heat exchangers may be,
g) The amino acid sequence of VH is shown in SEQ ID NO:61, the amino acid sequence of the VL is shown in SEQ ID NO:65, or, alternatively,
h) The amino acid sequence of VH is shown in SEQ ID NO:59, the amino acid sequence of the VL is shown in SEQ ID NO: shown at 65.
In another preferred embodiment, the VH region variant refers to a sequence identical to SEQ ID NO: 51. SEQ ID NO: 53. SEQ ID NO: 59. SEQ ID NO:61 having at least 90%, 95%, 98%, or 99% amino acid sequence homology; the VL region variant refers to a sequence that hybridizes with SEQ ID NO: 55. SEQ ID NO: 57. SEQ ID NO: 63. SEQ ID NO:65, variants having at least 90%, 95%, 98%, or 99% amino acid sequence homology.
In another preferred embodiment, the VH region or VL region comprises 1-10 amino acid mutations; more preferably, the mutation is a substitution mutation.
In another preferred embodiment, the antibody or antigen binding fragment thereof comprises a heavy chain constant region and a light chain constant region; preferably, the heavy chain constant region is selected from human IgG1, human IgG2, human IgG3 or human IgG4, and the light chain constant region is selected from human Kappa (Kappa) or human Lambda (Lambda).
In another preferred embodiment, the human IgG1 heavy chain constant region comprises the amino acid sequence as set forth in SEQ ID NO:109, said human kappa light chain constant region comprising the amino acid sequence as set forth in SEQ ID NO:110, and a sequence of amino acids shown in seq id no.
In another preferred embodiment, the heavy chain constant region and/or the light chain constant region comprises mutated amino acids; more preferably, the human IgG4 heavy chain constant region comprises an S228P mutation.
In another preferred embodiment, the antigen-binding fragment of the anti-CD 3 antibody is monovalent or bivalent; more preferably, the antigen binding fragment of the anti-CD 3 antibody is monovalent.
In another preferred embodiment, the antigen-binding fragment of the anti-CD 3 antibody is an scFv comprising a VH-L1-VL structure or a VL-L1-VH structure from the amino-terminus to the carboxy-terminus, wherein L1 is a bond or a linker.
In another preferred embodiment, the scFv comprises a heavy chain complementarity determining region HCDR1-3 and a light chain complementarity determining region LCDR1-3, wherein the HCDR-1 amino acid sequences are set forth in SEQ ID NO:101, the HCDR-2 amino acid sequence is shown in SEQ ID NO:102, the HCDR-3 amino acid sequence is shown in SEQ ID NO: 103; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:104, the LCDR-2 amino acid sequence is shown as SEQ ID NO:105, the LCDR-3 amino acid sequence is shown as SEQ ID NO: shown at 106.
In another preferred embodiment, the scFv comprises a heavy chain variable region VH and a light chain variable region VL, wherein the VH has an amino acid sequence set forth in SEQ ID NO:107, the amino acid sequence of the VL is shown in SEQ ID NO: shown at 108.
In another preferred embodiment, the scFv comprises the amino acid sequence set forth in SEQ ID NO: 99.
In another preferred embodiment, L1, L2, L3, L4, L5 and L6 are each independently (G4S) n Wherein n is selected from integers from 1 to 6.
In another preferred embodiment, the target molecule a is GUCY2C and the target molecule B is CD3.
In another preferred embodiment, the anti-GUCY 2C/CD3 bispecific antibody is selected from the group consisting of:
SP4VHL-32H2 (structure 2): comprises an amino acid sequence shown in SEQ ID NO:67, and the amino acid sequence of which is set forth in SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
32H2-SP4VHL-L (Structure 19): comprises an amino acid sequence shown in SEQ ID NO:73, and the amino acid sequence of which is set forth in SEQ ID NO: 71; or alternatively, the first and second heat exchangers may be,
SP4VHL-32H2-L (Structure 18): comprises an amino acid sequence shown in SEQ ID NO:73, and the amino acid sequence of which is set forth in SEQ ID NO: 75. Or alternatively, the first and second heat exchangers may be,
32H2-SP4VHL (Structure 1): comprises an amino acid sequence shown in SEQ ID NO:77, and the amino acid sequence of SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
32H2Fab-SP4VHL (Structure 10): comprises an amino acid sequence shown in SEQ ID NO:79, and the amino acid sequence of SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
32H2CH1-SP4VHL (Structure 7): comprises an amino acid sequence shown in SEQ ID NO:81, and the amino acid sequence of which is shown in SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
10D7CH1-SP4VHL (structure 7): comprises an amino acid sequence shown in SEQ ID NO:83, and the amino acid sequence of which is set forth in SEQ ID NO: 85; or alternatively, the first and second heat exchangers may be,
32H2CH1-SP4VHL-KIH (Structure 9): comprises an amino acid sequence shown in SEQ ID NO:87, the amino acid sequence of which is shown in SEQ ID NO:89, and the amino acid sequence of SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
32H2CH1-SP4VHL-HIK (Structure 8): comprises an amino acid sequence shown in SEQ ID NO:91, and the amino acid sequence of the first heavy chain is shown as SEQ ID NO:93, and a second heavy chain having an amino acid sequence as set forth in SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
a derivative polypeptide which is formed by substitution, deletion or addition of one or more amino acid residues in the amino acid sequence of any one of SP4VHL-32H2, 32H2-SP4VHL-L, SP4VHL-32H2-L, 32H2Fab-SP4VHL, 32H2CH1-SP4VHL, 10D7CH1-SP4VHL, 32H2CH1-SP4VHL-KIH, and 32H2CH1-SP4 VHL-hik.
In a second aspect of the invention there is provided a polynucleotide molecule encoding an anti-GUCY 2C/CD3 bispecific antibody according to the first aspect of the invention.
In a third aspect of the invention there is provided an expression vector comprising a polynucleotide molecule according to the second aspect of the invention.
In another preferred embodiment, the expression vector is a virus or a plasmid.
In another preferred embodiment, the expression vector is selected from the group consisting of: pcDNA3.4, pDR1, pcDNA3.1 (+), pcDNA3.1/ZEO (+), pDHFR, pTT5, pDHFF, pGM-CSF or pCHO 1.0.
In a fourth aspect of the invention there is provided a host cell comprising an expression vector according to the third aspect of the invention.
In another preferred embodiment, the host cell is selected from the group consisting of: COS, CHO, NS0, sf9, sf21, DH5 a, BL21 (DE 3), TG1, BL21 (DE 3), 293F or 293E cells.
In a fifth aspect of the present invention, there is provided a method for preparing an anti-GUCY 2C/CD3 bispecific antibody according to the first aspect of the invention, characterized in that the method comprises the steps of:
a) Culturing the host cell according to the fourth aspect of the invention under expression conditions, thereby expressing the anti-GUCY 2C/CD3 bispecific antibody;
b) Isolating and purifying the anti-GUCY 2C/CD3 bispecific antibody described in the step a).
In a sixth aspect of the invention, there is provided a pharmaceutical composition comprising an effective amount of an anti-GUCY 2C/CD3 bispecific antibody according to the first aspect of the invention and one or more pharmaceutically acceptable carriers.
In another preferred embodiment, the pharmaceutical composition is in unit dosage form.
In another preferred embodiment, the dosage form of the pharmaceutical composition comprises a gastrointestinal dosage form or a parenteral dosage form.
In another preferred embodiment, the parenteral administration comprises intravitreal injection, intravenous drip, subcutaneous injection, local injection, intramuscular injection, intratumoral injection, intraperitoneal injection, intracranial injection, or intracavity injection.
In a seventh aspect of the invention there is provided the use of an anti-GUCY 2C/CD3 bispecific antibody according to the first aspect of the invention, or a pharmaceutical composition according to the sixth aspect of the invention, in the manufacture of a medicament for the treatment of cancer.
In another preferred embodiment, the cancer is a GUCY 2C-related cancer; more preferably, the cancer is a cancer of GUCY2C abnormal expression.
In another preferred embodiment, the cancer is a tumor of the gastrointestinal tract or pancreatic cancer; more preferably, the gastrointestinal tumor is selected from the group consisting of rectal cancer, colon cancer, small intestine cancer, stomach cancer, esophageal cancer, and gastro-esophageal junction cancer; even more preferably, the gastrointestinal tumor is a malignant tumor.
In an eighth aspect of the invention, there is provided a method of treating cancer, the method comprising administering to a subject in need thereof an anti-GUCY 2C/CD3 bispecific antibody according to the first aspect of the invention, or a pharmaceutical composition according to the sixth aspect of the invention.
In another preferred embodiment, the cancer is a GUCY 2C-related cancer; more preferably, the cancer is a cancer of GUCY2C abnormal expression.
In another preferred embodiment, the cancer is a tumor of the gastrointestinal tract or pancreatic cancer; more preferably, the gastrointestinal tumor is selected from the group consisting of rectal cancer, colon cancer, small intestine cancer, stomach cancer, esophageal cancer, and gastro-esophageal junction cancer; even more preferably, the gastrointestinal tumor is a malignant tumor.
In a ninth aspect of the invention, there is provided an immunoconjugate comprising:
a) An anti-GUCY 2C/CD3 bispecific antibody according to the first aspect of the invention; and b) a coupling moiety selected from the group consisting of: a detectable label, drug, toxin, cytokine, radionuclide, or enzyme.
In another preferred embodiment, the conjugate moiety is selected from the group consisting of: fluorescent or luminescent markers, radioactive markers, MRI (magnetic resonance imaging) or CT (electronic computed tomography) contrast agents, or enzymes, radionuclides, biotoxins, cytokines capable of producing detectable products.
In another preferred embodiment, the immunoconjugate comprises an antibody-drug conjugate (ADC).
In another preferred embodiment, the immunoconjugate is used for the preparation of a pharmaceutical composition for the treatment of cancer.
In a tenth aspect of the invention there is provided a method of treating cancer, the method comprising administering to a subject in need thereof an immunoconjugate according to the ninth aspect of the invention.
It should be noted that the embodiments or the preferred examples in the summary of the invention are given by way of example, and are not intended to limit the invention. Meanwhile, the structural scheme formed by different designs or selections of positions and numbers of functional element parts (such as an anti-CD 3 antibody or an antigen binding fragment thereof, an anti-GUCY 2C antibody or an antigen binding fragment thereof) based on the anti-GUCY 2C/CD3 bispecific antibody under the invention conception is also within the scope of the invention. For example, the antigen-binding fragment of an anti-CD 3 antibody may be monovalent, bivalent, or multivalent, depending on the desired properties of the bispecific antibody (e.g., affinity, toxicity, anti-tumor activity). The position of the antigen-binding fragment of an anti-CD 3 antibody may be at the N-terminal, C-terminal or intermediate positions of the polypeptide chain (not at the N-terminal or C-terminal ends of the polypeptide chain).
It is understood that within the scope of the present invention, the above-described technical features of the present invention and technical features specifically described below (e.g., in the examples) may be combined with each other to constitute new or preferred technical solutions. And are limited to a space, and are not described in detail herein.
Drawings
FIG. 1 detection of the binding Capacity of murine antibodies to human GUCY2C-Fc protein
FIG. 2 detection of binding Capacity of murine antibody to human colon cancer cell CW2
FIG. 3 detection of binding Activity of murine antibody to human GUCY2C-His protein
FIG. 4 detection of binding Activity of murine antibody to human GUCY2C-His protein
FIG. 5 detection of binding Activity of chimeric antibodies to human GUCY2C-His protein
FIG. 6 detection of binding Activity of humanized antibodies to human GUCY2C-His protein
FIG. 7 detection of binding Activity of humanized antibodies to human GUCY2C-Fc protein
FIG. 8 detection of binding Activity of humanized antibodies against human GUCY2C overexpressing tumor cells
FIG. 9 bispecific antibody constructs (note: anti-GCC is an antibody to 32H2-Hu or other Anti-GUCY 2C, anti-CD3 is an antibody to SP34 or other Anti-CD3 epsilon monomer or CD3 delta/CD 3 epsilon heterodimer; linker is the Linker sequence (G4S) n, which may also be other polypeptide sequences with flexibility, n may be 3, 4, 5 or 6.): wherein fig. 9A represents structure 1, fig. 9B represents structure 2, fig. 9C represents structure 3, fig. 9D represents structure 4, fig. 9E represents structure 5, fig. 9F represents structure 6, fig. 9G represents structure 7, fig. 9H represents structure 8, fig. 9I represents structure 9, fig. 9J represents structure 10, fig. 9K represents structure 11, fig. 9L represents structure 12, fig. 9M represents structure 13, fig. 9N represents structure 14, fig. 9O represents structure 15, fig. 9P represents structure 16, fig. 9Q represents structure 17, fig. 9R represents structure 18, and fig. 9S represents structure 19.
FIG. 10 binding biopsies of anti-GUCY 2C and CD3 bispecific antibodies to GUCY2C-His proteins
FIG. 11 detection of binding Activity of anti-GUCY 2C and CD3 bispecific antibodies against GUCY2C-His protein
FIG. 12 detection of binding Activity of anti-GUCY 2C and CD3 bispecific antibodies against GUCY2C-Fc protein
FIG. 13 detection of binding Activity of anti-GUCY 2C and CD3 bispecific antibodies against GUCY2C-Fc protein
FIG. 14 detection of binding Activity of anti-GUCY 2C and CD3 bispecific antibodies against CD3 ε protein
FIG. 15 detection of binding Activity of anti-GUCY 2C and CD3 bispecific antibodies against CD3 ε protein
FIG. 16 detection of binding Activity of anti-GUCY 2C and CD3 bispecific antibodies against CD3 ε delta heterodimer protein
FIG. 17 detection of binding Activity of anti-GUCY 2C and CD3 bispecific antibodies against CD3 ε delta heterodimer protein
FIG. 18 detection of binding Activity of anti-GUCY 2C and CD3 bispecific antibodies of the Knob-into-hole structure against GUCY2C-His protein
FIG. 19 detection of binding Activity of anti-GUCY 2C and CD3 bispecific antibodies of Knob-into-hole construction against GUCY2C-Fc protein
FIG. 20 detection of binding Activity of anti-GUCY 2C and CD3 bispecific antibodies of the Knob-into-hole structure on CD3 ε protein
FIG. 21 detection of binding Activity of anti-GUCY 2C and CD3 bispecific antibodies of the Knob-into-hole structure on CD3 ε delta heterodimer protein
FIG. 22 detection of the killing Activity of anti-GUCY 2C and CD3 bispecific antibody-mediated T cells against human colon cancer CW2 cells
FIG. 23 detection of the killing Activity of human colon cancer T84 cells by anti-GUCY 2C and CD3 bispecific antibody-mediated T cells of Knob-into-hole structure
Detailed Description
The present inventors have made extensive and intensive studies to obtain novel bispecific antibodies targeting both GUCY2C and CD3 in various structures. Experimental results show that the anti-GUCY 2C/CD3 bispecific antibody can mediate T cells to kill GUCY2C over-expression tumor cells specifically. The present invention has been completed on the basis of this finding.
Experimental materials
Mouse myeloma cells SP2/0: purchased from ATCC under the designation CRL-1581.
Balb/c mice: purchased from Shanghai Ling Biotechnology Co.
HRP-goat anti-mouse secondary antibody: purchased from Boolon immunotechnology Co., ltd., product number BF03001-1ML.
Donkey anti-mouse PE fluorescent secondary antibody: purchased from Jackson under the trade designation 715-116-150.
Sheep anti-human PE fluorescent secondary antibody: purchased from Jackson under the trade designation 109-115-098.
HRP-goat anti-human IgG Fab secondary antibody: purchased from Sigma, cat# A0293-1ML.
HRP-goat anti-human IgG Fc secondary: purchased from Sigma under the designation A0170-1ML.
96-well plates (elisa plate is not detachable): purchased from costar, cat No. 9018.
PBS buffer: purchased from Shanghai source culture Biotechnology Co., ltd., product number B320KJ.
TMB: purchased from KPL company under the number 52-00-03.
Bovine Serum Albumin (BSA): purchased from a manufacturer under the accession number a600332-0100.
RPMI 1640Medium: purchased from Gibco company under the accession number 61870127.
Penicillin-streptomycin (Penicillin-streptomycin): purchased from Gibco company under the accession number 15140122.
Fetal Bovine Serum (FBS): purchased from Gibco company under the accession number 10091-148.
polyethylene glycol solution: purchased from sigma company under the product number P7181.
hybrid-SFM: purchased from life technologies, cat No. 12045-076.
HAT: purchased from Gibco under the accession number 21060017.
pcDNA 3.4: available from thermo fisher under accession number a14697.
HEK-293F: purchased from Thermo Fisher under accession number a14527.
Terminology
Antibodies to
In the present invention, the term "antibody" refers to a full length antibody, and the term "antigen binding fragment" refers to a fragment derived from an antibody that is capable of binding an epitope of an antigen, including, but not limited to scFv, fv, fd, fab, F (ab ') 2 or F (ab').
In the present invention, the term "full length antibody" refers to an iso-tetralin protein of about 150000 daltons having the same structural characteristics, comprising a variable region and a constant region, consisting of two identical Heavy Chains (HC) and two identical Light Chains (LC). Each heavy chain has a heavy chain variable region (VH) at one end followed by a heavy chain constant region consisting of three domains, CH1, CH2, and CH 3. One end of each light chain has a light chain variable region (VL) and the other end has a light chain constant region comprising a domain CL; the light chain constant region pairs with the CH1 domain of the heavy chain constant region and the light chain variable region pairs with the heavy chain variable region. The constant regions are not directly involved in binding of antibodies to antigens, but they exhibit different effector functions, such as participation in antibody-dependent cell-mediated cytotoxicity (ADCC, anti-independent cell-mediated cytotoxicity), and the like. Heavy chain constant regions include the IgG1, igG2, igG3, igG4 subtypes; the light chain constant region includes Kappa (Kappa) or Lambda (Lambda). The heavy and light chains of an antibody are covalently linked together by disulfide bonds between the CH1 domain of the heavy chain and the CL domain of the light chain, and the two heavy chains of an antibody are covalently linked together by inter-polypeptide disulfide bonds formed between the hinge regions.
In the present invention, the term "variable" means that some portion of the variable region in an antibody differs in sequence, which results in the binding and specificity of each particular antibody for its particular antigen. However, the variability is not evenly distributed throughout the antibody variable region. It is concentrated in three fragments in the heavy and light chain variable regions, known as complementarity-determining region (CDR) or hypervariable regions. The more conserved parts of the variable region are called the Framework Regions (FR). The variable regions of the natural heavy and light chains each comprise four FR regions, which are generally in a β -sheet configuration, connected by three CDRs forming the connecting loops, which in some cases may form part of the β -sheet structure. The CDRs in each chain are held closely together by the FR regions and together with the CDRs of the other chain form the antigen binding site of the antibody (see Kabat et al, NIH publication No.91-3242, vol. I, pp. 647-669 (1991)). CDRs of the heavy chain variable region (VH) and the light chain variable region (VL) are referred to as HCDR and LCDR, respectively.
In the present invention, the term "humanized antibody" refers to an antibody in which the Complementarity Determining Regions (CDRs) of the antibody are derived from a non-human species (e.g., rodent) and the remainder of the antibody molecule, including the framework regions FR and constant regions, are derived from human. Wherein the FR residues of the framework region can be altered to maintain binding affinity. In the present invention, the term "chimeric antibody" refers to an antibody in which the variable region is derived from a non-human species (e.g., rodent) and the constant region is derived from a human.
In the present invention, the term "framework region" (FR) refers to a portion of an antibody that has relatively little change in amino acid composition and arrangement sequence outside of the hypervariable region. The light and heavy chains of an antibody each have four FRs, designated FR1-L, FR2-L, FR3-L, FR-L and FR1-H, FR2-H, FR3-H, FR-H, respectively. Preferably, the FR of the invention is a human antibody FR or a derivative thereof which is substantially identical to a naturally occurring human antibody FR, i.e. has a sequence homology of at least 85%, 90%, 95%, 96%, 97%, 98% or 99%. After knowing the amino acid sequence of the CDRs, one skilled in the art can determine the framework regions FR1-L, FR2-L, FR3-L, FR4-L and/or FR1-H, FR2-H, FR3-H, FR-H sequences.
In the present invention, the term "monoclonal antibody" refers to an antibody obtained from a substantially homogeneous population, the individual antibodies contained in the population being identical except for a few natural mutations that may be present. Monoclonal antibodies are directed against a single determinant on an antigen, with high specificity for a single antigenic site. Monoclonal antibodies can be synthesized by hybridoma culture without contamination by other immunoglobulins.
In the present invention, the term "scFv" refers to a single chain of a polypeptide formed by joining a VH region and a VL region via L. The scFv may comprise a VH-L-VL or VL-L-VH structure.
In the present invention, the terms "anti" and "binding" refer to a non-random binding reaction between two molecules, such as a reaction between an antibody and an antigen against which it is directed. Typically, the antibody is present at less than about 10 -7 M, e.g. less than about 10 -8 M、10 -9 M、10 -10 M、10 -11 An equilibrium dissociation constant (KD) of M or less binds to the antigen. The term "KD" refers to the equilibrium dissociation constant of a particular antibody-antigen interaction, which is used to describe the binding affinity between an antibody and an antigen. The smaller the equilibrium dissociation constant, the tighter the antibody-antigen binding, and the higher the affinity between the antibody and antigen. For example, the binding affinity of an antibody to an antigen is determined in a BIACORE instrument using surface plasmon resonance (Surface Plasmon Resonance, abbreviated SPR) or the relative affinity of an antibody to antigen binding is determined using ELISA.
In the present invention, the term "L" (e.g., L1, L2, L3, L4, L5, L6) may independently represent the absence, bond or linker in different structures. The "bond" (e.g., peptide bond) or "linker" is used to link 2 polypeptide chains. Suitable linkers may be polypeptide sequences having flexibility, examples of which include mono glycine (Gly), or serine (Ser) residues, the identity and sequence of the amino acid residues in the linker may vary with the type of secondary structural element that needs to be achieved in the linker. In the present invention, the linker is (G4S) n Preferably n is 2, 3, 4, 5 or 6. In the summary or embodiments of the present invention, when describing the structure of an antibody, different L (e.g., L1, L2, L3, L4, L5, L6) are used only to schematically represent L at different attachment positions in the structure, and there is no limitation on the structure of L. Different L in the same structure or L in different structures may be the same or different.
In the present invention, the term "anti-GUCY 2C/CD3 bispecific antibody" is a bispecific antibody that binds both the target molecules GUCY2C and CD 3. In some embodiments, to distinguish polypeptide chains of a bispecific antibody, a polypeptide single chain comprising a VHA-CH1 structure is defined as the heavy chain of an anti-GUCY 2C/CD3 bispecific antibody, and a polypeptide single chain not comprising a VHA-CH1 structure is defined as the light chain of an anti-GUCY 2C/CD3 bispecific antibody.
In the present invention, the term "knob-into-hole modification" means that the Fc domain of an antibody comprises amino acid mutations that result in the formation of a knob-into-hole pairing between two Fc polypeptide monomers. For example, amino acids at appropriate sites in the CH3 domain in one Fc polypeptide monomer are substituted with relatively large amino acid residues to form a pestle structure; the amino acid in the corresponding position in the CH3 domain in the other Fc polypeptide monomer is substituted with a relatively small amino acid residue to form a mortar structure.
The bispecific antibodies of the invention can be modified, e.g., by adding, deleting and/or substituting one or more amino acid residues, by techniques well known in the art to further improve or optimize the properties (e.g., affinity) of the bispecific antibodies and to obtain modified results by conventional assay methods.
In the present invention, bispecific antibodies of the invention also include conservative variants thereof, meaning that up to 10, preferably up to 7, more preferably up to 5, and most preferably up to 3 amino acids are replaced by amino acids of similar or similar nature to the amino acid sequence of a bispecific antibody of the invention to form a polypeptide. These conservatively variant polypeptides are preferably generated by amino acid substitutions according to Table A.
Table A
The bispecific antibodies of the invention may be used alone, or in combination or coupling with a detectable label (for diagnostic purposes), a therapeutic agent, or a combination of any of the above.
The amino acid sequence or the protein (polypeptide) structure in the present invention is given in the order from amino-terminus to carboxyl-terminus.
In the present invention, "-" in the structure of a protein (polypeptide) represents a peptide bond.
Coding nucleic acids and expression vectors
The invention also provides polynucleotide molecules encoding the bispecific antibodies described above. The polynucleotides of the invention may be in the form of DNA or RNA. DNA forms include cDNA, genomic DNA, or synthetic DNA. The DNA may be single-stranded or double-stranded. The DNA may be a coding strand or a non-coding strand. In the present invention, the term "expression vector" refers to a vector, such as a plasmid, viral vector (e.g., adenovirus, retrovirus), phage, yeast plasmid, or other vector, carrying an expression cassette for expression of a particular protein of interest or other substance. Such as conventional expression vectors in the art including suitable regulatory sequences, e.g., promoters, terminators, enhancers, and the like, including, but not limited to: viral vectors (e.g., adenovirus, retrovirus), plasmids, phages, yeast plasmids or other vectors. The expression vector preferably comprises pDR1, pcDNA3.4 (+), pDHFR or pTT5.
Once the relevant sequences are obtained, recombinant methods can be used to obtain the relevant sequences in large quantities. This is usually done by cloning it into a vector, transferring it into a cell, and isolating the relevant sequence from the propagated host cell by conventional methods.
The invention also relates to vectors comprising the above-described suitable DNA sequences and suitable promoter or control sequences. These vectors may be used to transform an appropriate host cell to enable expression of the protein.
In the present invention, the term "host cell" is a variety of host cells conventional in the art, as long as the vector is stably self-replicating and the polynucleotide molecule carried can be efficiently expressed. Wherein the host cell comprises a prokaryotic expression cell and a eukaryotic expression cell, preferably the host cell comprises: COS, CHO, NS0, sf9, sf21, DH5 a, BL21 (DE 3), TG1, BL21 (DE 3), 293F or 293E cells.
Pharmaceutical composition and application
The invention also provides a composition. Preferably, the composition is a pharmaceutical composition comprising a bispecific antibody as described above, and a pharmaceutically acceptable carrier. Typically, these materials are formulated in a nontoxic, inert and pharmaceutically acceptable aqueous carrier medium, wherein the pH is typically about 4 to 8, preferably about 5 to 8,5 to 7, or 6 to 8, although the pH may vary depending on the nature of the material being formulated and the condition being treated. The formulated pharmaceutical compositions may be administered by conventional routes including, but not limited to: intravenous injection, intravenous drip, subcutaneous injection, local injection, intramuscular injection, intratumoral injection, intraperitoneal injection (e.g., intraperitoneal), intracranial injection, or intracavity injection.
In the present invention, the term "pharmaceutical composition" means that the bispecific antibody of the present invention can be combined with a pharmaceutically acceptable carrier to form a pharmaceutical formulation composition to exert therapeutic effects more stably, which formulations can ensure the conformational integrity of the amino acid core sequence of the proteins disclosed herein, while also protecting the multifunctional groups of the proteins from degradation (including, but not limited to, aggregation, deamination or oxidation).
The pharmaceutical compositions of the invention comprise a safe and effective amount (e.g., 0.001-99wt%, preferably 0.01-90wt%, more preferably 0.1-80 wt%) of the bispecific antibody of the invention as described above, together with a pharmaceutically acceptable carrier. Such vectors include (but are not limited to): saline, buffer, glucose, water, glycerol, ethanol, and combinations thereof. The pharmaceutical formulation should be compatible with the mode of administration. The pharmaceutical compositions of the invention may be formulated as injectables, e.g. by conventional means using physiological saline or aqueous solutions containing glucose and other adjuvants. The pharmaceutical compositions, such as injections, solutions are preferably manufactured under sterile conditions. The amount of active ingredient administered is a therapeutically effective amount, for example, from about 10 micrograms per kilogram of body weight to about 50 milligrams per kilogram of body weight per day. In addition, bispecific antibodies of the invention can also be used with other therapeutic agents.
When a pharmaceutical composition is used, a safe and effective amount of the bispecific antibody is administered to a mammal, wherein the safe and effective amount is typically at least about 10 micrograms per kilogram of body weight and in most cases no more than about 50 milligrams per kilogram of body weight, preferably the dose is from about 10 micrograms per kilogram of body weight to about 10 milligrams per kilogram of body weight. Of course, the particular dosage should also take into account factors such as the route of administration, the health of the patient, etc., which are within the skill of the skilled practitioner.
In the present invention, the term "effective amount" refers to an amount or dose that produces a desired effect in a treated individual, including an improvement in the condition of the individual, following administration of the pharmaceutical composition of the present invention to a subject. The term "subject" includes, but is not limited to, mammals, such as humans, non-human primates, rats, mice, and the like.
The invention has the main advantages that: the anti-GUCY 2C/CD3 bispecific antibody has excellent targeting property, specificity and remarkable synergic anti-tumor activity, and has important clinical significance for tumor treatment. The concrete steps are as follows:
(1) The anti-GUCY 2C/CD3 bispecific antibody can simultaneously bind to GUCY2C protein on the surface of tumor cells and CD3 protein on the surface of T cells.
(2) The anti-GUCY 2C/CD3 bispecific antibody can recruit and activate T cell groups to specifically target GUCY2C over-expressed tumor cells, thereby achieving the purpose of specifically killing the GUCY2C over-expressed tumor cells.
(3) The anti-GUCY 2C/CD3 bispecific antibodies of the invention do not require binding to MHC class I complex antigen peptides via a T Cell Receptor (TCR) to activate T cells, but rather form immune synapse-activating T cells by recruiting T cells to directly target GUCY 2C-expressing tumor cells.
(4) The anti-GUCY 2C/CD3 bispecific antibody has remarkable synergistic anti-tumor activity, and is superior to the single use or the combined use of any one monoclonal antibody of GUCY2C or CD 3.
The invention is further illustrated below in conjunction with specific embodiments. It is to be understood that these examples are illustrative of the present invention and are not intended to limit the scope of the present invention. The experimental procedure, in which the detailed conditions are not noted in the following examples, is generally followed by routine conditions such as Sambrook et al, molecular cloning: conditions described in the laboratory Manual (New York: cold Spring Harbor Laboratory Press, 1989) or as recommended by the manufacturer.
The sequence information related to the following examples is summarized in the sequence table B.
Table B
The sequence of the invention adopts the numbering rule of the Kabat system.
EXAMPLE 1 preparation and screening of antigen-immunized animals and hybridomas
1.1 antigen expression
The extracellular region gene of GUCY2C (sequence from UniProt, accession number P25092) was constructed into a pcDNA 3.4 expression vector by a conventional gene synthesis and molecular cloning method, and a signal peptide sequence was added to the N-terminal thereof, and a 6 XHis tag was added to the C-terminal thereof, HEK-293F cells were transfected, and after 5d of expression, cell culture supernatants were collected and purified to obtain GUCY2C-His protein. Similarly, HEK-293F cells are transfected after the 6 XHis tag is replaced by the Fc sequence of human IgG1, and GUCY2C-Fc protein is obtained after expression and purification.
1.2 immunization of mice with antigen
Balb/C mice were routinely immunized with GUCY2C-His protein. Balb/C mice were subcutaneously injected at multiple points (GUCY 2C-His protein, 100. Mu.g/mouse/0.5 mL) after emulsification of soluble human GUCY2C-His protein with Freund's complete adjuvant, at day 14, after emulsification of soluble human GUCY2C-His protein with Freund's incomplete adjuvant, injected subcutaneously (GUCY 2C-His protein, 50. Mu.g/mouse/0.5 mL) after emulsification of soluble human GUCY2C-His protein with Freund's incomplete adjuvant, at day 28, injected subcutaneously (GUCY 2C-His protein, 50. Mu.g/mouse/0.5 mL) after three weeks, stimulated by intraperitoneal injection, and after 3-4 days, spleens were taken for fusion experiments.
1.3 preparation and screening of hybridomas
The spleen cells of the mice were PEG-fused with myeloma cells SP2/0 of the mice 3-4 days after the last immunization of the mice using conventional hybridoma protocols. The fused cells were suspended uniformly in complete medium, which was composed of RPMI1640-GLUMAX added to 1% penicillin-streptomycin and 20% fbs,1 x hat. The fused cells were cultured at a rate of 3X 10 4 A total of 60 96-well plates were plated at 37℃at 200. Mu.L/well, CO 2 Culturing in an incubator. After 7-12 days, the supernatant was harvested and hybridoma wells positive for human GUCY2C binding activity were screened by ELISA.
Wherein, ELISA method screens hybridoma holes positive for human GUCY2C binding activity as follows: GUCY2C-Fc was diluted to 1. Mu.g/mL in PBS buffer, 100. Mu.L/well was added to ELISA plates and incubated overnight at 4 ℃. The supernatant was thrown off the next day, then 5% skim milk powder in PBS was added and blocked at 37℃for 2h, and the plates were washed 3 times with PBST for use. The collected hybridoma supernatants were sequentially added to the blocked plates at 100. Mu.L/well and left at 37℃for 1 hour. PBST washing the plate for 3 times, adding HRP-marked goat anti-mouse IgG secondary antibody, and standing at 37 ℃ for 30min; after PBST washing the plate for 3 times, the residual liquid drops are beaten as much as possible on the absorbent paper, 100 mu L of TMB is added into each hole, and the color development is carried out at room temperature and in a dark place for 5min; 50 mu L of 2M H are added to each well 2 SO 4 The stopping solution stops the substrate reaction, the OD value is read at the 450nm position of the multifunctional enzyme-labeled instrument, and the binding capacity of the antibody to be detected and the target antigen GUCY2C is analyzed. A total of 18 hybridoma cell lines were obtained by fusion screening. The above-mentioned materials are mixedAmplifying and screening the obtained 18 Hybridoma cell strains in a serum-containing complete culture medium, centrifuging and changing the solution to a serum-free culture medium hybrid oma-SFM culture medium to ensure that the cell density is 1-2 multiplied by 10 7 Per mL, at 8% CO 2 Culturing for 1 week at 37 ℃, centrifuging to obtain a culture supernatant, and purifying by Protein G affinity chromatography to obtain the anti-human GUCY2C monoclonal antibody Protein.
EXAMPLE 2 binding ability of murine antibody to human GUCY2C-Fc protein
Indirect enzyme-linked immunosorbent assay (ELISA) the binding capacity of the murine antibody obtained in example 1 to human GUCY2C-Fc protein was determined. The specific method comprises the following steps:
GUCY2C-Fc protein was diluted to 1. Mu.g/mL in coating solution (50 mM carbonate coating buffer, pH 9.6) to coat ELISA plate, 4℃overnight; then, 5% of skimmed milk powder is prepared by PBS for sealing, and the mixture is incubated for 2 hours at 37 ℃; after 3 washes of PBST, 1% BSA buffer in PBS of the anti-human GUCY2C antibody protein prepared above was diluted from 10. Mu.g/mL to 11 gradients in 3-fold gradient, and 100. Mu.L/well was added to the ELSIA plate of pre-coated GUCY2C-Fc, and incubated for 1h at 37 ℃; PBST washing the plate for 3 times, adding HRP-marked goat anti-mouse IgG secondary antibody, and standing at 37 ℃ for 30min; after PBST washing the plate 3 times, the residual liquid drop is beaten as much as possible on the absorbent paper, 100 mu L of TMB is added into each hole, the color development is carried out for 5min at room temperature and in the dark, 50 mu L of 2M H is added into each hole 2 SO 4 The termination solution stops the substrate reaction, the OD value is read at 450nm of the multifunctional enzyme-labeled instrument, and the binding capacity of the antibody to be tested and the target antigen human GUCY2C-Fc is analyzed, and the result is shown in figure 1. EC binding of each hybridoma antibody to GUCY2C-Fc 50 As shown in table 1. As can be seen, 32H2G1A1 has the strongest affinity for GUCY 2C-Fc.
Table 1: EC binding of each hybridoma antibody to GUCY2C-Fc 50
Sample of | EC 50 (ng/mL) | Sample of | EC 50 (ng/mL) |
45E8F6B11 | 1.659 | 37A11E5 | 31.97 |
31C62F11 | 8.469 | 48H7F8 | 76.4 |
55G7A7H2 | 21.55 | 50C9F2B8 | 198.1 |
60C8D2C1 | 28.88 | 19D9B9H3 | 899.3 |
43D11B1D2 | 18.03 | 32H2G1A1 | 7.089 |
4G12A4G6 | 13.37 | 49F2B8B8 | 63.05 |
8C7A4 | 2837 | 206E8E6 | 14.03 |
10D7A7 | 30.72 | 208D1B7 | 11.85 |
19B2F2H4 | 7833988 | 21E12B4 | 62.03 |
EXAMPLE 3 binding ability of murine antibody to human colon cancer cell CW2
The binding activity of the murine antibody to the human colon cancer cell CW2 is detected by flow cytometry, and the specific method is as follows:
collecting CW2 cells, centrifuging to remove cell culture solution, and washing with PBS buffer solution for 2 times; count and dilute to 2X 10 with FACS buffer containing 1% BSA 6 Spreading the cells to a 96-well round bottom plate for later use, wherein the cell is 100 mu L/well; the antibody to be tested is added into the cell round bottom plate from 10 mug/mL of 1% BSA buffer solution prepared by PBS according to 11 gradients of 3-fold dilution, and incubated for 1h at 4 ℃; after centrifugation, the supernatant was discarded, washed 3 times with FACS buffer of 1% BSA, 100. Mu.L of donkey anti-mouse PE fluorescent secondary antibody was added per well at a ratio of 1:500 (see fluorescent secondary antibody specification for details), and incubated at 4℃for 1h; the analytical samples were assayed using BD FACSCelesta washed 3 times with FACS buffer of 1% BSA, resuspended with FACS buffer of 1% BSA, 200. Mu.L/well, and the antibody binding cell results are shown in FIG. 2. EC of individual samples on CW2 cell binding 50 See table 2. It can be seen that 10D7A7 has a slightly stronger binding activity to CW2 cells.
Table 2: EC of binding of each hybridoma antibody to CW2 50
Sample of | EC 50 (ng/mL) | Sample of | EC 50 (ng/mL) |
45E8F6B11 | 574.2 | 37A11E5 | 8089 |
31C62F11 | 417.3 | 48H7F8 | 3985 |
55G7A7H2 | 6273 | 50C9F2B8 | 77.95 |
60C8D2C1 | 945.5 | 19D9B9H3 | 1127 |
43D11B1D2 | 364.7 | 32H2G1A1 | 9604 |
4G12A4G6 | 123 | 49F2B8B8 | 11656 |
8C7A4 | 7469 | 206E8E6 | 680.3 |
10D7A7 | 137.2 | 208D1B7 | 635.2 |
19B2F2H4 | 1490 | 21E12B4 | 3421 |
EXAMPLE 4 binding Activity of murine antibody against human GUCY2C-His protein
GUCY2C-His protein was diluted to 0.1. Mu.g/mL with ELISA coating solution, ELISA plates were coated at 100. Mu.L/well, and the coated overnight at 4 ℃. The ELISA plate was washed three times with PBST, unbound antigen was removed, and the ELISA plate was blotted dry on absorbent paper, excess liquid was removed, then 2% BSA in PBS, 200. Mu.L/well, and blocked at room temperature for 2h. Washing with PBST three times, washing out excess blocking solution, drying ELISA plate, removing excess liquid, diluting each monoclonal antibody with 1% BSA prepared by PBST according to 3-time gradient, respectively, diluting with the highest concentration of 200nM, 12 gradients, adding ELISA hole, 100 μl/hole, incubating at room temperature for 1h, and making 2 multiplex holes in parallel for each sample. Unbound or non-specifically bound primary antibody was washed off, HRP-labeled secondary antibody was diluted to appropriate concentration (1:3000 dilution) with antibody dilution, ELISA plate was added, 100 μl/well, and incubated for 1h at room temperature. Wash three times with PBST and beat ELISA plate on absorbent paperDrying, removing excessive liquid, adding TMB color development liquid, developing for 5min at 100 μl/well, adding 2M H per well 2 SO 4 50. Mu.L/well to terminate the color development and its absorbance was measured at a wavelength of 450nm in a multifunctional microplate reader, and the data was analyzed.
As shown in FIGS. 3 and 4, 55G7A7H2 and 32H2G1A1 have higher affinity for GUCY2C-His, 43D11B1D2, 10D7A7 and 37A11E5 times, each sample has an EC for GUCY2C-His binding 50 The values are shown in tables 3 and 4.
Table 3: EC binding of each hybridoma antibody to GUCY2C-His 50
Sample of | EC 50 (nM) |
43D11B1D2 | 0.99 |
45E8F6B11 | 17.89 |
31C6-2F11 | 9.07 |
4G12A4G6 | 5.83 |
60C8D2C1 | \ |
55G7A7H2 | 0.12 |
Table 4: EC binding of each hybridoma antibody to GUCY2C-His 50
Sample of | EC 50 (nM) |
208D1B7 | \ |
206E8E6 | \ |
4G12A4G6 | 13.56 |
32H2G1A1 | 0.26 |
37A11E5 | 0.81 |
10D7A7 | 0.80 |
EXAMPLE 5 hybridoma antibody variable region Gene acquisition and chimeric antibody preparation
The heavy chain variable region and the light chain variable region of hybridomas 55G7A7H2, 32H2G1A1, 43D11B1D2, 10D7A7, 37A11E5 were obtained by a molecular biological-related method, and chimeric antibodies were further constructed.
The RNA of five hybridoma cells 55G7A7H2, 32H2G1A1, 43D11B1D2, 10D7A7, 37A11E5 was extracted by Trizol and mRNA was reverse transcribed to obtain cDNA, which was then used as a template for PCR with the heavy and light chain degenerate primers of murine antibodies (see Antibody Engineering, volume 1,Edited by Roland Kontermann and Stefan D ubel, sequence of the composite primer from page 323), the obtained PCR products were sequenced and analyzed by IMGT database to determine that the obtained sequence was the variable region sequence of murine antibody. The relevant sequence information is summarized in the sequence listing.
The obtained heavy chain variable region sequence of each hybridoma was spliced with a human IgG1 constant region, the light chain variable region sequence was spliced with a human kappa chain constant region, heavy chain and light chain to pcDNA3.4 expression vectors of each chimeric antibody were constructed, HEK-293F cells were transfected for expression and purified to obtain each chimeric antibody, which was designated 55G7-ch, 32H2-ch, 43D11-ch, 10D7-ch, 37A11-ch, respectively.
EXAMPLE 6 binding Activity of chimeric antibodies against human GUCY2C-His protein
The binding capacity of chimeric antibodies to GUCY2C-His was determined by the method of example 4, in which the secondary antibody was an HRP-labeled anti-human IgG Fc secondary antibody, and the experimental results are shown in FIG. 5, in which 32H2-ch, 10D7-ch and 43D11-ch were all capable of binding to GUCY2C-His, indicating that we have been hooked to the correct variable region gene, and that the affinity of 32H2-ch and 10D7-ch for GUCY2C-His was significantly better than that of 43D11-ch, EC 50 0.141nM, 0.113nM and 5.187nM, respectively.
EXAMPLE 7 construction and preparation of humanized antibodies
The amino acid sequences of the light chain variable region and the heavy chain variable region of each candidate murine antibody were analyzed, and the antigen Complementarity Determining Regions (CDRs) and 4 Framework Regions (FRs) of the 32H2G1A1 and 10D7A7 of the murine antibody were determined according to the Kabat rule.
The humanized template that matched best to the non-FR regions of each murine antibody described above was selected from the gemline database. Then the CDR region of the murine antibody is transplanted onto a selected humanized template, the CDR region of the humanized template is replaced, the heavy chain variable region is recombined with the human IgG1 constant region, the light chain variable region is recombined with the human kappa chain constant region, meanwhile, based on the three-dimensional structure of the antibody, the embedded residues, the residues directly interacted with the CDR region and the residues which have important influence on the conformation of VL and VH of each antibody are subjected to back mutation, and finally a plurality of humanized antibodies are obtained, the heavy chain and light chain-to-pcDNA3.4 expression vectors of each humanized antibody are respectively constructed, and HEK-293F cells are transfected for expression and purification to obtain each humanized antibody.
Table 5: sequence listing of variable regions of each humanized antibody
EXAMPLE 8 binding Activity of humanized antibodies to human GUCY2C-His protein
The binding capacity of each humanized antibody to GUCY2C-His was determined by the method of example 6, and the experimental results are shown in FIG. 6, in which humanized antibodies 32H2-Hu and 32H2-HuG have a higher affinity for GUCY2C-His, and 10D7-Hu and 10D7-HuG have the respective ECs of 50 The values are detailed in Table 6.
Table 6: EC of each humanized antibody to GUCY2C-His binding 50
Sample of | EC 50 (nM) |
10D7-Hu | 2.051 |
10D7-HuG | 1.483 |
32H2-Hu | 0.468 |
32H2-HuG | 0.746 |
EXAMPLE 9 binding Activity of humanized antibodies to human GUCY2C-Fc protein
The binding capacity of each humanized antibody to GUCY2C-Fc was determined by the method of example 2, wherein the secondary antibody was an HRP-labeled anti-human IgG Fab, and the experimental results are shown in FIG. 7, in which humanized antibodies 32H2-Hu and 10D7-HuG have higher affinity to GUCY2C-Fc, and 32H2-HuG and 10D7-Hu have slightly weaker respective ECs 50 The values are detailed in Table 7.
Table 7: EC of each humanized antibody to GUCY2C-Fc binding 50
Sample of | EC 50 (nM) |
10D7-Hu | 0.657 |
10D7-HuG | 0.407 |
32H2-Hu | 0.452 |
32H2-HuG | 0.585 |
EXAMPLE 10 binding Activity of humanized antibodies against human GUCY2C overexpressing tumor cells
Reference example 3 determination of binding of humanized antibody to human GUCY2C overexpressing tumor cell CW2-GCC#1 (a cell line highly expressed in GUCY2C obtained by infection of CW2 cells with a lentivirus expressing the GUCY2C gene and a monoclonal screening method) Affinity, as shown in FIG. 8, 32H2-Hu and 32H2-HuG bind efficiently to CW2-GCC # 1 cells, and their ECs 50 96.32nM and 80.47nM, respectively, are significantly better than 10D7-Hu and 10D7-HuG.
EXAMPLE 11 construction and preparation of bispecific antibodies against GUCY2C and CD3
By gene synthesis and conventional molecular cloning methods, the sequences of the Anti-GUCY 2C monoclonal antibody and the Anti-CD3 monoclonal antibody SP34 are constructed into the Anti-GUCY 2C and CD3 bispecific antibodies with the structural formulas shown in figure 9, and for the structure of the bispecific antibody of Knob-into-Hole, the Anti-CD3-scFv can be on the heavy chain of Knob or on the heavy chain of Hole. The Fc terminal of each bispecific antibody may be a structure that retains or removes biological functions such as ADCC and CDC. The heavy chain and light chain sequences of each bispecific antibody were constructed onto expression vector pcDNA3.4, and co-transfected HEK-293F cells were paired for expression and purified to obtain each sample.
Wherein the Anti-CD3-scFv has a specific structure of VH-L-VL, and the VH and VL region sequences are derived from a constructed humanized Anti-CD3 monoclonal antibody SP34. The construction method of the anti-CD3 monoclonal antibody SP34 comprises the following steps:
(1) Acquisition of CDR region sequences of murine anti-human CD3 monoclonal antibody mSP34
The heavy and light chain variable region amino acid sequences of murine anti-human CD3 monoclonal antibodies are derived from SEQ ID NOs 2 and 4, respectively, in US8236308B 2.
mSP34 heavy chain variable region amino acid sequence:
EVKLLESGGGLVQPKGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADSVKDRFTISRDDSQSILYLQMNNLKTEDTAMYYCVRHGNFGNSYVSWFAYWGQGTLVTVSA
mSP34 light chain variable region amino acid sequence:
QAVVTQESALTTSPGETVTLTCRSSTGAVTTSNYANWVQEKPDHLFTGLIGGTNKRAPGVPARFSGSLIGDKAALTITGAQTEDEAIYFCALWYSNLWVFGGGTKLTVL
the heavy and light chain variable region amino acid sequences of mSP were analyzed to determine mSP the antigen complementarity determining regions (complementarity determining region, CDRs) and framework regions of the heavy and light chains, respectively, according to the Kabat rules. The amino acid sequence of the mSP heavy chain CDR is HCDR1: TYANN (SEQ ID NO: 101), HCDR2: RIRSKYNNYATYYADSVKD (SEQ ID NO: 102) and HCDR3: HGNFGNSYVSWFAY (SEQ ID NO: 103), the amino acid sequence of the light chain CDR is LCDR1: RSSTGAVTTSNYAN (SEQ ID NO: 104), LCDR2: GTNKRAP (SEQ ID NO: 105) and LCDR3: ALWYSNLWV (SEQ ID NO: 106).
(2) The humanization process of the SP34 monoclonal antibody is as follows:
at the position ofhttps://www.ncbi.nlm.nih.gov/igblast/The heavy chain variable region of murine SP34 (mSP) was compared with human IgG germline sequences for homology, IGHV3-23 x 04 was selected as heavy chain CDR grafting template, mSP heavy chain CDRs were grafted into IGHV3-23 x 04 framework regions, and WGQGTLVTVSS was added after HCDR3 as the fourth framework region to obtain CDR grafted heavy chain variable region sequences. Similarly, the light chain variable region of mSP was compared with human IgG germline sequence homology, IGLV7-46 x 01 was selected as light chain CDR grafting template, light chain CDRs of SP34 were grafted into framework regions of IGLV7-46 x 01, and FGQGTKVEIK was added as fourth framework region after LCDR3 to obtain CDR grafted light chain variable region sequences. Some amino acid sites were mutated based on the CDR-grafted variable region. In the case of mutation, the amino acid sequence is Kabat-encoded, and the position of the site is indicated by the Kabat code.
Preferably, for the CDR-grafted heavy chain variable region, S at position 49 is mutated to a, N at position 73 is mutated to D, a at position 93 is mutated to V, and K at position 94 is mutated to R. For CDR-grafted light chain variable regions, F at position 36 was mutated to V, T at position 46 was mutated to G, Y at position 49 was mutated to G, W at position 57 was mutated to G, and T at position 58 was mutated to V. The heavy and light chain variable regions with back mutation sites described above are defined as SP34 humanized heavy and light chain variable regions (SEQ ID NOS: 107 and 108), respectively.
(3) Construction of full Length SP34 monoclonal antibodies
DNA encoding the humanized heavy and light chain variable regions described above was synthesized by Shanghai Biotechnology Inc. Connecting the synthesized humanized heavy chain variable region with the coding gene of a human IgG1 heavy chain constant region (SEQ ID NO: 109) to obtain a full-length humanized heavy chain gene, which is named SP34-HC (amino acid sequence is SEQ ID NO:95, nucleic acid sequence is SEQ ID NO: 96); the humanized light chain variable region was linked to the gene encoding the human Kappa chain constant region (SEQ ID NO: 110) to give a full-length humanized light chain gene designated SP34-LC (amino acid sequence: 97, nucleic acid sequence: 98). The SP34-HC and SP34-LC genes were separately constructed into pcDNA3.4 expression vectors, the obtained heavy chain and light chain expression vectors were transferred into HEK293F cells together using PEI transfection method to express the antibodies, and the antibodies were purified using Protein A affinity chromatography, and the obtained humanized antibody was designated as SP34.
Table 8: sequence listing of variable regions of each bispecific antibody
EXAMPLE 12 binding Activity of anti-GUCY 2C and CD3 bispecific antibodies against GUCY2C-His proteins
The binding capacity of each bispecific antibody to GUCY2C-His was measured by the method of example 6, the secondary antibody was HRP-anti-human IgG Fab, and the experimental results are shown in FIG. 10 and FIG. 11, and each bispecific antibody binds EC 50 As shown in tables 9 and 10. As can be seen, the bispecific antibodies 32H2CH1-SP4VHL and 32H2-SP4VHL maintained a higher binding affinity for GUCY2C-His, their ECs 50 The value is equivalent to that of the monoclonal antibody 32H2-Hu and is superior to that of the 10D7-Hu monoclonal antibody and the corresponding bispecific antibody.
Table 9: EC of each bispecific antibody to GUCY2C-his binding 50
Sample of | EC 50 (nM) |
10D7-Hu | 2.01 |
32H2-Hu | 0.24 |
SP4VHL-32H2 | 1.86 |
32H2CH1-SP4VHL | 0.29 |
32H2Fab-SP4VHL | 35.47 |
32H2-SP4VHL-L | 0.49 |
10D7CH1-SP4VHL | 1.23 |
SP34 | \ |
Table 10: EC of each bispecific antibody to GUCY2C-his binding 50
Sample of | EC 50 (nM) |
SP4VHL-32H2 | 2.09 |
32H2CH1-SP4VHL | 0.23 |
32H2Fab-SPVHL | 30.10 |
32H2-SP4VHL-L | 0.49 |
10D7CH1-SP4VHL | 1.18 |
32H2-SP4VHL | 0.26 |
SP4VHL-32H2-L | 1.21 |
EXAMPLE 13 binding Activity of anti-GUCY 2C and CD3 bispecific antibodies against GUCY2C-Fc protein
The binding capacity of each bispecific antibody to GUCY2C-Fc was measured by the method of example 9, and the results of the experiments are shown in FIG. 12 and FIG. 13, in which each bispecific antibody binds to EC 50 As shown in tables 11 and 12. As can be seen, the bispecific antibodies 32H2CH1-SP4VHL, 32H2-SP4VHL, 10D7CH1-SP4VHL have relatively good binding activity to GUCY 2C-Fc.
Table 11: EC of each bispecific antibody to GUCY2C-Fc binding 50
Sample of | EC 50 (nM) |
10D7-Hu | 0.53 |
32H2-Hu | 0.23 |
SP4VHL-32H2 | 1.00 |
32H2CH1-SP4VHL | 0.24 |
32H2Fab-SP4VHL | 5.08 |
32H2-SP4VHL-L | 0.38 |
10D7CH1-SP4VHL | 0.29 |
SP34 | \ |
Table 12: EC of each bispecific antibody to GUCY2C-Fc binding 50
Sample of | EC 50 (nM) |
SP4VHL-32H2 | 1.19 |
32H2CH1-SP4VHL | 0.33 |
32H2Fab-SP4VHL | 1.04 |
32H2-SP4VHL-L | 0.43 |
10D7CH1-SP4VHL | 0.43 |
32H2-SP4VHL | 0.29 |
SP4VHL-32H2-L | 0.78 |
EXAMPLE 14 binding Activity of anti-GUCY 2C and CD3 bispecific antibodies against CD3 epsilon protein
The binding capacity of each bispecific antibody to CD 3. Epsilon. Was determined by the method of example 9, wherein the antigen coated was CD 3. Epsilon. -ECD-his (preparation of reference example 1, CD 3. Epsilon. -ECD sequence from UniProt, accession number P07766) at a coating concentration of 1. Mu.g/mL, and the experimental results are shown in FIGS. 14 and 15, and each bispecific antibody binds to EC 50 As shown in tables 13 and 14. As can be seen, the bispecific antibodies 32H2CH1-SP4VHL, 10D7CH1-SP4VHL and SP4VHL-32H2, 32H2-SP4VHL have a relatively better affinity for CD3 ε, but slightly weaker than their corresponding monoclonal antibodies SP34.
Table 13: EC of each bispecific antibody to CD3 epsilon binding 50
Table 14: EC of each bispecific antibody to CD3 epsilon binding 50
Sample of | EC 50 (nM) |
SP4VHL-32H2 | 0.59 |
32H2CH1-SP4VHL | 0.37 |
32H2Fab-SP4VHL | 6.12 |
32H2-SP4VHL-L | 1.09 |
10D7CH1-SP4VHL | 0.35 |
32H2-SP4VHL | 0.30 |
SP4VHL-32H2-L | 1.16 |
Example 15 binding Activity of anti-GUCY 2C and CD3 bispecific antibodies against CD3 ε delta heterodimer proteins
The binding capacity of each bispecific antibody to CD 3. Epsilon. Delta. Was determined by the method of example 9, wherein the antigen coated was CD 3. Epsilon. Delta. ECD-his (preparation of reference example 1, CD3. Epsilon. Delta. ECD sequence from UniProt, accession number P07766, P04234) at a coating concentration of 1. Mu.g/mL, and the results of the experiments are shown in FIGS. 16 and 17, with each bispecific antibody binding EC 50 As shown in tables 15 and 16. As can be seen, the bispecific antibodies 32H2CH1-SP4VHL, 10D7CH1-SP4VHL and 32H2-SP4VHL have a relatively better affinity for CD3 εdelta, but slightly weaker than their corresponding monoclonal antibodies SP34.
Table 15: EC of each bispecific antibody to CD3 epsilon delta binding 50
Sample of | EC 50 (nM) |
10D7-Hu | \ |
32H2-Hu | \ |
SP4VHL-32H2 | 0.58 |
32H2CH1-SP4VHL | 0.33 |
32H2Fab-SP4VHL | 4.97 |
32H2-SP4VHL-L | 1.16 |
10D7CH1-SP4VHL | 0.24 |
SP34 | 0.13 |
Table 16: EC of each bispecific antibody to CD3 epsilon delta binding 50
EXAMPLE 16 binding Activity of Knob-into-Hole anti-GUCY 2C and CD3 bispecific antibodies against GUCY2C-His and GUCY2C-Fc proteins
Binding affinities of each bispecific antibody of Knob-into-Hole structure to GUCY2C-His and GUCY2C-Fc were determined by the methods of example 12 and example 13, respectively, and the experimental results are shown in FIG. 18 and FIG. 19, and each sample was bound to EC 50 As shown in tables 17 and 18. As can be seen, the binding affinities of 32H2CH1-SP4VHL-KIH and 32H2CH1-SP4VHL for GUCY2C-His were relatively good, comparable to their corresponding monoclonal antibodies 32H2-Hu, 32H2CH1-SP4VHL-hik and SP4VHL-32H 2-KIH. The binding affinity for GUCY2C-Fc is relatively good at 32H2CH1-SP4VHL, 32H2CH1-SP4VHL-KIH, SP4VHL-32H2-KIH and 32H2CH1-SP4 VHL-hik.
Table 17: EC of each Knob-into-Hole bispecific antibody to GUCY2C-His binding 50
Sample of | EC 50 (nM) |
SP34 | \ |
32H2-Hu | 0.244 |
SP4VHL-32H2 | 1.842 |
SP4VHL-32H2-hik | 0.733 |
SP4VHL-32H2-KIH | 0.405 |
32H2CH1-SP4VHL | 0.188 |
32H2CH1-SP4VHL-hik | 0.428 |
32H2CH1-SP4VHL-KIH | 0.264 |
Table 18: EC of each Knob-into-Hole bispecific antibody to GUCY2C-Fc binding 50
Sample of | EC 50 (nM) |
SP34 | \ |
32H2-Hu | 0.759 |
SP4VHL-32H2 | 2.831 |
SP4VHL-32H2-hik | 1.084 |
SP4VHL-32H2-KIH | 0.752 |
32H2CH1-SP4VHL | 0.521 |
32H2CH1-SP4VHL-hik | 0.869 |
32H2CH1-SP4VHL-KIH | 0.681 |
EXAMPLE 17 binding Activity of Knob-into-Hole anti-GUCY 2C and CD3 bispecific antibodies against CD3 epsilon protein and CD3 epsilon delta heterodimer protein
Binding affinities of each bispecific antibody of Knob-into-Hole structure to CD3 ε and CD3 ε delta proteins were measured by the methods of example 14 and example 15, respectively, and the experimental results are shown in FIG. 20 and FIG. 21, and EC bound to each sample 50 As shown in tables 19 and 20. As can be seen, SP34 is a bivalent 32H2CH1-SP4VHL with a stronger affinity for CD3 ε and CD3 ε δ, but about 2-fold weaker than its corresponding monoclonal antibody SP34, and secondly, the bivalent SP4VHL-32H2 is weaker than SP34 in affinityAbout 5-fold, further, SP34 is a monovalent structure of 32H2CH1-SP4VHL-KIH, 32H2CH1-SP4VHL-hik and SP4VHL-32H2-KIH, SP4VHL-32H2-hik, which are about 10-fold or more weaker than SP34 in affinity for both CD3 ε and CD3 ε δ. In contrast, reduced affinity for CD3 also reduces the "cytokine storm" side effects caused by binding of antibodies to CD 3.
Table 19: EC of CD3 epsilon binding by each Knob-into-Hole bispecific antibody 50
Sample of | EC 50 (nM) |
SP34 | 0.154 |
32H2-Hu | \ |
SP4VHL-32H2 | 0.792 |
SP4VHL-32H2-hik | 4.016 |
SP4VHL-32H2-KIH | 2.373 |
32H2CH1-SP4VHL | 0.309 |
32H2CH1-SP4VHL-hik | 1.921 |
32H2CH1-SP4VHL-KIH | 1.306 |
Table 20: EC of CD3 ε delta binding by each Knob-into Hole bispecific antibody 50
Sample of | EC 50 (nM) |
SP34 | 0.118 |
32H2-Hu | \ |
SP4VHL-32H2 | 0.651 |
SP4VHL-32H2-hik | 5.999 |
SP4VHL-32H2-KIH | 3.206 |
32H2CH1-SP4VHL | 0.252 |
32H2CH1-SP4VHL-hik | 3.720 |
32H2CH1-SP4VHL-KIH | 2.530 |
Example 18 anti-GUCY 2C and CD3 bispecific antibodies mediate the killing Activity of T cells on tumor cells
This example uses the following method to determine the killing activity of anti-GUCY 2C and CD3 bispecific antibodies on tumor cells:
1. CW2 cells in the logarithmic growth phase were counted for digestion prior to T cell sorting and plated at 5000 cells per well in 96-well cell culture plates (1640 complete medium was used).
2. T cells were isolated from PBMCs according to the instructions of Pan T cell Isolation Kit (purchased from Miltenyi Biotec, cat. No. 130-096-535).
3. The T cells were sorted according to 5:1 (E: t=5:1) (2.5E4T cells per well) was added to the 96-well cell culture plates described above; each antibody was simultaneously diluted at 10 concentrations in a 3-fold gradient at a maximum working concentration of 900nM and added to the 96-well cell culture plate described above. At 37℃5% CO 2 The incubator was incubated for 96 hours.
4. Before CTG detection, liquid is directly and lightly thrown away, PBS is used for cleaning for four times (T cells in the liquid are removed), the cleaning effect of the T cells is observed and confirmed under a microscope, and finally 100 mu L of PBS is added.
5. CTG detection reagent (75 mu L/hole) is added, incubated for 8min at room temperature in a dark place, and fluorescence value is measured in a multifunctional enzyme-labeled instrument to analyze data.
The experimental results are shown in FIG. 22, which shows that each sample mediates T cell killing of CW2 cells of human colon cancer 50 As shown in table 21. It can be seen that each bispecific antibody activated T cell has better killing effect on CW2 cells than the single drug effect of SP34, and is relatively superior in terms of 32H2CH1-SP4VHL and 10D7CH1-SP4VHL activities, followed by SP4VHL-32H2 and 32H2-SP4VHL-L.
Table 21: bispecific antibody mediated T cell killing IC of CW2 cells 50 Value of
Sample of | IC 50 (nM) |
10D7CH1-SP4VHL | 0.02 |
SP4VHL-32H2 | 0.46 |
32H2CH1-SP4VHL | 0.05 |
32H2Fab-SP4VHL | 51.05 |
32H2-SP4VHL-L | 0.08 |
SP34 | 212.0 |
Example 19 killing Activity of anti-GUCY 2C and CD3 bispecific antibodies by Knob-into-Hole constructs on tumor cells
This example uses the experimental method described in example 18 to determine the killing activity of human colon cancer T84 cells by anti-GUCY 2C and CD3 bispecific antibodies. The experimental results are shown in FIG. 23, which shows that each sample mediates the killing of T cells to human colon cancer T84 cells 50 As shown in table 22. It can be seen that both SP34 and 32H2-Hu mab are ineffective in mediating specific killing of T84 cells by T cells, whereas each bispecific antibody mediates killing of T84 cells by T cells. Wherein SP34 is the most active 32H2CH1-SP4VHL with bivalent structure, and SP34 is 32H2CH1-SP4VHL-KIH, 32H2CH1-S with monovalent structureP4VHL-hik, SP4VHL-32H2-KIH and SP4VHL-32H2-hik are superior in activity, and further, SP34 and 32H2-Hu are 32H2Fab-SP4VHL of monovalent structure.
Table 22: bispecific antibody mediated T cell killing T84 cell IC 50 Value of
Sample of | IC 50 (nM) |
32H2CH1-SP4VHL-KIH | 0.366 |
32H2CH1-SP4VHL-hik | 0.002 |
32H2CH1-SP4VHL | \ |
SP4VHL-32H2-KIH | 0.963 |
SP4VHL-32H2-hik | 0.142 |
SP4VHL-32H2 | 0.004 |
32H2Fab-SP4VHL | 1.253 |
32H2-Hu | \ |
SP34 | \ |
All documents mentioned in this application are incorporated by reference as if each were individually incorporated by reference. Further, it will be appreciated that various changes and modifications may be made by those skilled in the art after reading the above teachings, and such equivalents are intended to fall within the scope of the claims appended hereto.
Sequence listing
<110> Dansheng medicine technology (Shanghai) Co., ltd
<120> an anti-GUCY 2C/CD3 bispecific antibody and uses thereof
<160> 116
<170> SIPOSequenceListing 1.0
<210> 1
<211> 348
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 1
caggtccagc tgcagcagtc tggggctgac ctggcaagac ctggggcctc agtgaagatg 60
tcctgcaagg cttctggcta cacctttact agctacacga tgcactgggt aaaacagagg 120
cctggacagg gtctggaatg gattggatac attaatccta gcagtggtta tactaattac 180
aatcagaagt tccaggacaa ggccacattg actgcagaca aatcctccag cacagcctac 240
atgcaactga gcagcctgac atctgaggac tctgcagtct attactgtgc aagattggga 300
aggatcggcg tgtactgggg ccaaggcacc actcttacag tctcctcc 348
<210> 2
<211> 116
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 2
Gln Val Gln Leu Gln Gln Ser Gly Ala Asp Leu Ala Arg Pro Gly Ala
1 5 10 15
Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr
20 25 30
Thr Met His Trp Val Lys Gln Arg Pro Gly Gln Gly Leu Glu Trp Ile
35 40 45
Gly Tyr Ile Asn Pro Ser Ser Gly Tyr Thr Asn Tyr Asn Gln Lys Phe
50 55 60
Gln Asp Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser Thr Ala Tyr
65 70 75 80
Met Gln Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys
85 90 95
Ala Arg Leu Gly Arg Ile Gly Val Tyr Trp Gly Gln Gly Thr Thr Leu
100 105 110
Thr Val Ser Ser
115
<210> 3
<211> 5
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 3
Ser Tyr Thr Met His
1 5
<210> 4
<211> 17
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 4
Tyr Ile Asn Pro Ser Ser Gly Tyr Thr Asn Tyr Asn Gln Lys Phe Gln
1 5 10 15
Asp
<210> 5
<211> 7
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 5
Leu Gly Arg Ile Gly Val Tyr
1 5
<210> 6
<211> 318
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 6
caaattgttc tctcccagtc tccagcaatc ctgtctgcat ttccagggga aaaggtcaca 60
ctgacttgca gggccagctc aagtgtaagt ttcatacact ggtaccagca gaagccagga 120
tcctccccca aaccctggat ttatgccaca tccaacctgg cttctggagt ccctgatcgc 180
ttcagtggca gtgggtctgg gacctctttc tctttcacaa tcagcagagt ggaggctgaa 240
gatgctgcca cttattactg ccagcagtgg agtagtaacc cgtggacgtt cggtggaggc 300
accaagctgg aaatcaag 318
<210> 7
<211> 106
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 7
Gln Ile Val Leu Ser Gln Ser Pro Ala Ile Leu Ser Ala Phe Pro Gly
1 5 10 15
Glu Lys Val Thr Leu Thr Cys Arg Ala Ser Ser Ser Val Ser Phe Ile
20 25 30
His Trp Tyr Gln Gln Lys Pro Gly Ser Ser Pro Lys Pro Trp Ile Tyr
35 40 45
Ala Thr Ser Asn Leu Ala Ser Gly Val Pro Asp Arg Phe Ser Gly Ser
50 55 60
Gly Ser Gly Thr Ser Phe Ser Phe Thr Ile Ser Arg Val Glu Ala Glu
65 70 75 80
Asp Ala Ala Thr Tyr Tyr Cys Gln Gln Trp Ser Ser Asn Pro Trp Thr
85 90 95
Phe Gly Gly Gly Thr Lys Leu Glu Ile Lys
100 105
<210> 8
<211> 10
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 8
Arg Ala Ser Ser Ser Val Ser Phe Ile His
1 5 10
<210> 9
<211> 7
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 9
Ala Thr Ser Asn Leu Ala Ser
1 5
<210> 10
<211> 9
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 10
Gln Gln Trp Ser Ser Asn Pro Trp Thr
1 5
<210> 11
<211> 351
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 11
gaagtgatgc tggtggagtc tgggggagac ttggtgaagc ctggagggtc cctgaaactc 60
tcctgtgcag cctctggatt cagtttcagg acctatgcca tgtcttgggt tcgccagagt 120
ccggagaaga gtctggagtg ggtcgcaacc attagtagtg gtagtagtta catttactat 180
ccagacagtg tgaaggggcg attcaccgtt ttcagagaca atgccaagaa taccctgtac 240
ctgcaaatga gcagtctgag gtctgaggac tcggccattt attactgtac atgttataga 300
atggaaactt ttgagtactg gggccaaggc accactctca cagtctcctc a 351
<210> 12
<211> 117
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 12
Glu Val Met Leu Val Glu Ser Gly Gly Asp Leu Val Lys Pro Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr
20 25 30
Ala Met Ser Trp Val Arg Gln Ser Pro Glu Lys Ser Leu Glu Trp Val
35 40 45
Ala Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Val Phe Arg Asp Asn Ala Lys Asn Thr Leu Tyr
65 70 75 80
Leu Gln Met Ser Ser Leu Arg Ser Glu Asp Ser Ala Ile Tyr Tyr Cys
85 90 95
Thr Cys Tyr Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Thr
100 105 110
Leu Thr Val Ser Ser
115
<210> 13
<211> 5
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 13
Thr Tyr Ala Met Ser
1 5
<210> 14
<211> 17
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 14
Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val Lys
1 5 10 15
Gly
<210> 15
<211> 8
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 15
Tyr Arg Met Glu Thr Phe Glu Tyr
1 5
<210> 16
<211> 336
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 16
gatgttgtga tgacccaaac tccactctcc ctgcctgtca gtcttggaga tgaagcctcc 60
atctcttgta gatctagtca gagccttgta tacaataatg gaaacaccta tttacattgg 120
tacctgcaga agccaggcca gtctccaaag ctcctaatct acaaagtttc caaccgattt 180
tctggggtcc cagacaggtt cagtggcagt gggtcaggga cagatttcac actcaagatc 240
agcagagtgg aggctgagga tctgggagtt tatttctgct ctcaaagtac acatgttccg 300
ctcacgttcg gtgctgggac caagctggaa ctgaaa 336
<210> 17
<211> 112
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 17
Asp Val Val Met Thr Gln Thr Pro Leu Ser Leu Pro Val Ser Leu Gly
1 5 10 15
Asp Glu Ala Ser Ile Ser Cys Arg Ser Ser Gln Ser Leu Val Tyr Asn
20 25 30
Asn Gly Asn Thr Tyr Leu His Trp Tyr Leu Gln Lys Pro Gly Gln Ser
35 40 45
Pro Lys Leu Leu Ile Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro
50 55 60
Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Lys Ile
65 70 75 80
Ser Arg Val Glu Ala Glu Asp Leu Gly Val Tyr Phe Cys Ser Gln Ser
85 90 95
Thr His Val Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu Leu Lys
100 105 110
<210> 18
<211> 16
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 18
Arg Ser Ser Gln Ser Leu Val Tyr Asn Asn Gly Asn Thr Tyr Leu His
1 5 10 15
<210> 19
<211> 7
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 19
Lys Val Ser Asn Arg Phe Ser
1 5
<210> 20
<211> 9
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 20
Ser Gln Ser Thr His Val Pro Leu Thr
1 5
<210> 21
<211> 357
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 21
gacgtgaaac tcgtggagtc tgggggagtc ttagtgaagc ttggagggtc cctgaaactc 60
tcctgtgcag cctctggatt cactttcagt ggctatttca tgtcttgggt tcgccagact 120
ccagagaaga ggctggagtt ggtcgcagcc attaatagtg atggtggtag cacctactat 180
ccagacactg tgaagggccg attcaccatc tccagagaca atgccaaaaa caccctctac 240
ctgcaaatga gcagtctgaa gtctgaggac acggccttat attactgtgc aagacttgca 300
aggtacctct atgctatgga ctactggggt caaggaacct cagtcaccgt ctcctca 357
<210> 22
<211> 119
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 22
Asp Val Lys Leu Val Glu Ser Gly Gly Val Leu Val Lys Leu Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Gly Tyr
20 25 30
Phe Met Ser Trp Val Arg Gln Thr Pro Glu Lys Arg Leu Glu Leu Val
35 40 45
Ala Ala Ile Asn Ser Asp Gly Gly Ser Thr Tyr Tyr Pro Asp Thr Val
50 55 60
Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ala Lys Asn Thr Leu Tyr
65 70 75 80
Leu Gln Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Leu Tyr Tyr Cys
85 90 95
Ala Arg Leu Ala Arg Tyr Leu Tyr Ala Met Asp Tyr Trp Gly Gln Gly
100 105 110
Thr Ser Val Thr Val Ser Ser
115
<210> 23
<211> 5
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 23
Gly Tyr Phe Met Ser
1 5
<210> 24
<211> 17
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 24
Ala Ile Asn Ser Asp Gly Gly Ser Thr Tyr Tyr Pro Asp Thr Val Lys
1 5 10 15
Gly
<210> 25
<211> 10
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 25
Leu Ala Arg Tyr Leu Tyr Ala Met Asp Tyr
1 5 10
<210> 26
<211> 318
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 26
caaattgttc tcacccagtc tccagcaatc atgtctgcat ctccagggga gaaggtcacc 60
atgacctgca gtgccagctc aagtgtaagt tacatgtact ggtaccagca gaagccagga 120
tcctccccca gactcctgat ttatgacaca tccaacctgg cttctggagt ccctgttcgc 180
ttcagtggca gtgggtctgg gacctcttac tctctcacaa tcagccgaat ggaggctgaa 240
gatgctgcca cttattactg ccagcagtgg actagttctt catggacgtt cggtggaggc 300
accaagctgg aaatcaaa 318
<210> 27
<211> 106
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 27
Gln Ile Val Leu Thr Gln Ser Pro Ala Ile Met Ser Ala Ser Pro Gly
1 5 10 15
Glu Lys Val Thr Met Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met
20 25 30
Tyr Trp Tyr Gln Gln Lys Pro Gly Ser Ser Pro Arg Leu Leu Ile Tyr
35 40 45
Asp Thr Ser Asn Leu Ala Ser Gly Val Pro Val Arg Phe Ser Gly Ser
50 55 60
Gly Ser Gly Thr Ser Tyr Ser Leu Thr Ile Ser Arg Met Glu Ala Glu
65 70 75 80
Asp Ala Ala Thr Tyr Tyr Cys Gln Gln Trp Thr Ser Ser Ser Trp Thr
85 90 95
Phe Gly Gly Gly Thr Lys Leu Glu Ile Lys
100 105
<210> 28
<211> 10
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 28
Ser Ala Ser Ser Ser Val Ser Tyr Met Tyr
1 5 10
<210> 29
<211> 7
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 29
Asp Thr Ser Asn Leu Ala Ser
1 5
<210> 30
<211> 9
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 30
Gln Gln Trp Thr Ser Ser Ser Trp Thr
1 5
<210> 31
<211> 357
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 31
caggtgcagc tgaagcagtc aggacctggc ctagtgcagc cctcacagag cctgtccatc 60
acctgcacag tctctggttt ctcattaact aactatggtg tacactgggt tcgccagtct 120
ccaggaaagg gtctggagtg gctgggagtg atatggagtg gtggaaggaa agactataat 180
gcagctttca tatccagact gaacatcacc aaggacaatt ccaagagtca agttttcttt 240
acaatgaaca gtctgcattc tgatgacaca gccatatact actgtgccag acatggcacc 300
tacccttact ggtacttcgc tctctggggc gcagggacct cggtcaccat ctcctca 357
<210> 32
<211> 119
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 32
Gln Val Gln Leu Lys Gln Ser Gly Pro Gly Leu Val Gln Pro Ser Gln
1 5 10 15
Ser Leu Ser Ile Thr Cys Thr Val Ser Gly Phe Ser Leu Thr Asn Tyr
20 25 30
Gly Val His Trp Val Arg Gln Ser Pro Gly Lys Gly Leu Glu Trp Leu
35 40 45
Gly Val Ile Trp Ser Gly Gly Arg Lys Asp Tyr Asn Ala Ala Phe Ile
50 55 60
Ser Arg Leu Asn Ile Thr Lys Asp Asn Ser Lys Ser Gln Val Phe Phe
65 70 75 80
Thr Met Asn Ser Leu His Ser Asp Asp Thr Ala Ile Tyr Tyr Cys Ala
85 90 95
Arg His Gly Thr Tyr Pro Tyr Trp Tyr Phe Ala Leu Trp Gly Ala Gly
100 105 110
Thr Ser Val Thr Ile Ser Ser
115
<210> 33
<211> 5
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 33
Asn Tyr Gly Val His
1 5
<210> 34
<211> 16
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 34
Val Ile Trp Ser Gly Gly Arg Lys Asp Tyr Asn Ala Ala Phe Ile Ser
1 5 10 15
<210> 35
<211> 11
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 35
His Gly Thr Tyr Pro Tyr Trp Tyr Phe Ala Leu
1 5 10
<210> 36
<211> 321
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 36
gatatccaaa tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 60
atcagttgca gggcaagtca ggacatcagt aattatttaa actggtatca gcagaaacca 120
gatggaactt ttaaactcct ggtctactac acatcaagat tacagtcagg ggtcccatca 180
aggttcagtg gcagtgggtc tggaacactt tattctctca ccattagcac cctggagcaa 240
gaggatgttg ccacttactt ttgccaacag ggtaaaacgc ttccgttttc gttcggtgga 300
ggcaccaggc tggaaatcaa a 321
<210> 37
<211> 107
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 37
Asp Ile Gln Met Thr Gln Thr Thr Ser Ser Leu Ser Ala Ser Leu Gly
1 5 10 15
Asp Arg Val Thr Ile Ser Cys Arg Ala Ser Gln Asp Ile Ser Asn Tyr
20 25 30
Leu Asn Trp Tyr Gln Gln Lys Pro Asp Gly Thr Phe Lys Leu Leu Val
35 40 45
Tyr Tyr Thr Ser Arg Leu Gln Ser Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Leu Tyr Ser Leu Thr Ile Ser Thr Leu Glu Gln
65 70 75 80
Glu Asp Val Ala Thr Tyr Phe Cys Gln Gln Gly Lys Thr Leu Pro Phe
85 90 95
Ser Phe Gly Gly Gly Thr Arg Leu Glu Ile Lys
100 105
<210> 38
<211> 11
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 38
Arg Ala Ser Gln Asp Ile Ser Asn Tyr Leu Asn
1 5 10
<210> 39
<211> 7
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 39
Tyr Thr Ser Arg Leu Gln Ser
1 5
<210> 40
<211> 9
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 40
Gln Gln Gly Lys Thr Leu Pro Phe Ser
1 5
<210> 41
<211> 360
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 41
caggttcagg tgcagcagtc tggagttgaa ctgatgaagc ctggggcctc agtgaagata 60
tcctgcaagg ctactggcta ctcattcagt tcttactgga tagagtgggt aaagcagagg 120
cctggacatg gccttgagtg gattggagag atttttcctg gaagtgggac tactacctac 180
aatgagaagt tcaaggacaa ggccacattc actgcagaca catcctccaa cacagcctac 240
atgcaactca gcagcctgac atctgaggac tctgccgtct attattgtgc aaagggtaaa 300
attacgacat actgggtctt cgatgtctgg ggcgcaggga ccacggtcac cgtctcctca 360
<210> 42
<211> 120
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 42
Gln Val Gln Val Gln Gln Ser Gly Val Glu Leu Met Lys Pro Gly Ala
1 5 10 15
Ser Val Lys Ile Ser Cys Lys Ala Thr Gly Tyr Ser Phe Ser Ser Tyr
20 25 30
Trp Ile Glu Trp Val Lys Gln Arg Pro Gly His Gly Leu Glu Trp Ile
35 40 45
Gly Glu Ile Phe Pro Gly Ser Gly Thr Thr Thr Tyr Asn Glu Lys Phe
50 55 60
Lys Asp Lys Ala Thr Phe Thr Ala Asp Thr Ser Ser Asn Thr Ala Tyr
65 70 75 80
Met Gln Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys
85 90 95
Ala Lys Gly Lys Ile Thr Thr Tyr Trp Val Phe Asp Val Trp Gly Ala
100 105 110
Gly Thr Thr Val Thr Val Ser Ser
115 120
<210> 43
<211> 5
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 43
Ser Tyr Trp Ile Glu
1 5
<210> 44
<211> 17
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 44
Glu Ile Phe Pro Gly Ser Gly Thr Thr Thr Tyr Asn Glu Lys Phe Lys
1 5 10 15
Asp
<210> 45
<211> 11
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 45
Gly Lys Ile Thr Thr Tyr Trp Val Phe Asp Val
1 5 10
<210> 46
<211> 318
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 46
caaattgttc tctcccagtc tccagcaatc ctgtctgcat ctccagggga gaaggtcaca 60
atgacttgca gggccagctc aagtgtaagt tacatgcact ggtaccagca gaagccagga 120
tcctccccca aaccctggat ttatgccaca tccaacctgg cttctggagt ccctgctcgc 180
ttcagtggca gtgggtctgg gacctcttac tctctcacaa tcagcagagt ggaggctgaa 240
gatgctgcca cttattactg ccagcagtgg agtagtaacc cacggacgtt cggtggaggc 300
accaagctgg aaatcaaa 318
<210> 47
<211> 106
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 47
Gln Ile Val Leu Ser Gln Ser Pro Ala Ile Leu Ser Ala Ser Pro Gly
1 5 10 15
Glu Lys Val Thr Met Thr Cys Arg Ala Ser Ser Ser Val Ser Tyr Met
20 25 30
His Trp Tyr Gln Gln Lys Pro Gly Ser Ser Pro Lys Pro Trp Ile Tyr
35 40 45
Ala Thr Ser Asn Leu Ala Ser Gly Val Pro Ala Arg Phe Ser Gly Ser
50 55 60
Gly Ser Gly Thr Ser Tyr Ser Leu Thr Ile Ser Arg Val Glu Ala Glu
65 70 75 80
Asp Ala Ala Thr Tyr Tyr Cys Gln Gln Trp Ser Ser Asn Pro Arg Thr
85 90 95
Phe Gly Gly Gly Thr Lys Leu Glu Ile Lys
100 105
<210> 48
<211> 10
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 48
Arg Ala Ser Ser Ser Val Ser Tyr Met His
1 5 10
<210> 49
<211> 7
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 49
Ala Thr Ser Asn Leu Ala Ser
1 5
<210> 50
<211> 9
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 50
Gln Gln Trp Ser Ser Asn Pro Arg Thr
1 5
<210> 51
<211> 117
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 51
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Thr Tyr
20 25 30
Ala Met Ser Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ser Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ala Arg Tyr Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu
100 105 110
Val Thr Val Ser Ser
115
<210> 52
<211> 351
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 52
gaggtgcagc tggtggagag cggcggcggt ctggtgaagc ctggaggctc tctgagactg 60
tcttgtgctg cctctggctt tacctttagc acctatgcca tgagctgggt gcggcaggcc 120
cccggcaagg gcctggagtg ggtgagcacc atctcttctg gttcttctta tatctattat 180
cctgattctg tgaagggaag attcaccatc tctagagata atgctaagaa tagtctgtat 240
ctgcagatga atagtctgag agctgaggat acagccgtgt attattgtgc tagatataga 300
atggagacct ttgagtattg gggccagggc accctggtga ccgtgagtag t 351
<210> 53
<211> 117
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 53
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr
20 25 30
Ala Met Ser Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val
35 40 45
Ala Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Thr Cys Tyr Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu
100 105 110
Val Thr Val Ser Ser
115
<210> 54
<211> 351
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 54
gaggtgcagc tggtggaatc cggcggaggc ctggtgaaac caggcggcag cctgagactg 60
tcctgtgctg ctagcggttt ttcctttaga acttacgcta tgagctgggt gagacaggcc 120
ccaggaaagt ctctcgaatg ggtggccaca attagtagcg gcagtagcta catctactac 180
cctgactccg tgaagggccg gtttaccgtg agccgcgata acgccaagaa ctccctgtac 240
ctgcagatga acagcctgcg cgccgaggac accgccgtgt actactgcac ctgctaccga 300
atggagacct tcgagtactg gggccagggc accctggtga ccgtgagcag c 351
<210> 55
<211> 112
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 55
Asp Val Val Met Thr Gln Ser Pro Leu Ser Leu Pro Val Thr Leu Gly
1 5 10 15
Gln Pro Ala Ser Ile Ser Cys Arg Ser Ser Gln Ser Leu Val Tyr Asn
20 25 30
Asn Gly Asn Thr Tyr Leu His Trp Phe Gln Gln Arg Pro Gly Gln Ser
35 40 45
Pro Arg Arg Leu Ile Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro
50 55 60
Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Lys Ile
65 70 75 80
Ser Arg Val Glu Ala Glu Asp Val Gly Val Tyr Tyr Cys Ser Gln Ser
85 90 95
Thr His Val Pro Leu Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105 110
<210> 56
<211> 336
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 56
gatgtggtca tgacccagtc tccactgtcc ctgcctgtga ccctgggcca gcccgcttct 60
atctcttgta gatcttctca gtctctggtg tataataatg gaaataccta tctgcattgg 120
ttccagcaga gacctggaca gtctcctaga aggctgatct ataaggtgtc taacaggttt 180
tctggcgtgc ctgatagatt ttctggctct ggatctggca cagattttac cctgaagatc 240
tctagagtgg aggctgagga tgtgggcgtg tattattgtt ctcagagcac acacgtgcca 300
ctgacatttg gccagggcac aaaggtggaa attaag 336
<210> 57
<211> 112
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 57
Asp Val Val Met Thr Gln Ser Pro Leu Ser Leu Pro Val Thr Leu Gly
1 5 10 15
Gln Pro Ala Ser Ile Ser Cys Arg Ser Ser Gln Ser Leu Val Tyr Asn
20 25 30
Asn Gly Asn Thr Tyr Leu His Trp Tyr Gln Gln Arg Pro Gly Gln Ser
35 40 45
Pro Arg Leu Leu Ile Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro
50 55 60
Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Lys Ile
65 70 75 80
Ser Arg Val Glu Ala Glu Asp Val Gly Val Tyr Phe Cys Ser Gln Ser
85 90 95
Thr His Val Pro Leu Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105 110
<210> 58
<211> 336
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 58
gacgttgtga tgacacagtc tcctctgtcc ctgccagtga ccctgggaca gcctgcttct 60
atctcttgta gatcttctca gtctctggtg tacaataatg gaaacacata cctgcactgg 120
taccagcaga gacctggaca gtcccctaga ctgctgatct acaaggtgag taatagattt 180
tctggagtgc ctgatcggtt tagcggctct ggctctggca ccgattttac actgaagatc 240
tctagagtgg aggccgagga tgtgggcgtg tacttttgct cccagagcac acacgtgcct 300
ctgacctttg gacagggaac caaggtggag attaag 336
<210> 59
<211> 119
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 59
Gln Val Gln Leu Val Glu Ser Gly Gly Gly Val Val Gln Pro Gly Arg
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Gly Val His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Val Ile Trp Ser Gly Gly Arg Lys Asp Tyr Asn Ala Ala Phe Ile
50 55 60
Ser Arg Phe Thr Ile Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr Leu
65 70 75 80
Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala
85 90 95
Arg His Gly Thr Tyr Pro Tyr Trp Tyr Phe Ala Leu Trp Gly Gln Gly
100 105 110
Thr Leu Val Thr Val Ser Ser
115
<210> 60
<211> 357
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 60
caggtgcagc tggtggagtc tggcggcggc gtggtgcagc ctggaaggag tctgagactg 60
agttgtgccg ccagcggatt caccttctct aactatggag tgcattgggt gaggcaggct 120
cctggaaagg gcctggagtg ggtggccgtg atttggtctg gcggaagaaa ggattataat 180
gccgccttta tttcaagatt caccatcagc cgcgataaca gcaagaacac cctgtacctg 240
cagatgaaca gcctgagggc tgaggacacc gccgtgtatt actgcgccag gcacgggacc 300
tacccttact ggtacttcgc cctgtggggc cagggcaccc tggtgaccgt gtctagc 357
<210> 61
<211> 119
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 61
Gln Val Gln Leu Val Glu Ser Gly Gly Gly Val Val Gln Pro Gly Arg
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Val Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Gly Val His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Leu
35 40 45
Gly Val Ile Trp Ser Gly Gly Arg Lys Asp Tyr Asn Ala Ala Phe Ile
50 55 60
Ser Arg Leu Thr Ile Ser Lys Asp Asn Ser Lys Ser Thr Val Tyr Leu
65 70 75 80
Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala
85 90 95
Arg His Gly Thr Tyr Pro Tyr Trp Tyr Phe Ala Leu Trp Gly Gln Gly
100 105 110
Thr Leu Val Thr Val Ser Ser
115
<210> 62
<211> 357
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 62
caggtgcagc tggtggagtc cggcggcggc gtggtgcagc ctggcagatc tctgaggctg 60
agttgtgctg tgagtggctt cacattttct aactatggcg tgcactgggt gagacaggcc 120
cctggaaagg gactggagtg gctgggagtg atctggtccg gaggaagaaa agattataat 180
gctgccttta tttctaggct gacaattagt aaggataatt ctaagtctac cgtgtatctg 240
cagatgaata gtctgagggc tgaggacaca gccgtgtatt actgcgctag acatggaaca 300
tatccttatt ggtattttgc cctgtgggga cagggcaccc ttgtgaccgt gagctct 357
<210> 63
<211> 107
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 63
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Ile Ser Asn Tyr
20 25 30
Leu Asn Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile
35 40 45
Tyr Tyr Thr Ser Arg Leu Gln Ser Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Phe Thr Phe Thr Ile Ser Ser Leu Gln Pro
65 70 75 80
Glu Asp Ile Ala Thr Tyr Tyr Cys Gln Gln Gly Lys Thr Leu Pro Phe
85 90 95
Ser Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105
<210> 64
<211> 321
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 64
gatatccaga tgacccagtc tcctagctct ctgtctgctt ctgtgggaga tagagtgacc 60
attacatgta gagcttctca ggatatctcc aattatctga attggtatca gcagaaacca 120
ggcaaggccc caaagctgct gatctactat acatctagac tgcagagcgg cgtgccatcc 180
aggttttctg gctccggatc tggaacagat tttaccttta ccattagctc tctgcagcct 240
gaggatatcg ctacatatta ttgtcagcag ggcaagacac tgcctttttc ttttggccag 300
ggcaccaaag tggagatcaa g 321
<210> 65
<211> 107
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 65
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Ile Ser Asn Tyr
20 25 30
Leu Asn Trp Tyr Gln Gln Lys Pro Gly Lys Thr Phe Lys Leu Leu Val
35 40 45
Tyr Tyr Thr Ser Arg Leu Gln Ser Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Tyr Thr Phe Thr Ile Ser Ser Leu Gln Pro
65 70 75 80
Glu Asp Ile Ala Thr Tyr Phe Cys Gln Gln Gly Lys Thr Leu Pro Phe
85 90 95
Ser Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105
<210> 66
<211> 321
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 66
gatatccaga tgacacagtc tccttcctct ctgtctgcct ctgtgggcga tagggtgaca 60
atcacatgta gagcttctca ggatatctcc aattatctga attggtacca gcagaaacct 120
ggcaagacct ttaagctgct ggtgtactat acctccagac tgcagtctgg agtgccatct 180
agattttctg gctctggctc tggaaccgac tacaccttta ccatctctag cctgcagcct 240
gaagatatcg ctacctattt ttgtcagcag ggcaagactc tgccttttag cttcggccag 300
ggaaccaagg tggagatcaa g 321
<210> 67
<211> 716
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 67
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr
20 25 30
Ala Met Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Arg Ile Arg Ser Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp
50 55 60
Ser Val Lys Asp Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Leu Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Val Arg His Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe
100 105 110
Ala Tyr Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly
115 120 125
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
130 135 140
Ser Gln Ala Val Val Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly
145 150 155 160
Gly Thr Val Thr Leu Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr
165 170 175
Ser Asn Tyr Ala Asn Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg
180 185 190
Gly Leu Ile Gly Gly Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg
195 200 205
Phe Ser Gly Ser Leu Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly
210 215 220
Ala Gln Pro Glu Asp Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser
225 230 235 240
Asn Leu Trp Val Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Gly Gly
245 250 255
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Val Gln
260 265 270
Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly Ser Leu Arg
275 280 285
Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr Ala Met Ser
290 295 300
Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val Ala Thr Ile
305 310 315 320
Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val Lys Gly Arg
325 330 335
Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr Leu Gln Met
340 345 350
Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Thr Cys Tyr
355 360 365
Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu Val Thr Val
370 375 380
Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser
385 390 395 400
Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys
405 410 415
Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu
420 425 430
Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser Ser Gly Leu
435 440 445
Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr
450 455 460
Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val
465 470 475 480
Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro
485 490 495
Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe
500 505 510
Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val
515 520 525
Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe
530 535 540
Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro
545 550 555 560
Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr
565 570 575
Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val
580 585 590
Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala
595 600 605
Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg
610 615 620
Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly
625 630 635 640
Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro
645 650 655
Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser
660 665 670
Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln
675 680 685
Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His
690 695 700
Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
705 710 715
<210> 68
<211> 2148
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 68
gaggtgcagc tggtggagtc cggcggcggc ctggtgcagc ccggcggctc cctgaggctg 60
agctgcgccg ccagcggctt caccttcaac acctacgcca tgaactgggt gagacaggct 120
cccggcaagg gcctggagtg ggtggcccgg atcagatcta aatataacaa ttatgctaca 180
tattatgctg attctgtgaa ggataggttt acaattagca gagatgattc taagaataca 240
ctgtatctgc agatgaacag tctgcgtgct gaggatactg cagtgtatta ttgtgtgaga 300
catggaaatt tcggtaattc ttatgtgagc tggtttgctt actggggcca gggaactctg 360
gtgacagtgt cctctggcgg cggaggctct ggcggagggg gcagtggcgg cggtggctct 420
ggaggcggcg gctctcaggc tgtggtgaca caggaacctt ctctgacagt gtctccagga 480
ggaacagtga ctctgacatg tagaagttct actggagctg tgacaacctc taattatgct 540
aactgggtgc agcagaaacc tggccaggct cctagaggtt tgatcggagg tacaaataag 600
agagcacctg gagtgcctgc tagattttct ggctctctgc tgggcggaaa agctgctctg 660
acactgtctg gagctcagcc tgaggatgaa gctgagtatt attgtgctct gtggtactct 720
aatctgtggg tgttcggaca gggcacaaag gtggaaatta agggcggagg cggctctggc 780
ggcggcggaa gcggcggcgg cggctccgag gtgcagctgg tggaatccgg cggaggcctg 840
gtgaaaccag gcggcagcct gagactgtcc tgtgctgcta gcggtttttc ctttagaact 900
tacgctatga gctgggtgag acaggcccca ggaaagtctc tcgaatgggt ggccacaatt 960
agtagcggca gtagctacat ctactaccct gactccgtga agggccggtt taccgtgagc 1020
cgcgataacg ccaagaactc cctgtacctg cagatgaaca gcctgcgcgc cgaggacacc 1080
gccgtgtact actgcacctg ctaccgaatg gagaccttcg agtactgggg ccagggcacc 1140
ctggtgaccg tgagcagcgc cagcaccaag ggccccagcg tgttccccct ggccccctcc 1200
tccaagtcca cctccggcgg caccgctgcc ctgggctgcc tggtgaagga ctacttccct 1260
gagcctgtga ccgtgagctg gaacagcggc gccctgacct ccggcgtgca caccttcccc 1320
gccgtgctgc agtccagcgg cctgtacagc ctgagctctg tggtgaccgt gccaagcagc 1380
agcctgggca cccagaccta catctgtaac gtgaaccaca agcccagcaa caccaaggtg 1440
gataagaagg tggagcctaa gtcctgcgat aagacccaca cctgcccccc ctgccccgcc 1500
cccgagcttc tgggcggccc atccgtgttc ctgttccccc ccaagcctaa ggacaccctg 1560
atgatcagcc gcacccctga ggtgacctgc gtggtggtgg atgtgagcca cgaggaccct 1620
gaggtgaagt tcaactggta cgtggacggc gtggaggtcc ataacgccaa gaccaagccc 1680
agagaggagc agtataacag cacctacagg gtggtgtccg tgctgaccgt gctgcaccag 1740
gactggctga acggcaagga atacaagtgc aaagtgtcca acaaggctct gccagccccc 1800
atcgaaaaga caatctctaa ggccaagggc cagcccaggg agccccaagt gtacaccctg 1860
cctccctcca gagaggagat gaccaagaac caggtgtccc tgacctgcct ggtgaagggc 1920
ttctacccta gcgacatcgc cgtggagtgg gagagcaacg gccagcctga gaacaactat 1980
aagaccaccc ctcccgtgct ggatagtgac ggatctttct ttctgtatag taagctgacc 2040
gtggacaagt ctagatggca gcagggaaat gtgttttctt gttctgtgat gcatgaagcc 2100
ctgcataatc actacaccca gaagtctctg agcctgtccc caggaaag 2148
<210> 69
<211> 219
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 69
Asp Val Val Met Thr Gln Ser Pro Leu Ser Leu Pro Val Thr Leu Gly
1 5 10 15
Gln Pro Ala Ser Ile Ser Cys Arg Ser Ser Gln Ser Leu Val Tyr Asn
20 25 30
Asn Gly Asn Thr Tyr Leu His Trp Tyr Gln Gln Arg Pro Gly Gln Ser
35 40 45
Pro Arg Leu Leu Ile Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro
50 55 60
Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Lys Ile
65 70 75 80
Ser Arg Val Glu Ala Glu Asp Val Gly Val Tyr Phe Cys Ser Gln Ser
85 90 95
Thr His Val Pro Leu Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105 110
Arg Thr Val Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu
115 120 125
Gln Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe
130 135 140
Tyr Pro Arg Glu Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln
145 150 155 160
Ser Gly Asn Ser Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser
165 170 175
Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu
180 185 190
Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser
195 200 205
Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys
210 215
<210> 70
<211> 657
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 70
gacgttgtga tgacacagtc tcctctgtcc ctgccagtga ccctgggaca gcctgcttct 60
atctcttgta gatcttctca gtctctggtg tacaataatg gaaacacata cctgcactgg 120
taccagcaga gacctggaca gtcccctaga ctgctgatct acaaggtgag taatagattt 180
tctggagtgc ctgatcggtt tagcggctct ggctctggca ccgattttac actgaagatc 240
tctagagtgg aggccgagga tgtgggcgtg tacttttgct cccagagcac acacgtgcct 300
ctgacctttg gacagggaac caaggtggag attaagagaa cagtggctgc cccatctgtg 360
tttatttttc caccttccga tgagcagctg aagtctggca ccgcctctgt ggtgtgtctg 420
ctgaataatt tctatcctag agaagctaag gtgcagtgga aggtggataa tgctctgcag 480
agtggcaatt ctcaggagag tgtgacagag caggattcta aagattctac atattctctg 540
agcagcaccc tgacactgtc taaggctgat tacgagaagc ataaggtgta tgcttgcgaa 600
gtgacacatc agggactgtc tagccctgtg actaagtctt ttaatagagg cgagtgt 657
<210> 71
<211> 488
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 71
Asp Val Val Met Thr Gln Ser Pro Leu Ser Leu Pro Val Thr Leu Gly
1 5 10 15
Gln Pro Ala Ser Ile Ser Cys Arg Ser Ser Gln Ser Leu Val Tyr Asn
20 25 30
Asn Gly Asn Thr Tyr Leu His Trp Tyr Gln Gln Arg Pro Gly Gln Ser
35 40 45
Pro Arg Leu Leu Ile Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro
50 55 60
Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Lys Ile
65 70 75 80
Ser Arg Val Glu Ala Glu Asp Val Gly Val Tyr Phe Cys Ser Gln Ser
85 90 95
Thr His Val Pro Leu Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105 110
Arg Thr Val Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu
115 120 125
Gln Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe
130 135 140
Tyr Pro Arg Glu Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln
145 150 155 160
Ser Gly Asn Ser Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser
165 170 175
Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu
180 185 190
Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser
195 200 205
Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys Gly Gly Gly Gly Ser
210 215 220
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Val Gln Leu Val Glu
225 230 235 240
Ser Gly Gly Gly Leu Val Gln Pro Gly Gly Ser Leu Arg Leu Ser Cys
245 250 255
Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr Ala Met Asn Trp Val Arg
260 265 270
Gln Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg Ile Arg Ser Lys
275 280 285
Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp Ser Val Lys Asp Arg Phe
290 295 300
Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr Leu Tyr Leu Gln Met Asn
305 310 315 320
Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Val Arg His Gly
325 330 335
Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe Ala Tyr Trp Gly Gln Gly
340 345 350
Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
355 360 365
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gln Ala Val Val Thr
370 375 380
Gln Glu Pro Ser Leu Thr Val Ser Pro Gly Gly Thr Val Thr Leu Thr
385 390 395 400
Cys Arg Ser Ser Thr Gly Ala Val Thr Thr Ser Asn Tyr Ala Asn Trp
405 410 415
Val Gln Gln Lys Pro Gly Gln Ala Pro Arg Gly Leu Ile Gly Gly Thr
420 425 430
Asn Lys Arg Ala Pro Gly Val Pro Ala Arg Phe Ser Gly Ser Leu Leu
435 440 445
Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly Ala Gln Pro Glu Asp Glu
450 455 460
Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser Asn Leu Trp Val Phe Gly
465 470 475 480
Gln Gly Thr Lys Val Glu Ile Lys
485
<210> 72
<211> 1464
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 72
gacgttgtga tgacacagtc tcctctgtcc ctgccagtga ccctgggaca gcctgcttct 60
atctcttgta gatcttctca gtctctggtg tacaataatg gaaacacata cctgcactgg 120
taccagcaga gacctggaca gtcccctaga ctgctgatct acaaggtgag taatagattt 180
tctggagtgc ctgatcggtt tagcggctct ggctctggca ccgattttac actgaagatc 240
tctagagtgg aggccgagga tgtgggcgtg tacttttgct cccagagcac acacgtgcct 300
ctgacctttg gacagggaac caaggtggag attaagagaa cagtggctgc cccatctgtg 360
tttatttttc caccttccga tgagcagctg aagtctggca ccgcctctgt ggtgtgtctg 420
ctgaataatt tctatcctag agaagctaag gtgcagtgga aggtggataa tgctctgcag 480
agtggcaatt ctcaggagag tgtgacagag caggattcta aagattctac atattctctg 540
agcagcaccc tgacactgtc taaggctgat tacgagaagc ataaggtgta tgcttgcgaa 600
gtgacacatc agggactgtc tagccctgtg actaagtctt ttaatagagg cgagtgtgga 660
ggcggcggca gcggaggcgg cggctccggc ggcggcggct ctgaggtgca gctggtggag 720
tccggcggcg gcctggtgca gcccggcggc tccctgaggc tgagctgcgc cgccagcggc 780
ttcaccttca acacctacgc catgaactgg gtgagacagg ctcccggcaa gggcctggag 840
tgggtggccc ggatcagatc taaatataac aattatgcta catattatgc tgattctgtg 900
aaggataggt ttacaattag cagagatgat tctaagaata cactgtatct gcagatgaac 960
agtctgcgtg ctgaggatac tgcagtgtat tattgtgtga gacatggaaa tttcggtaat 1020
tcttatgtga gctggtttgc ttactggggc cagggaactc tggtgacagt gtcctctggc 1080
ggcggaggct ctggcggagg gggcagtggc ggcggtggct ctggaggcgg cggctctcag 1140
gctgtggtga cacaggaacc ttctctgaca gtgtctccag gaggaacagt gactctgaca 1200
tgtagaagtt ctactggagc tgtgacaacc tctaattatg ctaactgggt gcagcagaaa 1260
cctggccagg ctcctagagg tttgatcgga ggtacaaata agagagcacc tggagtgcct 1320
gctagatttt ctggctctct gctgggcgga aaagctgctc tgacactgtc tggagctcag 1380
cctgaggatg aagctgagta ttattgtgct ctgtggtact ctaatctgtg ggtgttcgga 1440
cagggcacaa aggtggaaat taag 1464
<210> 73
<211> 447
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 73
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr
20 25 30
Ala Met Ser Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val
35 40 45
Ala Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Thr Cys Tyr Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu
100 105 110
Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu
115 120 125
Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys
130 135 140
Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser
145 150 155 160
Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser
165 170 175
Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser
180 185 190
Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn
195 200 205
Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His
210 215 220
Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val
225 230 235 240
Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr
245 250 255
Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu
260 265 270
Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys
275 280 285
Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser
290 295 300
Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys
305 310 315 320
Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile
325 330 335
Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro
340 345 350
Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu
355 360 365
Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn
370 375 380
Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser
385 390 395 400
Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg
405 410 415
Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu
420 425 430
His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
435 440 445
<210> 74
<211> 1341
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 74
gaggtgcagc tggtggaatc cggcggaggc ctggtgaaac caggcggcag cctgagactg 60
tcctgtgctg ctagcggttt ttcctttaga acttacgcta tgagctgggt gagacaggcc 120
ccaggaaagt ctctcgaatg ggtggccaca attagtagcg gcagtagcta catctactac 180
cctgactccg tgaagggccg gtttaccgtg agccgcgata acgccaagaa ctccctgtac 240
ctgcagatga acagcctgcg cgccgaggac accgccgtgt actactgcac ctgctaccga 300
atggagacct tcgagtactg gggccagggc accctggtga ccgtgagcag cgccagcacc 360
aagggcccca gcgtgttccc cctggccccc tcctccaagt ccacctccgg cggcaccgct 420
gccctgggct gcctggtgaa ggactacttc cctgagcctg tgaccgtgag ctggaacagc 480
ggcgccctga cctccggcgt gcacaccttc cccgccgtgc tgcagtccag cggcctgtac 540
agcctgagct ctgtggtgac cgtgccaagc agcagcctgg gcacccagac ctacatctgt 600
aacgtgaacc acaagcccag caacaccaag gtggataaga aggtggagcc taagtcctgc 660
gataagaccc acacctgccc cccctgcccc gcccccgagc ttctgggcgg cccatccgtg 720
ttcctgttcc cccccaagcc taaggacacc ctgatgatca gccgcacccc tgaggtgacc 780
tgcgtggtgg tggatgtgag ccacgaggac cctgaggtga agttcaactg gtacgtggac 840
ggcgtggagg tccataacgc caagaccaag cccagagagg agcagtataa cagcacctac 900
agggtggtgt ccgtgctgac cgtgctgcac caggactggc tgaacggcaa ggaatacaag 960
tgcaaagtgt ccaacaaggc tctgccagcc cccatcgaaa agacaatctc taaggccaag 1020
ggccagccca gggagcccca agtgtacacc ctgcctccct ccagagagga gatgaccaag 1080
aaccaggtgt ccctgacctg cctggtgaag ggcttctacc ctagcgacat cgccgtggag 1140
tgggagagca acggccagcc tgagaacaac tataagacca cccctcccgt gctggatagt 1200
gacggatctt tctttctgta tagtaagctg accgtggaca agtctagatg gcagcaggga 1260
aatgtgtttt cttgttctgt gatgcatgaa gccctgcata atcactacac ccagaagtct 1320
ctgagcctgt ccccaggaaa g 1341
<210> 75
<211> 488
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 75
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr
20 25 30
Ala Met Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Arg Ile Arg Ser Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp
50 55 60
Ser Val Lys Asp Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Leu Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Val Arg His Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe
100 105 110
Ala Tyr Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly
115 120 125
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
130 135 140
Ser Gln Ala Val Val Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly
145 150 155 160
Gly Thr Val Thr Leu Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr
165 170 175
Ser Asn Tyr Ala Asn Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg
180 185 190
Gly Leu Ile Gly Gly Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg
195 200 205
Phe Ser Gly Ser Leu Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly
210 215 220
Ala Gln Pro Glu Asp Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser
225 230 235 240
Asn Leu Trp Val Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Gly Gly
245 250 255
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Asp Val Val
260 265 270
Met Thr Gln Ser Pro Leu Ser Leu Pro Val Thr Leu Gly Gln Pro Ala
275 280 285
Ser Ile Ser Cys Arg Ser Ser Gln Ser Leu Val Tyr Asn Asn Gly Asn
290 295 300
Thr Tyr Leu His Trp Tyr Gln Gln Arg Pro Gly Gln Ser Pro Arg Leu
305 310 315 320
Leu Ile Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Asp Arg Phe
325 330 335
Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Lys Ile Ser Arg Val
340 345 350
Glu Ala Glu Asp Val Gly Val Tyr Phe Cys Ser Gln Ser Thr His Val
355 360 365
Pro Leu Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val
370 375 380
Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys
385 390 395 400
Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg
405 410 415
Glu Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn
420 425 430
Ser Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser
435 440 445
Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys
450 455 460
Val Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr
465 470 475 480
Lys Ser Phe Asn Arg Gly Glu Cys
485
<210> 76
<211> 1464
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 76
gaggtgcagc tggtggagtc cggcggcggc ctggtgcagc ccggcggctc cctgaggctg 60
agctgcgccg ccagcggctt caccttcaac acctacgcca tgaactgggt gagacaggct 120
cccggcaagg gcctggagtg ggtggcccgg atcagatcta aatataacaa ttatgctaca 180
tattatgctg attctgtgaa ggataggttt acaattagca gagatgattc taagaataca 240
ctgtatctgc agatgaacag tctgcgtgct gaggatactg cagtgtatta ttgtgtgaga 300
catggaaatt tcggtaattc ttatgtgagc tggtttgctt actggggcca gggaactctg 360
gtgacagtgt cctctggcgg cggaggctct ggcggagggg gcagtggcgg cggtggctct 420
ggaggcggcg gctctcaggc tgtggtgaca caggaacctt ctctgacagt gtctccagga 480
ggaacagtga ctctgacatg tagaagttct actggagctg tgacaacctc taattatgct 540
aactgggtgc agcagaaacc tggccaggct cctagaggtt tgatcggagg tacaaataag 600
agagcacctg gagtgcctgc tagattttct ggctctctgc tgggcggaaa agctgctctg 660
acactgtctg gagctcagcc tgaggatgaa gctgagtatt attgtgctct gtggtactct 720
aatctgtggg tgttcggaca gggcacaaag gtggaaatta agggcggagg cggctctggc 780
ggcggcggaa gcggcggcgg cggctccgac gttgtgatga cacagtctcc tctgtccctg 840
ccagtgaccc tgggacagcc tgcttctatc tcttgtagat cttctcagtc tctggtgtac 900
aataatggaa acacatacct gcactggtac cagcagagac ctggacagtc ccctagactg 960
ctgatctaca aggtgagtaa tagattttct ggagtgcctg atcggtttag cggctctggc 1020
tctggcaccg attttacact gaagatctct agagtggagg ccgaggatgt gggcgtgtac 1080
ttttgctccc agagcacaca cgtgcctctg acctttggac agggaaccaa ggtggagatt 1140
aagagaacag tggctgcccc atctgtgttt atttttccac cttccgatga gcagctgaag 1200
tctggcaccg cctctgtggt gtgtctgctg aataatttct atcctagaga agctaaggtg 1260
cagtggaagg tggataatgc tctgcagagt ggcaattctc aggagagtgt gacagagcag 1320
gattctaaag attctacata ttctctgagc agcaccctga cactgtctaa ggctgattac 1380
gagaagcata aggtgtatgc ttgcgaagtg acacatcagg gactgtctag ccctgtgact 1440
aagtctttta atagaggcga gtgt 1464
<210> 77
<211> 716
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 77
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr
20 25 30
Ala Met Ser Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val
35 40 45
Ala Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Thr Cys Tyr Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu
100 105 110
Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu
115 120 125
Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys
130 135 140
Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser
145 150 155 160
Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser
165 170 175
Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser
180 185 190
Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn
195 200 205
Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His
210 215 220
Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val
225 230 235 240
Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr
245 250 255
Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu
260 265 270
Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys
275 280 285
Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser
290 295 300
Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys
305 310 315 320
Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile
325 330 335
Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro
340 345 350
Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu
355 360 365
Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn
370 375 380
Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser
385 390 395 400
Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg
405 410 415
Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu
420 425 430
His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys Gly
435 440 445
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Val
450 455 460
Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly Ser Leu
465 470 475 480
Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr Ala Met
485 490 495
Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg
500 505 510
Ile Arg Ser Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp Ser Val
515 520 525
Lys Asp Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr Leu Tyr
530 535 540
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
545 550 555 560
Val Arg His Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe Ala Tyr
565 570 575
Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser
580 585 590
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gln
595 600 605
Ala Val Val Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly Gly Thr
610 615 620
Val Thr Leu Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr Ser Asn
625 630 635 640
Tyr Ala Asn Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg Gly Leu
645 650 655
Ile Gly Gly Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg Phe Ser
660 665 670
Gly Ser Leu Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly Ala Gln
675 680 685
Pro Glu Asp Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser Asn Leu
690 695 700
Trp Val Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
705 710 715
<210> 78
<211> 2148
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 78
gaggtgcagc tggtggaatc cggcggaggc ctggtgaaac caggcggcag cctgagactg 60
tcctgtgctg ctagcggttt ttcctttaga acttacgcta tgagctgggt gagacaggcc 120
ccaggaaagt ctctcgaatg ggtggccaca attagtagcg gcagtagcta catctactac 180
cctgactccg tgaagggccg gtttaccgtg agccgcgata acgccaagaa ctccctgtac 240
ctgcagatga acagcctgcg cgccgaggac accgccgtgt actactgcac ctgctaccga 300
atggagacct tcgagtactg gggccagggc accctggtga ccgtgagcag cgccagcacc 360
aagggcccca gcgtgttccc cctggccccc tcctccaagt ccacctccgg cggcaccgct 420
gccctgggct gcctggtgaa ggactacttc cctgagcctg tgaccgtgag ctggaacagc 480
ggcgccctga cctccggcgt gcacaccttc cccgccgtgc tgcagtccag cggcctgtac 540
agcctgagct ctgtggtgac cgtgccaagc agcagcctgg gcacccagac ctacatctgt 600
aacgtgaacc acaagcccag caacaccaag gtggataaga aggtggagcc taagtcctgc 660
gataagaccc acacctgccc cccctgcccc gcccccgagc ttctgggcgg cccatccgtg 720
ttcctgttcc cccccaagcc taaggacacc ctgatgatca gccgcacccc tgaggtgacc 780
tgcgtggtgg tggatgtgag ccacgaggac cctgaggtga agttcaactg gtacgtggac 840
ggcgtggagg tccataacgc caagaccaag cccagagagg agcagtataa cagcacctac 900
agggtggtgt ccgtgctgac cgtgctgcac caggactggc tgaacggcaa ggaatacaag 960
tgcaaagtgt ccaacaaggc tctgccagcc cccatcgaaa agacaatctc taaggccaag 1020
ggccagccca gggagcccca agtgtacacc ctgcctccct ccagagagga gatgaccaag 1080
aaccaggtgt ccctgacctg cctggtgaag ggcttctacc ctagcgacat cgccgtggag 1140
tgggagagca acggccagcc tgagaacaac tataagacca cccctcccgt gctggatagt 1200
gacggatctt tctttctgta tagtaagctg accgtggaca agtctagatg gcagcaggga 1260
aatgtgtttt cttgttctgt gatgcatgaa gccctgcata atcactacac ccagaagtct 1320
ctgagcctgt ccccaggaaa gggaggcggc ggcagcggag gcggcggctc cggcggcggc 1380
ggctctgagg tgcagctggt ggagtccggc ggcggcctgg tgcagcccgg cggctccctg 1440
aggctgagct gcgccgccag cggcttcacc ttcaacacct acgccatgaa ctgggtgaga 1500
caggctcccg gcaagggcct ggagtgggtg gcccggatca gatctaaata taacaattat 1560
gctacatatt atgctgattc tgtgaaggat aggtttacaa ttagcagaga tgattctaag 1620
aatacactgt atctgcagat gaacagtctg cgtgctgagg atactgcagt gtattattgt 1680
gtgagacatg gaaatttcgg taattcttat gtgagctggt ttgcttactg gggccaggga 1740
actctggtga cagtgtcctc tggcggcgga ggctctggcg gagggggcag tggcggcggt 1800
ggctctggag gcggcggctc tcaggctgtg gtgacacagg aaccttctct gacagtgtct 1860
ccaggaggaa cagtgactct gacatgtaga agttctactg gagctgtgac aacctctaat 1920
tatgctaact gggtgcagca gaaacctggc caggctccta gaggtttgat cggaggtaca 1980
aataagagag cacctggagt gcctgctaga ttttctggct ctctgctggg cggaaaagct 2040
gctctgacac tgtctggagc tcagcctgag gatgaagctg agtattattg tgctctgtgg 2100
tactctaatc tgtgggtgtt cggacagggc acaaaggtgg aaattaag 2148
<210> 79
<211> 489
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 79
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr
20 25 30
Ala Met Ser Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val
35 40 45
Ala Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Thr Cys Tyr Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu
100 105 110
Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu
115 120 125
Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys
130 135 140
Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser
145 150 155 160
Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser
165 170 175
Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser
180 185 190
Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn
195 200 205
Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Gly Gly Gly Gly
210 215 220
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Val Gln Leu Val
225 230 235 240
Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly Ser Leu Arg Leu Ser
245 250 255
Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr Ala Met Asn Trp Val
260 265 270
Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg Ile Arg Ser
275 280 285
Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp Ser Val Lys Asp Arg
290 295 300
Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr Leu Tyr Leu Gln Met
305 310 315 320
Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Val Arg His
325 330 335
Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe Ala Tyr Trp Gly Gln
340 345 350
Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly
355 360 365
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gln Ala Val Val
370 375 380
Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly Gly Thr Val Thr Leu
385 390 395 400
Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr Ser Asn Tyr Ala Asn
405 410 415
Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg Gly Leu Ile Gly Gly
420 425 430
Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg Phe Ser Gly Ser Leu
435 440 445
Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly Ala Gln Pro Glu Asp
450 455 460
Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser Asn Leu Trp Val Phe
465 470 475 480
Gly Gln Gly Thr Lys Val Glu Ile Lys
485
<210> 80
<211> 1467
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 80
gaggtgcagc tggtggaatc cggcggaggc ctggtgaaac caggcggcag cctgagactg 60
tcctgtgctg ctagcggttt ttcctttaga acttacgcta tgagctgggt gagacaggcc 120
ccaggaaagt ctctcgaatg ggtggccaca attagtagcg gcagtagcta catctactac 180
cctgactccg tgaagggccg gtttaccgtg agccgcgata acgccaagaa ctccctgtac 240
ctgcagatga acagcctgcg cgccgaggac accgccgtgt actactgcac ctgctaccga 300
atggagacct tcgagtactg gggccagggc accctggtga ccgtgagcag cgccagcacc 360
aagggcccca gcgtgttccc cctggccccc tcctccaagt ccacctccgg cggcaccgct 420
gccctgggct gcctggtgaa ggactacttc cctgagcctg tgaccgtgag ctggaacagc 480
ggcgccctga cctccggcgt gcacaccttc cccgccgtgc tgcagtccag cggcctgtac 540
agcctgagct ctgtggtgac cgtgccaagc agcagcctgg gcacccagac ctacatctgt 600
aacgtgaacc acaagcccag caacaccaag gtggataaga aggtggagcc taagtcctgc 660
ggaggcggcg gcagcggagg cggcggctcc ggcggcggcg gctctgaggt gcagctggtg 720
gagtccggcg gcggcctggt gcagcccggc ggctccctga ggctgagctg cgccgccagc 780
ggcttcacct tcaacaccta cgccatgaac tgggtgagac aggctcccgg caagggcctg 840
gagtgggtgg cccggatcag atctaaatat aacaattatg ctacatatta tgctgattct 900
gtgaaggata ggtttacaat tagcagagat gattctaaga atacactgta tctgcagatg 960
aacagtctgc gtgctgagga tactgcagtg tattattgtg tgagacatgg aaatttcggt 1020
aattcttatg tgagctggtt tgcttactgg ggccagggaa ctctggtgac agtgtcctct 1080
ggcggcggag gctctggcgg agggggcagt ggcggcggtg gctctggagg cggcggctct 1140
caggctgtgg tgacacagga accttctctg acagtgtctc caggaggaac agtgactctg 1200
acatgtagaa gttctactgg agctgtgaca acctctaatt atgctaactg ggtgcagcag 1260
aaacctggcc aggctcctag aggtttgatc ggaggtacaa ataagagagc acctggagtg 1320
cctgctagat tttctggctc tctgctgggc ggaaaagctg ctctgacact gtctggagct 1380
cagcctgagg atgaagctga gtattattgt gctctgtggt actctaatct gtgggtgttc 1440
ggacagggca caaaggtgga aattaag 1467
<210> 81
<211> 720
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 81
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr
20 25 30
Ala Met Ser Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val
35 40 45
Ala Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Thr Cys Tyr Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu
100 105 110
Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu
115 120 125
Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys
130 135 140
Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser
145 150 155 160
Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser
165 170 175
Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser
180 185 190
Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn
195 200 205
Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Gly Gly Gly Gly
210 215 220
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Val Gln Leu Val
225 230 235 240
Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly Ser Leu Arg Leu Ser
245 250 255
Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr Ala Met Asn Trp Val
260 265 270
Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg Ile Arg Ser
275 280 285
Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp Ser Val Lys Asp Arg
290 295 300
Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr Leu Tyr Leu Gln Met
305 310 315 320
Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Val Arg His
325 330 335
Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe Ala Tyr Trp Gly Gln
340 345 350
Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly
355 360 365
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gln Ala Val Val
370 375 380
Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly Gly Thr Val Thr Leu
385 390 395 400
Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr Ser Asn Tyr Ala Asn
405 410 415
Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg Gly Leu Ile Gly Gly
420 425 430
Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg Phe Ser Gly Ser Leu
435 440 445
Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly Ala Gln Pro Glu Asp
450 455 460
Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser Asn Leu Trp Val Phe
465 470 475 480
Gly Gln Gly Thr Lys Val Glu Ile Lys Gly Gly Gly Ser Asp Lys Thr
485 490 495
His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser
500 505 510
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
515 520 525
Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro
530 535 540
Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala
545 550 555 560
Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val
565 570 575
Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr
580 585 590
Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr
595 600 605
Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu
610 615 620
Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr Cys
625 630 635 640
Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
645 650 655
Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp
660 665 670
Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser
675 680 685
Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala
690 695 700
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
705 710 715 720
<210> 82
<211> 2160
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 82
gaggtgcagc tggtggaatc cggcggaggc ctggtgaaac caggcggcag cctgagactg 60
tcctgtgctg ctagcggttt ttcctttaga acttacgcta tgagctgggt gagacaggcc 120
ccaggaaagt ctctcgaatg ggtggccaca attagtagcg gcagtagcta catctactac 180
cctgactccg tgaagggccg gtttaccgtg agccgcgata acgccaagaa ctccctgtac 240
ctgcagatga acagcctgcg cgccgaggac accgccgtgt actactgcac ctgctaccga 300
atggagacct tcgagtactg gggccagggc accctggtga ccgtgagcag cgccagcacc 360
aagggcccca gcgtgttccc cctggccccc tcctccaagt ccacctccgg cggcaccgct 420
gccctgggct gcctggtgaa ggactacttc cctgagcctg tgaccgtgag ctggaacagc 480
ggcgccctga cctccggcgt gcacaccttc cccgccgtgc tgcagtccag cggcctgtac 540
agcctgagct ctgtggtgac cgtgccaagc agcagcctgg gcacccagac ctacatctgt 600
aacgtgaacc acaagcccag caacaccaag gtggataaga aggtggagcc taagtcctgc 660
ggaggcggcg gcagcggagg cggcggctcc ggcggcggcg gctctgaggt gcagctggtg 720
gagtccggcg gcggcctggt gcagcccggc ggctccctga ggctgagctg cgccgccagc 780
ggcttcacct tcaacaccta cgccatgaac tgggtgagac aggctcccgg caagggcctg 840
gagtgggtgg cccggatcag atctaaatat aacaattatg ctacatatta tgctgattct 900
gtgaaggata ggtttacaat tagcagagat gattctaaga atacactgta tctgcagatg 960
aacagtctgc gtgctgagga tactgcagtg tattattgtg tgagacatgg aaatttcggt 1020
aattcttatg tgagctggtt tgcttactgg ggccagggaa ctctggtgac agtgtcctct 1080
ggcggcggag gctctggcgg agggggcagt ggcggcggtg gctctggagg cggcggctct 1140
caggctgtgg tgacacagga accttctctg acagtgtctc caggaggaac agtgactctg 1200
acatgtagaa gttctactgg agctgtgaca acctctaatt atgctaactg ggtgcagcag 1260
aaacctggcc aggctcctag aggtttgatc ggaggtacaa ataagagagc acctggagtg 1320
cctgctagat tttctggctc tctgctgggc ggaaaagctg ctctgacact gtctggagct 1380
cagcctgagg atgaagctga gtattattgt gctctgtggt actctaatct gtgggtgttc 1440
ggacagggca caaaggtgga aattaaggga ggtggatcag ataagaccca cacctgcccc 1500
ccctgccccg cccccgagct tctgggcggc ccatccgtgt tcctgttccc ccccaagcct 1560
aaggacaccc tgatgatcag ccgcacccct gaggtgacct gcgtggtggt ggatgtgagc 1620
cacgaggacc ctgaggtgaa gttcaactgg tacgtggacg gcgtggaggt ccataacgcc 1680
aagaccaagc ccagagagga gcagtataac agcacctaca gggtggtgtc cgtgctgacc 1740
gtgctgcacc aggactggct gaacggcaag gaatacaagt gcaaagtgtc caacaaggct 1800
ctgccagccc ccatcgaaaa gacaatctct aaggccaagg gccagcccag ggagccccaa 1860
gtgtacaccc tgcctccctc cagagaggag atgaccaaga accaggtgtc cctgacctgc 1920
ctggtgaagg gcttctaccc tagcgacatc gccgtggagt gggagagcaa cggccagcct 1980
gagaacaact ataagaccac ccctcccgtg ctggatagtg acggatcttt ctttctgtat 2040
agtaagctga ccgtggacaa gtctagatgg cagcagggaa atgtgttttc ttgttctgtg 2100
atgcatgaag ccctgcataa tcactacacc cagaagtctc tgagcctgtc cccaggaaag 2160
<210> 83
<211> 722
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 83
Gln Val Gln Leu Val Glu Ser Gly Gly Gly Val Val Gln Pro Gly Arg
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Val Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Gly Val His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Leu
35 40 45
Gly Val Ile Trp Ser Gly Gly Arg Lys Asp Tyr Asn Ala Ala Phe Ile
50 55 60
Ser Arg Leu Thr Ile Ser Lys Asp Asn Ser Lys Ser Thr Val Tyr Leu
65 70 75 80
Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala
85 90 95
Arg His Gly Thr Tyr Pro Tyr Trp Tyr Phe Ala Leu Trp Gly Gln Gly
100 105 110
Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe
115 120 125
Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu
130 135 140
Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp
145 150 155 160
Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu
165 170 175
Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser
180 185 190
Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro
195 200 205
Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Gly Gly
210 215 220
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Val Gln
225 230 235 240
Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly Ser Leu Arg
245 250 255
Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr Ala Met Asn
260 265 270
Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg Ile
275 280 285
Arg Ser Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp Ser Val Lys
290 295 300
Asp Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr Leu Tyr Leu
305 310 315 320
Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Val
325 330 335
Arg His Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe Ala Tyr Trp
340 345 350
Gly Gln Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly
355 360 365
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gln Ala
370 375 380
Val Val Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly Gly Thr Val
385 390 395 400
Thr Leu Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr Ser Asn Tyr
405 410 415
Ala Asn Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg Gly Leu Ile
420 425 430
Gly Gly Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg Phe Ser Gly
435 440 445
Ser Leu Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly Ala Gln Pro
450 455 460
Glu Asp Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser Asn Leu Trp
465 470 475 480
Val Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Gly Gly Gly Ser Asp
485 490 495
Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly
500 505 510
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
515 520 525
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
530 535 540
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
545 550 555 560
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
565 570 575
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
580 585 590
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu
595 600 605
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
610 615 620
Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu
625 630 635 640
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
645 650 655
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
660 665 670
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
675 680 685
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
690 695 700
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
705 710 715 720
Gly Lys
<210> 84
<211> 2166
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 84
caggtgcagc tggtggagtc cggcggcggc gtggtgcagc ctggcagatc tctgaggctg 60
agttgtgctg tgagtggctt cacattttct aactatggcg tgcactgggt gagacaggcc 120
cctggaaagg gactggagtg gctgggagtg atctggtccg gaggaagaaa agattataat 180
gctgccttta tttctaggct gacaattagt aaggataatt ctaagtctac cgtgtatctg 240
cagatgaata gtctgagggc tgaggacaca gccgtgtatt actgcgctag acatggaaca 300
tatccttatt ggtattttgc cctgtgggga cagggcaccc ttgtgaccgt gagctctgct 360
agtacaaaag gccctagcgt gtttcctttg gctccatcct ccaagagcac atccggcgga 420
actgctgctc tgggatgtct ggtgaaggat tattttcctg agcctgtgac cgtgtcttgg 480
aatagcggcg ccctgacatc cggagtgcac acatttccag ccgtgctgca gtctagcggc 540
ttatacagcc tgagctctgt ggtgactgtg cctagttctt ctctgggcac ccagacatat 600
atttgtaatg tgaatcataa gccttccaat acaaaggtgg ataagaaggt ggaaccaaag 660
tcttgtggag gcggcggcag cggaggcggc ggctccggcg gcggcggctc tgaggtgcag 720
ctggtggagt ccggcggcgg cctggtgcag cccggcggct ccctgaggct gagctgcgcc 780
gccagcggct tcaccttcaa cacctacgcc atgaactggg tgagacaggc tcccggcaag 840
ggcctggagt gggtggcccg gatcagatct aaatataaca attatgctac atattatgct 900
gattctgtga aggataggtt tacaattagc agagatgatt ctaagaatac actgtatctg 960
cagatgaaca gtctgcgtgc tgaggatact gcagtgtatt attgtgtgag acatggaaat 1020
ttcggtaatt cttatgtgag ctggtttgct tactggggcc agggaactct ggtgacagtg 1080
tcctctggcg gcggaggctc tggcggaggg ggcagtggcg gcggtggctc tggaggcggc 1140
ggctctcagg ctgtggtgac acaggaacct tctctgacag tgtctccagg aggaacagtg 1200
actctgacat gtagaagttc tactggagct gtgacaacct ctaattatgc taactgggtg 1260
cagcagaaac ctggccaggc tcctagaggt ttgatcggag gtacaaataa gagagcacct 1320
ggagtgcctg ctagattttc tggctctctg ctgggcggaa aagctgctct gacactgtct 1380
ggagctcagc ctgaggatga agctgagtat tattgtgctc tgtggtactc taatctgtgg 1440
gtgttcggac agggcacaaa ggtggaaatt aagggaggtg gatcagataa aacacacacc 1500
tgtcctccct gtcccgcccc tgagctgctg ggaggaccat ccgtgttcct gtttcctcca 1560
aagccaaagg ataccctgat gatttctaga acacctgagg tcacatgtgt ggtggtggat 1620
gtgtcccatg aagatcctga ggtaaagttt aactggtatg tggatggcgt ggaggtgcac 1680
aacgccaaga ccaagcccag ggaggagcag tacaactcca cctaccgggt ggtgagcgtg 1740
ctgaccgtgc tgcaccagga ctggctgaac ggcaaggagt acaagtgtaa ggtgtccaac 1800
aaggccctgc ctgcacctat cgaaaagacc atctctaagg ccaagggcca gccccgcgag 1860
ccccaggtgt acaccctgcc tccctcccgc gaggagatga ccaagaacca ggtgtccctg 1920
acctgcctgg tgaagggctt ctacccatcc gacatcgccg tggagtggga gtccaacggc 1980
cagcccgaga acaactacaa gaccaccccc cctgtgctgg actccgacgg cagcttcttc 2040
ctgtacagca agctgaccgt ggacaagtcc agatggcagc agggcaacgt gttcagctgc 2100
agcgtgatgc atgaggccct gcacaaccac tacacccaga agtccctgtc cctgagcccc 2160
ggcaag 2166
<210> 85
<211> 214
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 85
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Ile Ser Asn Tyr
20 25 30
Leu Asn Trp Tyr Gln Gln Lys Pro Gly Lys Thr Phe Lys Leu Leu Val
35 40 45
Tyr Tyr Thr Ser Arg Leu Gln Ser Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Tyr Thr Phe Thr Ile Ser Ser Leu Gln Pro
65 70 75 80
Glu Asp Ile Ala Thr Tyr Phe Cys Gln Gln Gly Lys Thr Leu Pro Phe
85 90 95
Ser Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala Ala
100 105 110
Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser Gly
115 120 125
Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala
130 135 140
Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser Gln
145 150 155 160
Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser
165 170 175
Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr
180 185 190
Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys Ser
195 200 205
Phe Asn Arg Gly Glu Cys
210
<210> 86
<211> 642
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 86
gatatccaga tgacacagtc tccttcctct ctgtctgcct ctgtgggcga tagggtgaca 60
atcacatgta gagcttctca ggatatctcc aattatctga attggtacca gcagaaacct 120
ggcaagacct ttaagctgct ggtgtactat acctccagac tgcagtctgg agtgccatct 180
agattttctg gctctggctc tggaaccgac tacaccttta ccatctctag cctgcagcct 240
gaagatatcg ctacctattt ttgtcagcag ggcaagactc tgccttttag cttcggccag 300
ggaaccaagg tggagatcaa gagaactgtg gctgcccctt ctgtgtttat cttcccacct 360
tccgatgaac agctgaagtc tggaaccgcc tctgtggtgt gtctgctgaa taacttctac 420
cctagagagg ctaaggtgca gtggaaggtg gataacgctc tgcagtctgg aaattctcag 480
gagtctgtga cagaacagga ttctaaggat tctacttatt ctctgtccag caccctgacc 540
ctgtctaagg ctgattatga gaaacataag gtgtatgctt gtgaggtgac ccaccaggga 600
ctgtctagcc ctgtgaccaa gtctttcaat agaggcgagt gt 642
<210> 87
<211> 720
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 87
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr
20 25 30
Ala Met Ser Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val
35 40 45
Ala Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Thr Cys Tyr Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu
100 105 110
Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu
115 120 125
Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys
130 135 140
Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser
145 150 155 160
Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser
165 170 175
Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser
180 185 190
Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn
195 200 205
Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Gly Gly Gly Gly
210 215 220
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Val Gln Leu Val
225 230 235 240
Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly Ser Leu Arg Leu Ser
245 250 255
Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr Ala Met Asn Trp Val
260 265 270
Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg Ile Arg Ser
275 280 285
Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp Ser Val Lys Asp Arg
290 295 300
Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr Leu Tyr Leu Gln Met
305 310 315 320
Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Val Arg His
325 330 335
Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe Ala Tyr Trp Gly Gln
340 345 350
Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly
355 360 365
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gln Ala Val Val
370 375 380
Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly Gly Thr Val Thr Leu
385 390 395 400
Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr Ser Asn Tyr Ala Asn
405 410 415
Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg Gly Leu Ile Gly Gly
420 425 430
Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg Phe Ser Gly Ser Leu
435 440 445
Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly Ala Gln Pro Glu Asp
450 455 460
Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser Asn Leu Trp Val Phe
465 470 475 480
Gly Gln Gly Thr Lys Val Glu Ile Lys Gly Gly Gly Ser Asp Lys Thr
485 490 495
His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser
500 505 510
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
515 520 525
Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro
530 535 540
Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala
545 550 555 560
Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val
565 570 575
Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr
580 585 590
Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr
595 600 605
Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Cys Thr Leu
610 615 620
Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu Trp Cys
625 630 635 640
Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
645 650 655
Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp
660 665 670
Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser
675 680 685
Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala
690 695 700
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
705 710 715 720
<210> 88
<211> 2160
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 88
gaggtgcagc tggtggaatc cggcggaggc ctggtgaaac caggcggcag cctgagactg 60
tcctgtgctg ctagcggttt ttcctttaga acttacgcta tgagctgggt gagacaggcc 120
ccaggaaagt ctctcgaatg ggtggccaca attagtagcg gcagtagcta catctactac 180
cctgactccg tgaagggccg gtttaccgtg agccgcgata acgccaagaa ctccctgtac 240
ctgcagatga acagcctgcg cgccgaggac accgccgtgt actactgcac ctgctaccga 300
atggagacct tcgagtactg gggccagggc accctggtga ccgtgagcag cgccagcacc 360
aagggcccca gcgtgttccc cctggccccc tcctccaagt ccacctccgg cggcaccgct 420
gccctgggct gcctggtgaa ggactacttc cctgagcctg tgaccgtgag ctggaacagc 480
ggcgccctga cctccggcgt gcacaccttc cccgccgtgc tgcagtccag cggcctgtac 540
agcctgagct ctgtggtgac cgtgccaagc agcagcctgg gcacccagac ctacatctgt 600
aacgtgaacc acaagcccag caacaccaag gtggataaga aggtggagcc taagtcctgc 660
ggaggcggcg gcagcggagg cggcggctcc ggcggcggcg gctctgaggt gcagctggtg 720
gagtccggcg gcggcctggt gcagcccggc ggctccctga ggctgagctg cgccgccagc 780
ggcttcacct tcaacaccta cgccatgaac tgggtgagac aggctcccgg caagggcctg 840
gagtgggtgg cccggatcag atctaaatat aacaattatg ctacatatta tgctgattct 900
gtgaaggata ggtttacaat tagcagagat gattctaaga atacactgta tctgcagatg 960
aacagtctgc gtgctgagga tactgcagtg tattattgtg tgagacatgg aaatttcggt 1020
aattcttatg tgagctggtt tgcttactgg ggccagggaa ctctggtgac agtgtcctct 1080
ggcggcggag gctctggcgg agggggcagt ggcggcggtg gctctggagg cggcggctct 1140
caggctgtgg tgacacagga accttctctg acagtgtctc caggaggaac agtgactctg 1200
acatgtagaa gttctactgg agctgtgaca acctctaatt atgctaactg ggtgcagcag 1260
aaacctggcc aggctcctag aggtttgatc ggaggtacaa ataagagagc acctggagtg 1320
cctgctagat tttctggctc tctgctgggc ggaaaagctg ctctgacact gtctggagct 1380
cagcctgagg atgaagctga gtattattgt gctctgtggt actctaatct gtgggtgttc 1440
ggacagggca caaaggtgga aattaaggga ggtggatcag ataagaccca cacctgcccc 1500
ccctgccccg cccccgagct tctgggcggc ccatccgtgt tcctgttccc ccccaagcct 1560
aaggacaccc tgatgatcag ccgcacccct gaggtgacct gcgtggtggt ggatgtgagc 1620
cacgaggacc ctgaggtgaa gttcaactgg tacgtggacg gcgtggaggt ccataacgcc 1680
aagaccaagc ccagagagga gcagtataac agcacctaca gggtggtgtc cgtgctgacc 1740
gtgctgcacc aggactggct gaacggcaag gaatacaagt gcaaagtgtc caacaaggct 1800
ctgccagccc ccatcgaaaa gacaatctct aaggccaagg gccagcccag ggagccccaa 1860
gtgtgcaccc tgcctccctc cagagaggag atgaccaaga accaggtgtc cctgtggtgc 1920
ctggtgaagg gcttctaccc tagcgacatc gccgtggagt gggagagcaa cggccagcct 1980
gagaacaact ataagaccac ccctcccgtg ctggatagtg acggatcttt ctttctgtat 2040
agtaagctga ccgtggacaa gtctagatgg cagcagggaa atgtgttttc ttgttctgtg 2100
atgcatgaag ccctgcataa tcactacacc cagaagtctc tgagcctgtc cccaggaaag 2160
<210> 89
<211> 447
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 89
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr
20 25 30
Ala Met Ser Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val
35 40 45
Ala Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Thr Cys Tyr Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu
100 105 110
Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu
115 120 125
Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys
130 135 140
Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser
145 150 155 160
Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser
165 170 175
Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser
180 185 190
Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn
195 200 205
Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His
210 215 220
Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val
225 230 235 240
Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr
245 250 255
Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu
260 265 270
Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys
275 280 285
Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser
290 295 300
Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys
305 310 315 320
Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile
325 330 335
Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro
340 345 350
Pro Cys Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu Ser Cys Ala
355 360 365
Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn
370 375 380
Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser
385 390 395 400
Asp Gly Ser Phe Phe Leu Val Ser Lys Leu Thr Val Asp Lys Ser Arg
405 410 415
Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu
420 425 430
His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
435 440 445
<210> 90
<211> 1341
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 90
gaggtgcagc tggtggaatc cggcggaggc ctggtgaaac caggcggcag cctgagactg 60
tcctgtgctg ctagcggttt ttcctttaga acttacgcta tgagctgggt gagacaggcc 120
ccaggaaagt ctctcgaatg ggtggccaca attagtagcg gcagtagcta catctactac 180
cctgactccg tgaagggccg gtttaccgtg agccgcgata acgccaagaa ctccctgtac 240
ctgcagatga acagcctgcg cgccgaggac accgccgtgt actactgcac ctgctaccga 300
atggagacct tcgagtactg gggccagggc accctggtga ccgtgagcag cgccagcacc 360
aagggcccca gcgtgttccc cctggccccc tcctccaagt ccacctccgg cggcaccgct 420
gccctgggct gcctggtgaa ggactacttc cctgagcctg tgaccgtgag ctggaacagc 480
ggcgccctga cctccggcgt gcacaccttc cccgccgtgc tgcagtccag cggcctgtac 540
agcctgagct ctgtggtgac cgtgccaagc agcagcctgg gcacccagac ctacatctgt 600
aacgtgaacc acaagcccag caacaccaag gtggataaga aggtggagcc taagtcctgc 660
gataagaccc acacctgccc cccctgcccc gcccccgagc ttctgggcgg cccatccgtg 720
ttcctgttcc cccccaagcc taaggacacc ctgatgatca gccgcacccc tgaggtgacc 780
tgcgtggtgg tggatgtgag ccacgaggac cctgaggtga agttcaactg gtacgtggac 840
ggcgtggagg tccataacgc caagaccaag cccagagagg agcagtataa cagcacctac 900
agggtggtgt ccgtgctgac cgtgctgcac caggactggc tgaacggcaa ggaatacaag 960
tgcaaagtgt ccaacaaggc tctgccagcc cccatcgaaa agacaatctc taaggccaag 1020
ggccagccca gggagcccca agtgtacacc ctgcctccct gcagagagga gatgaccaag 1080
aaccaggtgt ccctgtcctg cgctgtgaag ggcttctacc ctagcgacat cgccgtggag 1140
tgggagagca acggccagcc tgagaacaac tataagacca cccctcccgt gctggatagt 1200
gacggatctt tctttctggt tagtaagctg accgtggaca agtctagatg gcagcaggga 1260
aatgtgtttt cttgttctgt gatgcatgaa gccctgcata atcactacac ccagaagtct 1320
ctgagcctgt ccccaggaaa g 1341
<210> 91
<211> 720
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 91
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr
20 25 30
Ala Met Ser Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val
35 40 45
Ala Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Thr Cys Tyr Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu
100 105 110
Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu
115 120 125
Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys
130 135 140
Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser
145 150 155 160
Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser
165 170 175
Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser
180 185 190
Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn
195 200 205
Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Gly Gly Gly Gly
210 215 220
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Val Gln Leu Val
225 230 235 240
Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly Ser Leu Arg Leu Ser
245 250 255
Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr Ala Met Asn Trp Val
260 265 270
Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg Ile Arg Ser
275 280 285
Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp Ser Val Lys Asp Arg
290 295 300
Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr Leu Tyr Leu Gln Met
305 310 315 320
Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Val Arg His
325 330 335
Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe Ala Tyr Trp Gly Gln
340 345 350
Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly
355 360 365
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gln Ala Val Val
370 375 380
Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly Gly Thr Val Thr Leu
385 390 395 400
Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr Ser Asn Tyr Ala Asn
405 410 415
Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg Gly Leu Ile Gly Gly
420 425 430
Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg Phe Ser Gly Ser Leu
435 440 445
Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly Ala Gln Pro Glu Asp
450 455 460
Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser Asn Leu Trp Val Phe
465 470 475 480
Gly Gln Gly Thr Lys Val Glu Ile Lys Gly Gly Gly Ser Asp Lys Thr
485 490 495
His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser
500 505 510
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
515 520 525
Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro
530 535 540
Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala
545 550 555 560
Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val
565 570 575
Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr
580 585 590
Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr
595 600 605
Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu
610 615 620
Pro Pro Cys Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu Ser Cys
625 630 635 640
Ala Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
645 650 655
Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp
660 665 670
Ser Asp Gly Ser Phe Phe Leu Val Ser Lys Leu Thr Val Asp Lys Ser
675 680 685
Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala
690 695 700
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
705 710 715 720
<210> 92
<211> 2160
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 92
gaggtgcagc tggtggaatc cggcggaggc ctggtgaaac caggcggcag cctgagactg 60
tcctgtgctg ctagcggttt ttcctttaga acttacgcta tgagctgggt gagacaggcc 120
ccaggaaagt ctctcgaatg ggtggccaca attagtagcg gcagtagcta catctactac 180
cctgactccg tgaagggccg gtttaccgtg agccgcgata acgccaagaa ctccctgtac 240
ctgcagatga acagcctgcg cgccgaggac accgccgtgt actactgcac ctgctaccga 300
atggagacct tcgagtactg gggccagggc accctggtga ccgtgagcag cgccagcacc 360
aagggcccca gcgtgttccc cctggccccc tcctccaagt ccacctccgg cggcaccgct 420
gccctgggct gcctggtgaa ggactacttc cctgagcctg tgaccgtgag ctggaacagc 480
ggcgccctga cctccggcgt gcacaccttc cccgccgtgc tgcagtccag cggcctgtac 540
agcctgagct ctgtggtgac cgtgccaagc agcagcctgg gcacccagac ctacatctgt 600
aacgtgaacc acaagcccag caacaccaag gtggataaga aggtggagcc taagtcctgc 660
ggaggcggcg gcagcggagg cggcggctcc ggcggcggcg gctctgaggt gcagctggtg 720
gagtccggcg gcggcctggt gcagcccggc ggctccctga ggctgagctg cgccgccagc 780
ggcttcacct tcaacaccta cgccatgaac tgggtgagac aggctcccgg caagggcctg 840
gagtgggtgg cccggatcag atctaaatat aacaattatg ctacatatta tgctgattct 900
gtgaaggata ggtttacaat tagcagagat gattctaaga atacactgta tctgcagatg 960
aacagtctgc gtgctgagga tactgcagtg tattattgtg tgagacatgg aaatttcggt 1020
aattcttatg tgagctggtt tgcttactgg ggccagggaa ctctggtgac agtgtcctct 1080
ggcggcggag gctctggcgg agggggcagt ggcggcggtg gctctggagg cggcggctct 1140
caggctgtgg tgacacagga accttctctg acagtgtctc caggaggaac agtgactctg 1200
acatgtagaa gttctactgg agctgtgaca acctctaatt atgctaactg ggtgcagcag 1260
aaacctggcc aggctcctag aggtttgatc ggaggtacaa ataagagagc acctggagtg 1320
cctgctagat tttctggctc tctgctgggc ggaaaagctg ctctgacact gtctggagct 1380
cagcctgagg atgaagctga gtattattgt gctctgtggt actctaatct gtgggtgttc 1440
ggacagggca caaaggtgga aattaaggga ggtggatcag ataagaccca cacctgcccc 1500
ccctgccccg cccccgagct tctgggcggc ccatccgtgt tcctgttccc ccccaagcct 1560
aaggacaccc tgatgatcag ccgcacccct gaggtgacct gcgtggtggt ggatgtgagc 1620
cacgaggacc ctgaggtgaa gttcaactgg tacgtggacg gcgtggaggt ccataacgcc 1680
aagaccaagc ccagagagga gcagtataac agcacctaca gggtggtgtc cgtgctgacc 1740
gtgctgcacc aggactggct gaacggcaag gaatacaagt gcaaagtgtc caacaaggct 1800
ctgccagccc ccatcgaaaa gacaatctct aaggccaagg gccagcccag ggagccccaa 1860
gtgtacaccc tgcctccctg cagagaggag atgaccaaga accaggtgtc cctgtcctgc 1920
gctgtgaagg gcttctaccc tagcgacatc gccgtggagt gggagagcaa cggccagcct 1980
gagaacaact ataagaccac ccctcccgtg ctggatagtg acggatcttt ctttctggtt 2040
agtaagctga ccgtggacaa gtctagatgg cagcagggaa atgtgttttc ttgttctgtg 2100
atgcatgaag ccctgcataa tcactacacc cagaagtctc tgagcctgtc cccaggaaag 2160
<210> 93
<211> 447
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 93
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr
20 25 30
Ala Met Ser Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val
35 40 45
Ala Thr Ile Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Thr Cys Tyr Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu
100 105 110
Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu
115 120 125
Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys
130 135 140
Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser
145 150 155 160
Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser
165 170 175
Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser
180 185 190
Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn
195 200 205
Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His
210 215 220
Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val
225 230 235 240
Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr
245 250 255
Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu
260 265 270
Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys
275 280 285
Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser
290 295 300
Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys
305 310 315 320
Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile
325 330 335
Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Cys Thr Leu Pro
340 345 350
Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu Trp Cys Leu
355 360 365
Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn
370 375 380
Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser
385 390 395 400
Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg
405 410 415
Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu
420 425 430
His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
435 440 445
<210> 94
<211> 1341
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 94
gaggtgcagc tggtggaatc cggcggaggc ctggtgaaac caggcggcag cctgagactg 60
tcctgtgctg ctagcggttt ttcctttaga acttacgcta tgagctgggt gagacaggcc 120
ccaggaaagt ctctcgaatg ggtggccaca attagtagcg gcagtagcta catctactac 180
cctgactccg tgaagggccg gtttaccgtg agccgcgata acgccaagaa ctccctgtac 240
ctgcagatga acagcctgcg cgccgaggac accgccgtgt actactgcac ctgctaccga 300
atggagacct tcgagtactg gggccagggc accctggtga ccgtgagcag cgccagcacc 360
aagggcccca gcgtgttccc cctggccccc tcctccaagt ccacctccgg cggcaccgct 420
gccctgggct gcctggtgaa ggactacttc cctgagcctg tgaccgtgag ctggaacagc 480
ggcgccctga cctccggcgt gcacaccttc cccgccgtgc tgcagtccag cggcctgtac 540
agcctgagct ctgtggtgac cgtgccaagc agcagcctgg gcacccagac ctacatctgt 600
aacgtgaacc acaagcccag caacaccaag gtggataaga aggtggagcc taagtcctgc 660
gataagaccc acacctgccc cccctgcccc gcccccgagc ttctgggcgg cccatccgtg 720
ttcctgttcc cccccaagcc taaggacacc ctgatgatca gccgcacccc tgaggtgacc 780
tgcgtggtgg tggatgtgag ccacgaggac cctgaggtga agttcaactg gtacgtggac 840
ggcgtggagg tccataacgc caagaccaag cccagagagg agcagtataa cagcacctac 900
agggtggtgt ccgtgctgac cgtgctgcac caggactggc tgaacggcaa ggaatacaag 960
tgcaaagtgt ccaacaaggc tctgccagcc cccatcgaaa agacaatctc taaggccaag 1020
ggccagccca gggagcccca agtgtgcacc ctgcctccct ccagagagga gatgaccaag 1080
aaccaggtgt ccctgtggtg cctggtgaag ggcttctacc ctagcgacat cgccgtggag 1140
tgggagagca acggccagcc tgagaacaac tataagacca cccctcccgt gctggatagt 1200
gacggatctt tctttctgta tagtaagctg accgtggaca agtctagatg gcagcaggga 1260
aatgtgtttt cttgttctgt gatgcatgaa gccctgcata atcactacac ccagaagtct 1320
ctgagcctgt ccccaggaaa g 1341
<210> 95
<211> 455
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 95
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr
20 25 30
Ala Met Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Arg Ile Arg Ser Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp
50 55 60
Ser Val Lys Asp Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Leu Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Val Arg His Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe
100 105 110
Ala Tyr Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr
115 120 125
Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser
130 135 140
Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu
145 150 155 160
Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His
165 170 175
Thr Phe Pro Ala Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser
180 185 190
Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys
195 200 205
Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu
210 215 220
Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro
225 230 235 240
Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys
245 250 255
Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val
260 265 270
Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp
275 280 285
Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr
290 295 300
Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp
305 310 315 320
Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu
325 330 335
Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg
340 345 350
Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys
355 360 365
Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp
370 375 380
Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys
385 390 395 400
Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser
405 410 415
Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser
420 425 430
Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser
435 440 445
Leu Ser Leu Ser Pro Gly Lys
450 455
<210> 96
<211> 1365
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 96
gaagtgcagc tggtggagtc tggcggaggc ctggtgcagc ctggcggatc tctgagactg 60
tcctgtgccg cctctggatt cacctttaat acatatgcta tgaattgggt gaggcaggct 120
ccaggcaagg gactggagtg ggtggctaga attagatcta agtataacaa ttacgctacc 180
tactatgccg attccgtgaa ggatagattc accatctcta gagatgattc taaaaataca 240
ctgtatctgc agatgaactc tctgagagct gaggatacag ctgtgtatta ctgtgtgaga 300
cacggaaatt ttggcaactc ttacgtgtct tggttcgctt attggggcca gggcaccctg 360
gtgacagtgt cttctgcgag caccaaggga ccttccgtgt ttcccctcgc ccccagctcc 420
aaaagcacca gcggcggaac agctgctctc ggctgtctcg tcaaggatta cttccccgag 480
cccgtgaccg tgagctggaa cagcggagcc ctgacaagcg gcgtccacac cttccctgct 540
gtcctacagt cctccggact gtacagcctg agcagcgtgg tgacagtccc tagcagctcc 600
ctgggcaccc agacatatat ttgcaacgtg aatcacaagc ccagcaacac caaggtcgat 660
aagaaggtgg agcctaagtc ctgcgacaag acccacacat gtcccccctg tcccgctcct 720
gaactgctgg gaggcccttc cgtgttcctg ttccccccta agcccaagga caccctgatg 780
atttccagga cacccgaggt gacctgtgtg gtggtggacg tcagccacga ggaccccgag 840
gtgaaattca actggtacgt cgatggcgtg gaggtgcaca acgctaagac caagcccagg 900
gaggagcagt acaattccac ctacagggtg gtgtccgtgc tgaccgtcct ccatcaggac 960
tggctgaacg gcaaagagta taagtgcaag gtgagcaaca aggccctccc tgctcccatc 1020
gagaagacca tcagcaaagc caagggccag cccagggaac ctcaagtcta taccctgcct 1080
cccagcaggg aggagatgac caagaaccaa gtgagcctca catgcctcgt caagggcttc 1140
tatccttccg atattgccgt cgagtgggag tccaacggac agcccgagaa caactacaag 1200
acaacacccc ccgtgctcga ttccgatggc agcttcttcc tgtactccaa gctgaccgtg 1260
gacaagtcca gatggcaaca aggcaacgtc ttcagttgca gcgtcatgca tgaggccctc 1320
cacaaccact acacccagaa gagcctctcc ctgagccctg gaaag 1365
<210> 97
<211> 216
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 97
Gln Ala Val Val Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly Gly
1 5 10 15
Thr Val Thr Leu Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr Ser
20 25 30
Asn Tyr Ala Asn Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg Gly
35 40 45
Leu Ile Gly Gly Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg Phe
50 55 60
Ser Gly Ser Leu Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly Ala
65 70 75 80
Gln Pro Glu Asp Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser Asn
85 90 95
Leu Trp Val Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val
100 105 110
Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys
115 120 125
Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg
130 135 140
Glu Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn
145 150 155 160
Ser Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser
165 170 175
Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys
180 185 190
Val Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr
195 200 205
Lys Ser Phe Asn Arg Gly Glu Cys
210 215
<210> 98
<211> 648
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 98
caggccgtgg tgacccagga gcccagcctg accgtgagcc ccggcggcac cgtgaccctg 60
acctgcaggt cctccaccgg cgccgtgacc acctccaact acgccaactg ggtgcagcag 120
aagcccggcc aggccccaag gggcctgatc ggcggcacca ataagagggc ccccggcgtg 180
cccgctagat tctctggctc tcttctggga ggaaaggctg ctctgacact gtctggagct 240
cagcccgagg atgaggctga atactattgt gctctgtggt attctaatct gtgggtgttt 300
ggccagggaa ctaaagtaga aattaagagg accgtggccg cccccagcgt gttcatcttc 360
cccccctccg atgagcagct gaagagcggc acagccagcg tggtgtgcct gctgaacaac 420
ttctacccca gggaggccaa ggtgcagtgg aaggtggata acgccctgca gagcggcaac 480
agccaggaga gcgtgaccga gcaggatagc aaggacagca catactccct gagctccacc 540
ctgaccctga gcaaggccga ctacgagaag cacaaggtgt acgcctgcga ggtgacccac 600
caggggctga gcagccccgt gacaaagagc ttcaaccggg gggagtgc 648
<210> 99
<211> 254
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 99
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr
20 25 30
Ala Met Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Arg Ile Arg Ser Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp
50 55 60
Ser Val Lys Asp Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Leu Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Val Arg His Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe
100 105 110
Ala Tyr Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly
115 120 125
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
130 135 140
Ser Gln Ala Val Val Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly
145 150 155 160
Gly Thr Val Thr Leu Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr
165 170 175
Ser Asn Tyr Ala Asn Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg
180 185 190
Gly Leu Ile Gly Gly Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg
195 200 205
Phe Ser Gly Ser Leu Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly
210 215 220
Ala Gln Pro Glu Asp Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser
225 230 235 240
Asn Leu Trp Val Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
245 250
<210> 100
<211> 762
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 100
gaggtgcagc tggtggagtc cggcggcggc ctggtgcagc ccggcggctc cctgaggctg 60
agctgcgccg ccagcggctt caccttcaac acctacgcca tgaactgggt gagacaggct 120
cccggcaagg gcctggagtg ggtggcccgg atcagatcta aatataacaa ttatgctaca 180
tattatgctg attctgtgaa ggataggttt acaattagca gagatgattc taagaataca 240
ctgtatctgc agatgaacag tctgcgtgct gaggatactg cagtgtatta ttgtgtgaga 300
catggaaatt tcggtaattc ttatgtgagc tggtttgctt actggggcca gggaactctg 360
gtgacagtgt cctctggcgg cggaggctct ggcggagggg gcagtggcgg cggtggctct 420
ggaggcggcg gctctcaggc tgtggtgaca caggaacctt ctctgacagt gtctccagga 480
ggaacagtga ctctgacatg tagaagttct actggagctg tgacaacctc taattatgct 540
aactgggtgc agcagaaacc tggccaggct cctagaggtt tgatcggagg tacaaataag 600
agagcacctg gagtgcctgc tagattttct ggctctctgc tgggcggaaa agctgctctg 660
acactgtctg gagctcagcc tgaggatgaa gctgagtatt attgtgctct gtggtactct 720
aatctgtggg tgttcggaca gggcacaaag gtggaaatta ag 762
<210> 101
<211> 5
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 101
Thr Tyr Ala Met Asn
1 5
<210> 102
<211> 19
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 102
Arg Ile Arg Ser Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp Ser
1 5 10 15
Val Lys Asp
<210> 103
<211> 14
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 103
His Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe Ala Tyr
1 5 10
<210> 104
<211> 14
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 104
Arg Ser Ser Thr Gly Ala Val Thr Thr Ser Asn Tyr Ala Asn
1 5 10
<210> 105
<211> 7
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 105
Gly Thr Asn Lys Arg Ala Pro
1 5
<210> 106
<211> 9
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 106
Ala Leu Trp Tyr Ser Asn Leu Trp Val
1 5
<210> 107
<211> 125
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 107
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr
20 25 30
Ala Met Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Arg Ile Arg Ser Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp
50 55 60
Ser Val Lys Asp Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Leu Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Val Arg His Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe
100 105 110
Ala Tyr Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser
115 120 125
<210> 108
<211> 109
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 108
Gln Ala Val Val Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly Gly
1 5 10 15
Thr Val Thr Leu Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr Ser
20 25 30
Asn Tyr Ala Asn Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg Gly
35 40 45
Leu Ile Gly Gly Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg Phe
50 55 60
Ser Gly Ser Leu Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly Ala
65 70 75 80
Gln Pro Glu Asp Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser Asn
85 90 95
Leu Trp Val Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105
<210> 109
<211> 330
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 109
Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys
1 5 10 15
Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr
20 25 30
Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser
35 40 45
Gly Val His Thr Phe Pro Ala Val Leu Gln Ser Ser Gly Leu Tyr Ser
50 55 60
Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gln Thr
65 70 75 80
Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys
85 90 95
Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys
100 105 110
Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro
115 120 125
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
130 135 140
Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp
145 150 155 160
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
165 170 175
Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu
180 185 190
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
195 200 205
Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
210 215 220
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu
225 230 235 240
Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr
245 250 255
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
260 265 270
Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
275 280 285
Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn
290 295 300
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
305 310 315 320
Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
325 330
<210> 110
<211> 107
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 110
Arg Thr Val Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu
1 5 10 15
Gln Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe
20 25 30
Tyr Pro Arg Glu Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln
35 40 45
Ser Gly Asn Ser Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser
50 55 60
Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu
65 70 75 80
Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser
85 90 95
Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys
100 105
<210> 111
<211> 214
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 111
Met Gln Ser Gly Thr His Trp Arg Val Leu Gly Leu Cys Leu Leu Ser
1 5 10 15
Val Gly Val Trp Gly Gln Asp Gly Asn Glu Glu Met Gly Gly Ile Thr
20 25 30
Gln Thr Pro Tyr Lys Val Ser Ile Ser Gly Thr Thr Val Ile Leu Thr
35 40 45
Cys Pro Gln Tyr Pro Gly Ser Glu Ile Leu Trp Gln His Asn Asp Lys
50 55 60
Asn Ile Gly Gly Asp Glu Asp Asp Lys Asn Ile Gly Ser Asp Glu Asp
65 70 75 80
His Leu Ser Leu Lys Glu Phe Ser Glu Leu Glu Gln Ser Gly Tyr Tyr
85 90 95
Val Cys Tyr Pro Arg Gly Ser Lys Pro Glu Asp Ala Asn Phe Tyr Leu
100 105 110
Tyr Leu Arg Ala Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
115 120 125
Gly Gly Gly Ser Pro Ile Glu Glu Leu Glu Asp Arg Val Phe Val Asn
130 135 140
Cys Asn Thr Ser Ile Thr Trp Val Glu Gly Thr Val Gly Thr Leu Leu
145 150 155 160
Ser Asp Ile Thr Arg Leu Asp Leu Gly Lys Arg Ile Leu Asp Pro Arg
165 170 175
Gly Ile Tyr Arg Cys Asn Gly Thr Asp Ile Tyr Lys Asp Lys Glu Ser
180 185 190
Thr Val Gln Val His Tyr Arg Met Cys Gln Ser Cys Val Glu Leu Asp
195 200 205
His His His His His His
210
<210> 112
<211> 132
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 112
Met Gln Ser Gly Thr His Trp Arg Val Leu Gly Leu Cys Leu Leu Ser
1 5 10 15
Val Gly Val Trp Gly Gln Asp Gly Asn Glu Glu Met Gly Gly Ile Thr
20 25 30
Gln Thr Pro Tyr Lys Val Ser Ile Ser Gly Thr Thr Val Ile Leu Thr
35 40 45
Cys Pro Gln Tyr Pro Gly Ser Glu Ile Leu Trp Gln His Asn Asp Lys
50 55 60
Asn Ile Gly Gly Asp Glu Asp Asp Lys Asn Ile Gly Ser Asp Glu Asp
65 70 75 80
His Leu Ser Leu Lys Glu Phe Ser Glu Leu Glu Gln Ser Gly Tyr Tyr
85 90 95
Val Cys Tyr Pro Arg Gly Ser Lys Pro Glu Asp Ala Asn Phe Tyr Leu
100 105 110
Tyr Leu Arg Ala Arg Val Cys Glu Asn Cys Met Glu Met Asp His His
115 120 125
His His His His
130
<210> 113
<211> 716
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 113
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr
20 25 30
Ala Met Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Arg Ile Arg Ser Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp
50 55 60
Ser Val Lys Asp Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Leu Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Val Arg His Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe
100 105 110
Ala Tyr Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly
115 120 125
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
130 135 140
Ser Gln Ala Val Val Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly
145 150 155 160
Gly Thr Val Thr Leu Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr
165 170 175
Ser Asn Tyr Ala Asn Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg
180 185 190
Gly Leu Ile Gly Gly Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg
195 200 205
Phe Ser Gly Ser Leu Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly
210 215 220
Ala Gln Pro Glu Asp Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser
225 230 235 240
Asn Leu Trp Val Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Gly Gly
245 250 255
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Val Gln
260 265 270
Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly Ser Leu Arg
275 280 285
Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr Ala Met Ser
290 295 300
Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val Ala Thr Ile
305 310 315 320
Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val Lys Gly Arg
325 330 335
Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr Leu Gln Met
340 345 350
Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Thr Cys Tyr
355 360 365
Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu Val Thr Val
370 375 380
Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser
385 390 395 400
Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys
405 410 415
Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu
420 425 430
Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser Ser Gly Leu
435 440 445
Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr
450 455 460
Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val
465 470 475 480
Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro
485 490 495
Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe
500 505 510
Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val
515 520 525
Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe
530 535 540
Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro
545 550 555 560
Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr
565 570 575
Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val
580 585 590
Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala
595 600 605
Lys Gly Gln Pro Arg Glu Pro Gln Val Cys Thr Leu Pro Pro Ser Arg
610 615 620
Glu Glu Met Thr Lys Asn Gln Val Ser Leu Trp Cys Leu Val Lys Gly
625 630 635 640
Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro
645 650 655
Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser
660 665 670
Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln
675 680 685
Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His
690 695 700
Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
705 710 715
<210> 114
<211> 2148
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 114
gaggtgcagc tggtggagtc cggcggcggc ctggtgcagc ccggcggctc cctgaggctg 60
agctgcgccg ccagcggctt caccttcaac acctacgcca tgaactgggt gagacaggct 120
cccggcaagg gcctggagtg ggtggcccgg atcagatcta aatataacaa ttatgctaca 180
tattatgctg attctgtgaa ggataggttt acaattagca gagatgattc taagaataca 240
ctgtatctgc agatgaacag tctgcgtgct gaggatactg cagtgtatta ttgtgtgaga 300
catggaaatt tcggtaattc ttatgtgagc tggtttgctt actggggcca gggaactctg 360
gtgacagtgt cctctggcgg cggaggctct ggcggagggg gcagtggcgg cggtggctct 420
ggaggcggcg gctctcaggc tgtggtgaca caggaacctt ctctgacagt gtctccagga 480
ggaacagtga ctctgacatg tagaagttct actggagctg tgacaacctc taattatgct 540
aactgggtgc agcagaaacc tggccaggct cctagaggtt tgatcggagg tacaaataag 600
agagcacctg gagtgcctgc tagattttct ggctctctgc tgggcggaaa agctgctctg 660
acactgtctg gagctcagcc tgaggatgaa gctgagtatt attgtgctct gtggtactct 720
aatctgtggg tgttcggaca gggcacaaag gtggaaatta agggcggagg cggctctggc 780
ggcggcggaa gcggcggcgg cggctccgag gtgcagctgg tggaatccgg cggaggcctg 840
gtgaaaccag gcggcagcct gagactgtcc tgtgctgcta gcggtttttc ctttagaact 900
tacgctatga gctgggtgag acaggcccca ggaaagtctc tcgaatgggt ggccacaatt 960
agtagcggca gtagctacat ctactaccct gactccgtga agggccggtt taccgtgagc 1020
cgcgataacg ccaagaactc cctgtacctg cagatgaaca gcctgcgcgc cgaggacacc 1080
gccgtgtact actgcacctg ctaccgaatg gagaccttcg agtactgggg ccagggcacc 1140
ctggtgaccg tgagcagcgc cagcaccaag ggccccagcg tgttccccct ggccccctcc 1200
tccaagtcca cctccggcgg caccgctgcc ctgggctgcc tggtgaagga ctacttccct 1260
gagcctgtga ccgtgagctg gaacagcggc gccctgacct ccggcgtgca caccttcccc 1320
gccgtgctgc agtccagcgg cctgtacagc ctgagctctg tggtgaccgt gccaagcagc 1380
agcctgggca cccagaccta catctgtaac gtgaaccaca agcccagcaa caccaaggtg 1440
gataagaagg tggagcctaa gtcctgcgat aagacccaca cctgcccccc ctgccccgcc 1500
cccgagcttc tgggcggccc atccgtgttc ctgttccccc ccaagcctaa ggacaccctg 1560
atgatcagcc gcacccctga ggtgacctgc gtggtggtgg atgtgagcca cgaggaccct 1620
gaggtgaagt tcaactggta cgtggacggc gtggaggtcc ataacgccaa gaccaagccc 1680
agagaggagc agtataacag cacctacagg gtggtgtccg tgctgaccgt gctgcaccag 1740
gactggctga acggcaagga atacaagtgc aaagtgtcca acaaggctct gccagccccc 1800
atcgaaaaga caatctctaa ggccaagggc cagcccaggg agccccaagt gtgcaccctg 1860
cctccctcca gagaggagat gaccaagaac caggtgtccc tgtggtgcct ggtgaagggc 1920
ttctacccta gcgacatcgc cgtggagtgg gagagcaacg gccagcctga gaacaactat 1980
aagaccaccc ctcccgtgct ggatagtgac ggatctttct ttctgtatag taagctgacc 2040
gtggacaagt ctagatggca gcagggaaat gtgttttctt gttctgtgat gcatgaagcc 2100
ctgcataatc actacaccca gaagtctctg agcctgtccc caggaaag 2148
<210> 115
<211> 716
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 115
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Asn Thr Tyr
20 25 30
Ala Met Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Arg Ile Arg Ser Lys Tyr Asn Asn Tyr Ala Thr Tyr Tyr Ala Asp
50 55 60
Ser Val Lys Asp Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Leu Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Val Arg His Gly Asn Phe Gly Asn Ser Tyr Val Ser Trp Phe
100 105 110
Ala Tyr Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly
115 120 125
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
130 135 140
Ser Gln Ala Val Val Thr Gln Glu Pro Ser Leu Thr Val Ser Pro Gly
145 150 155 160
Gly Thr Val Thr Leu Thr Cys Arg Ser Ser Thr Gly Ala Val Thr Thr
165 170 175
Ser Asn Tyr Ala Asn Trp Val Gln Gln Lys Pro Gly Gln Ala Pro Arg
180 185 190
Gly Leu Ile Gly Gly Thr Asn Lys Arg Ala Pro Gly Val Pro Ala Arg
195 200 205
Phe Ser Gly Ser Leu Leu Gly Gly Lys Ala Ala Leu Thr Leu Ser Gly
210 215 220
Ala Gln Pro Glu Asp Glu Ala Glu Tyr Tyr Cys Ala Leu Trp Tyr Ser
225 230 235 240
Asn Leu Trp Val Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Gly Gly
245 250 255
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Val Gln
260 265 270
Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly Ser Leu Arg
275 280 285
Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Arg Thr Tyr Ala Met Ser
290 295 300
Trp Val Arg Gln Ala Pro Gly Lys Ser Leu Glu Trp Val Ala Thr Ile
305 310 315 320
Ser Ser Gly Ser Ser Tyr Ile Tyr Tyr Pro Asp Ser Val Lys Gly Arg
325 330 335
Phe Thr Val Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr Leu Gln Met
340 345 350
Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Thr Cys Tyr
355 360 365
Arg Met Glu Thr Phe Glu Tyr Trp Gly Gln Gly Thr Leu Val Thr Val
370 375 380
Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser
385 390 395 400
Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys
405 410 415
Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu
420 425 430
Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser Ser Gly Leu
435 440 445
Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr
450 455 460
Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val
465 470 475 480
Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro
485 490 495
Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe
500 505 510
Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val
515 520 525
Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe
530 535 540
Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro
545 550 555 560
Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr
565 570 575
Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val
580 585 590
Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala
595 600 605
Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Cys Arg
610 615 620
Glu Glu Met Thr Lys Asn Gln Val Ser Leu Ser Cys Ala Val Lys Gly
625 630 635 640
Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro
645 650 655
Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser
660 665 670
Phe Phe Leu Val Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln
675 680 685
Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His
690 695 700
Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
705 710 715
<210> 116
<211> 2148
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 116
gaggtgcagc tggtggagtc cggcggcggc ctggtgcagc ccggcggctc cctgaggctg 60
agctgcgccg ccagcggctt caccttcaac acctacgcca tgaactgggt gagacaggct 120
cccggcaagg gcctggagtg ggtggcccgg atcagatcta aatataacaa ttatgctaca 180
tattatgctg attctgtgaa ggataggttt acaattagca gagatgattc taagaataca 240
ctgtatctgc agatgaacag tctgcgtgct gaggatactg cagtgtatta ttgtgtgaga 300
catggaaatt tcggtaattc ttatgtgagc tggtttgctt actggggcca gggaactctg 360
gtgacagtgt cctctggcgg cggaggctct ggcggagggg gcagtggcgg cggtggctct 420
ggaggcggcg gctctcaggc tgtggtgaca caggaacctt ctctgacagt gtctccagga 480
ggaacagtga ctctgacatg tagaagttct actggagctg tgacaacctc taattatgct 540
aactgggtgc agcagaaacc tggccaggct cctagaggtt tgatcggagg tacaaataag 600
agagcacctg gagtgcctgc tagattttct ggctctctgc tgggcggaaa agctgctctg 660
acactgtctg gagctcagcc tgaggatgaa gctgagtatt attgtgctct gtggtactct 720
aatctgtggg tgttcggaca gggcacaaag gtggaaatta agggcggagg cggctctggc 780
ggcggcggaa gcggcggcgg cggctccgag gtgcagctgg tggaatccgg cggaggcctg 840
gtgaaaccag gcggcagcct gagactgtcc tgtgctgcta gcggtttttc ctttagaact 900
tacgctatga gctgggtgag acaggcccca ggaaagtctc tcgaatgggt ggccacaatt 960
agtagcggca gtagctacat ctactaccct gactccgtga agggccggtt taccgtgagc 1020
cgcgataacg ccaagaactc cctgtacctg cagatgaaca gcctgcgcgc cgaggacacc 1080
gccgtgtact actgcacctg ctaccgaatg gagaccttcg agtactgggg ccagggcacc 1140
ctggtgaccg tgagcagcgc cagcaccaag ggccccagcg tgttccccct ggccccctcc 1200
tccaagtcca cctccggcgg caccgctgcc ctgggctgcc tggtgaagga ctacttccct 1260
gagcctgtga ccgtgagctg gaacagcggc gccctgacct ccggcgtgca caccttcccc 1320
gccgtgctgc agtccagcgg cctgtacagc ctgagctctg tggtgaccgt gccaagcagc 1380
agcctgggca cccagaccta catctgtaac gtgaaccaca agcccagcaa caccaaggtg 1440
gataagaagg tggagcctaa gtcctgcgat aagacccaca cctgcccccc ctgccccgcc 1500
cccgagcttc tgggcggccc atccgtgttc ctgttccccc ccaagcctaa ggacaccctg 1560
atgatcagcc gcacccctga ggtgacctgc gtggtggtgg atgtgagcca cgaggaccct 1620
gaggtgaagt tcaactggta cgtggacggc gtggaggtcc ataacgccaa gaccaagccc 1680
agagaggagc agtataacag cacctacagg gtggtgtccg tgctgaccgt gctgcaccag 1740
gactggctga acggcaagga atacaagtgc aaagtgtcca acaaggctct gccagccccc 1800
atcgaaaaga caatctctaa ggccaagggc cagcccaggg agccccaagt gtacaccctg 1860
cctccctgca gagaggagat gaccaagaac caggtgtccc tgtcctgcgc tgtgaagggc 1920
ttctacccta gcgacatcgc cgtggagtgg gagagcaacg gccagcctga gaacaactat 1980
aagaccaccc ctcccgtgct ggatagtgac ggatctttct ttctggttag taagctgacc 2040
gtggacaagt ctagatggca gcagggaaat gtgttttctt gttctgtgat gcatgaagcc 2100
ctgcataatc actacaccca gaagtctctg agcctgtccc caggaaag 2148
Claims (29)
1. An anti-GUCY 2C/CD3 bispecific antibody comprising a domain a that binds to target molecule a and a domain B that binds to target molecule B; the target molecule A and the target molecule B are selected from GUCY2C and CD3; the domain a and domain B are selected from an antibody or antigen-binding fragment thereof directed against GUCY2C and an antibody or antigen-binding fragment thereof directed against CD 3.
2. The anti-GUCY 2C/CD3 bispecific antibody of claim 1, comprising: a) a first domain a, B) a second domain B, and, optionally, further comprising c) an Fc domain; the target molecule A and the target molecule B are selected from GUCY2C and CD3; the domain a and domain B are selected from an antibody or antigen-binding fragment thereof directed against GUCY2C and an antibody or antigen-binding fragment thereof directed against CD 3.
3. The anti-GUCY 2C/CD3 bispecific antibody of claim 1, wherein said bispecific antibody comprises a monomer or a dimer of monomers, which may be homologous or heterologous, said monomers comprising from amino-terminus to carboxy-terminus a structure selected from the group consisting of:
structure I:
structure II:
structure III:
structure IV:
structure V:
wherein,,
b1, B2, B3, B4, B5 are each independently an antigen-binding fragment that is devoid of or binds to target molecule B, and at least one is not devoid of;
l1, L2, L3, L4, L5, L6 are each independently no or a bond or a linker;
VHA represents the heavy chain variable region that binds to target molecule a; VLA represents the light chain variable region that binds to target molecule a;
CL represents the light chain constant region; CH represents a heavy chain constant region;
"-" represents disulfide or covalent bonds; "-" represents a peptide bond.
4. The anti-GUCY 2C/CD3 bispecific antibody of claim 3, wherein said bispecific antibody comprises a structure selected from the group consisting of seq id no:
a) Homodimers formed from monomers of structure I;
b) A heterodimer formed from monomers of structure I and structure II;
c) Monomers of structure III;
d) Monomers of structure IV;
e) Monomers of structure V.
5. The anti-GUCY 2C/CD3 bispecific antibody of any one of claims 1-4, wherein said antigen binding fragment is selected from scFv, fv, fd, fab, F (ab ') 2 or F (ab').
6. The anti-GUCY 2C/CD3 bispecific antibody of claim 1, wherein said bispecific antibody comprises 1 or 2 heavy chains, and 1 or 2 light chains, which heavy and light chains may be homologous or heterologous.
7. The anti-GUCY 2C/CD3 bispecific antibody of claim 6, wherein the heavy chain comprises from amino-terminus to carboxy-terminus a structure selected from the group consisting of:
a)VHA-CH1-CH2-CH3-L1-scFvB;
b)scFvB-L1-VHA-CH1-CH2-CH3;
c)VHA-CH1-L1-scFvB-CH2-CH3;
d)VHA-CH1-L1-scFvB;
e)VHA-CH1;
f)VHA-CH1-L1-scFvB-L2-VHA-CH1;
g)VHA-CH1-L1-scFvB-L2-VLA-CL;
h)VLA-CL-L1-scFvB-L2-VHA-CH1;
i)VHA-CH1-L1-VHA-CH1-L2-scFvB;
j)VHA-CH1-CH2-CH3;
wherein VHA refers to VH bound to target molecule a, scFvB refers to scFv bound to target molecule B, and L1, L2 are each independently a bond or a linker.
8. The anti-GUCY 2C/CD3 bispecific antibody of claim 6, wherein the light chain comprises from amino-terminus to carboxy-terminus a structure selected from the group consisting of:
k)VLA-CL;
l)scFvB-L3-VLA-CL;
m)VLA-CL-L3-scFvB;
n)VLA-CL-L3-scFvB-L4-VLA-CL;
0)VLA-CL-L3-VLA-CL-L4-scFvB;
Where VLA refers to VL that binds to target molecule A, scFvB refers to scFv that binds to target molecule B, and L3 and L4 are each independently a bond or linker.
9. The anti-GUCY 2C/CD3 bispecific antibody of any one of claims 6-8, comprising a structure selected from the group consisting of seq id no:
structure 1: comprising 2 heavy chains a) and 2 light chains k);
structure 2: comprising 2 heavy chains b) and 2 light chains k);
structure 3: comprising 1 heavy chain a), 1 heavy chain j) and 2 light chains k), the CH3 region of heavy chain a) comprising a mortar structure modification and the CH3 region of heavy chain j) comprising a pestle structure modification;
structure 4: comprising 1 heavy chain a), 1 heavy chain j) and 2 light chains k), the CH3 region of heavy chain a) comprising a knob structure modification, the CH3 region of heavy chain j) comprising a hole structure modification;
and (5) a structure 5: comprising 1 heavy chain b), 1 heavy chain j) and 2 light chains k), the CH3 region of heavy chain b) comprising a mortar structure modification and the CH3 region of heavy chain j) comprising a pestle structure modification;
structure 6: comprising 1 heavy chain b), 1 heavy chain j) and 2 light chains k), the CH3 region of heavy chain b) comprising a knob structure modification, the CH3 region of heavy chain j) comprising a hole structure modification;
structure 7: comprising 2 heavy chains c) and 2 light chains k);
structure 8: comprising 1 heavy chain c), 1 heavy chain j), and 2 light chains k), the CH3 region of heavy chain c) comprising a mortar structure modification, the CH3 region of heavy chain j) comprising a pestle structure modification;
Structure 9: comprising 1 heavy chain c), 1 heavy chain j), and 2 light chains k), the CH3 region of heavy chain c) comprising a knob structure modification, the CH3 region of heavy chain j) comprising a hole structure modification;
structure 10: comprising 1 heavy chain d) and 1 light chain k);
structure 11: comprising 1 heavy chain e) and 1 light chain m);
structure 12: comprising 1 heavy chain f) and 2 light chains k);
structure 13: comprising 1 heavy chain g), 1 heavy chain e) and 1 light chain k);
structure 14: comprising 2 heavy chains e) and 1 light chain n);
structure 15: comprising 1 heavy chain h), 1 heavy chain e) and 1 light chain k);
structure 16: comprising 1 heavy chain i) and 2 light chains k);
structure 17: comprising 2 heavy chains e) and 1 light chain 0);
structure 18: comprising 2 heavy chains j) and 2 light chains l);
structure 19: comprising 2 heavy chains j) and 2 light chains m).
10. The anti-GUCY 2C/CD3 bispecific antibody of claim 1, wherein the anti-GUCY 2C antibody or antigen-binding fragment thereof comprises heavy chain complementarity determining regions HCDR1-3 and light chain complementarity determining regions LCDR1-3, wherein:
a) The HCDR-1 amino acid sequence is shown in SEQ ID NO:3, the HCDR-2 amino acid sequence is shown as SEQ ID NO:4, the HCDR-3 amino acid sequence is shown as SEQ ID NO:5 is shown in the figure; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:8, the amino acid sequence of the LCDR-2 is shown as SEQ ID NO:9, the LCDR-3 amino acid sequence is shown as SEQ ID NO:10 is shown in the figure; or alternatively, the first and second heat exchangers may be,
b) The HCDR-1 amino acid sequence is shown in SEQ ID NO:13, the HCDR-2 amino acid sequence is shown in SEQ ID NO:14, the HCDR-3 amino acid sequence is shown in SEQ ID NO: 15; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:18, the amino acid sequence of the LCDR-2 is shown as SEQ ID NO:19, the amino acid sequence of the LCDR-3 is shown as SEQ ID NO: shown at 20; or alternatively, the first and second heat exchangers may be,
c) The HCDR-1 amino acid sequence is shown in SEQ ID NO:23, the HCDR-2 amino acid sequence is shown in SEQ ID NO:24, the HCDR-3 amino acid sequence is shown in SEQ ID NO: shown at 25; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:28, the amino acid sequence of the LCDR-2 is shown as SEQ ID NO:29, the LCDR-3 amino acid sequence is shown in SEQ ID NO: shown at 30; or alternatively, the first and second heat exchangers may be,
d) The HCDR-1 amino acid sequence is shown in SEQ ID NO:33, the HCDR-2 amino acid sequence is shown in SEQ ID NO:34, the HCDR-3 amino acid sequence is shown in SEQ ID NO: indicated at 35; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:38, the LCDR-2 amino acid sequence is shown in SEQ ID NO:39, wherein the amino acid sequence of the LCDR-3 is shown in SEQ ID NO: shown at 40; or alternatively, the first and second heat exchangers may be,
e) The HCDR-1 amino acid sequence is shown in SEQ ID NO:43, the HCDR-2 amino acid sequence is shown in SEQ ID NO:44, the HCDR-3 amino acid sequence is shown in SEQ ID NO: 45; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:48, the LCDR-2 amino acid sequence is shown in SEQ ID NO:49, said LCDR-3 amino acid sequence is as set forth in SEQ ID NO: shown at 50.
11. The anti-GUCY 2C/CD3 bispecific antibody of claim 1, wherein the anti-GUCY 2C antibody or antigen-binding fragment thereof comprises a heavy chain variable region VH or variant thereof and a light chain variable region VL or variant thereof, wherein:
a) The amino acid sequence of VH is shown in SEQ ID NO:51, the amino acid sequence of VL is shown in SEQ ID NO: indicated at 55; or alternatively, the first and second heat exchangers may be,
b) The amino acid sequence of VH is shown in SEQ ID NO:51, the amino acid sequence of VL is shown in SEQ ID NO: 57; or alternatively, the first and second heat exchangers may be,
c) The amino acid sequence of VH is shown in SEQ ID NO:53, the amino acid sequence of the VL is shown in SEQ ID NO: indicated at 55; or alternatively, the first and second heat exchangers may be,
d) The amino acid sequence of VH is shown in SEQ ID NO:53, the amino acid sequence of the VL is shown in SEQ ID NO: 57; or alternatively, the first and second heat exchangers may be,
e) The amino acid sequence of VH is shown in SEQ ID NO:59, the amino acid sequence of the VL is shown in SEQ ID NO: indicated at 63; or alternatively, the first and second heat exchangers may be,
f) The amino acid sequence of VH is shown in SEQ ID NO:61, the amino acid sequence of the VL is shown in SEQ ID NO: indicated at 63; or alternatively, the first and second heat exchangers may be,
g) The amino acid sequence of VH is shown in SEQ ID NO:61, the amino acid sequence of the VL is shown in SEQ ID NO:65, or, alternatively,
h) The amino acid sequence of VH is shown in SEQ ID NO:59, the amino acid sequence of the VL is shown in SEQ ID NO: shown at 65.
12. The anti-GUCY 2C/CD3 bispecific antibody of claim 11, wherein said VH region variant is identical to SEQ ID NO: 51. SEQ ID NO: 53. SEQ ID NO: 59. SEQ ID NO:61 having at least 90%, 95%, 98%, or 99% amino acid sequence homology; the VL region variant refers to a sequence that hybridizes with SEQ ID NO: 55. SEQ ID NO: 57. SEQ ID NO: 63. SEQ ID NO:65, variants having at least 90%, 95%, 98%, or 99% amino acid sequence homology.
13. The anti-GUCY 2C/CD3 bispecific antibody of claim 11, wherein the VH region or VL region comprises 1-10 amino acid mutations; preferably, the mutation is a substitution mutation.
14. The anti-GUCY 2C/CD3 bispecific antibody of claim 1, wherein the antigen-binding fragment of the anti-CD 3 antibody is monovalent or bivalent; preferably, the antigen binding fragment of the anti-CD 3 antibody is monovalent.
15. The anti-GUCY 2C/CD3 bispecific antibody of claim 1, wherein the antigen-binding fragment of the anti-CD 3 antibody is a scFv comprising a VH-L1-VL structure or a VL-L1-VH structure from amino terminus to carboxy terminus, and wherein L1 is a bond or a linker.
16. The anti-GUCY 2C/CD3 bispecific antibody of claim 1, wherein the scFv comprises a heavy chain complementarity determining region HCDR1-3 and a light chain complementarity determining region LCDR1-3, wherein the HCDR-1 amino acid sequence is as set forth in SEQ ID NO:101, the HCDR-2 amino acid sequence is shown in SEQ ID NO:102, the HCDR-3 amino acid sequence is shown in SEQ ID NO: 103; the amino acid sequence of the LCDR-1 is shown as SEQ ID NO:104, the LCDR-2 amino acid sequence is shown as SEQ ID NO:105, the LCDR-3 amino acid sequence is shown as SEQ ID NO: shown at 106.
17. The anti-GUCY 2C/CD3 bispecific antibody of claim 1, wherein the scFv comprises a heavy chain variable region VH and a light chain variable region VL, wherein the amino acid sequence of VH is set forth in SEQ ID NO:107, the amino acid sequence of the VL is shown in SEQ ID NO: shown at 108.
18. The anti-GUCY 2C/CD3 bispecific antibody of claim 3, 7, 8 or 15, wherein each of said L1, L2, L3, L4, L5, L6 is independently (G4S) n Wherein n is selected from integers from 1 to 6.
19. The anti-GUCY 2C/CD3 bispecific antibody of claim 1, wherein the target molecule a is GUCY2C and the target molecule B is CD3.
20. The anti-GUCY 2C/CD3 bispecific antibody according to claim 1, characterized in that it is selected from the following bispecific antibodies:
SP4VHL-32H2 (structure 2): comprises an amino acid sequence shown in SEQ ID NO:67, and the amino acid sequence of which is set forth in SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
32H2-SP4VHL-L (Structure 19): comprises an amino acid sequence shown in SEQ ID NO:73, and the amino acid sequence of which is set forth in SEQ ID NO: 71; or alternatively, the first and second heat exchangers may be,
SP4VHL-32H2-L (Structure 18): comprises an amino acid sequence shown in SEQ ID NO:73, and the amino acid sequence of which is set forth in SEQ ID NO: 75. Or alternatively, the first and second heat exchangers may be,
32H2-SP4VHL (Structure 1): comprises an amino acid sequence shown in SEQ ID NO:77, and the amino acid sequence of SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
32H2Fab-SP4VHL (Structure 10): comprises an amino acid sequence shown in SEQ ID NO:79, and the amino acid sequence of SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
32H2CH1-SP4VHL (Structure 7): comprises an amino acid sequence shown in SEQ ID NO:81, and the amino acid sequence of which is shown in SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
10D7CH1-SP4VHL (structure 7): comprises an amino acid sequence shown in SEQ ID NO:83, and the amino acid sequence of which is set forth in SEQ ID NO: 85; or alternatively, the first and second heat exchangers may be,
32H2CH1-SP4VHL-KIH (Structure 9): comprises an amino acid sequence shown in SEQ ID NO:87, the amino acid sequence of which is shown in SEQ ID NO:89, and the amino acid sequence of SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
32H2CH1-SP4VHL-hik (Structure 8): comprises an amino acid sequence shown in SEQ ID NO:91, and the amino acid sequence of the first heavy chain is shown as SEQ ID NO:93, and a second heavy chain having an amino acid sequence as set forth in SEQ ID NO: 69; or alternatively, the first and second heat exchangers may be,
a derivative polypeptide which is formed by substitution, deletion or addition of one or more amino acid residues in the amino acid sequence of any one of SP4VHL-32H2, 32H2-SP4VHL-L, SP4VHL-32H2-L, 32H2Fab-SP4VHL, 32H2CH1-SP4VHL, 10D7CH1-SP4VHL, 32H2CH1-SP4VHL-KIH, and 32H2CH1-SP4 VHL-hik.
21. A polynucleotide molecule encoding the anti-GUCY 2C/CD3 bispecific antibody of any one of claims 1-20.
22. An expression vector comprising the polynucleotide molecule of claim 21.
23. A host cell comprising the expression vector of claim 22.
24. A method of preparing an anti-GUCY 2C/CD3 bispecific antibody according to any one of claims 1-20, comprising the steps of:
a) Culturing the host cell of claim 23 under expression conditions to express an anti-GUCY 2C/CD3 bispecific antibody;
b) Isolating and purifying the anti-GUCY 2C/CD3 bispecific antibody described in the step a).
25. A pharmaceutical composition comprising an effective amount of the anti-GUCY 2C/CD3 bispecific antibody of any one of claims 1-20, and one or more pharmaceutically acceptable carriers.
26. Use of an anti-GUCY 2C/CD3 bispecific antibody according to any one of claims 1-20, or a pharmaceutical composition according to claim 25, for the manufacture of a medicament for the treatment of cancer.
27. The use of claim 26, wherein the cancer is a GUCY 2C-related cancer;
preferably, the cancer is a GUCY2C abnormal expression.
28. The use of claim 26 or 27, wherein the cancer is a gastrointestinal tumor or pancreatic cancer; preferably, the gastrointestinal tumor is selected from the group consisting of rectal cancer, colon cancer, small intestine cancer, stomach cancer, esophageal cancer, and gastro-esophageal junction cancer; more preferably, the gastrointestinal tumor is a malignant tumor.
29. An immunoconjugate, the immunoconjugate comprising:
a) The anti-GUCY 2C/CD3 bispecific antibody of any one of claims 1-20; and b) a coupling moiety selected from the group consisting of: a detectable label, drug, toxin, cytokine, radionuclide, or enzyme.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111497366.0A CN116284426A (en) | 2021-12-09 | 2021-12-09 | anti-GUCY 2C/CD3 bispecific antibody and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111497366.0A CN116284426A (en) | 2021-12-09 | 2021-12-09 | anti-GUCY 2C/CD3 bispecific antibody and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116284426A true CN116284426A (en) | 2023-06-23 |
Family
ID=86782010
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111497366.0A Pending CN116284426A (en) | 2021-12-09 | 2021-12-09 | anti-GUCY 2C/CD3 bispecific antibody and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116284426A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116574187A (en) * | 2023-07-07 | 2023-08-11 | 浙江时迈药业有限公司 | Antibodies against GUCY2C and uses thereof |
-
2021
- 2021-12-09 CN CN202111497366.0A patent/CN116284426A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116574187A (en) * | 2023-07-07 | 2023-08-11 | 浙江时迈药业有限公司 | Antibodies against GUCY2C and uses thereof |
CN116574187B (en) * | 2023-07-07 | 2024-03-08 | 浙江时迈药业有限公司 | Antibodies against GUCY2C and uses thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102589136B1 (en) | Anti-B7-H3 antibody and its uses | |
CN111018986B (en) | Antibodies against glypican-3 and uses thereof | |
CN107973854B (en) | PDL1 monoclonal antibody and application thereof | |
CN107922494B (en) | anti-PD-1 antibodies and uses thereof | |
JP5752687B2 (en) | Antibodies against ectodomain of ERBB3 and use thereof | |
KR20190133160A (en) | Molecules Including Anti-GPRC5D Antibody and Anti-GPRC5D Antibody | |
KR20190055022A (en) | Anti-HER2 Antibody or Antigen Binding Fragment Thereof, and Chimeric Antigen Receptor Comprising The Same | |
KR20220110177A (en) | Anti-human claudin 18.2 antibody and application thereof | |
KR20130086533A (en) | Novel anti-cmet antibody | |
KR20220113353A (en) | Bispecific antibodies to CEACAM5 and CD3 | |
KR20190121294A (en) | Molecules Including Anti-CD3 Antibodies, and Anti-CD3 Antibodies | |
KR102405278B1 (en) | ALK7 binding proteins and uses thereof | |
CN111196854A (en) | OX40 antibody and preparation method and application thereof | |
CN110713537A (en) | SEMA4D antibody and preparation method and application thereof | |
JP2010509931A (en) | New antiproliferative compounds | |
CN113321734A (en) | anti-CD 47/anti-PD-L1 antibodies and uses thereof | |
CN114478769B (en) | anti-TIGIT antibody, and pharmaceutical composition and use thereof | |
CN112500485A (en) | anti-B7-H3 antibody and application thereof | |
JP2022539344A (en) | Anti-CEA antibody and its application | |
CN111253487B (en) | CD19 antibodies and uses thereof | |
KR102457751B1 (en) | Activin type 2 receptor binding protein and uses thereof | |
CN108137695B (en) | Sialylated diluisia a expressed on glycoproteins other than glycolipids as a functional cancer target and antibodies thereto | |
CN109912716B (en) | EGFR antibody and preparation method and application thereof | |
CN116284426A (en) | anti-GUCY 2C/CD3 bispecific antibody and application thereof | |
JP2023109923A (en) | Humanized antibodies against globo h and uses thereof in cancer treatments |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication |