KR20220005002A - 재조합 인플루엔자 항원 - Google Patents
재조합 인플루엔자 항원 Download PDFInfo
- Publication number
- KR20220005002A KR20220005002A KR1020217036388A KR20217036388A KR20220005002A KR 20220005002 A KR20220005002 A KR 20220005002A KR 1020217036388 A KR1020217036388 A KR 1020217036388A KR 20217036388 A KR20217036388 A KR 20217036388A KR 20220005002 A KR20220005002 A KR 20220005002A
- Authority
- KR
- South Korea
- Prior art keywords
- gly
- ser
- asn
- leu
- thr
- Prior art date
Links
- 206010022000 influenza Diseases 0.000 title claims abstract description 49
- 239000000427 antigen Substances 0.000 title description 4
- 108091007433 antigens Proteins 0.000 title description 4
- 102000036639 antigens Human genes 0.000 title description 4
- 101710154606 Hemagglutinin Proteins 0.000 claims abstract description 271
- 101710093908 Outer capsid protein VP4 Proteins 0.000 claims abstract description 271
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 claims abstract description 271
- 101710176177 Protein A56 Proteins 0.000 claims abstract description 271
- 239000000185 hemagglutinin Substances 0.000 claims abstract description 268
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 264
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 264
- 229920001184 polypeptide Polymers 0.000 claims abstract description 262
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 84
- 150000001413 amino acids Chemical class 0.000 claims abstract description 57
- 239000012634 fragment Substances 0.000 claims abstract description 46
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 44
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 43
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 43
- 230000002163 immunogen Effects 0.000 claims abstract description 40
- 241000712431 Influenza A virus Species 0.000 claims abstract description 23
- 208000037797 influenza A Diseases 0.000 claims abstract description 12
- 241000712461 unidentified influenza virus Species 0.000 claims description 54
- 239000013598 vector Substances 0.000 claims description 40
- 238000000034 method Methods 0.000 claims description 32
- 210000004027 cell Anatomy 0.000 claims description 31
- 210000004899 c-terminal region Anatomy 0.000 claims description 14
- 238000003776 cleavage reaction Methods 0.000 claims description 13
- 230000007017 scission Effects 0.000 claims description 13
- 238000000746 purification Methods 0.000 claims description 12
- 229960005486 vaccine Drugs 0.000 claims description 10
- 230000028993 immune response Effects 0.000 claims description 9
- 108091005804 Peptidases Proteins 0.000 claims description 7
- 239000004365 Protease Substances 0.000 claims description 7
- 239000000203 mixture Substances 0.000 claims description 6
- 238000001514 detection method Methods 0.000 claims description 3
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 3
- 230000001939 inductive effect Effects 0.000 claims description 3
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims 1
- 230000000890 antigenic effect Effects 0.000 abstract description 3
- 230000036039 immunity Effects 0.000 abstract 1
- 235000001014 amino acid Nutrition 0.000 description 99
- 229940024606 amino acid Drugs 0.000 description 81
- 108010076504 Protein Sorting Signals Proteins 0.000 description 53
- 230000035772 mutation Effects 0.000 description 50
- 108010089804 glycyl-threonine Proteins 0.000 description 35
- 108010092854 aspartyllysine Proteins 0.000 description 33
- 108010061238 threonyl-glycine Proteins 0.000 description 33
- 108090000623 proteins and genes Proteins 0.000 description 31
- 102000004169 proteins and genes Human genes 0.000 description 30
- 241000700605 Viruses Species 0.000 description 29
- 108010050848 glycylleucine Proteins 0.000 description 26
- 235000018102 proteins Nutrition 0.000 description 26
- 238000005829 trimerization reaction Methods 0.000 description 25
- 241000880493 Leptailurus serval Species 0.000 description 24
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 24
- 125000000539 amino acid group Chemical group 0.000 description 24
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 24
- 238000001542 size-exclusion chromatography Methods 0.000 description 24
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 23
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 21
- 108010047857 aspartylglycine Proteins 0.000 description 21
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 20
- 108010012581 phenylalanylglutamate Proteins 0.000 description 20
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 19
- 108010034529 leucyl-lysine Proteins 0.000 description 19
- 239000013638 trimer Substances 0.000 description 19
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 18
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 18
- 108010047495 alanylglycine Proteins 0.000 description 17
- 108010028295 histidylhistidine Proteins 0.000 description 17
- 108010064235 lysylglycine Proteins 0.000 description 17
- 239000000178 monomer Substances 0.000 description 17
- 108010073969 valyllysine Proteins 0.000 description 17
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 16
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 16
- 208000015181 infectious disease Diseases 0.000 description 16
- 108010057821 leucylproline Proteins 0.000 description 16
- 108010031719 prolyl-serine Proteins 0.000 description 16
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 15
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 15
- 108010015792 glycyllysine Proteins 0.000 description 15
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 14
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 14
- 108010005233 alanylglutamic acid Proteins 0.000 description 14
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 14
- 239000012228 culture supernatant Substances 0.000 description 14
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 14
- 230000000087 stabilizing effect Effects 0.000 description 14
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 13
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 13
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 13
- 108010068265 aspartyltyrosine Proteins 0.000 description 13
- 108010078144 glutaminyl-glycine Proteins 0.000 description 13
- 239000012528 membrane Substances 0.000 description 13
- OZSBRCONEMXYOJ-AVGNSLFASA-N Cys-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N OZSBRCONEMXYOJ-AVGNSLFASA-N 0.000 description 12
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 12
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 12
- WCNVGGZRTNHOOS-ULQDDVLXSA-N Pro-Lys-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O WCNVGGZRTNHOOS-ULQDDVLXSA-N 0.000 description 12
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 12
- 108010029020 prolylglycine Proteins 0.000 description 12
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 11
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 11
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 11
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 11
- 108010079364 N-glycylalanine Proteins 0.000 description 11
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 11
- 108010077245 asparaginyl-proline Proteins 0.000 description 11
- 238000004113 cell culture Methods 0.000 description 11
- 108010060199 cysteinylproline Proteins 0.000 description 11
- 201000010099 disease Diseases 0.000 description 11
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 11
- 108010081551 glycylphenylalanine Proteins 0.000 description 11
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 11
- 229960000310 isoleucine Drugs 0.000 description 11
- 108010004914 prolylarginine Proteins 0.000 description 11
- 108010026333 seryl-proline Proteins 0.000 description 11
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 10
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 10
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 10
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 10
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 10
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 10
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 10
- JBSLJUPMTYLLFH-MELADBBJSA-N His-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O JBSLJUPMTYLLFH-MELADBBJSA-N 0.000 description 10
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 10
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 10
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 10
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 10
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 10
- ZKVANNIVSDOQMG-HKUYNNGSSA-N Trp-Tyr-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)NCC(=O)O)N ZKVANNIVSDOQMG-HKUYNNGSSA-N 0.000 description 10
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 10
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 10
- 108010008355 arginyl-glutamine Proteins 0.000 description 10
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 10
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 10
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 10
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 10
- 108010087823 glycyltyrosine Proteins 0.000 description 10
- 238000004519 manufacturing process Methods 0.000 description 10
- 108010051242 phenylalanylserine Proteins 0.000 description 10
- 108010020532 tyrosyl-proline Proteins 0.000 description 10
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 9
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 9
- CLEFUAZULXANBU-MELADBBJSA-N Cys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N)C(=O)O CLEFUAZULXANBU-MELADBBJSA-N 0.000 description 9
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 9
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 9
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 9
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 9
- IROABALAWGJQGM-OALUTQOASA-N Gly-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)CN IROABALAWGJQGM-OALUTQOASA-N 0.000 description 9
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 9
- LYDKQVYYCMYNMC-SRVKXCTJSA-N His-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYDKQVYYCMYNMC-SRVKXCTJSA-N 0.000 description 9
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 9
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 9
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 9
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 9
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 9
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 9
- MQUYPYFPHIPVHJ-MNSWYVGCSA-N Tyr-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O MQUYPYFPHIPVHJ-MNSWYVGCSA-N 0.000 description 9
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 9
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 9
- 235000013601 eggs Nutrition 0.000 description 9
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 9
- 229960003971 influenza vaccine Drugs 0.000 description 9
- 210000004962 mammalian cell Anatomy 0.000 description 9
- 230000003472 neutralizing effect Effects 0.000 description 9
- 108010080629 tryptophan-leucine Proteins 0.000 description 9
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 8
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 8
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 8
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 8
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 8
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 8
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 8
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 8
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 8
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 8
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 8
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 8
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 8
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 8
- 241001500351 Influenzavirus A Species 0.000 description 8
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 8
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 8
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 8
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 8
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 8
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 8
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 8
- XTONYTDATVADQH-CIUDSAMLSA-N Lys-Cys-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XTONYTDATVADQH-CIUDSAMLSA-N 0.000 description 8
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 8
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 8
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 8
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 8
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 8
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 8
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 8
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 8
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 8
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 8
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 8
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 8
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 8
- 108010037850 glycylvaline Proteins 0.000 description 8
- 108010085325 histidylproline Proteins 0.000 description 8
- 108010017391 lysylvaline Proteins 0.000 description 8
- 108010070643 prolylglutamic acid Proteins 0.000 description 8
- 108010003137 tyrosyltyrosine Proteins 0.000 description 8
- IGXNPQWXIRIGBF-KEOOTSPTSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IGXNPQWXIRIGBF-KEOOTSPTSA-N 0.000 description 7
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 7
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 7
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 7
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 7
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 7
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 7
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 7
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 7
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 7
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 7
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 7
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 7
- USTCFDAQCLDPBD-XIRDDKMYSA-N Leu-Asn-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N USTCFDAQCLDPBD-XIRDDKMYSA-N 0.000 description 7
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 7
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 7
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 7
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 7
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 7
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 7
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 7
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 7
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 7
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 7
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 7
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 7
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 7
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 7
- WTRQBSSQBKRNKV-MNSWYVGCSA-N Trp-Thr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=C(O)C=C1 WTRQBSSQBKRNKV-MNSWYVGCSA-N 0.000 description 7
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 7
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 7
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 7
- 108010087924 alanylproline Proteins 0.000 description 7
- 108010060035 arginylproline Proteins 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 230000001086 cytosolic effect Effects 0.000 description 7
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 7
- 108010027338 isoleucylcysteine Proteins 0.000 description 7
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 7
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 7
- 238000001890 transfection Methods 0.000 description 7
- 230000003612 virological effect Effects 0.000 description 7
- FATXTKJILXPNJL-UHFFFAOYSA-N 2-[[2-[2-[(2-amino-3-methylpentanoyl)amino]propanoylamino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CCC(C)C(N)C(=O)NC(C)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 FATXTKJILXPNJL-UHFFFAOYSA-N 0.000 description 6
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 6
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 6
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 6
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 6
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 6
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 6
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 6
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 6
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 6
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 6
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 6
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 6
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 6
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 6
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 6
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 6
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 6
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 6
- KMCRKVOLRCOMBG-DJFWLOJKSA-N Asn-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KMCRKVOLRCOMBG-DJFWLOJKSA-N 0.000 description 6
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 6
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 6
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 6
- GOPFMQJUQDLUFW-LKXGYXEUSA-N Asn-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O GOPFMQJUQDLUFW-LKXGYXEUSA-N 0.000 description 6
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 6
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 6
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 6
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 6
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 6
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 6
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 6
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 6
- 238000002965 ELISA Methods 0.000 description 6
- 102100038132 Endogenous retrovirus group K member 6 Pro protein Human genes 0.000 description 6
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 6
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 6
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 6
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 6
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 6
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 6
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 6
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 6
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 6
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 6
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 6
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 6
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 6
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 6
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 6
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 6
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 6
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 6
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 6
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 6
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 6
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 6
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 6
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 6
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 6
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 6
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 6
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 6
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 6
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 6
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 6
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 6
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 6
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 6
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 6
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 6
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 6
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 6
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 6
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 6
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 6
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 6
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 6
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 6
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 6
- HKRYNJSKVLZIFP-IHRRRGAJSA-N Met-Asn-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HKRYNJSKVLZIFP-IHRRRGAJSA-N 0.000 description 6
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 6
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 6
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 6
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 6
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 6
- SBYVDRLQAGENMY-DCAQKATOSA-N Pro-Asn-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O SBYVDRLQAGENMY-DCAQKATOSA-N 0.000 description 6
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 6
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 6
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 6
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 6
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 6
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 6
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 6
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 6
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 6
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 6
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 6
- UTCFSBBXPWKLTG-XKBZYTNZSA-N Thr-Cys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O UTCFSBBXPWKLTG-XKBZYTNZSA-N 0.000 description 6
- VEWZSFGRQDUAJM-YJRXYDGGSA-N Thr-Cys-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O VEWZSFGRQDUAJM-YJRXYDGGSA-N 0.000 description 6
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 6
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 6
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 6
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 6
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 6
- KULBQAVOXHQLIY-HSCHXYMDSA-N Trp-Ile-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 KULBQAVOXHQLIY-HSCHXYMDSA-N 0.000 description 6
- WBZOZLNLXVBCNW-LTHWPDAASA-N Trp-Thr-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)[C@@H](C)O)=CNC2=C1 WBZOZLNLXVBCNW-LTHWPDAASA-N 0.000 description 6
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 6
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 6
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 6
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 6
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 6
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 6
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 6
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 6
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 6
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 6
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 6
- 108010070783 alanyltyrosine Proteins 0.000 description 6
- 108010068380 arginylarginine Proteins 0.000 description 6
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 108010069495 cysteinyltyrosine Proteins 0.000 description 6
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 6
- 108010040030 histidinoalanine Proteins 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 230000014759 maintenance of location Effects 0.000 description 6
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 6
- 238000012216 screening Methods 0.000 description 6
- 108010048818 seryl-histidine Proteins 0.000 description 6
- 230000035882 stress Effects 0.000 description 6
- 108010084932 tryptophyl-proline Proteins 0.000 description 6
- 241000701161 unidentified adenovirus Species 0.000 description 6
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 5
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 5
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 5
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 5
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 5
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 5
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 5
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 5
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 5
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 5
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 5
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 5
- OGTCOKZFOJIZFG-CIUDSAMLSA-N Asp-His-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OGTCOKZFOJIZFG-CIUDSAMLSA-N 0.000 description 5
- HMWBPUDETPKSSS-DCAQKATOSA-N Cys-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCCN)C(=O)O HMWBPUDETPKSSS-DCAQKATOSA-N 0.000 description 5
- 108700039887 Essential Genes Proteins 0.000 description 5
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 5
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 5
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 5
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 5
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 5
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 5
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 5
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 5
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 5
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 5
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 5
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 5
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 5
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 5
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 5
- 241000713196 Influenza B virus Species 0.000 description 5
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 5
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 5
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 5
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 5
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 5
- WBRJVRXEGQIDRK-XIRDDKMYSA-N Leu-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WBRJVRXEGQIDRK-XIRDDKMYSA-N 0.000 description 5
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 5
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 5
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 5
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 5
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 5
- BPDXWKVZNCKUGG-BZSNNMDCSA-N Lys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N BPDXWKVZNCKUGG-BZSNNMDCSA-N 0.000 description 5
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 5
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 5
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 5
- 108091028043 Nucleic acid sequence Proteins 0.000 description 5
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 5
- ABEFOXGAIIJDCL-SFJXLCSZSA-N Phe-Thr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ABEFOXGAIIJDCL-SFJXLCSZSA-N 0.000 description 5
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 5
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 5
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 5
- DTQIXTOJHKVEOH-DCAQKATOSA-N Pro-His-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O DTQIXTOJHKVEOH-DCAQKATOSA-N 0.000 description 5
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 5
- HOJUNFDJDAPVBI-BZSNNMDCSA-N Pro-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 HOJUNFDJDAPVBI-BZSNNMDCSA-N 0.000 description 5
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 5
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 5
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 5
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 5
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 5
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 5
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 5
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 5
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 5
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 5
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 5
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 5
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 5
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 5
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 5
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 5
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 5
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000012512 characterization method Methods 0.000 description 5
- 239000003814 drug Substances 0.000 description 5
- 230000013595 glycosylation Effects 0.000 description 5
- 238000006206 glycosylation reaction Methods 0.000 description 5
- 108010084389 glycyltryptophan Proteins 0.000 description 5
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 5
- 238000003752 polymerase chain reaction Methods 0.000 description 5
- 108010079317 prolyl-tyrosine Proteins 0.000 description 5
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 5
- 238000011282 treatment Methods 0.000 description 5
- 108010044292 tryptophyltyrosine Proteins 0.000 description 5
- 230000009385 viral infection Effects 0.000 description 5
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 4
- 238000012815 AlphaLISA Methods 0.000 description 4
- 235000006576 Althaea officinalis Nutrition 0.000 description 4
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 4
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 4
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 4
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 4
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 4
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 4
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 4
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 4
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 4
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 4
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 4
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 4
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 4
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 4
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 4
- OTXLNICGSXPGQF-KBIXCLLPSA-N Cys-Ile-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTXLNICGSXPGQF-KBIXCLLPSA-N 0.000 description 4
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 4
- ZOKPRHVIFAUJPV-GUBZILKMSA-N Cys-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O ZOKPRHVIFAUJPV-GUBZILKMSA-N 0.000 description 4
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 4
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 4
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 4
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 4
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 4
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 4
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 4
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 4
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 4
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 4
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 4
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 4
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 4
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 4
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 4
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 4
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 4
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 4
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 4
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 4
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 4
- PYFHPYDQHCEVIT-KBPBESRZSA-N Gly-Trp-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O PYFHPYDQHCEVIT-KBPBESRZSA-N 0.000 description 4
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 4
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 4
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 4
- IDQNVIWPPWAFSY-AVGNSLFASA-N His-His-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O IDQNVIWPPWAFSY-AVGNSLFASA-N 0.000 description 4
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 4
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 4
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 4
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 4
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 4
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 4
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 4
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 4
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 4
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 4
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 4
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 4
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 4
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 4
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 4
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 4
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 4
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 4
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 4
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 4
- XDPLZVNMYQOFQZ-BJDJZHNGSA-N Lys-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N XDPLZVNMYQOFQZ-BJDJZHNGSA-N 0.000 description 4
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 4
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 4
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 4
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 4
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 4
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 4
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 4
- 108010006232 Neuraminidase Proteins 0.000 description 4
- 102000005348 Neuraminidase Human genes 0.000 description 4
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 4
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 4
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 4
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 4
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 4
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 4
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 4
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 4
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 4
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 4
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 4
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 4
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 4
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 4
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 4
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 4
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 4
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 4
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 4
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 4
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 4
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 4
- 239000002671 adjuvant Substances 0.000 description 4
- 238000007413 biotinylation Methods 0.000 description 4
- 230000006287 biotinylation Effects 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 230000002950 deficient Effects 0.000 description 4
- 239000003937 drug carrier Substances 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000005558 fluorometry Methods 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 4
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 4
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 4
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 108010056582 methionylglutamic acid Proteins 0.000 description 4
- 125000003729 nucleotide group Chemical group 0.000 description 4
- 239000008194 pharmaceutical composition Substances 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- 238000001195 ultra high performance liquid chromatography Methods 0.000 description 4
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 3
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 3
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 3
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 3
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 3
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 3
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 3
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 3
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 3
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 3
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 3
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 3
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 3
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 3
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 3
- 239000004475 Arginine Substances 0.000 description 3
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 3
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 3
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 3
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 3
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 3
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 3
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 3
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 3
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 3
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 3
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 3
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 3
- 241000287828 Gallus gallus Species 0.000 description 3
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 3
- FJAYYNIXQNERSO-ACZMJKKPSA-N Gln-Cys-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FJAYYNIXQNERSO-ACZMJKKPSA-N 0.000 description 3
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 3
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 3
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 3
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 3
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 3
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 3
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 3
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 3
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 3
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 3
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 3
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 3
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 3
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 3
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 3
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 3
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 3
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 3
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 3
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 3
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 3
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 3
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 3
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 3
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 3
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 3
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 3
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 3
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 3
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 3
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 3
- GOVDTWNJCBRRBJ-DCAQKATOSA-N Lys-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GOVDTWNJCBRRBJ-DCAQKATOSA-N 0.000 description 3
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 3
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 3
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 3
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 3
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 3
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 3
- MQWISMJKHOUEMW-ULQDDVLXSA-N Phe-Arg-His Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 MQWISMJKHOUEMW-ULQDDVLXSA-N 0.000 description 3
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 3
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 3
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 3
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 3
- ZTVCLZLGHZXLOT-ULQDDVLXSA-N Pro-Glu-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O ZTVCLZLGHZXLOT-ULQDDVLXSA-N 0.000 description 3
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 3
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 3
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 3
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 3
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 3
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 3
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 3
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 3
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 3
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 3
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 3
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 3
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 3
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 3
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 3
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 3
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 3
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 3
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 3
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 3
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 3
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 3
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 3
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 3
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 3
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 3
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 3
- DNUJCLUFRGGSDJ-YLVFBTJISA-N Trp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DNUJCLUFRGGSDJ-YLVFBTJISA-N 0.000 description 3
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 3
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 3
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 3
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 3
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 3
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 3
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 3
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 3
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 3
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 3
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 3
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 3
- 239000013543 active substance Substances 0.000 description 3
- 230000006978 adaptation Effects 0.000 description 3
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 3
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 230000034994 death Effects 0.000 description 3
- 231100000517 death Toxicity 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010018006 histidylserine Proteins 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 125000000741 isoleucyl group Chemical group [H]N([H])C(C(C([H])([H])[H])C([H])([H])C([H])([H])[H])C(=O)O* 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010091871 leucylmethionine Proteins 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 238000002844 melting Methods 0.000 description 3
- 230000008018 melting Effects 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 229940035032 monophosphoryl lipid a Drugs 0.000 description 3
- 239000002105 nanoparticle Substances 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 230000001575 pathological effect Effects 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 230000002265 prevention Effects 0.000 description 3
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- 230000001225 therapeutic effect Effects 0.000 description 3
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 2
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 2
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 2
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 2
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 2
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 2
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 2
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 2
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 2
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 2
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 2
- RCAUJZASOAFTAJ-FXQIFTODSA-N Arg-Asp-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N RCAUJZASOAFTAJ-FXQIFTODSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 2
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 2
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 2
- OCDJOVKIUJVUMO-SRVKXCTJSA-N Arg-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N OCDJOVKIUJVUMO-SRVKXCTJSA-N 0.000 description 2
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 2
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 2
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 2
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 2
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 2
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 2
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 2
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 2
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 2
- ZUVMUOOHJYNJPP-XIRDDKMYSA-N Arg-Trp-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZUVMUOOHJYNJPP-XIRDDKMYSA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 2
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 2
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 2
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 2
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 2
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 2
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 2
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 2
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 2
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 2
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 2
- LUVODTFFSXVOAG-ACZMJKKPSA-N Asn-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N LUVODTFFSXVOAG-ACZMJKKPSA-N 0.000 description 2
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 2
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 2
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 2
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 2
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- JGIAYNNXZKKKOW-KKUMJFAQSA-N Asn-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N JGIAYNNXZKKKOW-KKUMJFAQSA-N 0.000 description 2
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 2
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 2
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 2
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 2
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- HTOZUYZQPICRAP-BPUTZDHNSA-N Asp-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N HTOZUYZQPICRAP-BPUTZDHNSA-N 0.000 description 2
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 2
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 2
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 2
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 2
- LXKLDWVHXNZQGB-SRVKXCTJSA-N Asp-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O LXKLDWVHXNZQGB-SRVKXCTJSA-N 0.000 description 2
- SMZCLQGDQMGESY-ACZMJKKPSA-N Asp-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N SMZCLQGDQMGESY-ACZMJKKPSA-N 0.000 description 2
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 2
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 2
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 2
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 2
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 2
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- IQCJOIHDVFJQFV-LKXGYXEUSA-N Asp-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O IQCJOIHDVFJQFV-LKXGYXEUSA-N 0.000 description 2
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 2
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 2
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 2
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 2
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 2
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 2
- 241000271566 Aves Species 0.000 description 2
- 102100021277 Beta-secretase 2 Human genes 0.000 description 2
- 101100315624 Caenorhabditis elegans tyr-1 gene Proteins 0.000 description 2
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 2
- UCMIKRLLIOVDRJ-XKBZYTNZSA-N Cys-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)O UCMIKRLLIOVDRJ-XKBZYTNZSA-N 0.000 description 2
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 2
- AFYGNOJUTMXQIG-FXQIFTODSA-N Cys-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N AFYGNOJUTMXQIG-FXQIFTODSA-N 0.000 description 2
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 2
- JUNZLDGUJZIUCO-IHRRRGAJSA-N Cys-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O JUNZLDGUJZIUCO-IHRRRGAJSA-N 0.000 description 2
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 2
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 2
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 2
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 2
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 2
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 2
- KYFSMWLWHYZRNW-ACZMJKKPSA-N Gln-Asp-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KYFSMWLWHYZRNW-ACZMJKKPSA-N 0.000 description 2
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 2
- CXFUMJQFZVCETK-FXQIFTODSA-N Gln-Cys-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O CXFUMJQFZVCETK-FXQIFTODSA-N 0.000 description 2
- ALUBSZXSNSPDQV-WDSKDSINSA-N Gln-Cys-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ALUBSZXSNSPDQV-WDSKDSINSA-N 0.000 description 2
- QFTRCUPCARNIPZ-XHNCKOQMSA-N Gln-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)C(=O)O QFTRCUPCARNIPZ-XHNCKOQMSA-N 0.000 description 2
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 2
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 2
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 2
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 2
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 2
- DAAUVRPSZRDMBV-KBIXCLLPSA-N Gln-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DAAUVRPSZRDMBV-KBIXCLLPSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 2
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 2
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 2
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 2
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 2
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 2
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 2
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 2
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 2
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 2
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 2
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 2
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 2
- OWVURWCRZZMAOZ-XHNCKOQMSA-N Glu-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OWVURWCRZZMAOZ-XHNCKOQMSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 2
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 2
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 2
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 2
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 2
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 2
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 2
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 2
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 2
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 2
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 2
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 2
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 2
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 2
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 2
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 2
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 2
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- CQIIXEHDSZUSAG-QWRGUYRKSA-N Gly-His-His Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CQIIXEHDSZUSAG-QWRGUYRKSA-N 0.000 description 2
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 2
- YNIMVVJTPWCUJH-KBPBESRZSA-N Gly-His-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YNIMVVJTPWCUJH-KBPBESRZSA-N 0.000 description 2
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 2
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 2
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 2
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 2
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 2
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 2
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- NIOPEYHPOBWLQO-KBPBESRZSA-N Gly-Trp-Glu Chemical compound NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOPEYHPOBWLQO-KBPBESRZSA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 108090000288 Glycoproteins Proteins 0.000 description 2
- 102000003886 Glycoproteins Human genes 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 2
- WOAMZMXCLBBQKW-KKUMJFAQSA-N His-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)O WOAMZMXCLBBQKW-KKUMJFAQSA-N 0.000 description 2
- NNBWMLHQXBTIIT-HVTMNAMFSA-N His-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N NNBWMLHQXBTIIT-HVTMNAMFSA-N 0.000 description 2
- PYNUBZSXKQKAHL-UWVGGRQHSA-N His-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O PYNUBZSXKQKAHL-UWVGGRQHSA-N 0.000 description 2
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 2
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 2
- YAEKRYQASVCDLK-JYJNAYRXSA-N His-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YAEKRYQASVCDLK-JYJNAYRXSA-N 0.000 description 2
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 2
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 2
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 2
- DQZCEKQPSOBNMJ-NKIYYHGXSA-N His-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DQZCEKQPSOBNMJ-NKIYYHGXSA-N 0.000 description 2
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 2
- UXZMINKIEWBEQU-SZMVWBNQSA-N His-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N UXZMINKIEWBEQU-SZMVWBNQSA-N 0.000 description 2
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 2
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 2
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 2
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 2
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 2
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- QYOGJYIRKACXEP-SLBDDTMCSA-N Ile-Asn-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N QYOGJYIRKACXEP-SLBDDTMCSA-N 0.000 description 2
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 2
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 2
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 2
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 2
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 2
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 2
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 2
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 2
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 2
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 2
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 2
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 2
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- KTTMFLSBTNBAHL-MXAVVETBSA-N Ile-Phe-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N KTTMFLSBTNBAHL-MXAVVETBSA-N 0.000 description 2
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 2
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 2
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 2
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- 101900159346 Influenza A virus Hemagglutinin Proteins 0.000 description 2
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 2
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- VFQOCUQGMUXTJR-DCAQKATOSA-N Leu-Cys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)O)N VFQOCUQGMUXTJR-DCAQKATOSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 2
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 2
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 2
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 2
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 2
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 2
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 2
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 2
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 2
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 2
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 2
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 2
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 2
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 2
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 2
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 2
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 2
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 2
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 2
- MYKLINMAGAIRPJ-CIUDSAMLSA-N Met-Gln-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MYKLINMAGAIRPJ-CIUDSAMLSA-N 0.000 description 2
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 2
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 2
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 2
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 2
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 2
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 2
- RBGLBUDVQVPTEG-DCAQKATOSA-N Met-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N RBGLBUDVQVPTEG-DCAQKATOSA-N 0.000 description 2
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 2
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 description 2
- ANCPZNHGZUCSSC-ULQDDVLXSA-N Met-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 ANCPZNHGZUCSSC-ULQDDVLXSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 2
- 102000011931 Nucleoproteins Human genes 0.000 description 2
- 108010061100 Nucleoproteins Proteins 0.000 description 2
- YQNBKXUTWBRQCS-BVSLBCMMSA-N Phe-Arg-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 YQNBKXUTWBRQCS-BVSLBCMMSA-N 0.000 description 2
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 2
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 2
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 2
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 2
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 2
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 2
- RVRRHFPCEOVRKQ-KKUMJFAQSA-N Phe-His-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVRRHFPCEOVRKQ-KKUMJFAQSA-N 0.000 description 2
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 2
- QEFHBVDWKFFKQI-PMVMPFDFSA-N Phe-His-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QEFHBVDWKFFKQI-PMVMPFDFSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 2
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 2
- YMTMNYNEZDAGMW-RNXOBYDBSA-N Phe-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N YMTMNYNEZDAGMW-RNXOBYDBSA-N 0.000 description 2
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 2
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 2
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 2
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 2
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 2
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 2
- XBCOOBCTVMMQSC-BVSLBCMMSA-N Phe-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XBCOOBCTVMMQSC-BVSLBCMMSA-N 0.000 description 2
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 2
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 2
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 2
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 2
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 2
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 2
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 2
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 2
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 2
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 2
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 2
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 2
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 2
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 2
- FRVUYKWGPCQRBL-GUBZILKMSA-N Pro-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 FRVUYKWGPCQRBL-GUBZILKMSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 2
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 2
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 2
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 2
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 2
- CJNCVBHTDXKTMJ-CYDGBPFRSA-N Ser-Asp-Lys-Pro Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(O)=O CJNCVBHTDXKTMJ-CYDGBPFRSA-N 0.000 description 2
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 2
- UCOYFSCEIWQYNL-FXQIFTODSA-N Ser-Cys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O UCOYFSCEIWQYNL-FXQIFTODSA-N 0.000 description 2
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 2
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 2
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 2
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 2
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 2
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 2
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 2
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 2
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 2
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 2
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 2
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 2
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- 239000012505 Superdex™ Substances 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 2
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 2
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 2
- PAOYNIKMYOGBMR-PBCZWWQYSA-N Thr-Asn-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PAOYNIKMYOGBMR-PBCZWWQYSA-N 0.000 description 2
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 2
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 2
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 2
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 2
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 2
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 2
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 2
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 2
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- VUSAEKOXGNEYNE-PBCZWWQYSA-N Thr-His-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VUSAEKOXGNEYNE-PBCZWWQYSA-N 0.000 description 2
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 2
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 2
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 2
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 2
- KHTIUAKJRUIEMA-HOUAVDHOSA-N Thr-Trp-Asp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 KHTIUAKJRUIEMA-HOUAVDHOSA-N 0.000 description 2
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 2
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 2
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 2
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 2
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 2
- WPSYJHFHZYJXMW-JSGCOSHPSA-N Trp-Gln-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O WPSYJHFHZYJXMW-JSGCOSHPSA-N 0.000 description 2
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 2
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 2
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 2
- PKZIWSHDJYIPRH-JBACZVJFSA-N Trp-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKZIWSHDJYIPRH-JBACZVJFSA-N 0.000 description 2
- MBLJBGZWLHTJBH-SZMVWBNQSA-N Trp-Val-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 MBLJBGZWLHTJBH-SZMVWBNQSA-N 0.000 description 2
- 102000004142 Trypsin Human genes 0.000 description 2
- 108090000631 Trypsin Proteins 0.000 description 2
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 2
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 2
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 2
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 2
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 2
- SMLCYZYQFRTLCO-UWJYBYFXSA-N Tyr-Cys-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O SMLCYZYQFRTLCO-UWJYBYFXSA-N 0.000 description 2
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 2
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 2
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 2
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 2
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 2
- KEANSLVUGJADPN-LKTVYLICSA-N Tyr-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N KEANSLVUGJADPN-LKTVYLICSA-N 0.000 description 2
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 2
- CVXURBLRELTJKO-BWAGICSOSA-N Tyr-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O CVXURBLRELTJKO-BWAGICSOSA-N 0.000 description 2
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 2
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 2
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 2
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 2
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 2
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 2
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 2
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 2
- QRCBQDPRKMYTMB-IHPCNDPISA-N Tyr-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N QRCBQDPRKMYTMB-IHPCNDPISA-N 0.000 description 2
- MJUTYRIMFIICKL-JYJNAYRXSA-N Tyr-Val-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJUTYRIMFIICKL-JYJNAYRXSA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 2
- DHINLYMWMXQGMQ-IHRRRGAJSA-N Val-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 DHINLYMWMXQGMQ-IHRRRGAJSA-N 0.000 description 2
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 2
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 2
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 2
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 2
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 2
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 2
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 2
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 2
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 2
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 2
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 2
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 2
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 2
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 230000000840 anti-viral effect Effects 0.000 description 2
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000012575 bio-layer interferometry Methods 0.000 description 2
- 238000007707 calorimetry Methods 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- NAGJZTKCGNOGPW-UHFFFAOYSA-K dioxido-sulfanylidene-sulfido-$l^{5}-phosphane Chemical compound [O-]P([O-])([S-])=S NAGJZTKCGNOGPW-UHFFFAOYSA-K 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 239000000839 emulsion Substances 0.000 description 2
- -1 etc.) Chemical compound 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010017446 glycyl-prolyl-arginyl-proline Proteins 0.000 description 2
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 230000003053 immunization Effects 0.000 description 2
- 238000002649 immunization Methods 0.000 description 2
- 208000037798 influenza B Diseases 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000009545 invasion Effects 0.000 description 2
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 238000013081 phylogenetic analysis Methods 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 238000011321 prophylaxis Methods 0.000 description 2
- ZCCUUQDIBDJBTK-UHFFFAOYSA-N psoralen Chemical compound C1=C2OC(=O)C=CC2=CC2=C1OC=C2 ZCCUUQDIBDJBTK-UHFFFAOYSA-N 0.000 description 2
- 230000008707 rearrangement Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 208000023504 respiratory system disease Diseases 0.000 description 2
- 230000001932 seasonal effect Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 229960000814 tetanus toxoid Drugs 0.000 description 2
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 239000012096 transfection reagent Substances 0.000 description 2
- 239000012588 trypsin Substances 0.000 description 2
- 125000000430 tryptophan group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C2=C([H])C([H])=C([H])C([H])=C12 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 108010025432 tyrosyl-alanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 239000000277 virosome Substances 0.000 description 2
- 108010000998 wheylin-2 peptide Proteins 0.000 description 2
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- JUEUYDRZJNQZGR-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JUEUYDRZJNQZGR-UHFFFAOYSA-N 0.000 description 1
- VXGRJERITKFWPL-UHFFFAOYSA-N 4',5'-Dihydropsoralen Natural products C1=C2OC(=O)C=CC2=CC2=C1OCC2 VXGRJERITKFWPL-UHFFFAOYSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 1
- FRFDXQWNDZMREB-ACZMJKKPSA-N Ala-Cys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRFDXQWNDZMREB-ACZMJKKPSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- NAARDJBSSPUGCF-FXQIFTODSA-N Arg-Cys-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N NAARDJBSSPUGCF-FXQIFTODSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 1
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 1
- 208000031504 Asymptomatic Infections Diseases 0.000 description 1
- 231100000699 Bacterial toxin Toxicity 0.000 description 1
- QCMYYKRYFNMIEC-UHFFFAOYSA-N COP(O)=O Chemical class COP(O)=O QCMYYKRYFNMIEC-UHFFFAOYSA-N 0.000 description 1
- 108010071134 CRM197 (non-toxic variant of diphtheria toxin) Proteins 0.000 description 1
- 108090000565 Capsid Proteins Proteins 0.000 description 1
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 102000000844 Cell Surface Receptors Human genes 0.000 description 1
- 108010049048 Cholera Toxin Proteins 0.000 description 1
- 102000009016 Cholera Toxin Human genes 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- FEJCUYOGOBCFOQ-ACZMJKKPSA-N Cys-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N FEJCUYOGOBCFOQ-ACZMJKKPSA-N 0.000 description 1
- VFGADOJXRLWTBU-JBDRJPRFSA-N Cys-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N VFGADOJXRLWTBU-JBDRJPRFSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- POSRGGKLRWCUBE-CIUDSAMLSA-N Cys-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N POSRGGKLRWCUBE-CIUDSAMLSA-N 0.000 description 1
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 1
- ONIBWKKTOPOVIA-SCSAIBSYSA-N D-Proline Chemical compound OC(=O)[C@H]1CCCN1 ONIBWKKTOPOVIA-SCSAIBSYSA-N 0.000 description 1
- 229930182820 D-proline Natural products 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 238000012286 ELISA Assay Methods 0.000 description 1
- 208000004739 Egg Hypersensitivity Diseases 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 101710146739 Enterotoxin Proteins 0.000 description 1
- 108010040721 Flagellin Proteins 0.000 description 1
- 238000005033 Fourier transform infrared spectroscopy Methods 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- MSHXWFKYXJTLEZ-CIUDSAMLSA-N Gln-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MSHXWFKYXJTLEZ-CIUDSAMLSA-N 0.000 description 1
- XBWGJWXGUNSZAT-CIUDSAMLSA-N Gln-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XBWGJWXGUNSZAT-CIUDSAMLSA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- XKPOCESCRTVRPL-KBIXCLLPSA-N Glu-Cys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XKPOCESCRTVRPL-KBIXCLLPSA-N 0.000 description 1
- ZXLZWUQBRYGDNS-CIUDSAMLSA-N Glu-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXLZWUQBRYGDNS-CIUDSAMLSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 1
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- NEDQVOQDDBCRGG-UHFFFAOYSA-N Gly Gly Thr Tyr Chemical compound NCC(=O)NCC(=O)NC(C(O)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 NEDQVOQDDBCRGG-UHFFFAOYSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- LJXWZPHEMJSNRC-KBPBESRZSA-N Gly-Gln-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LJXWZPHEMJSNRC-KBPBESRZSA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108060003393 Granulin Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 1
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 1
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 1
- YBDOQKVAGTWZMI-XIRDDKMYSA-N His-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N YBDOQKVAGTWZMI-XIRDDKMYSA-N 0.000 description 1
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 1
- 102100021628 Histatin-3 Human genes 0.000 description 1
- 101000898505 Homo sapiens Histatin-3 Proteins 0.000 description 1
- 241000598171 Human adenovirus sp. Species 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- WTOAPTKSZJJWKK-HTFCKZLJSA-N Ile-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WTOAPTKSZJJWKK-HTFCKZLJSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- RWYCOSAAAJBJQL-KCTSRDHCSA-N Ile-Gly-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RWYCOSAAAJBJQL-KCTSRDHCSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- VLCMCYDZJCWPQT-VKOGCVSHSA-N Ile-Met-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N VLCMCYDZJCWPQT-VKOGCVSHSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- 206010061598 Immunodeficiency Diseases 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 101100028758 Influenza A virus (strain A/Swine/Wisconsin/1/1967 H1N1) PB1-F2 gene Proteins 0.000 description 1
- 102100027612 Kallikrein-11 Human genes 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- 208000032420 Latent Infection Diseases 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 1
- HQPHMEPBNUHPKD-XIRDDKMYSA-N Leu-Cys-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N HQPHMEPBNUHPKD-XIRDDKMYSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- YLMIDMSLKLRNHX-HSCHXYMDSA-N Leu-Trp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YLMIDMSLKLRNHX-HSCHXYMDSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241000701076 Macacine alphaherpesvirus 1 Species 0.000 description 1
- 101710141347 Major envelope glycoprotein Proteins 0.000 description 1
- 241000272527 Mareca penelope Species 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 1
- RAAVFTFEAUAVIY-DCAQKATOSA-N Met-Glu-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N RAAVFTFEAUAVIY-DCAQKATOSA-N 0.000 description 1
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 1
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 1
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 1
- GHQFLTYXGUETFD-UFYCRDLUSA-N Met-Tyr-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N GHQFLTYXGUETFD-UFYCRDLUSA-N 0.000 description 1
- OTKQHDPECKUDSB-SZMVWBNQSA-N Met-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OTKQHDPECKUDSB-SZMVWBNQSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241000282339 Mustela Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 230000004989 O-glycosylation Effects 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000712464 Orthomyxoviridae Species 0.000 description 1
- 101150103639 PB1 gene Proteins 0.000 description 1
- 239000002033 PVDF binder Substances 0.000 description 1
- 241000845082 Panama Species 0.000 description 1
- 108010081690 Pertussis Toxin Proteins 0.000 description 1
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- CTNODEMQIKCZGQ-JYJNAYRXSA-N Phe-Gln-His Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 CTNODEMQIKCZGQ-JYJNAYRXSA-N 0.000 description 1
- RLUMIJXNHJVUCO-JBACZVJFSA-N Phe-Gln-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 RLUMIJXNHJVUCO-JBACZVJFSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- JDMKQHSHKJHAHR-UHFFFAOYSA-N Phe-Phe-Leu-Tyr Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)CC1=CC=CC=C1 JDMKQHSHKJHAHR-UHFFFAOYSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- 206010035737 Pneumonia viral Diseases 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- RWCOTTLHDJWHRS-YUMQZZPRSA-N Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RWCOTTLHDJWHRS-YUMQZZPRSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- RQXDSYQXBCRXBT-GUBZILKMSA-N Ser-Met-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RQXDSYQXBCRXBT-GUBZILKMSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- 102000012479 Serine Proteases Human genes 0.000 description 1
- 108010022999 Serine Proteases Proteins 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- SSNGFWKILJLTQM-QEJZJMRPSA-N Trp-Gln-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SSNGFWKILJLTQM-QEJZJMRPSA-N 0.000 description 1
- SVGAWGVHFIYAEE-JSGCOSHPSA-N Trp-Gly-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 SVGAWGVHFIYAEE-JSGCOSHPSA-N 0.000 description 1
- GQHAIUPYZPTADF-FDARSICLSA-N Trp-Ile-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 GQHAIUPYZPTADF-FDARSICLSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 1
- RZRDCZDUYHBGDT-BVSLBCMMSA-N Trp-Met-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RZRDCZDUYHBGDT-BVSLBCMMSA-N 0.000 description 1
- SLOYNOMYOAOUCX-BVSLBCMMSA-N Trp-Phe-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SLOYNOMYOAOUCX-BVSLBCMMSA-N 0.000 description 1
- ZHDQRPWESGUDST-JBACZVJFSA-N Trp-Phe-Gln Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ZHDQRPWESGUDST-JBACZVJFSA-N 0.000 description 1
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 1
- UIDJDMVRDUANDL-BVSLBCMMSA-N Trp-Tyr-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UIDJDMVRDUANDL-BVSLBCMMSA-N 0.000 description 1
- 101710152431 Trypsin-like protease Proteins 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- QOIKZODVIPOPDD-AVGNSLFASA-N Tyr-Cys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOIKZODVIPOPDD-AVGNSLFASA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 description 1
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 1
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- 206010058874 Viraemia Diseases 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 230000032683 aging Effects 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 108010087049 alanyl-alanyl-prolyl-valine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 239000002168 alkylating agent Substances 0.000 description 1
- AZDRQVAHHNSJOQ-UHFFFAOYSA-N alumane Chemical class [AlH3] AZDRQVAHHNSJOQ-UHFFFAOYSA-N 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- ILRRQNADMUWWFW-UHFFFAOYSA-K aluminium phosphate Chemical compound O1[Al]2OP1(=O)O2 ILRRQNADMUWWFW-UHFFFAOYSA-K 0.000 description 1
- 230000000202 analgesic effect Effects 0.000 description 1
- 239000012491 analyte Substances 0.000 description 1
- 238000012436 analytical size exclusion chromatography Methods 0.000 description 1
- 238000004873 anchoring Methods 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 230000027645 antigenic variation Effects 0.000 description 1
- 239000002221 antipyretic Substances 0.000 description 1
- 229940125716 antipyretic agent Drugs 0.000 description 1
- 239000003443 antiviral agent Substances 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 238000000149 argon plasma sintering Methods 0.000 description 1
- 239000000688 bacterial toxin Substances 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 230000007012 clinical effect Effects 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 230000005860 defense response to virus Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 238000012631 diagnostic technique Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 208000037771 disease arising from reactivation of latent virus Diseases 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 230000012202 endocytosis Effects 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 239000000147 enterotoxin Substances 0.000 description 1
- 231100000655 enterotoxin Toxicity 0.000 description 1
- 244000309457 enveloped RNA virus Species 0.000 description 1
- 230000017188 evasion or tolerance of host immune response Effects 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000012395 formulation development Methods 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 230000005182 global health Effects 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 244000052637 human pathogen Species 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 238000001597 immobilized metal affinity chromatography Methods 0.000 description 1
- 230000002519 immonomodulatory effect Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 230000002434 immunopotentiative effect Effects 0.000 description 1
- 230000003308 immunostimulating effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 238000004020 luminiscence type Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000034217 membrane fusion Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 239000007764 o/w emulsion Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000001048 orange dye Substances 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000007030 peptide scission Effects 0.000 description 1
- 229940124531 pharmaceutical excipient Drugs 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 238000001874 polarisation spectroscopy Methods 0.000 description 1
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 244000144977 poultry Species 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 239000001397 quillaja saponaria molina bark Substances 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000029058 respiratory gaseous exchange Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012340 reverse transcriptase PCR Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 229930182490 saponin Natural products 0.000 description 1
- 150000007949 saponins Chemical class 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 description 1
- 108010061514 sialic acid receptor Proteins 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000004611 spectroscopical analysis Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000003146 transient transfection Methods 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 229940054967 vanquish Drugs 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 230000007502 viral entry Effects 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 208000009421 viral pneumonia Diseases 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 230000001018 virulence Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
- A61K39/145—Orthomyxoviridae, e.g. influenza virus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
- A61P31/16—Antivirals for RNA viruses for influenza or rhinoviruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
- A61K2039/5256—Virus expressing foreign proteins
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/53—DNA (RNA) vaccination
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10041—Use of virus, viral particle or viral elements as a vector
- C12N2710/10043—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/16011—Orthomyxoviridae
- C12N2760/16111—Influenzavirus A, i.e. influenza A virus
- C12N2760/16121—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/16011—Orthomyxoviridae
- C12N2760/16111—Influenzavirus A, i.e. influenza A virus
- C12N2760/16122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/16011—Orthomyxoviridae
- C12N2760/16111—Influenzavirus A, i.e. influenza A virus
- C12N2760/16134—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Virology (AREA)
- Organic Chemistry (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biochemistry (AREA)
- Veterinary Medicine (AREA)
- Genetics & Genomics (AREA)
- Gastroenterology & Hepatology (AREA)
- Public Health (AREA)
- Pulmonology (AREA)
- Mycology (AREA)
- Epidemiology (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- General Chemical & Material Sciences (AREA)
- Oncology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Communicable Diseases (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
Abstract
본 발명은 재조합 인플루엔자 A 적혈구응집소(HA) 폴리펩티드로서, 인플루엔자 A 바이러스 HA의 HA1 및 HA2 도메인을 포함하고,
(a) 위치 355의 아미노산은 W이고;
(b) 위치 432의 아미노산은 I이고/이거나, 위치 380의 아미노산은 I인 아미노산 서열을 포함하고;
HA 폴리펩티드의 아미노산 서열의 아미노산 위치의 넘버링은 기준 H3N2 인플루엔자 균주, 특히 기준 균주 H3N2 A/아이치/2/68(서열번호 1)로부터의 HA의 아미노산 서열의 아미노산의 넘버링에 따르는 것인 폴리펩티드, 이의 면역원성 단편, 상기 폴리펩티드 또는 면역원성 단편을 암호화하는 핵산 분자, 및 이들의 용도를 제공한다.
(a) 위치 355의 아미노산은 W이고;
(b) 위치 432의 아미노산은 I이고/이거나, 위치 380의 아미노산은 I인 아미노산 서열을 포함하고;
HA 폴리펩티드의 아미노산 서열의 아미노산 위치의 넘버링은 기준 H3N2 인플루엔자 균주, 특히 기준 균주 H3N2 A/아이치/2/68(서열번호 1)로부터의 HA의 아미노산 서열의 아미노산의 넘버링에 따르는 것인 폴리펩티드, 이의 면역원성 단편, 상기 폴리펩티드 또는 면역원성 단편을 암호화하는 핵산 분자, 및 이들의 용도를 제공한다.
Description
본 발명은 적어도 부분적으로, HHS가 수여한 계약 HHSO100201700018C에 따른 정부 지원으로 이루어졌다. 정부는 본 발명에 있어서 소정의 권리를 갖는다.
본 발명은 의약 분야에 관한 것이다. 재조합 인플루엔자 A 적혈구응집소(HA) 폴리펩티드, 상기 폴리펩티드를 암호화하는 핵산, 이를 포함하는 약제학적 조성물, 및 이들의 사용 방법이 본 명세서에서 제공된다.
인플루엔자 바이러스는 중증도가 준임상적 감염으로부터 사망을 야기할 수 있는 원발성 바이러스성 폐렴까지의 범위인 호흡기 질환(통상 "인플루엔자" 또는 "플루"로 지칭)을 야기하는 주요 인간 병원체이다. 감염의 임상 효과는 인플루엔자 균주의 발병력 및 숙주의 노출, 병력, 연령 및 면역 상태에 따라 달라진다. 매년 전 세계적으로 대략 10억 명의 사람들이 인플루엔자 바이러스로 감염을 겪어, 3백만 내지 5백만의 사례에서 중증의 병이 야기되고, 300,000 내지 500,000건의 인플루엔자 관련 사망이 발생하는 것으로 추정된다. 대부분의 이들 감염은 기여도가 보다 적은 인플루엔자 B 바이러스와 함께, H1 또는 H3 적혈구응집소 서브타입을 지니는 인플루엔자 A 바이러스에 기인할 수 있으며, 이에 따라, 대표적인 3가지 모두가 계절성 백신에 포함된다. 현행의 면역화 관행은 효과적인 계절성 인플루엔자 백신의 시기 적절한 생성을 가능하게 하기 위하여, 유행하는 인플루엔자 바이러스의 조기 확인에 의존한다. 다음 계절 동안 우세적일 균주를 예측함에 있어서 내재적인 어려움과 별개로, 항바이러스 내성 및 면역 회피 또한 현행의 백신이 이환 및 사망을 예방하지 못하게 한다. 이에 더하여, 동물 병원소로부터 비롯되고, 인간 대 인간 확산을 증가시키도록 재배열된(reassorted) 고도의 발병력의 바이러스 균주에 의해 야기되는 범유행의 가능성은 세계의 보건에 대하여 상당한 현실적인 위협을 제기한다.
인플루엔자 A 바이러스는 자연에서 널리 분포되어 있으며, 다양한 조류 및 포유동물을 감염시킬 수 있다. 인플루엔자 바이러스는 오르토믹소비리대(Orthomyxoviridae)의 과에 속하는 외피(enveloped) RNA 바이러스이다. 이들의 게놈은 11종의 상이한 단백질, 1종의 핵단백질(NP), 3종의 중합효소 단백질(PA, PB1 및 PB2), 2종의 기질 단백질(M1 및 M2), 3종의 비 구조 단백질(NS1, NS2 및 PB1-F2) 및 2종의 외부 당단백질: 적혈구응집소(HA) 및 뉴라미니다제(NA)를 암호화하는 8개의 단일 가닥 RNA 절편으로 구성된다. 바이러스는 HA 및 NA 단백질의 항원 구조의 차이를 기반으로 하여 분류되며, 이들의 상이한 조합은 특정 인플루엔자 바이러스 균주로 더 분류되는 독특한 바이러스 서브타입을 나타낸다. 모든 공지되어 있는 서브타입이 조류에서 관찰될 수 있지만, 현재 유행하는 인간 인플루엔자 A 서브타입은 H1N1 및 H3N2이다. 계통발생 분석에 의해, 다음과 같은 2개의 주요 그룹으로의 적혈구응집소의 세분이 증명되었다: 특히, 계통발생 그룹 1에서 H1, H2, H5 및 H9 서브타입, 그리고 특히, 계통발생 그룹 2에서 H3, H4 및 H7 서브타입.
인플루엔자 B형 바이러스 균주는 엄격하게 인간형이다. 인플루엔자 B형 바이러스 균주에서의 HA의 항원 변이는 A형 균주에서 관찰되는 것보다 더 작다. B/야마가타/16/88(B/야마가타로도 지칭) 및 B/빅토리아/2/87(B/빅토리아) 계통에 의해 대표되는 인플루엔자 B 바이러스의 2종의 유전적 및 항원적으로 별개의 계통이 인간에서 유행하고 있다. 인플루엔자 B 바이러스에 의해 야기되는 질환의 스펙트럼은 일반적으로 인플루엔자 A 바이러스에 의해 야기되는 것보다 더 경증이지만, 입원을 필요로 하는 중증의 병이 인플루엔자 B 감염에서 여전히 빈번하게 관찰된다.
인플루엔자 바이러스를 중화시키는 항체가 주로 적혈구응집소(HA)에 대해 유도되는 것이 알려져 있다. 적혈구응집소는 바이러스 코트에 고정되는 삼량체 당단백질이며 이중의 기능을 갖는다: 세포 표면 수용체 시알산으로의 결합을 담당하는 기능, 및 이것이 흡수 후에 바이러스 및 엔도솜 막의 융합을 매개하여 세포의 세포질 내의 바이러스 RNA의 방출을 야기하는 기능. HA는 소위 헤드 도메인(head domain) 및 스템 도메인(stem domain)을 포함한다. 바이러스 막으로의 부착은 스템 도메인에 연결된 C-말단 고정(anchoring) 서열(막횡단 도메인으로도 알려져 있음)에 의해 매개된다. 단백질은 지정된 루프에서 번역 후 절단되어, 2가지 폴리펩티드, HA1 및 HA2를 생성한다(전체 서열은 HA0으로 지칭). 막 원위 헤드 도메인은 주로 HA1으로부터 유래되며, 막 근위 스템 도메인은 주로 HA2로부터 유래된다(도 1).
인플루엔자 바이러스는 어디에나 존재하기 때문에, 바이러스에 의한 감염을 방지하는 것은 거의 불가능하다. 백신접종은 인플루엔자 유행병 및 범유행을 통제하는 데 중요한 역할을 한다. 많은 인플루엔자 백신은 달걀에서 바이러스의 재배열, 적응 및 성장을 포함하는 방법에 의해 제조된다. 그러나 이러한 기존 방법에는 한계가 있다. 모든 인플루엔자 바이러스 균주가 달걀에서 잘 자라는 것은 아니므로, 적응하거나 바이러스 재배열체를 구성해야 한다. 제조 중 HA의 변화는 순환하는 균주와 다르고 최적이 아닌 보호 수준을 제공할 수 있는 균주로 이어질 수 있다. 또 다른 단점은 달걀 알레르기가 있는 사람들이 달걀 기반 백신의 잔류 달걀 단백질에 과민반응을 보일 수 있다는 점이다. 더욱이, 달걀 기반 방법은 연속된 달걀의 공급에 의존하며, 이는 가금류의 질환의 경우에서와 같이 공급의 중단에 취약할 수 있다. 달걀 공급에 의존하지 않고 백신 단백질 제조가 달걀 기반 방법보다 더욱 엄격하게 통제되는 방법을 사용하여 백신을 제조할 필요가 있다. 세포 배양에서 제조된 HA의 재조합 형태(rHA)는 달걀에서 얻은 것에 대한 인플루엔자 백신의 대안적인 항원 공급원으로 사용된다. 그러나 이러한 방법을 사용하여 rHA의 면역원성 및 규칙적인 4차 구조를 유지하고 높은 수율의 삼량체 rHA를 보장하는 문제가 발생하였다. 따라서, 기존의 문제를 해결하는 인플루엔자 백신 또는 진단을 위한 대안적인 항원 공급 방법이 여전히 필요하다.
본 발명의 일부 양태가 아래에 요약된다. 추가 양태는 본 특허 출원의 발명의 상세한 설명, 실시예, 도면, 및 청구범위 섹션에 기술되어 있다.
제1 양태에서, 본 발명은 재조합 인플루엔자 A 적혈구응집소(HA) 폴리펩티드에 관한 것으로, 인플루엔자 A 바이러스 HA의 HA1 및 HA2 도메인을 포함하고,
(a) 위치 355의 아미노산은 트립토판(W)이고;
(b) 위치 432의 아미노산은 이소류신(I)이고/이거나, 위치 380의 아미노산은 I인 아미노산 서열을 포함하며;
HA 폴리펩티드의 아미노산 서열의 아미노산 위치의 넘버링은 기준 H3N2 인플루엔자 균주, 특히 기준 균주 H3N2 A/아이치/2/68(서열번호 1)로부터의 HA의 아미노산 서열의 아미노산의 넘버링에 따르는 것인 HA 폴리펩티드에 관한 것이다.
추가 양태에서, 본 발명은 적어도 2개의 HA 폴리펩티드를 포함하는 다량체 폴리펩티드, 특히 본원에 기술된 바와 같은 3개의 HA 폴리펩티드를 포함하는 삼량체 폴리펩티드에 관한 것이다.
본 발명에 따르면 놀랍게도 재조합 인플루엔자 HA 폴리펩티드, 특히 재조합 삼량체 HA 폴리펩티드가 야생형 HA 폴리펩티드와 비교하여 높은 수준으로 수득될 수 있고, 이종 아미노산 서열, 예컨대, 이종 삼량체화 도메인의 추가 없이 더 큰 안정성을 나타내는 증가된 용융 온도를 갖는 것으로 나타났다. 또한, 본 발명의 HA 폴리펩티드는 항체 CR9114, CR8020 및/또는 CR6261과 같은(그러나 이에 한정되지 않음) HA 폴리펩티드에 대한 항-HA 항체의 결합에 의해 제시된 바와 같이 정확하게 접힌다. 따라서, 폴리펩티드는 대상체, 특히 인간 대상체에게 투여되는 경우 HA에 대한 면역 반응을 유도할 수 있다. 삼량체 폴리펩티드는 야생형 천연 HA의 4차 구조를 포함하고, 따라서 HA 분자의 막 근위 줄기의 보존된 에피토프를 포함하는 천연 에피토프를 면역계에 제시한다.
추가의 양태에서, 본 발명은 재조합 인플루엔자 HA 폴리펩티드를 암호화하는 핵산 분자를 제공한다.
또 다른 양태에서, 본 발명은 인플루엔자 HA 폴리펩티드를 암호화하는 핵산 분자를 포함하는 벡터, 특히 재조합 아데노바이러스 벡터를 제공한다.
또 다른 양태에서, 본 발명은 본 발명에 따른 인플루엔자 HA 폴리펩티드, 핵산 분자 및/또는 벡터, 및 약제학적으로 허용 가능한 담체를 포함하는 면역원성 조성물을 제공한다.
추가의 양태에서, 본 발명은 의약으로서 사용하기 위한, 특히 인플루엔자 질환, 특히 계통발생 그룹 1 및/또는 2로부터의 인플루엔자 바이러스 A 균주에 의해 유발된 질환 또는 병태의 예방 및/또는 치료를 위한 백신으로서 사용하기 위한 인플루엔자 HA 폴리펩티드, 상기 인플루엔자 HA 폴리펩티드를 암호화하는 핵산 분자, 및/또는 상기 핵산 분자를 포함하는 벡터를 제공한다.
본 발명은 또한 필요로 하는 대상체에서 인플루엔자 HA에 대한 면역 반응의 유도 방법을 제공하며, 방법은 대상체에게 본 발명에 따른 인플루엔자 HA 폴리펩티드, 핵산 분자, 및/또는 벡터를 투여하는 단계를 포함한다. 또 다른 양태에서, 인플루엔자 질환 감염 위험이 있는 것으로 확인된 사람과 같이 필요로 하는 사람에게 전술한 바와 같은 폴리펩티드 또는 면역원성 조성물을 투여하는 단계를 포함하는 인플루엔자 질환에 대한 예방 및/또는 백신접종 방법이 제공된다.
또 다른 양태에서, 위에서 정의된 바와 같은 재조합 HA 폴리펩티드를 제조하는 방법으로서, 원핵생물 또는 진핵생물 세포, 예컨대, 포유동물 세포, 예를 들어, CHO 세포, 또는 곤충 세포에서 전술한 핵산 분자를 발현시키는 단계를 포함하고, 상기 세포로부터 rHA를 정제/단리하는 단계를 선택적으로 추가로 포함하는 방법이 제공된다.
또 다른 양태에서, 본 발명은 연구 도구 또는 진단 도구로서, 또는 항체의 인플루엔자 억제제 생산을 위한 표적으로서 HA 폴리펩티드의 용도를 제공한다.
도 1.
A. 표시된 돌연변이의 위치를 갖는 본 발명의 폴리펩티드 단량체의 3차원 표현. 암회색의 적혈구응집소(HA)의 헤드와 연회색의 스템; B. 표시된 돌연변이의 위치를 갖는 본 발명의 폴리펩티드 단량체의 개략도(검은색: 헤드; 연회색: 스템).
도 2. 도 2a. 인플루엔자 HA의 계통수. 인플루엔자 A 그룹 1 및 그룹 2 및 인플루엔자 B의 상이한 서브타입이 표시되어 있다; 도 2b. OCTET(항-His2 센서)에 의해 결정된 단백질 발현 수준. 마지막 컬럼은 야생형(WT) HA와 비교하여 본 발명의 안정화된 가용성 HA 삼량체의 발현 수준의 배수 증가를 나타낸다; 도 2c. 크기 배제 크로마토그래피(SEC) 프로파일 - 점선은 WT HA를 나타내고, 검은색의 실선은 본 발명에 따른 안정화된 HA 폴리펩티드를 나타낸다.
도 3. 도 3a. 본 발명의 정제된 삼량체(T) 폴리펩티드(검은색 선) 및 폴드온(Foldon) 삼량체화 도메인을 갖는 WT 폴리펩티드(회색 선)의 SEC 분석(UFV4239는 폴드온 도메인을 포함하지 않음에 유의함). WT-폴드온 정제된 폴리펩티드는 -80℃에서 보관 후 피크의 폭 넓어짐 및 다량체 형성(*)을 나타낸다. UFV4239의 누락된 삼량체화 도메인으로 인해, 단량체(M)만 발현되고 정제되었다; 도 3b 및 도 3c. 시차 주사 형광측정법(DSF)에 의한 정제된 폴리펩티드의 온도 안정성 분석(℃ 단위의 Tm50 값); 도 3d. 단일클론 항체(mAb) CR6261, CR8020, CT149, CR9114 및 다중도메인 항체 MD3606의 결합(ELISA, EC50 값).
도 4. A. OCTET(항-His2 센서)에 의해 결정된 단백질 발현 수준; B. EXPI-293 세포 배양 상청액의 SEC 분석. UFV181007은 돌연변이 K380I 및 E432I(검은색의 점선)를 포함한다. UFV181005은 돌연변이 H355W 및 M478I(회색의 점선)를 포함한다. 안정화 돌연변이의 조합(UFV1810009, 검은색 선).
도 5. A. OCTET(항-His2 센서)에 의해 결정된 단백질 발현 수준. 마지막 컬럼은 야생형(WT) HA와 비교하여 본 발명의 안정화된 가용성 HA 삼량체의 발현 수준의 배수 증가를 나타낸다; B. 크기 배제 크로마토그래피(SEC) 프로파일- 점선은 WT HA를 나타내고, 검은색의 실선은 본 발명에 따른 추가의 안정화된 HA 폴리펩티드를 나타낸다.
도 6. 온도 스트레스 전후의 정제된 안정화 HA의 크기 배제 크로마토그래피(SEC) 프로파일. 실험 전(점선)과 4℃(검은색 실선) 및 37℃(회색 실선)에서 60일의 인큐베이션 후 폴리펩티드에 대한 프로파일을 나타내었다.
도 2. 도 2a. 인플루엔자 HA의 계통수. 인플루엔자 A 그룹 1 및 그룹 2 및 인플루엔자 B의 상이한 서브타입이 표시되어 있다; 도 2b. OCTET(항-His2 센서)에 의해 결정된 단백질 발현 수준. 마지막 컬럼은 야생형(WT) HA와 비교하여 본 발명의 안정화된 가용성 HA 삼량체의 발현 수준의 배수 증가를 나타낸다; 도 2c. 크기 배제 크로마토그래피(SEC) 프로파일 - 점선은 WT HA를 나타내고, 검은색의 실선은 본 발명에 따른 안정화된 HA 폴리펩티드를 나타낸다.
도 3. 도 3a. 본 발명의 정제된 삼량체(T) 폴리펩티드(검은색 선) 및 폴드온(Foldon) 삼량체화 도메인을 갖는 WT 폴리펩티드(회색 선)의 SEC 분석(UFV4239는 폴드온 도메인을 포함하지 않음에 유의함). WT-폴드온 정제된 폴리펩티드는 -80℃에서 보관 후 피크의 폭 넓어짐 및 다량체 형성(*)을 나타낸다. UFV4239의 누락된 삼량체화 도메인으로 인해, 단량체(M)만 발현되고 정제되었다; 도 3b 및 도 3c. 시차 주사 형광측정법(DSF)에 의한 정제된 폴리펩티드의 온도 안정성 분석(℃ 단위의 Tm50 값); 도 3d. 단일클론 항체(mAb) CR6261, CR8020, CT149, CR9114 및 다중도메인 항체 MD3606의 결합(ELISA, EC50 값).
도 4. A. OCTET(항-His2 센서)에 의해 결정된 단백질 발현 수준; B. EXPI-293 세포 배양 상청액의 SEC 분석. UFV181007은 돌연변이 K380I 및 E432I(검은색의 점선)를 포함한다. UFV181005은 돌연변이 H355W 및 M478I(회색의 점선)를 포함한다. 안정화 돌연변이의 조합(UFV1810009, 검은색 선).
도 5. A. OCTET(항-His2 센서)에 의해 결정된 단백질 발현 수준. 마지막 컬럼은 야생형(WT) HA와 비교하여 본 발명의 안정화된 가용성 HA 삼량체의 발현 수준의 배수 증가를 나타낸다; B. 크기 배제 크로마토그래피(SEC) 프로파일- 점선은 WT HA를 나타내고, 검은색의 실선은 본 발명에 따른 추가의 안정화된 HA 폴리펩티드를 나타낸다.
도 6. 온도 스트레스 전후의 정제된 안정화 HA의 크기 배제 크로마토그래피(SEC) 프로파일. 실험 전(점선)과 4℃(검은색 실선) 및 37℃(회색 실선)에서 60일의 인큐베이션 후 폴리펩티드에 대한 프로파일을 나타내었다.
정의
본 발명에서 사용되는 용어의 정의가 아래에 제공된다.
본 발명에 따른 아미노산은 20개의 자연 발생(또는 '표준' 아미노산) 또는 이의 변이체, 예를 들어, D-프롤린(프롤린의 D-거울상 이성질체), 또는 자연적으로 단백질에서 관찰되지 않는 임의의 변이체, 예를 들어, 노르류신 중 임의의 것일 수 있다. 표준 아미노산은 이들의 특성을 기반으로 몇 개의 군으로 분류될 수 있다. 중요한 인자는 전하, 친수성 또는 소수성, 크기 및 기능기이다. 이들 특성은 단백질 구조 및 단백질-단백질 상호작용에 중요하다. 다른 시스테인 잔기와 공유적 이황화 결합(또는 이황화 가교)을 형성할 수 있는 시스테인, 폴리펩티드 백본에 대하여 고리를 형성하는 프롤린, 및 다른 아미노산보다 더 유연한 글리신과 같이, 일부 아미노산은 특별한 특성을 갖는다. 표 3은 표준 아미노산의 약어 및 특성을 나타낸다.
본 명세서에서 사용되는 용어 "포함되는" 또는 "포함하는"은 용어 "제한 없이"가 뒤따르는 것으로 간주된다.
본 명세서에서 사용되는 용어 "감염"은 세포 또는 대상체 내의 바이러스의 증식 및/또는 존재에 의한 침습을 의미한다. 일 구현예에서, 감염은 "활성" 감염, 즉, 바이러스가 세포 또는 대상체에서 복제하고 있는 것이다. 이러한 감염은 바이러스에 의해 처음 감염된 세포, 조직 및/또는 기관으로부터 다른 세포, 조직 및/또는 기관으로의 바이러스의 확산을 특징으로 한다. 감염은 또한 잠복 감염, 즉, 바이러스가 복제하고 있지 않은 것일 수도 있다. 특정 구현예에서, 감염은 세포 또는 대상체에서의 바이러스의 존재로부터 야기되거나, 또는 바이러스에 의한 세포 또는 대상체의 침습에 의해 야기되는 병리적 상태를 지칭한다.
인플루엔자 바이러스는 전형적으로 속 A, B 및 C의 인플루엔자 바이러스 타입으로 분류된다. 본 명세서에서 사용되는 용어 "인플루엔자 바이러스 서브타입"은 헤마글루티닌(H) 및 뉴라미다제(N) 바이러스 표면 단백질의 조합을 특징으로 하는 인플루엔자 A 바이러스 변이체를 말한다. 본 발명에 따르면, 인플루엔자 바이러스 서브타입은 이들의 H 번호에 의해, 예를 들어, "H3 서브타입의 HA를 포함하는 인플루엔자 바이러스", "H3 서브타입의 인플루엔자 바이러스" 또는 "H3 인플루엔자"에 의해, 또는 H 번호 및 N 번호의 조합, 예를 들어, "인플루엔자 바이러스 서브타입 H3N2" 또는 "H3N2"에 의해 지칭될 수 있다. 용어 "서브타입"은 구체적으로 보통 돌연변이로부터 야기되는 각각의 서브타입 내의 모든 개별 "균주"를 포함하며, 천연 분리주 및 인공 돌연변이체 또는 재배열체 등을 포함하여, 상이한 병리학적 프로파일을 보인다. 이러한 균주는 또한 바이러스 서브타입의 다양한 "분리주"로 지칭될 수도 있다. 따라서, 본 명세서에서 사용되는 용어 "균주" 및 "분리주"는 상호교환 가능하게 사용될 수 있다. 인간 인플루엔자 바이러스 균주 또는 분리주에 대한 현행의 명명법은 예를 들어, A/모스크바/10/2000(H3N2)과 같이, 보통 괄호 안에 제공되는 HA 및 NA의 항원 설명과 함께, 바이러스의 타입(속), 즉, A, B 또는 C, 처음 분리한 지리학적 위치, 균주 번호 및 분리한 해를 포함한다. 비 인간 균주는 또한 명명에 있어 기원하는 숙주를 포함한다.
인플루엔자 A 바이러스 서브타입은 이들의 계통발생 그룹을 참조하여 더 분류될 수 있다. 계통발생 분석에 의해, 다음과 같은 2개의 주요 그룹으로의 적혈구응집소의 세분이 증명되었다: 특히, 계통발생 그룹 1("그룹 1" 인플루엔자 바이러스)의 H1, H2, H5 및 H9 서브타입 및 특히, 계통발생 그룹 2("그룹 2" 인플루엔자 바이러스)의 H3, H4, H7 및 H10 서브타입.
본 명세서에서 사용되는 용어 "인플루엔자 바이러스 질환" 또는 "인플루엔자"는 대상체에서의 인플루엔자 바이러스, 예를 들어, 인플루엔자 A 또는 B 바이러스의 존재로부터 야기되는 병리학적 상태를 지칭한다. 본 명세서에서 사용되는 용어 "질환" 및 "장애"는 상호교환 가능하게 사용된다. 특정 구현예에서, 이 용어는 인플루엔자 바이러스에 의한 대상체의 감염에 의해 야기되는 호흡기 병을 지칭한다.
본 명세서에서 사용되는 용어 "핵산" 또는 "핵산 분자"는 폴리뉴클레오티드, 예컨대, DNA 분자(예를 들어, cDNA 또는 게놈 DNA) 및 RNA 분자(예를 들어, mRNA) 및 뉴클레오티드 유사체를 사용하여 생성된 DNA 또는 RNA의 유사체를 포함하고자 한 것이다. 핵산은 단일 가닥 또는 이중 가닥일 수 있다. 핵산 분자는 화학적 또는 생화학적으로 변형될 수 있거나, 또는 당업자에 의해 용이하게 인정될 바와 같이, 비 천연 또는 유도체화된 뉴클레오티드 염기를 함유할 수 있다. 이러한 변형은 예를 들어, 표지, 메틸화, 하나 이상의 자연 발생 뉴클레오티드의 유사체로의 치환, 뉴클레오티드 사이 변형, 예를 들어, 비 하전 연결기(예를 들어, 메틸 포스포네이트, 포스포트리에스테르, 포스포르아미데이트, 카르바메이트 등), 하전된 연결기(예를 들어, 포스포로티오에이트, 포스포로디티오에이트 등), 펜덴트 부분(예를 들어, 폴리펩티드), 인터칼레이터(intercalator)(예를 들어, 아크리딘, 소랄렌 등), 킬레이터, 알킬레이터 및 변형된 연결기(예를 들어, 알파 아노머 핵산 등)를 포함한다. 핵산 서열에 대한 언급은 달리 명시되지 않는 한 이의 상보체를 포함한다. 따라서, 특정 서열을 갖는 핵산 분자에 대한 언급은 이의 상보 서열을 지니는 이의 상보 가닥을 포함하는 것으로 이해되어야 한다. 상보 가닥은 또한 예를 들어, 안티-센스 치료법, 혼성화 프로브 및 PCR 프라이머에 유용하다.
본 명세서에서 사용되는 바와 같이, HA에서 아미노산의 넘버링은 문헌[Winter et al. (Nature 292: 72-75, 1981)]에 기술된 바와 같이, H3 넘버링에 기반한다. 따라서, 본 발명의 폴리펩티드의 아미노산 잔기 또는 아미노산 위치의 넘버링은 문헌[Winter et al. (1981)]의 도 2에서 기술되고 도시된 바와 같이, H3 HA의 아미노산의 넘버링(특히, A/아이치/2/68의 HA의 아미노산 위치의 넘버링)에 해당한다. 특히, 넘버링은 서열번호 1의 아미노산 위치의 넘버링에 해당한다. 예를 들어, 표현 "위치 355의 아미노산"은 문헌[Winter et al. (1981)]의 H3 넘버링에 따른 위치 355에 있는 아미노산 잔기, 즉, 서열번호 1의 위치 355에 있는 아미노산 잔기를 지칭한다. 다른 인플루엔자 바이러스 균주 및/또는 서브타입, 예컨대, 예를 들어, H1, H5, 또는 H7 HA에서 균등한 아미노산이 서열 정렬에 의해 결정될 수 있음을 당업자는 이해할 것이다. 따라서, 예를 들어, 서열번호 1과 비교하여 추가의 아미노산 잔기가 부가되거나 제거된 경우, 상이한 HA 서열이 상이한 넘버링 시스템을 가질 수 있음을 주목해야 하고, 이를 당업자는 이해할 것이다. 이와 같이, 특정 아미노산 잔기가 그 번호로 지칭될 경우, 이러한 설명은 주어진 아미노산 서열의 시작부터 셀 때 정확히 해당 번호가 매겨진 위치에 위치한 아미노산에만 한정되지 않고, 오히려 해당 잔기가 동일한 정확한 번호가 매겨진 위치에 있지 않더라도, 예를 들어, HA 서열이 서열번호 1보다 짧거다 긴 경우, 또는 서열번호 1과 비교하여 삽입 또는 결실이 있는 경우, 모든 HA 서열의 균등한/상응하는 아미노산 잔기가 의도되는 것으로 이해된다. 당업자는 예를 들어 주어진 HA 서열을 서열번호 1에 정렬함으로써, 본원에 인용된 특정 번호가 매겨진 잔기 중 임의의 것과 상응하는/동등한 아미노산 위치가 무엇인지 쉽게 결정할 수 있다. 따라서, 인플루엔자 HA 단백질의 특정 아미노산 잔기가 지칭되는 구현예에서, 본 발명은 오로지 정확한 번호가 매겨진 아미노산 위치에서만의 소정의 아미노산 잔기(예를 들어, 위치 355의 트립토판(W) 및/또는 위치 432 및/또는 380의 이소류신(I)의 존재)를 갖는 서열들로 한정되지 않음이 이해된다.
"폴리펩티드"는 당업자에게 공지된 바와 같이 아미드 결합에 의해 연결된 아미노산의 중합체를 지칭한다. 본 명세서에서 사용되는 용어는 공유적 아미드 결합에 의해 연결된 단일 폴리펩티드 사슬을 지칭할 수 있다. 이 용어는 또한 비 공유 상호작용, 예를 들어, 이온 접촉, 수소 결합, 반데르발스 접촉 및 소수성 접촉에 의해 회합된 다수의 폴리펩티드 사슬을 지칭할 수 있다. 당업자는 이 용어가 예를 들어, 번역 후 가공, 예를 들어, 신호 펩티드 절단, 이황화 결합 형성, 글리코실화(예를 들어, N-연결 및 O-연결 글리코실화), 프로테아제 절단 및 지질 변형(예를 들어, S-팔미토일화)에 의해 변형된 폴리펩티드를 포함함을 인식할 것이다.
본 명세서에서 사용되는 용어 "야생형"은 자연적으로 순환하는 인플루엔자 바이러스로부터의 HA를 지칭한다.
발명의 상세한 설명
인플루엔자 바이러스는 세계 공중 보건에 상당한 영향을 미치며, 매년 수백만의 중증의 병의 사례, 수천의 사망 및 상당한 경제적 손실을 야기한다. 현행의 3가 인플루엔자 백신은 백신 균주 및 밀접하게 관련된 분리주에 대하여 강력한 중화 항체 반응을 유도하나, 서브타입 내의 더욱 분기된 균주 또는 다른 서브타입으로 드물게 확대된다. 또한, 적절한 백신 균주의 선택은 많은 시험감염을 제시하며, 준최적의 보호를 빈번하게 야기한다. 추가로, 다음의 유행성 바이러스가 발생할 시기와 장소를 비롯하여 이의 서브타입을 예측하는 것은 현재 불가능하다.
적혈구응집소(HA)는 중화 항체의 주요 표적인 인플루엔자 A 바이러스 유래의 주요 외피 당단백질이다. 적혈구응집소는 유입 과정 동안 2가지 주요 기능을 갖는다. 먼저, 적혈구응집소는 시알산 수용체와의 상호작용을 통하여 표적 세포의 표면으로의 바이러스의 부착을 매개한다. 두 번째로, 바이러스의 세포내 이입 후에, 적혈구응집소는 이후에 바이러스 및 엔도솜 막의 융합을 매개하여, 이의 게놈을 표적 세포의 세포질로 방출시킨다.
HA는 단량체당 약 500개 아미노산의 엑토도메인을 포함하는 삼량체 단백질이고, 각각이 이황화 결합에 의해 연결되는 2개의 폴리펩티드 HA1 및 HA2를 함유하는 세 개의 동일한 서브유닛(단량체)을 포함한다. 각각의 단량체는 처음에는 HA0으로 발현되고, 이후에 상기 이황화 결합을 통해 연결되는 HA1 및 HA2 도메인으로 숙주 프로테아제에 의해 절단된다.
대다수의 N 말단 도메인(HA1 도메인, 약 320 내지 330개 아미노산 길이)은 바이러스 중화 항체에 의해 인식되는 대부분의 결정기 및 수용체 결합 부위를 함유하는 막-원위 구형 도메인(헤드 도메인)을 형성한다. 보다 작은 C 말단 도메인(HA2 도메인, 약 180개 아미노산 길이)은 세포 또는 바이러스 막에 구형 도메인을 고정시키는 스템 유사 구조(스템 도메인)를 형성한다. 가장 보존된 영역 중 하나는 절단 부위 근처의 서열, 특히 HA2 N 말단의 23개 아미노산(융합 단백질)이며, 이는 모든 인플루엔자 A 바이러스 서브타입 간에 보존된다. 이러한 영역의 부분은 HA 전구체 분자(HA0)에서 표면 루프로서 노출되나, HA0이 HA1 및 HA2로 절단되는 경우 접근 불가능하게 된다.
위에서 언급한 바와 같이, 인플루엔자 HA 단백질은 바이러스의 표면에서 발견되는 주요한 단백질이다. 비리온의 표면에서 발견되는 HA는 삼량체 형태이다. 삼량체는 3개의 단량체 각각의 카복시 말단 단부의 막횡단 스패닝 서열에 의해 바이러스 막에 고정되어 있다. 인플루엔자 백신의 주요 보호 효능은 HA 단백질에 대한 항-헤마글루티닌 항체에 기인한다. 이는 구조적으로 관련 있는 HA 단백질에 대한 면역 반응을 높이는 것의 중요성을 강조한다.
인플루엔자 A 바이러스 적혈구응집소(HA0)의 엑토도메인을 나타내는 가용성 폴리펩티드를 제조하기 위하여, HA는 고유의 막횡단 및 세포질 도메인 없이 발현될 필요가 있다. 안정적인 삼량체형 가용성 야생형(WT) HA의 발현은 종종 포유동물 세포에서 매우 빈약하다. 적어도 삼량체화 수준을 개선하기 위하여, 이종 삼량체화 도메인(예를 들어, 폴드온 삼량체화 도메인; 문헌[Stevens et al. Science 303(5665):1866-1870, 2004])은 종종 폴리펩티드의 C 말단에 유전적으로 융합된다. 불행히도, 이종 삼량체화 도메인의 부가는 원치 않는 네오에피토프를 도입하고 종종 발현 수준을 감소시키거나, 또는 폴리펩티드의 4차 구조를 변경할 수 있다.
본 발명은 안정적인 재조합 인플루엔자 A 적혈구응집소(HA) 폴리펩티드로서, 인플루엔자 A 바이러스 HA의 HA1 및 HA2 도메인을 포함하고,
(a) 위치 355의 아미노산은 W이고;
(b) 위치 432의 아미노산은 I이고/이거나, 위치 380의 아미노산은 I인 아미노산 서열을 포함하고;
HA 폴리펩티드의 아미노산 서열의 아미노산 위치의 넘버링은 기준 H3N2 인플루엔자 균주, 특히 기준 균주 H3N2 A/아이치/2/68(서열번호 1)로부터의 HA의 아미노산 서열의 아미노산의 넘버링에 따르는 것인 HA 폴리펩티드를 제공한다.
본 발명에 따르면, 놀랍게도 안정적인 재조합 HA 폴리펩티드, 특히 가용성 HA 삼량체 폴리펩티드가 HA 폴리펩티드의 코어에서 특정 아미노산 돌연변이의 존재에 의해, 폴드온 도메인 또는 임의의 다른 이종 삼량체화 도메인의 부가 없이, 수득될 수 있음이 밝혀졌다.
따라서, 특정 양태에서, 본 발명은 재조합 인플루엔자 A 적혈구응집소(HA) 폴리펩티드에 관한 것으로, 인플루엔자 A 바이러스 HA의 HA1 및 HA2 도메인을 포함하고,
(a) 위치 355의 아미노산은 W로 돌연변이되고;
(b) 위치 432의 아미노산은 I로 돌연변이되고/되거나, 위치 380의 아미노산은 I로 돌연변이된 아미노산 서열을 포함하고;
HA 폴리펩티드의 아미노산 서열의 아미노산 위치의 넘버링은 기준 H3N2 인플루엔자 균주, 특히 기준 균주 H3N2 A/아이치/2/68(서열번호 1)로부터의 HA의 아미노산 서열의 아미노산의 넘버링에 따르는 것인 HA 폴리펩티드를 제공한다. 돌연변이는 "매립형" 돌연변이이기 때문에, 즉 이들 잔기의 측쇄가 단백질 표면에 노출되지 않으므로, HA 폴리펩티드의 항원성은 바뀌지 않을 것이다.
특정 구현예에서, 폴리펩티드는 위치 355의 아미노산, 특히 히스티딘(H)의 트립토판(W)으로의 돌연변이 및 위치 432 및/또는 380의 아미노산의 이소류신(I)으로의 돌연변이를 포함한다.
예를 들어, 위치 432의 아미노산의 I로의 돌연변이를 도입함으로써 위치 432의 아미노산 I와 조합하여, 예를 들어, 위치 355의 아미노산, 특히 H의 W로의 돌연변이를 도입함으로써 위치 355에 아미노산 잔기 W를 갖거나; 또는 예를 들어, 위치 432 및 380에서 I로의 돌연변이를 도입함으로써 위치 432의 I와 위치 380의 I의 조합을 갖는 본 발명의 HA 폴리펩티드는 이러한 아미노산 돌연변이가 없는 HA 폴리펩티드와 비교하여 포유동물 세포에서 증가된 발현 수준, 삼량체화 경향 증가(예를 들어, AlphaLISA, Octet 및 SEC에 의해 측정됨), 및/또는 증가된 수준의 열 안정성(예를 들어, 동적 주사 형광측정법/열량측정법(DSF/DSC)에 의해 측정됨)을 보여준다. 또한, 본 발명의 폴리펩티드에 대한 시험한 모든 항체의 결합 강도는 (Octet 및 ELISA에 의해 측정 시) 5 nM 미만이다. 이는 폴리펩티드가 (1차, 2차, 3차 및 4차 구조와 관련하여) 천연의 야생형 HA와 구조적으로 동일함을 분명히 보여준다. 신규한 HA 폴리펩티드는 나아가 링커, 태그, 또는 삼량체화 도메인 서열과 같은 임의의 인공 (이종) 서열의 존재를 필요로 하지 않는다.
특정 구현예에서, 폴리펩티드는 위치 355의 아미노산, 특히 히스티딘(H)의 트립토판(W)으로의 돌연변이 및 위치 432 및/또는 380의 아미노산의 이소류신(I)으로의 돌연변이를 포함한다.
특정 구현예에서, HA 폴리펩티드는
(a)
위치 388의 아미노산은 M이고/이거나;
(b)
위치 478의 아미노산은 I인 아미노산 서열을 포함한다.
이러한 돌연변이는 적어도 특정 HA 서브타입에서 HA 폴리펩티드의 안정성을 추가로 증가시키는 것으로 나타났다.
특정 구현예에서, 상기 HA 단량체는 프로테아제 절단 부위를 포함하지 않는다. 위에 기술된 바와 같이, (HA1 및 HA2의) 인플루엔자 HA0 단백질의 절단은 이의 활성에 요구되어, 숙주 엔도솜 막의 바이러스 막과의 융합을 초래하여 바이러스 게놈의 표적 세포로의 진입을 촉진한다. 특정 구현예에서, 본 발명의 폴리펩티드는 천연 프로테아제 절단 부위를 포함한다. 따라서, HA1 및 HA2에 걸쳐 이어지는 Arg(R) 내지 Gly(G) 서열(즉, 아미노산 위치 329 및 330)은 트립신 및 트립신 유사 프로테아제의 인식 부위이고, 전형적으로 적혈구응집소 활성화를 위하여 절단된다(도 1의 A). 특정 구현예에서, 프로테아제 절단 부위는 위치 329의 아미노산 잔기의 아르기닌(R) 또는 리신(K) 이외의 임의의 아미노산으로의 돌연변이에 의해 제거되었다. 특정 구현예에서, 위치 329의 아미노산 잔기는 아르기닌(R)이 아니다. 바람직한 구현예에서, 폴리펩티드는 위치 329의 아미노산의 글루타민(Q)으로의 돌연변이를 포함한다. 따라서, 특정 구현예에서, 본 발명의 폴리펩티드는 투여 후에 시험관 내 또는 생체 내에서 생산되는 동안 또는 그 이후에 분자의 추정상의 절단을 방지하기 위하여 절단 부위 녹아웃 돌연변이 R329Q를 포함한다. 이로써, 절단 부위 녹아웃 돌연변이, 예를 들어, R329Q 돌연변이는 낮은 pH에 의해 유발된 구조 변화에 대한 무감응을 보장하고 HA의 융합 전 구조를 보존한다.
본 발명에 따르면, HA1 및/또는 HA2 도메인은 인플루엔자 HA 폴리펩티드의 완전한(즉, 전장) HA1 및/또는 HA2 도메인을 포함할 수 있거나, 또는 이들은 HA1 및/또는 HA2 도메인의 적어도 일부를 포함할 수 있다.
분비된 (가용성) HA 폴리펩티드를 생성하기 위하여, 특정 구현예에서 HA 단량체는 절단된 HA2 도메인을 포함한다. 따라서, 특정 구현예에서 본 발명의 폴리펩티드 내의 HA 단량체는 막횡단 및 세포질 도메인을 포함하지 않는다. 특히, 특정 구현예에서, 폴리펩티드 단량체는 C 말단 단부에서 절단된 HA2 도메인을 포함한다. 따라서, 본 발명에 따른 절단된 HA2 도메인은 HA2 도메인의 C 말단 및/또는 N 말단 단부에서 하나 이상의 아미노산 잔기의 결실에 의해 전장 HA2 서열보다 짧다. 따라서, 본 발명은 또한 HA의 세포외 도메인(엑토도메인, ECD)을 포함하거나 이로 구성된 재조합 HA 폴리펩티드를 제공한다.
특정 구현예에서, 위치 515의 아미노산에 해당하는 아미노산으로 시작하는 HA2 도메인의 C 말단 부분이 결실되어, 실질적으로 전체 막횡단 및 세포질 도메인이 제거되었다.
특정 구현예에서, 엑토도메인의 C 말단의 하나 이상의 아미노산이 또한 결실되었다. 본 발명에 따르면, HA2 도메인의 더 큰 부분이 결실되는 경우에도, 안정적인 가용성 및 삼량체 HA 폴리펩티드가 제공될 수 있음이 밝혀졌다. 따라서, 특정 구현예에서, 위치 500, 501, 502, 503, 504, 505, 506, 507, 508, 509, 510, 511, 512, 513, 또는 514의 아미노산 서열에서 시작하는 HA2 도메인의 C 말단 부분이 결실되어(위의 문헌[Winter et al.]에서 기술된 바와 같은 H3 넘버링을 따름) 세포에서 발현된 후 가용성 폴리펩티드를 생성하였다.
유사하게, HA1 도메인은 완전할 수 있거나(즉, 전장 HA1 도메인), 또는 이의 적어도 일부일 수 있다. 특정 구현예에서, 폴리펩티드는 절단된 HA1 도메인을 포함한다. HA1 도메인은 HA1 도메인의 N 및/또는 C 말단 단부에서 절단될 수 있다.
특정 구현예에서, HA 폴리펩티드는 신호 서열을 포함하지 않는다. 신호 서열(때때로 신호 펩티드, 표적화 신호, 국소화 신호, 국소화 서열, 전달 펩티드, 리더 서열 또는 리더 펩티드라고도 함)은 분비 경로로 향하는 새로 합성된 단백질의 대부분의 N 말단에 존재하는 짧은 펩티드(보통 16 내지 30개 아미노산의 길이)이다. 신호 서열은 세포가 단백질, 일반적으로 세포막으로 전위하도록 자극하는 기능을 한다. 많은 경우에, 신호 펩티드를 포함하는 아미노산은 최종 목적지에 도달하면 단백질로부터 절단된다. 인플루엔자 HA에서, 신호 서열은 전형적으로 (H3 넘버링에 따라 위치 -6에서 위치 10까지의 아미노산에 해당하는) 전장 HA0의 아미노산 서열의 처음 16개의 아미노산을 포함한다.
또한, 본 발명은 HA 폴리펩티드의 면역원성 단편을 제공한다. 특정 구현예에서, 헤드 도메인을 구성하는 HA1 도메인의 적어도 일부가 결실되어 본 발명의 HA 폴리펩티드의 면역원성 단편, 예컨대, 무 헤드 HA 폴리펩티드(즉, 스템 단독 폴리펩티드)를 제공할 수 있다.
본 발명의 폴리펩티드는 인플루엔자 A 바이러스의 인플루엔자 바이러스 적혈구응집소(HA)를 나타낸다(그로부터 유래된다). 전술한 바와 같이, 인플루엔자 A는 그룹 1과 그룹 2의 두 개의 주요 그룹으로 나눌 수 있는 여러 서브타입의 HA를 함유한다(도 2a). 본 발명의 폴리펩티드의 안정화 돌연변이는 인플루엔자 A의 모든 적혈구응집소 유형에 적용될 수 있다.
특정 구현예에서, HA1 및 HA2 도메인은 그룹 1 또는 그룹 2 인플루엔자 A 바이러스로부터 유래된다. 특정 구현예에서, HA1 및 HA2 도메인은 동일한 그룹 1 또는 그룹 2 바이러스로부터 유래된다. 특정한 다른 구현예에서, HA1 및 HA2 도메인은 상이한 그룹 1 또는 상이한 그룹 2 바이러스로부터 유래되거나, 또는 HA1 및 HA2 도메인은 상이한 그룹으로부터의 인플루엔자 A 바이러스로부터 유래된다. 예를 들어, HA2 도메인은 그룹 1 바이러스로부터 유래되고, HA2 도메인은 그룹 2 바이러스로부터 유래되거나, 또는 그 반대의 경우도 마찬가지이다. 특정 구현예에서, 헤드 도메인(즉, 헤드 도메인을 형성하는 HA1 도메인의 적어도 일부는 스템 도메인과 상이한 인플루엔자 바이러스(즉, 인플루엔자 HA 폴리펩티드의 스템 도메인을 형성하는 HA2 도메인의 일부)로부터 유래된다.
일부 특정 구현예에서, HA1 및/또는 HA2 도메인은 다음으로 구성된 군으로부터 선택되는 인플루엔자 A 바이러스로부터 유래된다: H1 서브타입의 HA를 포함하는 인플루엔자 바이러스, 예를 들어, 인플루엔자 바이러스 A/캘리포니아/07/2009 또는 A/미시간/45/2015; H2 서브타입의 HA를 포함하는 인플루엔자 바이러스, 예를 들어, 인플루엔자 바이러스 A/Env/MPU3156/2005; H5 서브타입의 HA를 포함하는 인플루엔자 바이러스, 예를 들어, 인플루엔자 바이러스 A/붉은머리오리(Eurasian Wigeon)/MPF461/2007; H9 서브타입의 HA를 포함하는 인플루엔자 바이러스, 예를 들어, 인플루엔자 바이러스 A/홍콩/1073/1999; H3 서브타입의 HA를 포함하는 인플루엔자 바이러스, 예를 들어, 인플루엔자 바이러스 H/홍콩/1/1968 또는 A/파나마/2007/1999; H14 서브타입의 HA를 포함하는 인플루엔자 바이러스, 예를 들어, 인플루엔자 바이러스 A/청둥오리(Mallard)/아스트라칸/263/1982; H7 서브타입의 HA를 포함하는 인플루엔자 바이러스, 예를 들어, 인플루엔자 바이러스 A/청둥오리/네덜란드/12/2000; 및 H10 서브타입의 HA를 포함하는 인플루엔자 바이러스, 예를 들어, 인플루엔자 바이러스 A/닭/독일/N/1949. 당업자는 본 발명의 폴리펩티드가 또한 그룹 1 또는 그룹 2로부터의 다른 인플루엔자 A 바이러스 균주의 HA로부터 유래될 수 있음을 이해할 것이다.
특정 바람직한 구현예에서, HA 서브타입(즉, 그룹 1 또는 그룹 2)에 따라 HA 폴리펩티드 또는 이의 면역원성 단편은 결합 분자 CR9114, CR6261, CR8020 및/또는 MD3606에 결합한다. 따라서, (서열번호 2의 중쇄 가변 영역 및 서열번호 3의 경쇄 가변 영역을 포함하는) 항체 CR6261 및/또는 (서열번호 6의 중쇄 가변 영역 및 서열번호 7의 경쇄 가변 영역을 포함하는) 항체 CR9114, 및/또는 (서열번호 4의 중쇄 가변 영역 및 서열번호 5의 경쇄 가변 영역을 포함하는) 항체 CR8020 및/또는 다중도메인 항체 MD3606(서열번호 8)의 특정 에피토프를 제시하는 신규한 HA 폴리펩티드가 제공된다. 본 발명의 폴리펩티드는 단독으로 또는 다른 예방적 및/또는 치료적 처치와 조합하여 생체 내 투여될 때 인플루엔자 바이러스 중화 항체를 유도하는 데 사용될 수 있다.
특정 구현예에서, 본 발명의 HA 폴리펩티드 또는 이의 면역원성 단편은 예를 들어, 중합체, 리포솜, 바이로솜, 바이러스 유사 입자, 또는 자가 조립 나노입자와 같은 나노입자에 연결된다. 폴리펩티드는 나노입자와 조합되거나, 나노입자 내에 캡시드화되거나 나노입자에 접합(예를 들어, 공유적으로 연결되거나 흡착)될 수 있다.
본 발명은 추가로 전술한 바와 같이 적어도 2개의 HA 폴리펩티드 또는 이의 면역원성 단편을 포함하는 다량체 폴리펩티드를 제공한다.
특정 바람직한 구현예에서, 다량체 폴리펩티드는 삼량체이고, 전술한 바와 같이 3개의 HA 폴리펩티드 또는 이의 면역원성 단편을 포함한다.
따라서, 특정 구현예에서, 본 발명은 안정화된 재조합 안정화된 삼량체 인플루엔자 A 적혈구응집소(HA) 폴리펩티드 또는 이의 면역원성 단편을 제공하며, 상기 폴리펩티드는 3개의 HA 단량체를 포함하고, 상기 HA 단량체는 각각 인플루엔자 A 바이러스 HA의 HA1 및 HA2 도메인을 포함하고,
(a) 위치 355의 아미노산은 W이고;
(b) 위치 432의 아미노산은 I이거나, 또는 위치 432의 아미노산이 I이고 위치 380의 아미노산이 I인 아미노산 서열을 포함하며;
HA 폴리펩티드의 아미노산 서열의 아미노산 위치의 넘버링은 기준 H3N2 인플루엔자 균주, 특히 기준 균주 H3N2 A/아이치/2/68(서열번호 1)로부터의 HA의 아미노산 서열의 아미노산의 넘버링에 따른다.
위에 언급한 바와 같이, 본 발명에 따르면, 예를 들어, 위치 432의 아미노산의 I로의 돌연변이를 도입함으로써 위치 432의 아미노산 I와 조합하여, 예를 들어, 위치 355의 아미노산의 W로의 돌연변이를 도입함으로써 위치 355에 아미노산 잔기 W를 가짐으로써; 또는 예를 들어, 위치 432 및 380에서 I로의 돌연변이를 도입함으로써 위치 432의 I와 위치 380의 I의 조합을 가짐으로써, 안정적인 HA 삼량체의 발현 수준 및 삼량체화 둘 다가 증가될 수 있음이 밝혀졌다. 따라서, 본 발명의 HA 폴리펩티드는 이러한 아미노산 돌연변이가 없는 HA 폴리펩티드와 비교하여 포유동물 세포에서 증가된 발현 수준, 삼량체화 경향 증가(예를 들어, AlphaLISA, Octet 및 SEC에 의해 측정됨), 및/또는 증가된 수준의 열 안정성(예를 들어, 동적 주사 형광측정법/열량측정법(DSF/DSC)에 의해 측정됨)을 보여준다.
특정 구현예에서, 본 발명의 HA 폴리펩티드는 40℃에서 적어도 3일 동안 안정적이다.
본 발명은 또한 본 발명의 인플루엔자 HA 폴리펩티드 또는 이의 면역원성 단편을 암호화하는 핵산 분자를 제공한다. 유전자 코드의 축중(degeneracy)의 결과로 다수의 상이한 핵산 분자가 동일한 폴리펩티드를 암호화할 수 있다는 점은 당업자에 의해 이해된다. 당업자는 폴리펩티드가 발현될 임의의 특정한 숙주 유기체의 코돈 활용을 반영하기 위하여 통상적인 기법을 이용하여 기재된 폴리뉴클레오티드에 의하여 암호화된 폴리펩티드 서열에 영향을 주지 않는 뉴클레오티드 치환을 제조할 수 있다는 것도 알고 있다. 따라서, 달리 명시되지 않는 한, "아미노산 서열을 암호화하는 핵산 분자"는 서로의 축중 형태이면서 동일한 아미노산 서열을 암호화하는 모든 뉴클레오티드 서열을 포함한다.
특정 구현예에서, 인플루엔자 HA 폴리펩티드 또는 이의 면역원성 단편을 암호화하는 핵산 분자는 인간 세포와 같은 포유동물 세포에서의 발현을 위하여 코돈 최적화된다. 코돈 최적화의 방법은 공지되어 있고 이전에 기술된 바 있다(예를 들어, WO 96/09378).
본 발명은 추가로 위에서 정의된 바와 같은 재조합 HA 폴리펩티드 또는 이의 면역원성 단편을 제조하는 방법을 제공하며, 이는 원핵생물(예를 들어, 대장균) 또는 진핵생물 세포(예를 들어, 포유동물 세포, 예컨대, CHO 또는 PER.C6)에서 전술한 핵산 분자를 발현시키는 단계를 포함하고, 상기 방법은 상기 세포로부터 재조합 HA 폴리펩티드 또는 이의 면역원성 단편을 정제/단리하는 단계를 선택적으로 포함한다. 재조합 인플루엔자 HA 폴리펩티드 또는 이의 면역원성 단편은 본원에 기술된 기술을 포함하여, 당업자에게 재조합 폴리펩티드를 제조하기에 적합한 것으로 간주되는 임의의 기술에 따라 제조될 수 있다. 따라서, 본 발명의 폴리펩티드는 당해 분야에 공지되어 있는 표준 방법에 의해 DNA 서열로서 합성되고, 적합한 제한 효소와 당해 분야에 공지되어 있는 방법을 사용하여 클로닝되고, 이후에 시험관 내 또는 생체 내에서 발현될 수 있다. 본 발명의 HA 폴리펩티드 또는 이의 면역원성 단편을 암호화하는 뉴클레오티드 서열은 당업자에게 잘 알려진 기술에 따라 합성 및/또는 클로닝 및 발현될 수 있다. 예를 들어, 문헌[Sambrook, et al, Molecular Cloning, A Laboratory Manual, Vols. 1-3, Cold Spring Harbor Press, Cold Spring Harbor, N.Y. (1989)] 참조. 인플루엔자 백신을 제조하기 위한 재조합 DNA 기술의 사용은 몇 가지 이점을 제공한다. 여기에는 달걀에서 감염성 바이러스의 적응 및 통과 단계를 피하는 것과 더 안전하고 엄격하게 통제되는 조건 하에서 더 고도로 정제된 단백질의 생산이 포함된다. 더욱이, 바이러스 불활성화 단계를 포함할 필요가 없다. 임의의 적합한 클로닝 및 발현 시스템을 사용하여 본 발명의 HA 폴리펩티드를 재조합적으로 생산할 수 있다.
바람직한 구현예에서, 폴리펩티드 또는 이의 면역원성 단편은 포유동물 세포에서 생산된다. 특정 구현예에서, 폴리펩티드는 적합한 세포(예를 들어, 포유류 세포)에서 발현될 때 글리코실화된다. 따라서, 폴리펩티드는 1개 이상의 천연 및/또는 도입된(즉, 비 천연) 글리코실화 모티프를 함유할 수 있다.
적혈구응집소 서열은 중합효소 연쇄 반응(PCR) 또는 역전사효소 PCR, 역공학과 같은 당해 분야에 공지된 표준 재조합 방법에 의해 생산될 수 있거나, 또는 DNA가 합성될 수 있다. PCR의 경우, 공개적으로 이용 가능한 데이터베이스에서 이용할 수 있는 적혈구응집소 뉴클레오티드 서열을 사용하여 프라이머를 준비할 수 있다. 폴리뉴클레오티드 구축물은 PCR 카세트로부터 조립될 수 있고 숙주 세포에서 증식을 위한 선택 가능한 마커를 함유하는 벡터로 순차적으로 클로닝될 수 있다. 그런 다음, 재조합 벡터는 주사, 형질감염 또는 전기천공 또는 기타 방법(예를 들어, 인산칼슘 형질감염, DEAE-덱스트란 매개 형질감염, 양이온성 지질 매개 형질감염, 전기천공)에 의해 숙주 세포 내로 도입될 수 있다. 리포펙타민(Invitrogen, 미국 캘리포니아 주 칼스배드 소재)과 같은 상업적인 형질감염 시약도 이용 가능하다.
HA 폴리펩티드 또는 이의 면역원성 단편은 음이온 및/또는 양이온 교환 크로마토그래피, 친화도 크로마토그래피를 포함하는 당해 분야에 공지된 방법에 의해 재조합 세포 배양물로부터 회수 및 단리/정제될 수 있다. SDS-PAGE와 같은 기술을 사용하여 이러한 분리/정제 기술에서 용리된 단백질 분획을 분석할 수 있다. 이러한 방법은 당업자에게 잘 알려져 있으므로, 여기에서는 상세하게 제시하지 않을 것이다. 또한, 정제된 폴리펩티드를 당해 분야에 공지된 분광학적 방법(예를 들어, 원편광 분광법, 푸리에 변환 적외 분광법 및 NMR 분광법 또는 X-선 결정학)에 의해 분석하여, 헬릭스 및 베타 시트와 같은 요망되는 구조의 존재를 조사할 수 있다. ELISA, AlphaLISA, 바이오층 간섭법(Octet) 및 FACS 등을 사용하여, CR6261 및/또는 CR9114와 같은 광범위 중화 항체에 대한 본 발명의 폴리펩티드의 결합을 조사할 수 있다. 따라서, 올바른 형태를 갖는 본 발명에 따른 폴리펩티드가 선택될 수 있다. 삼량체 함량은 예를 들어, 비 환원 조건 하에서의 SDS 겔 전기영동, CR6261 및/또는 CR9114와 같은 광범위 중화 항체의 항체 Fab 단편의 존재 하에서의 크기 배제 크로마토그래피 및 차등 표지된 항체를 이용한 AlphaLISA를 이용하여 분석할 수 있다. 폴리펩티드의 안정성은 온도 스트레스, 냉동-해동 주기, 증가된 단백질 농도, 또는 교반 후에 위에 기술된 바와 같이 평가할 수 있다. 폴리펩티드의 용융 온도는 시차 주사 형광측정법(DSF)에 의해 추가로 평가할 수 있다.
일부 구현예에서 본 발명은 서열번호 10, 서열번호 12, 서열번호 14, 서열번호 16, 서열번호 18, 서열번호 20, 서열번호 22, 서열번호 24, 서열번호 26, 서열번호 28, 서열번호 33, 서열번호 34, 서열번호 35, 서열번호 36, 서열번호 38, 서열번호 40, 서열번호 42, 서열번호 44, 서열번호 47, 서열번호 50, 서열번호 51 및 서열번호 52로 구성된 군으로부터 선택되는 인플루엔자 HA 아미노산 서열 중 임의의 하나 또는 이러한 아미노산 서열과 적어도 약 40% 또는 50% 또는 60% 또는 65% 또는 70% 또는 75% 또는 80% 또는 85% 또는 90% 또는 95% 또는 98% 또는 99%의 동일성을 갖는 이의 임의의 변이체 또는 단편으로부터 유래되거나, 이를 포함하거나, 이로 구성되는 재조합 인플루엔자 HA 폴리펩티드를 제공하며, 여기서, 인플루엔자 HA 폴리펩티드는 위치 355에 트립토판(W) 및 위치 432 및/또는 380에 이소류신(I)을 포함하고, 아미노산 넘버링은 서열번호 1의 서열에 기초하거나, 예를 들어 서열번호 1에 대한 HA 아미노산 서열의 정렬에 의해 결정된 바와 같이, 이러한 아미노산 위치에 상응하는 아미노산 위치에서 이루어진다.
특정 구현예에서 본 발명은 서열번호 10의 아미노산 잔기 18 내지 518, 서열번호 12의 아미노산 잔기 18 내지 518, 서열번호 14의 아미노산 잔기 16 내지 514, 서열번호 16의 아미노산 잔기 17 내지 516, 서열번호 18의 아미노산 잔기 19 내지 512, 서열번호 20의 아미노산 잔기 17 내지 521, 서열번호 22의 아미노산 잔기 17 내지 521, 서열번호 24의 아미노산 잔기 18 내지 523, 서열번호 26의 아미노산 잔기 19 내지 515, 서열번호 28의 아미노산 잔기 17 내지 515, 서열번호 33의 아미노산 잔기 17 내지 521, 서열번호 34의 아미노산 18 내지 518, 서열번호 35의 아미노산 18 내지 518, 서열번호 36의 아미노산 18 내지 517, 서열번호 38의 아미노산 18 내지 518, 서열번호 40의 아미노산 17 내지 521, 서열번호 42의 아미노산 17 내지 521, 서열번호 44의 아미노산 17 내지 521, 서열번호 47의 아미노산 17 내지 519, 서열번호 50의 아미노산 17 내지 521, 서열번호 51의 아미노산 19 내지 515, 또는 서열번호 52의 아미노산 17 내지 514로부터 유래되거나, 이를 포함하거나, 이로 구성되는 재조합 인플루엔자 HA 폴리펩티드를 제공한다.
특정 구현예에서, HA 폴리펩티드는 서열번호 10, 서열번호 12, 서열번호 14, 서열번호 16, 서열번호 18, 서열번호 20, 서열번호 22, 서열번호 24, 서열번호 26, 서열번호 28, 서열번호 33, 서열번호 34 or 서열번호 35, 서열번호 36, 서열번호 38, 서열번호 40, 서열번호 42, 서열번호 44, 서열번호 47, 서열번호 50, 서열번호 51 및 서열번호 52로부터 유래되거나, 이를 포함하거나, 이로 구성된 아미노산 서열을 포함한다.
본 발명은 추가로 본 발명의 HA 폴리펩티드 또는 이의 면역원성 단편을 암호화하는 핵산 분자를 포함하는 벡터에 관한 것이다.
특정 구현예에서, 벡터는 인간 재조합 아데노바이러스이다. 따라서, 본 발명은 본 발명에 따른 HA 폴리펩티드 또는 이의 면역원성 단편을 암호화하는 핵산 분자를 포함하는 재조합 아데노바이러스 벡터를 제공한다. 재조합 아데노바이러스 벡터는 막결합 HA를 암호화할 수 있고, 따라서 HA2 도메인을 포함하고, 막횡단 및 세포질 도메인을 포함하는 HA 폴리펩티드를 암호화할 수 있다. 아데노벡터는 또한 가용성 폴리펩티드를 암호화할 수 있고, 따라서 절단된 HA2 도메인을 포함하는 HA 폴리펩티드를 암호화할 수 있다.
재조합 아데노바이러스 벡터의 제조는 당해 분야에 잘 알려져 있다. 본 명세서에서 사용되는 바와 같이, 아데노바이러스에 대한 '재조합'이라는 용어는 아데노바이러스가 인간의 손에 의해 변형되었고, 예컨대, 아데노바이러스가 그 안에 활발하게 클로닝된 변경된 말단부를 가지고/가지거나 이종 유전자를 포함함, 즉, 자연 발생적인 야생형 아데노바이러스가 아님을 내포한다. 특정 구현예에서, 본 발명에 따른 아데노바이러스 벡터는 바이러스 복제에 요구되는 아데노바이러스 게놈의 E1 영역, 예컨대, E1a 영역 및/또는 E1b 영역의 적어도 하나의 필수적인 유전자 기능이 결핍되어 있다. 특정 구현예에서, 본 발명에 따른 아데노바이러스 벡터는 필수적이지 않은 E3 영역의 적어도 일부가 결핍되어 있다. 특정 구현예에서, 벡터는 E1 영역의 적어도 하나의 필수 유전자 기능 및 필수적이지 않은 E3 영역의 적어도 일부가 결핍되어 있다. 아데노바이러스 벡터는 "다중적으로 결핍"될 수 있는데, 이는 아데노바이러스 벡터가 아데노바이러스 게놈의 둘 이상의 영역들 각각에서 하나 이상의 필수적인 유전자 기능이 결핍되어 있음을 의미한다. 예를 들면, 위에서 언급된 E1-결핍된 또는 E1-, E3-결핍된 아데노바이러스 벡터는 E4 영역의 적어도 하나의 필수적인 유전자 및/또는 E2 영역(예컨대, E2A 영역 및/또는 E2B 영역)의 적어도 하나의 필수적인 유전자가 더 결핍되어 있을 수 있다. 아데노바이러스 벡터, 이의 구축 방법 및 이의 증식 방법은 당해 분야에 잘 알려져 있고, 예를 들어, 미국 특허 번호 5,559,099, 5,837,511, 5,846,782, 5,851,806, 5,994,106, 5,994,128, 5,965,541, 5,981,225, 6,040,174, 6,020,191, 및 6,113,913에 기술되어 있다.
특정 구현예에서, 아데노바이러스는 혈청형 26의 인간 아데노바이러스이다.
본 발명은 본 발명에 따른 HA 폴리펩티드, 이의 면역원성 단편, 핵산, 및/또는 벡터, 및 약제학적으로 허용 가능한 담체를 포함하는 면역원성 조성물을 추가로 제공한다. 본 발명은 특히 치료적 유효량의 본 발명의 폴리펩티드, 면역원성 단편, 핵산, 및/또는 벡터를 포함하는 약제학적 조성물에 관한 것이다. 약제학적 조성물은 약제학적으로 허용 가능한 담체를 더 포함한다. 본 발명의 문맥에서, 용어 "약제학적으로 허용 가능한"은 담체가, 사용되는 투여량 및 농도에서, 이들이 투여되는 대상체에게 원치 않는 또는 유해한 효과를 야기하지 않을 것을 의미한다. 이러한 약제학적으로 허용 가능한 담체 및 부형제는 당해 분야에 잘 알려져 있다(문헌[Remington's Pharmaceutical Sciences, 18th edition, A. R. Gennaro, Ed., Mack Publishing Company [1990]]; 문헌[Pharmaceutical Formulation Development of Peptides and Proteins, S. Frokjaer and L. Hovgaard, Eds., Taylor & Francis [2000]]; 및 문헌[Handbook of Pharmaceutical Excipients, 3rd edition, A. Kibbe, Ed., Pharmaceutical Press [2000]] 참조). 용어 "담체"는 폴리펩티드, 핵산, 및/또는 벡터와 함께 투여되는 희석제, 부형제, 또는 비히클을 지칭한다. 염수 용액 및 덱스트로스 및 글리세롤 수용액이 특히 주사 가능한 용액을 위한 액체 담체로서 사용될 수 있다.
본 발명은 또한 의약으로서의 사용을 위한, 본 명세서에서 기술된 HA 폴리펩티드, 면역원성 단편, 핵산, 및/또는 벡터에 관한 것이다. 본 발명은 특히 인플루엔자 바이러스에 대해, 특히 인플루엔자 바이러스의 HA 분자에 대해, 바람직하게는 중화 항체를 이끌어내는 것을 포함하여, 면역 반응을 유도하는 데 사용하기 위한, 본 명세서에서 기술된 HA 폴리펩티드, 핵산, 및/또는 벡터에 관한 것이다. 바람직한 구현예에서, 본 발명은 인플루엔자 백신으로서의 사용을 위한, 본 명세서에서 기술된 HA 폴리펩티드, 면역원성 단편, 핵산, 및/또는 벡터에 관한 것이다.
본 발명은 또한 필요로 하는 대상체에서 인플루엔자 A 바이러스에 대한 면역 반응의 유도 방법, 특히 항체를 이끌어내는 방법에 관한 것으로, 방법은 상기 대상체에게 위에서 기술된 HA 폴리펩티드, 면역원성 단편, 핵산 분자 및/또는 벡터를 투여하는 단계를 포함한다. 본 발명에 따른 대상체는 바람직하게는 인플루엔자 바이러스에 감염될 수 있거나, 다르게는 인플루엔자 바이러스에 대한 면역 반응의 유도로부터 이익을 얻을 수 있는 포유류이며, 예를 들어, 이러한 대상체는 설치류, 예를 들어, 마우스, 페럿, 애완동물 또는 가축, 또는 비 인간 영장류 또는 인간이다. 바람직하게는, 대상체는 인간 대상체, 예컨대, 인플루엔자 질환 감염 위험이 있는 것으로 확인된 사람이다.
특정 구현예에서, 본 발명의 HA 폴리펩티드, 면역원성 단편, 핵산 분자 및/또는 벡터는 보강제와 조합하여 투여된다. 보강제는 본 발명의 폴리펩티드, 핵산 분자 및/또는 벡터의 투여 전에, 투여와 동시에, 또는 투여 후에 투여될 수 있다. 적합한 보강제의 예에는 알루미늄 염, 예를 들어, 수산화알루미늄 및/또는 인산알루미늄; 스쿠알렌-물 에멀젼, 예를 들어, MF59를 비롯한 오일-에멀젼 조성물(또는 수중유 조성물)(예를 들어, WO 90/14837 참조); 사포닌 제형, 예를 들어, QS21 및 면역자극 복합체(ISCOMS)(예를 들어, US 5,057,540; WO 90/03184, WO 96/11711, WO 2004/004762, WO 2005/002620 참조); 예로서, 모노포스포릴 지질 A(MPL), 3-O-데아실화 MPL(3dMPL), CpG-모티프 함유 올리고뉴클레오티드, ADP-리보실화 박테리아 독소 또는 이의 돌연변이체, 예를 들어, E. 콜라이 이열성 장독소 LT, 콜레라 독소 CT, 백일해 독소 PT 또는 파상풍 톡소이드 TT, 매트릭스 M, 또는 이의 조합이 있는, 박테리아 또는 미생물 유도체가 포함된다. 또한, 공지되어 있는 면역증강 기술, 예를 들어, 본 발명의 폴리펩티드를 면역 반응을 향상시키는 것으로 당해 분야에 공지되어 있는 단백질(예를 들어, 파상풍 톡소이드, CRM197, rCTB, 박테리아 플라젤린 또는 기타의 것)에 융합시키는 것, 또는 폴리펩티드를 바이로솜에 포함시키는 것, 또는 이의 조합이 사용될 수 있다. 또한, 예를 들어, 동일한 아데노벡터에 의해 공동 전달되거나 암호화되는 유전적 보강제가 사용될 수 있다.
본 발명에 따른 HA 폴리펩티드, 면역원성 단편, 핵산 분자, 및/또는 벡터의 투여는 표준 투여 경로를 사용하여 수행될 수 있다. 비 제한적인 예에는 비경구 투여, 예를 들어, 정맥내, 피내, 경피, 근육내, 피하 등 또는 점막 투여, 예를 들어, 비내, 구강 등이 포함된다. 당업자는 면역 반응을 유도하기 위하여 본 발명에 따른 폴리펩티드, 핵산 분자 및/또는 벡터를 투여하는 다양한 가능성을 결정할 수 있을 것이다.
본 발명은 추가로 필요로 하는 대상체에서 인플루엔자 바이러스 질환의 예방 및/또는 치료, 바람직하게는 예방 방법을 제공하며, 방법은 상기 대상체에게 본 명세서에 기술된 치료적 유효량의 HA 폴리펩티드, 면역원성 단편, 핵산 분자 및/또는 벡터를 투여하는 단계를 포함한다. 치료적 유효량은 인플루엔자 바이러스에 의한 감염으로부터 야기되는 질환 또는 병태를 예방, 완화 및/또는 치료하는 데 유효한 폴리펩티드, 면역원성 단편, 핵산 분자, 및/또는 벡터의 양을 지칭한다. 예방은 인플루엔자 바이러스의 확산의 억제 또는 감소, 또는 인플루엔자 바이러스에 의한 감염과 관련된 증상 중 하나 이상의 발병, 발생 또는 진행의 억제 또는 감소를 포함한다. 본 명세서에서 사용되는 바와 같이, 개선은 인플루엔자 감염의 가시적이거나 또는 감지 가능한 질환 증상, 바이러스 혈증, 또는 임의의 다른 측정 가능한 징후의 감소를 의미할 수 있다.
치료를 필요로 하는 대상체는 인플루엔자 바이러스 감염에 기인한 병태로 이미 고통 받는 대상체뿐만 아니라, 인플루엔자 바이러스 감염이 예방되어야 하는 대상체도 포함한다. 따라서, 본 발명의 폴리펩티드, 면역원성 단편, 핵산 및/또는 벡터는 미경험 대상체, 즉, 인플루엔자 바이러스 감염에 의해 야기되는 질환을 갖지 않거나, 인플루엔자 바이러스 감염으로 감염된 적이 없으며 현재 감염되지 않은 대상체, 또는 인플루엔자 바이러스에 이미 감염된 적이 있는 대상체에게 투여될 수 있다.
일 구현예에서, 예방 및/또는 치료는 인플루엔자 바이러스 감염에 민감한 환자군에서 표적화될 수 있다. 이러한 환자군은, 예를 들어 노인(예를 들어, 50세 이상, 60세 이상, 그리고 바람직하게는 65세 이상), 어린이(예를 들어, 5세 이하, 1세 이하), 입원 환자, 면역손상 대상체, 및 항바이러스 화합물 치료를 받았으나 부적절한 항바이러스 반응을 나타낸 환자를 포함하나, 이에 한정되지 않는다.
본 발명의 폴리펩티드, 면역원성 단편, 핵산 분자 및/또는 벡터는 하나 이상의 다른 활성제, 예컨대, 대안적인 인플루엔자 백신, 단일클론 항체, 항바이러스제, 항박테리아제, 및/또는 면역조절제와 병용하여 대상체에게 투여될 수 있다. 하나 이상의 다른 활성제는 인플루엔자 바이러스 질환의 치료 및/또는 예방에 유리할 수 있거나 인플루엔자 바이러스 질환과 관련된 증상 또는 병태를 개선할 수 있다. 일부 구현예에서, 하나 이상의 다른 활성제는 진통제, 해열 약제 또는 호흡을 완화하거나 보조하는 치료제이다.
또한, 본 발명의 HA 폴리펩티드 또는 이의 단편은 연구 도구로서, 진단 도구로서, 또는 항체 시약 또는 치료 항체의 생산을 위한 표적으로서 사용될 수 있다. 예를 들어, 일부 구현예에서 HA 폴리펩티드는 예를 들어, ELISA 분석, 비아코어(Biacore)/SPR 결합 분석 및/또는 당해 분야에 공지된 항체 결합에 대한 임의의 기타 분석에서 항-HA 항체의 분석, 이의 결합 및/또는 이의 역가의 측정을 위한 분석물로서 유용할 수 있다. 또 다른 예로서, 본 발명의 HA 폴리펩티드는 항-HA 항체의 효능을 분석 및/또는 비교하는 데 사용될 수 있다.
본 발명의 HA 폴리펩티드 또는 이의 단편은 또한 치료 항체 및/또는 연구 도구로서 또는 임의의 기타 원하는 용도에 사용될 수 있는 항체의 생성에 유용할 수 있다. 예를 들어, 본 발명의 HA 폴리펩티드는 연구 도구 및/또는 치료제로서 사용하기 위한 HA 단백질에 대한 항체를 얻기 위하여 비 인간 동물의 면역화에 사용될 수 있다. 그러면 단일클론성 또는 다클론성일 수 있는 이러한 항체 및/또는 이러한 항체를 생성하는 세포를 이러한 동물로부터 얻을 수 있다.
진단 도구로서 사용하기 위한 본 발명의 폴리펩티드는 주어진 분석에 적합한 임의의 검출 기술에 유용한 태그를 포함할 수 있다. 사용된 태그는 사용된 특정 검출/분석/진단 기술 및/또는 방법에 따라 달라질 것이다. 방법은 용액에서 수행될 수 있거나, 또는 본 발명의 폴리펩티드(들)는 담체 또는 기질, 예를 들어, 미세역가 플레이트(예를 들어, ELISA용), 막 및 비드 등에 결합되거나 부착될 수 있다.
본 발명을 다음의 실시예 및 도면에서 추가로 설명한다. 실시예는 본 발명의 범주를 어떠한 방식으로든 제한하고자 하지 않는다.
실시예
실시예 1:
가용성 HA 폴리펩티드 - 본 발명의 폴리펩티드의 구조 및 설계 요소
인플루엔자 A 바이러스 적혈구응집소(HA0)의 엑토도메인을 나타내는 가용성 폴리펩티드를 제조하기 위하여, HA는 고유의 막횡단 및 세포질 도메인 없이 발현될 필요가 있다. 안정적인 삼량체형 가용성 야생형(WT) HA의 발현은 종종 포유동물 세포에서 매우 빈약하다. 적어도 삼량체화 수준을 개선하기 위하여, 폴드온 삼량체화 도메인은 종종 폴리펩티드의 C 말단에 유전적으로 융합된다. 불행히도, 폴드온 도메인의 부가는 원치 않는 네오에피토프를 도입하고 종종 발현 수준을 감소시키거나, 또는 폴리펩티드의 구조를 변경할 수 있다. 본 발명에 따르면, 가용성의 안정적인 HA 삼량체의 발현 및 삼량체화 수준은 폴드온 또는 임의의 기타 비 천연 삼량체화 서열의 부가 없이 HA 폴리펩티드의 코어에, 특히 아미노산 위치 355 및 432에서, 또는 아미노산 위치 355 및 380 및 432에서, 특정 아미노산 돌연변이를 도입함으로써 증가될 수 있음이 밝혀졌다. 본 발명의 HA 단량체에서 아미노산 위치의 넘버링에 대해서는 문헌[Winter et al. 1981]에 의한 H3 넘버링이 사용됨을 유의한다(위 참조). 따라서, 본 발명의 HA 폴리펩티드 단량체에서 아미노산 위치의 넘버링은 기준 H3N2 인플루엔자 균주, 특히 (서열번호 1의 아미노산 서열을 갖는) 기준 H3N2 균주 A/아이치/2/68로부터의 HA 내 아미노산 위치의 넘버링에 따른다.
본 발명에 따른 주요 돌연변이의 주요 구조적 요소 및 위치는 인플루엔자 A H1A/캘리포니아/07/2009 균주의 HA에서 도 1의 A에 도시되어 있다(도 1의 A). 도시된 바와 같이, HA 단량체는 절단된 HA2 도메인(특히 HA2 도메인은 아미노산 위치 514 다음에 절단됨(즉, HA2 도메인의 C 말단 부분이 위치 515의 아미노산으로부터 시작하여 결실됨)을 포함하여, 막횡단 및 세포질 도메인을 결실시키고 HA의 가용성 엑토도메인을 생성한다(도 1의 B).
본 발명의 폴리펩티드는 위치 329(도 1의 B)에서 천연 일염기성 절단 부위인 아미노산 아르기닌(R)의 예를 들어, 글루타민(Q)으로의 돌연변이에 의해 프로테아제 절단에 대해 저항성이 되도록 만들어질 수 있다. 천연의 전장 HA와 대조적으로, R329Q 돌연변이를 포함하는 폴리펩티드는 세린 프로테아제(예를 들어, 트립신)에 의해 절단될 수 없다. HA의 절단은 단백질이 막 융합 및 바이러스 진입에 필요한 구조적 변화를 겪을 수 있도록 한다.
실시예 2:
상이한 서브타입에서 야생형 HA와 비교하여 가용성의 안정화된 HA의 발현
이 실시예에서, 그룹 1 및 그룹 2 둘 다로부터의 인플루엔자 바이러스로부터의 몇몇 HA를 선택하고 안정화된 가용성 삼량체 HA 폴리펩티드로서 발현시키고, 이들의 각각의 야생형 가용성 HA 엑토도메인(즉, 막횡단 및 세포질내 도메인이 없음)과 비교하였다. 본 발명에 따르면, 위치 355의 트립토판(W) 및 위치 380 및 432의 이소류신(I)은, 이들 아미노산이 HA 아미노 서열에 아직 존재하지 않은 경우, 인간에서의 8개의 가장 많은 순환 서브타입(도 2a)을 포함한 5개의 상이한 그룹 1 균주 및 5개의 상이한 그룹 2 균주의 HA의 아미노산 서열에 도입되었다. 또한, 일부 폴리펩티드의 위치 388에서 A-나선의 상단에 메티오닌(M)이 도입되었다. A/청둥오리/네덜란드/12/200(UFV181146) 및 A/닭/독일/N/1949(UFV181147)로부터 유래된 폴리펩티드를 제외하고 위치 478에서 이소류신이 도입되거나 WT 서열에 이미 존재하는 경우 유지되었다. Expi293F 배양 상청액에서 본 발명의 폴리펩티드의 발현 수준 및 삼량체화를 본 발명의 돌연변이가 없는 각각의 가용성 WT 폴리펩티드와 비교하였다.
표 1은 제조된 본 발명에 따른 폴리펩티드를 나타낸다.
[표 1]
본 발명의 폴리펩티드
+는 상기 위치에서 상기 아미노산의 존재를 의미하고; 빈 세포는 상기 아미노산의 부재(즉, 야생형 아미노산 잔기의 존재)를 의미한다.
도 2 및 표 1에 열거된 폴리펩티드를 암호화하는 DNA 단편을 합성하고(Genscript) pcDNA2004 발현 벡터(증진된 CMV 프로모터가 있는 변형된 pcDNA3 플라스미드)에 클로닝하였다. 본 발명의 폴리펩티드는 부위 특이적 비오틴화, 스크리닝 및 정제 목적을 위한 C 말단 링커-소르타제(Sortase)-링커-His 태그를 포함하였고, 진핵생물 Expi293F 현탁 세포주에서 마이크로 규모(200 μL)로 생산되었다. 야생형(WT) 전장(FL) HA 폴리펩티드는 스크리닝 목적을 위한 링커-His 태그를 함유하였다.
세포를 ExpiFectamine 293 형질감염 키트(Gibco , ThermoFisher Scientific)를 이용하여 2.5E+06vc/mL의 세포 밀도에서 96-하프 딥 웰 플레이트(System Duetz)에서 산업 등급의 DNA(0.01 EU/μg 이하의 내독소 수준 및 90% 이상의 초나선 함량)로 일시적으로 형질감염시키고, 37℃, 250 rpm, 8% CO2 및 75%의 습도에서 Expi293 발현 배지(Gibco, ThermoFisher Scientific)를 함유하는 진탕 플라스크에서 인큐베이션하였다. 분비된 폴리펩티드를 함유하는 세포 배양 상청액을 제3일에 수확하고, 원심분리(400xg에서 10분) 후 여과(96웰 필터 플레이트, 0.22 μm PVDF 막, Corning)하여 정화하였다.
수확된 배양 상청액의 발현된 가용성 HA 폴리펩티드의 수준을 OCTET 플랫폼(ForteBio)을 이용한 바이오층 간섭법에 의해 평가하였다. 요컨대, 항-HIS(HIS2) 바이오센서(ForteBio)를 사용하여, 정제된 폴리펩티드 UFV180436의 잘 정의된 기준 배치의 희석물 계열의 결합 이동을 측정하여 표준 곡선을 확립하였다. 이어서, 본 발명의 폴리펩티드를 함유하는 (동역학 완충액(ForteBio)에) 사전 희석된 세포 배양 상청액의 결합 이동을 측정하고, 폴리펩티드의 농도를 확립된 표준 곡선을 사용하여 계산하였다.
Expi293F 세포 배양 수확물에서 발현된 폴리펩티드의 존재 및 이의 4차 구조(폴리펩티드가 단량체, 삼량체 또는 다량체인지를 나타냄)는 BEH 200A 컬럼(Waters, 주입 부피 40 μL, 유속 0.35 mL/분)을 구비한 Vanquish 시스템(ThermoFisher Scientific)을 사용하여 초고성능 액체 크로마토그래피(UHPLC)에서 분석적 크기 배제 크로마토그래피(SEC)에 의해 평가하였다. 용리는 Helios 광산란 검출기(Wyatt Technologies)에 의해 모니터링하였다. SEC 프로파일을 Astra 6 소프트웨어 패키지(Wyatt Technology)로 분석하였다.
결과 및 결론
상이한 균주의 야생형 HA에서 위치 355의 트립토판 및 위치 380 및/또는 432의 이소류신의 도입은 OCTET에 의해 결정된 바와 같이 본 발명의 테스트한 모든 폴리펩티드에 대해 발현의 증가를 초래하였다(도 2b).
조 세포 배양 상청액의 SEC 분석은 본 발명의 폴리펩티드에 안정화 돌연변이를 도입할 때 모든 가용성의 안정화된 HA에 대해, 각각의 야생형 HA 엑토도메인에 대해 관찰된 삼량체 피크보다 더 긴 6분에서 7분 사이의 체류 시간에 뚜렷한 삼량체(T) 피크가 나타남을 보여주었다(도 2c). 상이한 인플루엔자 HA 서브타입 간의 체류 시간의 차이는 글리코실화 수준 및 복잡성의 차이로 인한 것일 수 있음을 주목한다.
종합하면, 데이터는 본 발명의 HA 폴리펩티드로의 돌연변이 355W, 380I 및/또는 432I의 도입은 안정적인 가용성 삼량체 HA의 증가된 발현 및 형성을 초래함을 확인시켜 준다.
실시예 3:
폴드온 삼량체화 도메인을 함유하는 야생형 HA와 비교한 정제된 삼량체 전장 HA의 시험관 내 특성화
중요한 안정화 돌연변이 355W, 380I 및/또는 432I의 기여를 추가로 특성화하기 위하여, 돌연변이를 H1 균주 A/캘리포니아/07/2009(UFV181009), A/미시간/45/2015((UFV181091), 및 H3 균주 A/홍콩/1/1968(UFV180660) 및 A/인디애나/11/2011(UFV181099)로부터 유래된 HA 엑토도메인 폴리펩티드(즉, TM 및 IC 도메인 제외)에 도입하고, 폴드온 삼량체화 도메인을 함유하는 야생형(WT) HA 엑토도메인과 비교하였다(폴드온 삼량체화 도메인이 결여된 UFV4239(서열번호 29)는 제외). 폴리펩티드는 표 2에 나타낸 바와 같은 아미노산을 포함하였다. 모든 폴리펩티드는 정제 및 스크리닝 목적을 위한 His 태그를 추가로 포함하고, ExpiCHO 세포에서 생산된 후, 정제 및 특성화되었다.
본 발명의 폴리펩티드를 암호화하는 DNA 단편을 실시예 2에 기술된 바와 같이 합성하였다. 폴리펩티드를 ExpiCHO 현탁 세포(350 mL 규모)에서 생산하였고, 제조업체의 프로토콜에 따라 ExpiFectamine 형질감염 시약(Gibco, ThermoFisher Scientific)을 사용하여 각각의 산업 등급의 DNA로 일시적 형질감염에 의해 ExpiCHO 발현 배지에서 배양하였다. ExpiFectamine CHO 증진제 및 ExpiCHO 피드(Gibco, ThermoFisher Scientific)를 제조업체의 프로토콜에 따라 형질감염 1일 후 세포 배양물에 첨가하였다. ExpiCHO 형질감염된 세포 현탁액을 32℃, 5% CO2에서 인큐베이션하고, 분비된 폴리펩티드를 함유하는 배양 상청액을 7 내지 11일 사이에 수확하였다. 배양 상청액을 원심분리에 이어, 0.2 μm 보틀 탑 필터(Corning)를 이용한 여과에 의해 정화하였다.
수확된 배양 상청액으로부터, 본 발명의 his 태그가 붙은 폴리펩티드 및 폴드온 삼량체화 도메인을 함유하는 각각의 야생형 균주를 KTA Avant 25 시스템(GE Healthcare Life Sciences)을 사용하여 2단계 프로토콜에 따라 정제하였다. 먼저, 사전 패킹된 cOmplete His-태그 정제 컬럼(Roche)을 사용하여 고정화된 금속 친화도 크로마토그래피를 수행하고, 1 mM의 이미다졸로 세척하고, 300 mM의 이미다졸로 용리시켰다. 두 번째로, HiLoad Superdex 200 pg 26/600 컬럼(GE Healthcare Life Sciences)을 사용하여 크기 배제 크로마토그래피를 수행하였다. 삼량체 피크 분획을 모아 냉동하고 -80℃에서 보관하였다(1개월 및 6개월).
본 발명의 정제된 폴리펩티드의 삼량체 함량을 실시예 2에 기술된 바와 같이 초고성능 액체 크로마토그래피(UHPLC)에서 분석적 SEC에 의해 평가하였다. 각각의 정제된 폴리펩티드 20 μg을 주입하고, 컬럼에 흘려보냈다.
정제된 폴리펩티드의 열 안정성을 6 μg의 폴리펩티드 용액에 첨가된 Sypro Orange 염료(ThermoFisher Scientific)의 형광 발광을 모니터링함으로써 시차 주사 형광측정법(DSF)에 의해 결정하였다. 온도가 25℃에서 95℃(시간당 60℃)로 점진적으로 증가하면, 폴리펩티드가 펼쳐지고 형광 염료가 노출된 소수성 잔기에 결합하여 특징적인 발광 변화를 초래한다. 용융 곡선을 ViiA7 실시간 PCR 장치(Applied BioSystems)를 사용하여 측정하고, Tm50 값을 Spotfire suite(Tibco Software Inc.)에 의해 계산하였다. Tm50 값은 단백질의 50%가 펼쳐지는 온도를 나타내며, 따라서 폴리펩티드의 온도 안정성에 대한 척도이다.
정제된 폴리펩티드의 3차원 입체형태를 ELISA에서 항원성을 테스트함으로써 평가하였다(항체 결합의 EC50 값). 이를 위하여, 폴리펩티드를 10 nM의 농도로 코팅하고, 출발 농도로서 70 nM을 사용하여, 일련의 단일클론 항체(mAb) 희석액: 특히 CR6261(그룹 1 특이적), CR8020(그룹 2 특이적), CR9114(그룹 1 및 2 모두 특이적), 및 MD3606(그룹 1 및 2 특이적 다중도메인 항체)과 인큐베이션하였다. 항체 결합을 2차 항체 항-인간 Fc HRP(마우스 항 인간 IgG, Jackson ImmunoResearch)와의 인큐베이션에 의해 결정하였고, POD 기질의 첨가에 의해 가시화하였다. EnSight™ 다중모드 플레이트 판독기(PerkinElmer)를 사용하여 판독을 수행하였다. 두 가지 독립적인 실험의 EC50 값을 Spotfire 제품군(Tibco Software Inc.)과 도 3d에 열거된 평균 및 표준 편차를 이용하여 계산하였다.
결과 및 결론
[표 2]
본 발명의 폴리펩티드
SEC 분석 결과는 상이한 인플루엔자 HA 균주의 본 발명의 폴리펩티드 내로의 (안정화) 아미노산의 존재(또는 동시 도입)가 고도로 순수하고 안정적인 가용성 삼량체 HA 폴리펩티드의 정제를 가능하게 함을 확인시켜 주었다. 아미노산의 안정화 효과는 H1A/캘리포니아/07/2009(UFV181009)에서 유래된 정제된 폴리펩티드의 경우 가장 잘 관찰되었으며, 여기서 상응하는 야생형 구축물(UFV4239, 서열번호 29)은 폴드온 삼량체화 도메인을 보유하지 않았고 단량체 피크만을 생성하였으나, 본 발명의 안정화된 폴리펩티드는 고순도 삼량체 피크를 보여준다(도 3a). 다른 H1 균주 A/미시간/45/2015의 야생형 HA 분자와 H3 균주 A/홍콩/1/1968 및 A/인디애나/11/2011은 추가적인 C 말단 폴드온 도메인을 가지고 발현되었고 삼량체 HA를 형성하였다. 그러나 본 발명의 각각의 안정화된 폴리펩티드와 달리, 폴드온 도메인을 갖는 야생형 HA의 삼량체 피크는 더 넓고 비대칭적이며, 바람직하지 않은 형태(*) 또는 덜 컴팩트한 접힘의 대안적인 고분자량 및/또는 저분자량의 폴리펩티드의 존재를 시사하는 숄더(shoulder)를 보여주었다(Seok et al., Sci. Cell Rep. 8;7:1-7540, 2017]).
모든 폴리펩티드의 추가 특성화는 안정화 아미노산을 포함하는 본 발명의 폴리펩티드가 폴드온 삼량체화 도메인이 있거나 없는(UFV4239, 서열번호 29) WT 폴리펩티드와 비교하여 상당히 더 높은 열 안정성을 나타냄을 보여주었다(도 3b 및 3c).
도입된 안정화 돌연변이는 매립형 돌연변이(즉, 표면이 아니라 HA 폴리펩티드 내부에 있음)이므로, 단량체 또는 삼량체 HA의 표면에 영향을 미치지 않을 것이다. HA 표면의 무결성을 확인하기 위하여, 널리 알려진 폭넓게 중화하는 항체 패널의 폴리펩티드에 대한 결합을 ELISA로 평가하였다. 본 발명의 야생형 및 안정화된 폴리펩티드는 이들의 예상되는 결합 폭에 따라 모든 항체에 대해 낮은 nM 범위의 EC50 값으로 비슷한 결합을 나타내었다. 본 발명의 H3 A/홍콩/1/1968 및 H3 A/인디애나/11/2011 유래 HA 폴리펩티드에 대한 CR9114 결합의 개선(약 4 내지 8배)이 관찰되었다(도 3d).
결론적으로, 본 실시예에 기술된 본 발명의 폴리펩티드는 고순도 삼량체 폴리펩티드로서 세포 배양 상청액으로부터 정제되었고, (폴드온 삼량체화 도메인이 있거나 없는) WT HA와 비교하여 개선된 열 안정성을 나타내었고 적절하게 접혔다.
실시예 4:
안정화 돌연변이의 조합의 특성화
본 발명의 폴리펩티드에서 안정화 돌연변이를 조합하는 것의 유익한 효과를 평가하기 위하여, 조합 355W + 478I 및 조합 380I + 432I를 H1 균주 A/캘리포니아/07/2009의 HA 엑토도메인에 단계적으로 도입하였다(도 4의 A, '.'는 첫 번째 줄에 열거된 H1 야생형(WT) 잔기의 변경되지 않은 존재를 나타낸다).
본 발명의 폴리펩티드를 암호화하는 DNA 단편을 실시예 2에 기술된 바와 같이 합성하였다. 부위 특이적 비오틴화 및 스크리닝 및 정제 목적을 위한 C 말단 링커-소르타제-링커-His 태그를 포함하는 폴리펩티드를 실시예 2에 기술된 바와 같이 진핵생물 Expi293F 세포에서 마이크로 규모(200 μL)로 생산하였다. 발현된 폴리펩티드의 수준을 OCTET에 의해 결정하였고, 삼량체 함량을 실시예 2에 기술된 바와 같이 분석적 SEC에 의해 분석하였다.
결과 및 결론
안정화 돌연변이의 상이한 조합을 갖는 본 발명의 폴리펩티드의 발현 수준의 평가는 UFV181007(서열번호 35)에 존재하는 돌연변이 380I 및 432I가 발현에 영향을 미치지 않았지만 WT 구축물과 비교하여 유의하게 삼량체의 수준을 증가시켰음을 보여주었다(도 4의 B). 돌연변이 355W 및 478I(예를 들어, UFV181005: 서열번호 34)의 부가는 현저한 발현의 증가를 초래하지만(도 4의 A), 삼량체의 형성은 관찰되지 않았다(도 4의 B). 355W, 478I, 380I 및 432I(예를 들어, UFV181009: 서열번호 10)를 조합할 때 발현 수준이 모두 증가되었고(도 4의 A), 삼량체 함량은 세포 배양 상청액에서 유의하게 개선되었다(도 4의 B).
결론적으로, 돌연변이 355W 및 478I는 본 발명의 폴리펩티드의 발현 수준을 증가시킨 반면, 돌연변이 380I 및 432I는 삼량체 형성을 개선하였다. 안정화 돌연변이의 조합은 본 발명의 폴리펩티드의 발현 및 삼량체 수준을 상승적으로 증가시켰다.
실시예 5: 다양한 HA 서브타입에서 야생형 HA와 비교하여 추가적인 가용성의 안정화된 HA의 발현
이 실시예에서, 추가로 안정화된 HA를 발현시키고, 각각의 야생형 가용성 HA 엑토도메인과 비교하였다(도 5의 A). 위치 355의 트립토판(W) 및 위치 380 및 432의 이소류신(I)이 2개의 추가 그룹 1 균주 및 4개의 추가 그룹 2 균주의 HA의 아미노산 서열에 도입되었다. 형질감염 3일 후 Expi293F 배양 상청액에서 폴리펩티드의 발현 수준 및 삼량체화를 본 발명의 돌연변이가 없는 각각의 가용성 WT 폴리펩티드와 비교하였다. 표 4는 제조된 본 발명에 따른 추가의 폴리펩티드를 나타낸다.
본 발명의 폴리펩티드를 암호화하는 DNA 단편을 실시예 2에 기술된 바와 같이 합성하였다. 플라스미드를 실시예 2에 기술된 바와 같이 진핵생물 Expi293F 세포에서 마이크로 규모(200 μL)로 형질감염시켰다. 스크리닝 및 정제 목적을 위한 C 말단 링커 His-태그를 포함한 모든 폴리펩티드가 발현된 반면, 안정화된 폴리펩티드는 부위 특이적 비오틴화를 위한 His 태그 앞에 추가의 소르타제-링커 서열을 포함한다. 발현된 폴리펩티드의 수준을 OCTET에 의해 결정하였고, 삼량체 함량을 실시예 2에 기술된 바와 같이 분석적 SEC에 의해 분석하였다.
결과 및 결론
실시예 2에서 관찰된 바와 같이, 상이한 균주의 야생형 HA에서 위치 355의 트립토판 및 위치 380 및 432의 이소류신의 도입은 OCTET 측정에 기초하여 추가로 테스트한 이러한 모든 폴리펩티드의 발현의 증가를 초래하였다. 하나의 예외는 H1 A/사우스캐롤라이나/1/1918 (UFV181084) 유래 HA에 대해 관찰되었는데, 이는 OCTET에 의해 결정된 바와 같이 약간의 감소를 보였지만(도 5의 A), SEC의 곡선 아래 면적을 기초로 하지 않았다(도 5의 B).
조 세포 배양 상청액의 SEC 분석은 모든 추가적인 가용성의 안정화된 HA에 안정화 돌연변이를 도입할 때, 각각의 야생형 HA 엑토도메인과 비교하여 더 많은 삼량체 폴리펩티드(T) 및 더 적은 단량체 폴리펩티드(M) 및 고분자량 종이 관찰되었음을 보여주었다(도 5의 B). 실시예 2에서 주목한 바와 같이, 상이한 인플루엔자 HA 서브타입 간의 체류 시간의 차이는 글리코실화 수준 및 복잡성의 차이로 인한 것일 가능성이 높다.
종합하면, 데이터는 본 발명의 추가적인 HA 폴리펩티드로의 돌연변이 355W, 380I 및/또는 432I의 도입이 안정적인 가용성 삼량체 HA의 증가된 발현 및 형성을 초래함을 확인시켜 준다.
실시예 6: 정제된 삼량체 전장 HA의 시험관 내 특성화(추가 데이터)
이 실시예에서는 추가적인 안정화된 HA를 발현시키고, 정제하고, 장기간 온도 스트레스에 노출시켰다. 이들 HA, UFV190839(서열번호 50), UFV190068(서열번호 51) 및 UFV190841(서열번호 52)은 각각 H3 A/홍콩/1/1968 H7 A/청둥오리/NL/12/2000, 및 H10 A/닭/독일/N/1949에서 유래되었다. 요컨대, 정제된 삼량체 폴리펩티드를 4℃(냉장고) 및 37℃(인큐베이터)에서 60일 동안 보관한 후, 분석적 SEC에 의해 단백질 무결성을 평가하였다.
본 발명에 따르면, 위치 355의 트립토판(W) 및 위치 380 및 432의 이소류신(I)이 3개의 상이한 그룹 2 균주의 HA의 아미노산 서열에 도입되었다.
본 발명의 폴리펩티드를 암호화하는 DNA 단편을 실시예 2에 기술된 바와 같이 합성하였다. 폴리펩티드를 실시예 3에 기술된 바와 같이 진핵생물 ExpiCHO 세포에서 중간 규모(30 mL)로 생산하였고, 5일째에 수확하였다. 부위 특이적 비오틴화, 스크리닝 및 정제 목적을 위한 C 말단 링커-소르타제-링커 His-태그를 포함하는 모든 폴리펩티드를 발현시켰다. 단백질을 실시예 3에 기술된 바와 같이 2단계 공정에 의해 정제하였으나, 이제는 HiLoad Superdex 200 16/600 컬럼이 사용되었다(GE Healthcare Life Sciences). 발현된 폴리펩티드의 수준을 OCTET에 의해 결정하였고, 현재 Unix-C 300 A 컬럼(Sepax Technologies)이 사용되었다는 일탈과 함께 실시예 2에 기술된 바와 같이 삼량체 함량을 분석적 SEC에 의해 분석하였다.
결과 및 결론
SEC 분석 결과는 정제 후 수득된 안정화 아미노산을 포함하는 본 발명의 폴리펩티드가 고도로 순수하고 삼량체임을 나타내었다. 나아가, 가용성 HA 폴리펩티드는 온도 스트레스에 저항적이었다. 4℃ 및 37℃에서 60일의 인큐베이션은 스트레스 전 물질에 대해 관찰된 것과 비교하여 단백질의 양 및 삼량체 상태에 영향을 미치지 않았으며(도 6), 37℃에서 인큐베이션 후 H10 유래 HA에 대해 오로지 소량의 삼량체 이외의 폴리펩티드가 관찰되었다(약 4.75분의 체류 시간).
실시예 2에서 주목한 바와 같이, 상이한 인플루엔자 HA 서브타입 간의 체류 시간의 차이는 글리코실화 수준 및 복잡성의 차이로 인한 것일 가능성이 높다. 또한, 60일 동안 스트레스를 가한 물질과 비교하여 출발 물질에 대해 관찰된 체류 시간의 작은 차이는 컬럼 노화로 인한 것일 가능성이 높다(즉, 유사한 이동이 내부 대조군에 대해 관찰됨).
결론적으로, 본 실시예에 기술된 본 발명의 폴리펩티드는 고순도 삼량체 폴리펩티드로서 배양 상청액으로부터 정제되었고, 60일의 기간 동안 온도 스트레스에 대해 고도로 불활성인 것으로 나타났다.
[표 3]
표준 아미노산, 약어 및 특성
서열
서열번호 1 CAA24269.1 적혈구응집소(인플루엔자 A 바이러스(A/아이치/2/1968(H3N2) (신호 서열 제외))
QDLPGNDNST ATLCLGHHAV PNGTLVKTIT DDQIEVTNAT ELVQSSSTGK 50
ICNNPHRILD GIDCTLIDAL LGDPHCDVFQ NETWDLFVER SKAFSNCYPY 100
DVPDYASLRS LVASSGTLEF ITEGFTWTGV TQNGGSNACK RGPGSGFFSR 150
LNWLTKSGST YPVLNVTMPN NDNFDKLYIW GIHHPSTNQE QTSLYVQASG 200
RVTVSTRRSQ QTIIPNIGSR PWVRGLSSRI SIYWTIVKPG DVLVINSNGN 250
LIAPRGYFKM RTGKSSIMRS DAPIDTCISE CITPNGSIPN DKPFQNVNKI 300
TYGACPKYVK QNTLKLATGM RNVPEKQTRG LFGAIAGFIE NGWEGMIDGW 350
YGFRHQNSEG TGQAADLKST QAAIDQINGK LNRVIEKTNE KFHQIEKEFS 400
EVEGRIQDLE KYVEDTKIDL WSYNAELLVA LENQHTIDLT DSEMNKLFEK 450
TRRQLRENAE EMGNGCFKIY HKCDNACIES IRNGTYDHDV YRDEALNNRF 500
QIKGVELKSG YKDWILWISF AISCFLLCVV LLGFIMWACQ RGNIRCNICI 550
CR6261 VH 단백질(서열번호 2)
EVQLVESGAEVKKPGSSVKVSCKASGGPFRSYAISWVRQAPGQGPEWMGGIIPIFGTTKYAPKFQGRVTITADDFAGTVYMELSSLRSEDTAMYYCAKHMGYQVRETMDVWGKGTTVTVSS
CR6261 VL 단백질(서열번호 3)
QSVLTQPPSVSAAPGQKVTISCSGSSSNIGNDYVSWYQQLPGTAPKLLIYDNNKRPSGIPDRFSGSKSGTSATLGITGLQTGDEANYYCATWDRRPTAYVVFGGGTKLTVL
CR8020 VH 단백질(서열번호 4)
QVQLQQSGAEVKTPGASVKVSCKASGYTFTSFGVSWIRQAPGQGLEWIGWISAYNGDTYYAQKFQARVTMTTDTSTTTAYMEMRSLRSDDTAVYYCAREPPLFYSSWSLDNWGQGTLVTVSS
CR8020 VL 단백질(서열번호 5)
EIVLTQSPGTLSLSPGERATLSCRASQSVSMNYLAWFQQKPGQAPRLLIYGASRRATGIPDRISGSGSGTDFTLTISRLEPADFAVYYCQQYGTSPRTFGQGAKVEIK
CR9114 VH 단백질(서열번호 6)
QVQLVQSGAEVKKPGSSVKVSCKSSGGTSNNYAISWVRQAPGQGLDWMGGISPIFGSTAYAQKFQGRVTISADIFSNTAYMELNSLTSEDTAVYFCARHGNYYYYSGMDVWGQGTTVTVSS
CR9114 VL 단백질(서열번호 7)
SYVLTQPPAVSGTPGQRVTISCSGSDSNIGRRSVNWYQQFPGTAPKLLIYSNDQRPSVVPDRFSGSKSGTSASLAISGLQSEDEAEYYCAAWDDSLKGAVFGGGTQLTVL
MD3606 단백질(서열번호 8)
EVQLVESGGGLVQPGGSLRLSCAVSISIFDIYAMDWYRQAPGKQRDLVATSFRDGSTNYADSVKGRFTISRDNAKNTLYLQMNSLKPEDTAVYLCHVSLYRDPLGVAGGMGVYWGKGALVTVSSGGGGSGGGGSEVQLVESGGGLVQAGGSLKLSCAASGRTYAMGWFRQAPGKEREFVAHINALGTRTYYSDSVKGRFTISRDNAKNTEYLEMNNLKPEDTAVYYCTAQGQWRAAPVAVAAEYEFWGQGTQVTVSSGGGGSGGGGSEVQLVESGGGLVQPGGSLRLSCAATGFTLENKAIGWFRQTPGSEREGVLCISKSGSWTYYTDSMRGRFTISRDNAENTVYLQMDSLKPEDTAVYYCATTTAGGGLCWDGTTFSRLASSWGQGTQVTVSSGGGGSGGGGSEVQLVESGGGLVQPGGSLKLSCAASGFTFSTSWMYWLRQAPGKGLEWVSVINTDGGTYYADSVKDRFTISRDNAKDTLYLQMSSLKSEDTAVYYCAKDWGGPEPTRGQGTQVTVSSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK
서열번호 9: UFV181157(신호 펩티드 및 태그는 밑줄 표시됨)
MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETPSSDNGTCYPGDFIDYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGSSRYSKKFKPEIAIRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFAMERNAGSGIIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNIPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDEITNKVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEIDGSHHHHHH
서열번호 10: UFV181009(신호 펩티드 및 태그는 밑줄 표시됨)
MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETPSSDNGTCYPGDFIDYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGSSRYSKKFKPEIAIRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFAMERNAGSGIIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNIPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHWQNEQGSGYAADLKSTQNAIDEITNIVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLINERTLDYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCIESVKNGTYDYPKYSEEAKLNREEIDSGSLPETGGGSHHHHHH
서열번호 11: UFV181134(신호 펩티드 및 태그는 밑줄 표시됨)
MKAILVVLLYTFTTANADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSNSDNGTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNSYPKLNQSYINDKGKEVLVLWGIHHPSTTADQQSLYQNADAYVFVGTSRYSKKFKPEIATRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFTMERNAGSGIIISDTPVHDCNTTCQTPEGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSNVKNLYEKVRNQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREKIDGSHHHHHH
서열번호 12: UFV181091(신호 펩티드 및 태그는 밑줄 표시됨)
MKAILVVLLYTFTTANADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSNSDNGTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNSYPKLNQSYINDKGKEVLVLWGIHHPSTTADQQSLYQNADAYVFVGTSRYSKKFKPEIATRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFTMERNAGSGIIISDTPVHDCNTTCQTPEGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHWQNEQGSGYAADLKSTQNAIDKITNIVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLINERTLDYHDSNVKNLYEKVRNQLKNNAKEIGNGCFEFYHKCDNTCIESVKNGTYDYPKYSEEAKLNREKIDSGSLPETGGGSHHHHHH
서열번호 13: UFV181153(신호 펩티드 및 태그는 밑줄 표시됨)
MAIIYLILLFAAVRGDQICIGYHSNNSTEKVDTILERNVTVTHAQDILEKTHNGKLCKLNGIPPLELGDCSIAGWLLGNPECDRLLTVPEWSYIMEKENPRNGLCYPGSFNDYEELKHLLSSVTHFEKVKILPRDRWTQHTTTGGSRACAVSGNPSFFRNMVWLTKKGSNYPIAKGSYNNTSGEQMLIIWGVHHPNDDAEQRTLYQNVGTYVSVGTSTLNKRSVPEIATRPKVNGQGGRMEFSWTILDMLDTINFESTGNLIAPEYGFRISKRGSSGIMKTEGTLENCETKCQTPLGAINTTLPFHNIHPLTIGECPKYVKSERLVLATGLRNVPQIESRGLFGAIAGFIEGGWQGMVDGWYGYHHSNDQGSGYAADKESTQRAIDGITNKVNSVIEKMNTQFEAVGKEFNNLEKRLENLNKKMEDGFLDVWTYNAELLVLMENERTLDFHDSNVKNLYDKVRMQLRDNAKELGNGCFEFYHKCDDECMNSVKNGTYDYPKYEEESKLNRNEIKGSHHHHHH
서열번호 14: UFV181154(신호 펩티드 및 태그는 밑줄 표시됨)
MAIIYLILLFAAVRGDQICIGYHSNNSTEKVDTILERNVTVTHAQDILEKTHNGKLCKLNGIPPLELGDCSIAGWLLGNPECDRLLTVPEWSYIMEKENPRNGLCYPGSFNDYEELKHLLSSVTHFEKVKILPRDRWTQHTTTGGSRACAVSGNPSFFRNMVWLTKKGSNYPIAKGSYNNTSGEQMLIIWGVHHPNDDAEQRTLYQNVGTYVSVGTSTLNKRSVPEIATRPKVNGQGGRMEFSWTILDMLDTINFESTGNLIAPEYGFRISKRGSSGIMKTEGTLENCETKCQTPLGAINTTLPFHNIHPLTIGECPKYVKSERLVLATGLRNVPQIESRGLFGAIAGFIEGGWQGMVDGWYGYHWSNDQGSGYAADKESTQRAIDGITNIVNSVIEKMNTQFEAVGKEFNNLEKRLENLNKKMEDGFLDVWTYNAELLVLMINERTLDFHDSNVKNLYDKVRMQLRDNAKELGNGCFEFYHKCDDECINSVKNGTYDYPKYEEESKLNRNEIKSGSLPETGGGSHHHHHH
서열번호 15: UFV181158(신호 펩티드 및 태그는 밑줄 표시됨)
MEKIVLLFAIVSLVQSDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCSLNGVKPLILRDCSVAGWLLGNPMCDEFLNVPEWSYIVEKDSPINGLCYPGDFNDYEELKHLLSSTNHFEKIQIIPRSSWSNHDASSGVSSACPYNGRSSFFRNVVWLIKKNNAYPTIKRSYNNTNQEDLLVLWGIHHPNDAAEQTKLYQNPTTYVSVGTSTLNQRSVPEIATRPKVNGQSGRMEFFWTILKPNDAINFESNGNFIAPEYAYKIVKKGDSAIMKSGLEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPKYVKSDRLVLATGLRNVPQRETRGLFGAIAGFIEGGWQGMVDGWYGYLHSNEQGSGYAADKESTQKAIDGITNKINSIIDKMNTQFEAVGKEFNNLERRIENLNKKMEDGFLDVWTYNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHKCDDECMESVRNGTYDYPQYSEEARLNREEISGSHHHHHH
서열번호 16: UFV181159(신호 펩티드 및 태그는 밑줄 표시됨)
MEKIVLLFAIVSLVQSDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCSLNGVKPLILRDCSVAGWLLGNPMCDEFLNVPEWSYIVEKDSPINGLCYPGDFNDYEELKHLLSSTNHFEKIQIIPRSSWSNHDASSGVSSACPYNGRSSFFRNVVWLIKKNNAYPTIKRSYNNTNQEDLLVLWGIHHPNDAAEQTKLYQNPTTYVSVGTSTLNQRSVPEIATRPKVNGQSGRMEFFWTILKPNDAINFESNGNFIAPEYAYKIVKKGDSAIMKSGLEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPKYVKSDRLVLATGLRNVPQRETRGLFGAIAGFIEGGWQGMVDGWYGYLWSNEQGSGYAADKESTQKAIDGITNIINSIIDKMNTQFEAVGKEFNNLERRIENLNKKMEDGFLDVWTYNAELLVLMINERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHKCDDECIESVRNGTYDYPQYSEEARLNREEISSGSLPETGGGSHHHHHH
서열번호 17: UFV181155(신호 펩티드 및 태그는 밑줄 표시됨)
METISLITILLVVTASNADKICIGHQSTNSTETVDTLTETNVPVTHAKELLHTEHNGMLCATSLGHPLILDTCTIEGLVYGNPSCDLLLGGREWSYIVERSSAVNGTCYPGNVENLEELRTLFSSASSYQRIQIFPDTTWNVTYTGTSRACSGSFYRSMRWLTQKSGFYPVQDAQYTNNRGKSILFVWGIHHPPTYTEQTNLYIRNDTTTSVTTEDLNRTFKPVIGPRPLVNGLQGRIDYYWSVLKPGQTLRVRSNGNLIAPWYGHVLSGGSHGRILKTDLKGGNCVVQCQTEKGGLNSTLPFHNISKYAFGTCPKYVRVNSLKLAVGLRNVPARSSRGLFGAIAGFIEGGWPGLVAGWYGFQHSNDQGVGMAADRDSTQKAIDKITSKVNNIVDKMNKQYEIIDHEFSEVETRLNMINNKIDDQIQDVWAYNAELLVLLENQKTLDEHDANVNNLYNKVKRALGSNAMEDGKGCFELYHKCDDQCMETIRNGTYNRRKYREESRLERQKIEGSHHHHHH
서열번호 18: UFV181156(신호 펩티드 및 태그는 밑줄 표시됨)
METISLITILLVVTASNADKICIGHQSTNSTETVDTLTETNVPVTHAKELLHTEHNGMLCATSLGHPLILDTCTIEGLVYGNPSCDLLLGGREWSYIVERSSAVNGTCYPGNVENLEELRTLFSSASSYQRIQIFPDTTWNVTYTGTSRACSGSFYRSMRWLTQKSGFYPVQDAQYTNNRGKSILFVWGIHHPPTYTEQTNLYIRNDTTTSVTTEDLNRTFKPVIGPRPLVNGLQGRIDYYWSVLKPGQTLRVRSNGNLIAPWYGHVLSGGSHGRILKTDLKGGNCVVQCQTEKGGLNSTLPFHNISKYAFGTCPKYVRVNSLKLAVGLRNVPARSSRGLFGAIAGFIEGGWPGLVAGWYGFQWSNDQGVGMAADRDSTQKAIDKITSIVNNIVDKMNKQYEIIDHEFSEVETRLNMINNKIDDQIQDVWAYNAELLVLLINQKTLDEHDANVNNLYNKVKRALGSNAMEDGKGCFELYHKCDDQCIETIRNGTYNRRKYREESRLERQKIESGSLPETGGGSHHHHHH
서열번호 19: UFV181141(신호 펩티드 및 태그는 밑줄 표시됨)
MKTIIALSYIFCLALGQDLPGNDNSTATLCLGHHAVPNGTLVKTITDDQIEVTNATELVQSSSTGKICNNPHRILDGIDCTLIDALLGDPHCDVFQNETWDLFVERSKAFSNCYPYDVPDYASLRSLVASSGTLEFITEGFTWTGVTQNGGSNACKRGPGSGFFSRLNWLTKSGSTYPVLNVTMPNNDNFDKLYIWGVHHPSTNQEQTSLYVQASGRVTVSTRRSQQTIIPNIGSRPWVRGLSSRISIYWTIVKPGDVLVINSNGNLIAPRGYFKMRTGKSSIMRSDAPIDTCISECITPNGSIPNDKPFQNVNKITYGACPKYVKQNTLKLATGMRNVPEKQTRGLFGAIAGFIENGWEGMIDGWYGFRHQNSEGTGQAADLKSTQAAIDQINGKLNRVIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSEMNKLFEKTRRQLRENAEDMGNGCFKIYHKCDNACIESIRNGTYDHDVYRDEALNNRFQIKGVGSHHHHHH
서열번호 20: UFV180660(신호 펩티드 및 태그는 밑줄 표시됨)
MKTIIALSYIFCLALGQDLPGNDNSTATLCLGHHAVPNGTLVKTITDDQIEVTNATELVQSSSTGKICNNPHRILDGIDCTLIDALLGDPHCDVFQNETWDLFVERSKAFSNCYPYDVPDYASLRSLVASSGTLEFITEGFTWTGVTQNGGSNACKRGPGSGFFSRLNWLTKSGSTYPVLNVTMPNNDNFDKLYIWGVHHPSTNQEQTSLYVQASGRVTVSTRRSQQTIIPNIWSRPWVRGLSSRISIYWTIVKPGDVLVINSNGNLIAPRGYFKMRTGKSSIMRSDAPIDTCISECITPNGSIPNDKPFQNVNKITYGACPKYVKQNTLKLATGMRNVPEKQTRGLFGAIAGFIENGWEGMIDGWYGFRWQNSEGTGQAADLKSTQAAIDQINGILNRVIEKMNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALINQHTIDLTDSEMNKLFEKTRRQLRENAEDMGNGCFKIYHKCDNACIESIRNGNYDHDVYRDEALNNRFQIKGVSGSLPETGGGSHHHHHH
서열번호 21: UFV181137(신호 펩티드 및 태그는 밑줄 표시됨)
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVSNGTLVKTITNDQIEVTNATELVQSSSTGRICDSPHQILDGENCTLIDALLGDPHCDGFQNKEWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVAQNGTSSACKRRSNKSFFSRLNWLHQLKYKYPALNVTMPNNEKFDKLYIWGVHHPSTDSDQISIYAQASGRVTVSTKRSQQTVIPNIGSSPWVRGVSSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAINQINGKLNRLIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSEMNKLFERTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDVYRDEALNNRFQIKGVGSHHHHHH
서열번호 22: UFV181096(신호 펩티드 및 태그는 밑줄 표시됨)
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVSNGTLVKTITNDQIEVTNATELVQSSSTGRICDSPHQILDGENCTLIDALLGDPHCDGFQNKEWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVAQNGTSSACKRRSNKSFFSRLNWLHQLKYKYPALNVTMPNNEKFDKLYIWGVHHPSTDSDQISIYAQASGRVTVSTKRSQQTVIPNIGSSPWVRGVSSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRWQNSEGTGQAADLKSTQAAINQINGILNRLIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALINQHTIDLTDSEMNKLFERTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDVYRDEALNNRFQIKGVSGSLPETGGGSHHHHHH
서열번호 23: UFV181145(신호 펩티드 및 태그는 밑줄 표시됨)
MIALILVALALSHTAYSQITNGTTGNPIICLGHHAVENGTSVKTLTDNHVEVVSAKELVETNHTDELCPSPLKLVDGQDCDLINGALGSPGCDRLQDTTWDVFIERPTAVDTCYPFDVPDYQSLRSILASSGSLEFIAEQFTWNGVKVDGSSSACLRGGRNSFFSRLNWLTKETNGNYGPINVTKENTGSYVRLYLWGVHHPSSDNEQTDLYKVATGRVTVSTRSDQISIVPNIGSRPRVRNQSGRISIYWTLVNPGDSIIFNSIGNLIAPRGHYKISKSTKSTVLKSDKRIGSCTSPCLTDKGSIQSDKPFQNVSRIAIGNCPKYVKQGSLMLATGMRNIPGKQAKGLFGAIAGFIENGWQGLIDGWYGFRHQNAEGTGTAADLKSTQAAIDQINGKLNRLIEKTNEKYHQIEKEFEQVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDVTDSEMNKLFERVRRQLRENAEDQGNGCFEIFHQCDNNCIESIRNGTYDHNIYRDEAINNRIKINPVGSHHHHHH
서열번호 24: UFV180661(신호 펩티드 및 태그는 밑줄 표시됨)
MIALILVALALSHTAYSQITNGTTGNPIICLGHHAVENGTSVKTLTDNHVEVVSAKELVETNHTDELCPSPLKLVDGQDCDLINGALGSPGCDRLQDTTWDVFIERPTAVDTCYPFDVPDYQSLRSILASSGSLEFIAEQFTWNGVKVDGSSSACLRGGRNSFFSRLNWLTKETNGNYGPINVTKENTGSYVRLYLWGVHHPSSDNEQTDLYKVATGRVTVSTRSDQISIVPNIGSRPRVRNQSGRISIYWTLVNPGDSIIFNSIGNLIAPRGHYKISKSTKSTVLKSDKRIGSCTSPCLTDKGSIQSDKPFQNVSRIAIGNCPKYVKQGSLMLATGMRNIPGKQAKGLFGAIAGFIENGWQGLIDGWYGFRWQNAEGTGTAADLKSTQAAIDQINGILNRLIEKMNEKYHQIEKEFEQVEGRIQDLEKYVEDTKIDLWSYNAELLVALINQHTIDVTDSEMNKLFERVRRQLRENAEDQGNGCFEIFHQCDNNCIESIRNGTYDHNIYRDEAINNRIKINPVSGSLPETGGGSHHHHHH
서열번호 25: UFV181146(신호 펩티드 및 태그는 밑줄 표시됨)
MNTQILVFALMAIIPTNADKICLGHHAVSNGTKVNTLTERGVEVVNATETVERTNVPRICSKGKRTVDLGQCGLLGTITGPPQCDQFLEFSADLIIERREGSDVCYPGKFVNEEALRQILRESGGIDKETMGFTYSGIRTNGATSACRRSGSSFYAEMKWLLSNTDNAAFPQMTKSYKNTRKDPALIIWGIHHSGSTTEQTKLYGSGNKLITVGSSNYQQSFVPSPGARPQVNGQSGRIDFHWLILNPNDTVTFSFNGAFIAPDRASFLRGKSMGIQSGVQVDANCEGDCYHSGGTIISNLPFQNINSRAVGKCPRYVKQESLLLATGMKNVPEIPKGRGLFGAIAGFIENGWEGLIDGWYGFRHQNAQGEGTAADYKSTQSAIDQITGKLNRLIEKTNQQFELIDNEFTEVEKQIGNVINWTRDSMTEVWSYNAELLVAMENQHTIDLADSEMNKLYERVKRQLRENAEEDGTGCFEIFHKCDDDCMASIRNNTYDHSKYREEAMQNRIQIDPVGSHHHHHH
서열번호 26: UFV180664(신호 펩티드 및 태그는 밑줄 표시됨)
MNTQILVFALMAIIPTNADKICLGHHAVSNGTKVNTLTERGVEVVNATETVERTNVPRICSKGKRTVDLGQCGLLGTITGPPQCDQFLEFSADLIIERREGSDVCYPGKFVNEEALRQILRESGGIDKETMGFTYSGIRTNGATSACRRSGSSFYAEMKWLLSNTDNAAFPQMTKSYKNTRKDPALIIWGIHHSGSTTEQTKLYGSGNKLITVGSSNYQQSFVPSPGARPQVNGQSGRIDFHWLILNPNDTVTFSFNGAFIAPDRASFLRGKSMGIQSGVQVDANCEGDCYHSGGTIISNLPFQNINSRAVGKCPRYVKQESLLLATGMKNVPEIPKGRGLFGAIAGFIENGWEGLIDGWYGFRWQNAQGEGTAADYKSTQSAIDQITGILNRLIEKMNQQFELIDNEFTEVEKQIGNVINWTRDSMTEVWSYNAELLVAMINQHTIDLADSEMNKLYERVKRQLRENAEEDGTGCFEIFHKCDDDCMASIRNNTYDHSKYREEAMQNRIQIDPVSGSLPETGGGSHHHHHH
서열번호 27: UFV181147(신호 펩티드 및 태그는 밑줄 표시됨)
MYKVVVIIALLGAVKGLDRICLGHHAVANGTIVKTLTNEQEEVTNATETVESTNLNKLCMKGRSYKDLGNCHPVGMLIGTPVCDPHLTGTWDTLIERENAIAHCYPGATINEEALRQKIMESGGISKMSTGFTYGSSINSAGTTKACMRNGGDSFYAELKWLVSKTKGQNFPQTTNTYRNTDTAEHLIIWGIHHPSSTQEKNDLYGTQSLSISVESSTYQNNFVPVVGARPQVNGQSGRIDFHWTLVQPGDNITFSHNGGLIAPSRVSKLTGRGLGIQSEALIDNSCESKCFWRGGSINTKLPFQNLSPRTVGQCPKYVNQRSLLLATGMRNVPEVVQGRGLFGAIAGFIENGWEGMVDGWYGFRHQNAQGTGQAADYKSTQAAIDQITGKLNRLIEKTNTEFESIESEFSETEHQIGNVINWTKDSITDIWTYQAELLVAMENQHTIDMADSEMLNLYERVRKQLRQNAEEDGKGCFEIYHTCDDSCMESIRNNTYDHSQYREEALLNRLNINSVGSHHHHHH
서열번호 28: UFV180662(신호 펩티드 및 태그는 밑줄 표시됨)
MYKVVVIIALLGAVKGLDRICLGHHAVANGTIVKTLTNEQEEVTNATETVESTNLNKLCMKGRSYKDLGNCHPVGMLIGTPVCDPHLTGTWDTLIERENAIAHCYPGATINEEALRQKIMESGGISKMSTGFTYGSSINSAGTTKACMRNGGDSFYAELKWLVSKTKGQNFPQTTNTYRNTDTAEHLIIWGIHHPSSTQEKNDLYGTQSLSISVESSTYQNNFVPVVGARPQVNGQSGRIDFHWTLVQPGDNITFSHNGGLIAPSRVSKLTGRGLGIQSEALIDNSCESKCFWRGGSINTKLPFQNLSPRTVGQCPKYVNQRSLLLATGMRNVPEVVQGRGLFGAIAGFIENGWEGMVDGWYGFRWQNAQGTGQAADYKSTQAAIDQITGILNRLIEKMNTEFESIESEFSETEHQIGNVINWTKDSITDIWTYQAELLVAMINQHTIDMADSEMLNLYERVRKQLRQNAEEDGKGCFEIYHTCDDSCMESIRNNTYDHSQYREEALLNRLNINSSGSLPETGGGSHHHHHH
서열번호 29: UFV4239(신호 펩티드 및 태그는 밑줄 표시됨)
MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETPSSDNGTCYPGDFIDYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGSSRYSKKFKPEIAIRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFAMERNAGSGIIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNIPSIQSQGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDEITNKVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEIDGRSLVPRGSGHHHHHH
서열번호 30: UFV180843(신호 펩티드 및 태그는 밑줄 표시됨)
MKAILVVLLYTFTTANADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSNSDNGTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNSYPKLNQSYINDKGKEVLVLWGIHHPSTTADQQSLYQNADAYVFVGTSRYSKKFKPEIATRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFTMERNAGSGIIISDTPVHDCNTTCQTPEGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSNVKNLYEKVRNQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREKIDSGSLVPSGSPGSGYIPEAPRDGQAYVRKDGEWVLLSTFLGGSLPETGGGSHHHHHH
서열번호 31: UFV180436(신호 펩티드 및 태그는 밑줄 표시됨)
MKTIIALSYIFCLALGQDLPGNDNSTATLCLGHHAVPNGTLVKTITDDQIEVTNATELVQSSSTGKICNNPHRILDGIDCTLIDALLGDPHCDVFQNETWDLFVERSKAFSNCYPYDVPDYASLRSLVASSGTLEFITEGFTWTGVTQNGGSNACKRGPGSGFFSRLNWLTKSGSTYPVLNVTMPNNDNFDKLYIWGVHHPSTNQEQTSLYVQASGRVTVSTRRSQQTIIPNIGSRPWVRGLSSRISIYWTIVKPGDVLVINSNGNLIAPRGYFKMRTGKSSIMRSDAPIDTCISECITPNGSIPNDKPFQNVNKITYGACPKYVKQNTLKLATGMRNVPEKQTRGLFGAIAGFIENGWEGMIDGWYGFRHQNSEGTGQAADLKSTQAAIDQINGKLNRVIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSEMNKLFEKTRRQLRENAEDMGNGCFKIYHKCDNACIESIRNGTYDHDVYRDEALNNRFQSGSLVPSGSPGSGYIPEAPRDGQAYVRKDGEWVLLSTFLGGSLPETGGGSHHHHHH
서열번호 32: UFV170466(신호 펩티드 및 태그는 밑줄 표시됨)
MKTIVALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSNSSFFSRLNWLTHLNFKYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRNIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPRYVKQSTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYNHDVYRDEALNNRFQSGSLVPRGSGSGYIPEAPRDGQAYVRKDGEWVLLSTFLGGSEPEA
서열번호 33: UFV181099(신호 펩티드 및 태그는 밑줄 표시됨)
MKTIVALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSNSSFFSRLNWLTHLNFKYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRNIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPRYVKQSTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRWQNSEGRGQAADLKSTQAAIDQINGILNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALINQHTIDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYNHDVYRDEALNNRFQIKGVSGSLPETGGGSHHHHHH
서열번호 34: UFV181005(신호 펩티드 및 태그는 밑줄 표시됨)
MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETPSSDNGTCYPGDFIDYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGSSRYSKKFKPEIAIRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFAMERNAGSGIIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNIPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHWQNEQGSGYAADLKSTQNAIDEITNKVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCIESVKNGTYDYPKYSEEAKLNREEIDSGSLPETGGGSHHHHHH
서열번호 35: UFV181007(신호 펩티드 및 태그는 밑줄 표시됨)
MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETPSSDNGTCYPGDFIDYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGSSRYSKKFKPEIAIRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFAMERNAGSGIIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNIPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDEITNIVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLINERTLDYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEIDSGSLPETGGGSHHHHHH
서열번호 36: UFV181090 (신호 펩티드 및 태그는 밑줄 표시됨)
MKVKLLVLLCTFTATYADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLENSHNGKLCLLKGIAPLQLGNCSVAGWILGNPECELLISKESWSYIVEKPNPENGTCYPGHFADYEELREQLSSVSSFERFEIFPKESSWPNHTVTGVSASCSHNGESSFYRNLLWLTGKNGLYPNLSKSYANNKEKEVLVLWGVHHPPNIGDQKALYHTENAYVSVVSSHYSRKFTPEIAKRPKVRDQEGRINYYWTLLEPGDTIIFEANGNLIAPRYAFALSRGFGSGIINSNAPMDKCDAKCQTPQGAINSSLPFQNVHPVTIGECPKYVRSAKLRMVTGLRNIPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHWQNEQGSGYAADQKSTQNAINGITNIVNSVIEKMNTQFTAVGKEFNKLERRMENLNKKVDDGFIDIWTYNAELLVLLINERTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNDECIESVKNGTYDYPKYSEESKLNREKIDSGSLPETGGGSHHHHHH
서열번호 37: UFV181135 (신호 펩티드 및 태그는 밑줄 표시됨)
MKVKLLVLLCTFTATYADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLENSHNGKLCLLKGIAPLQLGNCSVAGWILGNPECELLISKESWSYIVEKPNPENGTCYPGHFADYEELREQLSSVSSFERFEIFPKESSWPNHTVTGVSASCSHNGESSFYRNLLWLTGKNGLYPNLSKSYANNKEKEVLVLWGVHHPPNIGDQKALYHTENAYVSVVSSHYSRKFTPEIAKRPKVRDQEGRINYYWTLLEPGDTIIFEANGNLIAPRYAFALSRGFGSGIINSNAPMDKCDAKCQTPQGAINSSLPFQNVHPVTIGECPKYVRSAKLRMVTGLRNIPSIQSQGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADQKSTQNAINGITNKVNSVIEKMNTQFTAVGKEFNKLERRMENLNKKVDDGFIDIWTYNAELLVLLENERTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNDECMESVKNGTYDYPKYSEESKLNREKIDGSHHHHHH
서열번호 38: UFV181084 (신호 펩티드 및 태그는 밑줄 표시됨)
MEARLLVLLCAFAATNADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCKLKGIAPLQLGKCNIAGWLLGNPECDLLLTASSWSYIVETSNSENGTCYPGDFIDYEELREQLSSVSSFEKFEIFPKTSSWPNHETTKGVTAACSYAGASSFYRNLLWLTKKGSSYPKLSKSYVNNKGKEVLVLWGVHHPPTGTDQQSLYQNADAYVSVGSSKYNRRFTPEIAARPKVRDQAGRMNYYWTLLEPGDTITFEATGNLIAPWYAFALNRGSGSGIITSDAPVHDCNTKCQTPHGAINSSLPFQNIHPVTIGECPKYVRSTKLRMATGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHWQNEQGSGYAADQKSTQNAIDGITNIVNSVIEKMNTQFTAVGKEFNNLERRIENLNKKVDDGFLDIWTYNAELLVLLINERTLDFHDSNVRNLYEKVKSQLKNNAKEIGNGCFEFYHKCDDACIESVRNGTYDYPKYSEESKLNREEIDSGSLPETGGGSHHHHHH
서열번호 39: UFV181131 (신호 펩티드 및 태그는 밑줄 표시됨)
MEARLLVLLCAFAATNADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCKLKGIAPLQLGKCNIAGWLLGNPECDLLLTASSWSYIVETSNSENGTCYPGDFIDYEELREQLSSVSSFEKFEIFPKTSSWPNHETTKGVTAACSYAGASSFYRNLLWLTKKGSSYPKLSKSYVNNKGKEVLVLWGVHHPPTGTDQQSLYQNADAYVSVGSSKYNRRFTPEIAARPKVRDQAGRMNYYWTLLEPGDTITFEATGNLIAPWYAFALNRGSGSGIITSDAPVHDCNTKCQTPHGAINSSLPFQNIHPVTIGECPKYVRSTKLRMATGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQNAIDGITNKVNSVIEKMNTQFTAVGKEFNNLERRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDFHDSNVRNLYEKVKSQLKNNAKEIGNGCFEFYHKCDDACMESVRNGTYDYPKYSEESKLNREEIDGSHHHHHH
서열번호 40: UFV181095 (신호 펩티드 및 태그는 밑줄 표시됨)
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTLVKTITNDQIEVTNATELVQSSSTGRICDSPHRILDGKNCTLIDALLGDPHCDGFQNKEWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFINEDFNWTGVAQDGKSYTCKRGSVNSFFSRLNWLHKLEYKYPALNVTMPNNGKFDKLYIWGVHHPSTDSDQTSLYVRASGRVTVSTKRSQQTVIPNIGSRPWVRGLSSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRNGKSSIMRSDAPIGNCSSECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRWQNSEGTGQAADLKSTQAAIDQINGILNRLIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALINQHTIDLTDSEMNKLFERTRKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDVYRDEALNNRFQIKGVSGSLPETGGGSHHHHHH
서열번호 41: UFV181140 (신호 펩티드 및 태그는 밑줄 표시됨)
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTLVKTITNDQIEVTNATELVQSSSTGRICDSPHRILDGKNCTLIDALLGDPHCDGFQNKEWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFINEDFNWTGVAQDGKSYTCKRGSVNSFFSRLNWLHKLEYKYPALNVTMPNNGKFDKLYIWGVHHPSTDSDQTSLYVRASGRVTVSTKRSQQTVIPNIGSRPWVRGLSSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRNGKSSIMRSDAPIGNCSSECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAIDQINGKLNRLIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSEMNKLFERTRKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDVYRDEALNNRFQIKGVGSHHHHHH
서열번호 42: UFV181093 (신호 펩티드 및 태그는 밑줄 표시됨)
MKTIIALSYIFCQVLAQNLPGNDNSTATLCLGHHAVPNGTLVKTITNDQIEVTNATELVQSSSTGRICDSPHRILDGKNCTLIDALLGDPHCDGFQNEKWDLFVERSKAFSNCYPYDVPDYASLRSLVASSGTLEFINEGFNWTGVTQNGGSYACKRGPDKSFFSRLNWLYESESTYPVLNVTMPNNDNFDKLYIWGVHHPSTDKEQTNLYVQASGRVTVSTKRSQQTIIPNVGSRPWVRGLSSRISIYWTIVKPGDILLINSNGNLIAPRGYFKIRTGKSSIMRSDAPIGTCSSECITPNGSIPNDKPFQNVNKITYGACPKYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMIDGWYGFRWQNSEGTGQAADLKSTQAAIDQINGILNRVIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALINQHTIDLTDSEMNKLFEKTRRQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDVYRDEALNNRFQIKGVSGSLPETGGGSHHHHHH
서열번호 43: UFV181136 (신호 펩티드 및 태그는 밑줄 표시됨)
MKTIIALSYIFCQVLAQNLPGNDNSTATLCLGHHAVPNGTLVKTITNDQIEVTNATELVQSSSTGRICDSPHRILDGKNCTLIDALLGDPHCDGFQNEKWDLFVERSKAFSNCYPYDVPDYASLRSLVASSGTLEFINEGFNWTGVTQNGGSYACKRGPDKSFFSRLNWLYESESTYPVLNVTMPNNDNFDKLYIWGVHHPSTDKEQTNLYVQASGRVTVSTKRSQQTIIPNVGSRPWVRGLSSRISIYWTIVKPGDILLINSNGNLIAPRGYFKIRTGKSSIMRSDAPIGTCSSECITPNGSIPNDKPFQNVNKITYGACPKYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMIDGWYGFRHQNSEGTGQAADLKSTQAAIDQINGKLNRVIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSEMNKLFEKTRRQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDVYRDEALNNRFQIKGVGSHHHHHH
서열번호 44: UFV181097 (신호 펩티드 및 태그는 밑줄 표시됨)
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGGICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNDESFNWTGVTQNGTSSSCKRRSNNSFFSRLNWLTHLKFKYPALNVTMPNNEKFDKLYIWGVHHPVTDNDQIFLYAQASGRITVSTKRSQQTVIPNIGSRPRIRNIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRWQNSEGIGQAADLKSTQAAINQINGILNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALINQHTIDLTDSEMNKLFERTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDVYRDEALNNRFQIKGVSGSLPETGGGSHHHHHH
서열번호 45: UFV181138 (신호 펩티드 및 태그는 밑줄 표시됨)
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGGICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNDESFNWTGVTQNGTSSSCKRRSNNSFFSRLNWLTHLKFKYPALNVTMPNNEKFDKLYIWGVHHPVTDNDQIFLYAQASGRITVSTKRSQQTVIPNIGSRPRIRNIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGIGQAADLKSTQAAINQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSEMNKLFERTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDVYRDEALNNRFQIKGVGSHHHHHH
서열번호 46: UFV181148 (신호 펩티드 및 태그는 밑줄 표시됨)
MLSIVILFLLVAENSSQNYTGNPVICMGHHAVANGTMVKTLTDDQVEVVTAQELVESQNLPELCPSPLRLVDGQTCDIINGALGSPGCDHLNGAEWDVFIERPNAMDTCYPFDVPDYQSLRSILANNGKFEFIAEEFQWTTVKQNGKSGACKRANVNDFFRRLNWLVKSDRNAYPLQNLTKVNNGDYARLYIWGVHHPSTDTEQTNLYKNNPGRVTVSTKTSQTSVIPNIGSRPWVRGQSGRISFYWTIVEPGDLIVFNTIGNLIAPRGHYKLNNQKKGTILNTAIPIGSCVSKCHTDKGSLSTTKPFQNISRIAIGDCPKYVKQGSLKLATGMRNIPEKASRGLFGAIAGFIENGWQGLIDGWYGFRHQNAEGTGTAADLKSTQAAIDQINGKLNRLIEKTNEKYHQIEKEFEQVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDVTDSEMNKLFERVRRQLRENAEDKGNGCFEIFHKCDNNCIESIRNGTYDHDIYRDEAINNRFQIQGVGSHHHHHH
서열번호 47: UFV181149 (신호 펩티드 및 태그는 밑줄 표시됨)
MLSIVILFLLVAENSSQNYTGNPVICMGHHAVANGTMVKTLTDDQVEVVTAQELVESQNLPELCPSPLRLVDGQTCDIINGALGSPGCDHLNGAEWDVFIERPNAMDTCYPFDVPDYQSLRSILANNGKFEFIAEEFQWTTVKQNGKSGACKRANVNDFFRRLNWLVKSDRNAYPLQNLTKVNNGDYARLYIWGVHHPSTDTEQTNLYKNNPGRVTVSTKTSQTSVIPNIGSRPWVRGQSGRISFYWTIVEPGDLIVFNTIGNLIAPRGHYKLNNQKKGTILNTAIPIGSCVSKCHTDKGSLSTTKPFQNISRIAIGDCPKYVKQGSLKLATGMRNIPEKASRGLFGAIAGFIENGWQGLIDGWYGFRWQNAEGTGTAADLKSTQAAIDQINGILNRLIEKTNEKYHQIEKEFEQVEGRIQDLEKYVEDTKIDLWSYNAELLVALINQHTIDVTDSEMNKLFERVRRQLRENAEDKGNGCFEIFHKCDNNCIESIRNGTYDHDIYRDEAINNRFQIQGVSGSLPETGGGSHHHHHH
서열번호 50: UFV190839 (신호 펩티드 및 태그는 밑줄 표시됨
MKTIIALSYIFCLALGQDLPGNDNSTATLCLGHHAVPNGTLVKTITDDQIEVTNATELVQSSSTGKICNNPHRILDGIDCTLIDALLGDPHCDVFQNETWDLFVERSKAFSNCYPYDVPDYASLRSLVASSGTLEFITEGFTWTGVTQNGGSNACKRGPGSGFFSRLNWLTKSGSTYPVLNVTMPNNDNFDKLYIWGVHHPSTNQEQTSLYVQASGRVTVSTRRSQQTIIPNIGSRPWVRGLSSRISIYWTIVKPGDVLVINSNGNLIAPRGYFKMRTGKSSIMRSDAPIDTCISECITPNGSIPNDKPFQNVNKITYGACPKYVKQNTLKLATGMRNVPEKQTRGLFGAIAGFIENGWEGMIDGWYGFRWQNSEGTGQAADLKSTQAAIDQINGILNRVIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALINQHTIDLTDSEMNKLFEKTRRQLRENAEDMGNGCFKIYHKCDNACIESIRNGTYDHDVYRDEALNNRFQIKGVSGSLPETGGGSHHHHHH
서열번호 51: UFV190068 (신호 펩티드 및 태그는 밑줄 표시됨) MNTQILVFALMAIIPTNADKICLGHHAVSNGTKVNTLTERGVEVVNATETVERTNVPRICSKGKRTVDLGQCGLLGTITGPPQCDQFLEFSADLIIERREGSDVCYPGKFVNEEALRQILRESGGIDKETMGFTYSGIRTNGATSACRRSGSSFYAEMKWLLSNTDNAAFPQMTKSYKNTRKDPALIIWGIHHSGSTTEQTKLYGSGNKLITVGSSNYQQSFVPSPGARPQVNGQSGRIDFHWLILNPNDTVTFSFNGAFIAPDRASFLRGKSMGIQSGVQVDANCEGDCYHSGGTIISNLPFQNINSRAVGKCPRYVKQESLLLATGMKNVPEIPKGRGLFGAIAGFIENGWEGLIDGWYGFRWQNAQGEGTAADYKSTQSAIDQITGILNRLIEKTNQQFELIDNEFTEVEKQIGNVINWTRDSMTEVWSYNAELLVAMINQHTIDLADSEMNKLYERVKRQLRENAEEDGTGCFEIFHKCDDDCMASIRNNTYDHSKYREEAMQNRIQIDPVSGSLPETGGGSHHHHHH
서열번호 52: UFV190841 (신호 펩티드 및 태그는 밑줄 표시됨)
MYKVVVIIALLGAVKGDRICLGHHAVANGTIVKTLTNEQEEVTNATETVESTNLNKLCMKGRSYKDLGNCHPVGMLIGTPVCDPHLTGTWDTLIERENAIAHCYPGATINEEALRQKIMESGGISKMSTGFTYGSSINSAGTTKACMRNGGDSFYAELKWLVSKTKGQNFPQTTNTYRNTDTAEHLIIWGIHHPSSTQEKNDLYGTQSLSISVESSTYQNNFVPVVGARPQVNGQSGRIDFHWTLVQPGDNITFSHNGGLIAPSRVSKLTGRGLGIQSEALIDNSCESKCFWRGGSINTKLPFQNLSPRTVGQCPKYVNQRSLLLATGMRNVPEVVQGRGLFGAIAGFIENGWEGMVDGWYGFRWQNAQGTGQAADYKSTQAAIDQITGILNRLIEKTNTEFESIESEFSETEHQIGNVINWTKDSITDIWTYQAELLVAMINQHTIDMADSEMLNLYERVRKQLRQNAEEDGKGCFEIYHTCDDSCMESIRNNTYDHSQYREEALLNRLNINSSGSLPETGGGSHHHHHH
SEQUENCE LISTING
<110> Janssen Vaccines & Prevention B.V.
<120> RECOMBINANT INFLUENZA ANTIGENS
<130> CRU6016WOPCT1
<140> PCT/EP2020/061335
<141> 2020-04-23
<150> US 62/838,690
<151> 2019-04-25
<160> 52
<170> PatentIn version 3.5
<210> 1
<211> 550
<212> PRT
<213> Artificial Sequence
<220>
<223> CAA24269.1 haemagglutinin (Influenza A virus
(A/Aichi/2/1968(H3N2) (excluding signal sequence)
<400> 1
Gln Asp Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
1 5 10 15
His His Ala Val Pro Asn Gly Thr Leu Val Lys Thr Ile Thr Asp Asp
20 25 30
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
35 40 45
Gly Lys Ile Cys Asn Asn Pro His Arg Ile Leu Asp Gly Ile Asp Cys
50 55 60
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Val Phe Gln
65 70 75 80
Asn Glu Thr Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Phe Ser Asn
85 90 95
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
100 105 110
Ala Ser Ser Gly Thr Leu Glu Phe Ile Thr Glu Gly Phe Thr Trp Thr
115 120 125
Gly Val Thr Gln Asn Gly Gly Ser Asn Ala Cys Lys Arg Gly Pro Gly
130 135 140
Ser Gly Phe Phe Ser Arg Leu Asn Trp Leu Thr Lys Ser Gly Ser Thr
145 150 155 160
Tyr Pro Val Leu Asn Val Thr Met Pro Asn Asn Asp Asn Phe Asp Lys
165 170 175
Leu Tyr Ile Trp Gly Ile His His Pro Ser Thr Asn Gln Glu Gln Thr
180 185 190
Ser Leu Tyr Val Gln Ala Ser Gly Arg Val Thr Val Ser Thr Arg Arg
195 200 205
Ser Gln Gln Thr Ile Ile Pro Asn Ile Gly Ser Arg Pro Trp Val Arg
210 215 220
Gly Leu Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
225 230 235 240
Asp Val Leu Val Ile Asn Ser Asn Gly Asn Leu Ile Ala Pro Arg Gly
245 250 255
Tyr Phe Lys Met Arg Thr Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
260 265 270
Pro Ile Asp Thr Cys Ile Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
275 280 285
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Lys Ile Thr Tyr Gly Ala
290 295 300
Cys Pro Lys Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
305 310 315 320
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Leu Phe Gly Ala Ile Ala
325 330 335
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Ile Asp Gly Trp Tyr Gly
340 345 350
Phe Arg His Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys
355 360 365
Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn Arg Val
370 375 380
Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
385 390 395 400
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
405 410 415
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu
420 425 430
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
435 440 445
Glu Lys Thr Arg Arg Gln Leu Arg Glu Asn Ala Glu Glu Met Gly Asn
450 455 460
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Glu Ser
465 470 475 480
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
485 490 495
Asn Asn Arg Phe Gln Ile Lys Gly Val Glu Leu Lys Ser Gly Tyr Lys
500 505 510
Asp Trp Ile Leu Trp Ile Ser Phe Ala Ile Ser Cys Phe Leu Leu Cys
515 520 525
Val Val Leu Leu Gly Phe Ile Met Trp Ala Cys Gln Arg Gly Asn Ile
530 535 540
Arg Cys Asn Ile Cys Ile
545 550
<210> 2
<211> 121
<212> PRT
<213> Artificial Sequence
<220>
<223> CR6261 VH
<400> 2
Glu Val Gln Leu Val Glu Ser Gly Ala Glu Val Lys Lys Pro Gly Ser
1 5 10 15
Ser Val Lys Val Ser Cys Lys Ala Ser Gly Gly Pro Phe Arg Ser Tyr
20 25 30
Ala Ile Ser Trp Val Arg Gln Ala Pro Gly Gln Gly Pro Glu Trp Met
35 40 45
Gly Gly Ile Ile Pro Ile Phe Gly Thr Thr Lys Tyr Ala Pro Lys Phe
50 55 60
Gln Gly Arg Val Thr Ile Thr Ala Asp Asp Phe Ala Gly Thr Val Tyr
65 70 75 80
Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Met Tyr Tyr Cys
85 90 95
Ala Lys His Met Gly Tyr Gln Val Arg Glu Thr Met Asp Val Trp Gly
100 105 110
Lys Gly Thr Thr Val Thr Val Ser Ser
115 120
<210> 3
<211> 111
<212> PRT
<213> Artificial Sequence
<220>
<223> CR6261 VL
<400> 3
Gln Ser Val Leu Thr Gln Pro Pro Ser Val Ser Ala Ala Pro Gly Gln
1 5 10 15
Lys Val Thr Ile Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Asn Asp
20 25 30
Tyr Val Ser Trp Tyr Gln Gln Leu Pro Gly Thr Ala Pro Lys Leu Leu
35 40 45
Ile Tyr Asp Asn Asn Lys Arg Pro Ser Gly Ile Pro Asp Arg Phe Ser
50 55 60
Gly Ser Lys Ser Gly Thr Ser Ala Thr Leu Gly Ile Thr Gly Leu Gln
65 70 75 80
Thr Gly Asp Glu Ala Asn Tyr Tyr Cys Ala Thr Trp Asp Arg Arg Pro
85 90 95
Thr Ala Tyr Val Val Phe Gly Gly Gly Thr Lys Leu Thr Val Leu
100 105 110
<210> 4
<211> 122
<212> PRT
<213> Artificial Sequence
<220>
<223> CR8020 VH
<400> 4
Gln Val Gln Leu Gln Gln Ser Gly Ala Glu Val Lys Thr Pro Gly Ala
1 5 10 15
Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Phe
20 25 30
Gly Val Ser Trp Ile Arg Gln Ala Pro Gly Gln Gly Leu Glu Trp Ile
35 40 45
Gly Trp Ile Ser Ala Tyr Asn Gly Asp Thr Tyr Tyr Ala Gln Lys Phe
50 55 60
Gln Ala Arg Val Thr Met Thr Thr Asp Thr Ser Thr Thr Thr Ala Tyr
65 70 75 80
Met Glu Met Arg Ser Leu Arg Ser Asp Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ala Arg Glu Pro Pro Leu Phe Tyr Ser Ser Trp Ser Leu Asp Asn Trp
100 105 110
Gly Gln Gly Thr Leu Val Thr Val Ser Ser
115 120
<210> 5
<211> 108
<212> PRT
<213> Artificial Sequence
<220>
<223> CR8020 VL
<400> 5
Glu Ile Val Leu Thr Gln Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly
1 5 10 15
Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gln Ser Val Ser Met Asn
20 25 30
Tyr Leu Ala Trp Phe Gln Gln Lys Pro Gly Gln Ala Pro Arg Leu Leu
35 40 45
Ile Tyr Gly Ala Ser Arg Arg Ala Thr Gly Ile Pro Asp Arg Ile Ser
50 55 60
Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Arg Leu Glu
65 70 75 80
Pro Ala Asp Phe Ala Val Tyr Tyr Cys Gln Gln Tyr Gly Thr Ser Pro
85 90 95
Arg Thr Phe Gly Gln Gly Ala Lys Val Glu Ile Lys
100 105
<210> 6
<211> 121
<212> PRT
<213> Artificial Sequence
<220>
<223> CR9114 VH
<400> 6
Gln Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys Pro Gly Ser
1 5 10 15
Ser Val Lys Val Ser Cys Lys Ser Ser Gly Gly Thr Ser Asn Asn Tyr
20 25 30
Ala Ile Ser Trp Val Arg Gln Ala Pro Gly Gln Gly Leu Asp Trp Met
35 40 45
Gly Gly Ile Ser Pro Ile Phe Gly Ser Thr Ala Tyr Ala Gln Lys Phe
50 55 60
Gln Gly Arg Val Thr Ile Ser Ala Asp Ile Phe Ser Asn Thr Ala Tyr
65 70 75 80
Met Glu Leu Asn Ser Leu Thr Ser Glu Asp Thr Ala Val Tyr Phe Cys
85 90 95
Ala Arg His Gly Asn Tyr Tyr Tyr Tyr Ser Gly Met Asp Val Trp Gly
100 105 110
Gln Gly Thr Thr Val Thr Val Ser Ser
115 120
<210> 7
<211> 110
<212> PRT
<213> Artificial Sequence
<220>
<223> CR9114 VL
<400> 7
Ser Tyr Val Leu Thr Gln Pro Pro Ala Val Ser Gly Thr Pro Gly Gln
1 5 10 15
Arg Val Thr Ile Ser Cys Ser Gly Ser Asp Ser Asn Ile Gly Arg Arg
20 25 30
Ser Val Asn Trp Tyr Gln Gln Phe Pro Gly Thr Ala Pro Lys Leu Leu
35 40 45
Ile Tyr Ser Asn Asp Gln Arg Pro Ser Val Val Pro Asp Arg Phe Ser
50 55 60
Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile Ser Gly Leu Gln
65 70 75 80
Ser Glu Asp Glu Ala Glu Tyr Tyr Cys Ala Ala Trp Asp Asp Ser Leu
85 90 95
Lys Gly Ala Val Phe Gly Gly Gly Thr Gln Leu Thr Val Leu
100 105 110
<210> 8
<211> 749
<212> PRT
<213> Artificial Sequence
<220>
<223> MD3606
<400> 8
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Val Ser Ile Ser Ile Phe Asp Ile Tyr
20 25 30
Ala Met Asp Trp Tyr Arg Gln Ala Pro Gly Lys Gln Arg Asp Leu Val
35 40 45
Ala Thr Ser Phe Arg Asp Gly Ser Thr Asn Tyr Ala Asp Ser Val Lys
50 55 60
Gly Arg Phe Thr Ile Ser Arg Asp Asn Ala Lys Asn Thr Leu Tyr Leu
65 70 75 80
Gln Met Asn Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Leu Cys His
85 90 95
Val Ser Leu Tyr Arg Asp Pro Leu Gly Val Ala Gly Gly Met Gly Val
100 105 110
Tyr Trp Gly Lys Gly Ala Leu Val Thr Val Ser Ser Gly Gly Gly Gly
115 120 125
Ser Gly Gly Gly Gly Ser Glu Val Gln Leu Val Glu Ser Gly Gly Gly
130 135 140
Leu Val Gln Ala Gly Gly Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly
145 150 155 160
Arg Thr Tyr Ala Met Gly Trp Phe Arg Gln Ala Pro Gly Lys Glu Arg
165 170 175
Glu Phe Val Ala His Ile Asn Ala Leu Gly Thr Arg Thr Tyr Tyr Ser
180 185 190
Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ala Lys Asn
195 200 205
Thr Glu Tyr Leu Glu Met Asn Asn Leu Lys Pro Glu Asp Thr Ala Val
210 215 220
Tyr Tyr Cys Thr Ala Gln Gly Gln Trp Arg Ala Ala Pro Val Ala Val
225 230 235 240
Ala Ala Glu Tyr Glu Phe Trp Gly Gln Gly Thr Gln Val Thr Val Ser
245 250 255
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Val Gln Leu Val
260 265 270
Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly Ser Leu Arg Leu Ser
275 280 285
Cys Ala Ala Thr Gly Phe Thr Leu Glu Asn Lys Ala Ile Gly Trp Phe
290 295 300
Arg Gln Thr Pro Gly Ser Glu Arg Glu Gly Val Leu Cys Ile Ser Lys
305 310 315 320
Ser Gly Ser Trp Thr Tyr Tyr Thr Asp Ser Met Arg Gly Arg Phe Thr
325 330 335
Ile Ser Arg Asp Asn Ala Glu Asn Thr Val Tyr Leu Gln Met Asp Ser
340 345 350
Leu Lys Pro Glu Asp Thr Ala Val Tyr Tyr Cys Ala Thr Thr Thr Ala
355 360 365
Gly Gly Gly Leu Cys Trp Asp Gly Thr Thr Phe Ser Arg Leu Ala Ser
370 375 380
Ser Trp Gly Gln Gly Thr Gln Val Thr Val Ser Ser Gly Gly Gly Gly
385 390 395 400
Ser Gly Gly Gly Gly Ser Glu Val Gln Leu Val Glu Ser Gly Gly Gly
405 410 415
Leu Val Gln Pro Gly Gly Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly
420 425 430
Phe Thr Phe Ser Thr Ser Trp Met Tyr Trp Leu Arg Gln Ala Pro Gly
435 440 445
Lys Gly Leu Glu Trp Val Ser Val Ile Asn Thr Asp Gly Gly Thr Tyr
450 455 460
Tyr Ala Asp Ser Val Lys Asp Arg Phe Thr Ile Ser Arg Asp Asn Ala
465 470 475 480
Lys Asp Thr Leu Tyr Leu Gln Met Ser Ser Leu Lys Ser Glu Asp Thr
485 490 495
Ala Val Tyr Tyr Cys Ala Lys Asp Trp Gly Gly Pro Glu Pro Thr Arg
500 505 510
Gly Gln Gly Thr Gln Val Thr Val Ser Ser Asp Lys Thr His Thr Cys
515 520 525
Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu
530 535 540
Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu
545 550 555 560
Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys
565 570 575
Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys
580 585 590
Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu
595 600 605
Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys
610 615 620
Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys
625 630 635 640
Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser
645 650 655
Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys
660 665 670
Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln
675 680 685
Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly
690 695 700
Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln
705 710 715 720
Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn
725 730 735
His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
740 745
<210> 9
<211> 526
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181157
<400> 9
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val
50 55 60
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Pro Ser Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe
100 105 110
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp
130 135 140
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser
145 150 155 160
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro
165 170 175
Lys Leu Ser Lys Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val
180 185 190
Leu Trp Gly Ile His His Pro Ser Thr Ser Ala Asp Gln Gln Ser Leu
195 200 205
Tyr Gln Asn Ala Asp Ala Tyr Val Phe Val Gly Ser Ser Arg Tyr Ser
210 215 220
Lys Lys Phe Lys Pro Glu Ile Ala Ile Arg Pro Lys Val Arg Asp Gln
225 230 235 240
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys
245 250 255
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe
260 265 270
Ala Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro
275 280 285
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn
290 295 300
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys
305 310 315 320
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg
325 330 335
Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly
340 345 350
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr
355 360 365
His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser
370 375 380
Thr Gln Asn Ala Ile Asp Glu Ile Thr Asn Lys Val Asn Ser Val Ile
385 390 395 400
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His
405 410 415
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe
420 425 430
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn
435 440 445
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu
450 455 460
Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
465 470 475 480
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser Val
485 490 495
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu
500 505 510
Asn Arg Glu Glu Ile Asp Gly Ser His His His His His His
515 520 525
<210> 10
<211> 535
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181009
<400> 10
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val
50 55 60
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Pro Ser Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe
100 105 110
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp
130 135 140
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser
145 150 155 160
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro
165 170 175
Lys Leu Ser Lys Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val
180 185 190
Leu Trp Gly Ile His His Pro Ser Thr Ser Ala Asp Gln Gln Ser Leu
195 200 205
Tyr Gln Asn Ala Asp Ala Tyr Val Phe Val Gly Ser Ser Arg Tyr Ser
210 215 220
Lys Lys Phe Lys Pro Glu Ile Ala Ile Arg Pro Lys Val Arg Asp Gln
225 230 235 240
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys
245 250 255
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe
260 265 270
Ala Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro
275 280 285
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn
290 295 300
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys
305 310 315 320
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg
325 330 335
Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly
340 345 350
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr
355 360 365
His Trp Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser
370 375 380
Thr Gln Asn Ala Ile Asp Glu Ile Thr Asn Ile Val Asn Ser Val Ile
385 390 395 400
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His
405 410 415
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe
420 425 430
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Ile Asn
435 440 445
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu
450 455 460
Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
465 470 475 480
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Ile Glu Ser Val
485 490 495
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu
500 505 510
Asn Arg Glu Glu Ile Asp Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly
515 520 525
Ser His His His His His His
530 535
<210> 11
<211> 526
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181134
<400> 11
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Thr Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val
50 55 60
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Ser Asn Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe
100 105 110
Ile Asn Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp
130 135 140
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser
145 150 155 160
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro
165 170 175
Lys Leu Asn Gln Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val
180 185 190
Leu Trp Gly Ile His His Pro Ser Thr Thr Ala Asp Gln Gln Ser Leu
195 200 205
Tyr Gln Asn Ala Asp Ala Tyr Val Phe Val Gly Thr Ser Arg Tyr Ser
210 215 220
Lys Lys Phe Lys Pro Glu Ile Ala Thr Arg Pro Lys Val Arg Asp Gln
225 230 235 240
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys
245 250 255
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe
260 265 270
Thr Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro
275 280 285
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Glu Gly Ala Ile Asn
290 295 300
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys
305 310 315 320
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg
325 330 335
Asn Val Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly
340 345 350
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr
355 360 365
His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser
370 375 380
Thr Gln Asn Ala Ile Asp Lys Ile Thr Asn Lys Val Asn Ser Val Ile
385 390 395 400
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His
405 410 415
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe
420 425 430
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn
435 440 445
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu
450 455 460
Lys Val Arg Asn Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
465 470 475 480
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser Val
485 490 495
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu
500 505 510
Asn Arg Glu Lys Ile Asp Gly Ser His His His His His His
515 520 525
<210> 12
<211> 535
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181091
<400> 12
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Thr Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val
50 55 60
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Ser Asn Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe
100 105 110
Ile Asn Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp
130 135 140
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser
145 150 155 160
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro
165 170 175
Lys Leu Asn Gln Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val
180 185 190
Leu Trp Gly Ile His His Pro Ser Thr Thr Ala Asp Gln Gln Ser Leu
195 200 205
Tyr Gln Asn Ala Asp Ala Tyr Val Phe Val Gly Thr Ser Arg Tyr Ser
210 215 220
Lys Lys Phe Lys Pro Glu Ile Ala Thr Arg Pro Lys Val Arg Asp Gln
225 230 235 240
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys
245 250 255
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe
260 265 270
Thr Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro
275 280 285
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Glu Gly Ala Ile Asn
290 295 300
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys
305 310 315 320
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg
325 330 335
Asn Val Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly
340 345 350
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr
355 360 365
His Trp Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser
370 375 380
Thr Gln Asn Ala Ile Asp Lys Ile Thr Asn Ile Val Asn Ser Val Ile
385 390 395 400
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His
405 410 415
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe
420 425 430
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Ile Asn
435 440 445
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu
450 455 460
Lys Val Arg Asn Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
465 470 475 480
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Ile Glu Ser Val
485 490 495
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu
500 505 510
Asn Arg Glu Lys Ile Asp Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly
515 520 525
Ser His His His His His His
530 535
<210> 13
<211> 522
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181153
<400> 13
Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Ala Ala Val Arg Gly Asp
1 5 10 15
Gln Ile Cys Ile Gly Tyr His Ser Asn Asn Ser Thr Glu Lys Val Asp
20 25 30
Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Gln Asp Ile Leu
35 40 45
Glu Lys Thr His Asn Gly Lys Leu Cys Lys Leu Asn Gly Ile Pro Pro
50 55 60
Leu Glu Leu Gly Asp Cys Ser Ile Ala Gly Trp Leu Leu Gly Asn Pro
65 70 75 80
Glu Cys Asp Arg Leu Leu Thr Val Pro Glu Trp Ser Tyr Ile Met Glu
85 90 95
Lys Glu Asn Pro Arg Asn Gly Leu Cys Tyr Pro Gly Ser Phe Asn Asp
100 105 110
Tyr Glu Glu Leu Lys His Leu Leu Ser Ser Val Thr His Phe Glu Lys
115 120 125
Val Lys Ile Leu Pro Arg Asp Arg Trp Thr Gln His Thr Thr Thr Gly
130 135 140
Gly Ser Arg Ala Cys Ala Val Ser Gly Asn Pro Ser Phe Phe Arg Asn
145 150 155 160
Met Val Trp Leu Thr Lys Lys Gly Ser Asn Tyr Pro Ile Ala Lys Gly
165 170 175
Ser Tyr Asn Asn Thr Ser Gly Glu Gln Met Leu Ile Ile Trp Gly Val
180 185 190
His His Pro Asn Asp Asp Ala Glu Gln Arg Thr Leu Tyr Gln Asn Val
195 200 205
Gly Thr Tyr Val Ser Val Gly Thr Ser Thr Leu Asn Lys Arg Ser Val
210 215 220
Pro Glu Ile Ala Thr Arg Pro Lys Val Asn Gly Gln Gly Gly Arg Met
225 230 235 240
Glu Phe Ser Trp Thr Ile Leu Asp Met Leu Asp Thr Ile Asn Phe Glu
245 250 255
Ser Thr Gly Asn Leu Ile Ala Pro Glu Tyr Gly Phe Arg Ile Ser Lys
260 265 270
Arg Gly Ser Ser Gly Ile Met Lys Thr Glu Gly Thr Leu Glu Asn Cys
275 280 285
Glu Thr Lys Cys Gln Thr Pro Leu Gly Ala Ile Asn Thr Thr Leu Pro
290 295 300
Phe His Asn Ile His Pro Leu Thr Ile Gly Glu Cys Pro Lys Tyr Val
305 310 315 320
Lys Ser Glu Arg Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln
325 330 335
Ile Glu Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
340 345 350
Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn
355 360 365
Asp Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Arg Ala
370 375 380
Ile Asp Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn
385 390 395 400
Thr Gln Phe Glu Ala Val Gly Lys Glu Phe Asn Asn Leu Glu Lys Arg
405 410 415
Leu Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp
420 425 430
Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu
435 440 445
Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Met
450 455 460
Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe
465 470 475 480
Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys Asn Gly Thr
485 490 495
Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn Arg Asn Glu
500 505 510
Ile Lys Gly Ser His His His His His His
515 520
<210> 14
<211> 531
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181154
<400> 14
Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Ala Ala Val Arg Gly Asp
1 5 10 15
Gln Ile Cys Ile Gly Tyr His Ser Asn Asn Ser Thr Glu Lys Val Asp
20 25 30
Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Gln Asp Ile Leu
35 40 45
Glu Lys Thr His Asn Gly Lys Leu Cys Lys Leu Asn Gly Ile Pro Pro
50 55 60
Leu Glu Leu Gly Asp Cys Ser Ile Ala Gly Trp Leu Leu Gly Asn Pro
65 70 75 80
Glu Cys Asp Arg Leu Leu Thr Val Pro Glu Trp Ser Tyr Ile Met Glu
85 90 95
Lys Glu Asn Pro Arg Asn Gly Leu Cys Tyr Pro Gly Ser Phe Asn Asp
100 105 110
Tyr Glu Glu Leu Lys His Leu Leu Ser Ser Val Thr His Phe Glu Lys
115 120 125
Val Lys Ile Leu Pro Arg Asp Arg Trp Thr Gln His Thr Thr Thr Gly
130 135 140
Gly Ser Arg Ala Cys Ala Val Ser Gly Asn Pro Ser Phe Phe Arg Asn
145 150 155 160
Met Val Trp Leu Thr Lys Lys Gly Ser Asn Tyr Pro Ile Ala Lys Gly
165 170 175
Ser Tyr Asn Asn Thr Ser Gly Glu Gln Met Leu Ile Ile Trp Gly Val
180 185 190
His His Pro Asn Asp Asp Ala Glu Gln Arg Thr Leu Tyr Gln Asn Val
195 200 205
Gly Thr Tyr Val Ser Val Gly Thr Ser Thr Leu Asn Lys Arg Ser Val
210 215 220
Pro Glu Ile Ala Thr Arg Pro Lys Val Asn Gly Gln Gly Gly Arg Met
225 230 235 240
Glu Phe Ser Trp Thr Ile Leu Asp Met Leu Asp Thr Ile Asn Phe Glu
245 250 255
Ser Thr Gly Asn Leu Ile Ala Pro Glu Tyr Gly Phe Arg Ile Ser Lys
260 265 270
Arg Gly Ser Ser Gly Ile Met Lys Thr Glu Gly Thr Leu Glu Asn Cys
275 280 285
Glu Thr Lys Cys Gln Thr Pro Leu Gly Ala Ile Asn Thr Thr Leu Pro
290 295 300
Phe His Asn Ile His Pro Leu Thr Ile Gly Glu Cys Pro Lys Tyr Val
305 310 315 320
Lys Ser Glu Arg Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln
325 330 335
Ile Glu Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
340 345 350
Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His Trp Ser Asn
355 360 365
Asp Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Arg Ala
370 375 380
Ile Asp Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Asn
385 390 395 400
Thr Gln Phe Glu Ala Val Gly Lys Glu Phe Asn Asn Leu Glu Lys Arg
405 410 415
Leu Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp
420 425 430
Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Ile Asn Glu Arg Thr Leu
435 440 445
Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Met
450 455 460
Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe
465 470 475 480
Tyr His Lys Cys Asp Asp Glu Cys Ile Asn Ser Val Lys Asn Gly Thr
485 490 495
Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn Arg Asn Glu
500 505 510
Ile Lys Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly Ser His His His
515 520 525
His His His
530
<210> 15
<211> 524
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181158
<400> 15
Met Glu Lys Ile Val Leu Leu Phe Ala Ile Val Ser Leu Val Gln Ser
1 5 10 15
Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val
20 25 30
Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile
35 40 45
Leu Glu Lys Thr His Asn Gly Lys Leu Cys Ser Leu Asn Gly Val Lys
50 55 60
Pro Leu Ile Leu Arg Asp Cys Ser Val Ala Gly Trp Leu Leu Gly Asn
65 70 75 80
Pro Met Cys Asp Glu Phe Leu Asn Val Pro Glu Trp Ser Tyr Ile Val
85 90 95
Glu Lys Asp Ser Pro Ile Asn Gly Leu Cys Tyr Pro Gly Asp Phe Asn
100 105 110
Asp Tyr Glu Glu Leu Lys His Leu Leu Ser Ser Thr Asn His Phe Glu
115 120 125
Lys Ile Gln Ile Ile Pro Arg Ser Ser Trp Ser Asn His Asp Ala Ser
130 135 140
Ser Gly Val Ser Ser Ala Cys Pro Tyr Asn Gly Arg Ser Ser Phe Phe
145 150 155 160
Arg Asn Val Val Trp Leu Ile Lys Lys Asn Asn Ala Tyr Pro Thr Ile
165 170 175
Lys Arg Ser Tyr Asn Asn Thr Asn Gln Glu Asp Leu Leu Val Leu Trp
180 185 190
Gly Ile His His Pro Asn Asp Ala Ala Glu Gln Thr Lys Leu Tyr Gln
195 200 205
Asn Pro Thr Thr Tyr Val Ser Val Gly Thr Ser Thr Leu Asn Gln Arg
210 215 220
Ser Val Pro Glu Ile Ala Thr Arg Pro Lys Val Asn Gly Gln Ser Gly
225 230 235 240
Arg Met Glu Phe Phe Trp Thr Ile Leu Lys Pro Asn Asp Ala Ile Asn
245 250 255
Phe Glu Ser Asn Gly Asn Phe Ile Ala Pro Glu Tyr Ala Tyr Lys Ile
260 265 270
Val Lys Lys Gly Asp Ser Ala Ile Met Lys Ser Gly Leu Glu Tyr Gly
275 280 285
Asn Cys Asn Thr Lys Cys Gln Thr Pro Met Gly Ala Ile Asn Ser Ser
290 295 300
Met Pro Phe His Asn Ile His Pro Leu Thr Ile Gly Glu Cys Pro Lys
305 310 315 320
Tyr Val Lys Ser Asp Arg Leu Val Leu Ala Thr Gly Leu Arg Asn Val
325 330 335
Pro Gln Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile
340 345 350
Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr Leu His
355 360 365
Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln
370 375 380
Lys Ala Ile Asp Gly Ile Thr Asn Lys Ile Asn Ser Ile Ile Asp Lys
385 390 395 400
Met Asn Thr Gln Phe Glu Ala Val Gly Lys Glu Phe Asn Asn Leu Glu
405 410 415
Arg Arg Ile Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp
420 425 430
Val Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg
435 440 445
Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val
450 455 460
Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe
465 470 475 480
Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Glu Ser Val Arg Asn
485 490 495
Gly Thr Tyr Asp Tyr Pro Gln Tyr Ser Glu Glu Ala Arg Leu Asn Arg
500 505 510
Glu Glu Ile Ser Gly Ser His His His His His His
515 520
<210> 16
<211> 533
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181159
<400> 16
Met Glu Lys Ile Val Leu Leu Phe Ala Ile Val Ser Leu Val Gln Ser
1 5 10 15
Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val
20 25 30
Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile
35 40 45
Leu Glu Lys Thr His Asn Gly Lys Leu Cys Ser Leu Asn Gly Val Lys
50 55 60
Pro Leu Ile Leu Arg Asp Cys Ser Val Ala Gly Trp Leu Leu Gly Asn
65 70 75 80
Pro Met Cys Asp Glu Phe Leu Asn Val Pro Glu Trp Ser Tyr Ile Val
85 90 95
Glu Lys Asp Ser Pro Ile Asn Gly Leu Cys Tyr Pro Gly Asp Phe Asn
100 105 110
Asp Tyr Glu Glu Leu Lys His Leu Leu Ser Ser Thr Asn His Phe Glu
115 120 125
Lys Ile Gln Ile Ile Pro Arg Ser Ser Trp Ser Asn His Asp Ala Ser
130 135 140
Ser Gly Val Ser Ser Ala Cys Pro Tyr Asn Gly Arg Ser Ser Phe Phe
145 150 155 160
Arg Asn Val Val Trp Leu Ile Lys Lys Asn Asn Ala Tyr Pro Thr Ile
165 170 175
Lys Arg Ser Tyr Asn Asn Thr Asn Gln Glu Asp Leu Leu Val Leu Trp
180 185 190
Gly Ile His His Pro Asn Asp Ala Ala Glu Gln Thr Lys Leu Tyr Gln
195 200 205
Asn Pro Thr Thr Tyr Val Ser Val Gly Thr Ser Thr Leu Asn Gln Arg
210 215 220
Ser Val Pro Glu Ile Ala Thr Arg Pro Lys Val Asn Gly Gln Ser Gly
225 230 235 240
Arg Met Glu Phe Phe Trp Thr Ile Leu Lys Pro Asn Asp Ala Ile Asn
245 250 255
Phe Glu Ser Asn Gly Asn Phe Ile Ala Pro Glu Tyr Ala Tyr Lys Ile
260 265 270
Val Lys Lys Gly Asp Ser Ala Ile Met Lys Ser Gly Leu Glu Tyr Gly
275 280 285
Asn Cys Asn Thr Lys Cys Gln Thr Pro Met Gly Ala Ile Asn Ser Ser
290 295 300
Met Pro Phe His Asn Ile His Pro Leu Thr Ile Gly Glu Cys Pro Lys
305 310 315 320
Tyr Val Lys Ser Asp Arg Leu Val Leu Ala Thr Gly Leu Arg Asn Val
325 330 335
Pro Gln Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile
340 345 350
Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr Leu Trp
355 360 365
Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln
370 375 380
Lys Ala Ile Asp Gly Ile Thr Asn Ile Ile Asn Ser Ile Ile Asp Lys
385 390 395 400
Met Asn Thr Gln Phe Glu Ala Val Gly Lys Glu Phe Asn Asn Leu Glu
405 410 415
Arg Arg Ile Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp
420 425 430
Val Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Ile Asn Glu Arg
435 440 445
Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val
450 455 460
Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe
465 470 475 480
Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Ile Glu Ser Val Arg Asn
485 490 495
Gly Thr Tyr Asp Tyr Pro Gln Tyr Ser Glu Glu Ala Arg Leu Asn Arg
500 505 510
Glu Glu Ile Ser Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly Ser His
515 520 525
His His His His His
530
<210> 17
<211> 520
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181155
<400> 17
Met Glu Thr Ile Ser Leu Ile Thr Ile Leu Leu Val Val Thr Ala Ser
1 5 10 15
Asn Ala Asp Lys Ile Cys Ile Gly His Gln Ser Thr Asn Ser Thr Glu
20 25 30
Thr Val Asp Thr Leu Thr Glu Thr Asn Val Pro Val Thr His Ala Lys
35 40 45
Glu Leu Leu His Thr Glu His Asn Gly Met Leu Cys Ala Thr Ser Leu
50 55 60
Gly His Pro Leu Ile Leu Asp Thr Cys Thr Ile Glu Gly Leu Val Tyr
65 70 75 80
Gly Asn Pro Ser Cys Asp Leu Leu Leu Gly Gly Arg Glu Trp Ser Tyr
85 90 95
Ile Val Glu Arg Ser Ser Ala Val Asn Gly Thr Cys Tyr Pro Gly Asn
100 105 110
Val Glu Asn Leu Glu Glu Leu Arg Thr Leu Phe Ser Ser Ala Ser Ser
115 120 125
Tyr Gln Arg Ile Gln Ile Phe Pro Asp Thr Thr Trp Asn Val Thr Tyr
130 135 140
Thr Gly Thr Ser Arg Ala Cys Ser Gly Ser Phe Tyr Arg Ser Met Arg
145 150 155 160
Trp Leu Thr Gln Lys Ser Gly Phe Tyr Pro Val Gln Asp Ala Gln Tyr
165 170 175
Thr Asn Asn Arg Gly Lys Ser Ile Leu Phe Val Trp Gly Ile His His
180 185 190
Pro Pro Thr Tyr Thr Glu Gln Thr Asn Leu Tyr Ile Arg Asn Asp Thr
195 200 205
Thr Thr Ser Val Thr Thr Glu Asp Leu Asn Arg Thr Phe Lys Pro Val
210 215 220
Ile Gly Pro Arg Pro Leu Val Asn Gly Leu Gln Gly Arg Ile Asp Tyr
225 230 235 240
Tyr Trp Ser Val Leu Lys Pro Gly Gln Thr Leu Arg Val Arg Ser Asn
245 250 255
Gly Asn Leu Ile Ala Pro Trp Tyr Gly His Val Leu Ser Gly Gly Ser
260 265 270
His Gly Arg Ile Leu Lys Thr Asp Leu Lys Gly Gly Asn Cys Val Val
275 280 285
Gln Cys Gln Thr Glu Lys Gly Gly Leu Asn Ser Thr Leu Pro Phe His
290 295 300
Asn Ile Ser Lys Tyr Ala Phe Gly Thr Cys Pro Lys Tyr Val Arg Val
305 310 315 320
Asn Ser Leu Lys Leu Ala Val Gly Leu Arg Asn Val Pro Ala Arg Ser
325 330 335
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
340 345 350
Pro Gly Leu Val Ala Gly Trp Tyr Gly Phe Gln His Ser Asn Asp Gln
355 360 365
Gly Val Gly Met Ala Ala Asp Arg Asp Ser Thr Gln Lys Ala Ile Asp
370 375 380
Lys Ile Thr Ser Lys Val Asn Asn Ile Val Asp Lys Met Asn Lys Gln
385 390 395 400
Tyr Glu Ile Ile Asp His Glu Phe Ser Glu Val Glu Thr Arg Leu Asn
405 410 415
Met Ile Asn Asn Lys Ile Asp Asp Gln Ile Gln Asp Val Trp Ala Tyr
420 425 430
Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Gln Lys Thr Leu Asp Glu
435 440 445
His Asp Ala Asn Val Asn Asn Leu Tyr Asn Lys Val Lys Arg Ala Leu
450 455 460
Gly Ser Asn Ala Met Glu Asp Gly Lys Gly Cys Phe Glu Leu Tyr His
465 470 475 480
Lys Cys Asp Asp Gln Cys Met Glu Thr Ile Arg Asn Gly Thr Tyr Asn
485 490 495
Arg Arg Lys Tyr Arg Glu Glu Ser Arg Leu Glu Arg Gln Lys Ile Glu
500 505 510
Gly Ser His His His His His His
515 520
<210> 18
<211> 529
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181156
<400> 18
Met Glu Thr Ile Ser Leu Ile Thr Ile Leu Leu Val Val Thr Ala Ser
1 5 10 15
Asn Ala Asp Lys Ile Cys Ile Gly His Gln Ser Thr Asn Ser Thr Glu
20 25 30
Thr Val Asp Thr Leu Thr Glu Thr Asn Val Pro Val Thr His Ala Lys
35 40 45
Glu Leu Leu His Thr Glu His Asn Gly Met Leu Cys Ala Thr Ser Leu
50 55 60
Gly His Pro Leu Ile Leu Asp Thr Cys Thr Ile Glu Gly Leu Val Tyr
65 70 75 80
Gly Asn Pro Ser Cys Asp Leu Leu Leu Gly Gly Arg Glu Trp Ser Tyr
85 90 95
Ile Val Glu Arg Ser Ser Ala Val Asn Gly Thr Cys Tyr Pro Gly Asn
100 105 110
Val Glu Asn Leu Glu Glu Leu Arg Thr Leu Phe Ser Ser Ala Ser Ser
115 120 125
Tyr Gln Arg Ile Gln Ile Phe Pro Asp Thr Thr Trp Asn Val Thr Tyr
130 135 140
Thr Gly Thr Ser Arg Ala Cys Ser Gly Ser Phe Tyr Arg Ser Met Arg
145 150 155 160
Trp Leu Thr Gln Lys Ser Gly Phe Tyr Pro Val Gln Asp Ala Gln Tyr
165 170 175
Thr Asn Asn Arg Gly Lys Ser Ile Leu Phe Val Trp Gly Ile His His
180 185 190
Pro Pro Thr Tyr Thr Glu Gln Thr Asn Leu Tyr Ile Arg Asn Asp Thr
195 200 205
Thr Thr Ser Val Thr Thr Glu Asp Leu Asn Arg Thr Phe Lys Pro Val
210 215 220
Ile Gly Pro Arg Pro Leu Val Asn Gly Leu Gln Gly Arg Ile Asp Tyr
225 230 235 240
Tyr Trp Ser Val Leu Lys Pro Gly Gln Thr Leu Arg Val Arg Ser Asn
245 250 255
Gly Asn Leu Ile Ala Pro Trp Tyr Gly His Val Leu Ser Gly Gly Ser
260 265 270
His Gly Arg Ile Leu Lys Thr Asp Leu Lys Gly Gly Asn Cys Val Val
275 280 285
Gln Cys Gln Thr Glu Lys Gly Gly Leu Asn Ser Thr Leu Pro Phe His
290 295 300
Asn Ile Ser Lys Tyr Ala Phe Gly Thr Cys Pro Lys Tyr Val Arg Val
305 310 315 320
Asn Ser Leu Lys Leu Ala Val Gly Leu Arg Asn Val Pro Ala Arg Ser
325 330 335
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
340 345 350
Pro Gly Leu Val Ala Gly Trp Tyr Gly Phe Gln Trp Ser Asn Asp Gln
355 360 365
Gly Val Gly Met Ala Ala Asp Arg Asp Ser Thr Gln Lys Ala Ile Asp
370 375 380
Lys Ile Thr Ser Ile Val Asn Asn Ile Val Asp Lys Met Asn Lys Gln
385 390 395 400
Tyr Glu Ile Ile Asp His Glu Phe Ser Glu Val Glu Thr Arg Leu Asn
405 410 415
Met Ile Asn Asn Lys Ile Asp Asp Gln Ile Gln Asp Val Trp Ala Tyr
420 425 430
Asn Ala Glu Leu Leu Val Leu Leu Ile Asn Gln Lys Thr Leu Asp Glu
435 440 445
His Asp Ala Asn Val Asn Asn Leu Tyr Asn Lys Val Lys Arg Ala Leu
450 455 460
Gly Ser Asn Ala Met Glu Asp Gly Lys Gly Cys Phe Glu Leu Tyr His
465 470 475 480
Lys Cys Asp Asp Gln Cys Ile Glu Thr Ile Arg Asn Gly Thr Tyr Asn
485 490 495
Arg Arg Lys Tyr Arg Glu Glu Ser Arg Leu Glu Arg Gln Lys Ile Glu
500 505 510
Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly Ser His His His His His
515 520 525
His
<210> 19
<211> 529
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181141
<400> 19
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Phe Cys Leu Ala Leu Gly
1 5 10 15
Gln Asp Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Leu Val Lys Thr Ile Thr Asp Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Lys Ile Cys Asn Asn Pro His Arg Ile Leu Asp Gly Ile Asp Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Val Phe Gln
85 90 95
Asn Glu Thr Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Phe Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Ile Thr Glu Gly Phe Thr Trp Thr
130 135 140
Gly Val Thr Gln Asn Gly Gly Ser Asn Ala Cys Lys Arg Gly Pro Gly
145 150 155 160
Ser Gly Phe Phe Ser Arg Leu Asn Trp Leu Thr Lys Ser Gly Ser Thr
165 170 175
Tyr Pro Val Leu Asn Val Thr Met Pro Asn Asn Asp Asn Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asn Gln Glu Gln Thr
195 200 205
Ser Leu Tyr Val Gln Ala Ser Gly Arg Val Thr Val Ser Thr Arg Arg
210 215 220
Ser Gln Gln Thr Ile Ile Pro Asn Ile Gly Ser Arg Pro Trp Val Arg
225 230 235 240
Gly Leu Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Val Leu Val Ile Asn Ser Asn Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Met Arg Thr Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Asp Thr Cys Ile Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Lys Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Lys Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Leu Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Ile Asp Gly Trp Tyr Gly
355 360 365
Phe Arg His Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn Arg Val
385 390 395 400
Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Lys Thr Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Glu Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Gly Ser His His His His His
515 520 525
His
<210> 20
<211> 538
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV180660
<400> 20
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Phe Cys Leu Ala Leu Gly
1 5 10 15
Gln Asp Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Leu Val Lys Thr Ile Thr Asp Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Lys Ile Cys Asn Asn Pro His Arg Ile Leu Asp Gly Ile Asp Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Val Phe Gln
85 90 95
Asn Glu Thr Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Phe Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Ile Thr Glu Gly Phe Thr Trp Thr
130 135 140
Gly Val Thr Gln Asn Gly Gly Ser Asn Ala Cys Lys Arg Gly Pro Gly
145 150 155 160
Ser Gly Phe Phe Ser Arg Leu Asn Trp Leu Thr Lys Ser Gly Ser Thr
165 170 175
Tyr Pro Val Leu Asn Val Thr Met Pro Asn Asn Asp Asn Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asn Gln Glu Gln Thr
195 200 205
Ser Leu Tyr Val Gln Ala Ser Gly Arg Val Thr Val Ser Thr Arg Arg
210 215 220
Ser Gln Gln Thr Ile Ile Pro Asn Ile Trp Ser Arg Pro Trp Val Arg
225 230 235 240
Gly Leu Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Val Leu Val Ile Asn Ser Asn Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Met Arg Thr Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Asp Thr Cys Ile Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Lys Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Lys Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Leu Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Ile Asp Gly Trp Tyr Gly
355 360 365
Phe Arg Trp Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Ile Leu Asn Arg Val
385 390 395 400
Ile Glu Lys Met Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Ile
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Lys Thr Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Glu Ser
485 490 495
Ile Arg Asn Gly Asn Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Ser Gly Ser Leu Pro Glu Thr
515 520 525
Gly Gly Gly Ser His His His His His His
530 535
<210> 21
<211> 529
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181137
<400> 21
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Leu Cys Leu Val Phe Ala
1 5 10 15
Gln Lys Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Ser Asn Gly Thr Leu Val Lys Thr Ile Thr Asn Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Arg Ile Cys Asp Ser Pro His Gln Ile Leu Asp Gly Glu Asn Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Gly Phe Gln
85 90 95
Asn Lys Glu Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Tyr Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Asn Asn Glu Ser Phe Asn Trp Thr
130 135 140
Gly Val Ala Gln Asn Gly Thr Ser Ser Ala Cys Lys Arg Arg Ser Asn
145 150 155 160
Lys Ser Phe Phe Ser Arg Leu Asn Trp Leu His Gln Leu Lys Tyr Lys
165 170 175
Tyr Pro Ala Leu Asn Val Thr Met Pro Asn Asn Glu Lys Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asp Ser Asp Gln Ile
195 200 205
Ser Ile Tyr Ala Gln Ala Ser Gly Arg Val Thr Val Ser Thr Lys Arg
210 215 220
Ser Gln Gln Thr Val Ile Pro Asn Ile Gly Ser Ser Pro Trp Val Arg
225 230 235 240
Gly Val Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Ile Leu Leu Ile Asn Ser Thr Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Ile Arg Ser Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Gly Lys Cys Asn Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Arg Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Arg Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Ile Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Val Asp Gly Trp Tyr Gly
355 360 365
Phe Arg His Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asn Gln Ile Asn Gly Lys Leu Asn Arg Leu
385 390 395 400
Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Arg Thr Lys Lys Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Gly Ser His His His His His
515 520 525
His
<210> 22
<211> 538
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181096
<400> 22
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Leu Cys Leu Val Phe Ala
1 5 10 15
Gln Lys Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Ser Asn Gly Thr Leu Val Lys Thr Ile Thr Asn Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Arg Ile Cys Asp Ser Pro His Gln Ile Leu Asp Gly Glu Asn Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Gly Phe Gln
85 90 95
Asn Lys Glu Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Tyr Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Asn Asn Glu Ser Phe Asn Trp Thr
130 135 140
Gly Val Ala Gln Asn Gly Thr Ser Ser Ala Cys Lys Arg Arg Ser Asn
145 150 155 160
Lys Ser Phe Phe Ser Arg Leu Asn Trp Leu His Gln Leu Lys Tyr Lys
165 170 175
Tyr Pro Ala Leu Asn Val Thr Met Pro Asn Asn Glu Lys Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asp Ser Asp Gln Ile
195 200 205
Ser Ile Tyr Ala Gln Ala Ser Gly Arg Val Thr Val Ser Thr Lys Arg
210 215 220
Ser Gln Gln Thr Val Ile Pro Asn Ile Gly Ser Ser Pro Trp Val Arg
225 230 235 240
Gly Val Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Ile Leu Leu Ile Asn Ser Thr Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Ile Arg Ser Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Gly Lys Cys Asn Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Arg Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Arg Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Ile Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Val Asp Gly Trp Tyr Gly
355 360 365
Phe Arg Trp Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asn Gln Ile Asn Gly Ile Leu Asn Arg Leu
385 390 395 400
Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Ile
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Arg Thr Lys Lys Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Ser Gly Ser Leu Pro Glu Thr
515 520 525
Gly Gly Gly Ser His His His His His His
530 535
<210> 23
<211> 531
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181145
<400> 23
Met Ile Ala Leu Ile Leu Val Ala Leu Ala Leu Ser His Thr Ala Tyr
1 5 10 15
Ser Gln Ile Thr Asn Gly Thr Thr Gly Asn Pro Ile Ile Cys Leu Gly
20 25 30
His His Ala Val Glu Asn Gly Thr Ser Val Lys Thr Leu Thr Asp Asn
35 40 45
His Val Glu Val Val Ser Ala Lys Glu Leu Val Glu Thr Asn His Thr
50 55 60
Asp Glu Leu Cys Pro Ser Pro Leu Lys Leu Val Asp Gly Gln Asp Cys
65 70 75 80
Asp Leu Ile Asn Gly Ala Leu Gly Ser Pro Gly Cys Asp Arg Leu Gln
85 90 95
Asp Thr Thr Trp Asp Val Phe Ile Glu Arg Pro Thr Ala Val Asp Thr
100 105 110
Cys Tyr Pro Phe Asp Val Pro Asp Tyr Gln Ser Leu Arg Ser Ile Leu
115 120 125
Ala Ser Ser Gly Ser Leu Glu Phe Ile Ala Glu Gln Phe Thr Trp Asn
130 135 140
Gly Val Lys Val Asp Gly Ser Ser Ser Ala Cys Leu Arg Gly Gly Arg
145 150 155 160
Asn Ser Phe Phe Ser Arg Leu Asn Trp Leu Thr Lys Glu Thr Asn Gly
165 170 175
Asn Tyr Gly Pro Ile Asn Val Thr Lys Glu Asn Thr Gly Ser Tyr Val
180 185 190
Arg Leu Tyr Leu Trp Gly Val His His Pro Ser Ser Asp Asn Glu Gln
195 200 205
Thr Asp Leu Tyr Lys Val Ala Thr Gly Arg Val Thr Val Ser Thr Arg
210 215 220
Ser Asp Gln Ile Ser Ile Val Pro Asn Ile Gly Ser Arg Pro Arg Val
225 230 235 240
Arg Asn Gln Ser Gly Arg Ile Ser Ile Tyr Trp Thr Leu Val Asn Pro
245 250 255
Gly Asp Ser Ile Ile Phe Asn Ser Ile Gly Asn Leu Ile Ala Pro Arg
260 265 270
Gly His Tyr Lys Ile Ser Lys Ser Thr Lys Ser Thr Val Leu Lys Ser
275 280 285
Asp Lys Arg Ile Gly Ser Cys Thr Ser Pro Cys Leu Thr Asp Lys Gly
290 295 300
Ser Ile Gln Ser Asp Lys Pro Phe Gln Asn Val Ser Arg Ile Ala Ile
305 310 315 320
Gly Asn Cys Pro Lys Tyr Val Lys Gln Gly Ser Leu Met Leu Ala Thr
325 330 335
Gly Met Arg Asn Ile Pro Gly Lys Gln Ala Lys Gly Leu Phe Gly Ala
340 345 350
Ile Ala Gly Phe Ile Glu Asn Gly Trp Gln Gly Leu Ile Asp Gly Trp
355 360 365
Tyr Gly Phe Arg His Gln Asn Ala Glu Gly Thr Gly Thr Ala Ala Asp
370 375 380
Leu Lys Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn
385 390 395 400
Arg Leu Ile Glu Lys Thr Asn Glu Lys Tyr His Gln Ile Glu Lys Glu
405 410 415
Phe Glu Gln Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu
420 425 430
Asp Thr Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala
435 440 445
Leu Glu Asn Gln His Thr Ile Asp Val Thr Asp Ser Glu Met Asn Lys
450 455 460
Leu Phe Glu Arg Val Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Gln
465 470 475 480
Gly Asn Gly Cys Phe Glu Ile Phe His Gln Cys Asp Asn Asn Cys Ile
485 490 495
Glu Ser Ile Arg Asn Gly Thr Tyr Asp His Asn Ile Tyr Arg Asp Glu
500 505 510
Ala Ile Asn Asn Arg Ile Lys Ile Asn Pro Val Gly Ser His His His
515 520 525
His His His
530
<210> 24
<211> 540
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV180661
<400> 24
Met Ile Ala Leu Ile Leu Val Ala Leu Ala Leu Ser His Thr Ala Tyr
1 5 10 15
Ser Gln Ile Thr Asn Gly Thr Thr Gly Asn Pro Ile Ile Cys Leu Gly
20 25 30
His His Ala Val Glu Asn Gly Thr Ser Val Lys Thr Leu Thr Asp Asn
35 40 45
His Val Glu Val Val Ser Ala Lys Glu Leu Val Glu Thr Asn His Thr
50 55 60
Asp Glu Leu Cys Pro Ser Pro Leu Lys Leu Val Asp Gly Gln Asp Cys
65 70 75 80
Asp Leu Ile Asn Gly Ala Leu Gly Ser Pro Gly Cys Asp Arg Leu Gln
85 90 95
Asp Thr Thr Trp Asp Val Phe Ile Glu Arg Pro Thr Ala Val Asp Thr
100 105 110
Cys Tyr Pro Phe Asp Val Pro Asp Tyr Gln Ser Leu Arg Ser Ile Leu
115 120 125
Ala Ser Ser Gly Ser Leu Glu Phe Ile Ala Glu Gln Phe Thr Trp Asn
130 135 140
Gly Val Lys Val Asp Gly Ser Ser Ser Ala Cys Leu Arg Gly Gly Arg
145 150 155 160
Asn Ser Phe Phe Ser Arg Leu Asn Trp Leu Thr Lys Glu Thr Asn Gly
165 170 175
Asn Tyr Gly Pro Ile Asn Val Thr Lys Glu Asn Thr Gly Ser Tyr Val
180 185 190
Arg Leu Tyr Leu Trp Gly Val His His Pro Ser Ser Asp Asn Glu Gln
195 200 205
Thr Asp Leu Tyr Lys Val Ala Thr Gly Arg Val Thr Val Ser Thr Arg
210 215 220
Ser Asp Gln Ile Ser Ile Val Pro Asn Ile Gly Ser Arg Pro Arg Val
225 230 235 240
Arg Asn Gln Ser Gly Arg Ile Ser Ile Tyr Trp Thr Leu Val Asn Pro
245 250 255
Gly Asp Ser Ile Ile Phe Asn Ser Ile Gly Asn Leu Ile Ala Pro Arg
260 265 270
Gly His Tyr Lys Ile Ser Lys Ser Thr Lys Ser Thr Val Leu Lys Ser
275 280 285
Asp Lys Arg Ile Gly Ser Cys Thr Ser Pro Cys Leu Thr Asp Lys Gly
290 295 300
Ser Ile Gln Ser Asp Lys Pro Phe Gln Asn Val Ser Arg Ile Ala Ile
305 310 315 320
Gly Asn Cys Pro Lys Tyr Val Lys Gln Gly Ser Leu Met Leu Ala Thr
325 330 335
Gly Met Arg Asn Ile Pro Gly Lys Gln Ala Lys Gly Leu Phe Gly Ala
340 345 350
Ile Ala Gly Phe Ile Glu Asn Gly Trp Gln Gly Leu Ile Asp Gly Trp
355 360 365
Tyr Gly Phe Arg Trp Gln Asn Ala Glu Gly Thr Gly Thr Ala Ala Asp
370 375 380
Leu Lys Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Ile Leu Asn
385 390 395 400
Arg Leu Ile Glu Lys Met Asn Glu Lys Tyr His Gln Ile Glu Lys Glu
405 410 415
Phe Glu Gln Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu
420 425 430
Asp Thr Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala
435 440 445
Leu Ile Asn Gln His Thr Ile Asp Val Thr Asp Ser Glu Met Asn Lys
450 455 460
Leu Phe Glu Arg Val Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Gln
465 470 475 480
Gly Asn Gly Cys Phe Glu Ile Phe His Gln Cys Asp Asn Asn Cys Ile
485 490 495
Glu Ser Ile Arg Asn Gly Thr Tyr Asp His Asn Ile Tyr Arg Asp Glu
500 505 510
Ala Ile Asn Asn Arg Ile Lys Ile Asn Pro Val Ser Gly Ser Leu Pro
515 520 525
Glu Thr Gly Gly Gly Ser His His His His His His
530 535 540
<210> 25
<211> 523
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181146
<400> 25
Met Asn Thr Gln Ile Leu Val Phe Ala Leu Met Ala Ile Ile Pro Thr
1 5 10 15
Asn Ala Asp Lys Ile Cys Leu Gly His His Ala Val Ser Asn Gly Thr
20 25 30
Lys Val Asn Thr Leu Thr Glu Arg Gly Val Glu Val Val Asn Ala Thr
35 40 45
Glu Thr Val Glu Arg Thr Asn Val Pro Arg Ile Cys Ser Lys Gly Lys
50 55 60
Arg Thr Val Asp Leu Gly Gln Cys Gly Leu Leu Gly Thr Ile Thr Gly
65 70 75 80
Pro Pro Gln Cys Asp Gln Phe Leu Glu Phe Ser Ala Asp Leu Ile Ile
85 90 95
Glu Arg Arg Glu Gly Ser Asp Val Cys Tyr Pro Gly Lys Phe Val Asn
100 105 110
Glu Glu Ala Leu Arg Gln Ile Leu Arg Glu Ser Gly Gly Ile Asp Lys
115 120 125
Glu Thr Met Gly Phe Thr Tyr Ser Gly Ile Arg Thr Asn Gly Ala Thr
130 135 140
Ser Ala Cys Arg Arg Ser Gly Ser Ser Phe Tyr Ala Glu Met Lys Trp
145 150 155 160
Leu Leu Ser Asn Thr Asp Asn Ala Ala Phe Pro Gln Met Thr Lys Ser
165 170 175
Tyr Lys Asn Thr Arg Lys Asp Pro Ala Leu Ile Ile Trp Gly Ile His
180 185 190
His Ser Gly Ser Thr Thr Glu Gln Thr Lys Leu Tyr Gly Ser Gly Asn
195 200 205
Lys Leu Ile Thr Val Gly Ser Ser Asn Tyr Gln Gln Ser Phe Val Pro
210 215 220
Ser Pro Gly Ala Arg Pro Gln Val Asn Gly Gln Ser Gly Arg Ile Asp
225 230 235 240
Phe His Trp Leu Ile Leu Asn Pro Asn Asp Thr Val Thr Phe Ser Phe
245 250 255
Asn Gly Ala Phe Ile Ala Pro Asp Arg Ala Ser Phe Leu Arg Gly Lys
260 265 270
Ser Met Gly Ile Gln Ser Gly Val Gln Val Asp Ala Asn Cys Glu Gly
275 280 285
Asp Cys Tyr His Ser Gly Gly Thr Ile Ile Ser Asn Leu Pro Phe Gln
290 295 300
Asn Ile Asn Ser Arg Ala Val Gly Lys Cys Pro Arg Tyr Val Lys Gln
305 310 315 320
Glu Ser Leu Leu Leu Ala Thr Gly Met Lys Asn Val Pro Glu Ile Pro
325 330 335
Lys Gly Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Asn Gly
340 345 350
Trp Glu Gly Leu Ile Asp Gly Trp Tyr Gly Phe Arg His Gln Asn Ala
355 360 365
Gln Gly Glu Gly Thr Ala Ala Asp Tyr Lys Ser Thr Gln Ser Ala Ile
370 375 380
Asp Gln Ile Thr Gly Lys Leu Asn Arg Leu Ile Glu Lys Thr Asn Gln
385 390 395 400
Gln Phe Glu Leu Ile Asp Asn Glu Phe Thr Glu Val Glu Lys Gln Ile
405 410 415
Gly Asn Val Ile Asn Trp Thr Arg Asp Ser Met Thr Glu Val Trp Ser
420 425 430
Tyr Asn Ala Glu Leu Leu Val Ala Met Glu Asn Gln His Thr Ile Asp
435 440 445
Leu Ala Asp Ser Glu Met Asn Lys Leu Tyr Glu Arg Val Lys Arg Gln
450 455 460
Leu Arg Glu Asn Ala Glu Glu Asp Gly Thr Gly Cys Phe Glu Ile Phe
465 470 475 480
His Lys Cys Asp Asp Asp Cys Met Ala Ser Ile Arg Asn Asn Thr Tyr
485 490 495
Asp His Ser Lys Tyr Arg Glu Glu Ala Met Gln Asn Arg Ile Gln Ile
500 505 510
Asp Pro Val Gly Ser His His His His His His
515 520
<210> 26
<211> 532
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV180664
<400> 26
Met Asn Thr Gln Ile Leu Val Phe Ala Leu Met Ala Ile Ile Pro Thr
1 5 10 15
Asn Ala Asp Lys Ile Cys Leu Gly His His Ala Val Ser Asn Gly Thr
20 25 30
Lys Val Asn Thr Leu Thr Glu Arg Gly Val Glu Val Val Asn Ala Thr
35 40 45
Glu Thr Val Glu Arg Thr Asn Val Pro Arg Ile Cys Ser Lys Gly Lys
50 55 60
Arg Thr Val Asp Leu Gly Gln Cys Gly Leu Leu Gly Thr Ile Thr Gly
65 70 75 80
Pro Pro Gln Cys Asp Gln Phe Leu Glu Phe Ser Ala Asp Leu Ile Ile
85 90 95
Glu Arg Arg Glu Gly Ser Asp Val Cys Tyr Pro Gly Lys Phe Val Asn
100 105 110
Glu Glu Ala Leu Arg Gln Ile Leu Arg Glu Ser Gly Gly Ile Asp Lys
115 120 125
Glu Thr Met Gly Phe Thr Tyr Ser Gly Ile Arg Thr Asn Gly Ala Thr
130 135 140
Ser Ala Cys Arg Arg Ser Gly Ser Ser Phe Tyr Ala Glu Met Lys Trp
145 150 155 160
Leu Leu Ser Asn Thr Asp Asn Ala Ala Phe Pro Gln Met Thr Lys Ser
165 170 175
Tyr Lys Asn Thr Arg Lys Asp Pro Ala Leu Ile Ile Trp Gly Ile His
180 185 190
His Ser Gly Ser Thr Thr Glu Gln Thr Lys Leu Tyr Gly Ser Gly Asn
195 200 205
Lys Leu Ile Thr Val Gly Ser Ser Asn Tyr Gln Gln Ser Phe Val Pro
210 215 220
Ser Pro Gly Ala Arg Pro Gln Val Asn Gly Gln Ser Gly Arg Ile Asp
225 230 235 240
Phe His Trp Leu Ile Leu Asn Pro Asn Asp Thr Val Thr Phe Ser Phe
245 250 255
Asn Gly Ala Phe Ile Ala Pro Asp Arg Ala Ser Phe Leu Arg Gly Lys
260 265 270
Ser Met Gly Ile Gln Ser Gly Val Gln Val Asp Ala Asn Cys Glu Gly
275 280 285
Asp Cys Tyr His Ser Gly Gly Thr Ile Ile Ser Asn Leu Pro Phe Gln
290 295 300
Asn Ile Asn Ser Arg Ala Val Gly Lys Cys Pro Arg Tyr Val Lys Gln
305 310 315 320
Glu Ser Leu Leu Leu Ala Thr Gly Met Lys Asn Val Pro Glu Ile Pro
325 330 335
Lys Gly Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Asn Gly
340 345 350
Trp Glu Gly Leu Ile Asp Gly Trp Tyr Gly Phe Arg Trp Gln Asn Ala
355 360 365
Gln Gly Glu Gly Thr Ala Ala Asp Tyr Lys Ser Thr Gln Ser Ala Ile
370 375 380
Asp Gln Ile Thr Gly Ile Leu Asn Arg Leu Ile Glu Lys Met Asn Gln
385 390 395 400
Gln Phe Glu Leu Ile Asp Asn Glu Phe Thr Glu Val Glu Lys Gln Ile
405 410 415
Gly Asn Val Ile Asn Trp Thr Arg Asp Ser Met Thr Glu Val Trp Ser
420 425 430
Tyr Asn Ala Glu Leu Leu Val Ala Met Ile Asn Gln His Thr Ile Asp
435 440 445
Leu Ala Asp Ser Glu Met Asn Lys Leu Tyr Glu Arg Val Lys Arg Gln
450 455 460
Leu Arg Glu Asn Ala Glu Glu Asp Gly Thr Gly Cys Phe Glu Ile Phe
465 470 475 480
His Lys Cys Asp Asp Asp Cys Met Ala Ser Ile Arg Asn Asn Thr Tyr
485 490 495
Asp His Ser Lys Tyr Arg Glu Glu Ala Met Gln Asn Arg Ile Gln Ile
500 505 510
Asp Pro Val Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly Ser His His
515 520 525
His His His His
530
<210> 27
<211> 524
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181147
<400> 27
Met Tyr Lys Val Val Val Ile Ile Ala Leu Leu Gly Ala Val Lys Gly
1 5 10 15
Leu Asp Arg Ile Cys Leu Gly His His Ala Val Ala Asn Gly Thr Ile
20 25 30
Val Lys Thr Leu Thr Asn Glu Gln Glu Glu Val Thr Asn Ala Thr Glu
35 40 45
Thr Val Glu Ser Thr Asn Leu Asn Lys Leu Cys Met Lys Gly Arg Ser
50 55 60
Tyr Lys Asp Leu Gly Asn Cys His Pro Val Gly Met Leu Ile Gly Thr
65 70 75 80
Pro Val Cys Asp Pro His Leu Thr Gly Thr Trp Asp Thr Leu Ile Glu
85 90 95
Arg Glu Asn Ala Ile Ala His Cys Tyr Pro Gly Ala Thr Ile Asn Glu
100 105 110
Glu Ala Leu Arg Gln Lys Ile Met Glu Ser Gly Gly Ile Ser Lys Met
115 120 125
Ser Thr Gly Phe Thr Tyr Gly Ser Ser Ile Asn Ser Ala Gly Thr Thr
130 135 140
Lys Ala Cys Met Arg Asn Gly Gly Asp Ser Phe Tyr Ala Glu Leu Lys
145 150 155 160
Trp Leu Val Ser Lys Thr Lys Gly Gln Asn Phe Pro Gln Thr Thr Asn
165 170 175
Thr Tyr Arg Asn Thr Asp Thr Ala Glu His Leu Ile Ile Trp Gly Ile
180 185 190
His His Pro Ser Ser Thr Gln Glu Lys Asn Asp Leu Tyr Gly Thr Gln
195 200 205
Ser Leu Ser Ile Ser Val Glu Ser Ser Thr Tyr Gln Asn Asn Phe Val
210 215 220
Pro Val Val Gly Ala Arg Pro Gln Val Asn Gly Gln Ser Gly Arg Ile
225 230 235 240
Asp Phe His Trp Thr Leu Val Gln Pro Gly Asp Asn Ile Thr Phe Ser
245 250 255
His Asn Gly Gly Leu Ile Ala Pro Ser Arg Val Ser Lys Leu Thr Gly
260 265 270
Arg Gly Leu Gly Ile Gln Ser Glu Ala Leu Ile Asp Asn Ser Cys Glu
275 280 285
Ser Lys Cys Phe Trp Arg Gly Gly Ser Ile Asn Thr Lys Leu Pro Phe
290 295 300
Gln Asn Leu Ser Pro Arg Thr Val Gly Gln Cys Pro Lys Tyr Val Asn
305 310 315 320
Gln Arg Ser Leu Leu Leu Ala Thr Gly Met Arg Asn Val Pro Glu Val
325 330 335
Val Gln Gly Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Asn
340 345 350
Gly Trp Glu Gly Met Val Asp Gly Trp Tyr Gly Phe Arg His Gln Asn
355 360 365
Ala Gln Gly Thr Gly Gln Ala Ala Asp Tyr Lys Ser Thr Gln Ala Ala
370 375 380
Ile Asp Gln Ile Thr Gly Lys Leu Asn Arg Leu Ile Glu Lys Thr Asn
385 390 395 400
Thr Glu Phe Glu Ser Ile Glu Ser Glu Phe Ser Glu Thr Glu His Gln
405 410 415
Ile Gly Asn Val Ile Asn Trp Thr Lys Asp Ser Ile Thr Asp Ile Trp
420 425 430
Thr Tyr Gln Ala Glu Leu Leu Val Ala Met Glu Asn Gln His Thr Ile
435 440 445
Asp Met Ala Asp Ser Glu Met Leu Asn Leu Tyr Glu Arg Val Arg Lys
450 455 460
Gln Leu Arg Gln Asn Ala Glu Glu Asp Gly Lys Gly Cys Phe Glu Ile
465 470 475 480
Tyr His Thr Cys Asp Asp Ser Cys Met Glu Ser Ile Arg Asn Asn Thr
485 490 495
Tyr Asp His Ser Gln Tyr Arg Glu Glu Ala Leu Leu Asn Arg Leu Asn
500 505 510
Ile Asn Ser Val Gly Ser His His His His His His
515 520
<210> 28
<211> 532
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV180662
<400> 28
Met Tyr Lys Val Val Val Ile Ile Ala Leu Leu Gly Ala Val Lys Gly
1 5 10 15
Leu Asp Arg Ile Cys Leu Gly His His Ala Val Ala Asn Gly Thr Ile
20 25 30
Val Lys Thr Leu Thr Asn Glu Gln Glu Glu Val Thr Asn Ala Thr Glu
35 40 45
Thr Val Glu Ser Thr Asn Leu Asn Lys Leu Cys Met Lys Gly Arg Ser
50 55 60
Tyr Lys Asp Leu Gly Asn Cys His Pro Val Gly Met Leu Ile Gly Thr
65 70 75 80
Pro Val Cys Asp Pro His Leu Thr Gly Thr Trp Asp Thr Leu Ile Glu
85 90 95
Arg Glu Asn Ala Ile Ala His Cys Tyr Pro Gly Ala Thr Ile Asn Glu
100 105 110
Glu Ala Leu Arg Gln Lys Ile Met Glu Ser Gly Gly Ile Ser Lys Met
115 120 125
Ser Thr Gly Phe Thr Tyr Gly Ser Ser Ile Asn Ser Ala Gly Thr Thr
130 135 140
Lys Ala Cys Met Arg Asn Gly Gly Asp Ser Phe Tyr Ala Glu Leu Lys
145 150 155 160
Trp Leu Val Ser Lys Thr Lys Gly Gln Asn Phe Pro Gln Thr Thr Asn
165 170 175
Thr Tyr Arg Asn Thr Asp Thr Ala Glu His Leu Ile Ile Trp Gly Ile
180 185 190
His His Pro Ser Ser Thr Gln Glu Lys Asn Asp Leu Tyr Gly Thr Gln
195 200 205
Ser Leu Ser Ile Ser Val Glu Ser Ser Thr Tyr Gln Asn Asn Phe Val
210 215 220
Pro Val Val Gly Ala Arg Pro Gln Val Asn Gly Gln Ser Gly Arg Ile
225 230 235 240
Asp Phe His Trp Thr Leu Val Gln Pro Gly Asp Asn Ile Thr Phe Ser
245 250 255
His Asn Gly Gly Leu Ile Ala Pro Ser Arg Val Ser Lys Leu Thr Gly
260 265 270
Arg Gly Leu Gly Ile Gln Ser Glu Ala Leu Ile Asp Asn Ser Cys Glu
275 280 285
Ser Lys Cys Phe Trp Arg Gly Gly Ser Ile Asn Thr Lys Leu Pro Phe
290 295 300
Gln Asn Leu Ser Pro Arg Thr Val Gly Gln Cys Pro Lys Tyr Val Asn
305 310 315 320
Gln Arg Ser Leu Leu Leu Ala Thr Gly Met Arg Asn Val Pro Glu Val
325 330 335
Val Gln Gly Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Asn
340 345 350
Gly Trp Glu Gly Met Val Asp Gly Trp Tyr Gly Phe Arg Trp Gln Asn
355 360 365
Ala Gln Gly Thr Gly Gln Ala Ala Asp Tyr Lys Ser Thr Gln Ala Ala
370 375 380
Ile Asp Gln Ile Thr Gly Ile Leu Asn Arg Leu Ile Glu Lys Met Asn
385 390 395 400
Thr Glu Phe Glu Ser Ile Glu Ser Glu Phe Ser Glu Thr Glu His Gln
405 410 415
Ile Gly Asn Val Ile Asn Trp Thr Lys Asp Ser Ile Thr Asp Ile Trp
420 425 430
Thr Tyr Gln Ala Glu Leu Leu Val Ala Met Ile Asn Gln His Thr Ile
435 440 445
Asp Met Ala Asp Ser Glu Met Leu Asn Leu Tyr Glu Arg Val Arg Lys
450 455 460
Gln Leu Arg Gln Asn Ala Glu Glu Asp Gly Lys Gly Cys Phe Glu Ile
465 470 475 480
Tyr His Thr Cys Asp Asp Ser Cys Met Glu Ser Ile Arg Asn Asn Thr
485 490 495
Tyr Asp His Ser Gln Tyr Arg Glu Glu Ala Leu Leu Asn Arg Leu Asn
500 505 510
Ile Asn Ser Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly Ser His His
515 520 525
His His His His
530
<210> 29
<211> 534
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV4239
<400> 29
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val
50 55 60
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Pro Ser Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe
100 105 110
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp
130 135 140
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser
145 150 155 160
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro
165 170 175
Lys Leu Ser Lys Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val
180 185 190
Leu Trp Gly Ile His His Pro Ser Thr Ser Ala Asp Gln Gln Ser Leu
195 200 205
Tyr Gln Asn Ala Asp Ala Tyr Val Phe Val Gly Ser Ser Arg Tyr Ser
210 215 220
Lys Lys Phe Lys Pro Glu Ile Ala Ile Arg Pro Lys Val Arg Asp Gln
225 230 235 240
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys
245 250 255
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe
260 265 270
Ala Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro
275 280 285
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn
290 295 300
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys
305 310 315 320
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg
325 330 335
Asn Ile Pro Ser Ile Gln Ser Gln Gly Leu Phe Gly Ala Ile Ala Gly
340 345 350
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr
355 360 365
His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser
370 375 380
Thr Gln Asn Ala Ile Asp Glu Ile Thr Asn Lys Val Asn Ser Val Ile
385 390 395 400
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His
405 410 415
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe
420 425 430
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn
435 440 445
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu
450 455 460
Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
465 470 475 480
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser Val
485 490 495
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu
500 505 510
Asn Arg Glu Glu Ile Asp Gly Arg Ser Leu Val Pro Arg Gly Ser Gly
515 520 525
His His His His His His
530
<210> 30
<211> 574
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV180843
<400> 30
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Thr Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val
50 55 60
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Ser Asn Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe
100 105 110
Ile Asn Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp
130 135 140
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser
145 150 155 160
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro
165 170 175
Lys Leu Asn Gln Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val
180 185 190
Leu Trp Gly Ile His His Pro Ser Thr Thr Ala Asp Gln Gln Ser Leu
195 200 205
Tyr Gln Asn Ala Asp Ala Tyr Val Phe Val Gly Thr Ser Arg Tyr Ser
210 215 220
Lys Lys Phe Lys Pro Glu Ile Ala Thr Arg Pro Lys Val Arg Asp Gln
225 230 235 240
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys
245 250 255
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe
260 265 270
Thr Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro
275 280 285
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Glu Gly Ala Ile Asn
290 295 300
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys
305 310 315 320
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg
325 330 335
Asn Val Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly
340 345 350
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr
355 360 365
His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser
370 375 380
Thr Gln Asn Ala Ile Asp Lys Ile Thr Asn Lys Val Asn Ser Val Ile
385 390 395 400
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His
405 410 415
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe
420 425 430
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn
435 440 445
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu
450 455 460
Lys Val Arg Asn Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
465 470 475 480
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser Val
485 490 495
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu
500 505 510
Asn Arg Glu Lys Ile Asp Ser Gly Ser Leu Val Pro Ser Gly Ser Pro
515 520 525
Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr Val
530 535 540
Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly Gly Ser
545 550 555 560
Leu Pro Glu Thr Gly Gly Gly Ser His His His His His His
565 570
<210> 31
<211> 573
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV180436
<400> 31
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Phe Cys Leu Ala Leu Gly
1 5 10 15
Gln Asp Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Leu Val Lys Thr Ile Thr Asp Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Lys Ile Cys Asn Asn Pro His Arg Ile Leu Asp Gly Ile Asp Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Val Phe Gln
85 90 95
Asn Glu Thr Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Phe Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Ile Thr Glu Gly Phe Thr Trp Thr
130 135 140
Gly Val Thr Gln Asn Gly Gly Ser Asn Ala Cys Lys Arg Gly Pro Gly
145 150 155 160
Ser Gly Phe Phe Ser Arg Leu Asn Trp Leu Thr Lys Ser Gly Ser Thr
165 170 175
Tyr Pro Val Leu Asn Val Thr Met Pro Asn Asn Asp Asn Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asn Gln Glu Gln Thr
195 200 205
Ser Leu Tyr Val Gln Ala Ser Gly Arg Val Thr Val Ser Thr Arg Arg
210 215 220
Ser Gln Gln Thr Ile Ile Pro Asn Ile Gly Ser Arg Pro Trp Val Arg
225 230 235 240
Gly Leu Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Val Leu Val Ile Asn Ser Asn Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Met Arg Thr Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Asp Thr Cys Ile Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Lys Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Lys Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Leu Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Ile Asp Gly Trp Tyr Gly
355 360 365
Phe Arg His Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn Arg Val
385 390 395 400
Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Lys Thr Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Glu Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ser Gly Ser Leu Val Pro Ser Gly Ser Pro Gly
515 520 525
Ser Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr Val Arg
530 535 540
Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly Gly Ser Leu
545 550 555 560
Pro Glu Thr Gly Gly Gly Ser His His His His His His
565 570
<210> 32
<211> 562
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV170466
<400> 32
Met Lys Thr Ile Val Ala Leu Ser Tyr Ile Leu Cys Leu Val Phe Ala
1 5 10 15
Gln Lys Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Ile Val Lys Thr Ile Thr Asn Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Glu Ile Cys Asp Ser Pro His Gln Ile Leu Asp Gly Glu Asn Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro Gln Cys Asp Gly Phe Gln
85 90 95
Asn Lys Lys Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Tyr Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Asn Asn Glu Ser Phe Asn Trp Thr
130 135 140
Gly Val Thr Gln Asn Gly Thr Ser Ser Ala Cys Ile Arg Arg Ser Asn
145 150 155 160
Ser Ser Phe Phe Ser Arg Leu Asn Trp Leu Thr His Leu Asn Phe Lys
165 170 175
Tyr Pro Ala Leu Asn Val Thr Met Pro Asn Asn Glu Gln Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Gly Thr Asp Lys Asp Gln Ile
195 200 205
Phe Leu Tyr Ala Gln Ser Ser Gly Arg Ile Thr Val Ser Thr Lys Arg
210 215 220
Ser Gln Gln Ala Val Ile Pro Asn Ile Gly Ser Arg Pro Arg Ile Arg
225 230 235 240
Asn Ile Pro Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Ile Leu Leu Ile Asn Ser Thr Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Ile Arg Ser Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Gly Lys Cys Asn Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Arg Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Arg Tyr Val Lys Gln Ser Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Ile Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Val Asp Gly Trp Tyr Gly
355 360 365
Phe Arg His Gln Asn Ser Glu Gly Arg Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn Arg Leu
385 390 395 400
Ile Gly Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Lys Thr Lys Lys Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asn His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ser Gly Ser Leu Val Pro Arg Gly Ser Gly Ser
515 520 525
Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr Val Arg Lys
530 535 540
Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly Gly Ser Glu Pro
545 550 555 560
Glu Ala
<210> 33
<211> 538
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181099
<400> 33
Met Lys Thr Ile Val Ala Leu Ser Tyr Ile Leu Cys Leu Val Phe Ala
1 5 10 15
Gln Lys Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Ile Val Lys Thr Ile Thr Asn Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Glu Ile Cys Asp Ser Pro His Gln Ile Leu Asp Gly Glu Asn Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro Gln Cys Asp Gly Phe Gln
85 90 95
Asn Lys Lys Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Tyr Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Asn Asn Glu Ser Phe Asn Trp Thr
130 135 140
Gly Val Thr Gln Asn Gly Thr Ser Ser Ala Cys Ile Arg Arg Ser Asn
145 150 155 160
Ser Ser Phe Phe Ser Arg Leu Asn Trp Leu Thr His Leu Asn Phe Lys
165 170 175
Tyr Pro Ala Leu Asn Val Thr Met Pro Asn Asn Glu Gln Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Gly Thr Asp Lys Asp Gln Ile
195 200 205
Phe Leu Tyr Ala Gln Ser Ser Gly Arg Ile Thr Val Ser Thr Lys Arg
210 215 220
Ser Gln Gln Ala Val Ile Pro Asn Ile Gly Ser Arg Pro Arg Ile Arg
225 230 235 240
Asn Ile Pro Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Ile Leu Leu Ile Asn Ser Thr Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Ile Arg Ser Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Gly Lys Cys Asn Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Arg Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Arg Tyr Val Lys Gln Ser Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Ile Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Val Asp Gly Trp Tyr Gly
355 360 365
Phe Arg Trp Gln Asn Ser Glu Gly Arg Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Ile Leu Asn Arg Leu
385 390 395 400
Ile Gly Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Ile
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Lys Thr Lys Lys Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asn His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Ser Gly Ser Leu Pro Glu Thr
515 520 525
Gly Gly Gly Ser His His His His His His
530 535
<210> 34
<211> 535
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181005
<400> 34
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val
50 55 60
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Pro Ser Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe
100 105 110
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp
130 135 140
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser
145 150 155 160
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro
165 170 175
Lys Leu Ser Lys Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val
180 185 190
Leu Trp Gly Ile His His Pro Ser Thr Ser Ala Asp Gln Gln Ser Leu
195 200 205
Tyr Gln Asn Ala Asp Ala Tyr Val Phe Val Gly Ser Ser Arg Tyr Ser
210 215 220
Lys Lys Phe Lys Pro Glu Ile Ala Ile Arg Pro Lys Val Arg Asp Gln
225 230 235 240
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys
245 250 255
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe
260 265 270
Ala Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro
275 280 285
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn
290 295 300
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys
305 310 315 320
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg
325 330 335
Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly
340 345 350
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr
355 360 365
His Trp Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser
370 375 380
Thr Gln Asn Ala Ile Asp Glu Ile Thr Asn Lys Val Asn Ser Val Ile
385 390 395 400
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His
405 410 415
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe
420 425 430
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn
435 440 445
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu
450 455 460
Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
465 470 475 480
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Ile Glu Ser Val
485 490 495
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu
500 505 510
Asn Arg Glu Glu Ile Asp Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly
515 520 525
Ser His His His His His His
530 535
<210> 35
<211> 535
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181007
<400> 35
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val
50 55 60
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Pro Ser Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe
100 105 110
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp
130 135 140
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser
145 150 155 160
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro
165 170 175
Lys Leu Ser Lys Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val
180 185 190
Leu Trp Gly Ile His His Pro Ser Thr Ser Ala Asp Gln Gln Ser Leu
195 200 205
Tyr Gln Asn Ala Asp Ala Tyr Val Phe Val Gly Ser Ser Arg Tyr Ser
210 215 220
Lys Lys Phe Lys Pro Glu Ile Ala Ile Arg Pro Lys Val Arg Asp Gln
225 230 235 240
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys
245 250 255
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe
260 265 270
Ala Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro
275 280 285
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn
290 295 300
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys
305 310 315 320
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg
325 330 335
Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly
340 345 350
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr
355 360 365
His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser
370 375 380
Thr Gln Asn Ala Ile Asp Glu Ile Thr Asn Ile Val Asn Ser Val Ile
385 390 395 400
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His
405 410 415
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe
420 425 430
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Ile Asn
435 440 445
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu
450 455 460
Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
465 470 475 480
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser Val
485 490 495
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu
500 505 510
Asn Arg Glu Glu Ile Asp Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly
515 520 525
Ser His His His His His His
530 535
<210> 36
<211> 534
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181090
<400> 36
Met Lys Val Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asn Ser His Asn Gly Lys Leu Cys Leu Leu Lys Gly Ile
50 55 60
Ala Pro Leu Gln Leu Gly Asn Cys Ser Val Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Leu Leu Ile Ser Lys Glu Ser Trp Ser Tyr Ile
85 90 95
Val Glu Lys Pro Asn Pro Glu Asn Gly Thr Cys Tyr Pro Gly His Phe
100 105 110
Ala Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Glu Ser Ser Trp Pro Asn His Thr
130 135 140
Val Thr Gly Val Ser Ala Ser Cys Ser His Asn Gly Glu Ser Ser Phe
145 150 155 160
Tyr Arg Asn Leu Leu Trp Leu Thr Gly Lys Asn Gly Leu Tyr Pro Asn
165 170 175
Leu Ser Lys Ser Tyr Ala Asn Asn Lys Glu Lys Glu Val Leu Val Leu
180 185 190
Trp Gly Val His His Pro Pro Asn Ile Gly Asp Gln Lys Ala Leu Tyr
195 200 205
His Thr Glu Asn Ala Tyr Val Ser Val Val Ser Ser His Tyr Ser Arg
210 215 220
Lys Phe Thr Pro Glu Ile Ala Lys Arg Pro Lys Val Arg Asp Gln Glu
225 230 235 240
Gly Arg Ile Asn Tyr Tyr Trp Thr Leu Leu Glu Pro Gly Asp Thr Ile
245 250 255
Ile Phe Glu Ala Asn Gly Asn Leu Ile Ala Pro Arg Tyr Ala Phe Ala
260 265 270
Leu Ser Arg Gly Phe Gly Ser Gly Ile Ile Asn Ser Asn Ala Pro Met
275 280 285
Asp Lys Cys Asp Ala Lys Cys Gln Thr Pro Gln Gly Ala Ile Asn Ser
290 295 300
Ser Leu Pro Phe Gln Asn Val His Pro Val Thr Ile Gly Glu Cys Pro
305 310 315 320
Lys Tyr Val Arg Ser Ala Lys Leu Arg Met Val Thr Gly Leu Arg Asn
325 330 335
Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe
340 345 350
Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His
355 360 365
Trp Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr
370 375 380
Gln Asn Ala Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu
385 390 395 400
Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu
405 410 415
Glu Arg Arg Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Ile
420 425 430
Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Ile Asn Glu
435 440 445
Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys
450 455 460
Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys
465 470 475 480
Phe Glu Phe Tyr His Lys Cys Asn Asp Glu Cys Ile Glu Ser Val Lys
485 490 495
Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn
500 505 510
Arg Glu Lys Ile Asp Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly Ser
515 520 525
His His His His His His
530
<210> 37
<211> 525
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181135
<400> 37
Met Lys Val Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asn Ser His Asn Gly Lys Leu Cys Leu Leu Lys Gly Ile
50 55 60
Ala Pro Leu Gln Leu Gly Asn Cys Ser Val Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Leu Leu Ile Ser Lys Glu Ser Trp Ser Tyr Ile
85 90 95
Val Glu Lys Pro Asn Pro Glu Asn Gly Thr Cys Tyr Pro Gly His Phe
100 105 110
Ala Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Glu Ser Ser Trp Pro Asn His Thr
130 135 140
Val Thr Gly Val Ser Ala Ser Cys Ser His Asn Gly Glu Ser Ser Phe
145 150 155 160
Tyr Arg Asn Leu Leu Trp Leu Thr Gly Lys Asn Gly Leu Tyr Pro Asn
165 170 175
Leu Ser Lys Ser Tyr Ala Asn Asn Lys Glu Lys Glu Val Leu Val Leu
180 185 190
Trp Gly Val His His Pro Pro Asn Ile Gly Asp Gln Lys Ala Leu Tyr
195 200 205
His Thr Glu Asn Ala Tyr Val Ser Val Val Ser Ser His Tyr Ser Arg
210 215 220
Lys Phe Thr Pro Glu Ile Ala Lys Arg Pro Lys Val Arg Asp Gln Glu
225 230 235 240
Gly Arg Ile Asn Tyr Tyr Trp Thr Leu Leu Glu Pro Gly Asp Thr Ile
245 250 255
Ile Phe Glu Ala Asn Gly Asn Leu Ile Ala Pro Arg Tyr Ala Phe Ala
260 265 270
Leu Ser Arg Gly Phe Gly Ser Gly Ile Ile Asn Ser Asn Ala Pro Met
275 280 285
Asp Lys Cys Asp Ala Lys Cys Gln Thr Pro Gln Gly Ala Ile Asn Ser
290 295 300
Ser Leu Pro Phe Gln Asn Val His Pro Val Thr Ile Gly Glu Cys Pro
305 310 315 320
Lys Tyr Val Arg Ser Ala Lys Leu Arg Met Val Thr Gly Leu Arg Asn
325 330 335
Ile Pro Ser Ile Gln Ser Gln Gly Leu Phe Gly Ala Ile Ala Gly Phe
340 345 350
Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His
355 360 365
His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr
370 375 380
Gln Asn Ala Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu
385 390 395 400
Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu
405 410 415
Glu Arg Arg Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Ile
420 425 430
Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu
435 440 445
Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys
450 455 460
Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys
465 470 475 480
Phe Glu Phe Tyr His Lys Cys Asn Asp Glu Cys Met Glu Ser Val Lys
485 490 495
Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn
500 505 510
Arg Glu Lys Ile Asp Gly Ser His His His His His His
515 520 525
<210> 38
<211> 535
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181084
<400> 38
Met Glu Ala Arg Leu Leu Val Leu Leu Cys Ala Phe Ala Ala Thr Asn
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Ser His Asn Gly Lys Leu Cys Lys Leu Lys Gly Ile
50 55 60
Ala Pro Leu Gln Leu Gly Lys Cys Asn Ile Ala Gly Trp Leu Leu Gly
65 70 75 80
Asn Pro Glu Cys Asp Leu Leu Leu Thr Ala Ser Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Ser Asn Ser Glu Asn Gly Thr Cys Tyr Pro Gly Asp Phe
100 105 110
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Lys Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Glu
130 135 140
Thr Thr Lys Gly Val Thr Ala Ala Cys Ser Tyr Ala Gly Ala Ser Ser
145 150 155 160
Phe Tyr Arg Asn Leu Leu Trp Leu Thr Lys Lys Gly Ser Ser Tyr Pro
165 170 175
Lys Leu Ser Lys Ser Tyr Val Asn Asn Lys Gly Lys Glu Val Leu Val
180 185 190
Leu Trp Gly Val His His Pro Pro Thr Gly Thr Asp Gln Gln Ser Leu
195 200 205
Tyr Gln Asn Ala Asp Ala Tyr Val Ser Val Gly Ser Ser Lys Tyr Asn
210 215 220
Arg Arg Phe Thr Pro Glu Ile Ala Ala Arg Pro Lys Val Arg Asp Gln
225 230 235 240
Ala Gly Arg Met Asn Tyr Tyr Trp Thr Leu Leu Glu Pro Gly Asp Thr
245 250 255
Ile Thr Phe Glu Ala Thr Gly Asn Leu Ile Ala Pro Trp Tyr Ala Phe
260 265 270
Ala Leu Asn Arg Gly Ser Gly Ser Gly Ile Ile Thr Ser Asp Ala Pro
275 280 285
Val His Asp Cys Asn Thr Lys Cys Gln Thr Pro His Gly Ala Ile Asn
290 295 300
Ser Ser Leu Pro Phe Gln Asn Ile His Pro Val Thr Ile Gly Glu Cys
305 310 315 320
Pro Lys Tyr Val Arg Ser Thr Lys Leu Arg Met Ala Thr Gly Leu Arg
325 330 335
Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly
340 345 350
Phe Ile Glu Gly Gly Trp Thr Gly Met Ile Asp Gly Trp Tyr Gly Tyr
355 360 365
His Trp Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser
370 375 380
Thr Gln Asn Ala Ile Asp Gly Ile Thr Asn Ile Val Asn Ser Val Ile
385 390 395 400
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Asn
405 410 415
Leu Glu Arg Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe
420 425 430
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Ile Asn
435 440 445
Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Arg Asn Leu Tyr Glu
450 455 460
Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
465 470 475 480
Cys Phe Glu Phe Tyr His Lys Cys Asp Asp Ala Cys Ile Glu Ser Val
485 490 495
Arg Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu
500 505 510
Asn Arg Glu Glu Ile Asp Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly
515 520 525
Ser His His His His His His
530 535
<210> 39
<211> 526
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181131
<400> 39
Met Glu Ala Arg Leu Leu Val Leu Leu Cys Ala Phe Ala Ala Thr Asn
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Ser His Asn Gly Lys Leu Cys Lys Leu Lys Gly Ile
50 55 60
Ala Pro Leu Gln Leu Gly Lys Cys Asn Ile Ala Gly Trp Leu Leu Gly
65 70 75 80
Asn Pro Glu Cys Asp Leu Leu Leu Thr Ala Ser Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Ser Asn Ser Glu Asn Gly Thr Cys Tyr Pro Gly Asp Phe
100 105 110
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Lys Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Glu
130 135 140
Thr Thr Lys Gly Val Thr Ala Ala Cys Ser Tyr Ala Gly Ala Ser Ser
145 150 155 160
Phe Tyr Arg Asn Leu Leu Trp Leu Thr Lys Lys Gly Ser Ser Tyr Pro
165 170 175
Lys Leu Ser Lys Ser Tyr Val Asn Asn Lys Gly Lys Glu Val Leu Val
180 185 190
Leu Trp Gly Val His His Pro Pro Thr Gly Thr Asp Gln Gln Ser Leu
195 200 205
Tyr Gln Asn Ala Asp Ala Tyr Val Ser Val Gly Ser Ser Lys Tyr Asn
210 215 220
Arg Arg Phe Thr Pro Glu Ile Ala Ala Arg Pro Lys Val Arg Asp Gln
225 230 235 240
Ala Gly Arg Met Asn Tyr Tyr Trp Thr Leu Leu Glu Pro Gly Asp Thr
245 250 255
Ile Thr Phe Glu Ala Thr Gly Asn Leu Ile Ala Pro Trp Tyr Ala Phe
260 265 270
Ala Leu Asn Arg Gly Ser Gly Ser Gly Ile Ile Thr Ser Asp Ala Pro
275 280 285
Val His Asp Cys Asn Thr Lys Cys Gln Thr Pro His Gly Ala Ile Asn
290 295 300
Ser Ser Leu Pro Phe Gln Asn Ile His Pro Val Thr Ile Gly Glu Cys
305 310 315 320
Pro Lys Tyr Val Arg Ser Thr Lys Leu Arg Met Ala Thr Gly Leu Arg
325 330 335
Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly
340 345 350
Phe Ile Glu Gly Gly Trp Thr Gly Met Ile Asp Gly Trp Tyr Gly Tyr
355 360 365
His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser
370 375 380
Thr Gln Asn Ala Ile Asp Gly Ile Thr Asn Lys Val Asn Ser Val Ile
385 390 395 400
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Asn
405 410 415
Leu Glu Arg Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe
420 425 430
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn
435 440 445
Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Arg Asn Leu Tyr Glu
450 455 460
Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
465 470 475 480
Cys Phe Glu Phe Tyr His Lys Cys Asp Asp Ala Cys Met Glu Ser Val
485 490 495
Arg Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu
500 505 510
Asn Arg Glu Glu Ile Asp Gly Ser His His His His His His
515 520 525
<210> 40
<211> 538
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181095
<400> 40
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Leu Cys Leu Val Phe Ala
1 5 10 15
Gln Lys Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Leu Val Lys Thr Ile Thr Asn Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Arg Ile Cys Asp Ser Pro His Arg Ile Leu Asp Gly Lys Asn Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Gly Phe Gln
85 90 95
Asn Lys Glu Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Tyr Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Ile Asn Glu Asp Phe Asn Trp Thr
130 135 140
Gly Val Ala Gln Asp Gly Lys Ser Tyr Thr Cys Lys Arg Gly Ser Val
145 150 155 160
Asn Ser Phe Phe Ser Arg Leu Asn Trp Leu His Lys Leu Glu Tyr Lys
165 170 175
Tyr Pro Ala Leu Asn Val Thr Met Pro Asn Asn Gly Lys Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asp Ser Asp Gln Thr
195 200 205
Ser Leu Tyr Val Arg Ala Ser Gly Arg Val Thr Val Ser Thr Lys Arg
210 215 220
Ser Gln Gln Thr Val Ile Pro Asn Ile Gly Ser Arg Pro Trp Val Arg
225 230 235 240
Gly Leu Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Ile Leu Leu Ile Asn Ser Thr Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Ile Arg Asn Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Gly Asn Cys Ser Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Arg Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Arg Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Ile Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Val Asp Gly Trp Tyr Gly
355 360 365
Phe Arg Trp Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Ile Leu Asn Arg Leu
385 390 395 400
Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Ile
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Arg Thr Arg Lys Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Ser Gly Ser Leu Pro Glu Thr
515 520 525
Gly Gly Gly Ser His His His His His His
530 535
<210> 41
<211> 529
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181140
<400> 41
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Leu Cys Leu Val Phe Ala
1 5 10 15
Gln Lys Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Leu Val Lys Thr Ile Thr Asn Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Arg Ile Cys Asp Ser Pro His Arg Ile Leu Asp Gly Lys Asn Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Gly Phe Gln
85 90 95
Asn Lys Glu Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Tyr Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Ile Asn Glu Asp Phe Asn Trp Thr
130 135 140
Gly Val Ala Gln Asp Gly Lys Ser Tyr Thr Cys Lys Arg Gly Ser Val
145 150 155 160
Asn Ser Phe Phe Ser Arg Leu Asn Trp Leu His Lys Leu Glu Tyr Lys
165 170 175
Tyr Pro Ala Leu Asn Val Thr Met Pro Asn Asn Gly Lys Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asp Ser Asp Gln Thr
195 200 205
Ser Leu Tyr Val Arg Ala Ser Gly Arg Val Thr Val Ser Thr Lys Arg
210 215 220
Ser Gln Gln Thr Val Ile Pro Asn Ile Gly Ser Arg Pro Trp Val Arg
225 230 235 240
Gly Leu Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Ile Leu Leu Ile Asn Ser Thr Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Ile Arg Asn Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Gly Asn Cys Ser Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Arg Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Arg Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Ile Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Val Asp Gly Trp Tyr Gly
355 360 365
Phe Arg His Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn Arg Leu
385 390 395 400
Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Arg Thr Arg Lys Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Gly Ser His His His His His
515 520 525
His
<210> 42
<211> 538
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181093
<400> 42
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Phe Cys Gln Val Leu Ala
1 5 10 15
Gln Asn Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Leu Val Lys Thr Ile Thr Asn Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Arg Ile Cys Asp Ser Pro His Arg Ile Leu Asp Gly Lys Asn Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Gly Phe Gln
85 90 95
Asn Glu Lys Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Phe Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Ile Asn Glu Gly Phe Asn Trp Thr
130 135 140
Gly Val Thr Gln Asn Gly Gly Ser Tyr Ala Cys Lys Arg Gly Pro Asp
145 150 155 160
Lys Ser Phe Phe Ser Arg Leu Asn Trp Leu Tyr Glu Ser Glu Ser Thr
165 170 175
Tyr Pro Val Leu Asn Val Thr Met Pro Asn Asn Asp Asn Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asp Lys Glu Gln Thr
195 200 205
Asn Leu Tyr Val Gln Ala Ser Gly Arg Val Thr Val Ser Thr Lys Arg
210 215 220
Ser Gln Gln Thr Ile Ile Pro Asn Val Gly Ser Arg Pro Trp Val Arg
225 230 235 240
Gly Leu Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Ile Leu Leu Ile Asn Ser Asn Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Ile Arg Thr Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Gly Thr Cys Ser Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Lys Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Lys Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Ile Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Ile Asp Gly Trp Tyr Gly
355 360 365
Phe Arg Trp Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Ile Leu Asn Arg Val
385 390 395 400
Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Ile
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Lys Thr Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Ser Gly Ser Leu Pro Glu Thr
515 520 525
Gly Gly Gly Ser His His His His His His
530 535
<210> 43
<211> 529
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181136
<400> 43
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Phe Cys Gln Val Leu Ala
1 5 10 15
Gln Asn Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Leu Val Lys Thr Ile Thr Asn Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Arg Ile Cys Asp Ser Pro His Arg Ile Leu Asp Gly Lys Asn Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Gly Phe Gln
85 90 95
Asn Glu Lys Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Phe Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Ile Asn Glu Gly Phe Asn Trp Thr
130 135 140
Gly Val Thr Gln Asn Gly Gly Ser Tyr Ala Cys Lys Arg Gly Pro Asp
145 150 155 160
Lys Ser Phe Phe Ser Arg Leu Asn Trp Leu Tyr Glu Ser Glu Ser Thr
165 170 175
Tyr Pro Val Leu Asn Val Thr Met Pro Asn Asn Asp Asn Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asp Lys Glu Gln Thr
195 200 205
Asn Leu Tyr Val Gln Ala Ser Gly Arg Val Thr Val Ser Thr Lys Arg
210 215 220
Ser Gln Gln Thr Ile Ile Pro Asn Val Gly Ser Arg Pro Trp Val Arg
225 230 235 240
Gly Leu Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Ile Leu Leu Ile Asn Ser Asn Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Ile Arg Thr Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Gly Thr Cys Ser Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Lys Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Lys Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Ile Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Ile Asp Gly Trp Tyr Gly
355 360 365
Phe Arg His Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn Arg Val
385 390 395 400
Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Lys Thr Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Gly Ser His His His His His
515 520 525
His
<210> 44
<211> 538
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181097
<400> 44
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Leu Cys Leu Val Phe Ala
1 5 10 15
Gln Lys Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Ile Val Lys Thr Ile Thr Asn Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Gly Ile Cys Asp Ser Pro His Gln Ile Leu Asp Gly Glu Asn Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro Gln Cys Asp Gly Phe Gln
85 90 95
Asn Lys Lys Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Tyr Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Asn Asp Glu Ser Phe Asn Trp Thr
130 135 140
Gly Val Thr Gln Asn Gly Thr Ser Ser Ser Cys Lys Arg Arg Ser Asn
145 150 155 160
Asn Ser Phe Phe Ser Arg Leu Asn Trp Leu Thr His Leu Lys Phe Lys
165 170 175
Tyr Pro Ala Leu Asn Val Thr Met Pro Asn Asn Glu Lys Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Val Thr Asp Asn Asp Gln Ile
195 200 205
Phe Leu Tyr Ala Gln Ala Ser Gly Arg Ile Thr Val Ser Thr Lys Arg
210 215 220
Ser Gln Gln Thr Val Ile Pro Asn Ile Gly Ser Arg Pro Arg Ile Arg
225 230 235 240
Asn Ile Pro Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Ile Leu Leu Ile Asn Ser Thr Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Ile Arg Ser Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Gly Lys Cys Asn Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Arg Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Arg Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Ile Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Val Asp Gly Trp Tyr Gly
355 360 365
Phe Arg Trp Gln Asn Ser Glu Gly Ile Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asn Gln Ile Asn Gly Ile Leu Asn Arg Leu
385 390 395 400
Ile Gly Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Ile
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Arg Thr Lys Lys Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Ser Gly Ser Leu Pro Glu Thr
515 520 525
Gly Gly Gly Ser His His His His His His
530 535
<210> 45
<211> 529
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181138
<400> 45
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Leu Cys Leu Val Phe Ala
1 5 10 15
Gln Lys Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Ile Val Lys Thr Ile Thr Asn Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Gly Ile Cys Asp Ser Pro His Gln Ile Leu Asp Gly Glu Asn Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro Gln Cys Asp Gly Phe Gln
85 90 95
Asn Lys Lys Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Tyr Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Asn Asp Glu Ser Phe Asn Trp Thr
130 135 140
Gly Val Thr Gln Asn Gly Thr Ser Ser Ser Cys Lys Arg Arg Ser Asn
145 150 155 160
Asn Ser Phe Phe Ser Arg Leu Asn Trp Leu Thr His Leu Lys Phe Lys
165 170 175
Tyr Pro Ala Leu Asn Val Thr Met Pro Asn Asn Glu Lys Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Val Thr Asp Asn Asp Gln Ile
195 200 205
Phe Leu Tyr Ala Gln Ala Ser Gly Arg Ile Thr Val Ser Thr Lys Arg
210 215 220
Ser Gln Gln Thr Val Ile Pro Asn Ile Gly Ser Arg Pro Arg Ile Arg
225 230 235 240
Asn Ile Pro Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Ile Leu Leu Ile Asn Ser Thr Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Ile Arg Ser Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Gly Lys Cys Asn Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Arg Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Arg Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Ile Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Val Asp Gly Trp Tyr Gly
355 360 365
Phe Arg His Gln Asn Ser Glu Gly Ile Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asn Gln Ile Asn Gly Lys Leu Asn Arg Leu
385 390 395 400
Ile Gly Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Arg Thr Lys Lys Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Gly Ser His His His His His
515 520 525
His
<210> 46
<211> 527
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181148
<400> 46
Met Leu Ser Ile Val Ile Leu Phe Leu Leu Val Ala Glu Asn Ser Ser
1 5 10 15
Gln Asn Tyr Thr Gly Asn Pro Val Ile Cys Met Gly His His Ala Val
20 25 30
Ala Asn Gly Thr Met Val Lys Thr Leu Thr Asp Asp Gln Val Glu Val
35 40 45
Val Thr Ala Gln Glu Leu Val Glu Ser Gln Asn Leu Pro Glu Leu Cys
50 55 60
Pro Ser Pro Leu Arg Leu Val Asp Gly Gln Thr Cys Asp Ile Ile Asn
65 70 75 80
Gly Ala Leu Gly Ser Pro Gly Cys Asp His Leu Asn Gly Ala Glu Trp
85 90 95
Asp Val Phe Ile Glu Arg Pro Asn Ala Met Asp Thr Cys Tyr Pro Phe
100 105 110
Asp Val Pro Asp Tyr Gln Ser Leu Arg Ser Ile Leu Ala Asn Asn Gly
115 120 125
Lys Phe Glu Phe Ile Ala Glu Glu Phe Gln Trp Thr Thr Val Lys Gln
130 135 140
Asn Gly Lys Ser Gly Ala Cys Lys Arg Ala Asn Val Asn Asp Phe Phe
145 150 155 160
Arg Arg Leu Asn Trp Leu Val Lys Ser Asp Arg Asn Ala Tyr Pro Leu
165 170 175
Gln Asn Leu Thr Lys Val Asn Asn Gly Asp Tyr Ala Arg Leu Tyr Ile
180 185 190
Trp Gly Val His His Pro Ser Thr Asp Thr Glu Gln Thr Asn Leu Tyr
195 200 205
Lys Asn Asn Pro Gly Arg Val Thr Val Ser Thr Lys Thr Ser Gln Thr
210 215 220
Ser Val Ile Pro Asn Ile Gly Ser Arg Pro Trp Val Arg Gly Gln Ser
225 230 235 240
Gly Arg Ile Ser Phe Tyr Trp Thr Ile Val Glu Pro Gly Asp Leu Ile
245 250 255
Val Phe Asn Thr Ile Gly Asn Leu Ile Ala Pro Arg Gly His Tyr Lys
260 265 270
Leu Asn Asn Gln Lys Lys Gly Thr Ile Leu Asn Thr Ala Ile Pro Ile
275 280 285
Gly Ser Cys Val Ser Lys Cys His Thr Asp Lys Gly Ser Leu Ser Thr
290 295 300
Thr Lys Pro Phe Gln Asn Ile Ser Arg Ile Ala Ile Gly Asp Cys Pro
305 310 315 320
Lys Tyr Val Lys Gln Gly Ser Leu Lys Leu Ala Thr Gly Met Arg Asn
325 330 335
Ile Pro Glu Lys Ala Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe
340 345 350
Ile Glu Asn Gly Trp Gln Gly Leu Ile Asp Gly Trp Tyr Gly Phe Arg
355 360 365
His Gln Asn Ala Glu Gly Thr Gly Thr Ala Ala Asp Leu Lys Ser Thr
370 375 380
Gln Ala Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn Arg Leu Ile Glu
385 390 395 400
Lys Thr Asn Glu Lys Tyr His Gln Ile Glu Lys Glu Phe Glu Gln Val
405 410 415
Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr Lys Ile
420 425 430
Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu Asn Gln
435 440 445
His Thr Ile Asp Val Thr Asp Ser Glu Met Asn Lys Leu Phe Glu Arg
450 455 460
Val Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Lys Gly Asn Gly Cys
465 470 475 480
Phe Glu Ile Phe His Lys Cys Asp Asn Asn Cys Ile Glu Ser Ile Arg
485 490 495
Asn Gly Thr Tyr Asp His Asp Ile Tyr Arg Asp Glu Ala Ile Asn Asn
500 505 510
Arg Phe Gln Ile Gln Gly Val Gly Ser His His His His His His
515 520 525
<210> 47
<211> 536
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV181149
<400> 47
Met Leu Ser Ile Val Ile Leu Phe Leu Leu Val Ala Glu Asn Ser Ser
1 5 10 15
Gln Asn Tyr Thr Gly Asn Pro Val Ile Cys Met Gly His His Ala Val
20 25 30
Ala Asn Gly Thr Met Val Lys Thr Leu Thr Asp Asp Gln Val Glu Val
35 40 45
Val Thr Ala Gln Glu Leu Val Glu Ser Gln Asn Leu Pro Glu Leu Cys
50 55 60
Pro Ser Pro Leu Arg Leu Val Asp Gly Gln Thr Cys Asp Ile Ile Asn
65 70 75 80
Gly Ala Leu Gly Ser Pro Gly Cys Asp His Leu Asn Gly Ala Glu Trp
85 90 95
Asp Val Phe Ile Glu Arg Pro Asn Ala Met Asp Thr Cys Tyr Pro Phe
100 105 110
Asp Val Pro Asp Tyr Gln Ser Leu Arg Ser Ile Leu Ala Asn Asn Gly
115 120 125
Lys Phe Glu Phe Ile Ala Glu Glu Phe Gln Trp Thr Thr Val Lys Gln
130 135 140
Asn Gly Lys Ser Gly Ala Cys Lys Arg Ala Asn Val Asn Asp Phe Phe
145 150 155 160
Arg Arg Leu Asn Trp Leu Val Lys Ser Asp Arg Asn Ala Tyr Pro Leu
165 170 175
Gln Asn Leu Thr Lys Val Asn Asn Gly Asp Tyr Ala Arg Leu Tyr Ile
180 185 190
Trp Gly Val His His Pro Ser Thr Asp Thr Glu Gln Thr Asn Leu Tyr
195 200 205
Lys Asn Asn Pro Gly Arg Val Thr Val Ser Thr Lys Thr Ser Gln Thr
210 215 220
Ser Val Ile Pro Asn Ile Gly Ser Arg Pro Trp Val Arg Gly Gln Ser
225 230 235 240
Gly Arg Ile Ser Phe Tyr Trp Thr Ile Val Glu Pro Gly Asp Leu Ile
245 250 255
Val Phe Asn Thr Ile Gly Asn Leu Ile Ala Pro Arg Gly His Tyr Lys
260 265 270
Leu Asn Asn Gln Lys Lys Gly Thr Ile Leu Asn Thr Ala Ile Pro Ile
275 280 285
Gly Ser Cys Val Ser Lys Cys His Thr Asp Lys Gly Ser Leu Ser Thr
290 295 300
Thr Lys Pro Phe Gln Asn Ile Ser Arg Ile Ala Ile Gly Asp Cys Pro
305 310 315 320
Lys Tyr Val Lys Gln Gly Ser Leu Lys Leu Ala Thr Gly Met Arg Asn
325 330 335
Ile Pro Glu Lys Ala Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe
340 345 350
Ile Glu Asn Gly Trp Gln Gly Leu Ile Asp Gly Trp Tyr Gly Phe Arg
355 360 365
Trp Gln Asn Ala Glu Gly Thr Gly Thr Ala Ala Asp Leu Lys Ser Thr
370 375 380
Gln Ala Ala Ile Asp Gln Ile Asn Gly Ile Leu Asn Arg Leu Ile Glu
385 390 395 400
Lys Thr Asn Glu Lys Tyr His Gln Ile Glu Lys Glu Phe Glu Gln Val
405 410 415
Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr Lys Ile
420 425 430
Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Ile Asn Gln
435 440 445
His Thr Ile Asp Val Thr Asp Ser Glu Met Asn Lys Leu Phe Glu Arg
450 455 460
Val Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Lys Gly Asn Gly Cys
465 470 475 480
Phe Glu Ile Phe His Lys Cys Asp Asn Asn Cys Ile Glu Ser Ile Arg
485 490 495
Asn Gly Thr Tyr Asp His Asp Ile Tyr Arg Asp Glu Ala Ile Asn Asn
500 505 510
Arg Phe Gln Ile Gln Gly Val Ser Gly Ser Leu Pro Glu Thr Gly Gly
515 520 525
Gly Ser His His His His His His
530 535
<210> 48
<400> 48
000
<210> 49
<400> 49
000
<210> 50
<211> 538
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV190839
<400> 50
Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Phe Cys Leu Ala Leu Gly
1 5 10 15
Gln Asp Leu Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30
His His Ala Val Pro Asn Gly Thr Leu Val Lys Thr Ile Thr Asp Asp
35 40 45
Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr
50 55 60
Gly Lys Ile Cys Asn Asn Pro His Arg Ile Leu Asp Gly Ile Asp Cys
65 70 75 80
Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Val Phe Gln
85 90 95
Asn Glu Thr Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Phe Ser Asn
100 105 110
Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val
115 120 125
Ala Ser Ser Gly Thr Leu Glu Phe Ile Thr Glu Gly Phe Thr Trp Thr
130 135 140
Gly Val Thr Gln Asn Gly Gly Ser Asn Ala Cys Lys Arg Gly Pro Gly
145 150 155 160
Ser Gly Phe Phe Ser Arg Leu Asn Trp Leu Thr Lys Ser Gly Ser Thr
165 170 175
Tyr Pro Val Leu Asn Val Thr Met Pro Asn Asn Asp Asn Phe Asp Lys
180 185 190
Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asn Gln Glu Gln Thr
195 200 205
Ser Leu Tyr Val Gln Ala Ser Gly Arg Val Thr Val Ser Thr Arg Arg
210 215 220
Ser Gln Gln Thr Ile Ile Pro Asn Ile Gly Ser Arg Pro Trp Val Arg
225 230 235 240
Gly Leu Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly
245 250 255
Asp Val Leu Val Ile Asn Ser Asn Gly Asn Leu Ile Ala Pro Arg Gly
260 265 270
Tyr Phe Lys Met Arg Thr Gly Lys Ser Ser Ile Met Arg Ser Asp Ala
275 280 285
Pro Ile Asp Thr Cys Ile Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile
290 295 300
Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Lys Ile Thr Tyr Gly Ala
305 310 315 320
Cys Pro Lys Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met
325 330 335
Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Leu Phe Gly Ala Ile Ala
340 345 350
Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Ile Asp Gly Trp Tyr Gly
355 360 365
Phe Arg Trp Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys
370 375 380
Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Ile Leu Asn Arg Val
385 390 395 400
Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser
405 410 415
Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
420 425 430
Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Ile
435 440 445
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
450 455 460
Glu Lys Thr Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
465 470 475 480
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Glu Ser
485 490 495
Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
500 505 510
Asn Asn Arg Phe Gln Ile Lys Gly Val Ser Gly Ser Leu Pro Glu Thr
515 520 525
Gly Gly Gly Ser His His His His His His
530 535
<210> 51
<211> 532
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV190068
<400> 51
Met Asn Thr Gln Ile Leu Val Phe Ala Leu Met Ala Ile Ile Pro Thr
1 5 10 15
Asn Ala Asp Lys Ile Cys Leu Gly His His Ala Val Ser Asn Gly Thr
20 25 30
Lys Val Asn Thr Leu Thr Glu Arg Gly Val Glu Val Val Asn Ala Thr
35 40 45
Glu Thr Val Glu Arg Thr Asn Val Pro Arg Ile Cys Ser Lys Gly Lys
50 55 60
Arg Thr Val Asp Leu Gly Gln Cys Gly Leu Leu Gly Thr Ile Thr Gly
65 70 75 80
Pro Pro Gln Cys Asp Gln Phe Leu Glu Phe Ser Ala Asp Leu Ile Ile
85 90 95
Glu Arg Arg Glu Gly Ser Asp Val Cys Tyr Pro Gly Lys Phe Val Asn
100 105 110
Glu Glu Ala Leu Arg Gln Ile Leu Arg Glu Ser Gly Gly Ile Asp Lys
115 120 125
Glu Thr Met Gly Phe Thr Tyr Ser Gly Ile Arg Thr Asn Gly Ala Thr
130 135 140
Ser Ala Cys Arg Arg Ser Gly Ser Ser Phe Tyr Ala Glu Met Lys Trp
145 150 155 160
Leu Leu Ser Asn Thr Asp Asn Ala Ala Phe Pro Gln Met Thr Lys Ser
165 170 175
Tyr Lys Asn Thr Arg Lys Asp Pro Ala Leu Ile Ile Trp Gly Ile His
180 185 190
His Ser Gly Ser Thr Thr Glu Gln Thr Lys Leu Tyr Gly Ser Gly Asn
195 200 205
Lys Leu Ile Thr Val Gly Ser Ser Asn Tyr Gln Gln Ser Phe Val Pro
210 215 220
Ser Pro Gly Ala Arg Pro Gln Val Asn Gly Gln Ser Gly Arg Ile Asp
225 230 235 240
Phe His Trp Leu Ile Leu Asn Pro Asn Asp Thr Val Thr Phe Ser Phe
245 250 255
Asn Gly Ala Phe Ile Ala Pro Asp Arg Ala Ser Phe Leu Arg Gly Lys
260 265 270
Ser Met Gly Ile Gln Ser Gly Val Gln Val Asp Ala Asn Cys Glu Gly
275 280 285
Asp Cys Tyr His Ser Gly Gly Thr Ile Ile Ser Asn Leu Pro Phe Gln
290 295 300
Asn Ile Asn Ser Arg Ala Val Gly Lys Cys Pro Arg Tyr Val Lys Gln
305 310 315 320
Glu Ser Leu Leu Leu Ala Thr Gly Met Lys Asn Val Pro Glu Ile Pro
325 330 335
Lys Gly Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Asn Gly
340 345 350
Trp Glu Gly Leu Ile Asp Gly Trp Tyr Gly Phe Arg Trp Gln Asn Ala
355 360 365
Gln Gly Glu Gly Thr Ala Ala Asp Tyr Lys Ser Thr Gln Ser Ala Ile
370 375 380
Asp Gln Ile Thr Gly Ile Leu Asn Arg Leu Ile Glu Lys Thr Asn Gln
385 390 395 400
Gln Phe Glu Leu Ile Asp Asn Glu Phe Thr Glu Val Glu Lys Gln Ile
405 410 415
Gly Asn Val Ile Asn Trp Thr Arg Asp Ser Met Thr Glu Val Trp Ser
420 425 430
Tyr Asn Ala Glu Leu Leu Val Ala Met Ile Asn Gln His Thr Ile Asp
435 440 445
Leu Ala Asp Ser Glu Met Asn Lys Leu Tyr Glu Arg Val Lys Arg Gln
450 455 460
Leu Arg Glu Asn Ala Glu Glu Asp Gly Thr Gly Cys Phe Glu Ile Phe
465 470 475 480
His Lys Cys Asp Asp Asp Cys Met Ala Ser Ile Arg Asn Asn Thr Tyr
485 490 495
Asp His Ser Lys Tyr Arg Glu Glu Ala Met Gln Asn Arg Ile Gln Ile
500 505 510
Asp Pro Val Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly Ser His His
515 520 525
His His His His
530
<210> 52
<211> 531
<212> PRT
<213> Artificial Sequence
<220>
<223> UFV190841
<400> 52
Met Tyr Lys Val Val Val Ile Ile Ala Leu Leu Gly Ala Val Lys Gly
1 5 10 15
Asp Arg Ile Cys Leu Gly His His Ala Val Ala Asn Gly Thr Ile Val
20 25 30
Lys Thr Leu Thr Asn Glu Gln Glu Glu Val Thr Asn Ala Thr Glu Thr
35 40 45
Val Glu Ser Thr Asn Leu Asn Lys Leu Cys Met Lys Gly Arg Ser Tyr
50 55 60
Lys Asp Leu Gly Asn Cys His Pro Val Gly Met Leu Ile Gly Thr Pro
65 70 75 80
Val Cys Asp Pro His Leu Thr Gly Thr Trp Asp Thr Leu Ile Glu Arg
85 90 95
Glu Asn Ala Ile Ala His Cys Tyr Pro Gly Ala Thr Ile Asn Glu Glu
100 105 110
Ala Leu Arg Gln Lys Ile Met Glu Ser Gly Gly Ile Ser Lys Met Ser
115 120 125
Thr Gly Phe Thr Tyr Gly Ser Ser Ile Asn Ser Ala Gly Thr Thr Lys
130 135 140
Ala Cys Met Arg Asn Gly Gly Asp Ser Phe Tyr Ala Glu Leu Lys Trp
145 150 155 160
Leu Val Ser Lys Thr Lys Gly Gln Asn Phe Pro Gln Thr Thr Asn Thr
165 170 175
Tyr Arg Asn Thr Asp Thr Ala Glu His Leu Ile Ile Trp Gly Ile His
180 185 190
His Pro Ser Ser Thr Gln Glu Lys Asn Asp Leu Tyr Gly Thr Gln Ser
195 200 205
Leu Ser Ile Ser Val Glu Ser Ser Thr Tyr Gln Asn Asn Phe Val Pro
210 215 220
Val Val Gly Ala Arg Pro Gln Val Asn Gly Gln Ser Gly Arg Ile Asp
225 230 235 240
Phe His Trp Thr Leu Val Gln Pro Gly Asp Asn Ile Thr Phe Ser His
245 250 255
Asn Gly Gly Leu Ile Ala Pro Ser Arg Val Ser Lys Leu Thr Gly Arg
260 265 270
Gly Leu Gly Ile Gln Ser Glu Ala Leu Ile Asp Asn Ser Cys Glu Ser
275 280 285
Lys Cys Phe Trp Arg Gly Gly Ser Ile Asn Thr Lys Leu Pro Phe Gln
290 295 300
Asn Leu Ser Pro Arg Thr Val Gly Gln Cys Pro Lys Tyr Val Asn Gln
305 310 315 320
Arg Ser Leu Leu Leu Ala Thr Gly Met Arg Asn Val Pro Glu Val Val
325 330 335
Gln Gly Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Asn Gly
340 345 350
Trp Glu Gly Met Val Asp Gly Trp Tyr Gly Phe Arg Trp Gln Asn Ala
355 360 365
Gln Gly Thr Gly Gln Ala Ala Asp Tyr Lys Ser Thr Gln Ala Ala Ile
370 375 380
Asp Gln Ile Thr Gly Ile Leu Asn Arg Leu Ile Glu Lys Thr Asn Thr
385 390 395 400
Glu Phe Glu Ser Ile Glu Ser Glu Phe Ser Glu Thr Glu His Gln Ile
405 410 415
Gly Asn Val Ile Asn Trp Thr Lys Asp Ser Ile Thr Asp Ile Trp Thr
420 425 430
Tyr Gln Ala Glu Leu Leu Val Ala Met Ile Asn Gln His Thr Ile Asp
435 440 445
Met Ala Asp Ser Glu Met Leu Asn Leu Tyr Glu Arg Val Arg Lys Gln
450 455 460
Leu Arg Gln Asn Ala Glu Glu Asp Gly Lys Gly Cys Phe Glu Ile Tyr
465 470 475 480
His Thr Cys Asp Asp Ser Cys Met Glu Ser Ile Arg Asn Asn Thr Tyr
485 490 495
Asp His Ser Gln Tyr Arg Glu Glu Ala Leu Leu Asn Arg Leu Asn Ile
500 505 510
Asn Ser Ser Gly Ser Leu Pro Glu Thr Gly Gly Gly Ser His His His
515 520 525
His His His
530
Claims (18)
- 재조합 인플루엔자 A 적혈구응집소(HA) 폴리펩티드로서, 인플루엔자 A 바이러스 HA의 HA1 및 HA2 도메인을 포함하고,
(a) 위치 355의 아미노산은 W이고;
(b) 위치 432의 아미노산은 I이고/이거나, 위치 380의 아미노산은 I인 아미노산 서열을 포함하고;
HA 폴리펩티드의 아미노산 서열의 아미노산 위치의 넘버링은 기준 H3N2 인플루엔자 균주, 특히 기준 균주 H3N2 A/아이치/2/68(서열번호 1)로부터의 HA의 아미노산 서열의 아미노산의 넘버링에 따르는 것인 HA 폴리펩티드. - 제1항에 있어서,
(a) 위치 388의 아미노산은 M이고/이거나;
(b) 위치 478의 아미노산은 I인 아미노산 서열을 포함하는 것인 HA 폴리펩티드. - 제1항 또는 제2항에 있어서, HA1 및 HA2 도메인 사이에 프로테아제 절단 부위를 포함하지 않는 것인 HA 폴리펩티드.
- 제1항 내지 제3항 중 어느 한 항에 있어서, HA1 및 HA2 도메인은 그룹 1 및/또는 그룹 2 인플루엔자 A 바이러스로부터 유래되는 것인 HA 폴리펩티드.
- 제1항 내지 제4항 중 어느 한 항에 있어서, 절단된 HA1 및/또는 HA2 도메인을 포함하는 것인 HA 폴리펩티드.
- 제5항에 있어서, 막횡단 및 세포질내 도메인이 HA2 도메인으로부터 결실된 것인 HA 폴리펩티드.
- 제5항 또는 제6항에 있어서, 위치 515의 아미노산에 해당하는 아미노산으로 시작하는 HA2 도메인의 적어도 C 말단 부분은 결실된 것인 HA 폴리펩티드.
- 제1항 내지 제7항 중 어느 한 항에 있어서, (절단된) HA2 도메인의 C 말단에 위치한 검출 및/또는 정제 태그를 포함하는 것인 HA 폴리펩티드.
- 제1항 내지 제8항 중 어느 한 항의 폴리펩티드의 면역원성 단편.
- 제1항 내지 제8항 중 어느 한 항의 적어도 두 개의 HA 폴리펩티드, 또는 제9항의 면역원성 단편을 포함하는 다량체 폴리펩티드.
- 제10항에 있어서, 폴리펩티드는 삼량체이고, 제1항 내지 제8항 중 어느 한 항의 세 개의 HA 폴리펩티드를 포함하는 것인 다량체 폴리펩티드.
- 제1항 내지 제8항, 제10항, 또는 제11항 중 어느 한 항의 HA 폴리펩티드, 또는 제9항의 면역원성 단편을 암호화하는 핵산.
- 제11항의 핵산 분자를 포함하는 벡터.
- 제12항에 있어서, 재조합 아데노바이러스 벡터인 것인, 벡터.
- 제1항 내지 제8항, 제10항, 또는 제11항 중 어느 한 항의 재조합 HA 폴리펩티드, 또는 제9항의 면역원성 단편을 제조하는 방법으로서, 원핵생물 또는 진핵생물 세포에서 제12항의 핵산 분자를 발현시키는 단계를 포함하고, 상기 세포로부터 HA 폴리펩티드 또는 이의 단편을 단리하는 단계를 추가로 선택적으로 포함하는 방법.
- 제1항 내지 제8항, 제10항 또는 제11항 중 어느 한 항의 HA 폴리펩티드, 제9항의 면역원성 단편, 제12항의 핵산, 및/또는 제13항 또는 제14항의 벡터, 및 약제학적으로 허용 가능한 담체를 포함하는 면역원성 조성물.
- 인플루엔자 바이러스에 대한 면역 반응을 유도하는 데 사용하기 위한, 제1항 내지 제8항, 제10항 또는 제11항 중 어느 한 항의 HA 폴리펩티드, 제9항의 면역원성 단편, 제12항의 핵산, 및/또는 제13항 또는 제14항의 벡터.
- 백신으로서 사용하기 위한, 제1항 내지 제8항, 제10항 또는 제11항 중 어느 한 항의 HA 폴리펩티드, 제9항의 면역원성 단편, 제12항의 핵산, 및/또는 제13항 또는 제14항의 벡터.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962838690P | 2019-04-25 | 2019-04-25 | |
US62/838,690 | 2019-04-25 | ||
PCT/EP2020/061335 WO2020216844A1 (en) | 2019-04-25 | 2020-04-23 | Recombinant influenza antigens |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20220005002A true KR20220005002A (ko) | 2022-01-12 |
Family
ID=70476193
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020217036388A KR20220005002A (ko) | 2019-04-25 | 2020-04-23 | 재조합 인플루엔자 항원 |
Country Status (11)
Country | Link |
---|---|
US (1) | US20220204567A1 (ko) |
EP (1) | EP3959228A1 (ko) |
JP (1) | JP2022530439A (ko) |
KR (1) | KR20220005002A (ko) |
CN (1) | CN113597428A (ko) |
AU (1) | AU2020263900A1 (ko) |
BR (1) | BR112021020907A2 (ko) |
CA (1) | CA3137448A1 (ko) |
EA (1) | EA202192921A1 (ko) |
MX (1) | MX2021012991A (ko) |
WO (1) | WO2020216844A1 (ko) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10456462B2 (en) | 2015-07-07 | 2019-10-29 | Janssen Vaccines & Preventions B.V. | Vaccine against RSV |
WO2017174564A1 (en) | 2016-04-05 | 2017-10-12 | Janssen Vaccines & Prevention B.V. | Vaccine against rsv |
PL3464331T3 (pl) | 2016-05-30 | 2021-04-19 | Janssen Vaccines & Prevention B.V. | Stabilizowane przedfuzyjne białka F RSV |
CN117693359A (zh) * | 2021-06-10 | 2024-03-12 | 乔治亚大学研究基金会股份有限公司 | 作为免疫原的广泛反应性病毒抗原、其组合物和使用方法 |
WO2023126982A1 (en) * | 2021-12-31 | 2023-07-06 | Mynvax Private Limited | Polypeptide fragments, immunogenic composition against influenza virus, and implementations thereof |
TW202408567A (zh) * | 2022-05-06 | 2024-03-01 | 法商賽諾菲公司 | 用於核酸疫苗之訊息序列 |
Family Cites Families (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5057540A (en) | 1987-05-29 | 1991-10-15 | Cambridge Biotech Corporation | Saponin adjuvant |
NZ230747A (en) | 1988-09-30 | 1992-05-26 | Bror Morein | Immunomodulating matrix comprising a complex of at least one lipid and at least one saponin; certain glycosylated triterpenoid saponins derived from quillaja saponaria molina |
JPH0832638B2 (ja) | 1989-05-25 | 1996-03-29 | カイロン コーポレイション | サブミクロン油滴乳剤を含んで成るアジュバント製剤 |
FR2705686B1 (fr) | 1993-05-28 | 1995-08-18 | Transgene Sa | Nouveaux adénovirus défectifs et lignées de complémentation correspondantes. |
US5851806A (en) | 1994-06-10 | 1998-12-22 | Genvec, Inc. | Complementary adenoviral systems and cell lines |
EP0784690B1 (en) | 1994-06-10 | 2006-08-16 | Genvec, Inc. | Complementary adenoviral vector systems and cell lines |
US5846782A (en) | 1995-11-28 | 1998-12-08 | Genvec, Inc. | Targeting adenovirus with use of constrained peptide motifs |
US5965541A (en) | 1995-11-28 | 1999-10-12 | Genvec, Inc. | Vectors and methods for gene transfer to cells |
US5559099A (en) | 1994-09-08 | 1996-09-24 | Genvec, Inc. | Penton base protein and methods of using same |
US5786464C1 (en) | 1994-09-19 | 2012-04-24 | Gen Hospital Corp | Overexpression of mammalian and viral proteins |
AUPM873294A0 (en) | 1994-10-12 | 1994-11-03 | Csl Limited | Saponin preparations and use thereof in iscoms |
SI0833934T2 (sl) | 1995-06-15 | 2013-04-30 | Crucell Holland B.V. | Pakirni sistemi za humani rekombinantni adenovirus za uporabo v genski terapiji |
US5837511A (en) | 1995-10-02 | 1998-11-17 | Cornell Research Foundation, Inc. | Non-group C adenoviral vectors |
US6020191A (en) | 1997-04-14 | 2000-02-01 | Genzyme Corporation | Adenoviral vectors capable of facilitating increased persistence of transgene expression |
US5981225A (en) | 1998-04-16 | 1999-11-09 | Baylor College Of Medicine | Gene transfer vector, recombinant adenovirus particles containing the same, method for producing the same and method of use of the same |
US6113913A (en) | 1998-06-26 | 2000-09-05 | Genvec, Inc. | Recombinant adenovirus |
SE0202110D0 (sv) | 2002-07-05 | 2002-07-05 | Isconova Ab | Iscom preparation and use thereof |
SE0301998D0 (sv) | 2003-07-07 | 2003-07-07 | Isconova Ab | Quil A fraction with low toxicity and use thereof |
US9163068B2 (en) * | 2009-11-03 | 2015-10-20 | The United States of America as represented by the Secretary of the Department of Health and Human Services, National Institutes of Health, Office of Technology Transfer | Influenza virus recombinant proteins |
SG11201402633UA (en) * | 2011-11-28 | 2014-09-26 | Crucell Holland Bv | Influenza virus vaccines and uses thereof |
-
2020
- 2020-04-23 EP EP20720893.5A patent/EP3959228A1/en active Pending
- 2020-04-23 CA CA3137448A patent/CA3137448A1/en active Pending
- 2020-04-23 BR BR112021020907A patent/BR112021020907A2/pt unknown
- 2020-04-23 EA EA202192921A patent/EA202192921A1/ru unknown
- 2020-04-23 WO PCT/EP2020/061335 patent/WO2020216844A1/en unknown
- 2020-04-23 AU AU2020263900A patent/AU2020263900A1/en active Pending
- 2020-04-23 KR KR1020217036388A patent/KR20220005002A/ko active Search and Examination
- 2020-04-23 JP JP2021563156A patent/JP2022530439A/ja active Pending
- 2020-04-23 US US17/594,576 patent/US20220204567A1/en active Pending
- 2020-04-23 MX MX2021012991A patent/MX2021012991A/es unknown
- 2020-04-23 CN CN202080019855.2A patent/CN113597428A/zh active Pending
Also Published As
Publication number | Publication date |
---|---|
MX2021012991A (es) | 2021-12-10 |
WO2020216844A1 (en) | 2020-10-29 |
BR112021020907A2 (pt) | 2022-04-19 |
US20220204567A1 (en) | 2022-06-30 |
EP3959228A1 (en) | 2022-03-02 |
CA3137448A1 (en) | 2020-10-29 |
JP2022530439A (ja) | 2022-06-29 |
CN113597428A (zh) | 2021-11-02 |
AU2020263900A1 (en) | 2021-10-14 |
EA202192921A1 (ru) | 2022-02-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102463632B1 (ko) | 인플루엔자 바이러스 백신 및 이의 용도 | |
US10328144B2 (en) | Influenza virus vaccines and uses thereof | |
KR102252163B1 (ko) | 인플루엔자 바이러스 백신 및 그의 용도 | |
KR20220005002A (ko) | 재조합 인플루엔자 항원 | |
KR20140099515A (ko) | 인플루엔자 바이러스 백신 및 이의 용도 | |
US11905314B2 (en) | Influenza virus vaccines and uses thereof | |
US20230250135A1 (en) | Influenza virus vaccines and uses thereof | |
KR20220082042A (ko) | 인플루엔자 바이러스 백신 및 이의 용도 | |
IL294808A (en) | History of pyrido]2,3-e]oxazine as agricultural chemicals | |
JP7167088B2 (ja) | インフルエンザウイルスワクチンおよびその使用 | |
EA045051B1 (ru) | Вакцины против вируса гриппа и пути их применения | |
NZ715583B2 (en) | Influenza virus vaccines and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination |