CN114716560B - Human papilloma virus 18 chimeric protein and application thereof - Google Patents
Human papilloma virus 18 chimeric protein and application thereof Download PDFInfo
- Publication number
- CN114716560B CN114716560B CN202110002251.3A CN202110002251A CN114716560B CN 114716560 B CN114716560 B CN 114716560B CN 202110002251 A CN202110002251 A CN 202110002251A CN 114716560 B CN114716560 B CN 114716560B
- Authority
- CN
- China
- Prior art keywords
- ser
- val
- leu
- thr
- pro
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 94
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 91
- 241000388169 Alphapapillomavirus 7 Species 0.000 title abstract description 4
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 222
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 187
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 115
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 84
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 72
- 229920001184 polypeptide Polymers 0.000 claims abstract description 66
- 241001631646 Papillomaviridae Species 0.000 claims abstract description 25
- 235000018102 proteins Nutrition 0.000 claims description 176
- 235000001014 amino acid Nutrition 0.000 claims description 128
- 150000001413 amino acids Chemical class 0.000 claims description 128
- 108020004705 Codon Proteins 0.000 claims description 42
- 108091033319 polynucleotide Proteins 0.000 claims description 41
- 239000002157 polynucleotide Substances 0.000 claims description 41
- 102000040430 polynucleotide Human genes 0.000 claims description 41
- 241000238631 Hexapoda Species 0.000 claims description 40
- 241000588724 Escherichia coli Species 0.000 claims description 30
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Natural products NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 30
- 241000701806 Human papillomavirus Species 0.000 claims description 29
- 229960005486 vaccine Drugs 0.000 claims description 28
- 239000004471 Glycine Substances 0.000 claims description 22
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 21
- 208000009608 Papillomavirus Infections Diseases 0.000 claims description 18
- 239000002245 particle Substances 0.000 claims description 17
- 239000002671 adjuvant Substances 0.000 claims description 15
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 claims description 14
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 claims description 14
- 101000641175 Human papillomavirus type 18 Major capsid protein L1 Proteins 0.000 claims description 13
- 206010008342 Cervix carcinoma Diseases 0.000 claims description 11
- 206010028980 Neoplasm Diseases 0.000 claims description 11
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 claims description 11
- 201000011510 cancer Diseases 0.000 claims description 11
- 201000010881 cervical cancer Diseases 0.000 claims description 11
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 11
- 201000010099 disease Diseases 0.000 claims description 9
- 235000003704 aspartic acid Nutrition 0.000 claims description 7
- CKLJMWTZIZZHCS-REOHCLBHSA-N aspartic acid group Chemical group N[C@@H](CC(=O)O)C(=O)O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 7
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 7
- 239000013598 vector Substances 0.000 claims description 6
- 238000002360 preparation method Methods 0.000 claims description 5
- 208000032271 Malignant tumor of penis Diseases 0.000 claims description 4
- 208000002471 Penile Neoplasms Diseases 0.000 claims description 4
- 206010034299 Penile cancer Diseases 0.000 claims description 4
- 208000003445 Mouth Neoplasms Diseases 0.000 claims description 3
- 206010031096 Oropharyngeal cancer Diseases 0.000 claims description 3
- 206010057444 Oropharyngeal neoplasm Diseases 0.000 claims description 3
- 208000006842 Tonsillar Neoplasms Diseases 0.000 claims description 3
- 208000012987 lip and oral cavity carcinoma Diseases 0.000 claims description 3
- 201000006958 oropharynx cancer Diseases 0.000 claims description 3
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 3
- 206010046885 vaginal cancer Diseases 0.000 claims description 3
- 208000013139 vaginal neoplasm Diseases 0.000 claims description 3
- 230000002265 prevention Effects 0.000 claims description 2
- 208000022361 Human papillomavirus infectious disease Diseases 0.000 description 81
- 230000008696 hypoxemic pulmonary vasoconstriction Effects 0.000 description 77
- 108010077245 asparaginyl-proline Proteins 0.000 description 55
- 108010047495 alanylglycine Proteins 0.000 description 54
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 54
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 52
- 210000004027 cell Anatomy 0.000 description 49
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 48
- 108010068265 aspartyltyrosine Proteins 0.000 description 44
- 238000003780 insertion Methods 0.000 description 37
- 230000037431 insertion Effects 0.000 description 37
- NJONQBYLTANINY-IHPCNDPISA-N Phe-Trp-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(N)=O)C(O)=O NJONQBYLTANINY-IHPCNDPISA-N 0.000 description 31
- 210000004899 c-terminal region Anatomy 0.000 description 31
- 238000013461 design Methods 0.000 description 31
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 29
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 29
- 230000014509 gene expression Effects 0.000 description 29
- 108010037850 glycylvaline Proteins 0.000 description 29
- 238000003786 synthesis reaction Methods 0.000 description 29
- 230000015572 biosynthetic process Effects 0.000 description 28
- 108010092114 histidylphenylalanine Proteins 0.000 description 28
- 108010057821 leucylproline Proteins 0.000 description 28
- 108010064235 lysylglycine Proteins 0.000 description 28
- 108010026333 seryl-proline Proteins 0.000 description 28
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 27
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 27
- CGXQUULXFWRJOI-SRVKXCTJSA-N Arg-Val-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O CGXQUULXFWRJOI-SRVKXCTJSA-N 0.000 description 27
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 27
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 27
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 27
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 27
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 27
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 27
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 27
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 27
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 27
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 27
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 27
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 27
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 27
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 27
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 27
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 27
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 27
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 27
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 27
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 27
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 27
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 27
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 27
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 27
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 27
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 27
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 27
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 27
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 27
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 27
- VIWQOOBRKCGSDK-RYQLBKOJSA-N Trp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VIWQOOBRKCGSDK-RYQLBKOJSA-N 0.000 description 27
- FJKXUIJOMUWCDD-FHWLQOOXSA-N Tyr-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N)O FJKXUIJOMUWCDD-FHWLQOOXSA-N 0.000 description 27
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 27
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 27
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 27
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 27
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 27
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 27
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 27
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 27
- 108010053037 kyotorphin Proteins 0.000 description 27
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 27
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 27
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 27
- 230000003472 neutralizing effect Effects 0.000 description 27
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 27
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 26
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 25
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 25
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 25
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 25
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 25
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 25
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 25
- 108010060199 cysteinylproline Proteins 0.000 description 25
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 23
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 23
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 23
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 23
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 23
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 23
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 23
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 23
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 23
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 23
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 23
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 23
- HHABWQIFXZPZCK-ACZMJKKPSA-N Cys-Gln-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HHABWQIFXZPZCK-ACZMJKKPSA-N 0.000 description 23
- PNEAWXSKCKCHDK-XIRDDKMYSA-N Cys-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CS)N)C(O)=O)C1=CN=CN1 PNEAWXSKCKCHDK-XIRDDKMYSA-N 0.000 description 23
- WVWRADGCZPIJJR-IHRRRGAJSA-N Cys-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N WVWRADGCZPIJJR-IHRRRGAJSA-N 0.000 description 23
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 23
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 23
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 23
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 23
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 23
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 23
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 23
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 23
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 23
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 23
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 23
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 23
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 23
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 23
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 23
- DDDLIMCZFKOERC-SVSWQMSJSA-N Thr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N DDDLIMCZFKOERC-SVSWQMSJSA-N 0.000 description 23
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 23
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 23
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 23
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 23
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 23
- 108010078144 glutaminyl-glycine Proteins 0.000 description 23
- 108010012058 leucyltyrosine Proteins 0.000 description 23
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 22
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 22
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 22
- 108010054813 diprotin B Proteins 0.000 description 22
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 22
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 21
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 21
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 21
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 21
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 21
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 21
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 21
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 21
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 21
- ZGKXAUIVGIBISK-SZMVWBNQSA-N Glu-His-Trp Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O ZGKXAUIVGIBISK-SZMVWBNQSA-N 0.000 description 21
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 21
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 21
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 21
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 21
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 21
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 21
- PBWNICYZGJQKJV-BZSNNMDCSA-N Phe-Phe-Cys Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O PBWNICYZGJQKJV-BZSNNMDCSA-N 0.000 description 21
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 21
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 21
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 21
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 21
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 21
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 21
- DZKFGCNKEVMXFA-JUKXBJQTSA-N Tyr-Ile-His Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O DZKFGCNKEVMXFA-JUKXBJQTSA-N 0.000 description 21
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 21
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 21
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 21
- BXJQKVDPRMLGKN-PMVMPFDFSA-N Tyr-Trp-Leu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 BXJQKVDPRMLGKN-PMVMPFDFSA-N 0.000 description 21
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 21
- 108010044940 alanylglutamine Proteins 0.000 description 21
- 108010018006 histidylserine Proteins 0.000 description 21
- 108010003700 lysyl aspartic acid Proteins 0.000 description 21
- 108010050848 glycylleucine Proteins 0.000 description 20
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 19
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 19
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 19
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 19
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 19
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 19
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 19
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 19
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 19
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 19
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 19
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 19
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 19
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 19
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 19
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 19
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 19
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 19
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 19
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 19
- 108010062796 arginyllysine Proteins 0.000 description 19
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 17
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 17
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 17
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 17
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 17
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 17
- 238000000034 method Methods 0.000 description 16
- 108700042752 tyrosyl-prolyl-leucyl-glycine Proteins 0.000 description 16
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 15
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 15
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 15
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 15
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 15
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 15
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 15
- 108010079317 prolyl-tyrosine Proteins 0.000 description 15
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 14
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 14
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 14
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 14
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 13
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 13
- 108010087924 alanylproline Proteins 0.000 description 13
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 13
- 238000006467 substitution reaction Methods 0.000 description 13
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 12
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 12
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 11
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 11
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 11
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 11
- 210000002966 serum Anatomy 0.000 description 11
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 10
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- 108010061238 threonyl-glycine Proteins 0.000 description 10
- GDNWBSFSHJVXKL-GUBZILKMSA-N Cys-Lys-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O GDNWBSFSHJVXKL-GUBZILKMSA-N 0.000 description 9
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 9
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 9
- 241001112090 Pseudovirus Species 0.000 description 9
- 239000013604 expression vector Substances 0.000 description 9
- 108010020532 tyrosyl-proline Proteins 0.000 description 9
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 8
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 8
- NQSFIPWBPXNJII-PMVMPFDFSA-N Lys-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 NQSFIPWBPXNJII-PMVMPFDFSA-N 0.000 description 8
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 8
- 108010093581 aspartyl-proline Proteins 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 230000005847 immunogenicity Effects 0.000 description 8
- 108010027338 isoleucylcysteine Proteins 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 108010065920 Insulin Lispro Proteins 0.000 description 7
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 7
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 7
- 230000003053 immunization Effects 0.000 description 7
- 238000002649 immunization Methods 0.000 description 7
- 208000015181 infectious disease Diseases 0.000 description 7
- 238000006386 neutralization reaction Methods 0.000 description 7
- 239000002773 nucleotide Substances 0.000 description 7
- 125000003729 nucleotide group Chemical group 0.000 description 7
- 231100000590 oncogenic Toxicity 0.000 description 7
- 230000002246 oncogenic effect Effects 0.000 description 7
- 108010090894 prolylleucine Proteins 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N 1-(2-azaniumylacetyl)pyrrolidine-2-carboxylate Chemical compound NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- WPXFILQZNKUYQO-BZSNNMDCSA-N 2-[[(2s)-2-[[(2s)-1-[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 WPXFILQZNKUYQO-BZSNNMDCSA-N 0.000 description 6
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 6
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 6
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 6
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 6
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 6
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 6
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 6
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 6
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 6
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 6
- AYBKPDHHVADEDA-YUMQZZPRSA-N Gly-His-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O AYBKPDHHVADEDA-YUMQZZPRSA-N 0.000 description 6
- 241000341655 Human papillomavirus type 16 Species 0.000 description 6
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 6
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 6
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 6
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 6
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 6
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 6
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 6
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 6
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 6
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 6
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 6
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 6
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 6
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 6
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 6
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 6
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 6
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 6
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 6
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 6
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 108010077112 prolyl-proline Proteins 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 5
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 5
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 241000701447 unidentified baculovirus Species 0.000 description 5
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 4
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 4
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 4
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 4
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 4
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 4
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 4
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 4
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 4
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 4
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 4
- GXIUDSXIUSTSLO-QXEWZRGKSA-N Asp-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N GXIUDSXIUSTSLO-QXEWZRGKSA-N 0.000 description 4
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 4
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 4
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 4
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 4
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 4
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 4
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 4
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 4
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 4
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 4
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 4
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 4
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 4
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 4
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 4
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 4
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 4
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 4
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 4
- FPNWKONEZAVQJF-GUBZILKMSA-N His-Asn-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FPNWKONEZAVQJF-GUBZILKMSA-N 0.000 description 4
- BFOGZWSSGMLYKV-DCAQKATOSA-N His-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N BFOGZWSSGMLYKV-DCAQKATOSA-N 0.000 description 4
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 4
- FHCNLXMTQJNJNH-KBIXCLLPSA-N Ile-Cys-Gln Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)O FHCNLXMTQJNJNH-KBIXCLLPSA-N 0.000 description 4
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 4
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 4
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 4
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 4
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 4
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 4
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 4
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 4
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 4
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 4
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 4
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 4
- 241000699670 Mus sp. Species 0.000 description 4
- 102000007079 Peptide Fragments Human genes 0.000 description 4
- 108010033276 Peptide Fragments Proteins 0.000 description 4
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 4
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 4
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 4
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 4
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 4
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 4
- QMABBZHZMDXHKU-FKBYEOEOSA-N Pro-Tyr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QMABBZHZMDXHKU-FKBYEOEOSA-N 0.000 description 4
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 4
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 4
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 4
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 4
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 4
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 4
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 4
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 4
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 4
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 4
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 4
- DBMMKEHYWIZTPN-JYJNAYRXSA-N Val-Cys-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N DBMMKEHYWIZTPN-JYJNAYRXSA-N 0.000 description 4
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 4
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 4
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 4
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 4
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 4
- 238000009825 accumulation Methods 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- UQLDLKMNUJERMK-UHFFFAOYSA-L di(octadecanoyloxy)lead Chemical compound [Pb+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O UQLDLKMNUJERMK-UHFFFAOYSA-L 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- 108010029384 tryptophyl-histidine Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 3
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 3
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 3
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 3
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 3
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 3
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 3
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 3
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 3
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 3
- 101150075239 L1 gene Proteins 0.000 description 3
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 3
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- 108010079364 N-glycylalanine Proteins 0.000 description 3
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 3
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 3
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 3
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 3
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 238000002296 dynamic light scattering Methods 0.000 description 3
- 238000010828 elution Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 3
- 230000001681 protective effect Effects 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 2
- FPQQSJJWHUJYPU-UHFFFAOYSA-N 3-(dimethylamino)propyliminomethylidene-ethylazanium;chloride Chemical compound Cl.CCN=C=NCCCN(C)C FPQQSJJWHUJYPU-UHFFFAOYSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 2
- CMBDUPIBCOEWNE-BJDJZHNGSA-N Asp-Leu-Asp-Gln Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CMBDUPIBCOEWNE-BJDJZHNGSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 2
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 2
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 2
- JAHCWGSVNZXHRR-SVSWQMSJSA-N Cys-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CS)N JAHCWGSVNZXHRR-SVSWQMSJSA-N 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 2
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 2
- UIRUVUUGUYCMBY-KCTSRDHCSA-N His-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N UIRUVUUGUYCMBY-KCTSRDHCSA-N 0.000 description 2
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 2
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 2
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 2
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 2
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 2
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 2
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 2
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 2
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 2
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 2
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 2
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 102100036407 Thioredoxin Human genes 0.000 description 2
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 2
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 2
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 2
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- IQXWAJUIAQLZNX-IHPCNDPISA-N Trp-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N IQXWAJUIAQLZNX-IHPCNDPISA-N 0.000 description 2
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 2
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 2
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 2
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 2
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 238000005571 anion exchange chromatography Methods 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 238000005277 cation exchange chromatography Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 229910052802 copper Inorganic materials 0.000 description 2
- 239000010949 copper Substances 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 239000002552 dosage form Substances 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 230000001024 immunotherapeutic effect Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000003921 particle size analysis Methods 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 2
- 229920000053 polysorbate 80 Polymers 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 230000009465 prokaryotic expression Effects 0.000 description 2
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 2
- 229940023143 protein vaccine Drugs 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000036555 skin type Effects 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- 108060008226 thioredoxin Proteins 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 238000004627 transmission electron microscopy Methods 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- LLQHSBBZNDXTIV-UHFFFAOYSA-N 6-[5-[[4-[2-(2,3-dihydro-1H-inden-2-ylamino)pyrimidin-5-yl]piperazin-1-yl]methyl]-4,5-dihydro-1,2-oxazol-3-yl]-3H-1,3-benzoxazol-2-one Chemical compound C1C(CC2=CC=CC=C12)NC1=NC=C(C=N1)N1CCN(CC1)CC1CC(=NO1)C1=CC2=C(NC(O2)=O)C=C1 LLQHSBBZNDXTIV-UHFFFAOYSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- 206010059313 Anogenital warts Diseases 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- 238000011725 BALB/c mouse Methods 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108010077805 Bacterial Proteins Proteins 0.000 description 1
- 206010004146 Basal cell carcinoma Diseases 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- 108010071134 CRM197 (non-toxic variant of diphtheria toxin) Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 101710169873 Capsid protein G8P Proteins 0.000 description 1
- 229940124957 Cervarix Drugs 0.000 description 1
- 108010049048 Cholera Toxin Proteins 0.000 description 1
- 102000009016 Cholera Toxin Human genes 0.000 description 1
- 244000050510 Cunninghamia lanceolata Species 0.000 description 1
- 208000037845 Cutaneous squamous cell carcinoma Diseases 0.000 description 1
- 108050006400 Cyclin Proteins 0.000 description 1
- 102000016736 Cyclin Human genes 0.000 description 1
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 1
- 241000450599 DNA viruses Species 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 108010040721 Flagellin Proteins 0.000 description 1
- 229940124897 Gardasil Drugs 0.000 description 1
- 102400001301 Gasdermin-B, C-terminal Human genes 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 101710156564 Major tail protein Gp23 Proteins 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 1
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 101710157639 Minor capsid protein Proteins 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- UEEJHVSXFDXPFK-UHFFFAOYSA-N N-dimethylaminoethanol Chemical compound CN(C)CCO UEEJHVSXFDXPFK-UHFFFAOYSA-N 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 101710136297 Protein VP2 Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- 206010044002 Tonsil cancer Diseases 0.000 description 1
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- 206010047741 Vulval cancer Diseases 0.000 description 1
- 208000004354 Vulvar Neoplasms Diseases 0.000 description 1
- 208000000260 Warts Diseases 0.000 description 1
- ABUBSBSOTTXVPV-UHFFFAOYSA-H [U+6].CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O Chemical compound [U+6].CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O ABUBSBSOTTXVPV-UHFFFAOYSA-H 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 125000000129 anionic group Chemical group 0.000 description 1
- 239000003945 anionic surfactant Substances 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 239000003093 cationic surfactant Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000011097 chromatography purification Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 229940102767 gardasil 9 Drugs 0.000 description 1
- 102000054766 genetic haplotypes Human genes 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 238000000703 high-speed centrifugation Methods 0.000 description 1
- 230000036571 hydration Effects 0.000 description 1
- 238000006703 hydration reaction Methods 0.000 description 1
- 229910052588 hydroxylapatite Inorganic materials 0.000 description 1
- 206010020718 hyperplasia Diseases 0.000 description 1
- 230000005965 immune activity Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 230000036210 malignancy Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 210000004400 mucous membrane Anatomy 0.000 description 1
- 238000011587 new zealand white rabbit Methods 0.000 description 1
- 239000002736 nonionic surfactant Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000003002 pH adjusting agent Substances 0.000 description 1
- 208000003154 papilloma Diseases 0.000 description 1
- 229960002566 papillomavirus vaccine Drugs 0.000 description 1
- XYJRXVWERLGGKC-UHFFFAOYSA-D pentacalcium;hydroxide;triphosphate Chemical compound [OH-].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O XYJRXVWERLGGKC-UHFFFAOYSA-D 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 description 1
- 229940068968 polysorbate 80 Drugs 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 239000012460 protein solution Substances 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 210000002345 respiratory system Anatomy 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000003118 sandwich ELISA Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 201000010153 skin papilloma Diseases 0.000 description 1
- 201000010106 skin squamous cell carcinoma Diseases 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229940031351 tetravalent vaccine Drugs 0.000 description 1
- 238000009210 therapy by ultrasound Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 210000000689 upper leg Anatomy 0.000 description 1
- 210000003462 vein Anatomy 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 201000005102 vulva cancer Diseases 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/20—Antivirals for DNA viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
- C12N15/866—Baculoviral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/06—Animal cells or tissues; Human cells or tissues
- C12N5/0601—Invertebrate cells or tissues, e.g. insect cells; Culture media therefor
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
- A61K2039/5258—Virus-like particles
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/40—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/735—Fusion polypeptide containing domain for protein-protein interaction containing a domain for self-assembly, e.g. a viral coat protein (includes phage display)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2510/00—Genetically modified cells
- C12N2510/02—Cells for production
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/14011—Baculoviridae
- C12N2710/14111—Nucleopolyhedrovirus, e.g. autographa californica nucleopolyhedrovirus
- C12N2710/14141—Use of virus, viral particle or viral elements as a vector
- C12N2710/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20023—Virus like particles [VLP]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20034—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20051—Methods of production or purification of viral material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20051—Methods of production or purification of viral material
- C12N2710/20052—Methods of production or purification of viral material relating to complementing cells and packaging systems for producing virus or viral particles
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/103—Plasmid DNA for invertebrates
- C12N2800/105—Plasmid DNA for invertebrates for insects
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/22—Vectors comprising a coding region that has been codon optimised for expression in a respective host
Abstract
The invention relates to a human papillomavirus 18 type chimeric protein and application thereof. Specifically, the invention relates to a papillomavirus chimeric protein, which comprises HPV18 type L1 protein or a mutant of HPV18 type L1 protein and polypeptide from HPV59 type L2 protein inserted into the surface area of the HPV18 type L1 protein or the mutant of HPV18 type L1 protein, or consists of the HPV59 type L2 protein, wherein the amino acid sequence of the HPV18 type L1 protein is shown as SEQ ID NO.1, and the amino acid sequence of the HPV59 type L2 protein is shown as SEQ ID NO. 2.
Description
Technical Field
The invention relates to the field of biotechnology, in particular to a human papillomavirus chimeric protein, a pentamer or virus-like particle formed by the human papillomavirus chimeric protein, and application of the human papillomavirus chimeric pentamer or human papillomavirus chimeric virus-like particle in preparing vaccines for preventing papillomavirus infection and diseases induced by the infection.
Background
Human papillomaviruses (human papillomavirus, HPV) are a class of non-enveloped small DNA viruses that infect epithelial tissues. Based on the homology of the major coat protein L1 amino acids of human papillomaviruses, more than 200 types of viruses have been identified, which are classified as alpha, beta, gamma, mu, eta. According to the infection site, it is classified into mucous membrane type and skin type. Mucosal HPV types are mainly infected in the genitourinary, perianal and oropharyngeal mucosal skin, are all of genus α, and are further classified into oncogenic HPV (oncogenic HPV) with transforming activity and low risk HPV (LR-HPV) which induces benign hyperplasia. Oncogenic HPV comprises 12 common high-risk types (including HPV types 16, -18, -31, -33, -35, -39, -45, -51, -52, -56, -58, -59, etc.), 1 possible high-risk type (HPV 68), and 10 more rare suspected high-risk types (HPV types 26, -30, -34, -53, -66, -67, -69, -70, -73, -82, -85, etc.). It was found that all the oncogenic HPV positive cancer tissues exhibited specific E6 x I mRNA expression, reduced expression of the oncogene Rb/P53 and cyclin CD1 and increased expression of p16.sup.ink 4a, indicating that the risk of cancer from infection with either oncogenic HPV is the same. There are about 12 low risk HPVs (HPV types 6, -7, -11, -13, -32, -40, -42, -43, -44, -54, -74, -91, etc.), where HPV types 6, -11 induce 90% of anal Zhou Shengshi device condyloma acuminatum and most recurrent papillomas of the respiratory tract in total. Cutaneous HPV types mainly infect skin tissues beyond the above sites, some of which (HPV 2, -27, -57) induce cutaneous wart, and others (HPV 5, -8, -38, etc.) are associated with the development of cutaneous squamous cell carcinoma and basal cell carcinoma.
The malignant tumors associated with oncogenic HPV infection are currently identified as follows: cervical cancer, vaginal cancer, labial cancer, penile cancer, perianal cancer, oropharyngeal cancer, tonsillar cancer, and oral cancer, with the greatest risk of cervical cancer. Cervical cancer is the third highest female malignancy worldwide, with a annual incidence of about 52.7 tens of thousands, with 28.5 tens of thousands in asia; the annual incidence of China is 7.5 ten thousand. The accumulation of 12 common high-risk HPVs induces 95.2% -96.5% of cervical cancers, and the accumulation of the rest 10 rare possible and suspicious high-risk HPVs induces about 3.29% of cervical cancers. HPV type 16 is a globally prevalent high-risk type with the highest detection rate in HPV-associated tumors such as cervical cancer, perianal cancer, penile cancer, vulvar cancer, etc., and precancerous lesions. The detection rates of HPV16 and HPV 18 in cervical cancer can reach 50-60% and 20% respectively. The accumulation of 12 common high-risk HPVs induces 95.2% -96.5% of cervical cancers, and the accumulation of the rest 10 rare possible and suspicious high-risk HPVs induces about 3.29% of cervical cancers.
HPV L1 virus-like particles (L1 VLPs) mainly induce specific neutralizing antibodies and protective responses, and the scope of vaccine protection can only be expanded by increasing the type of L1 VLPs. The 3 HPV vaccines on the market are L1 VLP vaccines, namely divalent vaccine (Cervarix, HPV 16/-18) of GSK, tetravalent vaccine (Gardasil, HPV 6/-11/-16/-18) of Merck and nine vaccine (Gardasil-9, HPV 6/-11/-16/-18/-31/-33/-45/-52/-58) respectively, wherein the nine vaccine with the widest protection range only covers the limited 7 high-risk types, 2 low-risk types (HPV 6/-11) and cannot prevent skin types. In addition, the L1 VLP vaccine cannot expand the protection range by limitlessly increasing the types of the L1 VLP, so that the L1 VLP vaccine is difficult to meet the prevention requirements of HPV infection related diseases.
Minor capsid proteins L2 of HPV are not immunologically active in the natural state, but the N-terminal polypeptides of L2 can induce cross-neutralizing antibodies and cross-protective reactions, with only weak immunogenicity, low titers of induced antibodies, and limited cross-neutralizing species of haplotype L2 antisera. A variety of conserved epitope peptides capable of inducing neutralizing antibodies are only found in 16L2N, wherein aa.17-38 is the main neutralizing epitope region thereof, and the monoclonal antibody RG-1 cross-neutralizing type recognizing the region is the most, so that the region is also called RG-1 epitope peptide, aa.21-31 is the core sequence of the neutralizing epitope, and the related research of RG-1 epitope peptide retains the homologous region of aa.21-31 no matter the sequence length.
The reported types of RG-1 used in vaccine studies are HPV type 4 RG-1, HPV type 6 RG-1, HPV type 16 RG-1, HPV type 17 RG-1, HPV type 31 RG-1, HPV type 33 RG-1, HPV type 45 RG-1, HPV type 51 RG-1, HPV type 58 RG-1, etc. [ C.Schellenbacher et al The Journal of investigative dermatology 2013,133 (12):2706-13;H.Seitz et al.,Vaccine 2014,32(22):2610-2617;B.Huber et al.,PLoS One 2015,10(3):e0120152;B.Huber et al., PLoS One 2017,12(1):e0169533;X.Chen et al.,Oncotarget 2017,8(38):63333-63344;X.Chen et al.,Human Vaccines&Immunotherapeutics 2018,14(8):2025-2033; PCT/CN2017/075402]modes used include VLP surface display, bacterial protein surface display (bacterial thioredoxin Trx, flagellin, cholera toxin mutant CRM 197), targeted igγr engineered antibodies and fusion in tandem of polymorphic L2 polypeptides containing RG-1 epitopes. However, studies have shown that the activity results of various RG-1 epitope peptide-related vaccines are poor, such as low titers of neutralizing antibodies to HPV16 induced by 3 cVLPs of HPV type 4 RG-1, HPV type 6 RG-1, HPV type 17 RG-1, cross-neutralization titers were undetectable [ B.Huber et al., PLoS One 2017,12 (1): e0169533; X.Chen et al, oncostarget 2017, 8 (38): 63333-63344 ]The method comprises the steps of carrying out a first treatment on the surface of the The 18cVLP displaying HPV45 type RG-1 induces very low titers of neutralizing antibodies to HPV18 (only 1/100 of the 18 type L1 VLP), and only cross-neutralizing oncogenic HPVs 45, 70 and 39, at very low titers, and at the highest only 100[B.Huber et al, PLoS One 2015, 10 (3): e 0120152)]The method comprises the steps of carrying out a first treatment on the surface of the Trx fusion protein antisera with surface displaying type 51 RG1 has narrow cross range, and the highest titer of cross neutralizing antibodies is only 500[H.Seitz et al, vaccine 2014,32 (22): 2610-2617]. In contrast, schellenbacher et al reported 16-type RG1-cVLP and 31-type RG1-cVLP, 33-type RG1-cVLP and 58-type RG1-cVLP reported previously by the inventor have better immunocompetence, and HPV16 neutralizing antibody titer induced by framework type VLP is as high as 10 5 (comparable to type 16L 1VLP induced), the corresponding RG-1 epitope induced L2 dependent cross-neutralizing antibodies have a broad neutralization range and relatively high titers (up to 6400) [ C.Schellebacher et al The Journal of investigative dermatology, 2013,133 (12): 2706-13; chen et al, oncostarget 2017,8 (38): 63333-63344; X.Chen et al, human Vaccines&Immunotherapeutics 2018,14(8):2025-2033; PCT/CN2017/075402]。
The above data suggest that there is a very large difference in immunogenicity of RG-1 epitope peptides derived from different HPV types. The inventors compared the immunogenicity of type 58 RG-1 and type 6 RG-1 in previous literature and found that type 58 RG-1 epitope peptide antisera cross-neutralized more (13 types), higher titers (up to 3200) and type 6 RG-1 epitope peptide antisera less neutralized (9 types), very low titers (up to 100 only) [ X.Chen et al, oncostarget 2017,8 (38): 63333-63344]. It is shown that although RG-1 epitope peptide regions have strong conservation among different types, the immunogenicity of RG-1 of different types is different, so that 1L 2 aa.17-36 homologous polypeptide is selected, and a chimeric protein vaccine is constructed, and the immune activity of the chimeric protein vaccine is unpredictable.
On the other hand, the study of HPV16 cVLP vaccine reported by Schellenbacher and Wang showed that, as well as inserting type 16 RG-1 epitope peptide into the surface region of type 16L 1 VLP vector, the immunological activity of the obtained various different type 16 RG 1-cVLPs has significant difference due to the difference of flanking sequence of type 16 RG-1 core epitope peptide sequence and insertion site and insertion mode, wherein, most preferably, cVLP of type 16 RG-1 is inserted into the DE loop region of type 16L 1, and worst, cVLP of type 16 RG-1 core sequence is inserted into the h4 region of type 16L 1. In addition, chen and box reported 33 type RG-1 cVLPs, but the vectors used were different, namely HPV 16L 1 VLPs and 18L1 VLPs, respectively, and although both reports selected the DE loop as an insertion site, the insertion region was 1 amino acid away, the epitope peptide length was 2 amino acids away, the activity difference of 33 type RG-1 dependent cross neutralizing antibodies induced by two 33 type RG 1-cVLPs obtained was quite remarkable, 33 type RG 1-cVLPs antisera could cross neutralize at least 12 types (with 2 types of titers > 1000), while 33 type RG1-18 cVLPs antisera only cross neutralize 7 types, with 6 types of neutralization titers (with 4 types of titers < 100) all being much lower than 33RG1-16 cVLPs antisera.
Thus, there is a need to develop a vaccine based on HPV L1 and HPV L2 chimeric proteins that is capable of producing high titer neutralizing antibodies against more HPV types of viruses.
Disclosure of Invention
In order to solve the technical problems, the inventor selects a plurality of 59 type RG-1 epitope peptides with different lengths for researching HPV cVLPs, and the result shows that the HPV18 type cVLPs obtained by the invention have strong immunogenicity, and the induced serum neutralizing antibodies can neutralize a plurality of types of HPVs of alpha 7 subgenera in a high titer.
Comparative analysis of the neutralizing Activity of 8 different types of RG-1 epitope immune serum in example 1 by the present inventors have unexpectedly found that immune serum of type 59 RG-1 epitope peptides cross-neutralize at least 17 types, particularly at titers of neutralizing HPV45, 59 and 16 types of 10 3 The activity of neutralizing alpha 7 is the best, and the cross neutralization activity of HPV16 is equivalent to the 16RG-1 with stronger immunogenicity.
Accordingly, the present invention is directed to a human papillomavirus chimeric protein for preparing a vaccine for preventing papillomavirus infection and infection-induced diseases.
The present invention is based on the unexpected findings of the inventors: insertion of an HPV type 59L 2 protein polypeptide into the surface region of a full-length or truncated HPV type 18L 1 protein increases the immunogenicity of the HPV type 59L 2 protein polypeptide, and the resulting chimeric protein can be expressed at high levels in e.coli or insect cell expression systems, can be assembled into VLPs, and can elicit broad-spectrum protective immune responses against multiple HPV types from different genera/subgenera. Relevant experimental results are provided in the examples herein.
In view of the above objects, the present invention provides, in one aspect, a human papillomavirus chimeric protein having a backbone of HPV 18L 1 protein or a mutant of HPV 18L 1 protein, said backbone having chimeric thereon at least one polypeptide derived from HPV type 59L 2 protein.
That is, in a first aspect of the present invention, there is provided a human papillomavirus chimeric protein comprising or consisting of an HPV18 type L1 protein or a mutant of HPV18 type L1 protein and a polypeptide from HPV59 type L2 protein inserted into the surface region of said HPV18 type L1 protein or mutant of HPV18 type L1 protein, wherein the amino acid sequence of said HPV18 type L1 protein is as shown in SEQ ID NO.1 and the amino acid sequence of said HPV59 type L2 protein is as shown in SEQ ID NO. 2.
In a preferred embodiment of the human papillomavirus chimeric protein according to the invention, said mutant of the HPV18 type L1 protein is the protein obtained by truncating 0-8 amino acids at the N-terminus and/or 0-32 amino acids at the C-terminus of said HPV18 type L1 protein.
In a preferred embodiment of the human papillomavirus chimeric protein according to the invention, the mutant of HPV type 18L 1 protein is selected from:
a mutant with 32 truncated amino acids at the C end of the amino acid sequence shown in SEQ ID No. 1;
A mutant (mut 1) in which amino acids 477, 478, 484, 496, 499, 504, 506 of the amino acid sequence shown in SEQ ID No.1 are substituted with glycine (G) and amino acids 485, 500, 502 are substituted with serine (S);
a mutant (mut 2) in which amino acids 477, 478, 485, 496, 499, 504, 506 of the amino acid sequence shown in SEQ ID No.1 are substituted with glycine (G) and amino acids 486, 500, 502 are substituted with serine (S);
a mutant (mut 3) in which amino acids 477, 478, 484, 496, 499, 502, 506 of the amino acid sequence shown in SEQ ID No.1 are substituted with glycine (G), amino acids 485, 500 are substituted with serine (S), and amino acid 504 is substituted with aspartic acid (D);
a mutant (mut 4) in which amino acids 477, 478, 485, 496, 502, 506 of the amino acid sequence shown in SEQ ID No.1 are substituted with glycine (G), amino acids 486, 500 are substituted with serine (S), and amino acids 499, 504 are substituted with aspartic acid (D);
a mutant (mut 5) in which amino acids 477, 484, 496, 499, 504, 506 of the amino acid sequence shown in SEQ ID No.1 are substituted with glycine (G) and amino acids 485, 500, 502 are substituted with serine (S); and
a mutant (mut 6) in which amino acids 477, 485, 496, 499, 504, 506 of the amino acid sequence shown in SEQ ID No.1 were replaced with glycine (G) and amino acids 486, 500, 502 were replaced with serine (S).
In a preferred embodiment of the human papillomavirus chimeric protein according to the invention, said polypeptide from the HPV type 59L 2 protein is selected from any consecutive 8-33 amino acid fragment within the 1-50 region of amino acids represented by SEQ ID No. 2; preferably, the polypeptide from HPV59 type L2 protein is HPV59 type L2 protein RG-1 epitope peptide or mutant epitope peptide thereof; further preferred, the polypeptide from HPV type 59L 2 protein consists of an amino acid sequence selected from the group consisting of: amino acids 17 to 32 shown in SEQ ID No.2, amino acids 16 to 35 shown in SEQ ID No.2, amino acids 17 to 37 shown in SEQ ID No.2, amino acids 16 to 37 shown in SEQ ID No.2, and sequences of 1 to 7 amino acids extended or truncated at the N-and/or C-terminus of the above amino acid sequences.
Most preferably, the amino acid sequence of the polypeptide from HPV59 type L2 protein is shown as SEQ ID No.3, SEQ ID No.4, SEQ ID No.5 or SEQ ID No. 6.
Alternatively, the polypeptide from HPV59 type L2 protein is a polypeptide obtained by extension or truncation of 1-7 amino acids at the N-terminus and/or 1-7 amino acids at the C-terminus of the amino acid sequence shown in SEQ ID No. 3.
Alternatively, the polypeptide from HPV type 59L 2 protein may also be a polypeptide having greater than 60%, preferably greater than 70%, preferably greater than 80%, greater than 90%, even more preferably greater than 95% sequence identity to the amino acid sequence set forth in SEQ ID No.3, SEQ ID No.4, SEQ ID No.5 or SEQ ID No. 6.
In a preferred embodiment of the human papillomavirus chimeric protein according to the invention, the HPV type 18L 1 protein may be derived from, for example, but not limited to, the L1 proteins from HPV18 variants in NCBI database, ATL15214.1, ATL14646.1, ARS43458.1, ARS43428.1, ARS43449.1, AGU90430.1, etc. Preferably, the amino acid sequence of the HPV18 type L1 protein is shown as SEQ ID No. 1.
In a preferred embodiment of the human papillomavirus chimeric protein of the invention, the polypeptide from HPV type 59L 2 protein is inserted into the surface region of HPV type 18L 1 protein or a mutant of HPV type 18L 1 protein; preferably, the DE loop or h4 region of said HPV18 type L1 protein or mutant of HPV18 type L1 protein is inserted; more preferably, said polypeptide from HPV59 type L2 protein is inserted between amino acids 134 and 135, between amino acids 137 and 138, between amino acids 432 and 433, or between amino acids 434 and 435 of said HPV18 type L1 protein or a mutant of said HPV18 type L1 protein by direct insertion, or between amino acids 121 to 124, or between amino acids 131 to 138, or between amino acids 431-433, or between amino acids 432-435 of said HPV18 type L1 protein or a mutant of said HPV18 type L1 protein by non-isometric substitution.
As used herein, the term "direct insertion" refers to insertion of a selected peptide fragment between two adjacent amino acids. For example, direct insertion between amino acids 134 and 135 of SEQ ID NO.1 refers to the insertion of a selected peptide fragment directly between amino acids 134 and 135 of SEQ ID NO. 1.
As used herein, the term "non-isometric substitution" refers to the insertion of a selected peptide fragment into a specified amino acid interval after deletion of the sequence of the specified amino acid interval. For example, a non-isometric substitution in the region of amino acids 121 to 124 of SEQ ID NO.1 refers to the insertion of a selected peptide fragment between amino acids 121 to 124 of SEQ ID NO.1 after deletion of amino acids 122-123 of SEQ ID NO. 1. Alternatively, in the direct insertion or non-isometric substitution mode, the polypeptide from HPV type 59L 2 protein comprises a 1 to 3 amino acid residue long linker at its N-and/or C-terminus.
Optionally, the linker is composed of any combination of amino acids selected from glycine (G), serine (S), alanine (a) and proline (P). Preferably, the N-terminal is a G (glycine) P (proline) linker and the C-terminal is a P (proline) linker.
In a preferred embodiment of the papillomavirus chimeric protein of the present invention, in the direct insertion mode, the amino acid sequence of the polypeptide derived from the HPV59 type L2 protein is SEQ ID No.4 or SEQ ID No.5, and the insertion site is between amino acid 137 and amino acid 138 of the HPV18 type L1 protein or a mutant of the HPV18 type L1 protein C-terminally truncated by 32 amino acids, and the obtained papillomavirus chimeric protein amino acid sequence is shown as SEQ ID No.7, SEQ ID No.8, SEQ ID No.9 or SEQ ID No. 10.
In a preferred embodiment of the human papillomavirus chimeric protein of the invention, in the direct insertion mode, the amino acid sequence of the polypeptide from HPV59 type L2 protein is SEQ ID No.3, the insertion site is between amino acids 432 and 433 or between amino acids 434 and 435 of the HPV18 type L1 protein or mutants of the HPV18 type L1 protein truncated by 32 amino acids at the C-terminus, and the obtained papillomavirus chimeric protein amino acid sequence is shown as SEQ ID No.11, SEQ ID No.12, SEQ ID No.13 or SEQ ID No. 14.
In a preferred embodiment of the human papillomavirus chimeric protein of the invention, in the direct insertion mode, the amino acid sequence of the polypeptide from the HPV59 type L2 protein is the sequence shown in SEQ ID No.4 or SEQ ID No.6 containing a GP linker at the N-terminal and a P linker at the C-terminal, and the insertion site is between amino acid 134 and amino acid 135 of the HPV18 type L1 protein or a mutant of the HPV18 type L1 protein truncated by 32 amino acids at the C-terminal, and the obtained papillomavirus chimeric protein amino acid sequence is shown in SEQ ID No.15, SEQ ID No.16, SEQ ID No.17 or SEQ ID No. 18.
In a preferred embodiment of the human papillomavirus chimeric protein of the invention, in the non-isometric substitution mode, after deleting the amino acid 132-137 region of the HPV18 type L1 protein or the mutant of HPV18 type L1 protein with 32 truncated amino acids at the C-terminus, a polypeptide from HPV59 type L2 protein is inserted between amino acids 131 and 138 of the HPV18 type L1 protein or the mutant of HPV18 type L1 protein with 32 truncated amino acids at the C-terminus, the polypeptide from HPV59 type L2 protein has an added glycine-proline linker at the N-terminus, the amino acid sequence of the polypeptide from HPV59 type L2 protein is shown as SEQ ID No.3, and the amino acid sequence of the obtained papillomavirus chimeric protein is shown as SEQ ID No.19 or SEQ ID No. 20.
In a preferred embodiment of the human papillomavirus chimeric protein of the invention, in said non-isometric substitution mode, after deletion of the amino acid 122-123 region of said HPV18 type L1 protein or of the mutant of said HPV18 type L1 protein truncated by 32 amino acids at the C-terminus, a polypeptide from HPV59 type L2 protein is inserted between amino acids 121 and 124 of said HPV18 type L1 protein or of the mutant of said HPV18 type L1 protein truncated by 32 amino acids at the C-terminus, in said direct insertion mode, the amino acid sequence of said polypeptide from HPV59 type L2 protein is shown in SEQ ID No.3, and the amino acid sequence of the obtained papillomavirus chimeric protein is shown in SEQ ID No.21 or SEQ ID No. 22.
In a preferred embodiment of the human papillomavirus chimeric protein of the invention, in said non-isometric substitution mode, after deleting amino acid 432 of said HPV18 type L1 protein or of said mutant of HPV18 type L1 protein C-terminally truncated by 32 amino acids, a polypeptide from HPV59 type L2 protein is inserted between amino acids 431 and 433 of said HPV18 type L1 protein or of said mutant of HPV18 type L1 protein C-terminally truncated by 32 amino acids, in said direct insertion mode, the amino acid sequence of said polypeptide from HPV59 type L2 protein is as shown in SEQ ID No.3, and the amino acid sequence of the obtained papillomavirus chimeric protein is as shown in SEQ ID No.23 or SEQ ID No. 24.
In a preferred embodiment of the human papillomavirus chimeric protein of the invention, in said non-isometric substitution mode, after deletion of amino acids 433-434 of said HPV18 type L1 protein or of said mutant of HPV18 type L1 protein truncated by 32 amino acids at the C-terminus, a polypeptide from HPV59 type L2 protein is inserted between amino acids 432 and 435 of said HPV18 type L1 protein or of said mutant of HPV18 type L1 protein truncated by 32 amino acids at the C-terminus, in said direct insertion mode the amino acid sequence of said polypeptide from HPV59 type L2 protein is as shown in SEQ ID No.3, and the amino acid sequence of the obtained papillomavirus chimeric protein is as shown in SEQ ID No.25 or SEQ ID No. 26.
In a preferred embodiment of the human papillomavirus chimeric protein of the invention, the polypeptide represented by SEQ ID No.4 is chimeric by direct insertion between amino acids 137 and 138 of said HPV18 type L1 protein mutant, said HPV18 type L1 protein mutant being selected from the group consisting of:
a mutant in which amino acids 477, 478, 484, 496, 499, 504, 506 of the amino acid sequence shown in SEQ ID No.1 are replaced with glycine (G) and amino acids 485, 500, 502 are replaced with serine (S), and the obtained papillomavirus chimeric protein has an amino acid sequence shown in SEQ ID No.27 (mut 1); or,
a mutant in which amino acids 477, 478, 485, 496, 499, 504, 506 of the amino acid sequence shown in SEQ ID No.1 are replaced with glycine (G) and amino acids 486, 500, 502 are replaced with serine (S), and the obtained papillomavirus chimeric protein has an amino acid sequence shown in SEQ ID No.28 (mut 2); or,
a mutant in which amino acids 477, 478, 484, 496, 499, 502, 506 of the amino acid sequence shown in SEQ ID No.1 are replaced with glycine (G), amino acids 485, 500 are replaced with serine (S) and amino acid 504 is replaced with aspartic acid (D), and the obtained papillomavirus chimeric protein has an amino acid sequence shown in SEQ ID No.29 (mut 3); or,
A mutant in which amino acids 477, 478, 485, 496, 502, 506 of the amino acid sequence shown in SEQ ID No.1 are replaced with glycine (G), amino acids 486, 500 are replaced with serine (S) and amino acids 499, 504 are replaced with aspartic acid (D), and the obtained papillomavirus chimeric protein has an amino acid sequence shown in SEQ ID No.30 (mut 4); or,
a mutant in which amino acids 477, 484, 496, 499, 504, 506 of the amino acid sequence shown in SEQ ID No.1 are replaced with glycine (G) and amino acids 485, 500, 502 are replaced with serine (S), and the obtained papillomavirus chimeric protein has an amino acid sequence shown in SEQ ID No.31 (mut 5); or,
a mutant in which amino acids 477, 485, 496, 499, 504, 506 of the amino acid sequence shown in SEQ ID No.1 were replaced with glycine (G) and amino acids 486, 500, 502 were replaced with serine (S) gave a papillomavirus chimeric protein having the amino acid sequence shown in SEQ ID No.32 (mut 6).
Another aspect of the invention relates to polynucleotides encoding the papillomavirus chimeric proteins described above.
The invention also provides a vector containing the polynucleotide and a cell containing the vector.
The polynucleotide sequence for encoding the papillomavirus chimeric protein is suitable for different expression systems. Alternatively, these nucleotide sequences are optimized by using E.coli codons for full gene expression in E.coli expression systems; or the insect cell codon is adopted for whole gene optimization, and the expression can be carried out at high level in an insect cell expression system.
The present invention also provides a polymer, preferably a papillomavirus chimeric pentamer or chimeric virus-like particle, comprising or formed from the papillomavirus chimeric protein described above.
The present invention also provides the use of the above papillomavirus chimeric proteins, papillomavirus chimeric pentamers or the above papillomavirus chimeric virus-like particles in the preparation of a vaccine for preventing papillomavirus infection and/or diseases induced by said papillomavirus infection, preferably, said papillomavirus infection-induced diseases including, but not limited to, cervical cancer, vaginal cancer, labial cancer, penile cancer, perianal cancer, oropharyngeal cancer, tonsil cancer and oral cancer;
preferably, the papillomavirus infection is one or more infections selected from the group consisting of: HPV16, HPV18, HPV26, HPV31, HPV33, HPV35, HPV39, HPV45, HPV51, HPV52, HPV53, HPV56, HPV58, HPV59, HPV66, HPV68, HPV70, HPV73; HPV6, HPV11, HPV2, HPV5, HPV27 and HPV57.
The present invention also provides a vaccine for preventing papillomavirus infection and infection-induced diseases comprising the above papillomavirus chimeric pentamer or chimeric virus-like particle, an adjuvant, and an excipient or carrier for the vaccine, preferably, a virus-like particle or chimeric virus-like particle further comprising at least one mucophilic group and/or dermatophilic group of HPV. Wherein the content of the virus-like particles is effective to induce protective immune response.
Optionally, the adjuvant is a human adjuvant.
Description and explanation of related terms in the invention
According to the present invention, the term "insect cell expression system" includes insect cells, recombinant baculoviruses, recombinant Bacmid and expression vectors. Wherein the insect cells are derived from commercially available cells, exemplified herein but not limited to: sf9, sf21, high Five.
According to the present invention, the term "prokaryotic expression system" includes, but is not limited to, E.coli expression systems. Wherein the expression host bacteria are derived from commercially available strains, exemplified herein but not limited to: BL21 (DE 3), BL21 (DE 3) plysS, C43 (DE 3), rosetta-gami B (DE 3).
According to the present invention, examples of the term "full length HPV type 18L 1 protein" include, but are not limited to, full length L1 proteins of equal length, the protein numbered ATL15070.1 in the NCBI database.
A gene fragment of a "truncated HPV18 type L1 protein" refers to a deletion of 1 or more amino acid-encoding nucleotides at its 5 'and/or 3' end compared to the wild type HPV18 type L1 protein gene, wherein the full length sequence of the "wild type HPV18 type L1 protein" is such as, but not limited to, the following sequences in the NCBI database: ATL15214.1, ATL14646.1, ARS43458.1, ARS43428.1, ARS43449.1, AGU90430.1, and the like.
According to the present invention, the term "vaccine excipient or carrier" refers to a compound selected from one or more of the group including, but not limited to: a pH adjustor, a surfactant, and an ion strength enhancer. For example, pH modifiers such as but not limited to phosphate buffers, surfactants including cationic, anionic, or nonionic surfactants such as but not limited to polysorbate 80 (Tween-80), and ionic strength enhancers such as but not limited to sodium chloride.
According to the present invention, the term "human adjuvant" refers to adjuvants that are clinically applicable to the human body, including various adjuvants that are currently approved and that may be approved in the future, such as, but not limited to, aluminum adjuvants, MF59, and various forms of adjuvant compositions.
According to the invention, the vaccine of the invention may take a patient acceptable form, including but not limited to oral or injection, preferably injection.
According to the invention, the vaccine of the invention is preferably used in unit dosage forms, wherein the dose of the protein virus-like particles in the unit dosage form is in the range of 5 μg to 100 μg, e.g. 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 μg, and ranges between any two of the above; preferably 30 μg to 60 μg.
Drawings
Fig. 1A-1B: expression identification of chimeric proteins in E.coli and insect cells in example 6 of the invention. The results showed that 26 chimeric proteins could be expressed in E.coli or insect cells, with 6 chimeric proteins expressed in both expression systems.
Fig. 1A: expression identification of chimeric proteins in E.coli: 1 is 18L1DE 137-138 59dES;2 is 18L1DE 137-138 59dE;3 is 18L1h4 432-433 59dE;4 is 18L1h4 434-435 59dE;5 is 18L1DE 134-135 59dES;6 is 18L1DE 134-135 59dE;7 is 18L1DE 131-138 59dE;8 is 18L1DE 121-124 59dE,9 is 18L1h4 431-433 59dE;10 is 18L1h4 432-435 59dE;11 is 18L1DE 137-138 59dES-mut1;12 is 18L1DE 137-138 59dES-mut2;13 is 18L1DE 137-138 59dES-mut3;14 is 18L1DE 137-138 59dES-mut4;15 is 18L1DE 137-138 59dES-mut5;16 is 18L1DE 137-138 /59dES-mut6;
Fig. 1B: expression of chimeric proteins in insect cells identification: 1 is 18L1 ΔCDE 137-138 59dES; 2. 18L1 ΔCDE 137-138 59dE;3 is 18L1 DeltaCh 4 432-433 59dE;4 is 18L1 DeltaCh 4 434-435 59dE;5 is 18L1 ΔCDE 134-135 59dES;6 is 18L1 ΔCDE 134-135 59dE;7 is 18L1 ΔCDE 131-138 59dE;8 is 18L1 ΔCDE 121-124 59dE,9 is 18L1 DeltaCh 4 431-433 59dE; 10. is 18L1 delta Ch4 432-435 59dE;11 is 18L1DE 137-138 59dES-mut1;12 is 18L1DE 137-138 59dES-mut2;13 is 18L1DE 137-138 59dES-mut3;14 is 18L1DE 137-138 59dES-mut4;15 is 18L1DE 137-138 59dES-mut5;16 is 18L1DE 137-138 /59dES-mut6。
Fig. 2A-2D: dynamic light scattering analysis results of cVLPs obtained after purification in example 6 of the present invention. The results show 18L1 ΔCDE 134-135 /59dE、18L1ΔCDE 134-135 /59dES、18L1ΔCDE 137-138 /59dE18L1 ΔCDE 137-138 The kinetic diameters of hydration of virus-like particles formed by the 59dES recombinant protein are 106.8nm, 113.3nm, 114.7nm and 122.9nm respectively, and the percentage of particle assembly is 100%.
Fig. 2A:18L1 ΔCDE 134-135 59dE; fig. 2B:18L1 ΔCDE 134-135 59dES; fig. 2C: 18L1 ΔCDE 137-138 59dE; fig. 2D:18L1 ΔCDE 137-138 /59dES。
Fig. 3A-3D: transmission electron microscopy observations of the cvlps obtained after purification in example 7 of the invention. A large number of virus-like particles are visible in the visual field, and the particles are uniform. cVLP is about 50nm in diameter, similar to the VLP of L1 protein in size. Bar=50 nm.
Fig. 3A:18L1 ΔCDE 134-135 59dE; fig. 3B:18L1 ΔCDE 134-135 59dES; fig. 3C: 18L1 ΔCDE 137-138 59dE; FIG. 3D.18L1ΔCDE 137-138 /59dES。
Fig. 4: the neutralizing activity of chimeric VLP mouse immune serum in example 10 of the invention against α7 subgeneric HPV pseudoviruses was tested. * : p <0.05.
Detailed Description
The invention will be further illustrated by the following non-limiting examples, which are well known to those skilled in the art, and many modifications can be made to the invention without departing from the spirit thereof, and such modifications also fall within the scope of the invention. The following examples are merely illustrative of the present invention and should not be construed as limiting the scope of the invention as embodiments are necessarily varied. The terminology used in the description is for the purpose of describing particular embodiments only and is not intended to be limiting, the scope of the present invention being defined in the appended claims.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Preferred methods and materials of the invention are described below, but any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the invention. The following experimental methods are all methods described in conventional methods or product specifications unless otherwise specified, and the experimental materials used are readily available from commercial companies unless otherwise specified. All publications mentioned in this specification are herein incorporated by reference to disclose and describe the methods and/or materials in the publications.
Example 1: immunocompetence detection of different types of RG-1 epitope peptides
RG-1 epitope peptides of HPV35, -39, -51, -53, -56, -59, -68, -82 were synthesized by chemical synthesis, the epitope peptide sequences are shown in Table 1, the polypeptides were synthesized by Shanghai Jier Biochemical Co., ltd, and each synthetic peptide was coupled to Keyhole Limpet Hemocyanin (KLH) after activating the carboxyl group by 1- (3-dimethylaminopropyl) -3-ethylcarbodiimide hydrochloride (EDC, CAS No. 25952-53-8) in order to increase the immunogenicity of the synthetic peptide.
2.0-2.5kg body weight of New Zealand white rabbits are taken, randomly grouped, 2-4 of each group are injected into the back of the human body at multiple points for 4 days before immunization, 15mg of inactivated DH5a (PBS containing 0.5% v/v formaldehyde and treated for 24-48 hours at 37 ℃) which is fully mixed with an equal volume of Freund's complete adjuvant is injected into the back of the human body for immunization, and 1mg of KLH-polypeptide which is fully mixed with the equal volume of Freund's complete adjuvant is injected into the back of the human body and the inner side of the thigh at multiple points for the first immunization. The booster was performed 4 times, 2 weeks apart, with 0.5mg antigen and an equal volume of Freund's incomplete adjuvant (KLH-polypeptide mixed well. 2 weeks after the last immunization, blood was collected and serum was isolated.
The neutralizing antibody titer of the immune serum was measured using 17 HPV pseudoviruses, and the results are shown in table 2. The 59RG-1 epitope peptide has best immunological activity, and its antiserum can neutralize all 17 detection types, wherein the titer of neutralizing antibody of HPV45, -59, -16 is 10 3 The above, HPV5, -31, -18, -39, -68, -57 has a neutralizing antibody titer between 500-1000. Notably, 59RG-1 epitope peptide antisera had a high level of neutralizing antibodies to the 5 alpha 7-HPVs detected.
Methods for polypeptide synthesis, pseudovirus preparation and pseudovirus neutralization experiments are disclosed, for example, in patent CN 104418942a and 108676057a.
TABLE 1 sequence of synthesized different types of RG-1 epitope peptides
Identification device | Synthetic peptide sequences | SEQ ID NO. |
HPV35 | TQLYRTCKAAGTCPPDVIPKVEG | 44 |
HPV39 | STLYRTCKQSGTCPPDVVDKVEG | 45 |
HPV51 | TQLYSTCKAAGTCPPDVVNKVEG | 46 |
HPV53 | TQLYQTCKQSGTCPEDVINKIEH | 47 |
HPV56 | TQLYKTCKLSGTCPEDVVNKIEQ | 48 |
HPV59 | LYKTCKQ AGTCP SDVIN KVEGTT | 49 |
HPV68 | STLYKTCKQSGTCPPDVINKVEG | 50 |
HPV82 | TQLYSTCKAAGTCPPDVIPKVKG | 51 |
TABLE 2 serum neutralizing antibody titres induced by different RG1-KLH conjugated peptides in rabbits
Example 2: synthesis of chimeric L1 protein gene and construction of expression vector
26 chimeric L1 proteins, respectively:
1) Chimeric L1 protein 18L1DE 137-138 59dES: the skeleton is full length HPV18 type L1 protein (sequence shown as SEQ ID No. 1), aa.17-32 polypeptide (shown as SEQ ID No. 4) of HPV59 type L2 protein is directly inserted into the DE ring aa.137/138 site of the protein, 18L1DE 137-138 The amino acid sequence of the 59dES chimeric protein is shown in SEQ ID No. 7. Coding 18L1DE 137-138 The polynucleotide sequence of/59 dES is constructed by adopting a complete gene synthesis mode through the optimal design of an escherichia coli codon;
2) Chimeric L1 protein 18L1DE 137-138 59dE: the skeleton is full length HPV18 type L1 protein (sequence shown as SEQ ID No. 1), aa.16-35 polypeptide (shown as SEQ ID No. 5) of HPV59 type L2 protein is directly inserted into the site of DE ring aa.137/138 of the protein, 18L1DE 137-138 The amino acid sequence of the 59dE chimeric protein is shown in SEQ ID No. 9. Coding 18L1DE 137-138 The polynucleotide sequence of 59dE is constructed by adopting a complete gene synthesis mode through the optimal design of an escherichia coli codon;
3) Chimeric L1 protein 18L1h4 432-433 59dE: the skeleton is full length HPV18 type L1 protein (sequence shown as SEQ ID No. 1), aa.17-37 polypeptide (shown as SEQ ID No. 3) of HPV59 type L2 protein is directly inserted into aa.432/433 site of h4 region, 18L1h4 432-433 59dE blockThe amino acid sequence of the synthetic protein is shown as SEQ ID No. 11. Coding 18L1h4 432-433 The polynucleotide sequence of 59dE is constructed by adopting a complete gene synthesis mode through the optimal design of an escherichia coli codon;
4) Chimeric L1 protein 18L1h4 434-435 59dE: the skeleton is full length HPV18 type L1 protein (sequence shown as SEQ ID No. 1), aa.17-37 polypeptide (shown as SEQ ID No. 3) of HPV59 type L2 protein is directly inserted into aa.434/435 site of h4 region, 18L1h4 434-435 The amino acid sequence of the 59dE chimeric protein is shown in SEQ ID No. 13. Coding 18L1h4 434-435 The polynucleotide sequence of 59dE is constructed by adopting a complete gene synthesis mode through the optimal design of an escherichia coli codon;
5) Chimeric L1 protein 18L1DE 134-135 59dES: the framework is full length HPV18 type L1 protein (the sequence is shown as SEQ ID No. 1), aa.17-32 polypeptide of HPV59 type L2 protein containing GP linker at N end and P linker at C end is directly inserted into the aa.134/135 site of DE ring (i.e. glycine-proline is added at N end and proline is added at C end of the sequence shown as SEQ ID No. 4), 18L1DE 134-135 The amino acid sequence of the 59dES chimeric protein is shown in SEQ ID No. 15. Coding 18L1DE 134-135 The polynucleotide sequence of/59 dES is constructed by adopting a complete gene synthesis mode through the optimal design of an escherichia coli codon;
6) Chimeric L1 protein 18L1DE 134-135 59dE: the framework is full length HPV18 type L1 protein (the sequence is shown as SEQ ID No. 1), aa.16-37 polypeptide of HPV59 type L2 protein containing GP linker at N end and P linker at C end is directly inserted into the aa.134/135 site of DE ring (i.e. glycine-proline is added at N end and proline is added at C end of the sequence shown as SEQ ID No. 6), 18L1DE 134-135 The amino acid sequence of the 59dE chimeric protein is shown in SEQ ID No. 17. Coding 18L1DE 134-135 The polynucleotide sequence of/59 dES is constructed by adopting a complete gene synthesis mode through the optimal design of an escherichia coli codon;
7) Chimeric L1 protein 18L1DE 131-138 59dE: the skeleton is full length HPV18 type L1 protein (sequence is shown as SEQ ID No. 1), aa.132-137 region is deleted, and HPV59 type L2 protein containing GP linker at N terminal is fused between aa.131/138aa.17-37 polypeptide (non-isometric substitution insertion in aa.132-137 region of HPV18 type L1 protein), glycine-proline and 18L1DE are added to the N-terminal of the insertion fragment with the amino acid sequence shown in SEQ ID No.3 131-138 The amino acid sequence of the 59dE chimeric protein is shown in SEQ ID No. 19. Coding 18L1DE 131-138 The polynucleotide sequence of 59dE is constructed by adopting a complete gene synthesis mode through the optimal design of an escherichia coli codon;
8) Chimeric L1 protein 18L1DE 121-124 59dE: the skeleton is full length HPV18 type L1 protein (the sequence is shown as SEQ ID No. 1), aa.122-123 region is deleted, aa.17-37 polypeptide of HPV59 type L2 protein is fused between aa.121/124 (non-equilong replacement insertion is performed in aa.122-133 region of HPV18 type L1 protein), the amino acid sequence of the insertion fragment is shown as SEQ ID No.3, 18L1DE 121-124 The amino acid sequence of the 59dE chimeric protein is shown as SEQ ID No. 21. Coding 18L1DE 121-124 The polynucleotide sequence of 59dE is constructed by adopting a complete gene synthesis mode through the optimal design of an escherichia coli codon;
9) Chimeric L1 protein 18L1h4 431-433 59dE: the skeleton is full length HPV18 type L1 protein (the sequence is shown as SEQ ID No. 1), aa.432 is deleted, aa.17-37 polypeptide of HPV59 type L2 protein is fused between aa.431/433 (non-equilong replacement insertion is performed in aa.431-433 region of HPV18 type L1 protein), the amino acid sequence of the insertion fragment is shown as SEQ ID No.3, 18L1h4 431-433 The amino acid sequence of the 59dE chimeric protein is shown in SEQ ID No. 23. Coding 18L1h4 431-433 The polynucleotide sequence of 59dE is constructed by adopting a complete gene synthesis mode through the optimal design of an escherichia coli codon;
10 Chimeric L1 protein 18L1h4 432-435 59dE: the skeleton is full length HPV18 type L1 protein (the sequence is shown as SEQ ID No. 1), aa.433-434 region is deleted, aa.17-37 polypeptide of HPV59 type L2 protein is fused between aa.432/435 (non-equilong replacement insertion is performed in aa.432-435 region of HPV18 type L1 protein), the amino acid sequence of the insertion fragment is shown as SEQ ID No.3, 18L1h4 432-435 The amino acid sequence of the 59dE chimeric protein is shown in SEQ ID No. 25. Coding 18L1h4 433-434 The polynucleotide sequence of 59dE is optimally set by the codon of the escherichia coliConstructing by adopting a total gene synthesis mode;
11 Chimeric L1 protein 18l1Δcde 137-138 59dES: HPV18 type L1 protein with skeleton of C-terminal truncated 32 amino acids (C-terminal truncated 32 amino acids of sequence SEQ ID No. 1), aa.17-32 polypeptide of HPV59 type L2 protein (shown as SEQ ID No. 4) and 18L1 delta CDE are directly inserted into aa.137/138 site of DE ring thereof 137-138 The amino acid sequence of the 59dES chimeric protein is shown in SEQ ID No. 8. Encoding 18L1ΔCDE 137-138 The polynucleotide sequence of/59 dES is constructed by adopting a complete gene synthesis mode through the optimized design of insect cell codons, and the nucleotide sequence is shown as SEQ ID No. 33;
12 Chimeric L1 protein 18l1Δcde 137-138 59dE: HPV18 type L1 protein with skeleton of C-terminal truncated 32 amino acids (C-terminal truncated 32 amino acids of sequence SEQ ID No. 1), aa.16-35 polypeptide (shown as SEQ ID No. 5) of HPV59 type L2 protein, 18L1 delta CDE, is directly inserted into aa.137/138 site of DE ring thereof 137-138 The amino acid sequence of the 59dE chimeric protein is shown in SEQ ID No. 10. Encoding 18L1ΔCDE 137-138 The polynucleotide sequence of/59 dE is constructed by adopting a total gene synthesis mode through the optimized design of insect cell codons, and the nucleotide sequence is shown as SEQ ID No. 34;
13 Chimeric L1 protein 18l1Δch4 432-433 59dE: HPV18 type L1 protein with skeleton of C-terminal truncated 32 amino acids (C-terminal truncated 32 amino acids of sequence SEQ ID No. 1), aa.17-37 polypeptide of HPV59 type L2 protein (shown as SEQ ID No. 3) and 18L1 delta Ch4 are directly inserted into aa.432/433 site of h4 region 432-433 The amino acid sequence of the 59dE chimeric protein is shown in SEQ ID No. 12. Encoding 18L1 DeltaCh 4 432-433 The polynucleotide sequence of/59 dE is constructed by adopting a total gene synthesis mode through the optimized design of insect cell codons;
14 Chimeric L1 protein 18l1Δch4 434-435 59dE: HPV18 type L1 protein with framework of C-terminal truncated 32 amino acids (C-terminal truncated 32 amino acids of sequence SEQ ID No. 1), aa.17-37 polypeptide (shown as SEQ ID No. 3) of HPV59 type L2 protein, 18L1 delta Ch4, is directly inserted at aa.434/435 site of h4 region 434-435 The amino acid sequence of the 59dE chimeric protein is shown as SEQ ID No.14Shown. Encoding 18L1 DeltaCh 4 434-435 The polynucleotide sequence of/59 dE is constructed by adopting a complete gene synthesis mode through the optimized design of insect cell codons;
15 Chimeric L1 protein 18l1Δcde 134-135 59dES: HPV18 type L1 protein with framework of C-terminal truncated 32 amino acids (C-terminal truncated 32 amino acids of sequence SEQ ID No. 1), aa.17-32 polypeptide of HPV59 type L2 protein with N-terminal GP linker and C-terminal P linker (i.e. glycine-proline is added at N-terminal and proline is added at C-terminal of sequence SEQ ID No. 4) and 18L1 delta CDE are directly inserted at aa.134/135 site of DE ring thereof 134-135 The amino acid sequence of the 59dES chimeric protein is shown as SEQ ID No. 16. Encoding 18L1ΔCDE 134-135 The polynucleotide sequence of/59 dES is constructed by adopting a complete gene synthesis mode through the optimized design of insect cell codons, and the nucleotide sequence is shown as SEQ ID No. 35;
16 Chimeric L1 protein 18l1Δcde 134-135 59dE: HPV18 type L1 protein with framework of C-terminal truncated 32 amino acids (C-terminal truncated 32 amino acids of sequence SEQ ID No. 1), aa.16-37 polypeptide of HPV59 type L2 protein with N-terminal GP linker and C-terminal P linker (i.e. glycine-proline is added at N-terminal and proline is added at C-terminal of sequence SEQ ID No. 6) and 18L1 delta CDE are directly inserted at aa.134/135 site of DE ring thereof 134-135 The amino acid sequence of the 59dE chimeric protein is shown as SEQ ID No. 18. Encoding 18L1ΔCDE 134-135 The polynucleotide sequence of/59 dE is constructed by adopting a total gene synthesis mode through the optimized design of insect cell codons, and the nucleotide sequence is shown as SEQ ID No. 36;
17 Chimeric L1 protein 18l1Δcde 131-138 59dE: HPV18 type L1 protein with framework of C-terminal truncated 32 amino acids (C-terminal truncated 32 amino acids of sequence SEQ ID No. 1), aa.132-137 region thereof is deleted, aa.17-37 polypeptide of HPV59 type L2 protein with N-terminal containing GP linker is fused between aa.131/138 (non-equilong replacement insertion in aa.132-137 region of HPV18 type L1 protein), glycine-proline is added to the N-terminal of the sequence shown in SEQ ID No.3 as the amino acid sequence of the insertion fragment, 18L1DE 131-138 The amino acid sequence of the 59dE chimeric protein is shown as SEQ ID No. 20. Coding 18L1DE 131-138 The polynucleotide sequence of/59 dE is constructed by adopting a total gene synthesis mode through the optimized design of insect cell codons, and the nucleotide sequence is shown as SEQ ID No. 37;
18 Chimeric L1 protein 18l1Δcde 121-124 59dE: HPV18 type L1 protein with framework of C-terminal truncated 32 amino acids (C-terminal truncated 32 amino acids of sequence SEQ ID No. 1), aa.122-123 region thereof is deleted, aa.17-37 polypeptide of HPV59 type L2 protein is fused between aa.121/124 (non-equilong substitution insertion in aa.122-133 region of HPV18 type L1 protein), the amino acid sequence of the insert is shown as SEQ ID No.3, 18L1 delta CDE 121-124 The amino acid sequence of the 59dE chimeric protein is shown in SEQ ID No. 22. Encoding 18L1ΔCDE 121-124 The polynucleotide sequence of/59 dE is constructed by adopting a total gene synthesis mode through the optimized design of insect cell codons;
19 Chimeric L1 protein 18l1Δch4 431-433 59dE: HPV18 type L1 protein with skeleton of C-terminal truncated 32 amino acids (C-terminal truncated 32 amino acids of sequence SEQ ID No. 1), aa.432 thereof is deleted, aa.17-37 polypeptide of HPV59 type L2 protein is fused between aa.431/433 (non-equilong substitution insertion in aa.431-433 region of HPV18 type L1 protein), the amino acid sequence of inserted fragment is shown as SEQ ID No.3, 18L1 delta Ch4 431-433 The amino acid sequence of the 59dE chimeric protein is shown in SEQ ID No. 24. Encoding 18L1 DeltaCh 4 431-433 The polynucleotide sequence of/59 dE is constructed by adopting a total gene synthesis mode through the optimized design of insect cell codons;
20 Chimeric L1 protein 18l1Δch4 432-435 59dE: HPV18 type L1 protein with framework of C-terminal truncated 32 amino acids (C-terminal truncated 32 amino acids of sequence SEQ ID No. 1), aa.433-434 region thereof is deleted, aa.17-37 polypeptide of HPV59 type L2 protein is fused between aa.432/435 (non-equilong substitution insertion in aa.432-435 region of HPV18 type L1 protein), the amino acid sequence of the insert fragment is shown as SEQ ID No.3, 18L1 delta Ch4 432-435 The amino acid sequence of the 59dE chimeric protein is shown in SEQ ID No. 26. Encoding 18L1 DeltaCh 4 432-435 The polynucleotide sequence of/59 dE is constructed by adopting a total gene synthesis mode through the optimized design of insect cell codons;
21 Embedding)L1 protein 18L1DE 137-138 59dES-mut1: mutant mut1 with skeleton of full length HPV18 type L1 protein (i.e. mutant with amino acids 477, 478, 484, 496, 499, 504, 506 replaced by glycine (G) and amino acids 485, 500, 502 replaced by serine (S)) has direct insertion of aa.17-32 polypeptide of HPV59 type L2 protein (as shown in SEQ ID No. 4) at the position aa.137/138 of DE loop, 18L1DE 137-138 The amino acid sequence of the 59dES-mut1 chimeric protein is shown in SEQ ID No. 27. Coding 18L1DE 137-138 The polynucleotide sequence of/59 dES-mut1 is constructed by adopting a complete gene synthesis mode through the optimal design of E.coli codons or the optimal design of insect cells codons, wherein 18L1DE of the optimal insect cell codons is adopted 137-138 The polynucleotide sequence of/59 dES-mut1 is shown in SEQ ID No. 38;
22 Chimeric L1 protein 18L1DE 137-138 59dES-mut2: mutant mut2 with skeleton of full length HPV18 type L1 protein (namely, amino acids 477, 478, 485, 496, 499, 504, 506 of sequence shown in SEQ ID No.1 are replaced by glycine and amino acids 486, 500, 502 are replaced by serine), aa.17-32 polypeptide of HPV59 type L2 protein (shown in SEQ ID No. 4) is directly inserted into the position aa.137/138 of DE loop of the mutant mut2, 18L1DE 137-138 The amino acid sequence of the 59dES-mut2 chimeric protein is shown in SEQ ID No. 28. Coding 18L1DE 137-138 The polynucleotide sequence of/59 dES-mut2 is constructed by adopting a complete gene synthesis mode through the optimal design of E.coli codons or the optimal design of insect cells codons, wherein 18L1DE of the optimal insect cell codons is adopted 137-138 The polynucleotide sequence of/59 dES-mut2 is shown in SEQ ID No. 39;
23 Chimeric L1 protein 18L1DE 137-138 59dES-mut3: mutant mut3 with full length HPV18 type L1 protein skeleton (namely, amino acids 477, 478, 484, 496, 499, 502 and 506 of SEQ ID No.1 are replaced by glycine, amino acids 485 and 500 are replaced by serine and amino acid 504 is replaced by aspartic acid), aa.17-32 polypeptide (shown as SEQ ID No. 4) of HPV59 type L2 protein is directly inserted into the site of DE loop aa.137/138 of the mutant mut3, 18L1DE 137-138 The amino acid sequence of the 59dES-mut3 chimeric protein is shown in SEQ ID No. 29.Coding 18L1DE 137-138 The polynucleotide sequence of/59 dES-mut3 is constructed by adopting a complete gene synthesis mode through the optimal design of E.coli codons or the optimal design of insect cells codons, wherein 18L1DE of the optimal insect cell codons is adopted 137-138 The polynucleotide sequence of/59 dES-mut3 is shown as SEQ ID No. 40;
24 Chimeric L1 protein 18L1DE 137-138 59dES-mut4: mutant mut4 with full length HPV18 type L1 protein skeleton (namely, amino acids 477, 478, 485, 496, 502 and 506 of SEQ ID No.1 are replaced by glycine, amino acids 486 and 500 are replaced by serine and amino acids 499 and 504 are replaced by aspartic acid), aa.17-32 polypeptide of HPV59 type L2 protein (shown as SEQ ID No. 4) is directly inserted into the site of DE loop aa.137/138 of the mutant mut4, 18L1DE 137-138 The amino acid sequence of the 59dES-mut4 chimeric protein is shown in SEQ ID No. 30. Coding 18L1DE 137-138 The polynucleotide sequence of/59 dES-mut4 is constructed by adopting a complete gene synthesis mode through the optimal design of E.coli codons or the optimal design of insect cells codons, wherein 18L1DE of the optimal insect cell codons is adopted 137-138 The polynucleotide sequence of/59 dES-mut4 is shown as SEQ ID No. 41;
25 Chimeric L1 protein 18L1DE 137-138 59dES-mut5: mutant mut5 with full length HPV18 type L1 protein skeleton (namely, amino acids 477, 484, 496, 499, 504, 506 of SEQ ID No.1 are replaced by glycine and amino acids 485, 500, 502 are replaced by serine), aa.17-32 polypeptide of HPV59 type L2 protein (shown as SEQ ID No. 4) is directly inserted into the position aa.137/138 of DE loop of the mutant mut5, 18L1DE 137-138 The amino acid sequence of the 59dES-mut5 chimeric protein is shown in SEQ ID No. 31. Coding 18L1DE 137-138 The polynucleotide sequence of/59 dES-mut5 is constructed by adopting a complete gene synthesis mode through the optimal design of E.coli codons or the optimal design of insect cells codons, wherein 18L1DE of the optimal insect cell codons is adopted 137-138 The polynucleotide sequence of/59 dES-mut5 is shown in SEQ ID No. 42;
26 Chimeric L1 protein 18L1DE 137-138 59dES-mut6: mutant mut6 with full length HPV18 type L1 protein as skeleton (i.e. amino group of sequence shown in SEQ ID No. 1)Acid 477, 485, 496, 499, 504, 506 is replaced by glycine and amino acid 486, 500, 502 is replaced by serine), aa.17-32 polypeptide (shown as SEQ ID No. 4) of HPV59 type L2 protein is directly inserted into the position aa.137/138 of DE loop, 18L1DE 137-138 The amino acid sequence of the 59dES-mut6 chimeric protein is shown in SEQ ID No. 32. Coding 18L1DE 137-138 The polynucleotide sequence of/59 dES-mut6 is constructed by adopting a complete gene synthesis mode through the optimal design of E.coli codons or the optimal design of insect cells codons, wherein 18L1DE of the optimal insect cell codons is adopted 137-138 The polynucleotide sequence of 59dES-mut5 is shown in SEQ ID No. 43.
The E.coli codon-optimized chimeric protein gene was digested with NdeI/XhoI, and then inserted into a commercial expression vector pET22b (manufactured by Novagen Co.). The insect cell codon-optimized chimeric protein gene was digested with BamHI/EcoRI and inserted into a commercial expression vector pFastBac1 (Invitrogen). The expression vectors containing the chimeric protein gene were obtained, and 16 expression vectors for E.coli were obtained, respectively: pET22b-18L1DE 137-138 /59dES,pET22b-18L1DE 137-138 /59dE, pET22b-18L1h4 432-433 /59dE,pET22b-18L1h4 434-435 /59dE, pET22b-18L1DE 134-135 /59dES,pET22b-18L1DE 134-135 /59dE, pET22b-18L1DE 131-138 /59dE,pET22b-18L1DE 121-124 /59dE, pET22b-18L1h4 431-433 /59dE,pET22b-18L1h4 432-435 /59dE, pET22b-18L1DE 137-138 /59dES-mut1,pET22b-18L1DE 137-138 /59dES-mut2, pET22b-18L1DE 137-138 /59dES-mut3,pET22b-18L1DE 137-138 /59dES-mut4, pET22b-18L1DE 137-138 /59dES-mut5,pET22b-18L1DE 137-138 59dES-mut6; the total of 16 expression vectors for insect refinement were: pFastBac1-18L 1. DELTA. CDE 137-138 /59dES, pFastBac1-18L1ΔCDE 137-138 /59dE,pFastBac1-18L1ΔCh4 432-433 /59dE, pFastBac1-18L1ΔCh4 434-435 /59dE,pFastBac1-18L1ΔCDE 134-135 /59dES, pFastBac1-18L1ΔCDE 134-135 /59dE,pFastBac1-18L1ΔCDE 131-138 /59dE, pFastBac1-18L1ΔCDE 121-124 /59dE,pFastBac1-18L1ΔCh4 431-433 /59dE, pFastBac1-18L1ΔCh4 432-435 /59dE,pFastBac1-18L1DE 137-138 /59dES-mut1, pFastBac1-18L1DE 137-138 /59dES-mut2,pFastBac1-18L1DE 137-138 /59dES-mut3, pFastBac1-18L1DE 137-138 /59dES-mut4,pFastBac1-18L1DE 137-138 /59dES-mut5, pFastBac1-18L1DE 137-138 59dES-mut6. The methods of cleavage, ligation and cloning are all well known, for example, from patent CN101293918B.
The amino acid sequence of the polypeptide used in the present invention is as follows:
HPV18 type L1 full-length amino acid
HPV59 type L2 full-length amino acid sequence
HPV59 type L2 aa.17-37
LYKTCKQAGTCPSDVINKVEG SEQ ID NO.3
HPV59 type L2 aa 17-32
LYKTCKQAGTCPSDVI SEQ ID NO.4
HPV59 type L2 aa.16-35
DLYKTCKQAGTCPSDVINKV SEQ ID NO.5
HPV59 type L2 aa.16-37
DLYKTCKQAGTCPSDVINKVEG SEQ ID NO.6
18L1DE 137-138 /59dES
18L1ΔCDE 137-138 /59dES
18L1DE 137-138 /59dE
18L1ΔCDE 137-138 /59dE
18L1h4 432-433 /59dE
18L1ΔCh4 432-433 /59dE
18L1h4 434-435 /59dE
18L1ΔCh4 434-435 /59dE
18L1DE 134-135 /59dES
18L1ΔCDE 134-135 /59dES
18L1DE 134-135 /59dE
18L1ΔCDE 134-135 /59dE
18L1DE 131-138 /59dE
18L1ΔCDE 131-138 /59dE
18L1DE 121-124 /59dE
18L1ΔCDE 121-124 /59dE
18L1h4 431-433 /59dE
18L1ΔCh4 431-433 /59dE
18L1h4 432-435 /59dE
18L1ΔCh4 432-435 /59dE
18L1DE 137-138 /59dES-mut1
18L1DE 137-138 /59dES-mut2
/>
18L1DE 137-138 /59dES-mut3
18L1DE 137-138 /59dES-mut4
18L1DE 137-138 /59dES-mut5
18L1DE 137-138 /59dES-mut6
The nucleotide sequence encoding the chimeric proteins of the invention is shown below:
18L1ΔCDE 137-138 /59dES nt
/>
18L1ΔCDE 137-138 /59dE nt
18L1ΔCDE 134-135 /59dES nt
/>
18L1ΔCDE 134-135 /59dE nt
/>
18L1ΔCDE 131-138 /59dE nt
/>
18L1DE 137-138 /59dES-mut1 nt
18L1DE 137-138 /59dES-mut2 nt
/>
18L1DE 137-138 /59dES-mut3 nt
18L1DE 137-138 /59dES-mut4 nt
18L1DE 137-138 /59dES-mut5 nt
/>
18L1DE 137-138 /59dES-mut6 nt
example 3: construction of recombinant Bacmid and recombinant baculovirus of chimeric L1 protein gene
Recombinant expression vectors pFastBac1-18L1ΔCDE comprising chimeric L1 gene were used, respectively 137-138 /59dES,pFastBac1-18L1ΔCDE 137-138 /59dE, pFastBac1-18L1ΔCh4 432-433 /59dE,pFastBac1-18L1ΔCh4 434-435 /59dE, pFastBac1-18L1ΔCDE 134-135 /59dES,pFastBac1-18L1ΔCDE 134-135 /59dE, pFastBac1-18L1ΔCDE 131-138 /59dE,pFastBac1-18L1ΔCDE 121-124 /59dE, pFastBac1-18L1ΔCh4 431-433 /59dE,pFastBac1-18L1ΔCh4 432-435 /59dE, pFastBac1-18L1DE 137-138 /59dES-mut1,pFastBac1-18L1DE 137-138 /59dES-mut2, pFastBac1-18L1DE 137-138 /59dES-mut3,pFastBac1-18L1DE 137-138 /59dES-mut4, pFastBac1-18L1DE 137-138 /59dES-mut5,pFastBac1-18L1DE 137-138 E.coli DH10Bac competence was transformed with/59 dES-mut6, recombinant Bacmid was obtained by screening, insect cells Sf9 were transfected with recombinant Bacmid, and recombinant baculovirus was amplified within Sf 9. Methods for screening recombinant Bacmid and amplifying recombinant baculoviruses are well known, for example, patent CN101148661B.
Example 4: expression of genes of chimeric L1 proteins in Sf9 cells
Sf9 cells were inoculated with 16 recombinant baculoviruses of chimeric L1 gene, expression of chimeric L1 protein was performed, fermentation broth was collected after culturing at 27 ℃ for about 88 hours, centrifugation was performed at 3000rpm for 15min, supernatant was discarded, and cells were washed with PBS for expression identification and purification. Methods of infection expression are disclosed, for example, in patent CN101148661B.
Example 5: expression of chimeric L1 protein genes in E.coli
Recombinant expression vectors pET22b-18L1DE containing chimeric L1 gene were used, respectively 137-138 /59dES, pET22b-18L1DE 137-138 /59dE,pET22b-18L1h4 432-433 /59dE, pET22b-18L1h4 434-435 /59dE,pET22b-18L1DE 134-135 /59dES, pET22b-18L1DE 134-135 /59dE,pET22b-18L1DE 131-138 /59dE, pET22b-18L1DE 121-124 /59dE,pET22b-18L1h4 431-433 /59dE, pET22b-18L1h4 432-435 /59dE,pET22b-18L1DE 137-138 /59dES-mut1, pET22b-18L1DE 137-138 /59dES-mut2,pET22b-18L1DE 137-138 /59dES-mut3, pET22b-18L1DE 137-138 /59dES-mut4,pET22b-18L1DE 137-138 /59dES-mut5, pET22b-18L1DE 137-138 E.coli BL21 (DE 3) was transformed with 59dES-mut 6.
The monoclonal was inoculated into 3ml of LB medium containing ampicillin and cultured overnight at 37 ℃. The bacterial liquid cultured overnight is prepared according to the following ratio of 1:100 is added into LB culture medium, cultured for 3 hours at 37 ℃, IPTG is added to the final concentration of 0.5 mu M when the OD600 reaches 0.8-1.0, and cultured for 12 hours at 16 ℃, and bacterial liquid is collected.
Example 6: expression identification of chimeric L1 proteins
Taking 1X 10 each of the cells expressing different chimeric L1 proteins described in example 4 and example 5 6 Separately, 10. Mu.l of each of the two samples was subjected to SDS-PAGE and Western blot identification by re-suspending the sample in 200. Mu.l of PBS, adding 50. Mu.l of 6×loading Buffer, and denaturing at 75℃for 8 minutes. As shown in FIGS. 1A-1B, 26 chimeric L1 proteins were expressed at high levels in insect cells or in prokaryotic expression systems, 18L1DE 137-138 /59dES,18L1DE 137-138 /59dE,18L1h4 432-433 /59dE, 18L1h4 434-435 /59dE,18L1DE 134-135 /59dES,18L1DE 134-135 /59dE, 18L1DE 131-138 /59dE,18L1DE 121-124 /59dE,18L1h4 431-433 /59dE, 18L1h4 432-435 /59dE,18L1DE 137-138 /59dES-mut1,18L1DE 137-138 /59dES-mut2, 18L1DE 137-138 /59dES-mut3,18L1DE 137-138 /59dES-mut4, 18L1DE 137-138 /59dES-mut5,18L1DE 137-138 59dES-mut6 is approximately 59kDa in size and the remaining 10 proteins are approximately 55kDa in size. Methods of SDS-PAGE electrophoresis and Western blot identification are disclosed, for example, in patent CN101148661B.
Example 7: comparison of expression level of chimeric L1 protein in insect cells
Taking the C-terminal truncated 32 amino acid 18L1 skeleton protein expression cells or chimeric L1 protein expression cells respectively taking C-terminal truncated 32 amino acid 18L1 as skeleton or 6 18L1 mutants as skeleton as described in example 4 6 And re-suspending in 200 μl PBS solution, and disrupting the cells by ultrasonic disruption (Ningbo Xinzhi ultrasonic disrupter, 2# probe, 100W, ultrasonic for 5s, interval 7s, total time 3 min), high speed centrifugation at 12000rpm for 10 min. The cleavage supernatant is collected and the L1 content of the supernatant is measured by a sandwich ELISA method, which is well known, for example, from patent CN104513826A.
Coating an ELISA plate with HPV18L1 monoclonal antibody prepared by the inventor, and incubating at 4 ℃ for overnight; the plates were blocked with 5% BSA-PBST for 2h at room temperature and washed 3 times with PBST. Lysates were serially diluted 2-fold with PBS and HPV18L1 VLP standard was also diluted in gradient, at a concentration from 2. Mu.g/ml to 0.0625. Mu.g/ml, and ELISA plates were added, 100. Mu.l per well, and incubated for 1h at 37 ℃. Plates were washed 3 times with PBST, add 1: HPV18L1 rabbit polyclonal antibody diluted at 3000 was incubated at 37℃for 1h at 100. Mu.l per well. Plates were washed 3 times with PBST, add 1:3000 dilution of HRP-labeled goat anti-mouse IgG (1:3000 dilution, china fir bridge Co.) was incubated at 37℃for 45 minutes. The plate was washed 5 times with PBST, 100. Mu.l of OPD substrate (Sigma Co.) was added to each well, color development was performed at 37℃for 5 minutes, the reaction was stopped with 50. Mu.l of 2M sulfuric acid, and the absorbance was measured at 490 nm. The concentration of HPV18L1 protein and 18L1 chimeric protein in the lysates was calculated according to a standard curve.
The results are shown in Table 3, 18L 1. DELTA.CDE of the present invention 134-135 /59dES、 18L1ΔCDE 137-138 /59dES、18L1ΔCh4 431-433 59dE and 18L1 DeltaCh 4 432-435 The expression level of/59 dE is very high and is equivalent to that of an HPV18L1 skeleton; in addition, chimeric proteins 18L1DE with C-terminal amino acid substitution of 18L1 mutant as backbone 137-138 /59dES-mut1、18L1DE 137-138 /59dES-mut4、 18L1DE 137-138 The expression level of/59 dES-mut5 is higher than that of HPV18L1 skeleton and the chimeric protein truncated at C-terminal.
TABLE 3 chimeric L1 protein expression level analysis
Example 8: purification of chimeric L1 proteins and dynamic light scattering particle size analysis
Taking a proper amount of cell fermentation broth of chimeric L1, re-suspending cells by using 10ml PBS, adding PMSF to a final concentration of 1mg/ml, performing ultrasonic disruption (Ningbo Xinzhi ultrasonic disrupter, 6# probe, 200W, ultrasonic treatment for 5s, interval of 7s and total time of 10 min), taking disruption supernatant, purifying, and performing the purification step at room temperature. VLPs were depolymerized by adding 4% beta-mercaptoethanol (w/w) to the lysate, and then the samples were filtered using a 0.22 μm filter, followed by DMAE anion exchange chromatography or CM cation exchange chromatography (20mM Tris,180mM NaCl,4% beta-ME, pH7.9 elution), TMAE anion exchange chromatography or Q cation exchange chromatography (20mM Tris,180mM NaCl,4% beta-ME, pH7.9 elution) and hydroxyapatite chromatography (100 mM NaH) 2 PO 4 30mM NaCl,4% beta-ME, pH 6.0 elution). The purified product was concentrated using a Planova ultrafiltration system and the buffer was replaced (20 mM NaH 2 PO 4 500mM NaCl, pH 6.0) facilitates VLP assembly. The above purification methods are disclosed, for example, in patent CN101293918B, CN1976718A and the like.
Chimeric protein pure product assembly processFound in (1) 18L1h4 431-433 /59dE、18L1h4 432-435 /59dE、 18L1ΔCh4 431-433 59dE and 18L1 DeltaCh 4 432-435 The 59dE was severely aggregated, and no aggregation was observed after assembly of the other chimeric proteins. The assembled chimeric protein solution was subjected to DLS particle size analysis (Zetasizer Nano ZS dynamic light scattering apparatus, malvern Co.) and the results are shown in Table 4, wherein 18L 1. DELTA.CDE 134-135 /59dE、 18L1ΔCDE 134-135 /59dES、18L1ΔCDE 137-138 /59dE、18L1ΔCDE 137-138 DLS analysis charts of/59 dES are shown in FIGS. 2A to 2D.
TABLE 4 chimeric L1 protein DLS analysis
Example 9: transmission electron microscopy of chimeric VLPs
The chimeric proteins were purified separately by the chromatographic purification method described in example 8, copper mesh was prepared using the assembled chimeric, stained with 1% uranium acetate, dried well and observed using JEM-1400 electron microscope (olympus). The results show that both E.coli and insect cell expressed chimeric proteins can be assembled into cVLPs with diameters of about 50 nm. Of which 18L 1. DELTA.CDE 134-135 /59dE、18L1ΔCDE 134-135 /59dES、 18L1ΔCDE 137-138 /59dE、18L1ΔCDE 137-138 The electron microscope pictures of/59 dES cVLP are shown in FIGS. 3A to 3D. Methods of copper mesh preparation and electron microscopy are disclosed, for example, in patent CN 101148661B.
Example 10: mouse immunization and neutralizing antibody titer assay for chimeric VLPs
BALB/c mice of 4-6 weeks of age were randomly grouped, 5 animals per group, combined with 10. Mu.g cVLP in combination with Al (OH) 3 50 μg and MPL adjuvant 5 μg immunized mice. Subcutaneous injections were performed and immunized 4 times at weeks 0,4,7, 10. Tail vein blood collection was carried out 2 weeks after the 4 th immunization, and serum was separated.
The results of the detection of neutralizing antibody titers in immune serum using 24 HPV pseudoviruses showed that the level and neutralization range of cross-neutralizing antibodies induced after immunization of mice with various crps produced by escherichia coli and insect cell expression systems were different. Wherein, as shown in Table 5, the insect cells expressed 18L 1. Delta. CDE 134-135 59dES and 18L1 ΔCDE 137-138 The 59dES cVLP antiserum can neutralize at least 23 pseudoviruses, 18L1ΔCDE 134-135 59dE and 18L1ΔCDE 137-138 The 59dE cVLP immune serum can neutralize at least 19 pseudoviruses. It is worth mentioning that 18L1 ΔCDE 134-135 59dES and 18L1 ΔCDE 137-138 59dES cVLP antiserum neutralizes all 6 detectable alpha 7-HPVs, particularly 18L 1. Delta. CDE 137-138 The antibody titer of the 59dES cVLP for cross-neutralizing alpha 7-HPV is more than 250, and the cVLP with the highest cross-neutralizing alpha 7-HPV capability is reported at present.
In addition, cVLP constructed by C-terminal truncated 32 amino acid 18L1 mutant in the invention can induce high-level neutralizing antibodies after mice are immunized by the strategy, wherein 18L1DE 137-138 Each neutralizing antibody induced by 59dES-mut4 was found to be equivalent in level to 18L 1. Delta. CDE 137-138 A equivalent of 59dES, notably 18L1DE 137-138 HPV39 and HPV59 neutralizing antibody titers of/59 dES-mut4 immune serum were both greater than 10 3 (as shown in Table 5 and FIG. 4).
Methods for pseudovirus preparation and pseudovirus neutralization experiments are disclosed, for example, in patent CN 104418942a.
TABLE 5 neutralizing antibody titres induced in mice by different cVLPs
/>
* ND means that no neutralizing antibodies were detected at the lowest dilution.
Sequence listing
<110> basic medical institute of the national academy of medical science
<120> a chimeric protein of human papillomavirus 18 and use thereof
<130> 300263CG
<160> 51
<170> SIPOSequenceListing 1.0
<210> 1
<211> 507
<212> PRT
<213> HPV 18
<400> 1
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
435 440 445
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
450 455 460
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg Lys Pro Thr
465 470 475 480
Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser Lys
485 490 495
Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
500 505
<210> 2
<211> 464
<212> PRT
<213> HPV 59
<400> 2
Met Val Ser His Arg Ala Ala Arg Arg Lys Arg Ala Ser Ala Thr Asp
1 5 10 15
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
20 25 30
Asn Lys Val Glu Gly Thr Thr Leu Ala Asp Lys Ile Leu Gln Trp Thr
35 40 45
Ser Leu Gly Ile Phe Leu Gly Gly Leu Gly Ile Gly Thr Gly Ser Gly
50 55 60
Thr Gly Gly Arg Thr Gly Tyr Ile Pro Leu Gly Gly Arg Thr Asn Thr
65 70 75 80
Ile Val Asp Val Ser Pro Ala Lys Pro Pro Val Val Ile Glu Pro Val
85 90 95
Gly Pro Thr Asp Pro Ser Ile Val Thr Leu Val Glu Asp Ser Ser Val
100 105 110
Ile Thr Ser Gly Ala Pro Ala Pro Thr Phe Thr Gly Thr Ser Gly Phe
115 120 125
Glu Ile Ser Thr Ser Ser Thr Thr Thr Pro Ala Val Leu Asp Ile Thr
130 135 140
Pro Thr Ser Ser Val Gln Ile Ser Ser Ser Ser Phe Ile Asn Pro Ala
145 150 155 160
Phe Thr Asp Pro Ser Val Ile Glu Val Pro Gln Thr Gly Glu Ile Ser
165 170 175
Gly Asn Ile Leu Ile Ser Thr Pro Thr Ser Gly Ala His Gly Tyr Glu
180 185 190
Glu Ile Pro Met Gln Thr Phe Ala Thr Glu Gly Thr Gly Leu Glu Pro
195 200 205
Ile Ser Ser Thr Pro Asn Pro Thr Val Arg Arg Val Ala Gly Pro Arg
210 215 220
Leu Tyr Ser Arg Ala Asn Gln Gln Val Arg Val Ser Asp Ala Asn Phe
225 230 235 240
Leu Thr Arg Pro Ser Thr Phe Val Thr Tyr Asp Asn Pro Ala Tyr Asp
245 250 255
Pro Ile Asp Thr Thr Leu Thr Phe Asp Pro Ser Ser Glu Val Pro Asp
260 265 270
Pro Asp Phe Met Asp Ile Val Arg Leu His Arg Pro Ala Leu Thr Ser
275 280 285
Arg Arg Ser Thr Val Arg Phe Ser Arg Leu Gly Gln Arg Ala Thr Met
290 295 300
Phe Thr Arg Ser Gly Lys Gln Ile Gly Ala Arg Val His Phe Tyr His
305 310 315 320
Asp Ile Ser Pro Ile Pro His Ala Glu Asn Ile Glu Leu Gln Pro Leu
325 330 335
Val Ser Ser Gln Ala Ala Thr Asp Asp Ile Tyr Asp Ile Tyr Ala Asp
340 345 350
Ile Thr Asp Glu Ala Pro Thr Ser Thr Ala Asn Thr Ala Phe Thr Ile
355 360 365
Pro Lys Ser Ser Phe Gln Ser Leu Ser Leu Thr Arg Ser Ala Ser Ser
370 375 380
Thr Phe Ser Asn Val Thr Val Pro Leu Ala Thr Ala Trp Asp Val Pro
385 390 395 400
Val Asn Thr Gly Pro Asp Ile Val Leu Pro Asn Thr Asn Ile Val Gly
405 410 415
Pro Thr Tyr Ser Thr Thr Pro Phe Thr Thr Ile Gln Ser Ile Asn Ile
420 425 430
Glu Gly Thr Asn Tyr Phe Leu Trp Pro Ile Tyr Tyr Phe Leu Pro Arg
435 440 445
Lys Arg Lys Arg Val Pro Tyr Phe Phe Thr Asp Gly Ser Met Ala Phe
450 455 460
<210> 3
<211> 21
<212> PRT
<213> HPV 59
<400> 3
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
1 5 10 15
Asn Lys Val Glu Gly
20
<210> 4
<211> 16
<212> PRT
<213> HPV 59
<400> 4
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
1 5 10 15
<210> 5
<211> 20
<212> PRT
<213> HPV 59
<400> 5
Asp Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val
1 5 10 15
Ile Asn Lys Val
20
<210> 6
<211> 22
<212> PRT
<213> HPV 59
<400> 6
Asp Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val
1 5 10 15
Ile Asn Lys Val Glu Gly
20
<210> 7
<211> 523
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES
<400> 7
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg Lys Pro Thr
485 490 495
Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser Lys
500 505 510
Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520
<210> 8
<211> 491
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(491)
<223> 18L1ΔCDE137-138/59dES
<400> 8
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490
<210> 9
<211> 527
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(527)
<223> 18L1DE137-138/59dE
<400> 9
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asp Leu Tyr Lys Thr Cys Lys
130 135 140
Gln Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Asn Val Ser
145 150 155 160
Glu Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu
165 170 175
Cys Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly
180 185 190
Thr Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu
195 200 205
Glu Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly
210 215 220
Tyr Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val
225 230 235 240
Pro Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln
245 250 255
Met Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg
260 265 270
Glu Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly
275 280 285
Asp Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala
290 295 300
Ser Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val
305 310 315 320
Thr Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala
325 330 335
Gln Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr
340 345 350
Val Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr
355 360 365
Gln Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr
370 375 380
Ser Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys
385 390 395 400
Thr Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn
405 410 415
Ser Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr
420 425 430
Thr Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr
435 440 445
Cys Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys
450 455 460
Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu
465 470 475 480
Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg
485 490 495
Arg Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr
500 505 510
Thr Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 10
<211> 495
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(495)
<223> 18L1ΔCDE137-138/59dE
<400> 10
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asp Leu Tyr Lys Thr Cys Lys
130 135 140
Gln Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Asn Val Ser
145 150 155 160
Glu Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu
165 170 175
Cys Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly
180 185 190
Thr Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu
195 200 205
Glu Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly
210 215 220
Tyr Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val
225 230 235 240
Pro Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln
245 250 255
Met Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg
260 265 270
Glu Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly
275 280 285
Asp Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala
290 295 300
Ser Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val
305 310 315 320
Thr Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala
325 330 335
Gln Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr
340 345 350
Val Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr
355 360 365
Gln Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr
370 375 380
Ser Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys
385 390 395 400
Thr Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn
405 410 415
Ser Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr
420 425 430
Thr Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr
435 440 445
Cys Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys
450 455 460
Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu
465 470 475 480
Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
<210> 11
<211> 528
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(528)
<223> 18L1h4432-433/59dE
<400> 11
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
435 440 445
Asn Lys Val Glu Gly Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp
450 455 460
Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp
465 470 475 480
Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
Arg Arg Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala
500 505 510
Thr Thr Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 12
<211> 496
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(496)
<223> 18L1ΔCh4432-433/59dE
<400> 12
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
435 440 445
Asn Lys Val Glu Gly Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp
450 455 460
Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp
465 470 475 480
Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
<210> 13
<211> 528
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(528)
<223> 18L1h4434-435/59dE
<400> 13
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Ala Ala Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp
435 440 445
Val Ile Asn Lys Val Glu Gly Pro Ala Glu Asn Lys Asp Pro Tyr Asp
450 455 460
Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp
465 470 475 480
Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
Arg Arg Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala
500 505 510
Thr Thr Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 14
<211> 496
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(496)
<223> 18L1ΔCh4434-435/59dE
<400> 14
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Ala Ala Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp
435 440 445
Val Ile Asn Lys Val Glu Gly Pro Ala Glu Asn Lys Asp Pro Tyr Asp
450 455 460
Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp
465 470 475 480
Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
<210> 15
<211> 526
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(526)
<223> 18L1DE134-135/59dES
<400> 15
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Gly Pro Leu Tyr Lys Thr Cys Lys Gln Ala
130 135 140
Gly Thr Cys Pro Ser Asp Val Ile Pro Ala Thr Ser Asn Val Ser Glu
145 150 155 160
Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys
165 170 175
Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr
180 185 190
Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu
195 200 205
Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr
210 215 220
Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro
225 230 235 240
Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met
245 250 255
Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu
260 265 270
Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp
275 280 285
Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser
290 295 300
Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr
305 310 315 320
Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln
325 330 335
Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val
340 345 350
Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln
355 360 365
Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser
370 375 380
Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr
385 390 395 400
Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser
405 410 415
Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr
420 425 430
Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys
435 440 445
Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg
485 490 495
Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr
500 505 510
Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 16
<211> 494
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(494)
<223> 18L1ΔCDE134-135/59dES
<400> 16
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Gly Pro Leu Tyr Lys Thr Cys Lys Gln Ala
130 135 140
Gly Thr Cys Pro Ser Asp Val Ile Pro Ala Thr Ser Asn Val Ser Glu
145 150 155 160
Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys
165 170 175
Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr
180 185 190
Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu
195 200 205
Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr
210 215 220
Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro
225 230 235 240
Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met
245 250 255
Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu
260 265 270
Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp
275 280 285
Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser
290 295 300
Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr
305 310 315 320
Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln
325 330 335
Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val
340 345 350
Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln
355 360 365
Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser
370 375 380
Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr
385 390 395 400
Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser
405 410 415
Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr
420 425 430
Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys
435 440 445
Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490
<210> 17
<211> 532
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(532)
<223> 18L1DE134-135/59dE
<400> 17
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Gly Pro Asp Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Glu Gly Pro Ala
145 150 155 160
Thr Ser Asn Val Ser Glu Asp Val Arg Asp Asn Val Ser Val Asp Tyr
165 170 175
Lys Gln Thr Gln Leu Cys Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu
180 185 190
His Trp Ala Lys Gly Thr Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly
195 200 205
Asp Cys Pro Pro Leu Glu Leu Lys Asn Thr Val Leu Glu Asp Gly Asp
210 215 220
Met Val Asp Thr Gly Tyr Gly Ala Met Asp Phe Ser Thr Leu Gln Asp
225 230 235 240
Thr Lys Cys Glu Val Pro Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr
245 250 255
Pro Asp Tyr Leu Gln Met Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe
260 265 270
Phe Cys Leu Arg Arg Glu Gln Leu Phe Ala Arg His Phe Trp Asn Arg
275 280 285
Ala Gly Thr Met Gly Asp Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly
290 295 300
Thr Gly Met Arg Ala Ser Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro
305 310 315 320
Ser Gly Ser Ile Val Thr Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr
325 330 335
Trp Leu His Lys Ala Gln Gly His Asn Asn Gly Val Cys Trp His Asn
340 345 350
Gln Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr Asn Leu Thr
355 360 365
Ile Cys Ala Ser Thr Gln Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr
370 375 380
Lys Phe Lys Gln Tyr Ser Arg His Val Glu Glu Tyr Asp Leu Gln Phe
385 390 395 400
Ile Phe Gln Leu Cys Thr Ile Thr Leu Thr Ala Asp Val Met Ser Tyr
405 410 415
Ile His Ser Met Asn Ser Ser Ile Leu Glu Asp Trp Asn Phe Gly Val
420 425 430
Pro Pro Pro Pro Thr Thr Ser Leu Val Asp Thr Tyr Arg Phe Val Gln
435 440 445
Ser Val Ala Ile Thr Cys Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys
450 455 460
Asp Pro Tyr Asp Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys
465 470 475 480
Phe Ser Leu Asp Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val
485 490 495
Gln Ala Gly Leu Arg Arg Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser
500 505 510
Ala Pro Ser Ala Thr Thr Ser Ser Lys Pro Ala Lys Arg Val Arg Val
515 520 525
Arg Ala Arg Lys
530
<210> 18
<211> 500
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(500)
<223> 18L1ΔCDE134-135/59dE
<400> 18
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Gly Pro Asp Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Glu Gly Pro Ala
145 150 155 160
Thr Ser Asn Val Ser Glu Asp Val Arg Asp Asn Val Ser Val Asp Tyr
165 170 175
Lys Gln Thr Gln Leu Cys Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu
180 185 190
His Trp Ala Lys Gly Thr Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly
195 200 205
Asp Cys Pro Pro Leu Glu Leu Lys Asn Thr Val Leu Glu Asp Gly Asp
210 215 220
Met Val Asp Thr Gly Tyr Gly Ala Met Asp Phe Ser Thr Leu Gln Asp
225 230 235 240
Thr Lys Cys Glu Val Pro Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr
245 250 255
Pro Asp Tyr Leu Gln Met Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe
260 265 270
Phe Cys Leu Arg Arg Glu Gln Leu Phe Ala Arg His Phe Trp Asn Arg
275 280 285
Ala Gly Thr Met Gly Asp Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly
290 295 300
Thr Gly Met Arg Ala Ser Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro
305 310 315 320
Ser Gly Ser Ile Val Thr Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr
325 330 335
Trp Leu His Lys Ala Gln Gly His Asn Asn Gly Val Cys Trp His Asn
340 345 350
Gln Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr Asn Leu Thr
355 360 365
Ile Cys Ala Ser Thr Gln Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr
370 375 380
Lys Phe Lys Gln Tyr Ser Arg His Val Glu Glu Tyr Asp Leu Gln Phe
385 390 395 400
Ile Phe Gln Leu Cys Thr Ile Thr Leu Thr Ala Asp Val Met Ser Tyr
405 410 415
Ile His Ser Met Asn Ser Ser Ile Leu Glu Asp Trp Asn Phe Gly Val
420 425 430
Pro Pro Pro Pro Thr Thr Ser Leu Val Asp Thr Tyr Arg Phe Val Gln
435 440 445
Ser Val Ala Ile Thr Cys Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys
450 455 460
Asp Pro Tyr Asp Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys
465 470 475 480
Phe Ser Leu Asp Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val
485 490 495
Gln Ala Gly Leu
500
<210> 19
<211> 524
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(524)
<223> 18L1DE131-138/59dE
<400> 19
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Gly Pro Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys
130 135 140
Pro Ser Asp Val Ile Asn Lys Val Glu Gly Asn Val Ser Glu Asp Val
145 150 155 160
Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu
165 170 175
Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys
180 185 190
Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys
195 200 205
Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala
210 215 220
Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp
225 230 235 240
Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala
245 250 255
Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu
260 265 270
Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val
275 280 285
Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly
290 295 300
Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp
305 310 315 320
Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His
325 330 335
Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp
340 345 350
Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro
355 360 365
Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His
370 375 380
Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr
385 390 395 400
Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile
405 410 415
Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu
420 425 430
Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys
435 440 445
Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe
450 455 460
Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr
465 470 475 480
Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg Lys Pro
485 490 495
Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser
500 505 510
Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520
<210> 20
<211> 492
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(492)
<400> 20
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Gly Pro Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys
130 135 140
Pro Ser Asp Val Ile Asn Lys Val Glu Gly Asn Val Ser Glu Asp Val
145 150 155 160
Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu
165 170 175
Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys
180 185 190
Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys
195 200 205
Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala
210 215 220
Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp
225 230 235 240
Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala
245 250 255
Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu
260 265 270
Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val
275 280 285
Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly
290 295 300
Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp
305 310 315 320
Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His
325 330 335
Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp
340 345 350
Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro
355 360 365
Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His
370 375 380
Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr
385 390 395 400
Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile
405 410 415
Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu
420 425 430
Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys
435 440 445
Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe
450 455 460
Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr
465 470 475 480
Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490
<210> 21
<211> 526
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(526)
<223> 18L1DE121-124/59dE
<400> 21
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Leu Tyr Lys Thr Cys Lys Gln
115 120 125
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Glu Gly Asn Lys
130 135 140
Leu Asp Asp Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu
145 150 155 160
Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys
165 170 175
Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr
180 185 190
Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu
195 200 205
Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr
210 215 220
Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro
225 230 235 240
Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met
245 250 255
Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu
260 265 270
Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp
275 280 285
Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser
290 295 300
Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr
305 310 315 320
Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln
325 330 335
Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val
340 345 350
Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln
355 360 365
Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser
370 375 380
Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr
385 390 395 400
Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser
405 410 415
Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr
420 425 430
Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys
435 440 445
Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg
485 490 495
Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr
500 505 510
Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 22
<211> 494
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(494)
<223> 18L1ΔCDE121-124/59dE
<400> 22
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Leu Tyr Lys Thr Cys Lys Gln
115 120 125
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Glu Gly Asn Lys
130 135 140
Leu Asp Asp Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu
145 150 155 160
Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys
165 170 175
Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr
180 185 190
Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu
195 200 205
Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr
210 215 220
Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro
225 230 235 240
Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met
245 250 255
Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu
260 265 270
Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp
275 280 285
Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser
290 295 300
Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr
305 310 315 320
Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln
325 330 335
Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val
340 345 350
Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln
355 360 365
Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser
370 375 380
Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr
385 390 395 400
Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser
405 410 415
Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr
420 425 430
Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys
435 440 445
Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490
<210> 23
<211> 527
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(527)
<223> 18L1h4431-433/59dE
<400> 23
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Leu
420 425 430
Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile Asn
435 440 445
Lys Val Glu Gly Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys
450 455 460
Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu
465 470 475 480
Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg
485 490 495
Arg Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr
500 505 510
Thr Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 24
<211> 495
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(495)
<223> 18L1ΔCh4431-433/59dE
<400> 24
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Leu
420 425 430
Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile Asn
435 440 445
Lys Val Glu Gly Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys
450 455 460
Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu
465 470 475 480
Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
<210> 25
<211> 526
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(526)
<223> 18L1h4432-435/59dE
<400> 25
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
435 440 445
Asn Lys Val Glu Gly Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg
485 490 495
Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr
500 505 510
Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 26
<211> 494
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(494)
<223> 18L1ΔCh4432-435/59dE
<400> 26
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
435 440 445
Asn Lys Val Glu Gly Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490
<210> 27
<211> 523
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut1
<400> 27
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Gly Pro Thr
485 490 495
Ile Gly Pro Gly Ser Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Gly Ser Val Ser Val Gly Ala Gly Lys
515 520
<210> 28
<211> 523
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut2
<400> 28
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Gly Pro Thr
485 490 495
Ile Gly Pro Arg Gly Ser Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Gly Ser Val Ser Val Gly Ala Gly Lys
515 520
<210> 29
<211> 523
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut3
<400> 29
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Gly Pro Thr
485 490 495
Ile Gly Pro Gly Ser Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Gly Ser Val Gly Val Asp Ala Gly Lys
515 520
<210> 30
<211> 523
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut4
<400> 30
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Gly Pro Thr
485 490 495
Ile Gly Pro Arg Gly Ser Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Asp Ser Val Gly Val Asp Ala Gly Lys
515 520
<210> 31
<211> 523
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut5
<400> 31
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Lys Pro Thr
485 490 495
Ile Gly Pro Gly Ser Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Gly Ser Val Ser Val Gly Ala Gly Lys
515 520
<210> 32
<211> 523
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut6
<400> 32
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Lys Pro Thr
485 490 495
Ile Gly Pro Arg Gly Ser Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Gly Ser Val Ser Val Gly Ala Gly Lys
515 520
<210> 33
<211> 1477
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<220>
<221> exon
<222> (1)..(1477)
<223> 18L1ΔCDE137-138/59dES nt
<400> 33
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctctaat 1477
<210> 34
<211> 1489
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<220>
<221> exon
<222> (1)..(1489)
<223> 18L1ΔCDE137-138/59dE nt
<400> 34
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cgacctgtac 420
aagacctgca agcaggccgg tacctgcccc tccgacgtca tcaacaaggt caacgtcagc 480
gaagatgtcc gcgataacgt cagcgtggac tacaaacaaa cccaactgtg catcctcggt 540
tgcgcacccg ccatcggcga gcattgggcc aagggtaccg cctgcaagag caggcccctg 600
agccaaggtg actgtccacc cctggagttg aagaataccg tcctcgagga cggcgacatg 660
gtggacaccg gctacggcgc aatggatttc tccaccctgc aggacaccaa gtgcgaagtg 720
cccctcgaca tctgccaaag catctgcaag taccccgact acctgcagat gagcgccgac 780
ccctacggcg actccatgtt cttctgtctg agaagggaac aattgttcgc ccgccacttc 840
tggaaccgcg ccggcaccat gggcgatacc gtcccccagt ccctgtacat caagggtacc 900
ggcatgaggg ccagccccgg ttcatgcgtc tacagcccaa gcccctccgg tagcatcgtc 960
acaagcgatt cccaactctt caacaagccc tactggctgc acaaagccca aggccacaat 1020
aacggcgtct gttggcacaa ccagctgttc gtcaccgtcg tggacacaac caggtccaca 1080
aacctgacca tctgcgccag cacccaaagc cccgtgcccg gccagtacga cgccacaaag 1140
ttcaaacaat actcacgcca cgtcgaagag tacgacctcc aattcatctt ccaactctgc 1200
accatcaccc tgaccgccga cgtcatgtcc tacatccact ccatgaactc atccatcctg 1260
gaagactgga atttcggcgt cccaccaccc cccaccacct ccctcgtcga cacctacagg 1320
ttcgtgcaga gcgtcgccat cacatgccag aaagacgccg cccccgccga gaacaaagac 1380
ccatacgaca aactgaaatt ctggaacgtc gacctgaaag agaaattcag cctggatctg 1440
gaccagtacc cattgggcag gaagttcctc gtccaggcgg gtctctaat 1489
<210> 35
<211> 1486
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<220>
<221> exon
<222> (1)..(1486)
<223> 18L1ΔCDE134-135/59dES nt
<400> 35
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg caggaccact gtacaagacc 420
tgcaagcagg ccggtacctg cccctccgac gtcatcccag caaccagcaa cgtcagcgaa 480
gatgtccgcg ataacgtcag cgtggactac aaacaaaccc aactgtgcat cctcggttgc 540
gcacccgcca tcggcgagca ttgggccaag ggtaccgcct gcaagagcag gcccctgagc 600
caaggtgact gtccacccct ggagttgaag aataccgtcc tcgaggacgg cgacatggtg 660
gacaccggct acggcgcaat ggatttctcc accctgcagg acaccaagtg cgaagtgccc 720
ctcgacatct gccaaagcat ctgcaagtac cccgactacc tgcagatgag cgccgacccc 780
tacggcgact ccatgttctt ctgtctgaga agggaacaat tgttcgcccg ccacttctgg 840
aaccgcgccg gcaccatggg cgataccgtc ccccagtccc tgtacatcaa gggtaccggc 900
atgagggcca gccccggttc atgcgtctac agcccaagcc cctccggtag catcgtcaca 960
agcgattccc aactcttcaa caagccctac tggctgcaca aagcccaagg ccacaataac 1020
ggcgtctgtt ggcacaacca gctgttcgtc accgtcgtgg acacaaccag gtccacaaac 1080
ctgaccatct gcgccagcac ccaaagcccc gtgcccggcc agtacgacgc cacaaagttc 1140
aaacaatact cacgccacgt cgaagagtac gacctccaat tcatcttcca actctgcacc 1200
atcaccctga ccgccgacgt catgtcctac atccactcca tgaactcatc catcctggaa 1260
gactggaatt tcggcgtccc accacccccc accacctccc tcgtcgacac ctacaggttc 1320
gtgcagagcg tcgccatcac atgccagaaa gacgccgccc ccgccgagaa caaagaccca 1380
tacgacaaac tgaaattctg gaacgtcgac ctgaaagaga aattcagcct ggatctggac 1440
cagtacccat tgggcaggaa gttcctcgtc caggcgggtc tctaat 1486
<210> 36
<211> 1504
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<220>
<221> exon
<222> (1)..(1504)
<223> 18L1ΔCDE134-135/59dE nt
<400> 36
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg caggaccaga cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acaaggtcga aggaccagca 480
accagcaacg tcagcgaaga tgtccgcgat aacgtcagcg tggactacaa acaaacccaa 540
ctgtgcatcc tcggttgcgc acccgccatc ggcgagcatt gggccaaggg taccgcctgc 600
aagagcaggc ccctgagcca aggtgactgt ccacccctgg agttgaagaa taccgtcctc 660
gaggacggcg acatggtgga caccggctac ggcgcaatgg atttctccac cctgcaggac 720
accaagtgcg aagtgcccct cgacatctgc caaagcatct gcaagtaccc cgactacctg 780
cagatgagcg ccgaccccta cggcgactcc atgttcttct gtctgagaag ggaacaattg 840
ttcgcccgcc acttctggaa ccgcgccggc accatgggcg ataccgtccc ccagtccctg 900
tacatcaagg gtaccggcat gagggccagc cccggttcat gcgtctacag cccaagcccc 960
tccggtagca tcgtcacaag cgattcccaa ctcttcaaca agccctactg gctgcacaaa 1020
gcccaaggcc acaataacgg cgtctgttgg cacaaccagc tgttcgtcac cgtcgtggac 1080
acaaccaggt ccacaaacct gaccatctgc gccagcaccc aaagccccgt gcccggccag 1140
tacgacgcca caaagttcaa acaatactca cgccacgtcg aagagtacga cctccaattc 1200
atcttccaac tctgcaccat caccctgacc gccgacgtca tgtcctacat ccactccatg 1260
aactcatcca tcctggaaga ctggaatttc ggtgtcccac caccccccac cacctccctc 1320
gtcgacacct acaggttcgt gcagagcgtc gccatcacat gccagaaaga cgccgccccc 1380
gccgagaaca aagacccata cgacaaactg aaattctgga acgtcgacct gaaagagaaa 1440
ttcagcctgg atctggacca gtacccattg ggcaggaagt tcctcgtcca ggcgggtctc 1500
taat 1504
<210> 37
<211> 1480
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<220>
<221> exon
<222> (1)..(1480)
<223> 18L1ΔCDE131-138/59dE nt
<400> 37
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tccggtcccc tgtacaagac ctgcaagcag 420
gccggtacct gcccctccga cgtcatcaac aaggtcgaag gaaacgtcag cgaagatgtc 480
cgcgataacg tcagcgtgga ctacaaacaa acccaactgt gcatcctcgg ttgcgcaccc 540
gccatcggcg agcattgggc caagggtacc gcctgcaaga gcaggcccct gagccaaggt 600
gactgtccac ccctggagtt gaagaatacc gtcctcgagg acggcgacat ggtggacacc 660
ggctacggcg caatggattt ctccaccctg caggacacca agtgcgaagt gcccctcgac 720
atctgccaaa gcatctgcaa gtaccccgac tacctgcaga tgagcgccga cccctacggc 780
gactccatgt tcttctgtct gagaagggaa caattgttcg cccgccactt ctggaaccgc 840
gccggcacca tgggcgatac cgtcccccag tccctgtaca tcaagggtac cggcatgagg 900
gccagccccg gttcatgcgt ctacagccca agcccctccg gtagcatcgt cacaagcgat 960
tcccaactct tcaacaagcc ctactggctg cacaaagccc aaggccacaa taacggcgtc 1020
tgttggcaca accagctgtt cgtcaccgtc gtggacacaa ccaggtccac aaacctgacc 1080
atctgcgcca gcacccaaag ccccgtgccc ggccagtacg acgccacaaa gttcaaacaa 1140
tactcacgcc acgtcgaaga gtacgacctc caattcatct tccaactctg caccatcacc 1200
ctgaccgccg acgtcatgtc ctacatccac tccatgaact catccatcct ggaagactgg 1260
aatttcggcg tcccaccacc ccccaccacc tccctcgtcg acacctacag gttcgtgcag 1320
agcgtcgcca tcacatgcca gaaagacgcc gcccccgccg agaacaaaga cccatacgac 1380
aaactgaaat tctggaacgt cgacctgaaa gagaaattca gcctggatct ggaccagtac 1440
ccattgggca ggaagttcct cgtccaggcg ggtctctaat 1480
<210> 38
<211> 1573
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<220>
<221> exon
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut1 nt
<400> 38
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggcg gtccgacgat tggccctggc 1500
tctcgttctg ccccgtcggc cacgaccagc agcggccctg ccggtagcgt gagcgtgggc 1560
gctggcaaat aat 1573
<210> 39
<211> 1573
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<220>
<221> exon
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut2 nt
<400> 39
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggcg gtccgacgat tggccctcgt 1500
ggctcttctg ccccgtcggc cacgaccagc agcggccctg ccggtagcgt gagcgtgggc 1560
gctggcaaat aat 1573
<210> 40
<211> 1573
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<220>
<221> exon
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut3 nt
<400> 40
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggcg gtccgacgat tggccctggc 1500
tctcgttctg ccccgtcggc cacgaccagc agcggccctg ccggtagcgt gggcgtggac 1560
gctggcaaat aat 1573
<210> 41
<211> 1573
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<220>
<221> exon
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut4 nt
<400> 41
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggcg gtccgacgat tggccctcgt 1500
ggctcttctg ccccgtcggc cacgaccagc agcggccctg ccgacagcgt gggcgtggac 1560
gctggcaaat aat 1573
<210> 42
<211> 1573
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<220>
<221> exon
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut5 nt
<400> 42
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggca aaccgacgat tggccctggc 1500
tctcgttctg ccccgtcggc cacgaccagc agcggccctg ccggtagcgt gagcgtgggc 1560
gctggcaaat aat 1573
<210> 43
<211> 1573
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<220>
<221> exon
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut6 nt
<400> 43
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggca aaccgacgat tggccctcgt 1500
ggctcttctg ccccgtcggc cacgaccagc agcggccctg ccggtagcgt gagcgtgggc 1560
gctggcaaat aat 1573
<210> 44
<211> 23
<212> PRT
<213> HPV 35
<400> 44
Thr Gln Leu Tyr Arg Thr Cys Lys Ala Ala Gly Thr Cys Pro Pro Asp
1 5 10 15
Val Ile Pro Lys Val Glu Gly
20
<210> 45
<211> 23
<212> PRT
<213> HPV 39
<400> 45
Ser Thr Leu Tyr Arg Thr Cys Lys Gln Ser Gly Thr Cys Pro Pro Asp
1 5 10 15
Val Val Asp Lys Val Glu Gly
20
<210> 46
<211> 23
<212> PRT
<213> HPV 51
<400> 46
Thr Gln Leu Tyr Ser Thr Cys Lys Ala Ala Gly Thr Cys Pro Pro Asp
1 5 10 15
Val Val Asn Lys Val Glu Gly
20
<210> 47
<211> 23
<212> PRT
<213> HPV 53
<400> 47
Thr Gln Leu Tyr Gln Thr Cys Lys Gln Ser Gly Thr Cys Pro Glu Asp
1 5 10 15
Val Ile Asn Lys Ile Glu His
20
<210> 48
<211> 23
<212> PRT
<213> HPV 56
<400> 48
Thr Gln Leu Tyr Lys Thr Cys Lys Leu Ser Gly Thr Cys Pro Glu Asp
1 5 10 15
Val Val Asn Lys Ile Glu Gln
20
<210> 49
<211> 23
<212> PRT
<213> HPV 59
<400> 49
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
1 5 10 15
Asn Lys Val Glu Gly Thr Thr
20
<210> 50
<211> 23
<212> PRT
<213> HPV 68
<400> 50
Ser Thr Leu Tyr Lys Thr Cys Lys Gln Ser Gly Thr Cys Pro Pro Asp
1 5 10 15
Val Ile Asn Lys Val Glu Gly
20
<210> 51
<211> 23
<212> PRT
<213> HPV 82
<400> 51
Thr Gln Leu Tyr Ser Thr Cys Lys Ala Ala Gly Thr Cys Pro Pro Asp
1 5 10 15
Val Ile Pro Lys Val Lys Gly
20
Claims (12)
1. A human papillomavirus chimeric protein consisting of HPV type 18L 1 protein or a mutant of HPV type 18L 1 protein and a polypeptide from HPV type 59L 2 protein inserted into the surface region of the HPV type 18L 1 protein or mutant of HPV type 18L 1 protein, wherein:
the amino acid sequence of the HPV18 type L1 protein is shown as SEQ ID NO.1,
the mutant of HPV18 type L1 protein is selected from any one of the following:
a mutant with 32 truncated amino acids at the C end of the amino acid sequence shown in SEQ ID No. 1;
a mutant in which amino acids 477, 478, 485, 496, 502, 506 of the amino acid sequence shown in SEQ ID No.1 are substituted with glycine (G), amino acids 486, 500 are substituted with serine (S), and amino acids 499, 504 are substituted with aspartic acid (D); and
a mutant in which amino acids 477, 484, 496, 499, 504, 506 of the amino acid sequence shown in SEQ ID No.1 are substituted with glycine (G) and amino acids 485, 500, 502 are substituted with serine (S);
the polypeptide from HPV59 type L2 protein is selected from the polypeptide shown in any one of SEQ ID No.4, SEQ ID No.5 or SEQ ID No. 6;
wherein the amino acid sequence of the human papillomavirus chimeric protein is shown in SEQ ID NO: 8. 10, 16, 18, 30 and 31.
2. A polynucleotide encoding the human papillomavirus chimeric protein of claim 1.
3. The polynucleotide of claim 2, wherein the sequence of the polynucleotide is optimized whole gene with e.coli codons or whole gene with insect cell codons.
4. A polynucleotide according to claim 2 or 3 wherein the sequence of the polynucleotide is as set out in any one of SEQ ID No.33, SEQ ID No.34, SEQ ID No.35, SEQ ID No.36, SEQ ID No.41 and SEQ ID No. 42.
5. A vector comprising the polynucleotide of claim 4.
6. A cell comprising the vector of claim 5.
7. A multimer that is a chimeric pentamer or chimeric virus-like particle formed from the human papillomavirus chimeric protein of claim 1.
8. Use of a human papillomavirus chimeric protein according to claim 1 or a multimer according to claim 7 in the preparation of a vaccine for preventing papillomavirus infection and/or a disease induced by papillomavirus infection, wherein for a human papillomavirus chimeric protein as shown in SEQ ID No.31, the papillomavirus is one or more types selected from the group consisting of: HPV18, HPV39, HPV45, HPV59, HPV68, HPV70; for the sequence set forth in SEQ ID NO: 8. 10, 16, 18 and 30, said papillomavirus being one or more types selected from the group consisting of: HPV18, HPV26, HPV33, HPV35, HPV39, HPV45, HPV52, HPV53, HPV59, HPV66, HPV70, HPV73, HPV6, HPV11, HPV2, HPV5 and HPV57.
9. The use according to claim 8, wherein the papillomavirus infection-induced disease is selected from cervical cancer, vaginal cancer, labial cancer, penile cancer, perianal cancer, oropharyngeal cancer, tonsillar cancer, and oral cancer.
10. A vaccine for preventing papillomavirus infection and/or papillomavirus infection-induced disease comprising the human papillomavirus chimeric protein of claim 1 or the multimer of claim 7, and an adjuvant, vaccine excipient, or carrier.
11. Vaccine for preventing papillomavirus infection and/or papillomavirus infection-induced disease according to claim 10 further comprising virus-like particles or chimeric virus-like particles of at least one HPV of the mucophilic and/or dermatological group.
12. Vaccine for use in the prevention of papillomavirus infection and/or papillomavirus infection-induced disease according to claim 10 or 11, wherein the adjuvant is a human adjuvant.
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110002251.3A CN114716560B (en) | 2021-01-04 | 2021-01-04 | Human papilloma virus 18 chimeric protein and application thereof |
EP21913268.5A EP4261232A1 (en) | 2021-01-04 | 2021-09-26 | Human papillomavirus type 18 chimeric protein and use thereof |
PCT/CN2021/120583 WO2022142523A1 (en) | 2021-01-04 | 2021-09-26 | Human papillomavirus type 18 chimeric protein and use thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110002251.3A CN114716560B (en) | 2021-01-04 | 2021-01-04 | Human papilloma virus 18 chimeric protein and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114716560A CN114716560A (en) | 2022-07-08 |
CN114716560B true CN114716560B (en) | 2024-02-02 |
Family
ID=82233870
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110002251.3A Active CN114716560B (en) | 2021-01-04 | 2021-01-04 | Human papilloma virus 18 chimeric protein and application thereof |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP4261232A1 (en) |
CN (1) | CN114716560B (en) |
WO (1) | WO2022142523A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114716560B (en) * | 2021-01-04 | 2024-02-02 | 中国医学科学院基础医学研究所 | Human papilloma virus 18 chimeric protein and application thereof |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102497880A (en) * | 2009-06-25 | 2012-06-13 | 葛兰素史密丝克莱恩生物有限公司 | Novel human papillomavirus (HPV) protein constructs and their use in the prevention of HPV disease |
CN111662389A (en) * | 2020-06-05 | 2020-09-15 | 广州中医药大学(广州中医药研究院) | SARS-CoV-2 fusion protein and vaccine composition thereof |
WO2022142523A1 (en) * | 2021-01-04 | 2022-07-07 | 中国医学科学院基础医学研究所 | Human papillomavirus type 18 chimeric protein and use thereof |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB0413510D0 (en) | 2004-06-16 | 2004-07-21 | Glaxosmithkline Biolog Sa | Vaccine |
CN101148661B (en) | 2006-09-18 | 2013-01-02 | 中国医学科学院基础医学研究所 | Human papilloma virus 16 type coat protein virus-like particles, preparation method and use thereof |
CN101293918B (en) | 2007-04-29 | 2013-03-27 | 北京万泰生物药业股份有限公司 | Shorten human papilloma virus 16 type L1 protein |
CN101835796A (en) * | 2007-06-26 | 2010-09-15 | 财团法人日本健康科学振兴财团 | Induce the vaccine antigen of the cross reactivity neutralizing antibody of anti-high-risk human mammilla papillomavirus |
MX2012010481A (en) * | 2010-03-11 | 2012-10-09 | Rinat Neuroscience Corp | ANTIBODIES WITH pH DEPENDENT ANTIGEN BINDING. |
CN102153656B (en) * | 2011-01-12 | 2014-11-19 | 广州市元通医药科技有限公司 | Vaccine for chimeric virus-like particles and preparation method thereof |
CN104418942A (en) | 2013-08-30 | 2015-03-18 | 长春百克生物科技股份公司 | Truncated L1 proteins of human papilloma virus (HPV), virus-like particles as well as preparation method and application of virus-like particles |
CN104513826B (en) | 2013-09-29 | 2020-10-20 | 上海泽润生物科技有限公司 | Human papilloma virus gene, vector, strain and expression method |
CN107188966B (en) * | 2016-03-15 | 2020-03-31 | 中国医学科学院基础医学研究所 | Papilloma virus chimeric protein and application thereof |
CN107188967B (en) * | 2016-03-15 | 2020-03-31 | 中国医学科学院基础医学研究所 | Papilloma virus chimeric protein and application thereof |
JP2019511531A (en) * | 2016-04-13 | 2019-04-25 | メディミューン,エルエルシー | Use of amino acids as stabilizing compounds in pharmaceutical compositions containing high concentrations of protein based therapeutics |
CN108676057A (en) | 2018-06-19 | 2018-10-19 | 南京肽业生物科技有限公司 | Solid phase peptide synthssis device |
-
2021
- 2021-01-04 CN CN202110002251.3A patent/CN114716560B/en active Active
- 2021-09-26 EP EP21913268.5A patent/EP4261232A1/en active Pending
- 2021-09-26 WO PCT/CN2021/120583 patent/WO2022142523A1/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102497880A (en) * | 2009-06-25 | 2012-06-13 | 葛兰素史密丝克莱恩生物有限公司 | Novel human papillomavirus (HPV) protein constructs and their use in the prevention of HPV disease |
CN111662389A (en) * | 2020-06-05 | 2020-09-15 | 广州中医药大学(广州中医药研究院) | SARS-CoV-2 fusion protein and vaccine composition thereof |
WO2022142523A1 (en) * | 2021-01-04 | 2022-07-07 | 中国医学科学院基础医学研究所 | Human papillomavirus type 18 chimeric protein and use thereof |
Non-Patent Citations (4)
Title |
---|
Broad Cross-Protection Is Induced in Preclinical Models by a Human Papillomavirus Vaccine Composed of L1/L2 Chimeric Virus-Like Particles;Mathieu Boxus等;Journal of Virology;第90卷(第14期);6315-6325 * |
Chen,Z.等.L2 [human papillomavirus 59],AGU90687.1.《GenBank》.2013,FEATURES. * |
van der Weele,P.等.L1 [human papillomavirus 18] - Protein,ATL15086.1.《GenBank》.2017,FEATURES. * |
陈万涛主编.《口腔临床免疫学》.上海交通大学出版社,2010,第1卷(第1版),308. * |
Also Published As
Publication number | Publication date |
---|---|
WO2022142523A1 (en) | 2022-07-07 |
EP4261232A1 (en) | 2023-10-18 |
CN114716560A (en) | 2022-07-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10882887B2 (en) | Papillomavirus chimeric protein and application thereof | |
CN107188967B (en) | Papilloma virus chimeric protein and application thereof | |
US7976848B2 (en) | Optimized expression of HPV 58 L1 in yeast | |
Tyler et al. | Immunization with a consensus epitope from human papillomavirus L2 induces antibodies that are broadly neutralizing | |
US10940194B2 (en) | Mutant of L1 protein of human papillomavirus type 58 | |
Huber et al. | A chimeric 18L1-45RG1 virus-like particle vaccine cross-protects against oncogenic alpha-7 human papillomavirus types | |
WO2022142525A1 (en) | Human papillomavirus type 58 chimeric protein and use thereof | |
US20120087936A1 (en) | Therapeutic and prophylactic vaccine for the treatment and prevention of papillomavirus infection | |
JP2010263899A (en) | Chimeric human papillomavirus 16l1 protein including l2 peptide, virus-like particle prepared therefrom, and method for preparing the particle | |
WO2022111021A1 (en) | C-terminally modified human papillomavirus type 11 l1 protein and use thereof | |
CN114716560B (en) | Human papilloma virus 18 chimeric protein and application thereof | |
Chen et al. | Human papillomavirus 16L1-58L2 chimeric virus-like particles elicit durable neutralizing antibody responses against a broad-spectrum of human papillomavirus types | |
Zhang et al. | A rationally designed flagellin-L2 fusion protein induced serum and mucosal neutralizing antibodies against multiple HPV types | |
EP4273174A1 (en) | Human papillomavirus type 31 chimeric protein and use thereof | |
WO2022111020A1 (en) | C-terminus modified human papillomavirus type 6 l1 protein and use thereof | |
Bian et al. | Human papillomavirus type 16 L1E7 chimeric capsomeres have prophylactic and therapeutic efficacy against papillomavirus in mice | |
US20240002447A1 (en) | Modified human papillomavirus type 52 l1 protein and use thereof | |
US8715681B2 (en) | Minimal motifs of linear B-cell epitopes in L1 protein from human papillomavirus type 58 and their applications | |
US10329328B2 (en) | HPV-related fusion protein and applications thereof | |
Kwak | Development of prophylactic human papillomavirus vaccines |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |