CN114716560A - 一种人乳头瘤病毒18型嵌合蛋白及其用途 - Google Patents
一种人乳头瘤病毒18型嵌合蛋白及其用途 Download PDFInfo
- Publication number
- CN114716560A CN114716560A CN202110002251.3A CN202110002251A CN114716560A CN 114716560 A CN114716560 A CN 114716560A CN 202110002251 A CN202110002251 A CN 202110002251A CN 114716560 A CN114716560 A CN 114716560A
- Authority
- CN
- China
- Prior art keywords
- ser
- val
- leu
- thr
- pro
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 94
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 91
- 241000388169 Alphapapillomavirus 7 Species 0.000 title abstract description 4
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 229
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 213
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 118
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 78
- 229920001184 polypeptide Polymers 0.000 claims abstract description 69
- 241001631646 Papillomaviridae Species 0.000 claims abstract description 25
- 235000018102 proteins Nutrition 0.000 claims description 206
- 150000001413 amino acids Chemical group 0.000 claims description 160
- 235000001014 amino acid Nutrition 0.000 claims description 150
- 241000701806 Human papillomavirus Species 0.000 claims description 81
- 210000004027 cell Anatomy 0.000 claims description 51
- 108020004705 Codon Proteins 0.000 claims description 42
- 241000238631 Hexapoda Species 0.000 claims description 40
- 238000003780 insertion Methods 0.000 claims description 40
- 230000037431 insertion Effects 0.000 claims description 40
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Natural products NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 36
- 239000002157 polynucleotide Substances 0.000 claims description 35
- 108091033319 polynucleotide Proteins 0.000 claims description 34
- 102000040430 polynucleotide Human genes 0.000 claims description 34
- 241000588724 Escherichia coli Species 0.000 claims description 31
- 239000004471 Glycine Substances 0.000 claims description 27
- 229960005486 vaccine Drugs 0.000 claims description 27
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 26
- 208000009608 Papillomavirus Infections Diseases 0.000 claims description 19
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 claims description 18
- 239000002245 particle Substances 0.000 claims description 18
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 claims description 18
- 238000006467 substitution reaction Methods 0.000 claims description 17
- 239000002671 adjuvant Substances 0.000 claims description 15
- 206010008342 Cervix carcinoma Diseases 0.000 claims description 11
- 241000341655 Human papillomavirus type 16 Species 0.000 claims description 11
- 206010028980 Neoplasm Diseases 0.000 claims description 11
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 claims description 11
- 201000010881 cervical cancer Diseases 0.000 claims description 11
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 11
- 201000010099 disease Diseases 0.000 claims description 10
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 9
- 201000011510 cancer Diseases 0.000 claims description 9
- 235000003704 aspartic acid Nutrition 0.000 claims description 8
- CKLJMWTZIZZHCS-REOHCLBHSA-N aspartic acid group Chemical group N[C@@H](CC(=O)O)C(=O)O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 8
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 8
- 210000004899 c-terminal region Anatomy 0.000 claims description 8
- 208000015181 infectious disease Diseases 0.000 claims description 8
- KZNQNBZMBZJQJO-UHFFFAOYSA-N 1-(2-azaniumylacetyl)pyrrolidine-2-carboxylate Chemical compound NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 claims description 7
- 239000013598 vector Substances 0.000 claims description 6
- 238000002360 preparation method Methods 0.000 claims description 5
- 230000002265 prevention Effects 0.000 claims description 5
- 208000032271 Malignant tumor of penis Diseases 0.000 claims description 4
- 208000002471 Penile Neoplasms Diseases 0.000 claims description 4
- 206010034299 Penile cancer Diseases 0.000 claims description 4
- 239000000203 mixture Substances 0.000 claims description 4
- 229920000642 polymer Polymers 0.000 claims description 4
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 3
- 208000003445 Mouth Neoplasms Diseases 0.000 claims description 3
- 206010031096 Oropharyngeal cancer Diseases 0.000 claims description 3
- 206010057444 Oropharyngeal neoplasm Diseases 0.000 claims description 3
- 208000006842 Tonsillar Neoplasms Diseases 0.000 claims description 3
- 208000012987 lip and oral cavity carcinoma Diseases 0.000 claims description 3
- 201000006958 oropharynx cancer Diseases 0.000 claims description 3
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 3
- 206010046885 vaginal cancer Diseases 0.000 claims description 3
- 208000013139 vaginal neoplasm Diseases 0.000 claims description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 2
- 235000004279 alanine Nutrition 0.000 claims description 2
- 125000000539 amino acid group Chemical group 0.000 claims description 2
- 239000004973 liquid crystal related substance Substances 0.000 claims 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 abstract description 79
- 108010077245 asparaginyl-proline Proteins 0.000 description 55
- 108010047495 alanylglycine Proteins 0.000 description 54
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 54
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 52
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 48
- 108010068265 aspartyltyrosine Proteins 0.000 description 44
- NJONQBYLTANINY-IHPCNDPISA-N Phe-Trp-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(N)=O)C(O)=O NJONQBYLTANINY-IHPCNDPISA-N 0.000 description 31
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 29
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 29
- 108010037850 glycylvaline Proteins 0.000 description 29
- 108010092114 histidylphenylalanine Proteins 0.000 description 28
- 108010057821 leucylproline Proteins 0.000 description 28
- 108010064235 lysylglycine Proteins 0.000 description 28
- 230000003472 neutralizing effect Effects 0.000 description 28
- 108010026333 seryl-proline Proteins 0.000 description 28
- 238000003786 synthesis reaction Methods 0.000 description 28
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 27
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 27
- CGXQUULXFWRJOI-SRVKXCTJSA-N Arg-Val-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O CGXQUULXFWRJOI-SRVKXCTJSA-N 0.000 description 27
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 27
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 27
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 27
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 27
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 27
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 27
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 27
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 27
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 27
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 27
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 27
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 27
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 27
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 27
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 27
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 27
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 27
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 27
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 27
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 27
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 27
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 27
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 27
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 27
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 27
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 27
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 27
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 27
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 27
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 27
- VIWQOOBRKCGSDK-RYQLBKOJSA-N Trp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VIWQOOBRKCGSDK-RYQLBKOJSA-N 0.000 description 27
- FJKXUIJOMUWCDD-FHWLQOOXSA-N Tyr-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N)O FJKXUIJOMUWCDD-FHWLQOOXSA-N 0.000 description 27
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 27
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 27
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 27
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 27
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 27
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 27
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 27
- 230000015572 biosynthetic process Effects 0.000 description 27
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 27
- 108010053037 kyotorphin Proteins 0.000 description 27
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 27
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 27
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 27
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 27
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 26
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 25
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 25
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 25
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 25
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 25
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 25
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 25
- 108010060199 cysteinylproline Proteins 0.000 description 25
- 230000014509 gene expression Effects 0.000 description 24
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 23
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 23
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 23
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 23
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 23
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 23
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 23
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 23
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 23
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 23
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 23
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 23
- HHABWQIFXZPZCK-ACZMJKKPSA-N Cys-Gln-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HHABWQIFXZPZCK-ACZMJKKPSA-N 0.000 description 23
- PNEAWXSKCKCHDK-XIRDDKMYSA-N Cys-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CS)N)C(O)=O)C1=CN=CN1 PNEAWXSKCKCHDK-XIRDDKMYSA-N 0.000 description 23
- WVWRADGCZPIJJR-IHRRRGAJSA-N Cys-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N WVWRADGCZPIJJR-IHRRRGAJSA-N 0.000 description 23
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 23
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 23
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 23
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 23
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 23
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 23
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 23
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 23
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 23
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 23
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 23
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 23
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 23
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 23
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 23
- DDDLIMCZFKOERC-SVSWQMSJSA-N Thr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N DDDLIMCZFKOERC-SVSWQMSJSA-N 0.000 description 23
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 23
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 23
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 23
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 23
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 23
- 108010078144 glutaminyl-glycine Proteins 0.000 description 23
- 108010012058 leucyltyrosine Proteins 0.000 description 23
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 22
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 22
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 22
- 108010054813 diprotin B Proteins 0.000 description 22
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 22
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 21
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 21
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 21
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 21
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 21
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 21
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 21
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 21
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 21
- ZGKXAUIVGIBISK-SZMVWBNQSA-N Glu-His-Trp Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O ZGKXAUIVGIBISK-SZMVWBNQSA-N 0.000 description 21
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 21
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 21
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 21
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 21
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 21
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 21
- PBWNICYZGJQKJV-BZSNNMDCSA-N Phe-Phe-Cys Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O PBWNICYZGJQKJV-BZSNNMDCSA-N 0.000 description 21
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 21
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 21
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 21
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 21
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 21
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 21
- DZKFGCNKEVMXFA-JUKXBJQTSA-N Tyr-Ile-His Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O DZKFGCNKEVMXFA-JUKXBJQTSA-N 0.000 description 21
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 21
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 21
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 21
- BXJQKVDPRMLGKN-PMVMPFDFSA-N Tyr-Trp-Leu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 BXJQKVDPRMLGKN-PMVMPFDFSA-N 0.000 description 21
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 21
- 108010044940 alanylglutamine Proteins 0.000 description 21
- 108010018006 histidylserine Proteins 0.000 description 21
- 108010003700 lysyl aspartic acid Proteins 0.000 description 21
- 108010050848 glycylleucine Proteins 0.000 description 20
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 19
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 19
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 19
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 19
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 19
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 19
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 19
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 19
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 19
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 19
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 19
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 19
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 19
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 19
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 19
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 19
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 19
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 19
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 19
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 19
- 108010062796 arginyllysine Proteins 0.000 description 19
- 238000005457 optimization Methods 0.000 description 18
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 17
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 17
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 17
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 17
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 17
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 17
- 108700042752 tyrosyl-prolyl-leucyl-glycine Proteins 0.000 description 16
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 15
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 15
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 15
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 15
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 15
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 15
- 238000013461 design Methods 0.000 description 15
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 15
- 238000000034 method Methods 0.000 description 15
- 108010079317 prolyl-tyrosine Proteins 0.000 description 15
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 14
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 14
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 14
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 14
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 13
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 13
- 108010087924 alanylproline Proteins 0.000 description 13
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 12
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 12
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 12
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 11
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 11
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 11
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 11
- 239000002773 nucleotide Substances 0.000 description 11
- 125000003729 nucleotide group Chemical group 0.000 description 11
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 10
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- 239000013604 expression vector Substances 0.000 description 10
- 108010061238 threonyl-glycine Proteins 0.000 description 10
- GDNWBSFSHJVXKL-GUBZILKMSA-N Cys-Lys-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O GDNWBSFSHJVXKL-GUBZILKMSA-N 0.000 description 9
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 9
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 9
- 241001112090 Pseudovirus Species 0.000 description 9
- 239000012634 fragment Substances 0.000 description 9
- 108010020532 tyrosyl-proline Proteins 0.000 description 9
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 8
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 8
- NQSFIPWBPXNJII-PMVMPFDFSA-N Lys-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 NQSFIPWBPXNJII-PMVMPFDFSA-N 0.000 description 8
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 8
- 108010093581 aspartyl-proline Proteins 0.000 description 8
- 230000003053 immunization Effects 0.000 description 8
- 238000002649 immunization Methods 0.000 description 8
- 108010027338 isoleucylcysteine Proteins 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 210000002966 serum Anatomy 0.000 description 8
- 108010065920 Insulin Lispro Proteins 0.000 description 7
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 7
- 230000005847 immunogenicity Effects 0.000 description 7
- 238000006386 neutralization reaction Methods 0.000 description 7
- 231100000590 oncogenic Toxicity 0.000 description 7
- 230000002246 oncogenic effect Effects 0.000 description 7
- 108010090894 prolylleucine Proteins 0.000 description 7
- 238000000746 purification Methods 0.000 description 7
- WPXFILQZNKUYQO-BZSNNMDCSA-N 2-[[(2s)-2-[[(2s)-1-[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 WPXFILQZNKUYQO-BZSNNMDCSA-N 0.000 description 6
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 6
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 6
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 6
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 6
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 6
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 6
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 6
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 6
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 6
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 6
- AYBKPDHHVADEDA-YUMQZZPRSA-N Gly-His-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O AYBKPDHHVADEDA-YUMQZZPRSA-N 0.000 description 6
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 6
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 6
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 6
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 6
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 6
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 6
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 6
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 6
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 6
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 6
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 6
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 6
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 6
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 6
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 6
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 6
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 6
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 6
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 6
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 6
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 238000012217 deletion Methods 0.000 description 6
- 230000037430 deletion Effects 0.000 description 6
- 108010077112 prolyl-proline Proteins 0.000 description 6
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 5
- 208000022361 Human papillomavirus infectious disease Diseases 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 5
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 5
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 4
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 4
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 4
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 4
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 4
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 4
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 4
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 4
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 4
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 4
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 4
- GXIUDSXIUSTSLO-QXEWZRGKSA-N Asp-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N GXIUDSXIUSTSLO-QXEWZRGKSA-N 0.000 description 4
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 4
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 4
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 4
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 4
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 4
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 4
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 4
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 4
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 4
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 4
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 4
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 4
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 4
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 4
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 4
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 4
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 4
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 4
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 4
- FPNWKONEZAVQJF-GUBZILKMSA-N His-Asn-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FPNWKONEZAVQJF-GUBZILKMSA-N 0.000 description 4
- BFOGZWSSGMLYKV-DCAQKATOSA-N His-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N BFOGZWSSGMLYKV-DCAQKATOSA-N 0.000 description 4
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 4
- FHCNLXMTQJNJNH-KBIXCLLPSA-N Ile-Cys-Gln Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)O FHCNLXMTQJNJNH-KBIXCLLPSA-N 0.000 description 4
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 4
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 4
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 4
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 4
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 4
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 4
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 4
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 4
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 4
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 4
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 4
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 4
- 241000699670 Mus sp. Species 0.000 description 4
- 102000007079 Peptide Fragments Human genes 0.000 description 4
- 108010033276 Peptide Fragments Proteins 0.000 description 4
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 4
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 4
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 4
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 4
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 4
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 4
- QMABBZHZMDXHKU-FKBYEOEOSA-N Pro-Tyr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QMABBZHZMDXHKU-FKBYEOEOSA-N 0.000 description 4
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 4
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 4
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 4
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 4
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 4
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 4
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 4
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 4
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 4
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 4
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 4
- DBMMKEHYWIZTPN-JYJNAYRXSA-N Val-Cys-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N DBMMKEHYWIZTPN-JYJNAYRXSA-N 0.000 description 4
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 4
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 4
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 4
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 4
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- UQLDLKMNUJERMK-UHFFFAOYSA-L di(octadecanoyloxy)lead Chemical compound [Pb+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O UQLDLKMNUJERMK-UHFFFAOYSA-L 0.000 description 4
- 238000010790 dilution Methods 0.000 description 4
- 239000012895 dilution Substances 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 230000005965 immune activity Effects 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 230000001681 protective effect Effects 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- 108010029384 tryptophyl-histidine Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- 238000002525 ultrasonication Methods 0.000 description 4
- 241000701447 unidentified baculovirus Species 0.000 description 4
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 3
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 3
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 3
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 3
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 3
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 3
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 3
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 3
- 101000641175 Human papillomavirus type 18 Major capsid protein L1 Proteins 0.000 description 3
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 3
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 3
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- 108010079364 N-glycylalanine Proteins 0.000 description 3
- 241000283973 Oryctolagus cuniculus Species 0.000 description 3
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 3
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 3
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 3
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 3
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 238000002296 dynamic light scattering Methods 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- FPQQSJJWHUJYPU-UHFFFAOYSA-N 3-(dimethylamino)propyliminomethylidene-ethylazanium;chloride Chemical compound Cl.CCN=C=NCCCN(C)C FPQQSJJWHUJYPU-UHFFFAOYSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 2
- CMBDUPIBCOEWNE-BJDJZHNGSA-N Asp-Leu-Asp-Gln Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CMBDUPIBCOEWNE-BJDJZHNGSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 2
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 2
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 2
- JAHCWGSVNZXHRR-SVSWQMSJSA-N Cys-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CS)N JAHCWGSVNZXHRR-SVSWQMSJSA-N 0.000 description 2
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 2
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 2
- UIRUVUUGUYCMBY-KCTSRDHCSA-N His-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N UIRUVUUGUYCMBY-KCTSRDHCSA-N 0.000 description 2
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 2
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 2
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 2
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 2
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 2
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 2
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 2
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 2
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 2
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 2
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 2
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 102100036407 Thioredoxin Human genes 0.000 description 2
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 2
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 2
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 2
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- IQXWAJUIAQLZNX-IHPCNDPISA-N Trp-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N IQXWAJUIAQLZNX-IHPCNDPISA-N 0.000 description 2
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 2
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 2
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 2
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 2
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 2
- 206010047741 Vulval cancer Diseases 0.000 description 2
- 208000004354 Vulvar Neoplasms Diseases 0.000 description 2
- 238000005571 anion exchange chromatography Methods 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 238000005277 cation exchange chromatography Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 229910052802 copper Inorganic materials 0.000 description 2
- 239000010949 copper Substances 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 239000002552 dosage form Substances 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 206010020718 hyperplasia Diseases 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 210000004201 immune sera Anatomy 0.000 description 2
- 229940042743 immune sera Drugs 0.000 description 2
- 230000001024 immunotherapeutic effect Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 230000036210 malignancy Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000003921 particle size analysis Methods 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 2
- 229920000053 polysorbate 80 Polymers 0.000 description 2
- 230000009465 prokaryotic expression Effects 0.000 description 2
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 2
- 229940023143 protein vaccine Drugs 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000036555 skin type Effects 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 238000010254 subcutaneous injection Methods 0.000 description 2
- 239000007929 subcutaneous injection Substances 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- 108060008226 thioredoxin Proteins 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 238000004627 transmission electron microscopy Methods 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 201000005102 vulva cancer Diseases 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- -1 Amino groups Chemical group 0.000 description 1
- 206010059313 Anogenital warts Diseases 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- 238000011725 BALB/c mouse Methods 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108010077805 Bacterial Proteins Proteins 0.000 description 1
- 206010004146 Basal cell carcinoma Diseases 0.000 description 1
- 108010071134 CRM197 (non-toxic variant of diphtheria toxin) Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 101710169873 Capsid protein G8P Proteins 0.000 description 1
- 229940124957 Cervarix Drugs 0.000 description 1
- 108010049048 Cholera Toxin Proteins 0.000 description 1
- 102000009016 Cholera Toxin Human genes 0.000 description 1
- 208000037845 Cutaneous squamous cell carcinoma Diseases 0.000 description 1
- 108050006400 Cyclin Proteins 0.000 description 1
- 102000016736 Cyclin Human genes 0.000 description 1
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 108010040721 Flagellin Proteins 0.000 description 1
- 229940124897 Gardasil Drugs 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- 101150075239 L1 gene Proteins 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 101710156564 Major tail protein Gp23 Proteins 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 1
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 1
- 101710163801 Minor capsid protein L2 Proteins 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- UEEJHVSXFDXPFK-UHFFFAOYSA-N N-dimethylaminoethanol Chemical compound CN(C)CCO UEEJHVSXFDXPFK-UHFFFAOYSA-N 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- 206010044002 Tonsil cancer Diseases 0.000 description 1
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- 208000000260 Warts Diseases 0.000 description 1
- ABUBSBSOTTXVPV-UHFFFAOYSA-H [U+6].CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O Chemical compound [U+6].CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O.CC([O-])=O ABUBSBSOTTXVPV-UHFFFAOYSA-H 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000004520 agglutination Effects 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 125000000129 anionic group Chemical group 0.000 description 1
- 239000003945 anionic surfactant Substances 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 229940031416 bivalent vaccine Drugs 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 239000003093 cationic surfactant Substances 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000011097 chromatography purification Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 229940102767 gardasil 9 Drugs 0.000 description 1
- 210000004392 genitalia Anatomy 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000036571 hydration Effects 0.000 description 1
- 238000006703 hydration reaction Methods 0.000 description 1
- 229910052588 hydroxylapatite Inorganic materials 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- WSFSSNUMVMOOMR-NJFSPNSNSA-N methanone Chemical compound O=[14CH2] WSFSSNUMVMOOMR-NJFSPNSNSA-N 0.000 description 1
- 210000004400 mucous membrane Anatomy 0.000 description 1
- 238000011587 new zealand white rabbit Methods 0.000 description 1
- 239000002736 nonionic surfactant Substances 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000003002 pH adjusting agent Substances 0.000 description 1
- 208000003154 papilloma Diseases 0.000 description 1
- 229960002566 papillomavirus vaccine Drugs 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- XYJRXVWERLGGKC-UHFFFAOYSA-D pentacalcium;hydroxide;triphosphate Chemical compound [OH-].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O XYJRXVWERLGGKC-UHFFFAOYSA-D 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 description 1
- 229940068968 polysorbate 80 Drugs 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 239000012460 protein solution Substances 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 210000002345 respiratory system Anatomy 0.000 description 1
- 238000003118 sandwich ELISA Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 201000010153 skin papilloma Diseases 0.000 description 1
- 201000010106 skin squamous cell carcinoma Diseases 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229940031351 tetravalent vaccine Drugs 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 210000000689 upper leg Anatomy 0.000 description 1
- 210000003462 vein Anatomy 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/20—Antivirals for DNA viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
- C12N15/866—Baculoviral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/06—Animal cells or tissues; Human cells or tissues
- C12N5/0601—Invertebrate cells or tissues, e.g. insect cells; Culture media therefor
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
- A61K2039/5258—Virus-like particles
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/545—Medicinal preparations containing antigens or antibodies characterised by the dose, timing or administration schedule
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/555—Medicinal preparations containing antigens or antibodies characterised by a specific combination antigen/adjuvant
- A61K2039/55505—Inorganic adjuvants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/58—Medicinal preparations containing antigens or antibodies raising an immune response against a target which is not the antigen used for immunisation
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/70—Multivalent vaccine
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/40—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/735—Fusion polypeptide containing domain for protein-protein interaction containing a domain for self-assembly, e.g. a viral coat protein (includes phage display)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2510/00—Genetically modified cells
- C12N2510/02—Cells for production
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/14011—Baculoviridae
- C12N2710/14111—Nucleopolyhedrovirus, e.g. autographa californica nucleopolyhedrovirus
- C12N2710/14141—Use of virus, viral particle or viral elements as a vector
- C12N2710/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20023—Virus like particles [VLP]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20034—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20051—Methods of production or purification of viral material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20051—Methods of production or purification of viral material
- C12N2710/20052—Methods of production or purification of viral material relating to complementing cells and packaging systems for producing virus or viral particles
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/103—Plasmid DNA for invertebrates
- C12N2800/105—Plasmid DNA for invertebrates for insects
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/22—Vectors comprising a coding region that has been codon optimised for expression in a respective host
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Virology (AREA)
- Molecular Biology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Medicinal Chemistry (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Pharmacology & Pharmacy (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Plant Pathology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Physics & Mathematics (AREA)
- General Chemical & Material Sciences (AREA)
- Communicable Diseases (AREA)
- Oncology (AREA)
- Gastroenterology & Hepatology (AREA)
- Immunology (AREA)
- Mycology (AREA)
- Epidemiology (AREA)
- Cell Biology (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
本发明涉及一种人乳头瘤病毒18型嵌合蛋白及其用途。具体地,本发明涉及一种乳头瘤病毒嵌合蛋白,其包含HPV18型L1蛋白或HPV18型L1蛋白的突变体以及插入所述HPV18型L1蛋白或HPV18型L1蛋白的突变体的表面区的来自HPV59型L2蛋白的多肽、或由其组成,其中所述HPV18型L1蛋白的氨基酸序列如SEQ ID NO.1所示,所述HPV59型L2蛋白的氨基酸序列如SEQ ID NO.2所示。
Description
技术领域
本发明涉及生物技术领域,具体涉及一种人乳头瘤病毒嵌合蛋白,及由其形成的五聚体或病毒样颗粒,以及人乳头瘤病毒嵌合蛋白、人乳头瘤病毒嵌合五聚体或人乳头瘤病毒嵌合病毒样颗粒在制备预防乳头瘤病毒感染及感染诱发的疾病的疫苗中的用途。
背景技术
人乳头瘤病毒(human papillomavirus,HPV)是一类感染上皮组织的无包膜小DNA病毒。根据人乳头瘤病毒的主要外壳蛋白L1氨基酸的同源性,已鉴定了200 多型的该病毒,分为α、β、γ、μ、η属。根据感染部位不同又分为黏膜型及皮肤型。黏膜型HPV主要感染泌尿生殖器、肛周及口咽部的粘膜皮肤,均为α属,并进一步分为有转化活性的致癌型(oncogenic HPV)及诱发良性增生的低危型 (low-risk HPV,LR-HPV)。致癌型HPV包括12种常见的高危型(包括HPV16、 -18、-31、-33、-35、-39、-45、-51、-52、-56、-58、-59型等),1种可能的高危型(HPV68),及10余种十分少见的可疑高危型(HPV26、-30、-34、-53、-66、 -67、-69、-70、-73、-82、-85型等)。研究发现,所有致癌型HPV阳性癌组织均呈现特异性E6*I mRNA表达、抑癌基因Rb/P53及细胞周期蛋白CD1的表达降低及p16INK 4a表达增高,表明感染任何一种致癌型HPV罹患癌症的风险是一样的。低危型HPV有约12种(HPV6、-7、-11、-13、-32、-40、-42、-43、-44、 -54、-74、-91型等),其中HPV6、-11型合计诱发90%的肛周生殖器尖锐湿疣及绝大多数呼吸道复发性乳头瘤。皮肤型HPV主要感染上述部位之外的皮肤组织,其中一些型别(HPV2、-27、-57)诱发皮肤疣状增生,另一些型别(HPV5、-8、 -38等)与皮肤鳞状细胞癌及基底细胞癌的发生相关。
致癌型HPV感染相关的恶性肿瘤目前已确定的有:宫颈癌、阴道癌、阴唇癌、阴茎癌、肛门肛周癌、口咽癌、扁桃体癌及口腔癌,其中以宫颈癌的危害最大。宫颈癌是世界范围第三高发的妇女恶性肿瘤,年发病率约52.7万,其中亚洲地区28.5万;中国的年发病数7.5万。12种常见高危型HPV累计诱发 95.2%-96.5%的宫颈癌,其余10多种少见的可能及可疑的高危型累计诱发约 3.29%的宫颈癌。HPV16型是全球范围内的优势流行高危型,在HPV相关的肿瘤如宫颈癌、肛周癌、阴茎癌、外阴癌等及癌前病变中的检出率最高。HPV16及 HPV18在世界范围内宫颈癌中的检出率分别达50-60%及~20%。12种常见高危型HPV累计诱发95.2%-96.5%的宫颈癌,其余10多种少见的可能及可疑的高危型累计诱发约3.29%的宫颈癌。
HPV L1病毒样颗粒(L1 virus-like particle,L1 VLP)主要诱发型别特异性中和抗体和保护反应,只能通过增加L1 VLP的型别来扩大疫苗的保护范围。上市的3 种HPV疫苗均为L1 VLP疫苗,分别是GSK的二价苗(Cervarix,HPV16/-18), Merck的四价苗(Gardasil,HPV6/-11/-16/-18)和九价苗(Gardasil-9, HPV6/-11/-16/-18/-31/-33/-45/-52/-58),其中保护范围最宽的九价苗才只涵盖有限的7种高危型、2种低危型(HPV6/-11),而且不能预防皮肤型。另外L1 VLP疫苗不能通过无限制的增加L1 VLP的型别来扩大保护范围,因此L1 VLP疫苗难以满足HPV感染相关疾病的预防要求。
HPV的次要衣壳蛋白L2在天然状态下没有免疫活性,但L2的N端多肽可诱发交叉中和抗体及交叉保护反应,只是免疫原性弱,诱发抗体的滴度低,而且单型L2抗血清的交叉中和型别有限。目前仅在16L2N中发现了多种可诱发中和抗体的保守表位肽,其中aa.17-38为其主要中和表位区,识别该区域的单抗RG-1交叉中和的型别最多,因此该区域又称RG-1表位肽,其中aa.21-31为其中和表位的核心序列,RG-1表位肽的相关研究,无论序列的长短,均保留aa.21-31的同源区。
已经报道的用于疫苗研究的RG-1型别有HPV4型RG-1、HPV6型RG-1、 HPV16型RG-1、HPV17型RG-1、HPV31型RG-1、HPV33型RG-1、HPV45型 RG-1、HPV51型RG-1、HPV58型RG-1等[C.Schellenbacher et al.,The Journal of investigative dermatology 2013,133(12):2706-13;H.Seitz et al.,Vaccine 2014, 32(22):2610-2617;B.Huber et al.,PLoS One 2015,10(3):e0120152;B.Huber et al., PLoS One 2017,12(1):e0169533;X.Chen et al.,Oncotarget 2017,8(38):63333-63344; X.Chen et al.,HumanVaccines&Immunotherapeutics 2018,14(8):2025-2033; PCT/CN2017/075402],采用的方式包括VLP表面展示、细菌蛋白表面展示(细菌硫氧还蛋白Trx、鞭毛蛋白、霍乱毒素突变体CRM197)、靶向IgγR改造抗体及含RG-1表位的多型L2多肽串联融合。但研究表明,多种RG-1表位肽相关疫苗的活性结果较差,如表面展示HPV4型RG-1、HPV6型RG-1、HPV17型RG-1的 3种16型cVLP诱发产生的HPV16的中和抗体滴度很低,交叉中和滴度未能检测到[B.Huber etal.,PLoS One 2017,12(1):e0169533;X.Chen et al.,Oncotarget 2017, 8(38):63333-63344];展示HPV45型RG-1的18cVLP诱发产生的HPV18的中和抗体滴度很低(仅为18型L1VLP的1/100),而且仅交叉中和致癌型HPV45、70 及39,滴度很低,最高的仅为100[B.Huber etal.,PLoS One 2015, 10(3):e0120152];表面展示51型RG1的Trx融合蛋白抗血清的交叉范围窄,交叉中和抗体滴度最高的仅为500[H.Seitz et al.,Vaccine 2014,32(22):2610-2617]。相反,Schellenbacher等人报道的16型RG1-cVLP及本发明人之前报道的31型RG1-cVLP、33型RG1-cVLP及58型RG1-cVLP的免疫活性较好,骨架型别VLP 诱发的HPV16中和抗体滴度高达105(与16型L1 VLP诱发的相当),相应RG-1 表位诱发的L2依赖的交叉中和抗体中和范围广、滴度相对较高(最高的可达 6400)[C.Schellenbacher et al.,The Journal ofinvestigative dermatology 2013, 133(12):2706-13;X.Chen et al.,Oncotarget2017,8(38):63333-63344;X.Chen et al.,Human Vaccines&Immunotherapeutics 2018,14(8):2025-2033; PCT/CN2017/075402]。
上述数据提示,不同型别HPV来源的RG-1表位肽的免疫原性存在非常大的差异。本发明人在之前的文献中比较了58型RG-1及6型RG-1的免疫原性,发现58型RG-1表位肽抗血清交叉中和的型别多(13个型别)、滴度也较高(最高的达3200),而6型RG-1表位肽抗血清的中和型别较少(9个型别)、滴度很低 (最高的仅为100)[X.Chen et al.,Oncotarget 2017,8(38):63333-63344]。表明尽管 RG-1表位肽区在不同型别间具有较强的保守性,但不同型别的RG-1的免疫原性存在差异,因此任选1种L2 aa.17-36同源多肽,构建嵌合蛋白疫苗,其免疫活性是无法预测的。
另一方面,Schellenbacher和Wang的报道的HPV16 cVLP疫苗研究显示,同是将16型RG-1表位肽插入16型L1 VLP载体的表面区,由于16型RG-1核心表位肽序列的旁侧序列及插入位点和插入方式的差异,获得的多种不同的16型 RG1-cVLP的免疫活性具有显著差异,其中最好的是在16型L1的DE环区插入 16型RG-1的cVLP,最差的是在16型L1的h4区插入16型RG-1核心序列的 cVLP。另外,Chen和Boxus均报道了33型RG-1的cVLP,但采用的载体不同,分别是HPV16 L1 VLP及18L1 VLP,虽然两篇报道均选择了DE环作为插入位点,但是插入区相差1个氨基酸,表位肽长度相差2个氨基酸,获得的两种33型 RG1-cVLP诱发产生的33型RG-1依赖的交叉中和抗体的活性差异十分显著,33 型RG1-cVLP抗血清可交叉中和至少12种型别(其中2种型别的滴度>1000),而33型RG1-18cVLP抗血清仅交叉中和7种型别,其中6种型别的中和滴度(其中4种型别的滴度均<100)均远较33RG1-16cVLP抗血清的低。
因而,目前需要开发一种能够针对更多HPV型别的病毒产生高滴度中和抗体的基于HPV L1与HPV L2嵌合蛋白的疫苗。
发明内容
为解决上述技术问题,本发明人选用了多种不同长度的59型RG-1表位肽,用于HPVcVLP的研究,结果显示,本发明获得的HPV18型cVLP的免疫原性很强,其诱发的血清中和抗体可高滴度中和α7亚属的多个型别的HPV。
本发明人在实施例1中对8个不同型别RG-1表位免疫血清的中和活性比较分析,发明人意外的发现59型RG-1表位肽的免疫血清可交叉中和至少17种型别,特别是中和HPV45、59及16型的滴度均在103以上,是目前报道的中和α7属高危型HPV型别活性最好的,且同时对HPV16的交叉中和活性又可与目前报道的免疫原性较强的16RG-1相当。
有鉴于此,本发明的目的在于提供一种人乳头瘤病毒嵌合蛋白,用于制备预防乳头瘤病毒感染及感染诱发的疾病的疫苗。
本发明基于本发明人以下的意想不到的发现:在全长或截短型HPV18型L1 蛋白的表面区插入HPV59型L2蛋白多肽,可提高HPV59型L2蛋白多肽的免疫原性,获得的嵌合蛋白在大肠杆菌或昆虫细胞表达系统中可高水平表达,该嵌合蛋白可组装成VLP,并可诱发针对来自不同属/亚属的多种型别HPV的广谱保护性免疫反应。在本文实施例中提供了相关的实验结果。
基于上述目的,本发明一方面提供了一种人乳头瘤病毒嵌合蛋白,其骨架是HPV18 L1蛋白或HPV18 L1蛋白的突变体,所述的骨架上嵌合至少一个来自 HPV59型L2蛋白的多肽。
即,在本发明的第一方面中,本发明提供了一种人乳头瘤病毒嵌合蛋白,其包含HPV18型L1蛋白或HPV18型L1蛋白的突变体以及插入所述HPV18型L1 蛋白或HPV18型L1蛋白的突变体的表面区的来自HPV59型L2蛋白的多肽、或由其组成,其中所述HPV18型L1蛋白的氨基酸序列如SEQ ID NO.1所示,所述 HPV59型L2蛋白的氨基酸序列如SEQ ID NO.2所示。
在根据本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,所述的HPV18 型L1蛋白的突变体是在所述的HPV18型L1蛋白的N端截短0-8个氨基酸和/或C 端截短0-32个氨基酸的所获得的蛋白。
在根据本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,所述HPV18型 L1蛋白的突变体选自:
将SEQ ID No.1所示的氨基酸序列的C端截短32个氨基酸的突变体;
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、484、496、499、504、 506置换为甘氨酸(G)且将氨基酸485、500、502置换为丝氨酸(S)的突变体 (mut1);
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、485、496、499、504、 506置换为甘氨酸(G)且将氨基酸486、500、502置换为丝氨酸(S)的突变体 (mut2);
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、484、496、499、502、 506置换为甘氨酸(G)、将氨基酸485、500置换为丝氨酸(S)且将氨基酸504 置换为天冬氨酸(D)的突变体(mut3);
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、485、496、502、506置换为甘氨酸(G)、将氨基酸486、500置换为丝氨酸(S)且将氨基酸499、504 置换为天冬氨酸(D)的突变体(mut4);
将SEQ ID No.1所示氨基酸序列的氨基酸477、484、496、499、504、506置换为甘氨酸(G)且将氨基酸485、500、502置换为丝氨酸(S)的突变体(mut5);和
将SEQ ID No.1所示氨基酸序列的氨基酸477、485、496、499、504、506置换为甘氨酸(G)且将氨基酸486、500、502置换为丝氨酸(S)的突变体(mut6)。
在根据本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,所述来自 HPV59型L2蛋白的多肽选自SEQ ID No.2所示氨基酸的1-50区域内的任意连续 8-33个氨基酸的片段;优选地,所述来自HPV59型L2蛋白的多肽为HPV型59 型L2蛋白RG-1表位肽或其突变体表位肽;进一步优选地,所述来自HPV59型 L2蛋白的多肽由选自以下的氨基酸序列组成:SEQID No.2所示的氨基酸17至32、 SEQ ID No.2所示的氨基酸16至35、SEQ ID No.2所示的氨基酸17至37、SEQ ID No.2所示的氨基酸16至37、以及在上述氨基酸序列的N端和/或C端延长或截短 1至7个氨基酸的序列。
最优选地,所述来自HPV59型L2蛋白的多肽的氨基酸序列如SEQ ID No.3、 SEQ IDNo.4、SEQ ID No.5或SEQ ID No.6所示。
可选地,所述来自HPV59型L2蛋白的多肽是在SEQ ID No.3所示的氨基酸序列的N端延长或截短1-7个氨基酸和/或C端延长或截短1-7个氨基酸所获得的多肽。
可选地,所述来自HPV59型L2蛋白的多肽还可以是与SEQ ID No.3、SEQ ID No.4、SEQ ID No.5或SEQ ID No.6所示的氨基酸序列具有大于60%、优选大于 70%、优选大于80%、大于90%、甚至更优选大于95%序列同一性的多肽。
在根据本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,所述HPV18型 L1蛋白可来自,例如但不限于NCBI数据库中的ATL15214.1、ATL14646.1、 ARS43458.1、ARS43428.1、ARS43449.1、AGU90430.1等来自HPV18变异株的 L1蛋白。优选地,所述的HPV18型L1蛋白的氨基酸序列如SEQ ID No.1所示。
在本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,所述来自HPV59型 L2蛋白的多肽插入HPV18型L1蛋白或HPV18型L1蛋白的突变体的表面区;优选地,插入所述的HPV18型L1蛋白或HPV18型L1蛋白的突变体的DE环或h4 区;更优选地,所述来自HPV59型L2蛋白的多肽通过直接插入的方式插入所述的HPV18型L1蛋白或所述HPV18型L1蛋白的突变体的氨基酸134和氨基酸135 之间、或氨基酸137和138之间、或氨基酸432和433之间、或氨基酸434和435 之间,或者通过非等长置换的方式插入所述的HPV18型L1蛋白或所述HPV18型 L1蛋白的突变体的氨基酸121至124区域、或氨基酸131至138区域、或氨基酸 431-433区域、或氨基酸432-435区域。
如本文所用,术语“直接插入”是指在相邻两个氨基酸之间插入所选择的肽片段。例如,在SEQ ID NO.1的氨基酸134和氨基酸135之间的直接插入指的是将所选择的肽片段直接插入到SEQ ID NO.1的氨基酸134和氨基酸135之间。
如本文所用,术语“非等长置换”指的是在删除指定氨基酸区间的序列后,将所选的肽片段插入到指定的氨基酸区间。例如,在SEQ ID NO.1的氨基酸121 至124区域的非等长置换指的是,删除SEQ ID NO.1的氨基酸122-123之后,将所选择的肽片段插入到SEQ IDNO.1的氨基酸氨基酸121至124之间。可选地,在所述直接插入或非等长置换的方式中,所述来自HPV59型L2蛋白的多肽在其 N端和/或C端包含1至3个氨基酸残基长的连接子。
可选地,所述的连接子由选自甘氨酸(G)、丝氨酸(S)、丙氨酸(A)及脯氨酸(P)的氨基酸任意组合构成。优选地,N端选用G(甘氨酸)P(脯氨酸) 连接子,C端选用P(脯氨酸)连接子。
在本发明的乳头瘤病毒嵌合蛋白的优选的实施方案中,在所述直接插入的方式中,所述来自HPV59型L2蛋白的多肽的氨基酸序列是SEQ ID No.4或SEQ ID No.5,插入位点为所述的HPV18型L1蛋白或C端截短32个氨基酸的所述HPV18 型L1蛋白的突变体的氨基酸137和氨基酸138之间,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.7、SEQ IDNo.8、SEQ ID No.9或SEQ ID No.10所示。
在本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,在所述直接插入的方式中,所述来自HPV59型L2蛋白的多肽的氨基酸序列是SEQ ID No.3,插入位点为所述的HPV18型L1蛋白或C端截短32个氨基酸的所述HPV18型L1蛋白的突变体的氨基酸432和433之间或氨基酸434和435之间,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.11、SEQ IDNo.12、SEQ ID No.13或SEQ ID No.14所示。
在本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,在所述直接插入的方式中,所述来自HPV59型L2蛋白的多肽的氨基酸序列是N端含有GP连接子且C端含有P连接子的SEQ ID No.4或SEQ ID No.6所示序列,插入位点为所述的HPV18型L1蛋白或C端截短32个氨基酸的所述HPV18型L1蛋白的突变体的氨基酸134和氨基酸135之间,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.15、SEQ ID No.16、SEQ ID No.17或SEQ ID No.18所示。
在本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,在所述非等长置换的方式中,删除所述HPV18型L1蛋白或C端截短32个氨基酸的所述HPV18型 L1蛋白的突变体的氨基酸132-137区域后,在HPV18型L1蛋白或C端截短32 个氨基酸的所述HPV18型L1蛋白的突变体的氨基酸131及138之间插入来自 HPV59型L2蛋白的多肽,所述来自HPV59型L2蛋白的多肽其N端增加了甘氨酸-脯氨酸连接子,所述来自HPV59型L2蛋白的多肽的氨基酸序列如SEQ ID No.3所示,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.19、或SEQ IDNo.20所示。
在本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,在所述非等长置换的方式中,删除所述HPV18型L1蛋白或C端截短32个氨基酸的所述HPV18型 L1蛋白的突变体的氨基酸122-123区域后,在HPV18型L1蛋白或C端截短32 个氨基酸的所述HPV18型L1蛋白的突变体的氨基酸121及124之间插入来自 HPV59型L2蛋白的多肽,在所述直接插入的方式中,所述来自HPV59型L2蛋白的多肽的氨基酸序列是SEQ ID No.3所示,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.21或SEQ ID No.22所示。
在本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,在所述非等长置换的方式中,删除所述HPV18型L1蛋白或C端截短32个氨基酸的所述HPV18型 L1蛋白的突变体的氨基酸432后,在HPV18型L1蛋白或C端截短32个氨基酸的所述HPV18型L1蛋白的突变体的氨基酸431及433之间插入来自HPV59型 L2蛋白的多肽,在所述直接插入的方式中,所述来自HPV59型L2蛋白的多肽的氨基酸序列是SEQ ID No.3所示,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.23或SEQ ID No.24所示。
在本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,在所述非等长置换的方式中,删除所述HPV18型L1蛋白或C端截短32个氨基酸的所述HPV18型 L1蛋白的突变体的氨基酸433-434后,在HPV18型L1蛋白或C端截短32个氨基酸的所述HPV18型L1蛋白的突变体的氨基酸432及435之间插入来自HPV59 型L2蛋白的多肽,在所述直接插入的方式中,所述来自HPV59型L2蛋白的多肽的氨基酸序列是SEQ ID No.3所示,获得的乳头瘤病毒嵌合蛋白氨基酸序列如 SEQ ID No.25或SEQ ID No.26所示。
在本发明的人乳头瘤病毒嵌合蛋白的优选的实施方案中,所述SEQ ID No.4 所示的多肽通过直接插入的方式嵌合于所述HPV18型L1蛋白突变体的氨基酸137 和138之间,所述HPV18型L1蛋白突变体选自:
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、484、496、499、504、 506置换为甘氨酸(G)且将氨基酸485、500、502置换为丝氨酸(S)的突变体,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.27所示(mut1);或者,
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、485、496、499、504、 506置换为甘氨酸(G)且将氨基酸486、500、502置换为丝氨酸(S)的突变体,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.28所示(mut2);或者,
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、484、496、499、502、 506置换为甘氨酸(G)、将氨基酸485、500置换为丝氨酸(S)且将氨基酸504 置换为天冬氨酸(D)的突变体,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.29所示(mut3);或者,
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、485、496、502、506置换为甘氨酸(G)、将氨基酸486、500置换为丝氨酸(S)且将氨基酸499、504 置换为天冬氨酸(D)的突变体,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.30所示(mut4);或者,
将SEQ ID No.1所示氨基酸序列的氨基酸477、484、496、499、504、506置换为甘氨酸(G)且将氨基酸485、500、502置换为丝氨酸(S)的突变体,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.31所示(mut5);或者,
将SEQ ID No.1所示氨基酸序列的氨基酸477、485、496、499、504、506置换为甘氨酸(G)且将氨基酸486、500、502置换为丝氨酸(S)的突变体,获得的乳头瘤病毒嵌合蛋白氨基酸序列如SEQ ID No.32所示(mut6)。
本发明的另一方面涉及编码上述的乳头瘤病毒嵌合蛋白的多核苷酸。
本发明还提供了包含上述的多核苷酸的载体,以及包含所述的载体的细胞。
本发明涉及的编码上述的乳头瘤病毒嵌合蛋白的多核苷酸序列适用于不同的表达系统。可选地,这些核苷酸序列采用大肠杆菌密码子进行全基因优化,可在大肠杆菌表达系统中高水平表达;或采用昆虫细胞密码子进行全基因优化,可在昆虫细胞表达系统中高水平表达。
本发明还提供了一种多聚物,优选地,该多聚物为乳头瘤病毒嵌合五聚体或嵌合病毒样颗粒,其含有上述的乳头瘤病毒嵌合蛋白,或者由上述的乳头瘤病毒嵌合蛋白所形成。
本发明还提供了上述的乳头瘤病毒嵌合蛋白、乳头瘤病毒嵌合五聚体或上述的乳头瘤病毒嵌合病毒样颗粒在制备预防乳头瘤病毒感染和/或所述乳头瘤病毒感染诱发的疾病的疫苗中的用途,优选地,所述乳头瘤病毒感染诱发的疾病包括但不限于宫颈癌、阴道癌、阴唇癌、阴茎癌、肛门肛周癌、口咽癌、扁桃体癌及口腔癌;
优选地,所述乳头瘤病毒感染为一种或多种选自以下乳头瘤病毒型别的感染:HPV16、HPV18、HPV26、HPV31、HPV33、HPV35、HPV39、HPV45、HPV51、 HPV52、HPV53、HPV56、HPV58、HPV59、HPV66、HPV68、HPV70、HPV73; HPV6、HPV11、HPV2、HPV5、HPV27和HPV57。
本发明还提供了一种用于预防乳头瘤病毒感染及感染诱发的疾病的疫苗,其包含上述的乳头瘤病毒嵌合五聚体或嵌合病毒样颗粒、佐剂、以及疫苗用赋形剂或载体,优选地,还包含至少一种嗜黏膜组和/或嗜皮肤组的HPV的病毒样颗粒或嵌合病毒样颗粒。其中,这些病毒样颗粒的含量分别为能诱发保护性免疫反应的有效量。
可选地,所述佐剂为人用佐剂。
发明中相关术语的说明及解释
根据本发明,术语“昆虫细胞表达系统”包括昆虫细胞、重组杆状病毒、重组Bacmid及表达载体。其中昆虫细胞来源于市场上可得到的细胞,在此举例但不限于:Sf9,Sf21,High Five。
根据本发明,术语“原核表达系统”包括但不限于大肠杆菌表达系统。其中表达宿主菌来源于市场上可得到的菌株,在此举例但不限于:BL21(DE3), BL21(DE3)plysS,C43(DE3),Rosetta-gami B(DE3)。
根据本发明,术语“全长HPV18型L1蛋白”的例子包括但不限于NCBI数据库中编号为ATL15070.1的蛋白等长的全长L1蛋白。
“截短型HPV18型L1蛋白”的基因片段指的是其与野生型HPV 18型L1蛋白基因相比,在其5’端和/或3’端缺失编码1个或多个氨基酸的核苷酸,其中“野生型 HPV18型L1蛋白”的全长序列例如但不限于NCBI数据库中的如下序列: ATL15214.1、ATL14646.1、ARS43458.1、ARS43428.1、ARS43449.1、 AGU90430.1等。
根据本发明,术语“疫苗用赋形剂或载体”是指选自一种或多种,包括但不限于:pH调节剂,表面活性剂,离子强度增强剂。例如,pH调节剂举例但不限于磷酸盐缓冲液,表面活性剂包括阳离子、阴离子或非离子型表面活性剂,举例但不限于聚山梨酯80(Tween-80),离子强度增强剂举例但不限于氯化钠。
根据本发明,术语“人用佐剂”是指在临床上可应用于人体的佐剂,包括当前已获得批准的和将来可能获得批准的各种佐剂,例如但不限于铝佐剂、MF59及各种形式的佐剂组合物。
根据本发明,本发明的疫苗可采用患者可接受的形式,包括但不限于口服或者注射,优选注射。
根据本发明,本发明疫苗优选单位剂型使用,其中单位剂型中蛋白病毒样颗粒的剂量为5μg-100μg,例如5、10、15、20、25、30、35、40、45、50、55、60、 65、70、75、80、85、90、95、100μg、以及上述任意两个数值之间的范围;优选 30μg-60μg。
附图说明
图1A-图1B:本发明实施例6中嵌合蛋白在大肠杆菌及昆虫细胞中的表达鉴定。结果显示,26种嵌合蛋白均可在大肠杆菌或昆虫细胞中表达,其中有6种嵌合蛋白可在两种表达系统中表达。
图1A:嵌合蛋白在大肠杆菌中的表达鉴定:1为18L1DE137-138/59dES;2为18L1DE137-138/59dE;3为18L1h4432-433/59dE;4为18L1h4434-435/59dE;5为 18L1DE134-135/59dES;6为18L1DE134-135/59dE;7为18L1DE131-138/59dE;8为18L1DE121-124/59dE,9为18L1h4431-433/59dE;10为18L1h4432-435/59dE;11为 18L1DE137-138/59dES-mut1;12为18L1DE137-138/59dES-mut2;13为 18L1DE137-138/59dES-mut3;14为18L1DE137-138/59dES-mut4;15为 18L1DE137-138/59dES-mut5;16为18L1DE137-138/59dES-mut6;
图1B:嵌合蛋白在昆虫细胞中的表达鉴定:1为18L1ΔCDE137-138/59dES;2 为18L1ΔCDE137-138/59dE;3为18L1ΔCh4432-433/59dE;4为18L1ΔCh4434-435/59dE; 5为18L1ΔCDE134-135/59dES;6为18L1ΔCDE134-135/59dE;7为 18L1ΔCDE131-138/59dE;8为18L1ΔCDE121-124/59dE,9为18L1ΔCh4431-433/59dE;10 为18L1ΔCh4432-435/59dE;11为18L1DE137-138/59dES-mut1;12为 18L1DE137-138/59dES-mut2;13为18L1DE137-138/59dES-mut3;14为18L1DE137-138/59dES-mut4;15为18L1DE137-138/59dES-mut5;16为 18L1DE137-138/59dES-mut6。
图2A-图2D:本发明实施例6中纯化后获得的cVLP的动态光散射分析结果。结果显示18L1ΔCDE134-135/59dE、18L1ΔCDE134-135/59dES、18L1ΔCDE137-138/59dE 及18L1ΔCDE137-138/59dES重组蛋白形成的病毒样颗粒水化动力学直径分别为 106.8nm、113.3nm、114.7nm和122.9nm,颗粒组装的百分比均为100%。
图2A:18L1ΔCDE134-135/59dE;图2B:18L1ΔCDE134-135/59dES;图2C: 18L1ΔCDE137-138/59dE;图2D:18L1ΔCDE137-138/59dES。
图3A-图3D:本发明实施例7中纯化后获得的cVLP的透射电镜观察结果。视野中可见大量的病毒样颗粒,颗粒均一度好。cVLP直径为50nm左右,与L1 蛋白的VLP的大小相似。Bar=50nm。
图3A:18L1ΔCDE134-135/59dE;图3B:18L1ΔCDE134-135/59dES;图3C: 18L1ΔCDE137-138/59dE;图3D.18L1ΔCDE137-138/59dES。
图4:本发明实施例10中的嵌合VLP小鼠免疫血清对α7亚属HPV假病毒的中和活性检测。*:P<0.05。
具体实施方式
下面将通过下述非限制性实施例进一步说明本发明,本领域技术人员公知,在不背离本发明精神的情况下,可以对本发明做出许多修改,这样的修改也落入本发明的范围。下面的实施例仅用于说明本发明,而不应视为限定本发明的范围,因为实施方案必然是多样的。本说明书中使用的用语仅是为了阐述特定的实施方案,而非作为限制,本发明的范围已界定在所附的权利要求中。
除非特别说明,本说明书中所使用的所有技术和科学用语均和本案所属技术领域的技术人员所普遍明了的意义相同。下面就本发明的优选方法和材料加以叙述,但是与本说明书中所述方法和材料类似或等效的任何方法和材料均可用以实施或测试本发明。下述实验方法如无特别说明,均为常规方法或产品说明书所描述的方法,所使用的实验材料如无特别说明,均可容易地从商业公司获取。本说明书中所提到的所有公开文献均被并入于此作为参考,以揭示并说明所述公开文献中的方法和/或材料。
实施例1:不同型别RG-1表位肽的免疫活性检测
采用化学合成法合成HPV35、-39、-51、-53、-56、-59、-68、-82的RG-1表位肽,表位肽序列如表1所示,多肽由上海吉尔生化有限公司合成,为了提高合成肽的免疫原性,各合成肽通过1-(3-二甲氨基丙基)-3-乙基碳二亚胺盐酸盐 (EDC,CAS No.25952-53-8)活化羧基后与钥孔血蓝蛋白(KLH)偶联。
取2.0-2.5kg体重的新西兰大白兔,随机分组,每组2-4只,于免疫前4天背部多点皮下注射15mg灭活的与等体积弗氏完全佐剂充分混匀的DH5a(含0.5%v/v 甲醛的PBS,37℃处理24-48h)进行免疫刺激,首次免疫采用背部、大腿内侧多点皮下注射1mg与等体积弗氏完全佐剂充分混匀的KLH-多肽。加强免疫4次,每次间隔2周,加强免疫的抗原为0.5mg与等体积弗氏不完全佐剂(充分混匀的 KLH-多肽。最后一次免疫后2周采血,分离血清。
使用17种HPV假病毒对免疫血清的中和抗体滴度进行检测,结果如表2所示。59RG-1表位肽的免疫活性最好,其抗血清可中和所有17种检测型别,其中 HPV45,-59,-16的中和抗体的滴度均在103以上,HPV5,-31,-18,-39,-68,-57的中和抗体滴度在500-1000之间。值得注意的是,59RG-1表位肽抗血清对检测的5种α7-HPV的中和抗体水平均很高。
多肽合成、假病毒制备及假病毒中和实验的方法均是公开的,例如专利CN104418942A及108676057A。
表1.合成的不同型别RG-1表位肽的序列
型别 | 合成肽序列 | SEQ ID NO. |
HPV35 | TQLYRTCKAAGTCPPDVIPKVEG | 44 |
HPV39 | STLYRTCKQSGTCPPDVVDKVEG | 45 |
HPV51 | TQLYSTCKAAGTCPPDVVNKVEG | 46 |
HPV53 | TQLYQTCKQSGTCPEDVINKIEH | 47 |
HPV56 | TQLYKTCKLSGTCPEDVVNKIEQ | 48 |
HPV59 | LYKTCKQ AGTCP SDVIN KVEGTT | 49 |
HPV68 | STLYKTCKQSGTCPPDVINKVEG | 50 |
HPV82 | TQLYSTCKAAGTCPPDVIPKVKG | 51 |
表2.不同的RG1-KLH偶联肽在兔子中诱发的血清中和抗体滴度
实施例2:嵌合L1蛋白的基因的合成及表达载体构建
26种嵌合L1蛋白,分别为:
1)嵌合L1蛋白18L1DE137-138/59dES:骨架为全长HPV18型L1蛋白(序列如SEQ IDNo.1所示),在其DE环aa.137/138位点,直接插入HPV59型L2蛋白的aa.17-32多肽(如SEQ IDNo.4所示),18L1DE137-138/59dES嵌合蛋白的氨基酸序列如SEQ ID No.7所示。编码18L1DE137-138/59dES的多核苷酸序列经大肠杆菌密码子优化设计,采用全基因合成的方式构建;
2)嵌合L1蛋白18L1DE137-138/59dE:骨架为全长HPV18型L1蛋白(序列如 SEQ IDNo.1所示),在其DE环aa.137/138位点,直接插入HPV59型L2蛋白的 aa.16-35多肽(如SEQID No.5所示),18L1DE137-138/59dE嵌合蛋白的氨基酸序列如SEQ ID No.9所示。编码18L1DE137-138/59dE的多核苷酸序列经大肠杆菌密码子优化设计,采用全基因合成的方式构建;
3)嵌合L1蛋白18L1h4432-433/59dE:骨架为全长HPV18型L1蛋白(序列如 SEQ IDNo.1所示),在其h4区aa.432/433位点,直接插入HPV59型L2蛋白的 aa.17-37多肽(如SEQID No.3所示),18L1h4432-433/59dE嵌合蛋白的氨基酸序列如SEQ ID No.11所示。编码18L1h4432-433/59dE的多核苷酸序列经大肠杆菌密码子优化设计,采用全基因合成的方式构建;
4)嵌合L1蛋白18L1h4434-435/59dE:骨架为全长HPV18型L1蛋白(序列如 SEQ IDNo.1所示),在其h4区aa.434/435位点,直接插入HPV59型L2蛋白的 aa.17-37多肽(如SEQID No.3所示),18L1h4434-435/59dE嵌合蛋白的氨基酸序列如SEQ ID No.13所示。编码18L1h4434-435/59dE的多核苷酸序列经大肠杆菌密码子优化设计,采用全基因合成的方式构建;
5)嵌合L1蛋白18L1DE134-135/59dES:骨架为全长HPV18型L1蛋白(序列如SEQ IDNo.1所示),在其DE环aa.134/135位点,直接插入N端含GP连接子且C端含P连接子的HPV59型L2蛋白的aa.17-32多肽(即在SEQ ID No.4所示序列的N端添加甘氨酸-脯氨酸,且在C端添加脯氨酸),18L1DE134-135/59dES嵌合蛋白的氨基酸序列如SEQ ID No.15所示。编码18L1DE134-135/59dES的多核苷酸序列经大肠杆菌密码子优化设计,采用全基因合成的方式构建;
6)嵌合L1蛋白18L1DE134-135/59dE:骨架为全长HPV18型L1蛋白(序列如 SEQ IDNo.1所示),在其DE环aa.134/135位点,直接插入N端含GP连接子且 C端含P连接子的HPV59型L2蛋白的aa.16-37多肽(即在SEQ ID No.6所示序列的N端添加甘氨酸-脯氨酸,且在C端添加脯氨酸),18L1DE134-135/59dE嵌合蛋白的氨基酸序列如SEQ ID No.17所示。编码18L1DE134-135/59dES的多核苷酸序列经大肠杆菌密码子优化设计,采用全基因合成的方式构建;
7)嵌合L1蛋白18L1DE131-138/59dE:骨架为全长HPV18型L1蛋白(序列如 SEQ IDNo.1所示),删除其aa.132-137区域,并在aa.131/138之间融合N端包含 GP连接子的HPV59型L2蛋白的aa.17-37多肽(在HPV 18型L1蛋白的aa.132-137 区域非等长置换插入),插入片段的氨基酸序列为SEQ ID No.3所示序列的N端添加甘氨酸-脯氨酸,18L1DE131-138/59dE嵌合蛋白的氨基酸序列如SEQ ID No.19 所示。编码18L1DE131-138/59dE的多核苷酸序列经大肠杆菌密码子优化设计,采用全基因合成的方式构建;
8)嵌合L1蛋白18L1DE121-124/59dE:骨架为全长HPV18型L1蛋白(序列如 SEQ IDNo.1所示),删除其aa.122-123区域,并在aa.121/124之间融合HPV59 型L2蛋白的aa.17-37多肽(在HPV 18型L1蛋白的aa.122-133区域非等长置换插入),插入片段的氨基酸序列如SEQ ID No.3所示,18L1DE121-124/59dE嵌合蛋白的氨基酸序列如SEQ ID No.21所示。编码18L1DE121-124/59dE的多核苷酸序列经大肠杆菌密码子优化设计,采用全基因合成的方式构建;
9)嵌合L1蛋白18L1h4431-433/59dE:骨架为全长HPV18型L1蛋白(序列如 SEQ IDNo.1所示),删除其aa.432,并在aa.431/433之间融合HPV59型L2蛋白的aa.17-37多肽(在HPV 18型L1蛋白的aa.431-433区域非等长置换插入),插入片段的氨基酸序列如SEQ IDNo.3所示,18L1h4431-433/59dE嵌合蛋白的氨基酸序列如SEQ ID No.23所示。编码18L1h4431-433/59dE的多核苷酸序列经大肠杆菌密码子优化设计,采用全基因合成的方式构建;
10)嵌合L1蛋白18L1h4432-435/59dE:骨架为全长HPV18型L1蛋白(序列如 SEQ IDNo.1所示),删除其aa.433-434区域,并在aa.432/435之间融合HPV59 型L2蛋白的aa.17-37多肽(在HPV 18型L1蛋白的aa.432-435区域非等长置换插入),插入片段的氨基酸序列如SEQ ID No.3所示,18L1h4432-435/59dE嵌合蛋白的氨基酸序列如SEQ ID No.25所示。编码18L1h4433-434/59dE的多核苷酸序列经大肠杆菌密码子优化设计,采用全基因合成的方式构建;
11)嵌合L1蛋白18L1ΔCDE137-138/59dES:骨架为C端截短32个氨基酸的HPV18型L1蛋白(序列SEQ ID No.1的C端截短32个氨基酸),在其DE环 aa.137/138位点,直接插入HPV59型L2蛋白的aa.17-32多肽(如SEQ ID No.4所示),18L1ΔCDE137-138/59dES嵌合蛋白的氨基酸序列如SEQ ID No.8所示。编码 18L1ΔCDE137-138/59dES的多核苷酸序列经昆虫细胞密码子优化设计,采用全基因合成的方式构建,其核苷酸序列如SEQ ID No.33所示;
12)嵌合L1蛋白18L1ΔCDE137-138/59dE:骨架为C端截短32个氨基酸的 HPV18型L1蛋白(序列SEQ ID No.1的C端截短32个氨基酸),在其DE环 aa.137/138位点,直接插入HPV59型L2蛋白的aa.16-35多肽(如SEQ ID No.5所示),18L1ΔCDE137-138/59dE嵌合蛋白的氨基酸序列如SEQ ID No.10所示。编码 18L1ΔCDE137-138/59dE的多核苷酸序列经昆虫细胞密码子优化设计,采用全基因合成的方式构建,其核苷酸序列如SEQ ID No.34所示;
13)嵌合L1蛋白18L1ΔCh4432-433/59dE:骨架为C端截短32个氨基酸的HPV18 型L1蛋白(序列SEQ ID No.1的C端截短32个氨基酸),在其h4区aa.432/433 位点,直接插入HPV59型L2蛋白的aa.17-37多肽(如SEQ ID No.3所示), 18L1ΔCh4432-433/59dE嵌合蛋白的氨基酸序列如SEQ ID No.12所示。编码 18L1ΔCh4432-433/59dE的多核苷酸序列经昆虫细胞密码子优化设计,采用全基因合成的方式构建;
14)嵌合L1蛋白18L1ΔCh4434-435/59dE:骨架为C端截短32个氨基酸的HPV18 型L1蛋白(序列SEQ ID No.1的C端截短32个氨基酸),在其h4区aa.434/435 位点,直接插入HPV59型L2蛋白的aa.17-37多肽(如SEQ ID No.3所示), 18L1ΔCh4434-435/59dE嵌合蛋白的氨基酸序列如SEQ ID No.14所示。编码 18L1ΔCh4434-435/59dE的多核苷酸序列经昆虫细胞码子优化设计,采用全基因合成的方式构建;
15)嵌合L1蛋白18L1ΔCDE134-135/59dES:骨架为C端截短32个氨基酸的 HPV18型L1蛋白(序列SEQ ID No.1的C端截短32个氨基酸),在其DE环 aa.134/135位点,直接插入N端含GP连接子且C端含P连接子的HPV59型L2 蛋白的aa.17-32多肽(即在SEQ ID No.4所示序列的N端添加甘氨酸-脯氨酸,且在C端添加脯氨酸),18L1ΔCDE134-135/59dES嵌合蛋白的氨基酸序列如SEQ ID No.16所示。编码18L1ΔCDE134-135/59dES的多核苷酸序列经昆虫细胞密码子优化设计,采用全基因合成的方式构建,其核苷酸序列如SEQ ID No.35所示;
16)嵌合L1蛋白18L1ΔCDE134-135/59dE:骨架为C端截短32个氨基酸的 HPV18型L1蛋白(序列SEQ ID No.1的C端截短32个氨基酸),在其DE环 aa.134/135位点,直接插入N端含GP连接子且C端含P连接子的HPV59型L2 蛋白的aa.16-37多肽(即在SEQ ID No.6所示序列的N端添加甘氨酸-脯氨酸,且在C端添加脯氨酸),18L1ΔCDE134-135/59dE嵌合蛋白的氨基酸序列如SEQ ID No.18所示。编码18L1ΔCDE134-135/59dE的多核苷酸序列经昆虫细胞密码子优化设计,采用全基因合成的方式构建,其核苷酸序列如SEQ ID No.36所示;
17)嵌合L1蛋白18L1ΔCDE131-138/59dE:骨架为C端截短32个氨基酸的 HPV18型L1蛋白(序列SEQ ID No.1的C端截短32个氨基酸),删除其aa.132-137 区域,并在aa.131/138之间融合N端包含GP连接子的HPV59型L2蛋白的aa.17-37 多肽(在HPV 18型L1蛋白的aa.132-137区域非等长置换插入),插入片段的氨基酸序列为SEQ ID No.3所示序列的N端添加甘氨酸-脯氨酸,18L1DE131-138/59dE 嵌合蛋白的氨基酸序列如SEQ ID No.20所示。编码18L1DE131-138/59dE的多核苷酸序列经昆虫细胞密码子优化设计,采用全基因合成的方式构建,其核苷酸序列如SEQ ID No.37所示;
18)嵌合L1蛋白18L1ΔCDE121-124/59dE:骨架为C端截短32个氨基酸的 HPV18型L1蛋白(序列SEQ ID No.1的C端截短32个氨基酸),删除其aa.122-123 区域,并在aa.121/124之间融合HPV59型L2蛋白的aa.17-37多肽(在HPV 18型 L1蛋白的aa.122-133区域非等长置换插入),插入片段的氨基酸序列如SEQ ID No.3所示,18L1ΔCDE121-124/59dE嵌合蛋白的氨基酸序列如SEQ ID No.22所示。编码18L1ΔCDE121-124/59dE的多核苷酸序列经昆虫细胞密码子优化设计,采用全基因合成的方式构建;
19)嵌合L1蛋白18L1ΔCh4431-433/59dE:骨架为C端截短32个氨基酸的HPV18 型L1蛋白(序列SEQ ID No.1的C端截短32个氨基酸),删除其aa.432,并在 aa.431/433之间融合HPV59型L2蛋白的aa.17-37多肽(在HPV 18型L1蛋白的 aa.431-433区域非等长置换插入),插入片段的氨基酸序列如SEQ ID No.3所示, 18L1ΔCh4431-433/59dE嵌合蛋白的氨基酸序列如SEQ ID No.24所示。编码 18L1ΔCh4431-433/59dE的多核苷酸序列经昆虫细胞密码子优化设计,采用全基因合成的方式构建;
20)嵌合L1蛋白18L1ΔCh4432-435/59dE:骨架为C端截短32个氨基酸的HPV18 型L1蛋白(序列SEQ ID No.1的C端截短32个氨基酸),删除其aa.433-434区域,并在aa.432/435之间融合HPV59型L2蛋白的aa.17-37多肽(在HPV 18型 L1蛋白的aa.432-435区域非等长置换插入),插入片段的氨基酸序列如SEQ ID No.3所示,18L1ΔCh4432-435/59dE嵌合蛋白的氨基酸序列如SEQ ID No.26所示。编码18L1ΔCh4432-435/59dE的多核苷酸序列经昆虫细胞密码子优化设计,采用全基因合成的方式构建;
21)嵌合L1蛋白18L1DE137-138/59dES-mut1:骨架为全长HPV18型L1蛋白的突变体mut1(即将SEQ ID No.1所示氨基酸序列的氨基酸477、478、484、496、499、504、506置换为甘氨酸(G)且将氨基酸485、500、502置换为丝氨酸(S)的突变体),在其DE环aa.137/138位点,直接插入HPV59型L2蛋白的 aa.17-32多肽(如SEQ ID No.4所示),18L1DE137-138/59dES-mut1嵌合蛋白的氨基酸序列如SEQ ID No.27所示。编码18L1DE137-138/59dES-mut1的多核苷酸序列经大肠杆菌密码子优化设计或昆虫细胞密码子优化设计,采用全基因合成的方式构建,其中采用昆虫细胞密码子优化的18L1DE137-138/59dES-mut1的多核苷酸序如 SEQ IDNo.38所示;
22)嵌合L1蛋白18L1DE137-138/59dES-mut2:骨架为全长HPV18型L1蛋白的突变体mut2(即将SEQ ID No.1所示序列的氨基酸477、478、485、496、499、 504、506置换为甘氨酸且将氨基酸486、500、502置换为丝氨酸),在其DE环 aa.137/138位点,直接插入HPV59型L2蛋白的aa.17-32多肽(如SEQ ID No.4所示),18L1DE137-138/59dES-mut2嵌合蛋白的氨基酸序列如SEQ ID No.28所示。编码18L1DE137-138/59dES-mut2的多核苷酸序列经大肠杆菌密码子优化设计或昆虫细胞密码子优化设计,采用全基因合成的方式构建,其中采用昆虫细胞密码子优化的18L1DE137-138/59dES-mut2的多核苷酸序如SEQ ID No.39所示;
23)嵌合L1蛋白18L1DE137-138/59dES-mut3:骨架为全长HPV18型L1蛋白的突变体mut3(即将SEQ ID No.1所示序列的氨基酸477、478、484、496、499、 502、506置换为甘氨酸、将氨基酸485、500置换为丝氨酸且将氨基酸504置换为天冬氨酸),在其DE环aa.137/138位点,直接插入HPV59型L2蛋白的aa.17-32 多肽(如SEQ ID No.4所示),18L1DE137-138/59dES-mut3嵌合蛋白的氨基酸序列如SEQ ID No.29所示。编码18L1DE137-138/59dES-mut3的多核苷酸序列经大肠杆菌密码子优化设计或昆虫细胞密码子优化设计,采用全基因合成的方式构建,其中采用昆虫细胞密码子优化的18L1DE137-138/59dES-mut3的多核苷酸序如SEQ IDNo.40所示;
24)嵌合L1蛋白18L1DE137-138/59dES-mut4:骨架为全长HPV18型L1蛋白的突变体mut4(即将SEQ ID No.1所示序列的氨基酸477、478、485、496、502、 506置换为甘氨酸、将氨基酸486、500置换为丝氨酸且将氨基酸499、504置换为天冬氨酸),在其DE环aa.137/138位点,直接插入HPV59型L2蛋白的aa.17-32 多肽(如SEQ ID No.4所示),18L1DE137-138/59dES-mut4嵌合蛋白的氨基酸序列如SEQ ID No.30所示。编码18L1DE137-138/59dES-mut4的多核苷酸序列经大肠杆菌密码子优化设计或昆虫细胞密码子优化设计,采用全基因合成的方式构建,其中采用昆虫细胞密码子优化的18L1DE137-138/59dES-mut4的多核苷酸序如SEQ IDNo.41所示;
25)嵌合L1蛋白18L1DE137-138/59dES-mut5:骨架为全长HPV18型L1蛋白的突变体mut5(即将SEQ ID No.1所示序列的氨基酸477、484、496、499、504、 506置换为甘氨酸且将氨基酸485、500、502置换为丝氨酸),在其DE环 aa.137/138位点,直接插入HPV59型L2蛋白的aa.17-32多肽(如SEQ ID No.4所示),18L1DE137-138/59dES-mut5嵌合蛋白的氨基酸序列如SEQ ID No.31所示。编码18L1DE137-138/59dES-mut5的多核苷酸序列经大肠杆菌密码子优化设计或昆虫细胞密码子优化设计,采用全基因合成的方式构建,其中采用昆虫细胞密码子优化的18L1DE137-138/59dES-mut5的多核苷酸序如SEQ ID No.42所示;
26)嵌合L1蛋白18L1DE137-138/59dES-mut6:骨架为全长HPV18型L1蛋白的突变体mut6(即将SEQ ID No.1所示序列的氨基酸477、485、496、499、504、 506置换为甘氨酸且将氨基酸486、500、502置换为丝氨酸),在其DE环 aa.137/138位点,直接插入HPV59型L2蛋白的aa.17-32多肽(如SEQ ID No.4所示),18L1DE137-138/59dES-mut6嵌合蛋白的氨基酸序列如SEQ ID No.32所示。编码18L1DE137-138/59dES-mut6的多核苷酸序列经大肠杆菌密码子优化设计或昆虫细胞密码子优化设计,采用全基因合成的方式构建,其中采用昆虫细胞密码子优化的18L1DE137-138/59dES-mut5的多核苷酸序如SEQ ID No.43所示。
大肠杆菌密码子优化的嵌合蛋白基因经NdeI/XhoI酶切后,分别插入商业化的表达载体pET22b(Novagen公司生产)。昆虫细胞密码子优化的嵌合蛋白基因经BamHI/EcoRI酶切后,分别插入商业化表达载体pFastBac1(Invitrogen公司生产)中。得到包含嵌合蛋白基因的表达载体,用于大肠杆菌的表达载体共16个,分别为:pET22b-18L1DE137-138/59dES,pET22b-18L1DE137-138/59dE, pET22b-18L1h4432-433/59dE,pET22b-18L1h4434-435/59dE,pET22b-18L1DE134-135/59dES,pET22b-18L1DE134-135/59dE, pET22b-18L1DE131-138/59dE,pET22b-18L1DE121-124/59dE, pET22b-18L1h4431-433/59dE,pET22b-18L1h4432-435/59dE,pET22b-18L1DE137-138/59dES-mut1,pET22b-18L1DE137-138/59dES-mut2, pET22b-18L1DE137-138/59dES-mut3,pET22b-18L1DE137-138/59dES-mut4, pET22b-18L1DE137-138/59dES-mut5,pET22b-18L1DE137-138/59dES-mut6;用于昆虫细的表达载体共16个,分别为:pFastBac1-18L1ΔCDE137-138/59dES, pFastBac1-18L1ΔCDE137-138/59dE,pFastBac1-18L1ΔCh4432-433/59dE, pFastBac1-18L1ΔCh4434-435/59dE,pFastBac1-18L1ΔCDE134-135/59dES,pFastBac1-18L1ΔCDE134-135/59dE,pFastBac1-18L1ΔCDE131-138/59dE, pFastBac1-18L1ΔCDE121-124/59dE,pFastBac1-18L1ΔCh4431-433/59dE, pFastBac1-18L1ΔCh4432-435/59dE,pFastBac1-18L1DE137-138/59dES-mut1, pFastBac1-18L1DE137-138/59dES-mut2,pFastBac1-18L1DE137-138/59dES-mut3, pFastBac1-18L1DE137-138/59dES-mut4,pFastBac1-18L1DE137-138/59dES-mut5, pFastBac1-18L1DE137-138/59dES-mut6。上述酶切、连接及克隆构建的方法都是公知的,例如专利CN101293918B。
本发明中使用的多肽的氨基酸序列如下所示:
HPV18型L1全长氨基酸
HPV59型L2全长氨基酸序列
HPV 59型L2 aa.17-37
LYKTCKQAGTCPSDVINKVEG SEQ ID NO.3
HPV 59型L2 aa.17-32
LYKTCKQAGTCPSDVI SEQ ID NO.4
HPV 59型L2 aa.16-35
DLYKTCKQAGTCPSDVINKV SEQ ID NO.5
HPV 59型L2 aa.16-37
DLYKTCKQAGTCPSDVINKVEG SEQ ID NO.6
18L1DE137-138/59dES
18L1ΔCDE137-138/59dES
18L1DE137-138/59dE
18L1ΔCDE137-138/59dE
18L1h4432-433/59dE
18L1ΔCh4432-433/59dE
18L1h4434-435/59dE
18L1ΔCh4434-435/59dE
18L1DE134-135/59dES
18L1ΔCDE134-135/59dES
18L1DE134-135/59dE
18L1ΔCDE134-135/59dE
18L1DE131-138/59dE
18L1ΔCDE131-138/59dE
18L1DE121-124/59dE
18L1ΔCDE121-124/59dE
18L1h4431-433/59dE
18L1ΔCh4431-433/59dE
18L1h4432-435/59dE
18L1ΔCh4432-435/59dE
18L1DE137-138/59dES-mut1
18L1DE137-138/59dES-mut2
18L1DE137-138/59dES-mut3
18L1DE137-138/59dES-mut4
18L1DE137-138/59dES-mut5
18L1DE137-138/59dES-mut6
编码本发明嵌合蛋白的核苷酸序列如下所示:
18L1ΔCDE137-138/59dES nt
18L1ΔCDE137-138/59dE nt
18L1ΔCDE134-135/59dES nt
18L1ΔCDE134-135/59dE nt
18L1ΔCDE131-138/59dE nt
18L1DE137-138/59dES-mut1 nt
18L1DE137-138/59dES-mut2 nt
18L1DE137-138/59dES-mut3 nt
18L1DE137-138/59dES-mut4 nt
18L1DE137-138/59dES-mut5 nt
18L1DE137-138/59dES-mut6 nt
实施例3:嵌合L1蛋白的基因的重组Bacmid及重组杆状病毒的构建
分别使用包含嵌合L1基因的重组表达载体pFastBac1-18L1ΔCDE137-138/59dES,pFastBac1-18L1ΔCDE137-138/59dE, pFastBac1-18L1ΔCh4432-433/59dE,pFastBac1-18L1ΔCh4434-435/59dE, pFastBac1-18L1ΔCDE134-135/59dES,pFastBac1-18L1ΔCDE134-135/59dE,pFastBac1-18L1ΔCDE131-138/59dE,pFastBac1-18L1ΔCDE121-124/59dE, pFastBac1-18L1ΔCh4431-433/59dE,pFastBac1-18L1ΔCh4432-435/59dE, pFastBac1-18L1DE137-138/59dES-mut1,pFastBac1-18L1DE137-138/59dES-mut2, pFastBac1-18L1DE137-138/59dES-mut3,pFastBac1-18L1DE137-138/59dES-mut4, pFastBac1-18L1DE137-138/59dES-mut5,pFastBac1-18L1DE137-138/59dES-mut6转化大肠杆菌DH10Bac感受态,筛选获得重组Bacmid,然后用重组Bacmid转染昆虫细胞Sf9,在Sf9内扩增重组杆状病毒。重组Bacmid的筛选及重组杆状病毒的扩增方法都是公知的,例如专利CN101148661B。
实施例4:嵌合L1蛋白的基因在Sf9细胞中的表达
Sf9细胞分别接种16种嵌合L1基因的重组杆状病毒,进行嵌合L1蛋白的表达,27℃培养约88h后收发酵液,3000rpm离心15min,弃上清,用PBS洗涤细胞后,用于表达鉴定及纯化。感染表达的方法是公开的,例如专利CN 101148661 B。
实施例5:嵌合L1蛋白的基因在大肠杆菌中的表达
分别使用包含嵌合L1基因的重组表达载体pET22b-18L1DE137-138/59dES, pET22b-18L1DE137-138/59dE,pET22b-18L1h4432-433/59dE, pET22b-18L1h4434-435/59dE,pET22b-18L1DE134-135/59dES, pET22b-18L1DE134-135/59dE,pET22b-18L1DE131-138/59dE, pET22b-18L1DE121-124/59dE,pET22b-18L1h4431-433/59dE, pET22b-18L1h4432-435/59dE,pET22b-18L1DE137-138/59dES-mut1, pET22b-18L1DE137-138/59dES-mut2,pET22b-18L1DE137-138/59dES-mut3, pET22b-18L1DE137-138/59dES-mut4,pET22b-18L1DE137-138/59dES-mut5,pET22b-18L1DE137-138/59dES-mut6转化大肠杆菌BL21(DE3)。
挑单克隆接种到3ml含氨苄青霉素的LB培养基中,37℃培养过夜。将过夜培养的菌液按1:100的比例加入LB培养基中,37℃培养3h左右,待OD600达到0.8-1.0之间,加IPTG至终浓度0.5μM,16℃培养约12h,收取菌液。
实施例6:嵌合L1蛋白的表达鉴定
取实施例4及实施例5中所述表达不同嵌合L1蛋白的细胞各1×106个,重悬于200μl PBS溶液中,加入6×Loading Buffer 50μl,75℃变性8分钟,分别取10 μl进行SDS-PAGE电泳及Western blot鉴定。结果如图1A-图1B所示,26种嵌合L1蛋白均可在昆虫细胞或原核表达系统中高水平表达,其中 18L1DE137-138/59dES,18L1DE137-138/59dE,18L1h4432-433/59dE,18L1h4434-435/59dE,18L1DE134-135/59dES,18L1DE134-135/59dE, 18L1DE131-138/59dE,18L1DE121-124/59dE,18L1h4431-433/59dE, 18L1h4432-435/59dE,18L1DE137-138/59dES-mut1,18L1DE137-138/59dES-mut2, 18L1DE137-138/59dES-mut3,18L1DE137-138/59dES-mut4,18L1DE137-138/59dES-mut5,18L1DE137-138/59dES-mut6大小约59kDa,其余10种蛋白大小约55kDa。SDS-PAGE电泳及Western blot鉴定的方法是公开的,例如专利 CN101148661B。
实施例7:嵌合L1蛋白在昆虫细胞中的表达量比较
取C端截短32个氨基酸的18L1骨架蛋白的表达细胞以及实施例4中所述的以C端截短32个氨基酸的18L1为骨架或以6种18L1突变体为骨架的嵌合L1蛋白表达细胞各1×106个,重悬于200μl PBS溶液中,采用超声破碎法(宁波新芝超声破碎仪,2#探头,100W,超声5s,间隔7s,总时间3min)破碎细胞, 12000rpm高速离心10分钟。收取裂解上清,采用夹心ELISA法检测上清中的L1 含量,该方法是公知的,例如专利CN104513826A。
使用本发明人制备的HPV18 L1单克隆抗体包被酶标板,80ng/孔,4℃孵育过夜;使用5%BSA-PBST室温封闭2h,再用PBST洗板3次。用PBS将裂解上清进行连续2倍稀释,并且将HPV18L1 VLP标准品也进行梯度稀释,浓度从2μ g/ml-0.0625μg/ml,分别加入酶标板,每孔100μl,37℃孵育1h。用PBST洗板3次,加入1:3000稀释的HPV18L1兔多抗,每孔100μl,37℃孵育1h。用 PBST洗板3次,加入1:3000稀释的HRP标记的山羊抗小鼠IgG(1:3000稀释,中杉金桥公司),37℃孵育45分钟。用PBST洗板5次,每孔加入100μl OPD底物(Sigma公司),37℃显色5分钟,用50μl 2M硫酸终止反应,在490nm处测定吸光值。依据标准曲线计算裂解上清中HPV18L1蛋白及18L1嵌合蛋白的浓度。
结果如表3所示,本发明的18L1ΔCDE134-135/59dES、 18L1ΔCDE137-138/59dES、18L1ΔCh4431-433/59dE及18L1ΔCh4432-435/59dE的表达量很高,与HPV18L1骨架的相当;此外,以C端氨基酸置换的18L1突变体为骨架的嵌合蛋白18L1DE137-138/59dES-mut1、18L1DE137-138/59dES-mut4、 18L1DE137-138/59dES-mut5的表达量均高于HPV18L1骨架及相应的C端截短的嵌合蛋白。
表3.嵌合L1蛋白表达量分析
实施例8:嵌合L1蛋白的纯化及动态光散射粒径分析
取嵌合L1的细胞发酵液适量,使用10ml PBS重悬细胞,加PMSF至终浓度 1mg/ml,超声破碎(宁波新芝超声破碎仪,6#探头,200W,超声5s,间隔7s,总时间10min),取破碎上清进行纯化,纯化步骤在室温进行。在裂解液中加入 4%β-巯基乙醇(w/w)对VLP进行解聚,然后使用0.22μm滤器过滤样品,依次使用DMAE阴离子交换层析或CM阳离子交换层析(20mMTris,180mM NaCl,4%β-ME,pH7.9洗脱)、TMAE阴离子交换层析或Q阳离子交换层析(20mMTris,180 mM NaCl,4%β-ME,pH7.9洗脱)及羟基磷灰石层析(100mM NaH2PO4,30mM NaCl,4%β-ME,pH 6.0洗脱)纯化。纯化产物采用Planova超滤系统进行浓缩,并更换缓冲液(20mMNaH2PO4,500mM NaCl,pH6.0)促使VLP组装。以上纯化方法均是公开的,例如专利CN101293918B、CN1976718A等。
嵌合蛋白纯品组装过程中发现,18L1h4431-433/59dE、18L1h4432-435/59dE、 18L1ΔCh4431-433/59dE及18L1ΔCh4432-435/59dE严重凝集,其他嵌合蛋白组装后未观察到凝集现象。取组装后的嵌合蛋白溶液进行DLS粒径分析(Zetasizer Nano ZS 90动态光散射仪,Malvern公司),结果如表4所示,其中18L1ΔCDE134-135/59dE、 18L1ΔCDE134-135/59dES、18L1ΔCDE137-138/59dE、18L1ΔCDE137-138/59dES的DLS 分析图如图2A至图2D所示。
表4.嵌合L1蛋白DLS分析
实施例9:嵌合VLP的透射电镜观察
按实施例8所述的层析纯化方法,分别纯化嵌合蛋白,使用组装后嵌合制备铜网,并用1%醋酸铀进行染色,充分干燥后使用JEM-1400电镜(奥林巴斯)进行观察。结果显示,大肠杆菌及昆虫细胞表达的嵌合蛋白均可组装成直径约为 50nm的cVLP。其中18L1ΔCDE134-135/59dE、18L1ΔCDE134-135/59dES、 18L1ΔCDE137-138/59dE、18L1ΔCDE137-138/59dEScVLP的电镜图片如图3A至图3D 所示。铜网制备及电镜观察的方法均是公开的,例如专利CN101148661 B。
实施例10:嵌合VLP的小鼠免疫及中和抗体滴度测定
取4-6周龄的BALB/c小鼠,随机分组,每组5只,用10μg cVLP联合 Al(OH)3 50μg及MPL佐剂5μg免疫小鼠。皮下注射,于第0,4,7,10周免疫,共4次。第4次免疫后2周尾静脉采血,分离血清。
使用24种HPV假病毒对免疫血清的中和抗体滴度进行检测,结果显示大肠杆菌及昆虫细胞表达体系生产的各种cVLP免疫小鼠后,诱发的交叉中和抗体水平及中和范围各有不同。其中,如表5所示,昆虫细胞表达的 18L1ΔCDE134-135/59dES及18L1ΔCDE137-138/59dEScVLP抗血清可中和至少23个型别的假病毒,18L1ΔCDE134-135/59dE及18L1ΔCDE137-138/59dEcVLP免疫血清可中和至少19个型别的假病毒。值得一提的是,18L1ΔCDE134-135/59dES及18L1ΔCDE137-138/59dES cVLP抗血清可中和所有的6种检测用的α7-HPV,特别是 18L1ΔCDE137-138/59dES cVLP,交叉中和α7-HPV的抗体滴度都在250以上,是目前报道的交叉中和α7-HPV能力最强的cVLP。
此外,本发明中采用C端截短32个氨基酸的18L1突变体构建的cVLP,采用上述策略免疫小鼠后,也可诱发的高水平的中和抗体,其中 18L1DE137-138/59dES-mut4诱发的各型中和抗体水平均与18L1ΔCDE137-138/59dES的相当,值得注意的是,18L1DE137-138/59dES-mut4免疫血清的HPV39和HPV59中和抗体滴度均大于103(如表5、图4所示)。
假病毒制备及假病毒中和实验的方法均是公开的,例如专利CN 104418942A。
表5不同cVLP在小鼠中诱发的中和抗体滴度
*ND表示在最低稀释度下未检测到中和抗体。
序列表
<110> 中国医学科学院基础医学研究所
<120> 一种人乳头瘤病毒18型嵌合蛋白及其用途
<130> 300263CG
<160> 51
<170> SIPOSequenceListing 1.0
<210> 1
<211> 507
<212> PRT
<213> HPV 18
<400> 1
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
435 440 445
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
450 455 460
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg Lys Pro Thr
465 470 475 480
Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser Lys
485 490 495
Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
500 505
<210> 2
<211> 464
<212> PRT
<213> HPV 59
<400> 2
Met Val Ser His Arg Ala Ala Arg Arg Lys Arg Ala Ser Ala Thr Asp
1 5 10 15
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
20 25 30
Asn Lys Val Glu Gly Thr Thr Leu Ala Asp Lys Ile Leu Gln Trp Thr
35 40 45
Ser Leu Gly Ile Phe Leu Gly Gly Leu Gly Ile Gly Thr Gly Ser Gly
50 55 60
Thr Gly Gly Arg Thr Gly Tyr Ile Pro Leu Gly Gly Arg Thr Asn Thr
65 70 75 80
Ile Val Asp Val Ser Pro Ala Lys Pro Pro Val Val Ile Glu Pro Val
85 90 95
Gly Pro Thr Asp Pro Ser Ile Val Thr Leu Val Glu Asp Ser Ser Val
100 105 110
Ile Thr Ser Gly Ala Pro Ala Pro Thr Phe Thr Gly Thr Ser Gly Phe
115 120 125
Glu Ile Ser Thr Ser Ser Thr Thr Thr Pro Ala Val Leu Asp Ile Thr
130 135 140
Pro Thr Ser Ser Val Gln Ile Ser Ser Ser Ser Phe Ile Asn Pro Ala
145 150 155 160
Phe Thr Asp Pro Ser Val Ile Glu Val Pro Gln Thr Gly Glu Ile Ser
165 170 175
Gly Asn Ile Leu Ile Ser Thr Pro Thr Ser Gly Ala His Gly Tyr Glu
180 185 190
Glu Ile Pro Met Gln Thr Phe Ala Thr Glu Gly Thr Gly Leu Glu Pro
195 200 205
Ile Ser Ser Thr Pro Asn Pro Thr Val Arg Arg Val Ala Gly Pro Arg
210 215 220
Leu Tyr Ser Arg Ala Asn Gln Gln Val Arg Val Ser Asp Ala Asn Phe
225 230 235 240
Leu Thr Arg Pro Ser Thr Phe Val Thr Tyr Asp Asn Pro Ala Tyr Asp
245 250 255
Pro Ile Asp Thr Thr Leu Thr Phe Asp Pro Ser Ser Glu Val Pro Asp
260 265 270
Pro Asp Phe Met Asp Ile Val Arg Leu His Arg Pro Ala Leu Thr Ser
275 280 285
Arg Arg Ser Thr Val Arg Phe Ser Arg Leu Gly Gln Arg Ala Thr Met
290 295 300
Phe Thr Arg Ser Gly Lys Gln Ile Gly Ala Arg Val His Phe Tyr His
305 310 315 320
Asp Ile Ser Pro Ile Pro His Ala Glu Asn Ile Glu Leu Gln Pro Leu
325 330 335
Val Ser Ser Gln Ala Ala Thr Asp Asp Ile Tyr Asp Ile Tyr Ala Asp
340 345 350
Ile Thr Asp Glu Ala Pro Thr Ser Thr Ala Asn Thr Ala Phe Thr Ile
355 360 365
Pro Lys Ser Ser Phe Gln Ser Leu Ser Leu Thr Arg Ser Ala Ser Ser
370 375 380
Thr Phe Ser Asn Val Thr Val Pro Leu Ala Thr Ala Trp Asp Val Pro
385 390 395 400
Val Asn Thr Gly Pro Asp Ile Val Leu Pro Asn Thr Asn Ile Val Gly
405 410 415
Pro Thr Tyr Ser Thr Thr Pro Phe Thr Thr Ile Gln Ser Ile Asn Ile
420 425 430
Glu Gly Thr Asn Tyr Phe Leu Trp Pro Ile Tyr Tyr Phe Leu Pro Arg
435 440 445
Lys Arg Lys Arg Val Pro Tyr Phe Phe Thr Asp Gly Ser Met Ala Phe
450 455 460
<210> 3
<211> 21
<212> PRT
<213> HPV 59
<400> 3
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
1 5 10 15
Asn Lys Val Glu Gly
20
<210> 4
<211> 16
<212> PRT
<213> HPV 59
<400> 4
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
1 5 10 15
<210> 5
<211> 20
<212> PRT
<213> HPV 59
<400> 5
Asp Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val
1 5 10 15
Ile Asn Lys Val
20
<210> 6
<211> 22
<212> PRT
<213> HPV 59
<400> 6
Asp Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val
1 5 10 15
Ile Asn Lys Val Glu Gly
20
<210> 7
<211> 523
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES
<400> 7
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg Lys Pro Thr
485 490 495
Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser Lys
500 505 510
Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520
<210> 8
<211> 491
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(491)
<223> 18L1ΔCDE137-138/59dES
<400> 8
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490
<210> 9
<211> 527
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(527)
<223> 18L1DE137-138/59dE
<400> 9
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asp Leu Tyr Lys Thr Cys Lys
130 135 140
Gln Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Asn Val Ser
145 150 155 160
Glu Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu
165 170 175
Cys Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly
180 185 190
Thr Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu
195 200 205
Glu Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly
210 215 220
Tyr Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val
225 230 235 240
Pro Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln
245 250 255
Met Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg
260 265 270
Glu Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly
275 280 285
Asp Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala
290 295 300
Ser Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val
305 310 315 320
Thr Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala
325 330 335
Gln Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr
340 345 350
Val Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr
355 360 365
Gln Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr
370 375 380
Ser Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys
385 390 395 400
Thr Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn
405 410 415
Ser Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr
420 425 430
Thr Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr
435 440 445
Cys Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys
450 455 460
Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu
465 470 475 480
Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg
485 490 495
Arg Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr
500 505 510
Thr Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 10
<211> 495
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(495)
<223> 18L1ΔCDE137-138/59dE
<400> 10
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asp Leu Tyr Lys Thr Cys Lys
130 135 140
Gln Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Asn Val Ser
145 150 155 160
Glu Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu
165 170 175
Cys Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly
180 185 190
Thr Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu
195 200 205
Glu Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly
210 215 220
Tyr Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val
225 230 235 240
Pro Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln
245 250 255
Met Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg
260 265 270
Glu Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly
275 280 285
Asp Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala
290 295 300
Ser Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val
305 310 315 320
Thr Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala
325 330 335
Gln Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr
340 345 350
Val Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr
355 360 365
Gln Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr
370 375 380
Ser Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys
385 390 395 400
Thr Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn
405 410 415
Ser Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr
420 425 430
Thr Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr
435 440 445
Cys Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys
450 455 460
Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu
465 470 475 480
Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
<210> 11
<211> 528
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(528)
<223> 18L1h4432-433/59dE
<400> 11
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
435 440 445
Asn Lys Val Glu Gly Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp
450 455 460
Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp
465 470 475 480
Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
Arg Arg Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala
500 505 510
Thr Thr Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 12
<211> 496
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(496)
<223> 18L1ΔCh4432-433/59dE
<400> 12
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
435 440 445
Asn Lys Val Glu Gly Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp
450 455 460
Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp
465 470 475 480
Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
<210> 13
<211> 528
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(528)
<223> 18L1h4434-435/59dE
<400> 13
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Ala Ala Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp
435 440 445
Val Ile Asn Lys Val Glu Gly Pro Ala Glu Asn Lys Asp Pro Tyr Asp
450 455 460
Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp
465 470 475 480
Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
Arg Arg Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala
500 505 510
Thr Thr Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 14
<211> 496
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(496)
<223> 18L1ΔCh4434-435/59dE
<400> 14
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Ala Ala Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp
435 440 445
Val Ile Asn Lys Val Glu Gly Pro Ala Glu Asn Lys Asp Pro Tyr Asp
450 455 460
Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp
465 470 475 480
Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
<210> 15
<211> 526
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(526)
<223> 18L1DE134-135/59dES
<400> 15
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Gly Pro Leu Tyr Lys Thr Cys Lys Gln Ala
130 135 140
Gly Thr Cys Pro Ser Asp Val Ile Pro Ala Thr Ser Asn Val Ser Glu
145 150 155 160
Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys
165 170 175
Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr
180 185 190
Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu
195 200 205
Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr
210 215 220
Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro
225 230 235 240
Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met
245 250 255
Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu
260 265 270
Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp
275 280 285
Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser
290 295 300
Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr
305 310 315 320
Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln
325 330 335
Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val
340 345 350
Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln
355 360 365
Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser
370 375 380
Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr
385 390 395 400
Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser
405 410 415
Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr
420 425 430
Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys
435 440 445
Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg
485 490 495
Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr
500 505 510
Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 16
<211> 494
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(494)
<223> 18L1ΔCDE134-135/59dES
<400> 16
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Gly Pro Leu Tyr Lys Thr Cys Lys Gln Ala
130 135 140
Gly Thr Cys Pro Ser Asp Val Ile Pro Ala Thr Ser Asn Val Ser Glu
145 150 155 160
Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys
165 170 175
Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr
180 185 190
Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu
195 200 205
Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr
210 215 220
Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro
225 230 235 240
Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met
245 250 255
Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu
260 265 270
Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp
275 280 285
Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser
290 295 300
Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr
305 310 315 320
Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln
325 330 335
Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val
340 345 350
Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln
355 360 365
Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser
370 375 380
Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr
385 390 395 400
Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser
405 410 415
Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr
420 425 430
Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys
435 440 445
Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490
<210> 17
<211> 532
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(532)
<223> 18L1DE134-135/59dE
<400> 17
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Gly Pro Asp Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Glu Gly Pro Ala
145 150 155 160
Thr Ser Asn Val Ser Glu Asp Val Arg Asp Asn Val Ser Val Asp Tyr
165 170 175
Lys Gln Thr Gln Leu Cys Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu
180 185 190
His Trp Ala Lys Gly Thr Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly
195 200 205
Asp Cys Pro Pro Leu Glu Leu Lys Asn Thr Val Leu Glu Asp Gly Asp
210 215 220
Met Val Asp Thr Gly Tyr Gly Ala Met Asp Phe Ser Thr Leu Gln Asp
225 230 235 240
Thr Lys Cys Glu Val Pro Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr
245 250 255
Pro Asp Tyr Leu Gln Met Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe
260 265 270
Phe Cys Leu Arg Arg Glu Gln Leu Phe Ala Arg His Phe Trp Asn Arg
275 280 285
Ala Gly Thr Met Gly Asp Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly
290 295 300
Thr Gly Met Arg Ala Ser Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro
305 310 315 320
Ser Gly Ser Ile Val Thr Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr
325 330 335
Trp Leu His Lys Ala Gln Gly His Asn Asn Gly Val Cys Trp His Asn
340 345 350
Gln Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr Asn Leu Thr
355 360 365
Ile Cys Ala Ser Thr Gln Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr
370 375 380
Lys Phe Lys Gln Tyr Ser Arg His Val Glu Glu Tyr Asp Leu Gln Phe
385 390 395 400
Ile Phe Gln Leu Cys Thr Ile Thr Leu Thr Ala Asp Val Met Ser Tyr
405 410 415
Ile His Ser Met Asn Ser Ser Ile Leu Glu Asp Trp Asn Phe Gly Val
420 425 430
Pro Pro Pro Pro Thr Thr Ser Leu Val Asp Thr Tyr Arg Phe Val Gln
435 440 445
Ser Val Ala Ile Thr Cys Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys
450 455 460
Asp Pro Tyr Asp Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys
465 470 475 480
Phe Ser Leu Asp Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val
485 490 495
Gln Ala Gly Leu Arg Arg Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser
500 505 510
Ala Pro Ser Ala Thr Thr Ser Ser Lys Pro Ala Lys Arg Val Arg Val
515 520 525
Arg Ala Arg Lys
530
<210> 18
<211> 500
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(500)
<223> 18L1ΔCDE134-135/59dE
<400> 18
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Gly Pro Asp Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Glu Gly Pro Ala
145 150 155 160
Thr Ser Asn Val Ser Glu Asp Val Arg Asp Asn Val Ser Val Asp Tyr
165 170 175
Lys Gln Thr Gln Leu Cys Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu
180 185 190
His Trp Ala Lys Gly Thr Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly
195 200 205
Asp Cys Pro Pro Leu Glu Leu Lys Asn Thr Val Leu Glu Asp Gly Asp
210 215 220
Met Val Asp Thr Gly Tyr Gly Ala Met Asp Phe Ser Thr Leu Gln Asp
225 230 235 240
Thr Lys Cys Glu Val Pro Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr
245 250 255
Pro Asp Tyr Leu Gln Met Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe
260 265 270
Phe Cys Leu Arg Arg Glu Gln Leu Phe Ala Arg His Phe Trp Asn Arg
275 280 285
Ala Gly Thr Met Gly Asp Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly
290 295 300
Thr Gly Met Arg Ala Ser Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro
305 310 315 320
Ser Gly Ser Ile Val Thr Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr
325 330 335
Trp Leu His Lys Ala Gln Gly His Asn Asn Gly Val Cys Trp His Asn
340 345 350
Gln Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr Asn Leu Thr
355 360 365
Ile Cys Ala Ser Thr Gln Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr
370 375 380
Lys Phe Lys Gln Tyr Ser Arg His Val Glu Glu Tyr Asp Leu Gln Phe
385 390 395 400
Ile Phe Gln Leu Cys Thr Ile Thr Leu Thr Ala Asp Val Met Ser Tyr
405 410 415
Ile His Ser Met Asn Ser Ser Ile Leu Glu Asp Trp Asn Phe Gly Val
420 425 430
Pro Pro Pro Pro Thr Thr Ser Leu Val Asp Thr Tyr Arg Phe Val Gln
435 440 445
Ser Val Ala Ile Thr Cys Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys
450 455 460
Asp Pro Tyr Asp Lys Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys
465 470 475 480
Phe Ser Leu Asp Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val
485 490 495
Gln Ala Gly Leu
500
<210> 19
<211> 524
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(524)
<223> 18L1DE131-138/59dE
<400> 19
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Gly Pro Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys
130 135 140
Pro Ser Asp Val Ile Asn Lys Val Glu Gly Asn Val Ser Glu Asp Val
145 150 155 160
Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu
165 170 175
Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys
180 185 190
Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys
195 200 205
Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala
210 215 220
Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp
225 230 235 240
Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala
245 250 255
Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu
260 265 270
Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val
275 280 285
Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly
290 295 300
Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp
305 310 315 320
Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His
325 330 335
Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp
340 345 350
Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro
355 360 365
Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His
370 375 380
Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr
385 390 395 400
Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile
405 410 415
Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu
420 425 430
Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys
435 440 445
Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe
450 455 460
Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr
465 470 475 480
Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg Lys Pro
485 490 495
Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser
500 505 510
Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520
<210> 20
<211> 492
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(492)
<400> 20
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Gly Pro Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys
130 135 140
Pro Ser Asp Val Ile Asn Lys Val Glu Gly Asn Val Ser Glu Asp Val
145 150 155 160
Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu
165 170 175
Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys
180 185 190
Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys
195 200 205
Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala
210 215 220
Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp
225 230 235 240
Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala
245 250 255
Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu
260 265 270
Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val
275 280 285
Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly
290 295 300
Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp
305 310 315 320
Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His
325 330 335
Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp
340 345 350
Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro
355 360 365
Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His
370 375 380
Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr
385 390 395 400
Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile
405 410 415
Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu
420 425 430
Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys
435 440 445
Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe
450 455 460
Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr
465 470 475 480
Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490
<210> 21
<211> 526
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(526)
<223> 18L1DE121-124/59dE
<400> 21
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Leu Tyr Lys Thr Cys Lys Gln
115 120 125
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Glu Gly Asn Lys
130 135 140
Leu Asp Asp Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu
145 150 155 160
Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys
165 170 175
Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr
180 185 190
Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu
195 200 205
Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr
210 215 220
Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro
225 230 235 240
Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met
245 250 255
Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu
260 265 270
Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp
275 280 285
Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser
290 295 300
Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr
305 310 315 320
Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln
325 330 335
Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val
340 345 350
Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln
355 360 365
Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser
370 375 380
Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr
385 390 395 400
Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser
405 410 415
Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr
420 425 430
Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys
435 440 445
Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg
485 490 495
Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr
500 505 510
Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 22
<211> 494
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(494)
<223> 18L1ΔCDE121-124/59dE
<400> 22
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Leu Tyr Lys Thr Cys Lys Gln
115 120 125
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Lys Val Glu Gly Asn Lys
130 135 140
Leu Asp Asp Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu
145 150 155 160
Asp Val Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys
165 170 175
Ile Leu Gly Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr
180 185 190
Ala Cys Lys Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu
195 200 205
Leu Lys Asn Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr
210 215 220
Gly Ala Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro
225 230 235 240
Leu Asp Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met
245 250 255
Ser Ala Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu
260 265 270
Gln Leu Phe Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp
275 280 285
Thr Val Pro Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser
290 295 300
Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr
305 310 315 320
Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln
325 330 335
Gly His Asn Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val
340 345 350
Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln
355 360 365
Ser Pro Val Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser
370 375 380
Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr
385 390 395 400
Ile Thr Leu Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser
405 410 415
Ser Ile Leu Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr
420 425 430
Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys
435 440 445
Gln Lys Asp Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490
<210> 23
<211> 527
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(527)
<223> 18L1h4431-433/59dE
<400> 23
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Leu
420 425 430
Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile Asn
435 440 445
Lys Val Glu Gly Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys
450 455 460
Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu
465 470 475 480
Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg
485 490 495
Arg Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr
500 505 510
Thr Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 24
<211> 495
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(495)
<223> 18L1ΔCh4431-433/59dE
<400> 24
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Leu
420 425 430
Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile Asn
435 440 445
Lys Val Glu Gly Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys
450 455 460
Leu Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu
465 470 475 480
Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490 495
<210> 25
<211> 526
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(526)
<223> 18L1h4432-435/59dE
<400> 25
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
435 440 445
Asn Lys Val Glu Gly Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Arg
485 490 495
Lys Pro Thr Ile Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr
500 505 510
Ser Ser Lys Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys
515 520 525
<210> 26
<211> 494
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(494)
<223> 18L1ΔCh4432-435/59dE
<400> 26
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
435 440 445
Asn Lys Val Glu Gly Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu
450 455 460
Lys Phe Trp Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp
465 470 475 480
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
485 490
<210> 27
<211> 523
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut1
<400> 27
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Gly Pro Thr
485 490 495
Ile Gly Pro Gly Ser Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Gly Ser Val Ser Val Gly Ala Gly Lys
515 520
<210> 28
<211> 523
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut2
<400> 28
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Gly Pro Thr
485 490 495
Ile Gly Pro Arg Gly Ser Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Gly Ser Val Ser Val Gly Ala Gly Lys
515 520
<210> 29
<211> 523
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut3
<400> 29
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Gly Pro Thr
485 490 495
Ile Gly Pro Gly Ser Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Gly Ser Val Gly Val Asp Ala Gly Lys
515 520
<210> 30
<211> 523
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut4
<400> 30
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Gly Pro Thr
485 490 495
Ile Gly Pro Arg Gly Ser Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Asp Ser Val Gly Val Asp Ala Gly Lys
515 520
<210> 31
<211> 523
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut5
<400> 31
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Lys Pro Thr
485 490 495
Ile Gly Pro Gly Ser Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Gly Ser Val Ser Val Gly Ala Gly Lys
515 520
<210> 32
<211> 523
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(523)
<223> 18L1DE137-138/59dES-mut6
<400> 32
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Leu Tyr Lys Thr Cys Lys Gln
130 135 140
Ala Gly Thr Cys Pro Ser Asp Val Ile Asn Val Ser Glu Asp Val Arg
145 150 155 160
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
165 170 175
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
180 185 190
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
195 200 205
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
210 215 220
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
225 230 235 240
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
245 250 255
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
260 265 270
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
275 280 285
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
290 295 300
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
325 330 335
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
355 360 365
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
370 375 380
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
385 390 395 400
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
405 410 415
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
420 425 430
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
435 440 445
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
450 455 460
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
465 470 475 480
Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Arg Gly Lys Pro Thr
485 490 495
Ile Gly Pro Arg Gly Ser Ser Ala Pro Ser Ala Thr Thr Ser Ser Gly
500 505 510
Pro Ala Gly Ser Val Ser Val Gly Ala Gly Lys
515 520
<210> 33
<211> 1477
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<221> 外显子
<222> (1)..(1477)
<223> 18L1ΔCDE137-138/59dES nt
<400> 33
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctctaat 1477
<210> 34
<211> 1489
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<221> 外显子
<222> (1)..(1489)
<223> 18L1ΔCDE137-138/59dE nt
<400> 34
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cgacctgtac 420
aagacctgca agcaggccgg tacctgcccc tccgacgtca tcaacaaggt caacgtcagc 480
gaagatgtcc gcgataacgt cagcgtggac tacaaacaaa cccaactgtg catcctcggt 540
tgcgcacccg ccatcggcga gcattgggcc aagggtaccg cctgcaagag caggcccctg 600
agccaaggtg actgtccacc cctggagttg aagaataccg tcctcgagga cggcgacatg 660
gtggacaccg gctacggcgc aatggatttc tccaccctgc aggacaccaa gtgcgaagtg 720
cccctcgaca tctgccaaag catctgcaag taccccgact acctgcagat gagcgccgac 780
ccctacggcg actccatgtt cttctgtctg agaagggaac aattgttcgc ccgccacttc 840
tggaaccgcg ccggcaccat gggcgatacc gtcccccagt ccctgtacat caagggtacc 900
ggcatgaggg ccagccccgg ttcatgcgtc tacagcccaa gcccctccgg tagcatcgtc 960
acaagcgatt cccaactctt caacaagccc tactggctgc acaaagccca aggccacaat 1020
aacggcgtct gttggcacaa ccagctgttc gtcaccgtcg tggacacaac caggtccaca 1080
aacctgacca tctgcgccag cacccaaagc cccgtgcccg gccagtacga cgccacaaag 1140
ttcaaacaat actcacgcca cgtcgaagag tacgacctcc aattcatctt ccaactctgc 1200
accatcaccc tgaccgccga cgtcatgtcc tacatccact ccatgaactc atccatcctg 1260
gaagactgga atttcggcgt cccaccaccc cccaccacct ccctcgtcga cacctacagg 1320
ttcgtgcaga gcgtcgccat cacatgccag aaagacgccg cccccgccga gaacaaagac 1380
ccatacgaca aactgaaatt ctggaacgtc gacctgaaag agaaattcag cctggatctg 1440
gaccagtacc cattgggcag gaagttcctc gtccaggcgg gtctctaat 1489
<210> 35
<211> 1486
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<221> 外显子
<222> (1)..(1486)
<223> 18L1ΔCDE134-135/59dES nt
<400> 35
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg caggaccact gtacaagacc 420
tgcaagcagg ccggtacctg cccctccgac gtcatcccag caaccagcaa cgtcagcgaa 480
gatgtccgcg ataacgtcag cgtggactac aaacaaaccc aactgtgcat cctcggttgc 540
gcacccgcca tcggcgagca ttgggccaag ggtaccgcct gcaagagcag gcccctgagc 600
caaggtgact gtccacccct ggagttgaag aataccgtcc tcgaggacgg cgacatggtg 660
gacaccggct acggcgcaat ggatttctcc accctgcagg acaccaagtg cgaagtgccc 720
ctcgacatct gccaaagcat ctgcaagtac cccgactacc tgcagatgag cgccgacccc 780
tacggcgact ccatgttctt ctgtctgaga agggaacaat tgttcgcccg ccacttctgg 840
aaccgcgccg gcaccatggg cgataccgtc ccccagtccc tgtacatcaa gggtaccggc 900
atgagggcca gccccggttc atgcgtctac agcccaagcc cctccggtag catcgtcaca 960
agcgattccc aactcttcaa caagccctac tggctgcaca aagcccaagg ccacaataac 1020
ggcgtctgtt ggcacaacca gctgttcgtc accgtcgtgg acacaaccag gtccacaaac 1080
ctgaccatct gcgccagcac ccaaagcccc gtgcccggcc agtacgacgc cacaaagttc 1140
aaacaatact cacgccacgt cgaagagtac gacctccaat tcatcttcca actctgcacc 1200
atcaccctga ccgccgacgt catgtcctac atccactcca tgaactcatc catcctggaa 1260
gactggaatt tcggcgtccc accacccccc accacctccc tcgtcgacac ctacaggttc 1320
gtgcagagcg tcgccatcac atgccagaaa gacgccgccc ccgccgagaa caaagaccca 1380
tacgacaaac tgaaattctg gaacgtcgac ctgaaagaga aattcagcct ggatctggac 1440
cagtacccat tgggcaggaa gttcctcgtc caggcgggtc tctaat 1486
<210> 36
<211> 1504
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<221> 外显子
<222> (1)..(1504)
<223> 18L1ΔCDE134-135/59dE nt
<400> 36
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg caggaccaga cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acaaggtcga aggaccagca 480
accagcaacg tcagcgaaga tgtccgcgat aacgtcagcg tggactacaa acaaacccaa 540
ctgtgcatcc tcggttgcgc acccgccatc ggcgagcatt gggccaaggg taccgcctgc 600
aagagcaggc ccctgagcca aggtgactgt ccacccctgg agttgaagaa taccgtcctc 660
gaggacggcg acatggtgga caccggctac ggcgcaatgg atttctccac cctgcaggac 720
accaagtgcg aagtgcccct cgacatctgc caaagcatct gcaagtaccc cgactacctg 780
cagatgagcg ccgaccccta cggcgactcc atgttcttct gtctgagaag ggaacaattg 840
ttcgcccgcc acttctggaa ccgcgccggc accatgggcg ataccgtccc ccagtccctg 900
tacatcaagg gtaccggcat gagggccagc cccggttcat gcgtctacag cccaagcccc 960
tccggtagca tcgtcacaag cgattcccaa ctcttcaaca agccctactg gctgcacaaa 1020
gcccaaggcc acaataacgg cgtctgttgg cacaaccagc tgttcgtcac cgtcgtggac 1080
acaaccaggt ccacaaacct gaccatctgc gccagcaccc aaagccccgt gcccggccag 1140
tacgacgcca caaagttcaa acaatactca cgccacgtcg aagagtacga cctccaattc 1200
atcttccaac tctgcaccat caccctgacc gccgacgtca tgtcctacat ccactccatg 1260
aactcatcca tcctggaaga ctggaatttc ggtgtcccac caccccccac cacctccctc 1320
gtcgacacct acaggttcgt gcagagcgtc gccatcacat gccagaaaga cgccgccccc 1380
gccgagaaca aagacccata cgacaaactg aaattctgga acgtcgacct gaaagagaaa 1440
ttcagcctgg atctggacca gtacccattg ggcaggaagt tcctcgtcca ggcgggtctc 1500
taat 1504
<210> 37
<211> 1480
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<221> 外显子
<222> (1)..(1480)
<223> 18L1ΔCDE131-138/59dE nt
<400> 37
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tccggtcccc tgtacaagac ctgcaagcag 420
gccggtacct gcccctccga cgtcatcaac aaggtcgaag gaaacgtcag cgaagatgtc 480
cgcgataacg tcagcgtgga ctacaaacaa acccaactgt gcatcctcgg ttgcgcaccc 540
gccatcggcg agcattgggc caagggtacc gcctgcaaga gcaggcccct gagccaaggt 600
gactgtccac ccctggagtt gaagaatacc gtcctcgagg acggcgacat ggtggacacc 660
ggctacggcg caatggattt ctccaccctg caggacacca agtgcgaagt gcccctcgac 720
atctgccaaa gcatctgcaa gtaccccgac tacctgcaga tgagcgccga cccctacggc 780
gactccatgt tcttctgtct gagaagggaa caattgttcg cccgccactt ctggaaccgc 840
gccggcacca tgggcgatac cgtcccccag tccctgtaca tcaagggtac cggcatgagg 900
gccagccccg gttcatgcgt ctacagccca agcccctccg gtagcatcgt cacaagcgat 960
tcccaactct tcaacaagcc ctactggctg cacaaagccc aaggccacaa taacggcgtc 1020
tgttggcaca accagctgtt cgtcaccgtc gtggacacaa ccaggtccac aaacctgacc 1080
atctgcgcca gcacccaaag ccccgtgccc ggccagtacg acgccacaaa gttcaaacaa 1140
tactcacgcc acgtcgaaga gtacgacctc caattcatct tccaactctg caccatcacc 1200
ctgaccgccg acgtcatgtc ctacatccac tccatgaact catccatcct ggaagactgg 1260
aatttcggcg tcccaccacc ccccaccacc tccctcgtcg acacctacag gttcgtgcag 1320
agcgtcgcca tcacatgcca gaaagacgcc gcccccgccg agaacaaaga cccatacgac 1380
aaactgaaat tctggaacgt cgacctgaaa gagaaattca gcctggatct ggaccagtac 1440
ccattgggca ggaagttcct cgtccaggcg ggtctctaat 1480
<210> 38
<211> 1573
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<221> 外显子
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut1 nt
<400> 38
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggcg gtccgacgat tggccctggc 1500
tctcgttctg ccccgtcggc cacgaccagc agcggccctg ccggtagcgt gagcgtgggc 1560
gctggcaaat aat 1573
<210> 39
<211> 1573
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<221> 外显子
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut2 nt
<400> 39
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggcg gtccgacgat tggccctcgt 1500
ggctcttctg ccccgtcggc cacgaccagc agcggccctg ccggtagcgt gagcgtgggc 1560
gctggcaaat aat 1573
<210> 40
<211> 1573
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<221> 外显子
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut3 nt
<400> 40
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggcg gtccgacgat tggccctggc 1500
tctcgttctg ccccgtcggc cacgaccagc agcggccctg ccggtagcgt gggcgtggac 1560
gctggcaaat aat 1573
<210> 41
<211> 1573
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<221> 外显子
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut4 nt
<400> 41
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggcg gtccgacgat tggccctcgt 1500
ggctcttctg ccccgtcggc cacgaccagc agcggccctg ccgacagcgt gggcgtggac 1560
gctggcaaat aat 1573
<210> 42
<211> 1573
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<221> 外显子
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut5 nt
<400> 42
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggca aaccgacgat tggccctggc 1500
tctcgttctg ccccgtcggc cacgaccagc agcggccctg ccggtagcgt gagcgtgggc 1560
gctggcaaat aat 1573
<210> 43
<211> 1573
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<221> 外显子
<222> (1)..(1573)
<223> 18L1DE137-138/59dES-mut6 nt
<400> 43
atggctctct ggagaccctc cgataacaca gtgtacttgc ccccccccag cgtcgcccgc 60
gtcgtgaaca cagacgacta cgtcaccagg acctcaatct tctaccacgc cggttcaagc 120
cgcctgctga ccgtcggcaa cccctacttc cgcgtccccg ccggtggcgg taacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgcgtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcatgcgccg gcgtcgagat cggtaggggc caacccctgg gcgtcggttt gtccggccac 360
cccttctaca acaagctgga cgataccgag tcctcccacg cagcaaccag cctgtacaag 420
acctgcaagc aggccggtac ctgcccctcc gacgtcatca acgtcagcga agatgtccgc 480
gataacgtca gcgtggacta caaacaaacc caactgtgca tcctcggttg cgcacccgcc 540
atcggcgagc attgggccaa gggtaccgcc tgcaagagca ggcccctgag ccaaggtgac 600
tgtccacccc tggagttgaa gaataccgtc ctcgaggacg gcgacatggt ggacaccggc 660
tacggcgcaa tggatttctc caccctgcag gacaccaagt gcgaagtgcc cctcgacatc 720
tgccaaagca tctgcaagta ccccgactac ctgcagatga gcgccgaccc ctacggcgac 780
tccatgttct tctgtctgag aagggaacaa ttgttcgccc gccacttctg gaaccgcgcc 840
ggcaccatgg gcgataccgt cccccagtcc ctgtacatca agggtaccgg catgagggcc 900
agccccggtt catgcgtcta cagcccaagc ccctccggta gcatcgtcac aagcgattcc 960
caactcttca acaagcccta ctggctgcac aaagcccaag gccacaataa cggcgtctgt 1020
tggcacaacc agctgttcgt caccgtcgtg gacacaacca ggtccacaaa cctgaccatc 1080
tgcgccagca cccaaagccc cgtgcccggc cagtacgacg ccacaaagtt caaacaatac 1140
tcacgccacg tcgaagagta cgacctccaa ttcatcttcc aactctgcac catcaccctg 1200
accgccgacg tcatgtccta catccactcc atgaactcat ccatcctgga agactggaat 1260
ttcggcgtcc caccaccccc caccacctcc ctcgtcgaca cctacaggtt cgtgcagagc 1320
gtcgccatca catgccagaa agacgccgcc cccgccgaga acaaagaccc atacgacaaa 1380
ctgaaattct ggaacgtcga cctgaaagag aaattcagcc tggatctgga ccagtaccca 1440
ttgggcagga agttcctcgt ccaggcgggt ctccgtggca aaccgacgat tggccctcgt 1500
ggctcttctg ccccgtcggc cacgaccagc agcggccctg ccggtagcgt gagcgtgggc 1560
gctggcaaat aat 1573
<210> 44
<211> 23
<212> PRT
<213> HPV 35
<400> 44
Thr Gln Leu Tyr Arg Thr Cys Lys Ala Ala Gly Thr Cys Pro Pro Asp
1 5 10 15
Val Ile Pro Lys Val Glu Gly
20
<210> 45
<211> 23
<212> PRT
<213> HPV 39
<400> 45
Ser Thr Leu Tyr Arg Thr Cys Lys Gln Ser Gly Thr Cys Pro Pro Asp
1 5 10 15
Val Val Asp Lys Val Glu Gly
20
<210> 46
<211> 23
<212> PRT
<213> HPV 51
<400> 46
Thr Gln Leu Tyr Ser Thr Cys Lys Ala Ala Gly Thr Cys Pro Pro Asp
1 5 10 15
Val Val Asn Lys Val Glu Gly
20
<210> 47
<211> 23
<212> PRT
<213> HPV 53
<400> 47
Thr Gln Leu Tyr Gln Thr Cys Lys Gln Ser Gly Thr Cys Pro Glu Asp
1 5 10 15
Val Ile Asn Lys Ile Glu His
20
<210> 48
<211> 23
<212> PRT
<213> HPV 56
<400> 48
Thr Gln Leu Tyr Lys Thr Cys Lys Leu Ser Gly Thr Cys Pro Glu Asp
1 5 10 15
Val Val Asn Lys Ile Glu Gln
20
<210> 49
<211> 23
<212> PRT
<213> HPV 59
<400> 49
Leu Tyr Lys Thr Cys Lys Gln Ala Gly Thr Cys Pro Ser Asp Val Ile
1 5 10 15
Asn Lys Val Glu Gly Thr Thr
20
<210> 50
<211> 23
<212> PRT
<213> HPV 68
<400> 50
Ser Thr Leu Tyr Lys Thr Cys Lys Gln Ser Gly Thr Cys Pro Pro Asp
1 5 10 15
Val Ile Asn Lys Val Glu Gly
20
<210> 51
<211> 23
<212> PRT
<213> HPV 82
<400> 51
Thr Gln Leu Tyr Ser Thr Cys Lys Ala Ala Gly Thr Cys Pro Pro Asp
1 5 10 15
Val Ile Pro Lys Val Lys Gly
20
Claims (10)
1.一种人乳头瘤病毒嵌合蛋白,其包含HPV18型L1蛋白或HPV18型L1蛋白的突变体以及插入所述HPV18型L1蛋白或HPV18型L1蛋白的突变体的表面区的来自HPV59型L2蛋白的多肽、或由其组成,其中所述HPV18型L1蛋白的氨基酸序列如SEQ ID NO.1所示,所述HPV59型L2蛋白的氨基酸序列如SEQ ID NO.2所示,优选地,所述HPV18型L1蛋白的突变体选自以下任一种:
将SEQ ID No.1所示的氨基酸序列的C端截短32个氨基酸的突变体;
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、484、496、499、504、506置换为甘氨酸(G)且将氨基酸485、500、502置换为丝氨酸(S)的突变体;
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、485、496、499、504、506置换为甘氨酸(G)且将氨基酸486、500、502置换为丝氨酸(S)的突变体;
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、484、496、499、502、506置换为甘氨酸(G)、将氨基酸485、500置换为丝氨酸(S)且将氨基酸504置换为天冬氨酸(D)的突变体;
将SEQ ID No.1所示氨基酸序列的氨基酸477、478、485、496、502、506置换为甘氨酸(G)、将氨基酸486、500置换为丝氨酸(S)且将氨基酸499、504置换为天冬氨酸(D)的突变体;
将SEQ ID No.1所示氨基酸序列的氨基酸477、484、496、499、504、506置换为甘氨酸(G)且将氨基酸485、500、502置换为丝氨酸(S)的突变体;和
将SEQ ID No.1所示氨基酸序列的氨基酸477、485、496、499、504、506置换为甘氨酸(G)且将氨基酸486、500、502置换为丝氨酸(S)的突变体;
更优选地,所述来自HPV59型L2蛋白的多肽选自SEQ ID No.3、SEQ ID No.4、SEQ IDNo.5或SEQ ID No.6中任一项所示的多肽;
进一步优选地,
所述来自HPV59型L2蛋白的多肽插入所述HPV18型L1蛋白或HPV18型L1蛋白的突变体的表面区,所述表面区为DE环和/或h4区,优选地,所述来自HPV59型L2蛋白的多肽通过直接插入的方式插入所述HPV18型L1蛋白或HPV18型L1蛋白的突变体的氨基酸137和138之间、氨基酸432和433之间、或氨基酸434和435之间,或者,所述来自HPV59型L2蛋白的多肽通过非等长置换的方式插入所述HPV18型L1蛋白或HPV18型L1蛋白的突变体的氨基酸121至124区域、或氨基酸131至138区域、或氨基酸431-433区域、或氨基酸432-435区域;
更优选地,
其中所述来自HPV59型L2蛋白的多肽在其N端和/或C端包含1至3个氨基酸残基的连接子,所述连接子包含一种或多种选自以下的氨基酸:甘氨酸、丝氨酸、丙氨酸和脯氨酸,优选地,在N端的连接子由甘氨酸-脯氨酸组成,并且在C端的连接子由脯氨酸组成。
2.根据权利要求1所述的人乳头瘤病毒嵌合蛋白,其中所述人乳头瘤病毒嵌合蛋白的氨基酸序列如SEQ ID NO:7-32中的任一项所示。
3.一种多核苷酸,其编码权利要求1或2所述的人乳头瘤病毒嵌合蛋白,优选地,所述多核苷酸的序列采用大肠杆菌密码子进行全基因优化或采用昆虫细胞密码子进行全基因优化,更优选地,所述多核苷酸的序列如SEQ ID No.33、SEQ ID No.34、SEQ ID No.35、SEQ IDNo.36、SEQ ID No.37、SEQ ID No.38、SEQ ID No.39、SEQ ID No.40、SEQ ID No.41、SEQ IDNo.42和SEQ ID No.43中任一项所示。
4.一种载体,其包含如权利要求3所述的多核苷酸。
5.一种细胞,其包含如权利要求4所述的载体。
6.一种多聚物,该多聚物为嵌合五聚体或嵌合病毒样颗粒,其含有权利要求1或2所述的人乳头瘤病毒嵌合蛋白,或者由权利要求1或2所述的人乳头瘤病毒嵌合蛋白所形成。
7.如权利要求1或2所述的人乳头瘤病毒嵌合蛋白或如权利要求6所述的多聚物在制备预防乳头瘤病毒感染和/或乳头瘤病毒感染诱发的疾病的疫苗中的用途,优选地,乳头瘤病毒感染诱发的疾病选自宫颈癌、阴道癌、阴唇癌、阴茎癌、肛门肛周癌、口咽癌、扁桃体癌和口腔癌;
优选地,所述乳头瘤病毒感染为一种或多种选自以下乳头瘤病毒型别的感染:HPV16、HPV18、HPV26、HPV31、HPV33、HPV35、HPV39、HPV45、HPV51、HPV52、HPV53、HPV56、HPV58、HPV59、HPV66、HPV68、HPV70、HPV73;HPV6、HPV11、HPV2、HPV5、HPV27和HPV57。
8.一种用于预防乳头瘤病毒感染和/或乳头瘤病毒感染诱发的疾病的疫苗,其包含权利要求1或2所述的人乳头瘤病毒嵌合蛋白或如权利要求6所述的多聚物、佐剂、以及疫苗用赋形剂或载体。
9.根据权利要求8所述的用于预防乳头瘤病毒感染和/或乳头瘤病毒感染诱发的疾病的疫苗,还包含至少一种嗜黏膜组和/或嗜皮肤组的HPV的病毒样颗粒或嵌合病毒样颗粒。
10.根据权利要求8或9所述的用于预防乳头瘤病毒感染和/或乳头瘤病毒感染诱发的疾病的疫苗,其中所述佐剂为人用佐剂。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110002251.3A CN114716560B (zh) | 2021-01-04 | 2021-01-04 | 一种人乳头瘤病毒18型嵌合蛋白及其用途 |
PCT/CN2021/120583 WO2022142523A1 (zh) | 2021-01-04 | 2021-09-26 | 一种人乳头瘤病毒18型嵌合蛋白及其用途 |
EP21913268.5A EP4261232A4 (en) | 2021-01-04 | 2021-09-26 | CHIMERIC PROTEIN TYPE 18 OF THE HUMAN PAPILLOMAVIRUS AND ITS USE |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110002251.3A CN114716560B (zh) | 2021-01-04 | 2021-01-04 | 一种人乳头瘤病毒18型嵌合蛋白及其用途 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114716560A true CN114716560A (zh) | 2022-07-08 |
CN114716560B CN114716560B (zh) | 2024-02-02 |
Family
ID=82233870
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110002251.3A Active CN114716560B (zh) | 2021-01-04 | 2021-01-04 | 一种人乳头瘤病毒18型嵌合蛋白及其用途 |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP4261232A4 (zh) |
CN (1) | CN114716560B (zh) |
WO (1) | WO2022142523A1 (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114716560B (zh) * | 2021-01-04 | 2024-02-02 | 中国医学科学院基础医学研究所 | 一种人乳头瘤病毒18型嵌合蛋白及其用途 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110229489A1 (en) * | 2010-03-11 | 2011-09-22 | Rinat Neuroscience Corp. | Antibodies with pH Dependent Antigen Binding |
CN102497880A (zh) * | 2009-06-25 | 2012-06-13 | 葛兰素史密丝克莱恩生物有限公司 | 新的人乳头状瘤病毒(hpv)蛋白构建体及其在预防hpv疾病中的用途 |
US20190060241A1 (en) * | 2016-04-13 | 2019-02-28 | Medimmune, Llc | Use of amino acids as stabilizing compounds in pharmaceutical compositions containing high concentrations of protein-based therapeutic agents |
CN111662389A (zh) * | 2020-06-05 | 2020-09-15 | 广州中医药大学(广州中医药研究院) | 一种SARS-CoV-2的融合蛋白及其疫苗组合物 |
WO2022142523A1 (zh) * | 2021-01-04 | 2022-07-07 | 中国医学科学院基础医学研究所 | 一种人乳头瘤病毒18型嵌合蛋白及其用途 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB0413510D0 (en) | 2004-06-16 | 2004-07-21 | Glaxosmithkline Biolog Sa | Vaccine |
CN101148661B (zh) | 2006-09-18 | 2013-01-02 | 中国医学科学院基础医学研究所 | 人乳头瘤病毒16型外壳蛋白病毒样颗粒及其制备方法和用途 |
BRPI0811016B1 (pt) | 2007-04-29 | 2021-09-21 | Xiamen Innovax Biotech Co., Ltd. | Proteína li truncada do papiloma vírus humano tipo 16 |
MX2009014246A (es) * | 2007-06-26 | 2010-03-31 | Japan Health Science Found | Antigeno de vacuna con la capacidad de inducir anticuerpo de neutralizacion y reaccion cruzada contra virus de papiloma humano del tipo de alto riesgo. |
CN102153656B (zh) * | 2011-01-12 | 2014-11-19 | 广州市元通医药科技有限公司 | 一种嵌合病毒样颗粒疫苗及其制备方法 |
CN104418942A (zh) | 2013-08-30 | 2015-03-18 | 长春百克生物科技股份公司 | 截短的人乳头瘤病毒的l1蛋白、其类病毒颗粒及其制备方法和应用 |
CN112280792B (zh) | 2013-09-29 | 2022-06-24 | 上海泽润生物科技有限公司 | 人乳头瘤病毒基因,及载体,菌株,表达方法 |
CN107188966B (zh) * | 2016-03-15 | 2020-03-31 | 中国医学科学院基础医学研究所 | 一种乳头瘤病毒嵌合蛋白及其用途 |
CN107188967B (zh) * | 2016-03-15 | 2020-03-31 | 中国医学科学院基础医学研究所 | 一种乳头瘤病毒嵌合蛋白及其用途 |
CN108676057A (zh) | 2018-06-19 | 2018-10-19 | 南京肽业生物科技有限公司 | 固相多肽合成装置 |
-
2021
- 2021-01-04 CN CN202110002251.3A patent/CN114716560B/zh active Active
- 2021-09-26 WO PCT/CN2021/120583 patent/WO2022142523A1/zh active Application Filing
- 2021-09-26 EP EP21913268.5A patent/EP4261232A4/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102497880A (zh) * | 2009-06-25 | 2012-06-13 | 葛兰素史密丝克莱恩生物有限公司 | 新的人乳头状瘤病毒(hpv)蛋白构建体及其在预防hpv疾病中的用途 |
US20110229489A1 (en) * | 2010-03-11 | 2011-09-22 | Rinat Neuroscience Corp. | Antibodies with pH Dependent Antigen Binding |
US20190060241A1 (en) * | 2016-04-13 | 2019-02-28 | Medimmune, Llc | Use of amino acids as stabilizing compounds in pharmaceutical compositions containing high concentrations of protein-based therapeutic agents |
CN111662389A (zh) * | 2020-06-05 | 2020-09-15 | 广州中医药大学(广州中医药研究院) | 一种SARS-CoV-2的融合蛋白及其疫苗组合物 |
WO2022142523A1 (zh) * | 2021-01-04 | 2022-07-07 | 中国医学科学院基础医学研究所 | 一种人乳头瘤病毒18型嵌合蛋白及其用途 |
Non-Patent Citations (3)
Title |
---|
CHEN, Z.等: "L2 [human papillomavirus 59]", 《GENBANK》, vol. 90687 * |
MATHIEU BOXUS等: "Broad Cross-Protection Is Induced in Preclinical Models by a Human Papillomavirus Vaccine Composed of L1/L2 Chimeric Virus-Like Particles", JOURNAL OF VIROLOGY, vol. 90, no. 14, pages 6315 - 6325 * |
VAN DER WEELE, P.等: "L1 [human papillomavirus 18] - Protein", 《GENBANK》, vol. 15086, pages 308 * |
Also Published As
Publication number | Publication date |
---|---|
EP4261232A1 (en) | 2023-10-18 |
EP4261232A4 (en) | 2024-07-17 |
CN114716560B (zh) | 2024-02-02 |
WO2022142523A1 (zh) | 2022-07-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107188966B (zh) | 一种乳头瘤病毒嵌合蛋白及其用途 | |
Wang et al. | Human papillomavirus vaccine against cervical cancer: Opportunity and challenge | |
CN107188967B (zh) | 一种乳头瘤病毒嵌合蛋白及其用途 | |
US7976848B2 (en) | Optimized expression of HPV 58 L1 in yeast | |
US20120087937A1 (en) | Novel compositions | |
Tyler et al. | Immunization with a consensus epitope from human papillomavirus L2 induces antibodies that are broadly neutralizing | |
WO2022142525A1 (zh) | 一种人乳头瘤病毒58型嵌合蛋白及其用途 | |
US8039001B2 (en) | Therapeutic and prophylactic vaccine for the treatment and prevention of papillomavirus infection | |
WO2022111021A1 (zh) | 一种c端改造的人乳头瘤病毒11型l1蛋白及其用途 | |
Chen et al. | Human papillomavirus 16L1-58L2 chimeric virus-like particles elicit durable neutralizing antibody responses against a broad-spectrum of human papillomavirus types | |
CN114716560B (zh) | 一种人乳头瘤病毒18型嵌合蛋白及其用途 | |
EP4273174A1 (en) | Human papillomavirus type 31 chimeric protein and use thereof | |
WO2022111020A1 (zh) | 一种c端改造的人乳头瘤病毒6型l1蛋白及其用途 | |
WO2022111022A1 (zh) | 一种改造的人乳头瘤病毒52型l1蛋白及其用途 | |
US8715681B2 (en) | Minimal motifs of linear B-cell epitopes in L1 protein from human papillomavirus type 58 and their applications |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |