CN1954068A - Hla-结合肽,其前体,编码这些肽序列的dna片段和重组载体 - Google Patents
Hla-结合肽,其前体,编码这些肽序列的dna片段和重组载体 Download PDFInfo
- Publication number
- CN1954068A CN1954068A CN200580013767.7A CN200580013767A CN1954068A CN 1954068 A CN1954068 A CN 1954068A CN 200580013767 A CN200580013767 A CN 200580013767A CN 1954068 A CN1954068 A CN 1954068A
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- val
- gly
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/24011—Flaviviridae
- C12N2770/24211—Hepacivirus, e.g. hepatitis C virus, hepatitis G virus
- C12N2770/24222—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Virology (AREA)
- Toxicology (AREA)
- Zoology (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
本发明提供一种与HLA-A型分子结合的HLA-结合肽,所述HLA-结合肽包含至少一种选自SEQ ID NOS:1-183的氨基酸序列,不少于8个且不多于11个氨基酸残基。通过预测程序,利用图1中所述的积极学习实验方法,所有氨基酸序列被预测具有与人HLA-A型分子的结合性质。
Description
技术领域
本发明涉及HLA-结合肽,其前体,编码这些肽序列的DNA片段和重组载体。
背景技术
当感染病毒,诸如丙型肝炎病毒(HCV)时,在天生免疫系统防卫之后,就会诱导病毒特异的免疫反应以消除所述病毒。
当诱导特异性免疫反应时,分离的病毒粒子通过中和抗体丧失其传染性,并随后被消除。换言之,病毒感染的细胞由细胞毒性T淋巴细胞(CTLs)裂解。CTL将HLA I型分子呈递的表位肽识别成抗原。这种表位肽长度上有8-11个氨基酸。因此,为了开发针对病毒的治疗性疫苗,至关重要的是鉴别病毒表位肽。
这种技术从专利公报1是公知的。专利公报1描述了形成自特异性氨基酸序列的寡肽具有与HLA结合的性质。
[专利公报1]日本专利申请公开No.H8-151396(1996)
发明内容
然而,上述专利公报中所述的常规技术在下述几点尚有改进的余地。
首先,不清楚上述公报中的HLA-结合肽是否与HLA分子有效结合,因此仍有改进与HLA结合的性质的余地。
其次,上述公报叙述了其中的HLA-结合肽具有与HLA-DQ4结合的性质。然而,尚不清楚该肽是否与HLA-A2型分子(HLA-A*0201基因产物等)结合,该分子在欧美人中是常见的,以及是否与HLA-A24型分子(HLA-A*2402基因产物等)结合,该分子在日本人是常见的。
在上述情形下,实现了本发明,并且提供对特异类型的HLA分子显示高亲和性结合的HLA-结合肽。
根据本发明,提供了与HLA-A型分子结合的HLA-结合肽,其含有至少一种氨基酸序列并且由不少于8个且不多于11个氨基酸残基组成,所述氨基酸序列选自SEQ ID NOS:1-183。
而且,根据本发明,提供了含有至少一种氨基酸序列的HLA-结合肽,所述氨基酸序列选自SEQ ID NOS:1,2,3,5,8,12,13,14,16,17,18,19,22,23,25,27,34,37,38,40,42,45,48,49,52,53,54,55,56,58,59,60,61,62,63,64,65,67,71,72,74,75,76,84,86,87,90,91,92,93,94,96,97,98,100,101,102,104,106,107,108,109,110,112,123,124,126,127,131,132,133,134,135,136,137,139,141,142,146,147,149,150,152,162,170,173,176,177,和179。
此外,根据本发明,提供了与HLA-A型分子结合的HLA-结合肽,其含有一种氨基酸序列并且由不少于8个且不多于11个氨基酸残基,所述氨基酸序列是在上述HLA-结合肽中所含的氨基酸序列基础上,通过缺失,替换,或添加一个或两个氨基酸残基而形成的。
以此,含有一种氨基酸序列的构建体也可显示与上述HLA-结合肽相似的效果,所述氨基酸序列是在具有与HLA-A型分子结合的性质的特异氨基酸序列中,通过缺失,替换,或添加一个或数个氨基酸残基而形成的。
而且,根据本发明,提供了含有为上述HLA-结合肽编码的DNA序列的DNA区段。
此外,根据本发明,提供了含有为上述HLA-结合肽编码的DNA序列的重组载体。
而且,根据本发明,提供了在哺乳动物体内变成上述HLA-结合肽的HLA-结合肽前体。
根据本发明,由于包括特异氨基酸序列,可获得在与HLA-A型分子结合方面具有优异性质的HLA-结合肽。
附图简述
上述目的,其他目的,特征和优点从下文所述的优选实施方案,参照附图可变得更加显而易见。
[图1]示意图用于说明实施方案中所用的积极学习实验设计。
实施本发明的最佳方式
实施本发明的方式参照附图解释如下。在所有附图中,相同的组成元件以相同的标号和符号表示,以便不会重复解释。
<实施方案1>
在该实施方案中,含有一种氨基酸序列并由不少于8个且不多于11个氨基酸残基组成的肽用作HLA-结合肽的候选物,利用积极学习实验方法(日本专利申请公开No.H11-316754(1999))得到假设,通过该假设预测,所述氨基酸序列与HLA分子结合的-log Kd值为3或以上。根据结合实验的结果,已确认这些肽真正为HLA-结合肽。
结果,可有效获得大量HLA-结合肽,由于它们含有一种氨基酸序列,其在与HLA-A型分子结合方面具有优异性质,所述氨基酸序列与HLA分子结合的-log Kd值为3或以上。
具体而言,与该实施方案有关的HLA-结合肽为与HLA-A型分子结合,含有至少一种氨基酸序列(选自SEQ ID NOS:1-183,这将在下文描述),并由不少于8个且不多于11个氨基酸残基组成的HLA-结合肽。
在人HLA-A型中,约50%的日本人为HLA-A24型。对于欧美人,诸如德国人,为HLA-A2型。
所有在此所述的序列为由HCV(丙型肝炎病毒)某基因组蛋白中所含的9个氨基酸残基组成的序列。
序列SEQ ID NOS:1-44示于下表1。
表1
HLA-A24-结合肽
SEQ ID NO | D90208预测分 | 预测分 | SEQ名称 | 结合实验数据 |
1 | ILPCSFTFL | 6.9039 | 674 | 7.6571 |
2 | VILDSFDPI | 6.293 | 2251 | 5.32417 |
3 | RYAPVCKPL | 6.2755 | 2132 | 6.14848 |
4 | FWAKHMWNF | 6.0822 | 1760 | |
5 | ALYDVVSTL | 6.0484 | 2593 | 6.38942 |
6 | TVLSDFKTW | 6.0021 | 1986 | |
7 | PYIEQGMQL | 5.9628 | 1716 | |
8 | WHYPCTVNF | 5.921 | 616 | 6.38729 |
9 | KFPPALPIW | 5.8652 | 2280 | |
10 | TYSTYCKFL | 5.8658 | 1292 | |
11 | AYSQQTRGL | 5.831 | 1031 | |
12 | AQPGYPWPL | 5.8258 | 77 | 5.36419 |
13 | ILNTHFFSI | 5.8071 | 2843 | 7.89519 |
14 | SYTWTGALI | 5.8059 | 2422 | 7.12954 |
15 | SPPAVPQTF | 5.7982 | 1215 | |
16 | LLPRRGPRL | 5.7503 | 36 | 7.71195 |
17 | ALYGYWPLL | 5.7447 | 789 | 6.98038 |
18 | LMTHFFSIL | 5.7443 | 2844 | 5.9169 |
19 | LLKRLHQWI | 5.7425 | 1956 | 6.857254 |
20 | YILLLFLLL | 5.738 | 718 | |
21 | ARPDYNPPL | 5.7226 | 2289 | |
22 | AYYSMVGNW | 5.7076 | 360 | 6.46991 |
23 | PLARLIWWL | 5.6847 | 838 | 6.17696 |
24 | SQLDLSGWF | 5.6728 | 2962 | |
25 | SMLIDPSHI | 5.6657 | 2173 | 6.94013 |
26 | EYILLLFLL | 5.6643 | 717 | |
27 | ILLGPADSF | 5.6526 | 1010 | 5.50208 |
28 | LNPSVAATL | 5.6281 | 1254 | |
29 | GLLSFLVFF | 5.6226 | 764 | |
30 | YVYDHLTPL | 5.6148 | 948 | |
31 | HYAPRPCGI | 5.5954 | 488 | |
32 | GLIHLHRNI | 5.595 | 688 | |
33 | HYRDVLKEN | 5.5928 | 2482 | |
34 | YYKVFLARL | 5.5825 | 834 | 7.24746 |
35 | CMVDYPYRL | 5.566 | 607 | |
36 | AVIPDREVL | 5.5541 | 1693 | |
37 | NFSRCWVAL | 5.5315 | 234 | 6.28275 |
38 | VFSDMETKL | 5.5307 | 975 | 7.30704 |
39 | VWPLLLLLL | 5.5297 | 703 | |
40 | ITYSTYCKF | 5.5171 | 1291 | 6.97507 |
41 | IEPLDLPQI | 5.5049 | 2873 | |
42 | LLSTTEWQI | 5.4989 | 666 | 8.33563 |
43 | PLLREEVVF | 5.4208 | 2189 | |
44 | ATPPGSITV | 4.2466 | 1349 |
序列SEQ ID NOS:1-44为由HCV D90208株某基因组蛋白(SEQ IDNO:184)中所含的9个氨基酸残基组成的序列,这将在下文描述。序列SEQ ID NOS:1-44为通过上述方法预测具有优良的与HLA-A24型分子结合的序列。SEQ ID NOS:1-44以结合降低的顺序排列。即,SEQ IDNO:1为预测具有最佳结合的序列。各个序列与HLA-A24型分子结合的预测分和结合实验数据以-log Kd值的形式表示。
序列SEQ ID NOS:45-83示于下表2。
表2
HLA-A24-结合肽
SEQ ID NO | D89815预测分 | 预测分 | SEQ名称 | 结合实验数据 |
45 | VILDSFEPL | 6.4276 | 2251 | 5.00343 |
46 | ILPCSYTTL | 6.131 | 674 | |
47 | FWAKHMWNF | 6.0822 | 1760 | |
48 | ALYDVVSTL | 6.0484 | 2593 | 6.38942 |
49 | AFYGVWPLL | 5.9676 | 789 | 7.7344 |
50 | PYIEQGMQL | 5.9628 | 1716 | |
51 | TPPAVPQTF | 5.9302 | 1215 | |
52 | WHYPCTVNF | 5.921 | 616 | 6.38729 |
53 | GILPFFMFF | 5.9182 | 764 | 7.69551 |
54 | GLIHLHQNI | 5.879 | 688 | 5.85566 |
55 | LMCAVHFEL | 5.8442 | 876 | 6.59126 |
56 | TVLADFKFW | 5.8411 | 1986 | 6.51874 |
57 | AYSQQTRGL | 5.831 | 1031 | |
58 | AQPGYPWPL | 5.8258 | 77 | 5.36419 |
59 | PLLRDEVTF | 5.8128 | 2139 | 5.08926 |
60 | ILMTHFFSI | 5.8071 | 2843 | 7.89519 |
61 | SYTWTGALI | 5.8059 | 2422 | 7.12954 |
62 | ATPPGSVTF | 5.7779 | 1349 | 6.51124 |
63 | LLPRRGPRL | 5.7503 | 36 | 7.71195 |
64 | LMTHFFSIL | 5.7443 | 2844 | 5.9169 |
65 | LLKRLHQWI | 5.7425 | 1956 | 6.85724 |
66 | ARPDYNPPL | 5.7226 | 2289 | |
67 | AYYSMVGNW | 5.7076 | 360 | 6.46991 |
68 | KFPAAMPVW | 5.7062 | 2280 | |
69 | QYTLLFNIL | 5.7028 | 1804 | |
70 | LVPGAAYAF | 5.6865 | 782 | |
71 | RYAPACKPL | 5.6851 | 2132 | 6.75756 |
72 | FLARLIWWL | 5.6847 | 838 | 6.17696 |
73 | SQLDLSGWF | 5.6728 | 2962 | |
74 | SMLTDPSHI | 5.6657 | 2173 | 6.94014 |
75 | ILLGPADSR | 5.6526 | 1010 | 5.50208 |
76 | WLRDVWDWI | 5.6315 | 1976 | 6.34379 |
77 | YVVLLFLLL | 5.6308 | 718 | |
78 | LNPSVAATL | 5.6281 | 1254 | |
79 | YVVDHLIPL | 5.6148 | 948 | |
80 | HYRDVLKEM | 5.5928 | 2482 | |
81 | TLRRHVDLL | 5.5762 | 257 | |
82 | AVIPDREVL | 5.5541 | 1693 | |
83 | FLISQLFTF | 5.5528 | 285 |
序列SEQ ID NOS:45-83为由HCV D89815株某基因组蛋白(SEQID NO:185)中所含的9个氨基酸残基组成的序列。序列SEQ IDNOS:45-83为通过上述方法预测具有优良的与HLA-A24型分子结合的序列。SEQ ID NOS:45-83以结合降低的顺序排列。即,SEQ ID NO:45为预测具有最佳结合的序列。各个序列与HLA-A24型分子结合的预测分和结合实验数据以-log Kd值的形式表示。
序列SEQ ID NOS:84-123示于下表3。
表3
HLA-A24-结合肽
SEQ ID NO | pBRT703′X预测分 | 预测分 | SEQ名称 | 结合实验数据 |
84 | VILDSFEFL | 6.7012 | 2251 | 5.00343 |
85 | ILPCSYTTL | 6.2441 | 674 | |
86 | GILPFFMFF | 6.1234 | 764 | 7.69551 |
87 | PLLRDEVTF | 6.0964 | 2138 | 5.08926 |
88 | TPPAVPQTF | 6.0934 | 1215 | |
89 | FWAKHMWNF | 6.0822 | 1760 | |
90 | TVLADFKTW | 5.9355 | 1986 | 6.51874 |
91 | ALYDVVSTL | 5.9179 | 2593 | 6.38942 |
92 | WHYPCTVNF | 5.8742 | 616 | 6.38729 |
93 | ATPPGSVTF | 5.8681 | 1349 | 6.51124 |
94 | GLIHIHQNI | 5.8476 | 688 | 5.85566 |
95 | AYSQQTRGL | 5.831 | 1031 | |
96 | ILMTHFFSI | 5.8217 | 2843 | 7.88519 |
97 | FLARLIWWL | 5.7815 | 838 | 6.17698 |
98 | AFYGVWPLL | 5.7373 | 789 | 7.7544 |
99 | FLISQLFTF | 5.7347 | 285 | |
100 | ILLGPADSF | 5.719 | 1010 | 5.80208 |
101 | SNLTDPSHI | 5.8922 | 2173 | 6.94014 |
102 | AYYSMVGNW | 5.6746 | 360 | 6.46991 |
103 | QYTLLFNIL | 5.6682 | 1804 | |
104 | LLKRLHQWI | 5.6343 | 1858 | 6.85724 |
105 | SQLDLSGWF | 5.5993 | 2962 | |
106 | WLRDVWDWI | 5.5818 | 1976 | 6.34379 |
107 | ITYSTYGKF | 5.5352 | 1291 | 6.37373 |
108 | SYTWTGALI | 5.5253 | 2422 | 7.12954 |
109 | LLSTTEWQI | 5.5182 | 666 | 8.33563 |
110 | RYAPACKPL | 5.5076 | 2132 | 6.75756 |
111 | RLIWWLQYF | 5.5035 | 841 | |
112 | VLADFKTWL | 5.4871 | 1987 | 6.63423 |
113 | LVPGAAYAF | 5.4661 | 782 | |
114 | DLPQIIQRL | 5.4605 | 2877 | |
115 | WICTVLADF | 5.4521 | 1983 | |
118 | FYGVWPLLL | 5.4409 | 790 | |
117 | LLLSILGPL | 5.4385 | 891 | |
118 | HYRDVLKEM | 5.4331 | 2482 | |
119 | LIWWLQYFI | 5.4328 | 842 | |
120 | AVIPDREVL | 5.4247 | 1693 | |
121 | TRPPHGNWF | 5.4243 | 542 | |
122 | KFPAAMPVW | 5.424 | 2280 | |
123 | VFPDLGVRV | 5.3898 | 2580 | 6.73918 |
序列SEQ ID NOS:84-123为由HCV pBRT703′X株某基因组蛋白(SEQ ID NO:186)中所含的9个氨基酸残基组成的序列,这将在下文描述。序列SEQ ID NOS:84-123为通过上述方法预测具有优良的与HLA-A24型分子结合的序列。SEQ ID NOS:84-123以结合降低的顺序排列。即,SEQ ID NO:84为预测具有最佳结合的序列。各个序列与HLA-A24型分子结合的预测分和结合实验数据以-log Kd值的形式表示。
序列SEQ ID NOS:124-183示于下表4。
表4
HLA-A2-结合肽
SEQ ID NO | pBRT703′X预测分 | 预测分 | SEQ名称 | 结合实验数据 |
124 | KLLPRLPGV | 5.9316 | 1998 | 0.70728 |
125 | DHPSTEDLV | 5.925 | 1872 | |
126 | YLYGIGSAV | 5.8812 | 741 | 5.58617 |
127 | YLNTPGLPV | 5.7437 | 1542 | 5.67247 |
128 | CLLLLSYGV | 5.7302 | 2994 | |
129 | LLLSYGYGI | 5.8529 | 2998 | |
130 | LLCPSGHVV | 5.8239 | 1162 | |
131 | AILSPGALV | 5.8128 | 1885 | 6.24349 |
132 | SLIGYPYFV | 5.5608 | 906 | 5.86200 |
133 | DVWGWIGTV | 5.5367 | 1879 | 4.97855 |
134 | VIPASEDYV | 5.558 | 1425 | 5.24145 |
135 | RALAHGVRV | 5.5481 | 149 | 5.28381 |
136 | LSDGSWSTV | 5.5257 | 2400 | 6.22313 |
137 | KLQDCTKLV | 5.4922 | 3726 | 5.25202 |
138 | YCLTTGSYV | 5.4899 | 1873 | |
139 | SNLTDPSRI | 5.4885 | 2173 | 5.55941 |
140 | AAFCSAMYY | 5.4454 | 269 | |
141 | YSPGETNRV | 5.4058 | 2898 | 6.05123 |
142 | YTNYDQDLV | 5.4048 | 1101 | 5.37802 |
143 | LRDEVTFQV | 5.4015 | 2141 | |
144 | LAALTGTYV | 5.3812 | 041 | |
145 | CEPEFCYTV | 5.3645 | 2182 | |
146 | CNSADLEYV | 5.3581 | 1348 | 4.80983 |
147 | YFPDLGYRV | 5.3548 | 2580 | 6.62403 |
148 | YCFTPSPVV | 5.3268 | 507 | |
149 | YLQASLIRV | 5.2832 | 901 | 5.46327 |
150 | KQAEAAAPV | 5.2818 | 1741 | 5.41584 |
151 | LLLALFPRA | 5.2655 | 799 | |
152 | YLDDHYRDV | 5.2542 | 2478 | 8.51154 |
153 | FSPRRHETV | 5.2377 | 293 | |
154 | SVIDCNTCV | 5.3252 | 1453 | |
155 | GLIRACTLV | 5.1743 | 917 | |
156 | TYNFTIFKV | 5.1707 | 321 | |
157 | EHGGNIYRV | 5.1851 | 2238 | |
158 | PLLRHHRHV | 5.1643 | 2448 | |
159 | QLDLSGWFV | 5.1835 | 2983 | |
160 | TLAARNASV | 5.1583 | 245 | |
161 | RLGAYQNEV | 5.1393 | 1327 | |
162 | YILDSFEPL | 5.138 | 2251 | 5.38729 |
163 | AALENLYVL | 5.1347 | 748 | |
164 | LLEDTDTPI | 5.1223 | 2546 | |
165 | YVTSTWYLV | 5.1189 | 1655 | |
166 | FSLDPTFTI | 5.1183 | 1464 | |
197 | TIPASAYEY | 5.1158 | 186 | |
168 | DLLEDTDTF | 5.091 | 2544 | |
169 | LLLSLLGPL | 5.0753 | 891 | |
170 | VLADFKTWL | 5.0725 | 1987 | 8.04896 |
171 | SILCTCTVL | 5.071 | 1325 | |
172 | AGDNFPYIV | 5.0851 | 1579 | |
173 | ILPCSYTTL | 5.0843 | 674 | 8.37008 |
174 | YAAEEYVEV | 5.0509 | 2055 | |
175 | LAVAYEPYV | 5.0484 | 887 | |
176 | ALYCVVSTL | 5.0301 | 2593 | 6.14967 |
177 | FLARLINWL | 5.0259 | 838 | 5.67557 |
178 | RLLAPITAY | 5.0244 | 1024 | |
179 | WLRCVWDWI | 5.0131 | 1978 | 5.38168 |
180 | CYNGACWTV | 6.0181 | 1073 | |
181 | YYYCHLTPL | 5.0087 | 848 | |
182 | TVVLIESTV | 5.0081 | 2332 | |
183 | AARALAHGV | 5.0044 | 147 |
序列SEQ ID NOS:124-183为由HCV pBRT703′X株某基因组蛋白(SEQ ID NO:186)中所含的9个氨基酸残基组成的序列,这将在下文中描述。序列SEQ ID NOS:124-183为通过上述方法预测具有优良的与HLA-A2型分子结合的序列。SEQ ID NOS:124-183以结合降低的顺序排列。即,SEQ ID NO:124为预测具有最佳结合的序列。各个序列与HLA-A2型分子结合的预测分和结合实验数据以-log Kd值的形式表示。
尽管下文详细描述,但在表1-表4中,显然在预测分和结合实验数据之间存在相关性。即,尽管有小的误差,但可以说,通过上述方法预测具有高的与HLA-A型分子结合的肽经实验发现具有高的与HLA-A型分子结合。
既然通过这种实验设计方法,没有用于发现HLA-结合肽的常规技术,则只有很少量的HLA-结合肽经实验确认具有HLA-结合性质。由此,即使当由9个氨基酸残基组成的肽通过常规方法随机合成,进行实验以寻找该肽是否与HLA分子结合时,则只有约1/100的可能性找到一种结合的-log Kd值超过6的肽。
根据该实施方案,如上所述,由于利用通过实验设计方法寻找HLA-结合肽的技术,可找到多至183个HLA-结合肽序列。此外,当有些获得的HLA-结合肽通过实验检查结合时,经确认,所有已进行实验的序列显示等于或高于预测的与HLA的优异结合。
在这些序列中,含有选自下列至少一种氨基酸序列的HLA-结合肽,通过实验确认与人HLA-A型分子结合:SEQ ID NOS:1,2,3,5,8,12,13,14,16,17,18,19,22,23,25,27,34,37,38,40,42,45,48,49,52,53,54,55,56,58,59,60,61,62,63,64,65,67,71,72,74,75,76,84,86,87,90,91,92,93,94,96,97,98,100,101,102,104,106,107,108,109,110,112,123,124,126,127,131,132,133,134,135,136,137,139,141,142,146,147,149,150,152,162,170,173,176,177,和179。因此,可以肯定的说,正是HLA-结合肽在与人HLA-A型分子结合方面具有优异性质。
与本发明实施方案有关的HLA-结合肽与HLA分子结合的-log Kd值为3或以上,尤其优选5或以上,更优选5.4或以上。
在生化领域,已知-log Kd值约为3的结合能力,是肽能否真正与MHC,诸如HLA结合的阈值。因此,如果与HLA分子结合的-log Kd值为3或以上,则可以说,其是HLA-结合肽。
此外,如果与HLA分子结合的-log Kd值为5或以上,由于所得肽在与HLA分子结合方面具有优异性质,则其可合适地用于开发免疫疾病等的有效治疗药物,预防药物等。
而且,如果与HLA分子结合的-log Kd值为5.4或以上,则所得肽在与HLA分子结合方面具有尤其良好的性质,则其可合适地用于开发免疫疾病等的更有效治疗药物,预防药物等。
此外,可以安排成,与本发明实施方案有关的HLA-结合肽由不少于8个且不多于11个氨基酸残基组成。
以此方式,如果肽由不少于8个且不多于11个氨基酸残基组成,则其在与HLA分子结合方面具有优异性质。而且,胞毒T淋巴细胞(CTL)特异性识别病毒抗原(CTL表位),该抗原由8-11个氨基酸组成,以HLAI型分子被呈递在感染有病毒等的细胞的表面上,并通过破坏被感染的细胞而消除病毒。重要的是制备这种CTL表位,该表位由特异于病毒等的旨在制备针对该病毒等的治疗或预防用疫苗。
例如,上述HLA-结合肽可以是仅由氨基酸残基组成的肽,但不特别限于此。例如,它可以是任选修饰以糖链或脂肪酸基团等的HLA-结合肽前体,条件是本发明的效果不被削弱。这种前体发生变化,包括通过消化酶等在活的哺乳动物体,诸如人消化器官中消化变成HLA-结合肽,由此显示与上述HLA-结合肽所示的效果相似。
此外,上述HLA-结合肽可以是与人HLA-A24型分子结合的肽。
而且,上述HLA-结合肽可以是与人HLA-A2型分子结合的肽。
根据该组成,由于获得的肽与HLA-A24型分子结合,HLA-A24型分子在亚洲人,诸如日本人中是常见的,因此该肽可用于开发对亚洲人,诸如日本人尤其有效的治疗药物,预防药物等。
此外,根据该组成,由于获得的肽与HLA-A2型分子结合,除日本人之外,HLA-A2型分子还在欧美人中是常见的,因此该肽可用于开发对除日本人之外还对欧美人尤其有效的治疗药物,预防药物等。
上述HLA-结合肽中所含的氨基酸序列可以是衍生自HCV某基因组蛋白的氨基酸序列,但不特别限于此。例如,其可以是衍生自HCV蛋白的氨基酸序列,衍生自雪松花粉蛋白的氨基酸序列等。而且,其可以含有衍生自具有其他病原性或变应原性的蛋白的氨基酸序列。
例如,当其含有衍生自HCV包膜蛋白的氨基酸序列时,可获得治疗,预防由HCV引起的疾病的HLA-结合肽。
<实施方案2>
根据该实施方案,提供了与HLA-A型分子结合的HLA-结合肽,含有在上述HLA-结合肽中所含的氨基酸序列基础上通过缺失,替换,或添加一个或两个氨基酸残基而形成的氨基酸序列,并且由不少于8个且不多于11个氨基酸残基组成。
如下文所述,即使组成包括在与HLA-A型分子结合的特异氨基酸序列中通过缺失,替换,或添加一个或数个氨基酸残基而形成的氨基酸序列,仍显示与上述实施方案1有关的HLA-结合肽相似的效果。
尽管上述HCV D90208株,D89815株,和pBRT703′X株(D89815的突变亚克隆)的多肽的氨基酸序列彼此部分相异,但由于D90208株某基因组蛋白中现有的若干9-聚体肽的-log Kd值的预测数据和实验数据之间的相关性高,即,基于预测数据确定为结合的序列在实验数据中显示良好的-log Kd值,则可预测D89815株和pBRT703′X株(D89815的突变亚克隆)会显示-log Kd值在预测数据中具有较好的名次排列。因此,可以预测,即使D89815株和pBRT703′X株(D89815的突变亚克隆)某些基因组蛋白中的氨基酸序列,是在显示结合性质的氨基酸序列的基础上通过替换一个或两个氨基酸残基而形成的氨基酸序列,仍将同样显示优异的HLA-结合性质。
即,可以预测,即使氨基酸序列是在与HLA-A型分子结合方面具有优异性质的SEQ ID NOS:1-83所示氨基酸序列的基础上通过缺失,替换或添加一个或两个氨基酸残基而形成的,将以相似方式显示优异的HLA-结合性质。
从另一观点来看,可以预测,即使氨基酸序列是在通过上述方法预测在与HLA-A型分子结合方面具有优异性质的氨基酸序列的基础上通过缺失,替换或添加一个或数个氨基酸残基而形成的,仍会以相似方式显示HLA-结合性质。被替换的氨基酸残基优选为相互之间具有相似性质的氨基酸残基,例如,两个皆为疏水性氨基酸残基。
而且,利用本领域技术人员公知的方法,可以生产实施方案1和实施方案2中所述的HLA-结合肽。例如,所述结合肽可通过固相或液相方法人工合成。或者,这些HLA-结合肽通过编码这些HLA-结合肽的DNA区段或重组载体的表达而生产。所得的这些HLA-结合肽通过本领域技术人员公知的方法而鉴别。例如,采用Edman降解,质谱等鉴别是可能的。
<实施方案3>
根据本实施方案,提供了含有为上述HLA-结合肽编码的DNA序列的DNA区段。由于与本实施方案有关的DNA区段含有特异DNA序列,因此,其可表达上述HLA-结合肽。
当利用与本实施方案有关的DNA区段表达上述HLA-结合肽时,通过将DNA区段导入细胞,可实现表达,或通过商用人工蛋白表达试剂盒,可实现表达。
此外,通过将上述DNA区段导入,例如人细胞,可实现连续表达。由此,通过将编码HLA-结合肽的DNA区段导入细胞,而非将HLA-结合肽自身导入细胞,可使HLA-结合肽组成型存在于细胞中。当HLA-结合肽用作疫苗时,这种组成型表达的能力对于提高疫苗的功效是有利的。
而且,与本实施方案有关的DNA区段可通过本领域技术人员公知的方法生产。例如,其可通过商用DNA合成仪等人工合成。或者,其可通过限制性酶等从HCV基因组中片断化。或者,其可利用引物对通过PCR方法从HCV基因组中扩增。所得的DNA区段利用本领域技术人员公知的方法而鉴别。例如,其可通过商用DNA测序仪而鉴别。
<实施方案4>
根据本实施方案,提供了含有为上述HLA-结合肽编码的DNA序列的重组载体。由于与本实施方案有关的重组载体含有特异DNA序列,上述HLA-结合肽可得以表达。
当上述HLA-结合肽通过与本实施方案有关的重组载体表达时,通过将该重组载体导入细胞可实现表达,或通过商用人工蛋白表达试剂盒可实现表达。
此外,通过将上述重组载体导入,例如人细胞,可实现连续表达。由此,通过将编码HLA-结合肽的重组载体导入细胞,而非将HLA-结合肽自身导入细胞,可使HLA-结合肽连续存在于细胞中。当HLA-结合肽用作疫苗时,这种连续表达的能力对提高疫苗的功效是有利的。
而且,在上述重组载体中,通过在参与转录和表达的调节区中使用某序列,例如位于编码上述HLA-结合肽的DNA序列上游的启动子区,HLA-结合肽的表达量可以高精度控制。而且,在参与复制的调节区中通过使用某序列,例如重组载体的起点区,重组载体在细胞中的拷贝数可得以高精度控制。
此外,上述重组载体可任选含有除编码上述HLA-结合肽的NDA序列以外的序列。例如,其可含有标记基因序列,诸如药物抗性基因。
而且,与本实施方案有关的重组载体可利用本领域技术人员公知的方法生产。例如,其通过在某限制性酶切位点处切割商用载体,诸如pBR322或pUC19的多克隆位点,然后插入上述DNA区段至该位点,并连接而获得。此外,所得重组载体利用本领域技术人员公知的方法可鉴别。例如,通过琼脂糖凝胶电泳,可确认由预定限制性酶切割的DNA区段的长度是否与商用载体,诸如pBR322或pUC19的限制性图谱一致。此外,通过DNA测序仪等可鉴别上述DNA序列是否包含在从多克隆位点切割得到的DNA序列中。
尽管上文描述了本发明的实施方案,但它们仅列举作为本发明的例子,除上述之外,可采用多种不同的变例。
例如,在上述实施方案中,HLA-结合肽含有衍生自HCV某基因组蛋白(SEQ ID NOS:184,185,186)的氨基酸序列,但可采用含有衍生自另一HCV的氨基酸序列的HLA-结合肽。在这种情形下,所述结合肽可用于治疗与其衍生的蛋白有关的多种免疫疾病。
此外,可采用除HCV以外的病原菌,诸如HIV病毒的HLA-结合肽,也可采用含有衍生自蛋白,诸如雪松花粉过敏原等,或癌细胞的氨基酸序列的HLA-结合肽。
这样,如果含有通过上述方法预测具有优异HLA-结合性质的氨基序列,则可以预计,所述结合肽将显示优异的HLA-结合性质,这与经实验确认时的相似。由此,这些HLA-结合肽可合适地主要用于治疗或预防传染病(流感,SARS,HIV,HCV等),并且也用于癌症免疫治疗,过敏性疾病(花粉过敏(花粉热),风湿病,遗传性过敏症,哮喘等),自身免疫疾病等。
(实施例)
本发明下文将参照实施例进一步解释,但本发明并不限于此。
具体而言,基于积极学习实验设计,实现本发明实施例中的预测,实验和评价步骤,并且一般而言,重复下述步骤。在此采用的积极学习实验设计的示意图示于图1中。
(1)低级顺序学习算法的试验(将在下文描述)进行一次。即,从累积数据中通过随机采样生成多个假设,关于随机表达的候选查询点(肽),显示预测值的最大分布的点挑选作为查询点进行实验。
(2)被挑选的查询点处的肽通过合成和纯化方法(将在下文描述)而制备,以及实际结合能力通过实验测量(将在下文描述),并添加至累积数据。
在本发明实施例中,Hidden Markov模型的监督学习算法用作低级顺序学习算法,以223种肽的起始数据开始,根据实验预测和挑选20-30种肽,上述步骤重复四次,获得总共341个数据点。
更具体而言,在本发明实施例的积极学习方法中,根据实验设计和合成了20-30种肽,在这些肽含有的氨基酸序列中,排列了9/20种氨基酸。测量了与HLA分子的结合强度(结合能力)。获得的结合能力(Kd值,以摩尔浓度表示)作为实验结果。当结合能力高时,该肽被挑选作为HLA-结合肽的候选物,用作疫苗材料。
所得结果输入装配学习机的学习系统,采用Hidden Markov模型作为数学算法,并创建了规则。学习机采样不同结果以制备规则。通过学习机表述的规则具有不同的组成。视需要,所得规则和实验数据存储作为累积数据。
从超过209=5000亿个肽序列中,通过规则挑选用于随后实验的候选物,并且重复上述步骤。在此阶段,不同的规则适用于实验候选物,以及实验结果的预测产生分歧的候选物进行实验。这样,由于实验结果的预测产生分歧的候选物进行随后实验,提高了预测的最终精度。
以此,多个学习机进行选择性采样,其中给出不同预测的样品被挑选作为实验候选物,可有效增加信息,以及获得高精度的假设(规则)。重复上述步骤四次,给出优异结果,如下文实施例所述。重复7次或以上给出甚至更好的结果。
根据这种积极学习方法,可减少由9个氨基酸残基组成的肽的结合实验的重复次数,其本应对HLA-结合肽的所有候选物进行5000亿或以上的组合。在积极学习方法中,通过实验形成规则,对数十个通过应用规则预测的候选序列重复实验。由此,实验次数可削减,初始筛选的时间和费用大大减少。
此外,采用规则预测肽与HLA结合的命中率可达70-80%,所述规则通过积极学习方法获得,而通过其他公知技术,诸如锚定方法得到的命中率低至约30%。
<肽的合成和纯化>
利用Fmoc氨基酸,通过Merrifield固相方法,手工合成肽。脱保护之后,利用C18柱进行反相HPLC纯化,给出95%或以上的纯度。采用MALDI-TOF质谱仪(Voyager DE RP,PerSeptive),鉴别肽和确认其纯度。通过Micro BCA分析(Pierce Corp.),利用BSA作为标准蛋白,进行肽的定量分析。
<肽与HLA-A24型分子结合的实验>
肽与HLA-A24型分子(HLA-A*2402基因的产物)结合的能力,利用表达HLA-A24型分子的C1R-A24细胞(由熊本大学的MasafumiTakiguchi教授制备的细胞,在征得其同意下,通过Ehime University的Masaki Yasukawa助理教授提供),加以测量。
C1R-A24细胞首先暴露于酸性条件,pH3.3,30秒,由此解离和去除轻链β2m(其与HLA I型分子共同相连),以及原先与HLA-A*2402分子结合的内源肽。中和后,纯化的β2m加至C1R-A24细胞,所得产物添加至系列稀释的肽中,冰上温育4小时。利用荧光标记的单克隆抗体17A12染色,所述抗体识别HLA-A*2402分子,肽,和β2m三个成员的结合(MHC-pep),这三个成员在温育过程中已重新结合。随后,MHC-pep计数/C1R-A24细胞(与上述荧光抗体的荧光强度成比例)利用FACScan荧光活化的细胞分拣仪(Becton DickinsonBiosciences)定量测量。HLA-A24型分子和肽之间的结合解离常数Kd值,通过公开的方法(Udaka等,Immunogenetics,51,816-828,2000),根据每个细胞的平均荧光强度而计算。
<肽与HLA-A2型分子结合的实验>
肽与HLA-A2型分子(HLA-A*0201基因的产物)结合的能力,利用表达HLA-A*0201的株JY细胞,加以测量。
JY细胞首先暴露于酸性条件,pH3.8,30秒,由此解离和去除与HLA-A*0201分子非共价连接的轻链β2m和内源肽。中和后,进行重新结合实验。
上述JY细胞和纯化的β2m加至分级系列稀释的结合能力被测量的肽,冰上温育4小时。利用结合型特异的荧光标记的单克隆抗体BB7.2,对已重新结合高达该点的HLA-A*0201分子染色。
随后,每个细胞的荧光量,利用流式细胞仪测量,而解离常数Kd值,通过公开的方法(Udaka等,Immunogenetics,51,816-828,2000),以摩尔浓度计算。
<评价结果>
获得的预测结果和实验结果示于上文表1-4中。
表1中的序列SEQ ID NOS:1-44由登记在GenBank中的HCVD90208株某基因组蛋白的全长序列中所含的9个氨基酸残基组成。此外,序列SEQ ID NOS:1-44具有较好的与HLA-A24型分子结合,正如通过假设预测,所述假设由实施方案1中所述的实验设计方法获得。SEQ ID NOS:1-44以结合降低的顺序排列。即,SEQ ID NO:1为预测具有最佳结合的序列。HCV D90208某基因组蛋白的全长氨基酸序列示于下列SEQ ID NO:184
(MSTNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGP
RLGVRATRKTSERSQPRGRRQPIPKARRPEGRTWAQPGYPWPLYGN
EGMGWAGWLLSPRGSRPSWGPTDPRRRSRNLGKVIDTLTCGFADL
MGYIPLVGAPLGGAARALAHGVRVLEDGVNYATGNLPGCSFSIFLL
ALLSCLTIPASAYEVRNVSGIYHVTNDCSNSSIVYEAADMIMHTPGC
VPCVRESNFSRCWVALTPTLAARNSSIPTTTIRRHVDLLVGAAALCS
AMYVGDLCGSVFLVSQLFTFSPRRYETVQDCNCSIYPGHVSGHRMA
WDMMMNWSPTTALVVSQLLRIPQAVVDMVAGAHWGVLAGLAYY
SMVGNWAKVLIVMLLFAGVDGHTHVTGGRVASSTQSLVSWLSQGP
SQKIQLVNTNGSWHINRTALNCNDSLQTGFIAALFYAHRFNASGCPE
RMASCRPIDEFAQGWGPITHDMPESSDQRPYCWHYAPRPCGIVPAS
QVCGPVYCFTPSPVVVGTTDRFGAPTYSWGENETDVLLLSNTRPPQ
GNWFGCTWMNSTGFTKTCGGPPCNIGGVGNNTLVCPTDCFRKHPE
ATYTKCGSGPWLTPRCMVDYPYRLWHYPCTVNFTVFKVRMYVGG
VEHRLNAACNWTRGERCDLEDRDRSELSPLLLSTTEWQILPCSFTTL
PALSTGLIHLHRNIVDVQYLYGIGSAVVSFAIKWEYILLLFLLLADAR
VCACLWMMLLIAQAEATLENLVVLNAASVAGAHGLLSFLVFFCAA
WYIKGRLVPGAAYALYGVWPLLLLLLALPPRAYAMDREMAASCGG
AVFVGLVLLTLSPYYKVFLARLIWWLQYFITRAEAHLQVWVPPLNV
RGGRDAIILLTCAVHPELIFDITKLLLAILGPLMVLQAGITRVPYFVRA
QGLIRACMLVRKVAGGHYVQMAFMKLAALTGTYVYDHLTPLRDW
AHAGLRDLAVAVEPVVFSDMETKLITWGADTAACGDIISGLPVSAR
RGKEILLGPADSFGEQGWRLLAPITAYSQQTRGLLGCIITSLTGRDKN
QVDGEVQVLSTATQSFLATCVNGVCWTVYHGAGSKTLAGPKGPIT
QMYTNVDQDLVGWPAPPGARSMTPCTCGSSDLYLVTRHADVVPVR
RRGDSRGSLLSPRPISYLKGSSGGPLLCPSGHVVGIFRAAVCTRGVA
KAVDFIPVESMETTMRSPVFTDNSSPPAVPQTFQVAHLHAPTGSGKS
TKVPAAYAAQGYKVLVLNPSVAATLGFGAYMSKAHGIEPNIRTGVR
TITTGGPITYSTYCKFLADGGCSGGAYDIIICDECHSTDSTTILGIGTV
LDQAETAGARLVVLATATPPGSITVPHPNIEEVALSNTGEIPFYGKAI
PIEAIKGGRHLIFCHSKKKCDELAAKLTGLGLNAVAYYRGLDVSVIP
TSGDVVVVATDALMTGFTGDFDSVIDCNTCVTQTVDFSLDPTFTIET
TTLPQDAVSRAQRRGRTGRGRSGIYRFVTPGERPSGMFDSSVLCECY
DAGCAWYELTPAETSVRLRAYLNTPGLPVCQDHLEFWESVFTGLTH
IDAHFLSQTKQAGDNLPYLVAYQATVCARAQAPPPSWDQMWKCLI
RLKPTLHGPTPLLYRLGAVQNEVTLTHPITKYIMACMSADLEVVTST
WVLVGGVLAALAAYCLTTGSVVIVGRIILSGRPAVIPDREVLYQEFD
EMEECASHLPYIEQGMQLAEQFKQKALGLLQTATKQAEAAAPVVES
KWRALEVFWAKHMWNFISGIQYLAGLSTLPGNPAIASLMAFTASITS
PLTTQNTLLFNILGGWVAAQLAPPSAASAFVGAGIAGAAVGSIGLGK
VLVDILAGYGAGVAGALVAFKVMSGEMPSTEDLVNLLPAILSPGAL
VVGVVCAAILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPE
SDAAARVTQILSSLTITQLLKRLHQWINEDCSTPCSGSWLKDVWDWI
CTVLSDFKTWLQSKLLPRLPGLPFLSCQRGYKGVWRGDGIMQTTCP
CGAQITGHVKNGSMRIVGPKTCSNTWHGTFPINAYTTGPCTPSPAPN
YSRALWRVAAEEYVEVTRVGDFHYVTGMTTDNVKCPCQVPAPEFF
TEVDGVRLHRYAPVCKPLLREEVVFQVGLNQYLVGSQLPCEPEPDV
AVLTSMLTDPSHITAETAKRRLARGSPPSLASSSASQLSAPSLKATCT
THHDSPDADLIEANLLWRQEMGGNITRVESENKVVILDSFDPIRAVE
DEREISVPAEILRKPRKFPPALPIWARPDYNPPLLESWKDPDYVPPVV
HGCPLPSTKAPPIPPPRRKRTVVLTESTVSSALAELATKTFGSSGSSA
VDSGTATGPPDQASDDGDKGSDVESYSSMPPLEGEPGDPDLSDGSW
STVSGEAGEDVVCCSMSYTWTGALITPCAAEESKLPINPLSNSLLRH
HSMVYSTTSRSASLRQKKVTFDRLQVLDDHYRDVLKEMKAKASTV
KARLLSIEEACKLTPPHSAKSKFGYGAKDVRSLSSRAVNHIRSVWED
LLEDTETPIDTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEK
MALYDVVSTLPQAVMGPSYGFQYSPGQRVEFLVNTWKSKKCPMGF
SYDTRCFDSTVTENDIRTEESIYQCCDLAPEARQAIRSLTERLYVGGP
LTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKATAACRAAKLQD
CTMLVNGDDLVVICESAGTQEDAAALRAFTEAMTRYSAPPGDPPQP
EYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETVRH
TPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYG
ACYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLR
VWRHRARSVRAKLLSQGGRAATCGKYLFNWAVKTKLKLTPIPAAS
QLDLSGWFVAGYNGGDIYHSLSRARPRWFMLCLLLLSVGVGIYLLP
NR).
序列SEQ ID NOS:45-83由登记在GenBank中的HCV D89815株某基因组蛋白的全长序列中所含的9个氨基酸残基组成。此外,序列SEQ ID NOS:45-83具有较好的与HLA-A24型分子结合,正如通过假设预测,所述假设由实施方案1中所述的实验设计方法获得。SEQ IDNOS:45-83以结合降低的顺序排列。即,SEQ ID NO:45为预测具有最佳结合的序列。HCV D89815株某基因组蛋白的全长氨基酸序列示于下列SEQ ID NO:185
(MSTNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGP
RLGVRATRKTSERSQPRGRRQPIPKARRPEGRTWAQPGYPWPLYGN
EGLGWAGWLLSPRGSRPSWGPNDPRRRSRNLGKVIDTLTCGFADLM
GYIPLVGAPLGGAARALAHGVRVLEDGVNYATGNLPGCSFSIFLLAL
LSCLTIPASAYEVRNVSGIYHVTNDCSNSSIVYEAADVIMHAPGCVP
CVRENNSSRCWVALTPTLAARNASVPTTTLRRHVDLLVGTAAFCSA
MYVGDLCGSVFLISQLFTFSPRRHETVQDCNCSIYPGHVSGHRMAW
DMMMNWSPTAALVVSQLLRIPQAVMDMVAGAHWGVLAGLAYYS
MVGNWAKVLIVMLLFAGVDGHTRVTGGVQGHVTSTLTSLFRPGAS
QKIQLVNTNGSWHINRTALNCNDSLKTGFLAALFYTHKFNASGCPE
RMASCRSIDKFDQGWGPITYAQPDNSDQRPYCWHYAPRQCGIVPAS
QVCGPVYCFTPSPVVVGTTDRFGAPTYNWGDNETDVLLLNNTRPPH
GNWFGCTWMNSTGFTKTCGGPPCNIRGVGNNTLTCPTDCFRKHPD
ATYTKCGSGPWLTPRCLVDYPYRLWHYPCTVNFTIFKVRMYVGGV
EHRLDAACNWTRGERCDLEDRDRAELSPLLLSTTEWQILPCSYTTLP
dALSTGLIHLHQNIVDIQYLYGIGSAVVSIAIKWEYVVLLFLLLADARV
CACLWMMLLIAQAEAALENLVVLNAASVVGAHGMLPFFMFFCAA
WYMKGRLVPGAAYAFYGVWPLLLLLLALPPRAYAMDREMVASCG
GGVFVGLALLTLSPYCKVFLARLIWWLQYFITKAEAHLQVSLPPLNV
RGGRDAIILLMCAVHPELIFDITKLLLSILGPLMVLQASLIRVPYFVRA
QGLIRACMLVRKAAGGHYVQMAFVKLAALTGTYVYDHLTPLQDW
AHVGLRDLAVAVEPVVFSAMETKVITWGADTAACGDIISGLPVSAR
RGKEILLGPADSFEGQGWRLLAPITAYSQQTRGLLGCIITSLTGRDKN
QVEGEVQVVSTAKQSFLATCVNGACWTVFHGAGSKTLAAAKGPIT
QMYTNVDQDLVGWPAPPGARSLTPCTCGSSDLYLVTRHADVIPVRR
RGDSRGSLLSPRPISYLKGSSGGPLLCPSGHVVGIFRAAVCTRGVAK
AVDFIPVESMETTMRSPVFTDNSTPPAVPQTFQVAHLHAPTGSGKST
KVPAAYAAQGYMVLVLNPSVAATLGFGAYMSKAHGIDPNIRTGVR
TITTGAPITYSTYGKFLADGGCSGGAYDIIICDECHSTDSTSILGIGTV
LDQAETVGARFVVLATATPPGSITFPHPNIEEVPLANTGEIPFYAKTIP
IEVIRGGRHLIFCHSKKKCDELPAKLSALGLNAVAYYRGLDVSVIPA
SGDVVVVATDALMTGFTGDFDSVIDCNTCVTQTVDFSLDPTFTIETT
TVPQDAVSRTQRRGRTGRGRRGIYRFVTPGERPSAMFDSSVLCECY
DAGCAWYELTPAETSVRLRAYLNTPGLPVCQDHLEFWESVFTGLTH
IDAHFLSQTKQAGDNFPYLVAYQATVCARAKAPPPSWDQMWKCLI
RLKPTLHGPTPLLYRLGAVQNEVTLTHPITKYIMACMSADLEVVTST
WVLVGGVLAALAAYCLTTGSVVIVGRIILSGRPAVIPDREVLYQEFD
EMEECASHLPYIEQGMQLAEQFKQKALGLLQTATKQAEAAAPVVES
KWRALETFWAKHMWNFISGIQYLAGLSTLPGNPAIASLMAFTASITS
PLATQYTLLFNILGGWVAAQLAPPSAASAFVGAGIAGAAVGSIGLGK
VLVDILAGYGAGVAGALVAFKVMSGDMPSTEDLVNLLPAILSPGAL
VVGVVCAAILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPE
SDAAARVTQILSNLTITQLLKRLHQWINEDCSTPCSGSWLRDVWDWI
CTVLADFKTWLQSKLLPRLPGVPFFSCQRGYKGVWRGDGIMYTTCP
CGAQITGHVKNGSMRIVGPRTCSNTWHGTFPINAYTTGPCTPSPAPN
YSRALWRVAAEEYVEVTRVGDFHYVTGMTTDNVKCPCQVPAPEFF
TELDGVRLHRYAPACKPLLRDEVTFQVGLNQYTVGSQLPCEPEPDV
TVVTSMLTDPSHITAEAARRRLARGSPPSLAGSSASQLSALSLKATCT
THHGAPDTDLIEANLLWRQEMGGNITRVESENKIVILDSFEPLRAEE
DEREVSAAAEILRKTRKFPAAMPVWARPDYNPPLLESWKNPDYVPP
VVHGCPLPPTKAPPIPPPRRKRTVVLTESTVSSALAELATKTFGGSGS
SAVDSGTATGPPDQASAEGDAGSDAESYSSMPPLEGEPGDPDLSDGS
WSTVSEEASEDVVCCSMSYTWTGALITPCAAEESKLPINALSNPLLR
HHNMVYSTTSRSASLRQKKVTFDRMQVLDDHYRDVLKEMKAKAS
TVKAKLLSVEEACKLTPPHSAKSKFGYGAKDVRSLSSRAVNHIRSV
WKDLLEDTDTPIQTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRV
CEKMALYDVVSTLPQAVMGSSYGFQYSPKQRVEFLVNTWKAKKCP
MGFSYDTRCFDSTVTENDIRVEESIYQCCDLAPEARQAIRSLTERLYI
GGPMTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAAAACRAAK
LQDCTMLVCGDDLVVICDSAGTQEDAASLRVFTEAMTRYSAPPGDP
PQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA
RHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQI
YGATYSIEPLDLPQIIQRLHGLSAFSLHSYSPGEINRVASCLRKLGVPP
LRVWRHRARSVRAKLLSQGGRAATCGKYLFNWAVKTKLKLTPIPE
ASQLDLSGWFVAGYSGGDIYHSLSRARPRWFMWCLLLLSVGVGIYL
LPNR).
序列SEQ ID NOS:84-123由HCV pBRT703′X株(D89815的突变亚克隆)某基因组蛋白中所含的9个氨基酸残基组成,所述毒株获自大阪大学微生物疾病研究所的Yoshiharu Matsuura教授。此外,序列SEQ IDNOS:84-123具有较好的与HLA-A24型分子结合,正如通过假设预测,所述假设由实施方案1中所述的实验设计方法获得。SEQ IDNOS:84-123以结合降低的顺序排列。即,SEQ ID NO:84为预测具有最佳结合的序列。HCV pBRT703′X株(D89815的突变亚克隆)某基因组蛋白的全长氨基酸序列示于下列SEQ ID NO:186
(MSTNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGP
RLGVRATRKTSERSQPRGRRQPIPKARHPEGRAWAQPGYPWPLYGN
EGMGWAGWLLSPRGSRPSWGPTDPRRRSRNLGKVIDTLTCGFADL
MGYIPLVGAPLGGAARALAHGVRVLEDGVNYATGNLPGCSFSIFLL
ALLSCLTIPASAYEVRNVSGIYHVTNDCSNSSIVYEAADVIMHAPGC
VPCVRENNSSRCWVALTPTLAARNASVPTTTLRRHVDLLVGTAAFC
SAMYVGDLCGSVFLISQLFTFSPRRHETVQDCNCSIYPGHVSGHRMA
WDMMMNWSPTAALVVSQLLRIPQAVMDMVAGAHWGVLAGLAYY
SMVGNWAKVLIVMLLFAGVDGHTRVTGGVQGHVTSTLTSLFRPGA
SQKIQLVNTNGSWHINRTALNCNDSLKTGFLAALFYTHKFNASGCPE
RMASCRSIDKFDQGWGPITYAQPDNSDQRPYCWHYAPRQCGIVPAS
QVCGPVYCFTPSPVVVGTTDRFGAPTYNWGDNETDVLLLNNTRPPH
GNWFGCTWMNSTGFTKTCGGPPCNIRGVGNNTLTCPTDCFRKHPD
ATYTKCGSGPWLTPRCLVDYPYRLWHYPCTVNFTIFKVRMYVGGV
EHRLDAACNWTRGERCDLEDRDRAELSPLLLSTTEWQILPCSYTTLP
ALSTGLIHLHQNIVDIQYLYGIGSAVVSIAIKWEYVVLLFLLLADARV
CACLWMMLLIAQAEAALENLVVLNAASVAGAHGILPFFMFFCAAW
YMKGRLVPGAAYAFYGVWPLLLLLLALPPRAYAMDREMAASCGG
GVFVGLALLTLSPYCKVFLARLIWWLQYFITKAEAHLQVWVPPLNV
RAGRDAIILLMCAVHPELIFDITKLLLSILGPLMVLQASLIRVPYFVRA
QGLIRACTLVRKAAGGHYVQMAFVKLAALTGTYVYDHLTPLQDW
AHVGLRDLAVAVEPVVFSAMETKVITWGADTAACGDIISGLPVSAR
RGKEILLGPADSFEGQGWRLLAPITAYSQQTRGLLGCIITSLTGRDKN
QVEGEVQVVSTATQSFLATCVNGACWTVFHGAGSKTLAGPKGPITQ
MYTNVDQDLVGWPAPPGARSLTPCTCGSSDLYLVTRHADVIPVRRR
GDTRGSLLSPRPISYLKGSSGGPLLCPSGHVVGIFRAAVCTRGVAKA
VDFIPVESMETTMRSPVFTDNSTPPAVPQTFQVAHLHAPTGSGKSTK
VPAAYAAQGYMVLVLNPSVAATLGFGAYMSKAHGIDPNIRTGVRTI
TTGAPITYSTYGKFLADGGCSGGAYDIIICDECHSTDSTSILGIGTVLD
QAETAGARLVVLATATPPGSVTFPHPNIEEVALGNTGEIPFYGKAIPI
EVIKGGRHLIFCHSKKKCDELAAKLSPLGLNAVAYYRGLDVSVIPAS
GDVVVVATDALMTGFTGDFDSVIDCNTCVTQTVDFSLDPTFTIETTT
VPQDAVSRTQRRGRTGRGRRGIYRFVTPGERPSGMFDSSVLCECYD
AGCAWYELTPAETSVRLRAYLNTPGLPVCQDHLEFWESVFTGLTHI
DAHFLSQTKQAGDNFPYLVAYQATVCARAKAPPPSWDQMWKCLIR
LKPTLHGPTPLLYRLGAVQNEVTLTHPITKFIMACMSADLEVVTSTW
VLVGGVLAALAAYCLTTGSVVIVGRIILSGRPAVIPDREVLYQEFDE
MEECASHLPYIEQGMQLAEQFKQKALGLLQTATKQAEAAAPVVES
KWRALETFWAKHMWNFISGIQYLAGLSTLPGNPAIASLMAFTASITS
PLATQYTLLFNILGGWVAAQLAPPSAASAFVGAGIAGAAVGSIGLGK
VLVDILAGYGAGVAGALVAFKVMSGDMPSTEDLVNLLPAILSPGAL
VVGVVCAAILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPE
SDAAARVTQILSNLTITQLLKRLHQWINEDCSTPCSGSWLRDVWDWI
CTVLADFKTWLQSKLLPRLPGVPFFSCQRGYKGVWRGDGIMYTTCP
CGAQITGHVKNGSMRIVGPRTCSNTWHGTFPINAYTTGPCTPSPAPN
YSRALWRVAAEEYVEVTRVGDFHYVTGMTTDNVKCPCQVPAPEFF
TELDGVRLHRYAPACKPLLRDEVTFQVGLNQYTVGSQLPCEPEPDV
TVVTSMLTDPSHITAEAARRRLARGSPPSLAGSSASQLSAPSLKATCT
THHGAPDTDLIEANLLWRQEMGGNITRVESENKIVILDSFEPLRAEE
DEREVSAAAEILRKTRKFPAAMPVWARPDYNPPLLESWKNPDYVPP
VVHGCPLPPTKAPPIPPPRRKRTVVLTESTVSSALAELATKTFGGSGS
SAVDSGTATGPPDQASAEGDAGSDAESYSSMPPLEGEPGDPDLSDGS
WSTVSEEASEDVVCCSMSYTWTGALITPCAAEESKLPINALSNPLLR
HHNMVYSTTSRSASLRQKKVTFDRMQVLDDHYRDVLKEMKAKAS
TVKAKLLSVEEACKLTPPHSAKSKFGYGAKDVRSLSSRAVNHIRSV
WKDLLEDTDTPIQTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRV
CEKMALYDVVSTLPQAVMGSSYGFQYSPKQRVEFLVNTWKAKKCP
MGFSYDTRCFDSTVTENDIRVEESIYQCCDLAPEARQAIRSLTERLYI
GGPMTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAAAACRAAK
LQDCTMLVCGDDLVVICDSAGTQEDAASLRVFTEAMTRYSAPPGDP
PQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA
RHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQI
YGATYSIEPLDLPQIIQRLHGLSAFSLHSYSPGEINRVASCLRKLGVPP
LRVWRHRARSVRAKLLSQGGRAATCGKYLFNWAVKTKLKLTPIPE
ASQLDLSGWFVAGYSGGDIYHSLSRARPRWFMWCLLLLSVGVGIYL
LPNR).
序列SEQ ID NOS:124-183由上述HCV pBRT703′X株(D89815的突变亚克隆)某基因组蛋白中所含的9个氨基酸残基组成。此外,序列SEQ ID NOS:124-183具有较好的与HLA-A2型分子结合,正如通过假设预测,所述假设由实施方案1中所述的实验设计方法获得。SEQ IDNOS:124-183以结合降低的顺序排列。即,SEQ ID NO:124为预测具有最佳结合的序列。
表1-4显示的氨基酸序列在对HCV D90208株,D89815株,和pBRT703′X株(D89815的突变亚克隆),利用上述预测程序,预测分,和相应结合实验数据而获得的预测结果中具有较好的得分。通过上述合成方法人工合成9个氨基酸肽,可获得全部结合实验数据。
所述的HCV D90208株和D89815株的某些基因组蛋白登记在GenBank,但由其中9个氨基酸残基组成的序列,即HLA-结合肽,现在尚未登记。
此外,pBRT703′X株(D89815的突变亚克隆)为突变株,其与日本人丙型肝炎病人中常见的HCV亚株相似。在本发明实施例中,已发现在该突变株某基因组蛋白中所含的HLA-结合肽,所述突变株与常见于日本人中的亚株相似。该HLA-结合肽可合适地用于开发日本人的丙型肝炎治疗药。
在此,上述HCV D90208株,D89815株,和pBRT703′X株(D89815的突变亚克隆)某些基因组蛋白的氨基酸序列彼此部分不同,并且可以预测,即使在氨基酸序列中通过替换一个或数个氨基酸残基而形成的氨基酸序列将同样显示上述优异的HLA-结合性质。
例如,D90208株的肽SEQ ID NO:1从左计数第6个氨基酸为F,但其对D89815株的肽SEQ ID NO:46和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:85为Y。
此外,D90208株的肽SEQ ID NO:17从左计数第2个氨基酸为L,但其对D89815株的肽SEQ ID NO:49和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:98为F。
而且,D90208株的肽SEQ ID NO:3从左计数第5个氨基酸为V,但其对D89815株的肽SEQ ID NO:71和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:110为A。
此外,D90208株的肽SEQ ID NO:2从左计数第7个氨基酸为D,但其对D89815株的肽SEQ ID NO:45和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:84为E。
而且,D90208株的肽SEQ ID NO:40从左计数第7个氨基酸为C,但其对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:107为G。
此外,D90208株的肽SEQ ID NO:43从左计数第5个氨基酸为E,但其对D89815株的肽SEQ ID NO:59和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:87为D,以及D90208株的肽SEQ ID NO:43从左计数第8个氨基酸为V,但其对D89815株的肽SEQ ID NO:59和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:87为T。
而且,D90208株的肽SEQ ID NO:44从左计数第7个氨基酸为I,但其对D89815株的肽SEQ ID NO:62和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:93为V,以及D90208株的肽SEQ ID NO:44从左计数第9个氨基酸为V,但其对D89815株的肽SEQ ID NO:62和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:93为F。
在通过相互替换一个或两个氨基酸残基而形成的肽序列中,例如,D90208株的肽SEQ ID NO:17从左计数第2个氨基酸为L,但其对D89815株的肽SEQ ID NO:49和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:98为F,以及D90208株的肽SEQ ID NO:17的结合实验值为6.98038,而其对D89815株的肽SEQ ID NO:49和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:98为7.7344,由此证实它们全部显示良好的结合性质。
此外,在通过相互替换一个或两个氨基酸残基而形成的肽序列中,例如,D90208株的肽SEQ ID NO:40从左计数第7个氨基酸为C,但其对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:107为G,以及D90208株的肽SEQ ID NO:40的结合实验值为6.97507,而其对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:107为6.37373,由此证实它们全部显示良好的结合性质。
而且,在通过相互替换一个或两个氨基酸残基而形成的肽序列中,例如,D90208株的肽SEQ ID NO:3从左计数第5个氨基酸为V,但其对D89815株的肽SEQ ID NO:71和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:110为A,以及D90208株的肽SEQ ID NO:3的结合实验值为6.14848,而其对D89815株的肽SEQ ID NO:71和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:110为6.75756,由此证实它们全部显示良好的结合性质。
此外,在通过相互替换一个或两个氨基酸残基而形成的肽序列中,例如,D90208株的肽SEQ ID NO:2从左计数第7个氨基酸为D,但其对D89815株的肽SEQ ID NO:45和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:84为E,以及D90208株的肽SEQ ID NO:2的结合实验值为5.32417,而其对D89815株的肽SEQ ID NO:45和对pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:84为5.00343,由此证实它们全部显示良好的结合性质。
而且,在通过相互替换一个或两个氨基酸残基而形成的肽序列中,例如,在pBRT703′X株(D89815的突变亚克隆)的肽SEQ ID NO:90和肽SEQ ID NO:112之间偏移一个的氨基酸序列,以及肽SEQ ID NO:90的结合实验值为6.51874,而肽SEQ ID NO:112的结合实验值为6.63423,由此证实它们全部显示良好的结合性质。
此外,在通过相互替换一个或两个氨基酸残基而形成的肽序列中,例如,在D90208株的肽SEQ ID NO:13和D89815株的肽SEQ ID NO:60与D90208株的肽SEQ ID NO:18和D89815株的肽SEQ ID NO:64之间偏移一个的氨基酸序列,以及肽SEQ ID NOS:13和60的结合实验值为7.89519,而肽SEQ ID NOS:18和64的结合实验值为5.9169,由此证实它们全部显示良好的结合性质。
由此可以预测,通过相互替换一个或两个氨基酸残基而形成的肽序列皆会显示优异的与HLA-A24型分子的结合。总之,由SEQ IDNOS:1-183所示,在与HLA-A型分子结合方面具有优异性质的氨基酸序列中,通过缺失,替换,或添加一个或数个氨基酸残基而形成的氨基酸序列,甚至同样可以预测显示优异的HLA-结合性质。
从另一点来看,正如通过假设预测,所述假设由实施方案1中所述的实验设计方法获得,在与HLA-A型分子结合方面具有优异性质的氨基酸序列中,通过缺失,替换,或添加一个或数个氨基酸残基而形成的氨基酸序列,甚至同样可以说显示优异的HLA-结合性质。被替换的氨基酸残基优选为相互具有相似性质的氨基酸残基,诸如两个皆为疏水氨基酸残基。
本发明参照上述实施例进行了解释。这些实施例仅列举成例子,本领域技术人员将理解,多种修改例是可能的,而这些修改例仍落入本发明的保护范围。
例如,在上述实施例中,采用HCV D90208株,D89815株,和pBRT703′X株(D89815的突变亚克隆),但也可采用另一HCV株。在此情形下,根据本发明所用的预测程序,可以高精度预测HLA结合。
序列表
<110>日本电气株式会社(NEC Corporation)
国立大学法人高知大学(Kochi University)
<120>HLA-结合肽,其前体,编码这些肽序列的DNA片段和重组载体(HLA binding peptides)
<130>SCT064984-47
<160>186
<170>PatentIn version 3.1
<210>1
<211>9
<212>PRT
<213>Hepatitis C virus
<400>1
Ile Leu Pro Cys Ser Phe Thr Thr Leu
1 5
<210>2
<211>9
<212>PRT
<213>Hepatitis C virus
<400>2
Val Ile Leu Asp Ser Phe Asp Pro Ile
1 5
<210>3
<211>9
<212>PRT
<213>Hepatitis C virus
<400>3
Arg Tyr Ala Pro Val Cys Lys Pro Leu
1 5
<210>4
<211>9
<212>PRT
<213>Hepatitis C virus
<400>4
Phe Trp Ala Lys His Met Trp Asn Phe
1 5
<210>5
<211>9
<212>PRT
<213>Hepatitis C virus
<400>5
Ala Leu Tyr Asp Val Val Ser Thr Leu
1 5
<210>6
<211>9
<212>PRT
<213>Hepatitis C virus
<400>6
Thr Val Leu Ser Asp Phe Lys Thr Trp
1 5
<210>7
<211>9
<212>PRT
<213>Hepatitis C virus
<400>7
Pro Tyr Ile Glu Gln Gly Met Gln Leu
1 5
<210>8
<211>9
<212>PRT
<213>Hepatitis C virus
<400>8
Trp His Tyr Pro Cys Thr Val Asn Phe
1 5
<210>9
<211>9
<212>PRT
<213>Hepatitis C virus
<400>9
Lys Phe Pro Pro Ala Leu Pro Ile Trp
1 5
<210>10
<211>9
<212>PRT
<213>Hepatitis C virus
<400>10
Thr Tyr Ser Thr Tyr Cys Lys Phe Leu
1 5
<210>11
<211>9
<212>PRT
<213>Hepatitis C virus
<400>11
Ala Tyr Ser Gln Gln Thr Arg Gly Leu
1 5
<210>12
<211>9
<212>PRT
<213>Hepatitis C virus
<400>12
Ala Gln Pro Gly Tyr Pro Trp Pro Leu
1 5
<210>13
<211>9
<212>PRT
<213>Hepatitis C virus
<400>13
Ile Leu Met Thr His Phe Phe Ser Ile
1 5
<210>14
<211>9
<212>PRT
<213>Hepatitis C virus
<400>14
Ser Tyr Thr Trp Thr Gly Ala Leu Ile
1 5
<210>15
<211>9
<212>PRT
<213>Hepatitis C virus
<400>15
Ser Pro Pro Ala Val Pro Gln Thr Phe
1 5
<210>16
<211>9
<212>PRT
<213>Hepatitis C virus
<400>16
Leu Leu Pro Arg Arg Gly Pro Arg Leu
1 5
<210>17
<211>9
<212>PRT
<213>Hepatitis C virus
<400>17
Ala Leu Tyr Gly Val Trp Pro Leu Leu
1 5
<210>18
<211>9
<212>PRT
<213>Hepatitis C virus
<400>18
Leu Met Thr His Phe Phe Ser Ile Leu
1 5
<210>19
<211>9
<212>PRT
<213>Hepatitis C virus
<400>19
Leu Leu Lys Arg Leu His Gln Trp Ile
1 5
<210>20
<211>9
<212>PRT
<213>Hepatitis C virus
<400>20
Tyr Ile Leu Leu Leu Phe Leu Leu Leu
1 5
<210>21
<211>9
<212>PRT
<213>Hepatitis C virus
<400>21
Ala Arg Pro Asp Tyr Asn Pro Pro Leu
1 5
<210>22
<211>9
<212>PRT
<213>Hepatitis C virus
<400>22
Ala Tyr Tyr Ser Met Val Gly Asn Trp
1 5
<210>23
<211>9
<212>PRT
<213>Hepatitis C virus
<400>23
Phe Leu Ala Arg Leu Ile Trp Trp Leu
1 5
<210>24
<211>9
<212>PRT
<213>Hepatitis C virus
<400>24
Ser Gln Leu Asp Leu Ser Gly Trp Phe
1 5
<210>25
<211>9
<212>PRT
<213>Hepatitis C virus
<400>25
Ser Met Leu Thr Asp Pro Ser His Ile
1 5
<210>26
<211>9
<212>PRT
<213>Hepatitis C virus
<400>26
Glu Tyr Ile Leu Leu Leu Phe Leu Leu
1 5
<210>27
<211>9
<212>PRT
<213>Hepatitis C virus
<400>27
Ile Leu Leu Gly Pro Ala Asp Ser Phe
1 5
<210>28
<211>9
<212>PRT
<213>Hepatitis C virus
<400>28
Leu Asn Pro Ser Val Ala Ala Thr Leu
1 5
<210>29
<211>9
<212>PRT
<213>Hepatitis C virus
<400>29
Gly Leu Leu Ser Phe Leu Val Phe Phe
1 5
<210>30
<211>9
<212>PRT
<213>Hepatitis C virus
<400>30
Tyr Val Tyr Asp His Leu Thr Pro Leu
1 5
<210>31
<211>9
<212>PRT
<213>Hepatitis C virus
<400>31
His Tyr Ala Pro Arg Pro Cys Gly Ile
1 5
<210>32
<211>9
<212>PRT
<213>Hepatitis C virus
<400>32
Gly Leu Ile His Leu His Arg Asn Ile
1 5
<210>33
<211>9
<212>PRT
<213>Hepatitis C virus
<400>33
His Tyr Arg Asp Val Leu Lys Glu Met
1 5
<210>34
<211>9
<212>PRT
<213>Hepatitis C virus
<400>34
Tyr Tyr Lys Val Phe Leu Ala Arg Leu
1 5
<210>35
<211>9
<212>PRT
<213>Hepatitis C virus
<400>35
Cys Met Val Asp Tyr Pro Tyr Arg Leu
1 5
<210>36
<211>9
<212>PRT
<213>Hepatitis C virus
<400>36
Ala Val Ile Pro Asp Arg Glu Val Leu
1 5
<210>37
<211>9
<212>PRT
<213>Hepatitis C virus
<400>37
Asn Phe Ser Arg Cys Trp Val Ala Leu
1 5
<210>38
<211>9
<212>PRT
<213>Hepatitis C virus
<400>38
Val Phe Ser Asp Met Glu Thr Lys Leu
1 5
<210>39
<211>9
<212>PRT
<213>Hepatitis C virus
<400>39
Val Trp Pro Leu Leu Leu Leu Leu Leu
1 5
<210>40
<211>9
<212>PRT
<213>Hepatitis C virus
<400>40
Ile Thr Tyr Ser Thr Tyr Cys Lys Phe
1 5
<210>41
<211>9
<212>PRT
<213>Hepatitis C virus
<400>41
Ile Glu Pro Leu Asp Leu Pro Gln Ile
1 5
<210>42
<211>9
<212>PRT
<213>Hepatitis C virus
<400>42
Leu Leu Ser Thr Thr Glu Trp Gln Ile
1 5
<210>43
<211>9
<212>PRT
<213>Hepatitis C virus
<400>43
Pro Leu Leu Arg Glu Glu Val Val Phe
1 5
<210>44
<211>9
<212>PRT
<213>Hepatitis C virus
<400>44
Ala Thr Pro Pro Gly Ser Ile Thr Val
1 5
<210>45
<211>9
<212>PRT
<213>Hepatitis C virus
<400>45
Val Ile Leu Asp Ser Phe Glu Pro Leu
1 5
<210>46
<211>9
<212>PRT
<213>Hepatitis C virus
<400>46
Ile Leu Pro Cys Ser Tyr Thr Thr Leu
1 5
<210>47
<211>9
<212>PRT
<213>Hepatitis C virus
<400>47
Phe Trp Ala Lys His Met Trp Asn Phe
1 5
<210>48
<211>9
<212>PRT
<213>Hepatitis C virus
<400>48
Ala Leu Tyr Asp Val Val Ser Thr Leu
1 5
<210>49
<211>9
<212>PRT
<213>Hepatitis C virus
<400>49
Ala Phe Tyr Gly Val Trp Pro Leu Leu
1 5
<210>50
<211>9
<212>PRT
<213>Hepatitis C virus
<400>50
Pro Tyr Ile Glu Gln Gly Met Gln Leu
1 5
<210>51
<211>9
<212>PRT
<213>Hepatitis C virus
<400>51
Thr Pro Pro Ala Val Pro Gln Thr Phe
1 5
<210>52
<211>9
<212>PRT
<213>Hepatitis C virus
<400>52
Trp His Tyr Pro Cys Thr Val Asn Phe
1 5
<210>53
<211>9
<212>PRT
<213>Hepatitis C virus
<400>53
Gly Ile Leu Pro Phe Phe Met Phe Phe
1 5
<210>54
<211>9
<212>PRT
<213>Hepatitis C virus
<400>54
Gly Leu Ile His Leu His Gln Asn Ile
1 5
<210>55
<211>9
<212>PRT
<213>Hepatitis C virus
<400>55
Leu Met Cys Ala Val His Pro Glu Leu
1 5
<210>56
<211>9
<212>PRT
<213>Hepatitis C virus
<400>56
Thr Val Leu Ala Asp Phe Lys Thr Trp
1 5
<210>57
<211>9
<212>PRT
<213>Hepatitis C virus
<400>57
Ala Tyr Ser Gln Gln Thr Arg Gly Leu
1 5
<210>58
<211>9
<212>PRT
<213>Hepatitis C virus
<400>58
Ala Gln Pro Gly Tyr Pro Trp Pro Leu
1 5
<210>59
<211>9
<212>PRT
<213>Hepatitis C virus
<400>59
Pro Leu Leu Arg Asp Glu Val Thr Phe
1 5
<210>60
<211>9
<212>PRT
<213>Hepatitis C virus
<400>60
Ile Leu Met Thr His Phe Phe Ser Ile
1 5
<210>61
<211>9
<212>PRT
<213>Hepatitis C virus
<400>61
Ser Tyr Thr Trp Thr Gly Ala Leu Ile
1 5
<210>62
<211>9
<212>PRT
<213>Hepatitis C virus
<400>62
Ala Thr Pro Pro Gly Ser Val Thr Phe
1 5
<210>63
<211>9
<212>PRT
<213>Hepatitis C virus
<400>63
Leu Leu Pro Arg Arg Gly Pro Arg Leu
1 5
<210>64
<211>9
<212>PRT
<213>Hepatitis C virus
<400>64
Leu Met Thr His Phe Phe Ser Ile Leu
1 5
<210>65
<211>9
<212>PRT
<213>Hepatitis C virus
<400>65
Leu Leu Lys Arg Leu His Gln Trp Ile
1 5
<210>66
<211>9
<212>PRT
<213>Hepatitis C virus
<400>66
Ala Arg Pro Asp Tyr Asn Pro Pro Leu
1 5
<210>67
<211>9
<212>PRT
<213>Hepatitis C virus
<400>67
Ala Tyr Tyr Ser Met Val Gly Asn Trp
1 5
<210>68
<211>9
<212>PRT
<213>Hepatitis C virus
<400>68
Lys Phe Pro Ala Ala Met Pro Val Trp
1 5
<210>69
<211>9
<212>PRT
<213>Hepatitis C virus
<400>69
Gln Tyr Thr Leu Leu Phe Asn Ile Leu
1 5
<210>70
<211>9
<212>PRT
<213>Hepatitis C virus
<400>70
Leu Val Pro Gly Ala Ala Tyr Ala Phe
1 5
<210>71
<211>9
<212>PRT
<213>Hepatitis C virus
<400>71
Arg Tyr Ala Pro Ala Cys Lys Pro Leu
1 5
<210>72
<211>9
<212>PRT
<213>Hepatitis C virus
<400>72
Phe Leu Ala Arg Leu Ile Trp Trp Leu
1 5
<210>73
<211>9
<212>PRT
<213>Hepatitis C virus
<400>73
Ser Gln Leu Asp Leu Ser Gly Trp Phe
1 5
<210>74
<211>9
<212>PRT
<213>Hepatitis C virus
<400>74
Ser Met Leu Thr Asp Pro Ser His Ile
1 5
<210>75
<211>9
<212>PRT
<213>Hepatitis C virus
<400>75
Ile Leu Leu Gly Pro Ala Asp Ser Phe
1 5
<210>76
<211>9
<212>PRT
<213>Hepatitis C virus
<400>76
Trp Leu Arg Asp Val Trp Asp Trp Ile
1 5
<210>77
<211>9
<212>PRT
<213>Hepatitis C virus
<400>77
Tyr Val Val Leu Leu Phe Leu Leu Leu
1 5
<210>78
<211>9
<212>PRT
<213>Hepatitis C virus
<400>78
Leu Asn Pro Ser Val Ala Ala Thr Leu
1 5
<210>79
<211>9
<212>PRT
<213>Hepatitis C virus
<400>79
Tyr Val Tyr Asp His Leu Thr Pro Leu
1 5
<210>80
<211>9
<212>PRT
<213>Hepatitis C virus
<400>80
His Tyr Arg Asp Val Leu Lys Glu Met
1 5
<210>81
<211>9
<212>PRT
<213>Hepatitis C virus
<400>81
Thr Leu Arg Arg His Val Asp Leu Leu
1 5
<210>82
<211>9
<212>PRT
<213>Hepatitis C virus
<400>82
Ala Val Ile Pro Asp Arg Glu Val Leu
1 5
<210>83
<211>9
<212>PRT
<213>Hepatitis C virus
<400>83
Phe Leu Ile Ser Gln Leu Phe Thr Phe
1 5
<210>84
<211>9
<212>PRT
<213>Hepatitis C virus
<400>84
Val Ile Leu Asp Ser Phe Glu Pro Leu
1 5
<210>85
<211>9
<212>PRT
<213>Hepatitis C virus
<400>85
Ile Leu Pro Cys Ser Tyr Thr Thr Leu
1 5
<210>86
<211>9
<212>PRT
<213>Hepatitis C virus
<400>86
Gly Ile Leu Pro Phe Phe Met Phe Phe
1 5
<210>87
<211>9
<212>PRT
<213>Hepatitis C virus
<400>87
Pro Leu Leu Arg Asp Glu Val Thr Phe
1 5
<210>88
<211>9
<212>PRT
<213>Hepatitis C virus
<400>88
Thr Pro Pro Ala Val Pro Gln Thr Phe
1 5
<210>89
<211>9
<212>PRT
<213>Hepatitis C virus
<400>89
Phe Trp Ala Lys His Met Trp Asn Phe
1 5
<210>90
<211>9
<212>PRT
<213>Hepatitis C virus
<400>90
Thr Val Leu Ala Asp Phe Lys Thr Trp
1 5
<210>91
<211>9
<212>PRT
<213>Hepatitis C virus
<400>91
Ala Leu Tyr Asp Val Val Ser Thr Leu
1 5
<210>92
<211>9
<212>PRT
<213>Hepatitis C virus
<400>92
Trp His Tyr Pro Cys Thr Val Asn Phe
1 5
<210>93
<211>9
<212>PRT
<213>Hepatitis C virus
<400>93
Ala Thr Pro Pro Gly Ser Val Thr Phe
1 5
<210>94
<211>9
<212>PRT
<213>Hepatitis C virus
<400>94
Gly Leu Ile His Leu His Gln Asn Ile
1 5
<210>95
<211>9
<212>PRT
<213>Hepatitis C virus
<400>95
Ala Tyr Ser Gln Gln Thr Arg Gly Leu
1 5
<210>96
<211>9
<212>PRT
<213>Hepatitis C virus
<400>96
Ile Leu Met Thr His Phe Phe Ser Ile
1 5
<210>97
<211>9
<212>PRT
<213>Hepatitis C virus
<400>97
Phe Leu Ala Arg Leu Ile Trp Trp Leu
1 5
<210>98
<211>9
<212>PRT
<213>Hepatitis C virus
<400>98
Ala Phe Tyr Gly Val Trp Pro Leu Leu
1 5
<210>99
<211>9
<212>PRT
<213>Hepatitis C virus
<400>99
Phe Leu Ile Ser Gln Leu Phe Thr Phe
1 5
<210>100
<211>9
<212>PRT
<213>Hepatitis C virus
<400>100
Ile Leu Leu Gly Pro Ala Asp Ser Phe
1 5
<210>101
<211>9
<212>PRT
<213>Hepatitis C virus
<400>101
Ser Met Leu Thr Asp Pro Ser His Ile
1 5
<210>102
<211>9
<212>PRT
<213>Hepatitis C virus
<400>102
Ala Tyr Tyr Ser Met Val Gly Asn Trp
1 5
<210>103
<211>9
<212>PRT
<213>Hepatitis C virus
<400>103
Gln Tyr Thr Leu Leu Phe Asn Ile Leu
1 5
<210>104
<211>9
<212>PRT
<213>Hepatitis C virus
<400>104
Leu Leu Lys Arg Leu His Gln Trp Ile
1 5
<210>105
<211>9
<212>PRT
<213>Hepatitis C virus
<400>105
Ser Gln Leu Asp Leu Ser Gly Trp Phe
1 5
<210>106
<211>9
<212>PRT
<213>Hepatitis C virus
<400>106
Trp Leu Arg Asp Val Trp Asp Trp Ile
1 5
<210>107
<211>9
<212>PRT
<213>Hepatitis C virus
<400>107
Ile Thr Tyr Ser Thr Tyr Gly Lys Phe
1 5
<210>108
<211>9
<212>PRT
<213>Hepatitis C virus
<400>108
Ser Tyr Thr Trp Thr Gly Ala Leu Ile
1 5
<210>109
<211>9
<212>PRT
<213>Hepatitis C virus
<400>109
Leu Leu Ser Thr Thr Glu Trp Gln Ile
1 5
<210>110
<211>9
<212>PRT
<213>Heatitis C virus
<400>110
Arg Tyr Ala Pro Ala Cys Lys Pro Leu
1 5
<210>111
<211>9
<212>PRT
<213>Hepatitis C virus
<400>111
Arg Leu Ile Trp Trp Leu Gln Tyr Phe
1 5
<210>112
<211>9
<212>PRT
<213>Hepatitis C virus
<400>112
Val Leu Ala Asp Phe Lys Thr Trp Leu
1 5
<210>113
<211>9
<212>PRT
<213>Hepatitis C virus
<400>113
Leu Val Pro Gly Ala Ala Tyr Ala Phe
1 5
<210>114
<211>9
<212>PRT
<213>Hepatitis C virus
<400>114
Asp Leu Pro Gln Ile Ile Gln Arg Leu
1 5
<210>115
<211>9
<212>PRT
<213>Hepatitis C virus
<400>115
Trp Ile Cys Thr Val Leu Ala Asp Phe
1 5
<210>116
<211>9
<212>PRT
<213>Hepatitis C virus
<400>116
Phe Tyr Gly Val Trp Pro Leu Leu Leu
1 5
<210>117
<211>9
<212>PRT
<213>Hepatitis C virus
<400>117
Leu Leu Leu Ser Ile Leu Gly Pro Leu
1 5
<210>118
<211>9
<212>PRT
<213>Hepatitis C virus
<400>118
His Tyr Arg Asp Val Leu Lys Glu Met
1 5
<210>119
<211>9
<212>PRT
<213>Hepatitis C virus
<400>119
Leu Ile Trp Trp Leu Gln Tyr Phe Ile
1 5
<210>120
<211>9
<212>PRT
<213>Hepatitis C virus
<400>120
Ala Val Ile Pro Asp Arg Glu Val Leu
1 5
<210>121
<211>9
<212>PRT
<213>Hepatitis C virus
<400>121
Thr Arg Pro Pro His Gly Asn Trp Phe
1 5
<210>122
<211>9
<212>PRT
<213>Hepatitis C virus
<400>122
Lys Phe Pro Ala Ala Met Pro Val Trp
1 5
<210>123
<211>9
<212>PRT
<213>Hepatitis C virus
<400>123
Val Phe Pro Asp Leu Gly Val Arg Val
1 5
<210>124
<211>9
<212>PRT
<213>Hepatitis C virus
<400>124
Lys Leu Leu Pro Arg Leu Pro Gly Val
1 5
<210>125
<211>9
<212>PRT
<213>Hepatitis C virus
<400>125
Asp Met Pro Ser Thr Glu Asp Leu Val
1 5
<210>126
<211>9
<212>PRT
<213>Hepatitis C virus
<400>126
Tyr Leu Tyr Gly Ile Gly Ser Ala Val
1 5
<210>127
<211>9
<212>PRT
<213>Hepatitis C virus
<400>127
Tyr Leu Asn Thr Pro Gly Leu Pro Val
1 5
<210>128
<211>9
<212>PRT
<213>Hepatitis C virus
<400>128
Cys Leu Leu Leu Leu Ser Val Gly Val
1 5
<210>129
<211>9
<212>PRT
<213>Hepatitis C virus
<400>129
Leu Leu Leu Ser Val Gly Val Gly Ile
1 5
<210>130
<211>9
<212>PRT
<213>Hepatitis C virus
<400>130
Leu Leu Cys Pro Ser Gly His Val Val
1 5
<210>131
<211>9
<212>PRT
<213>Hepatitis C virus
<400>131
Ala Ile Leu Ser Pro Gly Ala Leu Val
1 5
<210>132
<211>9
<212>PRT
<213>Hepatitis C virus
<400>132
Ser Leu Ile Arg Val Pro Tyr Phe Val
1 5
<210>133
<211>9
<212>PRT
<213>Hepatitis C virus
<400>133
Asp Val Trp Asp Trp Ile Cys Thr Val
1 5
<210>134
<211>9
<212>PRT
<213>Heatitis C virus
<400>134
Val Ile Pro Ala Ser Gly Asp Val Val
1 5
<210>135
<211>9
<212>PRT
<213>Hepatitis C virus
<400>135
Arg Ala Leu Ala His Gly Val Arg Val
1 5
<210>136
<211>9
<212>PRT
<213>Hepatitis C virus
<400>136
Leu Ser Asp Gly Ser Trp Ser Thr Val
1 5
<210>137
<211>9
<212>PRT
<213>Hepatitis C virus
<400>137
Lys Leu Gln Asp Cys Thr Met Leu Val
1 5
<210>138
<211>9
<212>PRT
<213>Hepatitis C virus
<400>138
Tyr Cys Leu Thr Thr Gly Ser Val Val
1 5
<210>139
<211>9
<212>PRT
<213>Hepatitis C virus
<400>139
Ser Met Leu Thr Asp Pro Ser His Ile
1 5
<210>140
<211>9
<212>PRT
<213>Hepatitis C virus
<400>140
Ala Ala Phe Cys Ser Ala Met Tyr Val
1 5
<210>141
<211>9
<212>PRT
<213>Hepatitis C virus
<400>141
Tyr Ser Pro Gly Glu Ile Asn Arg Val
1 5
<210>142
<211>9
<212>PRT
<213>Hepatitis C virus
<400>142
Tyr Thr Asn Val Asp Gln Asp Leu Val
1 5
<210>143
<211>9
<212>PRT
<213>Hepatitis C virus
<400>143
Leu Arg Asp Glu Val Thr Phe Gln Val
1 5
<210>144
<211>9
<212>PRT
<213>Hepatitis C virus
<400>144
Leu Ala Ala Leu Thr Gly Thr Tyr Val
1 5
<210>145
<211>9
<212>PRT
<213>Hepatitis C virus
<400>145
Cys Glu Pro Glu Pro Asp Val Thr Val
1 5
<210>146
<211>9
<212>PRT
<213>Hepatitis C virus
<400>146
Cys Met Ser Ala Asp Leu Glu Val Val
1 5
<210>147
<211>9
<212>PRT
<213>Hepatitis C virus
<400>147
Val Phe Pro Asp Leu Gly Val Arg Val
1 5
<210>148
<211>9
<212>PRT
<213>Hepatitis C virus
<400>148
Tyr Cys Phe Thr Pro Ser Pro Val Val
1 5
<210>149
<211>9
<212>PRT
<213>Hepatitis C virus
<400>149
Val Leu Gln Ala Ser Leu Ile Arg Val
1 5
<210>150
<211>9
<212>PRT
<213>Hepatitis C virus
<400>150
Lys Gln Ala Glu Ala Ala Ala Pro Val
1 5
<210>151
<211>9
<212>PRT
<213>Hepatitis C virus
<400>151
Leu Leu Leu Ala Leu Pro Pro Arg Ala
1 5
<210>152
<211>9
<212>PRT
<213>Hepatitis C virus
<400>152
Val Leu As Asp His Tyr Arg As Val
1 5
<210>153
<211>9
<212>PRT
<213>Hepatitis C virus
<400>153
Phe Ser Pro Arg Arg His Glu Thr Val
1 5
<210>154
<211>9
<212>PRT
<213>Heatitis C virus
<400>154
Ser Val Ile Asp Cys Asn Thr Cys Val
1 5
<210>155
<211>9
<212>PRT
<213>Heatitis C virus
<400>155
Gly Leu Ile Arg Ala Cys Thr Leu Val
1 5
<210>156
<211>9
<212>PRT
<213>Hepatitis C virus
<400>156
Thr Val Asn Phe Thr Ile Phe Lys Val
1 5
<210>157
<211>9
<212>PRT
<213>Hepatitis C virus
<400>157
Glu Met Gly Gly Asn Ile Thr Arg Val
1 5
<210>158
<211>9
<212>PRT
<213>Hepatitis C virus
<400>158
Pro Leu Leu Arg His His Asn Met Val
1 5
<210>159
<211>9
<212>PRT
<213>Hepatitis C virus
<400>159
Gln Leu Asp Leu Ser Gly Trp Phe Val
1 5
<210>160
<211>9
<212>PRT
<213>Hepatitis C virus
<400>160
Thr Leu Ala Ala Arg Asn Ala Ser Val
1 5
<210>161
<211>9
<212>PRT
<213>Hepatitis C virus
<400>161
Arg Leu Gly Ala Val Gln Asn Glu Val
1 5
<210>162
<211>9
<212>PRT
<213>Hepatitis C virus
<400>162
Val Ile Leu Asp Ser Phe Glu Pro Leu
1 5
<210>163
<211>9
<212>PRT
<213>Hepatitis C virus
<400>163
Ala Ala Leu Glu Asn Leu Val Val Leu
1 5
<210>164
<211>9
<212>PRT
<213>Hepatitis C virus
<400>164
Leu Leu Glu Asp Thr Asp Thr Pro Ile
1 5
<210>165
<211>9
<212>PRT
<213>Hepatitis C virus
<400>165
Val Val Thr Ser Thr Trp Val Leu Val
1 5
<210>166
<211>9
<212>PRT
<213>Hepatitis C virus
<400>166
Phe Ser Leu Asp Pro Thr Phe Thr Ile
1 5
<210>167
<211>9
<212>PRT
<213>Hepatitis C virus
<400>167
Thr Ile Pro Ala Ser Ala Tyr Glu Val
1 5
<210>168
<211>9
<212>PRT
<213>Hepatitis C virus
<400>168
Asp Leu Leu Glu Asp Thr Asp Thr Pro
1 5
<210>169
<211>9
<212>PRT
<213>Hepatitis C virus
<400>169
Leu Leu Leu Ser Ile Leu Gly Pro Leu
1 5
<210>170
<211>9
<212>PRT
<213>Hepatitis C virus
<400>170
Val Leu Ala Asp Phe Lys Thr Trp Leu
1 5
<210>171
<211>9
<212>PRT
<213>Hepatitis C virus
<400>171
Ser Ile Leu Gly Ile Gly Thr Val Leu
1 5
<210>172
<211>9
<212>PRT
<213>Hepatitis C virus
<400>172
Ala Gly Asp Asn Phe Pro Tyr Leu Val
1 5
<210>173
<211>9
<212>PRT
<213>Hepatitis C virus
<400>173
Ile Leu Pro Cys Ser Tyr Thr Thr Leu
1 5
<210>174
<211>9
<212>PRT
<213>Hepatitis C virus
<400>174
Val Ala Ala Glu Glu Tyr Val Glu Val
1 5
<210>175
<211>9
<212>PRT
<213>Hepatitis C virus
<400>175
Leu Ala Val Ala Val Glu Pro Val Val
1 5
<210>176
<211>9
<212>PRT
<213>Hepatitis C virus
<400>176
Ala Leu Tyr Asp Val Val Ser Thr Leu
1 5
<210>177
<211>9
<212>PRT
<213>Hepatitis C virus
<400>177
Phe Leu Ala Arg Leu Ile Trp Trp Leu
1 5
<210>178
<211>9
<212>PRT
<213>Hepatitis C virus
<400>178
Arg Leu Leu Ala Pro Ile Thr Ala Tyr
1 5
<210>179
<211>9
<212>PRT
<213>Hepatitis C virus
<400>179
Trp Leu Arg Asp Val Trp Asp Trp Ile
1 5
<210>180
<211>9
<212>PRT
<213>Hepatitis C virus
<400>180
Cys Val Asn Gly Ala Cys Trp Thr Val
1 5
<210>181
<211>9
<212>PRT
<213>Hepatitis C virus
<400>181
Tyr Val Tyr Asp His Leu Thr Pro Leu
1 5
<210>182
<211>9
<212>PRT
<213>Hepatitis C virus
<400>182
Thr Val Val Leu Thr Glu Ser Thr Val
1 5
<210>183
<211>9
<212>PRT
<213>Hepatitis C virus
<400>183
Ala Ala Arg Ala Leu Ala His Gly Val
1 5
<210>184
<211>3010
<212>PRT
<213>Hepatitis C virus
<400>184
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu
130 135 140
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala Tyr
180 185 190
Glu Val Arg Asn Val Ser Gly Ile Tyr His Val Thr Asn Asp Cys Ser
195 200 205
Asn Ser Ser Ile Val Tyr Glu Ala Ala Asp Met Ile Met His Thr Pro
210 215 220
Gly Cys Val Pro Cys Val Arg Glu Ser Asn Phe Ser Arg Cys Trp Val
225 230 235 240
Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ser Ser Ile Pro Thr Thr
245 250 255
Thr Ile Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Leu Cys
260 265 270
Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser
275 280 285
Gln Leu Phe Thr Phe Ser Pro Arg Arg Tyr Glu Thr Val Gln Asp Cys
290 295 300
Asn Cys Ser Ile Tyr Pro Gly His Val Ser Gly His Arg Met Ala Trp
305 310 315 320
Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gln
325 330 335
Leu Leu Arg Ile Pro Gln Ala Val Val Asp Met Val Ala Gly Ala His
340 345 350
Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp
355 360 365
Ala Lys Val Leu Ile Val Met Leu Leu Phe Ala Gly Val Asp Gly His
370 375 380
Thr His Val Thr Gly Gly Arg Val Ala Ser Ser Thr Gln Ser Leu Val
385 390 395 400
Ser Trp Leu Ser Gln Gly Pro Ser Gln Lys Ile Gln Leu Val Asn Thr
405 410 415
Asn Gly Ser Trp His Ile Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser
420 425 430
Leu Gln Thr Gly Phe Ile Ala Ala Leu Phe Tyr Ala His Arg Phe Asn
435 440 445
Ala Ser Gly Cys Pro Glu Arg Met Ala Ser Cys Arg Pro Ile Asp Glu
450 455 460
Phe Ala Gln Gly Trp Gly Pro Ile Thr His Asp Met Pro Glu Ser Ser
465 470 475 480
Asp Gln Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly Ile
485 490 495
Val Pro Ala Ser Gln Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser
500 505 510
Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Ala Pro Thr Tyr Ser
515 520 525
Trp Gly Glu Asn Glu Thr Asp Val Leu Leu Leu Ser Asn Thr Arg Pro
530 535 540
Pro Gln Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe
545 550 555 560
Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn Ile Gly Gly Val Gly Asn
565 570 575
Asn Thr Leu Val Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala
580 585 590
Thr Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met
595 600 605
Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe
610 615 620
Thr Val Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu
625 630 635 640
Asn Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp
645 650 655
Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp
660 665 670
Gln Ile Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly
675 680 685
Leu Ile His Leu His Arg Asn Ile Val Asp Val Gln Tyr Leu Tyr Gly
690 695 700
Ile Gly Ser Ala Val Val Ser Phe Ala Ile Lys Trp Glu Tyr Ile Leu
705 710 715 720
Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp
725 730 735
Met Met Leu Leu Ile Ala Gln Ala Glu Ala Thr Leu Glu Asn Leu Val
740 745 750
Val Leu Asn Ala Ala Ser Val Ala Gly Ala His Gly Leu Leu Ser Phe
755 760 765
Leu Val Phe Phe Cys Ala Ala Trp Tyr Ile Lys Gly Arg Leu Val Pro
770 775 780
Gly Ala Ala Tyr Ala Leu Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu
785 790 795 800
Leu Ala Leu Pro Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala
805 810 815
Ser Cys Gly Gly Ala Val Phe Val Gly Leu Val Leu Leu Thr Leu Ser
820 825 830
Pro Tyr Tyr Lys Val Phe Leu Ala Arg Leu Ile Trp Trp Leu Gln Tyr
835 840 845
Phe Ile Thr Arg Ala Glu Ala His Leu Gln Val Trp Val Pro Pro Leu
850 855 860
Asn Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Thr Cys Ala Val
865 870 875 880
His Pro Glu Leu Ile Phe Asp Ile Thr Lys Leu Leu Leu Ala Ile Leu
885 890 895
Gly Pro Leu Met Val Leu Gln Ala Gly Ile Thr Arg Val Pro Tyr Phe
900 905 910
Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu Val Arg Lys Val
915 920 925
Ala Gly Gly His Tyr Val Gln Met Ala Phe Met Lys Leu Ala Ala Leu
930 935 940
Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro Leu Arg Asp Trp Ala
945 950 955 960
His Ala Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Val Phe
965 970 975
Ser Asp Met Glu Thr Lys Leu Ile Thr Trp Gly Ala Asp Thr Ala Ala
980 985 990
Cys Gly Asp Ile Ile Ser Gly Leu Pro Val Ser Ala Arg Arg Gly Lys
995 1000 1005
Glu Ile Leu Leu Gly Pro Ala Asp Ser Phe Gly Glu Gln Gly Trp
1010 1015 1020
Arg Leu Leu Ala Pro Ile Thr Ala Tyr Ser Gln Gln Thr Arg Gly
1025 1030 1035
Leu Leu Gly Cys Ile Ile Thr Ser Leu Thr Gly Arg Asp Lys Asn
1040 1045 1050
Gln Val Asp Gly Glu Val Gln Val Leu Ser Thr Ala Thr Gln Ser
1055 1060 1065
Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr Val Tyr His
1070 1075 1080
Gly Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro Ile Thr
1085 1090 1095
Gln Met Tyr Thr Asn Val Asp Gln Asp Leu Val Gly Trp Pro Ala
1100 1105 1110
Pro Pro Gly Ala Arg Ser Met Thr Pro Cys Thr Cys Gly Ser Ser
1115 1120 1125
Asp Leu Tyr Leu Val Thr Arg His Ala Asp Val Val Pro Val Arg
1130 1135 1140
Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro Ile
1145 1150 1155
Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ser
1160 1165 1170
Gly His Val Val Gly Ile Phe Arg Ala Ala Val Cys Thr Arg Gly
1175 1180 1185
Val Ala Lys Ala Val Asp Phe Ile Pro Val Glu Ser Met Glu Thr
1190 1195 1200
Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro Pro Ala
1205 1210 1215
Val Pro Gln Thr Phe Gln Val Ala His Leu His Ala Pro Thr Gly
1220 1225 1230
Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gln Gly
1235 1240 1245
Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly
1250 1255 1260
Phe Gly Ala Tyr Met Ser Lys Ala His Gly Ile Glu Pro Asn Ile
1265 1270 1275
Arg Thr Gly Val Arg Thr Ile Thr Thr Gly Gly Pro Ile Thr Tyr
1280 1285 1290
Ser Thr Tyr Cys Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly
1295 1300 1305
Ala Tyr Asp Ile Ile Ile Cys Asp Glu Cys His Ser Thr Asp Ser
1310 1315 1320
Thr Thr Ile Leu Gly Ile Gly Thr Val Leu Asp Gln Ala Glu Thr
1325 1330 1335
Ala Gly Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly
1340 1345 1350
Ser Ile Thr Val Pro His Pro Asn Ile Glu Glu Val Ala Leu Ser
1355 1360 1365
Asn Thr Gly Glu Ile Pro Phe Tyr Gly Lys Ala Ile Pro Ile Glu
1370 1375 1380
Ala Ile Lys Gly Gly Arg His Leu Ile Phe Cys His Ser Lys Lys
1385 1390 1395
Lys Cys Asp Glu Leu Ala Ala Lys Leu Thr Gly Leu Gly Leu Asn
1400 1405 1410
Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val Ile Pro Thr
1415 1420 1425
Ser Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met Thr Gly
1430 1435 1440
Phe Thr Gly Asp Phe Asp Ser Val Ile Asp Cys Asn Thr Cys Val
1445 1450 1455
Thr Gln Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr Ile Glu
1460 1465 1470
Thr Thr Thr Leu Pro Gln Asp Ala Val Ser Arg Ala Gln Arg Arg
1475 1480 1485
Gly Arg Thr Gly Arg Gly Arg Ser Gly Ile Tyr Arg Phe Val Thr
1490 1495 1500
Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys
1505 1510 1515
Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala
1520 1525 1530
Glu Thr Ser Val Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu
1535 1540 1545
Pro Val Cys Gln Asp His Leu Glu Phe Trp Glu Ser Val Phe Thr
1550 1555 1560
Gly Leu Thr His Ile Asp Ala His Phe Leu Ser Gln Thr Lys Gln
1565 1570 1575
Ala Gly Asp Asn Leu Pro Tyr Leu Val Ala Tyr Gln Ala Thr Val
1580 1585 1590
Cys Ala Arg Ala Gln Ala Pro Pro Pro Ser Trp Asp Gln Met Trp
1595 1600 1605
Lys Cys Leu Ile Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro
1610 1615 1620
Leu Leu Tyr Arg Leu Gly Ala Val Gln Asn Glu Val Thr Leu Thr
1625 1630 1635
His Pro Ile Thr Lys Tyr Ile Met Ala Cys Met Ser Ala Asp Leu
1640 1645 1650
Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala
1655 1660 1665
Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val Ile Val
1670 1675 1680
Gly Arg Ile Ile Leu Ser Gly Arg Pro Ala Val Ile Pro Asp Arg
1685 1690 1695
Glu Val Leu Tyr Gln Glu Phe Asp Glu Met Glu Glu Cys Ala Ser
1700 1705 1710
His Leu Pro Tyr Ile Glu Gln Gly Met Gln Leu Ala Glu Gln Phe
1715 1720 1725
Lys Gln Lys Ala Leu Gly Leu Leu Gln Thr Ala Thr Lys Gln Ala
1730 1735 1740
Glu Ala Ala Ala Pro Val Val Glu Ser Lys Trp Arg Ala Leu Glu
1745 1750 1755
Val Phe Trp Ala Lys His Met Trp Asn Phe Ile Ser Gly Ile Gln
1760 1765 1770
Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Ile Ala
1775 1780 1785
Ser Leu Met Ala Phe Thr Ala Ser Ile Thr Ser Pro Leu Thr Thr
1790 1795 1800
Gln Asn Thr Leu Leu Phe Asn Ile Leu Gly Gly Trp Val Ala Ala
1805 1810 1815
Gln Leu Ala Pro Pro Ser Ala Ala Ser Ala Phe Val Gly Ala Gly
1820 1825 1830
Ile Ala Gly Ala Ala Val Gly Ser Ile Gly Leu Gly Lys Val Leu
1835 1840 1845
Val Asp Ile Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu
1850 1855 1860
Val Ala Phe Lys Val Met Ser Gly Glu Met Pro Ser Thr Glu Asp
1865 1870 1875
Leu Val Asn Leu Leu Pro Ala Ile Leu Ser Pro Gly Ala Leu Val
1880 1885 1890
Val Gly Val Val Cys Ala Ala Ile Leu Arg Arg His Val Gly Pro
1895 1900 1905
Gly Glu Gly Ala Val Gln Trp Met Asn Arg Leu Ile Ala Phe Ala
1910 1915 1920
Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu Ser
1925 1930 1935
Asp Ala Ala Ala Arg Val Thr Gln Ile Leu Ser Ser Leu Thr Ile
1940 1945 1950
Thr Gln Leu Leu Lys Arg Leu His Gln Trp Ile Asn Glu Asp Cys
1955 1960 1965
Ser Thr Pro Cys Ser Gly Ser Trp Leu Lys Asp Val Trp Asp Trp
1970 1975 1980
Ile Cys Thr Val Leu Ser Asp Phe Lys Thr Trp Leu Gln Ser Lys
1985 1990 1995
Leu Leu Pro Arg Leu Pro Gly Leu Pro Phe Leu Ser Cys Gln Arg
2000 2005 2010
Gly Tyr Lys Gly Val Trp Arg Gly Asp Gly Ile Met Gln Thr Thr
2015 2020 2025
Cys Pro Cys Gly Ala Gln Ile Thr Gly His Val Lys Asn Gly Ser
2030 2035 2040
Met Arg Ile Val Gly Pro Lys Thr Cys Ser Asn Thr Trp His Gly
2045 2050 2055
Thr Phe Pro Ile Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Ser
2060 2065 2070
Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg Val Ala Ala Glu
2075 2080 2085
Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His Tyr Val Thr
2090 2095 2100
Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gln Val Pro Ala
2105 2110 2115
Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg Tyr
2120 2125 2130
Ala Pro Val Cys Lys Pro Leu Leu Arg Glu Glu Val Val Phe Gln
2135 2140 2145
Val Gly Leu Asn Gln Tyr Leu Val Gly Ser Gln Leu Pro Cys Glu
2150 2155 2160
Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro
2165 2170 2175
Ser His Ile Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly
2180 2185 2190
Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gln Leu Ser Ala
2195 2200 2205
Pro Ser Leu Lys Ala Thr Cys Thr Thr His His Asp Ser Pro Asp
2210 2215 2220
Ala Asp Leu Ile Glu Ala Asn Leu Leu Trp Arg Gln Glu Met Gly
2225 2230 2235
Gly Asn Ile Thr Arg Val Glu Ser Glu Asn Lys Val Val Ile Leu
2240 2245 2250
Asp Ser Phe Asp Pro Ile Arg Ala Val Glu Asp Glu Arg Glu Ile
2255 2260 2265
Ser Val Pro Ala Glu Ile Leu Arg Lys Pro Arg Lys Phe Pro Pro
2270 2275 2280
Ala Leu Pro Ile Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Leu
2285 2290 2295
Glu Ser Trp Lys Asp Pro Asp Tyr Val Pro Pro Val Val His Gly
2300 2305 2310
Cys Pro Leu Pro Ser Thr Lys Ala Pro Pro Ile Pro Pro Pro Arg
2315 2320 2325
Arg Lys Arg Thr Val Val Leu Thr Glu Ser Thr Val Ser Ser Ala
2330 2335 2340
Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Gly Ser Ser
2345 2350 2355
Ala Val Asp Ser Gly Thr Ala Thr Gly Pro Pro Asp Gln Ala Ser
2360 2365 2370
Asp Asp Gly Asp Lys Gly Ser Asp Val Glu Ser Tyr Ser Ser Met
2375 2380 2385
Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly
2390 2395 2400
Ser Trp Ser Thr Val Ser Gly Glu Ala Gly Glu Asp Val Val Cys
2405 2410 2415
Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu Ile Thr Pro Cys
2420 2425 2430
Ala Ala Glu Glu Ser Lys Leu Pro Ile Asn Pro Leu Ser Asn Ser
2435 2440 2445
Leu Leu Arg His His Ser Met Val Tyr Ser Thr Thr Ser Arg Ser
2450 2455 2460
Ala Ser Leu Arg Gln Lys Lys Val Thr Phe Asp Arg Leu Gln Val
2465 2470 2475
Leu Asp Asp His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys
2480 2485 2490
Ala Ser Thr Val Lys Ala Arg Leu Leu Ser Ile Glu Glu Ala Cys
2495 2500 2505
Lys Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly
2510 2515 2520
Ala Lys Asp Val Arg Ser Leu Ser Ser Arg Ala Val Asn His Ile
2525 2530 2535
Arg Ser Val Trp Glu Asp Leu Leu Glu Asp Thr Glu Thr Pro Ile
2540 2545 2550
Asp Thr Thr Ile Met Ala Lys Asn Glu Val Phe Cys Val Gln Pro
2555 2560 2565
Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu Ile Val Phe Pro Asp
2570 2575 2580
Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val
2585 2590 2595
Ser Thr Leu Pro Gln Ala Val Met Gly Pro Ser Tyr Gly Phe Gln
2600 2605 2610
Tyr Ser Pro Gly Gln Arg Val Glu Phe Leu Val Asn Thr Trp Lys
2615 2620 2625
Ser Lys Lys Cys Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe
2630 2635 2640
Asp Ser Thr Val Thr Glu Asn Asp Ile Arg Thr Glu Glu Ser Ile
2645 2650 2655
Tyr Gln Cys Cys Asp Leu Ala Pro Glu Ala Arg Gln Ala Ile Arg
2660 2665 2670
Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Leu Thr Asn Ser
2675 2680 2685
Lys Gly Gln Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val
2690 2695 2700
Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala
2705 2710 2715
Thr Ala Ala Cys Arg Ala Ala Lys Leu Gln Asp Cys Thr Met Leu
2720 2725 2730
Val Asn Gly Asp Asp Leu Val Val Ile Cys Glu Ser Ala Gly Thr
2735 2740 2745
Gln Glu Asp Ala Ala Ala Leu Arg Ala Phe Thr Glu Ala Met Thr
2750 2755 2760
Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gln Pro Glu Tyr Asp
2765 2770 2775
Leu Glu Leu Ile Thr Ser Cys Ser Ser Asn Val Ser Val Ala His
2780 2785 2790
Asp Ala Ser Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr
2795 2800 2805
Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Val Arg His Thr Pro
2810 2815 2820
Val Asn Ser Trp Leu Gly Asn Ile Ile Met Tyr Ala Pro Thr Leu
2825 2830 2835
Trp Ala Arg Met Ile Leu Met Thr His Phe Phe Ser Ile Leu Leu
2840 2845 2850
Ala Gln Glu Gln Leu Glu Lys Ala Leu Asp Cys Gln Ile Tyr Gly
2855 2860 2865
Ala Cys Tyr Ser Ile Glu Pro Leu Asp Leu Pro Gln Ile Ile Glu
2870 2875 2880
Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro
2885 2890 2895
Gly Glu Ile Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val
2900 2905 2910
Pro Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala
2915 2920 2925
Lys Leu Leu Ser Gln Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr
2930 2935 2940
Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Ile
2945 2950 2955
Pro Ala Ala Ser Gln Leu Asp Leu Ser Gly Trp Phe Val Ala Gly
2960 2965 2970
Tyr Ash Gly Gly Asp Ile Tyr His Ser Leu Ser Arg Ala Arg Pro
2975 2980 2985
Arg Trp Phe Met Leu Cys Leu Leu Leu Leu Ser Val Gly Val Gly
2990 2995 3000
Ile Tyr Leu Leu Pro Asn Arg
3005 3010
<210>185
<211>3010
<212>PRT
<213>Hepatitis C virus
<400>185
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro
100 105 110
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu
130 135 140
Gly Gly Ala AlaArg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala Tyr
180 185 190
Glu Val Arg Asn Val Ser Gly Ile Tyr His Val Thr Asn Asp Cys Ser
195 200 205
Asn Ser Ser Ile Val Tyr Glu Ala Ala Asp Val Ile Met His Ala Pro
210 215 220
Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val
225 230 235 240
Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr
245 250 255
Thr Leu Arg Arg His Val Asp Leu Leu Val Gly Thr Ala Ala Phe Cys
260 265 270
Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Ile Ser
275 280 285
Gln Leu Phe Thr Phe Ser Pro Arg Arg His Glu Thr Val Gln Asp Cys
290 295 300
Asn Cys Ser Ile Tyr Pro Gly His Val Ser Gly His Arg Met Ala Trp
305 310 315 320
Asp Met Met Met Asn Trp Ser Pro Thr Ala Ala Leu Val Val Ser Gln
325 330 335
Leu Leu Arg Ile Pro Gln Ala Val Met Asp Met Val Ala Gly Ala His
340 345 350
Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp
355 360 365
Ala Lys Val Leu Ile Val Met Leu Leu Phe Ala Gly Val Asp Gly His
370 375 380
Thr Arg Val Thr Gly Gly Val Gln Gly His Val Thr Ser Thr Leu Thr
385 390 395 400
Ser Leu Phe Arg Pro Gly Ala Ser Gln Lys Ile Gln Leu Val Asn Thr
405 410 415
Asn Gly Ser Trp His Ile Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser
420 425 430
Leu Lys Thr Gly Phe Leu Ala Ala Leu Phe Tyr Thr His Lys Phe Asn
435 440 445
Ala Ser Gly Cys Pro Glu Arg Met Ala Ser Cys Arg Ser Ile Asp Lys
450 455 460
Phe Asp Gln Gly Trp Gly Pro Ile Thr Tyr Ala Gln Pro Asp Asn Ser
465 470 475 480
Asp Gln Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Gln Cys Gly Ile
485 490 495
Val Pro Ala Ser Gln Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser
500 505 510
Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Ala Pro Thr Tyr Asn
515 520 525
Trp Gly Asp Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg Pro
530 535 540
Pro His Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe
545 550 555 560
Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn Ile Arg Gly Val Gly Asn
565 570 575
Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Asp Ala
580 585 590
Thr Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Leu
595 600 605
Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe
610 615 620
Thr Ile Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu
625 630 635 640
Asp Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp
645 650 655
Arg Asp Arg Ala Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp
660 665 670
Gln Ile Leu Pro Cys Ser Tyr Thr Thr Leu Pro Ala Leu Ser Thr Gly
675 680 685
Leu Ile His Leu His Gln Asn Ile Val Asp Ile Gln Tyr Leu Tyr Gly
690 695 700
Ile Gly Ser Ala Val Val Ser Ile Ala Ile Lys Trp Glu Tyr Val Val
705 710 715 720
Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp
725 730 735
Met Met Leu Leu Ile Ala Gln Ala Glu Ala Ala Leu Glu Asn Leu Val
740 745 750
Val Leu Asn Ala Ala Ser Val Val Gly Ala His Gly Met Leu Pro Phe
755 760 765
Phe Met Phe Phe Cys Ala Ala Trp Tyr Met Lys Gly Arg Leu Val Pro
770 775 780
Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu
785 790 795 800
Leu Ala Leu Pro Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Val Ala
805 810 815
Ser Cys Gly Gly Gly ValPhe Val Gly Leu Ala Leu Leu Thr Leu Ser
820 825 830
Pro Tyr Cys Lys Val Phe Leu Ala Arg Leu Ile Trp Trp Leu Gln Tyr
835 840 845
Phe Ile Thr Lys Ala Glu Ala His Leu Gln Val Ser Leu Pro Pro Leu
850 855 860
Asn Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Met Cys Ala Val
865 870 875 880
His Pro Glu Leu Ile Phe Asp Ile Thr Lys Leu Leu Leu Ser Ile Leu
885 890 895
Gly Pro Leu Met Val Leu Gln Ala Ser Leu Ile Arg Val Pro Tyr Phe
900 905 910
Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu Val Arg Lys Ala
915 920 925
Ala Gly Gly His Tyr Val Gln Met Ala Phe Val Lys Leu Ala Ala Leu
930 935 940
Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro Leu Gln Asp Trp Ala
945 950 955 960
His Val Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Val Phe
965 970 975
Ser Ala Met Glu Thr Lys Val Ile Thr Trp Gly Ala Asp Thr Ala Ala
980 985 990
Cys Gly Asp Ile Ile Ser Gly Leu Pro Val Ser Ala Arg Arg Gly Lys
995 1000 1005
Glu Ile Leu Leu Gly Pro Ala Asp Ser Phe Glu Gly Gln Gly Trp
1010 1015 1020
Arg Leu Leu Ala Pro Ile Thr Ala Tyr Ser Gln Gln Thr Arg Gly
1025 1030 1035
Leu Leu Gly Cys Ile Ile Thr Ser Leu Thr Gly Arg Asp Lys Asn
1040 1045 1050
Gln Val Glu Gly Glu Val Gln Val Val Ser Thr Ala Lys Gln Ser
1055 1060 1065
Phe Leu Ala Thr Cys Val Asn Gly Ala Cys Trp Thr Val Phe His
1070 1075 1080
Gly Ala Gly Ser Lys Thr Leu Ala Ala Ala Lys Gly Pro Ile Thr
1085 1090 1095
Gln Met Tyr Thr Asn Val Asp Gln Asp Leu Val Gly Trp Pro Ala
1100 1105 1110
Pro Pro Gly Ala Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser
1115 1120 1125
Asp Leu Tyr Leu Val Thr Arg His Ala Asp Val Ile Pro Val Arg
1130 1135 1140
Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro Ile
1145 1150 1155
Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ser
1160 1165 1170
Gly His Val Val Gly Ile Phe Arg Ala Ala Val Cys Thr Arg Gly
1175 1180 1185
Val Ala Lys Ala Val Asp Phe Ile Pro Val Glu Ser Met Glu Thr
1190 1195 1200
Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Thr Pro Pro Ala
1205 1210 1215
Val Pro Gln Thr Phe Gln Val Ala His Leu His Ala Pro Thr Gly
1220 1225 1230
Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gln Gly
1235 1240 1245
Tyr Mer Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly
1250 1255 1260
Phe Gly Ala Tyr Met Ser Lys Ala His Gly Ile Asp Pro Asn Ile
1265 1270 1275
Arg Thr Gly Val Arg Thr Ile Thr Thr Gly Ala Pro Ile Thr Tyr
1280 1285 1290
Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly
1295 1300 1305
Ala Tyr Asp Ile Ile Ile Cys Asp Glu Cys His Ser Thr Asp Ser
1310 1315 1320
Thr Ser Ile Leu Gly Ile Gly Thr Val Leu Asp Gln Ala Glu Thr
1325 1330 1335
Val Gly Ala Arg Phe Val Val Leu Ala Thr Ala Thr Pro Pro Gly
1340 1345 1350
Ser Ile Thr Phe Pro His Pro Asn Ile Glu Glu Val Pro Leu Ala
1355 1360 1365
Asn Thr Gly Glu Ile Pro Phe Tyr Ala Lys Thr Ile Pro Ile Glu
1370 1375 1380
Val Ile Arg Gly Gly Arg His Leu Ile Phe Cys His Ser Lys Lys
1385 1390 1395
Lys Cys Asp Glu Leu Pro Ala Lys Leu Ser Ala Leu Gly Leu Asn
1400 1405 1410
Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val Ile Pro Ala
1415 1420 1425
Ser Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met Thr Gly
1430 1435 1440
Phe Thr Gly Asp Phe Asp Ser Val Ile Asp Cys Asn Thr Cys Val
1445 1450 1455
Thr Gln Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr Ile Glu
1460 1465 1470
Thr Thr Thr Val Pro Gln Asp Ala Val Ser Arg Thr Gln Arg Arg
1475 1480 1485
Gly Arg Thr Gly Arg Gly Arg Arg Gly Ile Tyr Arg Phe Val Thr
1490 1495 1500
Pro Gly Glu Arg Pro Ser Ala Met Phe Asp Ser Ser Val Leu Cys
1505 1510 1515
Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala
1520 1525 1530
Glu Thr Ser Val Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu
1535 1540 1545
Pro Val Cys Gln Asp His Leu Glu Phe Trp Glu Ser Val Phe Thr
1550 1555 1560
Gly Leu Thr His Ile Asp Ala His Phe Leu Ser Gln Thr Lys Gln
1565 1570 1575
Ala Gly Asp Asn Phe Pro Tyr Leu Val Ala Tyr Gln Ala Thr Val
1580 1585 1590
Cys Ala Arg Ala Lys Ala Pro Pro Pro Ser Trp Asp Gln Met Trp
1595 1600 1605
Lys Cys Leu Ile Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro
1610 1615 1620
Leu Leu Tyr Arg Leu Gly Ala Val Gln Asn Glu Val Thr Leu Thr
1625 1630 1635
His Pro Ile Thr Lys Tyr Ile Met Ala Cys Met Ser Ala Asp Leu
1640 1645 1650
Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala
1655 1660 1665
Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val Ile Val
1670 1675 1680
Gly Arg Ile Ile Leu Ser Gly Arg Pro Ala Val Ile Pro Asp Arg
1685 1690 1695
Glu Val Leu Tyr Gln Glu Phe Asp Glu Met Glu Glu Cys Ala Ser
1700 1705 1710
His Leu Pro Tyr Ile Glu Gln Gly Met Gln Leu Ala Glu Gln Phe
1715 1720 1725
Lys Gln Lys Ala Leu Gly Leu Leu Gln Thr Ala Thr Lys Gln Ala
1730 1735 1740
Glu Ala Ala Ala Pro Val Val Glu Ser Lys Trp Arg Ala Leu Glu
1745 1750 1755
Thr Phe Trp Ala Lys His Met Trp Asn Phe Ile Ser Gly Ile Gln
1760 1765 1770
Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Ile Ala
1775 1780 1785
Ser Leu Met Ala Phe Thr Ala Ser Ile Thr Ser Pro Leu Ala Thr
1790 1795 1800
Gln Tyr Thr Leu Leu Phe Asn Ile Leu Gly Gly Trp Val Ala Ala
1805 1810 1815
Gln Leu Ala Pro Pro Ser Ala Ala Ser Ala Phe Val Gly Ala Gly
1820 1825 1830
Ile Ala Gly Ala Ala Val Gly Ser Ile Gly Leu Gly Lys Val Leu
1835 1840 1845
Val Asp Ile Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu
1850 1855 1860
Val Ala Phe Lys Val Met Ser Gly Asp Met Pro Ser Thr Glu Asp
1865 1870 1875
Leu Val Asn Leu Leu Pro Ala Ile Leu Ser Pro Gly Ala Leu Val
1880 1885 1890
Val Gly Val Val Cys Ala Ala Ile Leu Arg Arg His Val Gly Pro
1895 1900 1905
Gly Glu Gly Ala Val Gln Trp Met Asn Arg Leu Ile Ala Phe Ala
1910 1915 1920
Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu Ser
1925 1930 1935
Asp Ala Ala Ala Arg Val Thr Gln Ile Leu Ser Asn Leu Thr Ile
1940 1945 1950
Thr Gln Leu Leu Lys Arg Leu His Gln Trp Ile Asn Glu Asp Cys
1955 1960 1965
Ser Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp
1970 1975 1980
Ile Cys Thr Val Leu Ala Asp Phe Lys Thr Trp Leu Gln Ser Lys
1985 1990 1995
Leu Leu Pro Arg Leu Pro Gly Val Pro Phe Phe Ser Cys Gln Arg
2000 2005 2010
Gly Tyr Lys Gly Val Trp Arg Gly Asp Gly Ile Met Tyr Thr Thr
2015 2020 2025
Cys Pro Cys Gly Ala Gln Ile Thr Gly His Val Lys Asn Gly Ser
2030 2035 2040
Met Arg Ile Val Gly Pro Arg Thr Cys Ser Asn Thr Trp His Gly
2045 2050 2055
Thr Phe Pro Ile Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Ser
2060 2065 2070
Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg Val Ala Ala Glu
2075 2080 2085
Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His Tyr Val Thr
2090 2095 2100
Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gln Val Pro Ala
2105 2110 2115
Pro Glu Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His Arg Tyr
2120 2125 2130
Ala Pro Ala Cys Lys Pro Leu Leu Arg Asp Glu Val Thr Phe Gln
2135 2140 2145
Val Gly Leu Asn Gln Tyr Thr Val Gly Ser Gln Leu Pro Cys Glu
2150 2155 2160
Pro Glu Pro Asp Val Thr Val Val Thr Ser Met Leu Thr Asp Pro
2165 2170 2175
Ser His Ile Thr Ala Glu Ala Ala Arg Arg Arg Leu Ala Arg Gly
2180 2185 2190
Ser Pro Pro Ser Leu Ala Gly Ser Ser Ala Ser Gln Leu Ser Ala
2195 2200 2205
Leu Ser Leu Lys Ala Thr Cys Thr Thr His His Gly Ala Pro Asp
2210 2215 2220
Thr Asp Leu lle Glu Ala Asn Leu Leu Trp Arg Gln Glu Met Gly
2225 2230 2235
Gly Asn Ile Thr Arg Val Glu Ser Glu Asn Lys Ile Val Ile Leu
2240 2245 2250
Asp Ser Phe Glu Pro Leu Arg Ala Glu Glu Asp Glu Arg Glu Val
2255 2260 2265
Ser Ala Ala Ala Glu Ile Leu Arg Lys Thr Arg Lys Phe Pro Ala
2270 2275 2280
Ala Met Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Leu
2285 2290 2295
Glu Ser Trp Lys Asn Pro Asp Tyr Val Pro Pro Val Val His Gly
2300 2305 2310
Cys Pro Leu Pro Pro Thr Lys Ala Pro Pro Ile Pro Pro Pro Arg
2315 2320 2325
Arg Lys Arg Thr Val Val Leu Thr Glu Ser Thr Val Ser Ser Ala
2330 2335 2340
Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Gly Ser Gly Ser Ser
2345 2350 2355
Ala Val Asp Ser Gly Thr Ala Thr Gly Pro Pro Asp Gln Ala Ser
2360 2365 2370
Ala Glu Gly Asp Ala Gly Ser Asp Ala Glu Ser Tyr Ser Ser Met
2375 2380 2385
Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly
2390 2395 2400
Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys
2405 2410 2415
Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu Ile Thr Pro Cys
2420 2425 2430
Ala Ala Glu Glu Ser Lys Leu Pro Ile Asn Ala Leu Ser Asn Pro
2435 2440 2445
Leu Leu Arg His His Asn Met Val Tyr Ser Thr Thr Ser Arg Ser
2450 2455 2460
Ala Ser Leu Arg Gln Lys Lys Val Thr Phe Asp Arg Met Gln Val
2465 2470 2475
Leu Asp Asp His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys
2480 2485 2490
Ala Ser Thr Val Lys Ala Lys Leu Leu Ser Val Glu Glu Ala Cys
2495 2500 2505
Lys Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly
2510 2515 2520
Ala Lys Asp Val Arg Ser Leu Ser Ser Arg Ala Val Asn His Ile
2525 2530 2535
Arg Ser Val Trp Lys Asp Leu Leu Glu Asp Thr Asp Thr Pro Ile
2540 2545 2550
Gln Thr Thr Ile Met Ala Lys Asn Glu Val Phe Cys Val Gln Pro
2555 2560 2565
Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu Ile Val Phe Pro Asp
2570 2575 2580
Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val
2585 2590 2595
Ser Thr Leu Pro Gln Ala Val Met Gly Ser Ser Tyr Gly Phe Gln
2600 2605 2610
Tyr Ser Pro Lys Gln Arg Val Glu Phe Leu Val Asn Thr Trp Lys
2615 2620 2625
Ala Lys Lys Cys Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe
2630 2635 2640
Asp Ser Thr Val Thr Glu Asn Asp Ile Arg Val Glu Glu Ser Ile
2645 2650 2655
Tyr Gln Cys Cys Asp Leu Ala Pro Glu Ala Arg Gln Ala Ile Arg
2660 2665 2670
Ser Leu Thr Glu Arg Leu Tyr Ile Gly Gly Pro Met Thr Asn Ser
2675 2680 2685
Lys Gly Gln Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val
2690 2695 2700
Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala
2705 2710 2715
Ala Ala Ala Cys Arg Ala Ala Lys Leu Gln Asp Cys Thr Met Leu
2720 2725 2730
Val Cys Gly Asp Asp Leu Val Val Ile Cys Asp Ser Ala Gly Thr
2735 2740 2745
Gln Glu Asp Ala Ala Ser Leu Arg Val Phe Thr Glu Ala Met Thr
2750 2755 2760
Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gln Pro Glu Tyr Asp
2765 2770 2775
Leu Glu Leu Ile Thr Ser Cys Ser Ser Asn Val Ser Val Ala His
2780 2785 2790
Asp Ala Ser Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr
2795 2800 2805
Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro
2810 2815 2820
Val Asn Ser Trp Leu Gly Asn Ile Ile Met Tyr Ala Pro Thr Leu
2825 2830 2835
Trp Ala Arg Met Ile Leu Met Thr His Phe Phe Ser Ile Leu Leu
2840 2845 2850
Ala Gln Glu Gln Leu Glu Lys Ala Leu Asp Cys Gln Ile Tyr Gly
2855 2860 2865
Ala Thr Tyr Ser Ile Glu Pro Leu Asp Leu Pro Gln Ile Ile Gln
2870 2875 2880
Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro
2885 2890 2895
Gly Glu Ile Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val
2900 2905 2910
Pro Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala
2915 2920 2925
Lys Leu Leu Ser Gln Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr
2930 2935 2940
Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Ile
2945 2950 2955
Pro Glu Ala Ser Gln Leu Asp Leu Ser Gly Trp Phe Val Ala Gly
2960 2965 2970
Tyr Ser Gly Gly Asp Ile Tyr His Ser Leu Ser Arg Ala Arg Pro
2975 2980 2985
Arg Trp Phe Met Trp Cys Leu Leu Leu Leu Ser Val Gly Val Gly
2990 2995 3000
Ile Tyr Leu Leu Pro Asn Arg
3005 3010
<210>186
<211>3010
<212>PRT
<213>Hepatitis C virus
<400>186
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Ala Arg His Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu
130 135 140
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala Tyr
180 185 190
Glu Val Arg Asn Val Ser Gly Ile Tyr His Val Thr Asn Asp Cys Ser
195 200 205
Asn Ser Ser Ile Val Tyr Glu Ala Ala Asp Val Ile Met His Ala Pro
210 215 220
Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val
225 230 235 240
Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr
245 250 255
Thr Leu Arg Arg His Val Asp Leu Leu Val Gly Thr Ala Ala Phe Cys
260 265 270
Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Ile Ser
275 280 285
Gln Leu Phe Thr Phe Ser Pro Arg Arg His Glu Thr Val Gln Asp Cys
290 295 300
Asn Cys Ser Ile Tyr Pro Gly His Val Ser Gly His Arg Met Ala Trp
305 310 315 320
Asp Met Met Met Asn Trp Ser Pro Thr Ala Ala Leu Val Val Ser Gln
325 330 335
Leu Leu Arg Ile Pro Gln Ala Val Met Asp Met Val Ala Gly Ala His
340 345 350
Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp
355 360 365
Ala Lys Val Leu Ile Val Met Leu Leu Phe Ala Gly Val Asp Gly His
370 375 380
Thr Arg Val Thr Gly Gly Val Gln Gly His Val Thr Ser Thr Leu Thr
385 390 395 400
Ser Leu Phe Arg Pro Gly Ala Ser Gln Lys Ile Gln Leu Val Asn Thr
405 410 415
Asn Gly Ser Trp His Ile Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser
420 425 430
Leu Lys Thr Gly Phe Leu Ala Ala Leu Phe Tyr Thr His Lys Phe Asn
435 440 445
Ala Ser Gly Cys Pro Glu Arg Met Ala Ser Cys Arg Ser Ile Asp Lys
450 455 460
Phe Asp Gln Gly Trp Gly Pro Ile Thr Tyr Ala Gln Pro Asp Asn Ser
465 470 475 480
Asp Gln Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Gln Cys Gly Ile
485 490 495
Val Pro Ala Ser Gln Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser
500 505 510
Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Ala Pro Thr Tyr Asn
515 520 525
Trp Gly Asp Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg Pro
530 535 540
Pro His Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe
545 550 555 560
Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn Ile Arg Gly Val Gly Asn
565 570 575
Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Asp Ala
580 585 590
Thr Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Leu
595 600 605
Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe
610 615 620
Thr Ile Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu
625 630 635 640
Asp Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp
645 650 655
Arg Asp Arg Ala Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp
660 665 670
Gln Ile Leu Pro Cys Ser Tyr Thr Thr Leu Pro Ala Leu Ser Thr Gly
675 680 685
Leu Ile His Leu His Gln Asn Ile Val Asp Ile Gln Tyr Leu Tyr Gly
690 695 700
Ile Gly Ser Ala Val Val Ser Ile Ala Ile Lys Trp Glu Tyr Val Val
705 710 715 720
Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp
725 730 735
Met Met Leu Leu Ile Ala Gln Ala Glu Ala Ala Leu Glu Asn Leu Val
740 745 750
Val Leu Asn Ala Ala Ser Val Ala Gly Ala His Gly Ile Leu Pro Phe
755 760 765
Phe Met Phe Phe Cys Ala Ala Trp Tyr Met Lys Gly Arg Leu Val Pro
770 775 780
Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu
785 790 795 800
Leu Ala Leu Pro Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala
805 810 815
Ser Cys Gly Gly Gly Val Phe Val Gly Leu Ala Leu Leu Thr Leu Ser
820 825 830
Pro Tyr Cys Lys Val Phe Leu Ala Arg Leu Ile Trp Trp Leu Gln Tyr
835 840 845
Phe Ile Thr Lys Ala Glu Ala His Leu Gln Val Trp Val Pro Pro Leu
850 855 860
Asn Val Arg Ala Gly Arg Asp Ala Ile Ile Leu Leu Met Cys Ala Val
865 870 875 880
His Pro Glu Leu Ile Phe Asp Ile Thr Lys Leu Leu Leu Ser Ile Leu
885 890 895
Gly Pro Leu Met Val Leu Gln Ala Ser Leu Ile Arg Val Pro Tyr Phe
900 905 910
Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Thr Leu Val Arg Lys Ala
915 920 925
Ala Gly Gly His Tyr Val Gln Met Ala Phe Val Lys Leu Ala Ala Leu
930 935 940
Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro Leu Gln Asp Trp Ala
945 950 955 960
His Val Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Val Phe
965 970 975
Ser Ala Met Glu Thr Lys Val Ile Thr Trp Gly Ala Asp Thr Ala Ala
980 985 990
Cys Gly Asp Ile Ile Ser Gly Leu Pro Val Ser Ala Arg Arg Gly Lys
995 1000 1005
Glu Ile Leu Leu Gly Pro Ala Asp Ser Phe Glu Gly Gln Gly Trp
1010 1015 1020
Arg Leu Leu Ala Pro Ile Thr Ala Tyr Ser Gln Gln Thr Arg Gly
1025 1030 1035
Leu Leu Gly Cys Ile Ile Thr Ser Leu Thr Gly Arg Asp Lys Asn
1040 1045 1050
Gln Val Glu Gly Glu Val Gln Val Val Ser Thr Ala Thr Gln Ser
1055 1060 1065
Phe Leu Ala Thr Cys Val Asn Gly Ala Cys Trp Thr Val Phe His
1070 1075 1080
GIy Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro Ile Thr
1085 1090 1095
Gln Met Tyr Thr Asn Val Asp Gln Asp Leu Val Gly Trp Pro Ala
1100 1105 1110
Pro Pro Gly Ala Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser
1115 1120 1125
Asp Leu Tyr Leu Val Thr Arg His Ala Asp Val Ile Pro Val Arg
1130 1135 1140
Arg Arg Gly Asp Thr Arg Gly Ser Leu Leu Ser Pro Arg Pro Ile
1145 1150 1155
Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ser
1160 1165 1170
Gly His Val Val Gly Ile Phe Arg Ala Ala Val Cys Thr Arg Gly
1175 1180 1185
Val Ala Lys Ala Val Asp Phe Ile Pro Val Glu Ser Met Glu Thr
1190 1195 1200
Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Thr Pro Pro Ala
1205 1210 1215
Val Pro Gln Thr Phe Gln Val Ala His Leu His Ala Pro Thr Gly
1220 1225 1230
Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gln Gly
1235 1240 1245
Tyr Met Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly
1250 1255 1260
Phe Gly Ala Tyr Met Ser Lys Ala His Gly Ile Asp Pro Asn Ile
1265 1270 1275
Arg Thr Gly Val Arg Thr Ile Thr Thr Gly Ala Pro Ile Thr Tyr
1280 1285 1290
Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly
1295 1300 1305
Ala Tyr Asp Ile Ile Ile Cys Asp Glu Cys His Ser Thr Asp Ser
1310 1315 1320
Thr Ser Ile Leu Gly Ile Gly Thr Val Leu Asp Gln Ala Glu Thr
1325 1330 1335
Ala Gly Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly
1340 1345 1350
Ser Val Thr Phe Pro His Pro Asn Ile Glu Glu Val Ala Leu Gly
1355 1360 1365
Asn Thr Gly Gln Ile Pro Phe Tyr Gly Lys Ala Ile Pro Ile Glu
1370 1375 1380
Val Ile Lys Gly Gly Arg His Leu Ile Phe Cys His Ser Lys Lys
1385 1390 1395
Lys Cys Asp Glu Leu Ala Ala Lys Leu Ser Pro Leu Gly Leu Asn
1400 1405 1410
Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val Ile Pro Ala
1415 1420 1425
Ser Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met Thr Gly
1430 1435 1440
Phe Thr Gly Asp Phe Asp Ser Val Ile Asp Cys Asn Thr Cys Val
1445 1450 1455
Thr Gln Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr Ile Glu
1460 1465 1470
Thr Thr Thr Val Pro Gln Asp Ala Val Ser Arg Thr Gln Arg Arg
1475 1480 1485
Gly Arg Thr Gly Arg Gly Arg Arg Gly Ile Tyr Arg Phe Val Thr
1490 1495 1500
Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys
1505 1510 1515
Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala
1520 1525 1530
Glu Thr Ser Val Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu
1535 1540 1545
Pro Val Cys Gln Asp His Leu Glu Phe Trp Glu Ser Val Phe Thr
1550 1555 1560
Gly Leu Thr His Ile Asp Ala His Phe Leu Ser Gln Thr Lys Gln
1565 1570 1575
Ala Gly Asp Asn Phe Pro Tyr Leu Val Ala Tyr Gln Ala Thr Val
1580 1585 1590
Cys Ala Arg Ala Lys Ala Pro Pro Pro Ser Trp Asp Gln Met Trp
1595 1600 1605
Lys Cys Leu Ile Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro
1610 1615 1620
Leu Leu Tyr Arg Leu Gly Ala Val Gln Asn Glu Val Thr Leu Thr
1625 1630 1635
His Pro Ile Thr Lys Phe Ile Met Ala Cys Met Ser Ala Asp Leu
1640 1645 1650
Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala
1655 1660 1665
Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val Ile Val
1670 1675 1680
Gly Arg Ile Ile Leu Ser Gly Arg Pro Ala Val Ile Pro Asp Arg
1685 1690 1695
Glu Val Leu Tyr Gln Glu Phe Asp Glu Met Glu Glu Cys Ala Ser
1700 1705 1710
His Leu Pro Tyr Ile Glu Gln Gly Met Gln Leu Ala Glu Gln Phe
1715 1720 1725
Lys Gln Lys Ala Leu Gly Leu Leu Gln Thr Ala Thr Lys Gln Ala
1730 1735 1740
Glu Ala Ala Ala Pro Val Val Glu Ser Lys Trp Arg Ala Leu Glu
1745 1750 1755
Thr Phe Trp Ala Lys His Met Trp Asn Phe Ile Ser Gly Ile Gln
1760 1765 1770
Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Ile Ala
1775 1780 1785
Ser Leu Met Ala Phe Thr Ala Ser Ile Thr Ser Pro Leu Ala Thr
1790 1795 1800
Gln Tyr Thr Leu Leu Phe Asn Ile Leu Gly Gly Trp Val Ala Ala
1805 1810 1815
Gln Leu Ala Pro Pro Ser Ala Ala Ser Ala Phe Val Gly Ala Gly
1820 1825 1830
Ile Ala Gly Ala Ala Val Gly Ser Ile Gly Leu Gly Lys Val Leu
1835 1840 1845
Val Asp Ile Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu
1850 1855 1860
Val Ala Phe Lys Val Met Ser Gly Asp Met Pro Ser Thr Glu Asp
1865 1870 1875
Leu Val Asn Leu Leu Pro Ala Ile Leu Ser Pro Gly Ala Leu Val
1880 1885 1890
Val Gly Val Val Cys Ala Ala Ile Leu Arg Arg His Val Gly Pro
1895 1900 1905
Gly Glu Gly Ala Val Gln Trp Met Asn Arg Leu Ile Ala Phe Ala
1910 1915 1920
Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu Ser
1925 1930 1935
Asp Ala Ala Ala Arg Val Thr Gln Ile Leu Ser Asn Leu Thr Ile
1940 1945 1950
Thr Gln Leu Leu Lys Arg Leu His Gln Trp Ile Asn Glu Asp Cys
1955 1960 1965
Ser Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp
1970 1975 1980
Ile Cys Thr Val Leu Ala Asp Phe Lys Thr Trp Leu Gln Ser Lys
1985 1990 1995
Leu Leu Pro Arg Leu Pro Gly Val Pro Phe Phe Ser Cys Gln Arg
2000 2005 2010
Gly Tyr Lys Gly Val Trp Arg Gly Asp Gly Ile Met Tyr Thr Thr
2015 2020 2025
Cys Pro Cys Gly Ala Gln Ile Thr Gly His Val Lys Asn Gly Ser
2030 2035 2040
Met Arg Ile Val Gly Pro Arg Thr Cys Ser Asn Thr Trp His Gly
2045 2050 2055
Thr Phe Pro Ile Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Ser
2060 2065 2070
Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg Val Ala Ala Glu
2075 2080 2085
Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His Tyr Val Thr
2090 2095 2100
Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gln Val Pro Ala
2105 2110 2115
Pro Glu Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His Arg Tyr
2120 2125 2130
Ala Pro Ala Cys Lys Pro Leu Leu Arg Asp Glu Val Thr Phe Gln
2135 2140 2145
Val Gly Leu Asn Gln Tyr Thr Val Gly Ser Gln Leu Pro Cys Glu
2150 2155 2160
Pro Glu Pro Asp Val Thr Val Val Thr Ser Met Leu Thr Asp Pro
2165 2170 2175
Ser His Ile Thr Ala Glu Ala Ala Arg Arg Arg Leu Ala Arg Gly
2180 2185 2190
Ser Pro Pro Ser Leu Ala Gly Ser Ser Ala Ser Gln Leu Ser Ala
2195 2200 2205
Pro Ser Leu Lys Ala Thr Cys Thr Thr His His Gly Ala Pro Asp
2210 2215 2220
Thr Asp Leu Ile Glu Ala Asn Leu Leu Trp Arg Gln Glu Met Gly
2225 2230 2235
Gly Asn Ile Thr Arg Val Glu Ser Glu Asn Lys Ile Val Ile Leu
2240 2245 2250
Asp Ser Phe Glu Pro Leu Arg Ala Glu Glu Asp Glu Arg Glu Val
2255 2260 2265
Ser Ala Ala Ala Glu Ile Leu Arg Lys Thr Arg Lys Phe Pro Ala
2270 2275 2280
Ala Met Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Leu
2285 2290 2295
Glu Ser Trp Lys Asn Pro Asp Tyr Val Pro Pro Val Val His Gly
2300 2305 2310
Cys Pro Leu Pro Pro Thr Lys Ala Pro Pro Ile Pro Pro Pro Arg
2315 2320 2325
Arg Lys Arg Thr Val Val Leu Thr Glu Ser Thr Val Ser Ser Ala
2330 2335 2340
Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Gly Ser Gly Ser Ser
2345 2350 2355
Ala Val Asp Ser Gly Thr Ala Thr Gly Pro Pro Asp Gln Ala Ser
2360 2365 2370
Ala Glu Gly Asp Ala Gly Ser Asp Ala Glu Ser Tyr Ser Ser Met
2375 2380 2385
Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly
2390 2395 2400
Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys
2405 2410 2415
Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu Ile Thr Pro Cys
2420 2425 2430
Ala Ala Glu Glu Ser Lys Leu Pro Ile Asn Ala Leu Ser Asn Pro
2435 2440 2445
Leu Leu Arg His His Asn Met Val Tyr Ser Thr Thr Ser Arg Ser
2450 2455 2460
Ala Ser Leu Arg Gln Lys Lys Val Thr Phe Asp Arg Met Gln Val
2465 2470 2475
Leu Asp Asp His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys
2480 2485 2490
Ala Ser Thr Val Lys Ala Lys Leu Leu Ser Val Glu Glu Ala Cys
2495 2500 2505
Lys Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly
2510 2515 2520
Ala Lys Asp Val Arg Ser Leu Ser Ser Arg Ala Val Asn His Ile
2525 2530 2535
Arg Ser Val Trp Lys Asp Leu Leu Glu Asp Thr Asp Thr Pro Ile
2540 2545 2550
Gln Thr Thr Ile Met Ala Lys Asn Glu Val Phe Cys Val Gln Pro
2555 2560 2565
Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu Ile Val Phe Pro Asp
2570 2575 2580
Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val
2585 2590 2595
Ser Thr Leu Pro Gln Ala Val Met Gly Ser Ser Tyr Gly Phe Gln
2600 2605 2610
Tyr Ser Pro Lys Gln Arg Val Glu Phe Leu Val Asn Thr Trp Lys
2615 2620 2625
Ala Lys Lys Cys Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe
2630 2635 2640
Asp Ser Thr Val Thr Glu Asn Asp Ile Arg Val Glu Glu Ser Ile
2645 2650 2655
Tyr Gln Cys Cys Asp Leu Ala Pro Glu Ala Arg Gln Ala Ile Arg
2660 2665 2670
Ser Leu Thr Glu Arg Leu Tyr Ile Gly Gly Pro Met Thr Asn Ser
2675 2680 2685
Lys Gly Gln Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val
2690 2695 2700
Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala
2705 2710 2715
Ala Ala Ala Cys Arg Ala Ala Lys Leu Gln Asp Cys Thr Met Leu
2720 2725 2730
Val Cys Gly Asp Asp Leu Val Val Ile Cys Asp Ser Ala Gly Thr
2735 2740 2745
Gln Glu Asp Ala Ala Ser Leu Arg Val Phe Thr Glu Ala Met Thr
2750 2755 2760
Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gln Pro Glu Tyr Asp
2765 2770 2775
Leu Glu Leu Ile Thr Ser Cys Ser Ser Asn Val Ser Val Ala His
2780 2785 2790
Asp Ala Ser Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr
2795 2800 2805
Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro
2810 2815 2820
Val Asn Ser Trp Leu Gly Asn Ile Ile Met Tyr Ala Pro Thr Leu
2825 2830 2835
Trp Ala Arg Met Ile Leu Met Thr His Phe Phe Ser Ile Leu Leu
2840 2845 2850
Ala Gln Glu Gln Leu Glu Lys Ala Leu Asp Cys Gln Ile Tyr Gly
2855 2860 2865
Ala Thr Tyr Ser Ile Glu Pro Leu Asp Leu Pro Gln Ile Ile Gln
2870 2875 2880
Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro
2885 2890 2895
Gly Glu Ile Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val
2900 2905 2910
Pro Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala
2915 2920 2925
Lys Leu Leu Ser Gln Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr
2930 2935 2940
Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Ile
2945 2950 2955
Pro Glu Ala Ser Gln Leu Asp Leu Ser Gly Trp Phe Val Ala Gly
2960 2965 2970
Tyr Ser Gly Gly Asp Ile Tyr His Ser Leu Ser Arg Ala Arg Pro
2975 2980 2985
Arg Trp Phe Met Trp Cys Leu Leu Leu Leu Ser Val Gly Val Gly
2990 2995 3000
Ile Tyr Leu Leu Pro Asn Arg
3005 3010
Claims (8)
1.与HLA-A型分子结合的HLA-结合肽,所述HLA-结合肽包含
至少一种选自SEQ ID NOS:1-183的氨基酸序列,以及
不少于8个且不多于11个氨基酸残基。
2.权利要求1所述的HLA-结合肽,包含至少一种选自SEQ IDNOS:1,2,3,5,8,12,13,14,16,17,18,19,22,23,25,27,34,37,38,40,42,45,48,49,52,53,54,55,56,58,59,60,61,62,63,64,65,67,71,72,74,75,76,84,86,87,90,91,92,93,94,96,97,98,100,101,102,104,106,107,108,109,110,112,123,124,126,127,131,132,133,134,135,136,137,139,141,142,146,147,149,150,152,162,170,173,176,177,和179的氨基酸序列。
3.与HLA-A型分子结合的HLA-结合肽,所述HLA-结合肽包含
在权利要求1或2所述的HLA-结合肽中所含的所述氨基酸序列的基础上,通过缺失,替换,或添加一个或两个氨基酸残基而形成的氨基酸序列,以及
不少于8个且不多于11个氨基酸残基。
4.权利要求1-3任一项所述的HLA-结合肽,其中所述HLA-结合肽与人HLA-A24型分子结合。
5.权利要求1-3任一项所述的HLA-结合肽,其中所述HLA-结合肽与人HLA-A2型分子结合。
6.含有DNA序列的DNA区段,所述DNA序列编码权利要求1-5任一项所述的HLA-结合肽。
7.含有DNA序列的重组载体,所述DNA序列编码权利要求1-5任一项所述的HLA-结合肽。
8.HLA-结合肽前体,其在哺乳动物体内变成权利要求1-5任一项所述的HLA-结合肽。
Applications Claiming Priority (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP135652/2004 | 2004-04-30 | ||
JP2004135652 | 2004-04-30 | ||
JP2004272314 | 2004-09-17 | ||
JP272314/2004 | 2004-09-17 | ||
JP050164/2005 | 2005-02-25 | ||
JP2005050164 | 2005-02-25 | ||
PCT/JP2005/007231 WO2005105993A1 (ja) | 2004-04-30 | 2005-04-14 | Hla結合性ペプチド、その前駆体、それをコードするdna断片および組み換えベクター |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1954068A true CN1954068A (zh) | 2007-04-25 |
CN1954068B CN1954068B (zh) | 2011-03-02 |
Family
ID=35241673
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200580013767.7A Expired - Fee Related CN1954068B (zh) | 2004-04-30 | 2005-04-14 | Hla-结合肽,其前体,编码这些肽序列的dna片段和重组载体 |
Country Status (5)
Country | Link |
---|---|
US (3) | US20080249283A1 (zh) |
EP (2) | EP2559763B1 (zh) |
JP (2) | JP4761311B2 (zh) |
CN (1) | CN1954068B (zh) |
WO (1) | WO2005105993A1 (zh) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007094137A1 (ja) * | 2006-02-17 | 2007-08-23 | Nec Corporation | 細胞傷害性t細胞の誘導方法、細胞傷害性t細胞の誘導剤、およびそれを用いた医薬組成物およびワクチン |
KR101323540B1 (ko) * | 2007-02-07 | 2013-10-29 | 사이단호진한다이비세이부쯔뵤우겐큐우카이 | 암의 치료제 |
JP5279063B2 (ja) * | 2007-05-25 | 2013-09-04 | 国立大学法人愛媛大学 | Hla−a2拘束性抗原ペプチドおよびその用途 |
JP6218175B2 (ja) * | 2011-12-14 | 2017-10-25 | 国立大学法人高知大学 | ヘルパーt細胞誘導性ポリペプチドの改変 |
US10995116B2 (en) | 2014-05-09 | 2021-05-04 | University Of Southampton | Peptide-induced NK cell activation |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6709828B1 (en) * | 1992-03-06 | 2004-03-23 | N.V. Innogenetics S.A. | Process for the determination of peptides corresponding to immunologically important epitopes and their use in a process for determination of antibodies or biotinylated peptides corresponding to immunologically important epitopes, a process for preparing them and compositions containing them |
US9340577B2 (en) * | 1992-08-07 | 2016-05-17 | Epimmune Inc. | HLA binding motifs and peptides and their uses |
CA2157510A1 (en) * | 1993-03-05 | 1994-09-15 | Howard M. Grey | Hla-a2.1 binding peptides and their uses |
JPH08151396A (ja) * | 1994-11-28 | 1996-06-11 | Teijin Ltd | Hla結合性オリゴペプチド及びそれを含有する免疫調節剤 |
KR100190910B1 (ko) * | 1995-12-29 | 1999-06-01 | 허영섭 | C형 간염 바이러스에 대한 인체 면역조절기능을 가진펩타이드 |
CA2323632A1 (en) * | 1998-03-13 | 1999-09-16 | Epimmune Inc. | Hla-binding peptides and their uses |
JPH11316754A (ja) * | 1998-05-06 | 1999-11-16 | Nec Corp | 実験計画法及び実験計画プログラムを記録した記録媒体 |
CA2377525A1 (en) * | 1999-07-19 | 2001-03-29 | Epimmune, Inc. | Inducing cellular immune responses to hepatitis c virus using peptide and nucleic acid compositions |
CA2425648A1 (en) * | 2000-10-19 | 2002-04-19 | Epimmune Inc. | Hla class i and ii binding peptides and their uses |
JP4667578B2 (ja) * | 2000-10-24 | 2011-04-13 | 和貴 黒河内 | C型肝炎ウイルスの新規なctlエピトープ |
WO2002089731A2 (en) * | 2001-05-03 | 2002-11-14 | Stanford University | Agents for treatment of hcv and methods of use |
FR2839722A1 (fr) * | 2002-05-17 | 2003-11-21 | Bio Merieux | Nouvelles compositions peptidiques et leur utilisation notamment dans la preparation de compositions pharmaceutiques actives contre le virus de l'hepatite c |
CN1249240C (zh) * | 2002-06-25 | 2006-04-05 | 中国人民解放军军事医学科学院基础医学研究所 | 表达载体pBVTB及其构建方法和在HCV疫苗研究中的应用 |
FR2855758B1 (fr) * | 2003-06-05 | 2005-07-22 | Biomerieux Sa | Composition comprenant la polyproteine ns3/ns4 et le polypeptide ns5b du vhc, vecteurs d'expression incluant les sequences nucleiques correspondantes et leur utilisation en therapeutique |
EA200701870A1 (ru) * | 2003-09-22 | 2008-02-28 | Грин Пептайд Ко., Лтд. | Пептид, происходящий из вируса гепатита с |
WO2005042698A2 (en) * | 2003-10-23 | 2005-05-12 | Pecos Labs, Inc. | T cell epitopes useful in a hepatitis c virus vaccine and as diagnostic tools and methods for identifying same |
JP2008509654A (ja) * | 2004-06-01 | 2008-04-03 | イノジェネティックス・ナムローゼ・フェンノートシャップ | C型肝炎ウイルスに対するctlおよび/またはhtl応答を誘導するためのペプチド |
-
2005
- 2005-04-14 JP JP2006519491A patent/JP4761311B2/ja not_active Expired - Fee Related
- 2005-04-14 EP EP12176945.9A patent/EP2559763B1/en not_active Not-in-force
- 2005-04-14 US US11/587,973 patent/US20080249283A1/en not_active Abandoned
- 2005-04-14 WO PCT/JP2005/007231 patent/WO2005105993A1/ja active Application Filing
- 2005-04-14 CN CN200580013767.7A patent/CN1954068B/zh not_active Expired - Fee Related
- 2005-04-14 EP EP05730475A patent/EP1757687A4/en not_active Withdrawn
-
2010
- 2010-10-12 US US12/903,000 patent/US20110060124A1/en not_active Abandoned
-
2011
- 2011-01-24 JP JP2011012175A patent/JP2011120598A/ja active Pending
-
2014
- 2014-12-08 US US14/563,502 patent/US20150087809A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
JP4761311B2 (ja) | 2011-08-31 |
CN1954068B (zh) | 2011-03-02 |
WO2005105993A1 (ja) | 2005-11-10 |
US20110060124A1 (en) | 2011-03-10 |
JPWO2005105993A1 (ja) | 2008-03-13 |
EP2559763B1 (en) | 2017-02-15 |
EP1757687A4 (en) | 2008-01-16 |
US20150087809A1 (en) | 2015-03-26 |
EP2559763A1 (en) | 2013-02-20 |
US20080249283A1 (en) | 2008-10-09 |
JP2011120598A (ja) | 2011-06-23 |
EP1757687A1 (en) | 2007-02-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230141371A1 (en) | Coronavirus vaccines and methods of use | |
Trent et al. | Partial nucleotide sequence of St. Louis encephalitis virus RNA: structural proteins, NS1, ns2a, and ns2b | |
CN1954068B (zh) | Hla-结合肽,其前体,编码这些肽序列的dna片段和重组载体 | |
US9353151B2 (en) | HLA-binding peptide, and DNA fragment and recombinant vector coding for said HLA-binding peptide | |
US9045531B2 (en) | HLA-binding peptide, precursor thereof, and DNA fragment and recombinant vector coding for said HLA-binding peptide | |
EP2636679B1 (en) | HLA-binding peptide, and DNA fragment and recombinant vector coding for said HLA-binding peptide | |
US8324345B2 (en) | HLA-binding peptide, precursor thereof, DNA fragment and recombinant vector encoding the same | |
Ruiz-Gayo et al. | Unequivocal synthesis and characterization of a parallel and an antiparallel bis-cystine peptide | |
CN113150111A (zh) | 一种HLA-A*0201限制性CMVpp65特异性T细胞受体及其应用 | |
Ferchichi et al. | Chemical synthesis, molecular modeling, and antimicrobial activity of a novel bacteriocin, MMFII | |
US20080306243A1 (en) | Hla-Binding Peptide, and Dna Fragment and Recombinant Vector Coding for Said Hla-Binding Peptide | |
JP2004509071A (ja) | 収束コンビナトリーペプチドライブラリおよびc型肝炎ウイルスに対してのワクチン化のためのその使用 | |
Platan et al. | T-immunogenic peptides are constituted of rare sequence patterns. Use in the identification of T epitopes in the human immunodeficiency virus gag protein | |
Feller et al. | Dipeptide backbone conformation and antibody recognition of a viral octapeptide epitope |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20110302 Termination date: 20200414 |
|
CF01 | Termination of patent right due to non-payment of annual fee |