奈瑟球菌蛋白质的杂交表达
本案是2001.02.28提交的申请号为01808738.8,名为奈瑟球菌蛋白质的杂交表达的分案申请。本文将所引用的全部文献都纳入作为参考。
技术领域
本发明涉及蛋白质表达的领域。具体说,本发明涉及奈瑟球菌(如淋病奈瑟球菌或较佳地为脑膜炎奈瑟球菌)的蛋白质的异源表达。
背景技术
国际专利申请WO 99/24578、WO 99/36544、WO 99/57280和WO 00/22430公开了脑膜炎奈瑟球菌(Neisseria meningitidis)和淋病奈瑟球菌(Neisseriagonorrhoeae)的蛋白质。这些蛋白质一般是以N-未端GST-融合体或C-未端His-标记融合体在大肠杆菌中表达的(即异源表达),虽然也公开了其它表达系统(包括在天然的奈瑟球菌中的表达)。
本发明的目的是提供这些蛋白质的异源表达的其它或改进方法。这些方法通常影响表达的水平、纯化的简易程度、表达的细胞内定位和/或表达的蛋白质的免疫学特性。
发明的公开
在本发明中,以单杂交蛋白表达本发明的两种或多种(如3、4、5、6或更多)蛋白质。较佳地,不使用非奈瑟球菌的融合配体(如GST或聚-His)。
这有两个优点。其一,可能不稳定或本身表达很差的蛋白质可以通过加入合适的克服该问题的杂交配体予以帮助。其二,简化工业生产,即制备两种分别有用的蛋白质只需要一次表达和纯化。
因此,本发明提供了同时异源表达本发明的两种或多种蛋白质的方法,其中所述的本发明的两种或多种蛋白质是融合的(即,它们是作为单多肽链翻译的)。
该方法通常包括如下步骤:获得编码本发明第一种蛋白质的第一核酸;获得编码本发明第二种蛋白质的第二核酸;连接第一和第二核酸。将得到的核酸插入表达载体中,或已作为表达载体的一部分。
当仅连接两种蛋白质时,可以式NH2-A-B-COOH简单地表示杂交蛋白。A和B各选自任何奈瑟球菌蛋白质,尤其是那些以SEQ#1-4326表示的。该方法非常适合表达蛋白质orf1、orf4、orf25、orf40、Orf46/46.1、orf83、233、287、292L、564、687、741、907、919、953、961和983。
由式NH2-A-B-COOH在下表中以‘X′表示的42个杂交体是优选的:
↓A B→ |
ORF46.1 |
287 |
741 |
919 |
953 |
961 |
983 |
ORF46.1 |
|
X |
X |
X |
X |
X |
X |
287 |
X |
|
X |
X |
X |
X |
X |
741 |
X |
X |
|
X |
X |
X |
X |
919 |
X |
X |
X |
|
X |
X |
X |
953 |
X |
X |
X |
X |
|
X |
X |
961 |
X |
X |
X |
X |
X |
|
X |
983 |
X |
X |
X |
X |
X |
X |
|
因此,优选的以杂交体表达的蛋白质是ORF46.1,287,741,919,953,961和983。它们可以全长形式或多-甘氨酸缺失(ΔG)形式使用(如ΔG-287、ΔGTbp2、ΔG741、ΔG983等)、或以截短形式使用(如Δ1-287、Δ2-287等),或以结构域缺失的形式使用(如287B、287C、287BC、ORF461-433、ORF46433-608、ORF46、961c等)等。
特别优选的是:(a)含919和287的杂交蛋白;(b)含953和287的杂交蛋白;(c)含287和ORF46.1的杂交蛋白;(d)含ORF1和ORF46.1的杂交蛋白;(e)含91 9和ORF46.1的杂交蛋白;(f)含ORF46.1和919的杂交蛋白;(g)含ORF46.1、287和919的杂交蛋白;(h)含919和519的杂交蛋白;和(i)含ORF97和225的杂交蛋白。
附图中显示了其它优选例,它们包括ΔG287-919、ΔG287-953、ΔG287-961、ΔG983-ORF46.1、ΔG983-741、ΔG983-961、ΔG983-961C、ΔG741-961、ΔG741-961C、ΔG741-983、ΔG741-ORF46.1、ORF46.1-741、ORF46.1-961、ORF46.1-961C、961-ORF46.1、961-741、961-983、961C-ORF46.1、961C-741、961C-983、961CL-ORF46.1、961CL-741和961CL-983。
当使用287时,其优先位于杂交体的C-未端;如果在N-未端使用它,则优先使用287的ΔG形式(如与ORF46.1、919、953或961杂交的杂交体的N-未端)。
当使用287时,其优先为菌株2996或菌株394/98的。
当使用961时,其优先在N-未端。可使用961的结构域形式。
WO 99/66741公开了ORF46、287、919和953的多态形式的排序。本发明可以使用这些多态形式中的任一形式。
较佳地,本发明杂交蛋白中的组成蛋白质(A和B)来源于是同一菌株。
杂交体中的融合蛋白可以是直接相连的,或者是通过接头肽相连的,如通过聚-甘氨酸接头(即,Gn,其中n=3、4、5、6、7、8、9、10或更多)或通过协助克隆的短肽序列相连。显然不宜将ΔG蛋白质连接于聚-甘氨酸接头的C-未端。
融合蛋白可以缺失天然前导肽或可以包含N-未端融合配体的前导肽序列。
宿主
较佳地,利用异源宿主。异源宿主可以是原核的或真核的。优先为大肠杆菌,但其它合适的宿主包括枯草芽孢杆菌(Bacillus subtilis)、霍乱弧菌(Vibriocholerae)、伤寒沙门氏菌(Salmonella typhi)、鼠伤寒沙门氏菌(salmonellatyphimurium)、脑膜炎奈瑟球菌、淋病奈瑟球菌、乳糖奈瑟球菌(Neisserialactamica)、灰色奈瑟球菌(Neisseria cinerea)、分枝杆菌(如结核分枝杆菌(M.tuberculosis)、酵母等。
载体、宿主等
如上所述的方法,本发明提供了(a)用于这些方法的核酸和载体;(b)含所述载体的宿主细胞;(c)可用这些方法表达的或可表达的蛋白质;(d)包含这些蛋白质的组合物,其可能适合作为疫苗、或例如诊断剂或免疫原性的组合物;(e)用作药物(如疫苗)或诊断剂的组合物;(f)这些组合物在制备以下物质中的用途(1)用于治疗或预防奈瑟球菌引起的感染的药物(2)检测奈瑟球菌或由奈瑟球菌引起的抗体存在与否的诊断剂,和/或(3)能产生抗奈瑟球菌抗体的药物;和(g)治疗患者的方法,其包括对该患者施用治疗有效量的这些组合物。
序列
本发明还提供了具有以下实施例中所列出的任何的蛋白质或核酸。本发明还提供了具有与这些序列是序列相同性的蛋白质和核酸。如上所述,“序列相同性”的程度最好大于50%(如60%、70%、80%、90%、95%、99%或更大)。
本文的命名
本文参考在WO 99/24578、WO 99/36544和WO 99/57280中公开的2166个蛋白质序列,并将它们编号为如下的SEQ#:
申请 |
蛋白质序列 |
本文的SEQ# |
WO 99/24578 |
偶SEQ ID 2-892 |
SEQ#1-446 |
WO 99/36544 |
偶SEQ ID 2-90 |
SEQ#447-491 |
WO 99/57280 |
偶SEQ ID 2-3020偶SEQ ID 3040-3114SEQ ID 3115-3241 |
SEQ#492-2001SEQ#2002-2039SEQ#2040-2166 |
除了这种SEQ#编号外,本文还使用了WO 99/24578、WO 99/36544和WO99/27280中的命名约定(如WO99/24578和WO 99/36544中用的‘ORF4′、‘ORF40′、‘OFR40-1′等;WO 99/57280中用的‘m919′、‘g919′和‘a919′等)。
在本文中将Tettelin等[Science(2000)287:1809-1815]中的从NMB0001到NMB2160的2160个蛋白质称为SEQ#2167-4326[参见WO 00/66791]。
本文采用的术语“本发明的蛋白质”指包含以下的蛋白质:
(a)SEQ#1-4326中的一个序列;或
(b)与SEQ#1-4326中的一个序列相同的序列;或
(c)SEQ#1-4326中的一个序列的片段。
(b)中的“序列相同性”的程度最好大于50%(如60%、70%、80%、90%、95%、99%或更大)。其包括突变体和等位基因变体[如,参见WO 00/66741]。相同性最好用Smith-Waterman同源性搜寻算法确定,如在MPSRCH程序(OxfordMolecular)中执行的,采用参数“缺口罚分(gap open penalty)”为12,“缺口延伸罚分(gap extension penalty)”为1进行缺口仿射搜索。通常,将两种蛋白质之间50%或更高的相同性视为功能等效的指示。
(c)中的“片段”应包含SEQ#1-4326中一个序列的至少n个连续的氨基酸,且根据具体的序列n为7或更高(如8、10、12、14、16、18、20、25、30、35、40、50、60、70、80、90、100或更高)。较佳地,片段包含SEQ#1-4326中一个序列的表位。优选的片段是在WO 00/71574和WO 01/04316中公开的那些。
本发明优选的蛋白质是在脑膜炎奈瑟球菌血清群B中发现的。
根据本发明使用的优选蛋白质是血清群B脑膜炎奈瑟球菌菌株2996或菌株394/98(新西兰菌株)。除非特别指出,本文所述的蛋白质是脑膜炎奈瑟球菌菌株2996的蛋白质。但是,应该理解通常本发明并不受菌株的限制。参考具体的蛋白质(如‘287′、‘919′等)可以包括任何菌株的该蛋白质。
应该理解术语“核酸”包括DNA和RNA,以及它们的类似物,如含有修饰骨架的那些类似物,还包括肽核酸(PNA)等。
附图简述
图1-26显示本发明的杂交蛋白。
本发明的进行模式
实施例1-ORF46的杂交体
脑膜炎奈瑟球菌(血清群B,菌株2996)的完整ORF46蛋白质具有如下序列:
1 LGISRKISLI LSILAVCLPM HAHASDLAND SFIRQVLDRQ HFEPDGKYHL
51 FGSRGELAER SGHIGLGKIQ SHQLGNLMIQ QAAIKGNIGY IVRFSDHGHE
101 VHSPFDNHAS HSDSDEAGSP VDGFSLYRIH WDGYEHHPAD GYDGPQGGGY
151 PAPKGARDIY SYDIKGVAQN IRLNLTDNRS TGQRLADRFH NAGSMLTQGV
201 GDGFKRATRY SPELDRSGNA AEAFNGTADI VKNIIGAAGE IVGAGDAVQG
251 ISEGSNIAVM HGLGLLSTEN KMARINDLAD MAQLKDYAAA AIRDWAVQNP
301 NAAQGIEAVS NIFMAAIPIK GIGAVRGKYG LGGITAHPIK RSQMGAIALP
351 KGKSAVSDNF ADAAYAKYPS PYHSRNIRSN LEQRYGKENI TSSTVPPSNG
401 KNVKLADQRH PKTGVPFDGK GFPNFEKHVK YDTKLDIQEL SGGGIPKAKP
451 VSDAKPRWEV DRKLNKLTTR EQVEKNVQEI RNGNKNSNFS QHAQLEREIN
501 KLKSADEINF ADGMGKFTDS MNDKAFSRLV KSVKENGFTN PVVEYVEING
551 KAYIVRGNNR VFAAEYLGRI HELKFKKVDF PVPNTSWKNP TDVLNESGNV
601 KRPRYRSK*
在前导肽下加下划线。
可以在WO 00/66741中发现其它菌株的ORF46的序列。
ORF46在其C-未端和N-未端与287、919和ORF1融合。该杂交蛋白通常是不溶解的,但产生一些良好的ELISA和杀菌结果(针对同源2996菌株):
蛋白质 |
ELISA |
杀菌Ab |
Orf1-Orf46.1-His |
850 |
256 |
919-Orf46.1-His |
12900 |
512 |
919-287-Orf46-His |
n.d. |
n.d. |
Orf46.1-287His |
150 |
8192 |
Orf46.1-919His |
2800 |
2048 |
Orf46.1-287-919His |
3200 |
16384 |
为了比较,构建了ORF46.1、287(以GST融合体或ΔG287的形式)和919的“三”杂交体,并针对各种菌株(包括同源2996菌株)将其与三种抗原的简单混合物相比。FCA用作佐剂:
|
2996 |
BZ232 |
MC58 |
NGH38 |
F6124 |
BZ133 |
混合物 |
8192 |
256 |
512 |
1024 |
>2048 |
>2048 |
ORF46.1-287-919his |
16384 |
256 |
4096 |
8192 |
8192 |
8192 |
ΔG287-919-ORF46.1his |
8192 |
64 |
4096 |
8192 |
8192 |
16384 |
ΔG287-ORF46.1-919his |
4096 |
128 |
256 |
8192 |
512 |
1024 |
同样,这些杂交体显示相当等的或更佳的免疫活性。
针对各种异源菌株,将两种蛋白质(菌株2996)的杂交体与单种蛋白质相比:
|
1000 |
MC58 |
F6124(MenA) |
ORF46.1-His |
<4 |
4096 |
<4 |
ORF1-His |
8 |
256 |
128 |
ORF1-ORF46.1-His |
1024 |
512 |
1024 |
再次,这些杂交体显示相等的或更佳的免疫活性。
实施例2-ΔG287的杂交体
发现287中(Gly)6序列的缺失对蛋白质的表达有显著影响。将缺失N-未端氨基酸多达GGGGGG的蛋白质称为‘ΔG287′。在菌株MC58中,它的基本序列(前导肽有下划线)为:
SPDVKS ADTLSKPAAP VVSEKETEAK EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT
GNGGAVTADN PKNEDEVAQN DMPQNAAGTD SSTPNHTPDP NMLAGNMENQ ATDAGESSQP
ANQPDMANAA DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS
NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS DADKISNYKK
DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSFARFRRS ARSRRSLPAE MPLIPVNQAD
TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY
NGEVLHFHTE NGRPYPTRGR FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAIDGNGFKG
TWTENGSGDV SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD*
与‘287-His′或‘287未标记的′相比,有或无His-标记的ΔG287(分别为‘ΔG287-His′和‘ΔG287K′)以很好的水平表达。
在基因变异性数据的基础上,从许多MenB菌株(尤其是从菌株2996、MC58、100和BZ232)的大肠杆菌中表达ΔG287-His的变体。结果也好-它们都具有很高的ELISA滴定度,且血清杀菌滴定度>8192。由pET-24b表达的ΔG287K在ELISA和血清杀菌实验中有极佳滴定度。
还将聚-Gly序列的缺失应用于Tbp2(NMB0460)、741(NMB1870)和983(NMB1969)。在不编码其前导肽的序列且没有聚-Gly(即,以“ΔG形式”),在pET载体中克隆并在大肠杆菌中表达时,观察到相同的作用-在携带聚-甘氨酸段缺失的克隆中表达很好,若在表达的蛋白中存在甘氨酸时则表达差或不表达。
将ΔG287直接融合于919、953、961(如下所示的序列)和ORF46.1的符合读框的上游:
ΔG287-919
ATGGCTAGCCCCGATGTTAAATCGGCGGACACGCTGTCAAAACCGGCCGCTCCTGTTGTTGCTGAAAAAGAGACAGAG
GTAAAAGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCACACAAGGCAGCCAAGATATGGCGGCA
GTTTCGGCAGAAAATACAGGCAATGGCGGTGCGGCAACAACGGACAAACCCAAAAATGAAGACGAGGGACCGCAAAAT
GATATGCCGCAAAATTCCGCCGAATCCGCAAATCAAACAGGGAACAACCAACCCGCCGATTCTTCAGATTCCGCCCCC
GCGTCAAACCCTGCACCTGCGAATGGCGGTAGCAATTTTGGAAGGGTTGATTTGGCTAATGGCGTTTTGATTGATGKG
CCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGTAATGGTGATAATTTATTGGATGAAGAAGCACCG
TCAAAATCAGAATTTGAAAATTTAAATGAGTCTGAACGAATTGAGAAATATAAGAAAGATGGGAAAAGCGATAAATTT
ACTAATTTGGTTGCGACAGCAGTTCAAGCTAATGGAACTAACAAATATGTCATCATTTATAAAGACAAGTCCGCTTCA
TCTTCATCTGCGCGATTCAGGCGTTCTGCACGGTCGAGGAGGTCGCTTCCTGCCGAGATGCCGCTAATCCCCGTCAAT
CAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGCCTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGG
AATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCCGGCGGATCGTATGCCCTCCGTGTGCAAGGCGAACCGGCA
AAAGGCGAAATGCTTGCTGGCACGGCCGTGTACAACGGCGAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTAC
CCGACTAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGCAGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGAT
TTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGATGGAAACGGCTTTAAGGGGACTTGGACGG AAATGGCGGC
GGGGATGTTTCCGGAAGTTTTACGGCCCGGCCGGCGAGGAAGTGGGCGGGAAAATACAGCTATCGCCCGACAGATGCG
GAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAGCAGGATGGATCCGGAGGAGGAGGATGCCAAAGCAAGAGC
ATCCAAACCTTTCCGCAACCCGACACATCCGTCATCAACGGCCCGGACCGGCCGGTCGGCATCCCCGACCCCGCCGGA
ACGACGGTCGGCGGCGGCGGGGCCGTCTATACCGTTGTACCGCACCTGTCCCTGCCCCACTGGGCGGCGCAGGATTTC
GCCAAAAGCCTGCAATCCTTCCGCCTCGGCTGCGCCAATTTGAAAAACCGCCAAGGCTGGCAGGATGTGTGCGCCCAA
GCCTTTCAAACCCCCGTCCATTCCTTTCAGGCAAAACAGTTTTTTGAACGCTATTTCACGCCGTGGCAGGTTGCAGGC
AACGGAAGCCTTGCCGGTACGGTTACCGGCTATTACGAGCCGGTGCTGAAGGGCGACGACAGGCGGACGGCACAAGCC
CGCTTCCCGATTTACGGTATTCCCGACGATTTTATCTCCGTCCCCCTGCCTGCCGGTTTGCGGAGCGGAAAAGCCCTT
GTCCGCATCAGGCAGACGGGAAAAAACAGCGGCACAATCGACAATACCGGCGGCACACATACCGCCGACCTCTCCCGA
TTCCCCATCACCGCGCGCACAACGGCAATCAAAGGCAGGTTTGAAGGAAGCCGCTTCCTCCCCTACCACACGCGCAAC
CAAATCAACGGCGGCGCGCTTGACGGCAAAGCCCCGATACTCGGTTACGCCGAAGACCCCGTCGAACTTTTTTTTATG
CACATCCAAGGCTCGGGCCGTCTGAAAACCCCGTCCGGCAAATACATCCGCATCGGCTATGCCGACAAAAACGAACAT
CCCTACGTTTCCATCGGACGCTATATGGCGGACAAAGGCTACCTCAAGCTCGGGCAGACCTCGATGCAGGGCATCAAA
GCCTATATGCGGCAAAATCCGCAACGCCTCGCCGAAGTTTTGGGTCAAAACCCCAGCTATATCTTTTTCCGCGAGCTT
GCCGGAAGCAGCAATGACGGTCCCGTCGGCGCACTGGGCACGCCGTTGATGGGGGAATATGCCGGCGCAGTCGACCGG
CACTACATTACCTTGGGCGCGCCCTTATTTGTCGCCACCGCCCATCCGGTTACCCGCAAAGCCCTCAACCGCCTGATT
ATGGCGCAGGATACCGGCAGCGCGATTAAAGGCGCGGTGCGCGTGGATTATTTTTGGGGATACGGCGACGAAGCCGGC
GAACTTGCCGGCAAACAGAAAACCACGGGTTACGTCTGGCAGCTCCTACCCAACGGTATGAAGCCCGAATACCGCCCG
TAACTCGAG
1 MASPDVKSAD TLSKPAAPVV AEKETEVKED APQAGSQGQG APSTQGSQDM
51 AAVSAENTGN GGAATTDKPK NEDEGPQNDM PQNSAESANQ TGNNQPADSS
101 DSAPASNPAP ANGGSNFGEV DLANGVLIDG PSQNITLTHC KGDSCNGDNL
151 LDEEAPSKSE FENLNESERI EKYKKDGKSD KFTNLVATAV QANGTNKYVI
201 IYKKDSASSS SARFRRSARS RRSLPAEMPL IPVNQADTLI VDGEAVSLTG
251 HSGNIFAPEG NYRYLTYGAE KLPGGSYALR VQGEPAKGEM LAGTAVYNGE
301 VLHFHTENGR PYPTRGRFAA KVDFGSKSVD GIIDSGDDLH MGTQKFKAAI
351 DGNGFKGTWT ENGGGDVSGR FYGPAGEEVA GKYSYRPTDA EKGGFGVFAG
401 KKEQDGSGGG GCQSKSIQTF PQPDTSVING PDRPVGIPDP AGTTVGGGGA
451 VYTVVPHLSL PHWAAQDFAK SLQSFRLGCA NLKNRQGWQD VCAQAFQTPV
501 HSFQAKQFFE RYFTPWQVAG NGSLAGTVTG YYEPVLKGDD RRTAQARFPI
551 YGIPDDFISV PLPAGLRSGK ALVRIRQTGK NSGTIDNTGG THTADLSRFP
601 ITARTTAIKG RFEGSRFLPY HTRNQINGGA LDGKAPILGY AEDPVELFFM
651 HIQGSGRLKT PSGKYIRIGY ADKNEHPTVS IGRYMADKGY LKLGQTSMQG
701 IKAYMRQNPQ RLAEVLGQNP SYIFFRELAG SSNDGPVGAL GTPLMGEYAG
751 AVDRHYITLG APLFVATAHP VTRKALNRLI MAQDTGSAIK GAVRVDYFWG
801 YGDEAGELAG KQKTTGYVWQ LLPNGMKPEY RP*
ΔG287-953
ATGGCTAGCCCCGATGTTAAATCGGCGGACACGCTGTCAAAACCGGCCGCTCCTGTTGTTGCTGAAAAAGAGACAGAG
GTAAAAGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCACACAAGGCAGCCAAGATATGGCGGCA
GTTTCGGCAGAAAATACAGGCAATGGCGGTGCGGCAACAACGGACAAACCCAAAAATGAAGACGAGGGACCGCAAAAT
GATATGCCGCAAAATTCCGCCGAATCCGCAAATCAAACAGGGAACAACCAACCCGCCGATTCTTCAGATTCCGCCCCC
GCGTCAAACCCTGCACCTGCGAATGGCGGTAGCAATTTTGGAAGGGTTGATTTGGCTAATGGCGTTTTGATTGATGGG
CCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGTAATGGTGATAATTTATTGGATGAAGAAGCACCG
TCAAAATCAGAATTTGAAAATTTAATGAGTCTGAACGAATTGAGAAATATAAGAAAAGATGGGAAAAGCGATAAATTT
ACTAATTTGGTTGCGACAGCAGTTCAAGCTAATGGAACTAACAAATATGTCATCATTTATAAAGACAAGTCCGCTTCA
TCTTCATCTGCGCGATTCAGGCGTTGTGCACGGTCGAGGAGGTCGCTTCCTGCCGAGATGCCGCTAATCCCCGTCAAT
CAGGCGGATACGCTGATTGTCGATGGGGAAAGCGGTCAGCCTGACGGGGCTTCCGGCAATATCTTCGCGCCCGAAGGG
AATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCCGGCGGATCGTATGCCCTCCGTGTGCAAGGCGAACCGGCA
AAAGGCGAAATGCTTGCTGGCACGGCCGTGTACAACGGCGAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTAC
CCGACTAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGCAGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGAT
TTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGATGGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGC
GGGGATGTTTCCGGAAGGTTTTACGGCCCGGCCGGCGAGGAAGTGGCGGGAAAATACAGCTATCGCCCGACAGATGCG
GAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAGCAGGATGGATCCGGAGGAGGAGGAGCCACCTACAAAGTG
GACGAATATCACGCCAACGCCCGTTTCGCCATCGACCATTTCAACACCAGCACCAACGTCGGCGGTTTTTACGGTCTG
ACCGGTTCCGTCGAGTTCGACCAAGCAAAACGCGACGGTAAAATCGACATCACCATCCCCGTTGCCAACCTGCAAAGC
GGTTCGCAACACTTTACCGACCACCTGAAATCAGCCGACATCTTCGATGCCGCCCAATATCCGGACATCCGCTTTGTT
TCCACCAAATTCAACTTCAACGGCAAAAAACTGGTTTCCGTTGACGGCAACCTGACCATGCACGGCAAAACCGCCCCC
GTCAAACTCAAAGCCGAAAAATTCAACTGCTACCAAAGCCCGATGGCGAAAACCGAAGTTTGCGGCGGCGACTTCAGC
ACCACCATCGACCGCACCAAATGGGGCGTGGACTACCTCGTTAACGTTGGTATGACCAAAAGCGTCCGCATCGACATC
CAAATCGAGGCAGCCAAACAATAACTCGAG
1 MASPDVKSAD TLSKPAAPVV AEKETEVKED APQAGSQGQG APSTQGSQDM
51 AAVSANETGN GGAATTDKPK NEDEGPQNDM PQNSAESANQ TGNNQPADSS
101 DSAPASNPAP ANGGSNFGRV DLANGVLIDG PSQNITLTHC KGDSCNGDNL
1S1 LDEEAPSKSE FENLNESERI EKYKKDGKSD KFTNLVATAV QANGTNKYVI
201 IYKDKSASSS SARFRRSARS RRSLPAEMPL IPVNQADTLI VDGEAVSLTG
251 HSGNIFAPEG NYRYLTYGAE KLPGGSYALR VQGEPAKGEM LAGTAVYNGE
301 VLHFHTENGR PYPTRGRFAA KVDFGSKSVD GIIDSGDDLH MGTQKFKAAI
351 DGNGFKGTWT ENGGGDVSGR FYGPAGEEVA GKYSYRPTDA EKGGFGVFAG
401 KKEQDGSGGG GATYKVDEYH ANARFAIDHF NTSTNVGGFY GLTGSVEFDQ
451 AKRDGKIDIT IPVANLQSGS QHFTDHLKSA DIFDAAQYPD IRFVSTKFNF
501 NGKKLVSVDG NLTMHGKTAP VKLKAEKFNC YQSPMAKTEV CGGDFSTTID
551 RTKWGVDYLV NVGMTKSVRI DIQIEAAKQ*
ΔG287-961
ATGGCTAGCCCCGATGTTAAATCGGCGGACACGCTGTCAAAACCGGCCGCTCCTGTTGTTGCTGAAAAAGAGACAGAG
GTAAAAGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCACACAAGGCAGCCAAGATATGGCGGCA
GTTTCGGCAGAAAATACAGGCAATGGCGGTGCGGCAACAACGGACAAACCCAAAAATGAAGACGAGGGACCGCAAAAT
GATATGCCGCAAAATTCCGCCGAATCCGCAAATCAAACAGGGAACAACCAACCCGCCGATTCTTCAGATTCCGCCCCC
GCGTCAAACCCTGCACCTGCGAATGGCGGTAGCAATTTTGGAAGGGTTGATTTGGCTAATGGCGTTTTGATTGATGGG
CCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGTAATGGTGATAATTTATTGGATGAAGAAGCACCG
TCAAAATCAGAATTTGAAAATTTAAATGAGTCTGAACGAATTGAGAAATATAAGAAAGATGGGAAAAGCGATAAATTT
ACTAATTTGGTTGCGACAGCAGTTCAAGCTAATGGAACTAACAAATATGTCATCATTTATAAAGACAAGTCCGCTTCA
TCTTCATCTGCGCGATTCAGGCGTTCTGCACGGTCGAGGAGGTCGCTTCCTGCCGAGATGCCGCTAATCCCCGTCAAT
CAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGCCTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGG
AATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCCGGCGGATCGTATGCCCTCCGTGTGCAAGGCGAACCGGCA
AAAGGCGAAATGCTTGCTGGCACGGCCGTGTACAACGGCGAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTAC
CCGACTAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGCAGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGAT
TTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGATGGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGC
GGGGATGTTTCCGGAAGGTTTTACGGCCCGGCCGGCGAGGAAGTGGCGGGAAAATACAGCTATCGCCCGACAGATGCG
GAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAGCAGGATGGATCCGGAGGAGGAGGAGCCACAAACGACGAC
GATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAAGCTGGA
GAGACCATCTACGACATTGATGAAGACGGCAACAATTACCAAAAAGACGCAACTGCAGCCGATGTTGAAGCCGACGAC
TTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTCGATGCC
AAAGTAAAAGCTGCAGAATCTGAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTTAGCAGATACT
GATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAGACTAAG
ACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTCAACGAT
ATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAGACGGCC
GAAGAAACCAAAACAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCCGCTGGC
ACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCTACGAAC
AAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTCAGAATT
GATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCACGATACT
CGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCCGCGCTC
TCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGCTACAAATCCGAATCGGCA
GTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCAGTCGGCACTTCGTCCGGT
TCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGTAACTCGAG
1 MASPDVKSAD TLSKPAAPVV AEKETEVKED APQAGSQGQG APSTQGSQDM
51 AAVSAENTGN GGAATTDKPK NEDEGPQNDM PQNSAESANQ TGNNQPADSS
101 DSAPASNPAP ANGGSNFGRV DLANGVLIDG PSQNITLTHC KGDSCNGDNL
151 LDEEAPSKSE FNELNESERI EKYKKDGKSD KFTNLVATAV QANGTNKYVI
201 IYKDKSASSS SARFRRSARS RRSLPAEMPL IPVNQADTLI VDGEAVSLTG
251 HSGNIFAPEG NYRYLTYGAE KLPGGSYALR VQGEPAKGKM LAGTAVYNGE
301 VLHFHTENGR PYPTRGRFAA KVDFGSKSVD GIIDSGDDLH MGTQKFKAAI
351 DGNGFKGTWT ENGGGDVSGR FYGPAGEEVA GKYSYRPTDA EKGGFGVFAG
401 KKEQDGSGGG GATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE
451 DGTITKKDAT AADVEADDFK GLGLKKVVTN LTKTVNENKQ NVDAKVKAAE
501 SEIKELTTKL ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV
551 KIDEKLEAVA DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE
601 TKQNVDAKVK AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN
651 KDNIAKKANS ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH
701 DTELNGLDET VSDLRKETRQ GLAEQAALSG LFQPYNVGRF NVTAAVGGYK
751 SESAVAIGTG FRFTENFAAK AGVAVGTSSG SSAATHVGVN YEW*
|
ELISA |
杀菌 |
ΔG287-953-His |
3834 |
65536 |
ΔG287-961-His |
108627 |
65536 |
对919和ORF46.1而言,将针对杂交蛋白产生的抗体的杀菌效力(同源菌株)与针对组分抗原(用287-GST)的简单混合物产生的抗体相比:
|
与287的混合物 |
与ΔG287的杂交体 |
919 |
32000 |
128000 |
ORF46.1 |
128 |
16000 |
还获得了针对异源MenB菌株和针对血清型A和C的杀菌活性的数据:
|
919 |
ORF46.1 |
菌株 |
混合物 |
杂交体 |
混合物 |
杂交体 |
NGH38 |
1024 |
32000 |
- |
16384 |
MC58 |
512 |
8192 |
- |
512 |
B2232 |
512 |
512 |
- |
- |
MenA (F6124) |
512 |
32000 |
- |
8192 |
MenC(C11) |
>2048 |
>2048 |
- |
- |
MenC(BZ133) |
>4096 |
64000 |
- |
8192 |
因此,在N-未端与ΔG287的杂交蛋白在免疫学上优于与ΔG287-ORF46.1的简单混合物,即使针对异源菌株也特别有效。可以在pET-24b中表达ΔG287-ORF46.1。
用新西兰菌株394/98,而非2996,制备相同的杂交蛋白:
ΔG287NZ-919
ATGGCTAGCCCCGATGTCAAGTCGGCGGACACGCTGTCAAAACCTGCCGCCCCTGTTGTTTCTGAAAAAGAGACAGAG
GCAAAGGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCGCACAAGGCGGTCAAGATATGGCGGCG
GTTTCGGAAGAAAATACAGGCAATGGCGGTGCGGCAGCAACGGACAAACCCAAAAATGAAGACGAGGGGGCGCAAAAT
GATATGCCGCAAAATGCCGCCGATACAGATAGTTTGACACCGAATCACACCCCGGCTTCGAATATGCCGGCCGGAAAT
ATGGAAAACCAAGCACCGGATGCCGGGGAATCGGAGCAGCCGGCAAACCAACCGGATATGGCAAATACGGCGGACGGA
ATGCAGGGTGACGATCCGTCGGCAGGCGGGGAAAATGCCGGCAATACGGCTGCCCAAGGTACAAATCAAGCCGAAAAC
AATCAAACCGCCGGTTCTCAAAATCCTGCCTCTTCAACCAATCCTAGCGCCACGAATAGCGGTGGTGATTTTGGAAGG
ACGAACGTGGGCAATTCTGTTGTGATTGACGGGCCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGT
AGTGGCAATAATTTCTTGGATGAAGAAGTACAGCTAAAATCAGAATTTGAAAAATTAAGTGATGCAGACAAAATAAGT
AATTACAAGAAAGATGGGAAGAATGACGGGAAGAATGATAAATTTGTCGGTTTGGTTGCCGATAGTGTGCAGATGAAG
GGAATCAATCAATATATTATCTTTTATAAACCTAAACCCACTTCATTTGCGCGATTTAGGCGTTCTGCACGGTCGAGG
CGGTCGCTTCCGGCCGAGATGCCGCTGATTCCCGTCAATCAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGC
CTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGGAATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCC
GGCGGATCGTATGCCCTCCGTGTTCAAGGCGAACCTTCAAAAGGCGAAATGCTCGCGGGCACGGCAGTGTACAACGGC
GAAGTGCTGCATTTTCATACGGAAAAGGGCCGTCCGTCCCCGTCCAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGC
AGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGGTTTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGAT
GGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGCGGGGATGTTTCCGGAAAGTTTTACGGCCCGGCCGGCGAG
GAAGTGGCGGGAAAATACAGCTATCGCCCAACAGATGCGGAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAACAG
CAGGATGGATCCGGAGGAGGAGGATGCCAAAGCAAGAGCATCCAAACCTTTCCGCAACCCGACACATCCGTCATCAAC
GGCCCGGACCGGCCGGTCGGCATCCCCGACCCCGCCGGAACGACGGTCGGCGGCGGCGGGGCCGTCTATACCGTTGTA
CCGCACCTGTCCCTGCCCCACTGGGCGGCGCAGGATTTCGCCAAAAGCCTGCAATCCTTCCGCCTCGGCTGCGCCAAT
TTGAAAAACCGCCAAGGCTGGCAGGATGTGTGCGCCCAAGCCTTTCAAACCCCCGTCCATTCCTTTCAGGCAAAACAG
TTTTTTGAACGCTATTTCACGCCGTGGCAGGTTGCAGGCAACGGAAGCCTTGCCGGTACGGTTACCGGCTATTACGAG
CCGGTGCTGAAGGGCGACGACAGGCGGACGGCACAAGCCCGCTTCCCGATTTACGGTATTCCCGACGATTTTATCTCC
GTCCCCCTGCCTGCCGGTTTGCGGAGCGGAAAAGCCCTTGTCCGCATCAGGCAGACGGGAAAAAACAGCGGCACAATC
GACAATACCGGCGGCACACATACCGCCGACCTCTCCCGATTCCCCATCACCGCGCGCACAACGGCAATCAAAGGCAGG
TTTGAAGGAAGCCGCTTCCTCCCCTACCACACGCGCAACCAAATCAACGGCGGCGCGCTTGACGGCAAAGCCCCGATA
CTCGGTTACGCCGAAGACCCCGTCGAACTTTTTTTTATGCACATCCAAGGCTCGGGCCGTCTGAAAACCCCGTCCGGC
AAATACATCCGCATCGGCTATGCCGACAAAAACGAACATCCCTACGTTTCCATCGGACGCTATATGGCGGACAAAGGC
TACCTCAAGCTCGGGCAGACCTCGATGCAGGGCATCAAAGCCTATATGCGGCAAAATCCGCAACGCCTCGCCGAAGTT
TTGGGTCAAAACCCCAGCTATATCTTTTTCCGCGAGCTTGCCGGAAGCAGCAATGACGGTCCCGTCGGCGCACTGGGC
ACGCCGTTGATGGGGGAATATGCCGGCGCAGTCGACCGGCACTACATTACCTTGGGCGCGCCCTTATTTGTCGCCACC
GCCCATCCGGTTACCCGCAAAGCCCTCAACCGCCTGATTATGGCGCAGGATACCGGCAGCGCGATTAAAGGCGCGGTG
CGCGTGGATTATTTTTGGGGATACGGCGACGAAGCCGGCGAACTTGCCGGCAAACAGAAAACCACGGGTTACGTCTGG
CAGCTCCTACCCAACGGTATGAAGCCCGAATACCGCCCGTAAAAGCTT
1 MASPDVKSAD TLSKPAAPVV SEKETEAKED APQAGSQGQG APSAQGGQDM
51 AAVSEENTGN GGAAATDKPK NEDEGAQNDM PQNAADTDSL TPNHTPASNM
101 PAGNMENQAP DAGESEQPAN QPDMANTADG MQGDDPSAGG ENAGNTAAQG
151 TNQAENNQTA GSQNPASSTN PSATNSGGDF GRTNVGNSVV IDGPSQNITL
201 THCKGDSCSG NNFLDEEVQL KSEFEKLSDA DKISNYKKDG KNDGKNDKFV
251 GLVADSVQMK GINQYIIFYK PKPTSFARFR RSARSRRSLP AEMPLIPVNQ
301 ADTLIVDGEA VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG SYALRVQGEP
351 SKGEMLAGTA VYNGEVLHFH TENGRPSPSR GRFAAKVDFG SKSVDGIIDS
401 GDGLHMGTQK FKAAIDGNGF KGTWTENGGG DVSGKFYGPA GEEVAGKYSY
451 RPTDAEKGGF GVFAGKKEQD GSGGGGCQSK SIQTFPQPDT SVINGPDRFV
501 GIPDPAGTTV GGGGAVYTVV PHLSLPHWAA QDFAKSLQSF RLGCANLKNR
551 QGWQDVCAQA FQTPVHSFQA KQFFERYFTP WQVAGNGSLA GTVTGYYEPV
601 LKGDDRRTAQ ARFPIYGIPD DFISVPLPAG LRSGKALVRI RQTGKNSGTI
651 DNTGGTHTAD LSRFPITART TAIKGRFEGS RFLPYHTRNQ INGGALDGKA
701 PILGYAEDPV ELFFMHIQGS GRLKTPSGKY IRIGYADKNE HPYVSIGRYM
751 ADKGYLKLGQ TSMQGIKAYM RQNPQRLAEV LGQNPSYIFF RELAGSSNDG
801 PVGALGTPLM GEYAGAVDRH YITLGAPLFV ATAHPVTRKA LNRLIMAQDT
851 GSAIKGAVRV DYFWGYGDEA GELAGKQKTT GYVWQLLPNG MKPEYRP*
ΔG287NZ-953
ATGGCTAGCCCCGATGTCAAGTCGGCGGACACGCTGTCAAAACCTGCCGCCCCTGTTGTTTCTGAAAAAGAGACAGAG
GCAAAGGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCGCACAAGGCGGTCAAGATATGGCGGCG
GTTTCGGAAGAAAATACAGGCAATGGCGGTGCGGCAGCAACGGACAAACCCAAAAATGAAGACGAGGGGGCGCAAAAT
GATATGCCGCAAAATGCCGCCGATACAGATAGTTTGACACCGAATCACACCCCGGCTTCGAATATGCCGGCCGGAAAT
ATGGAAAACCAAGCACCGGATGCCGGGGAATCGGAGCAGCCGGCAAACCAACCGGATATGGCAAATACGGCGGACGGA
ATGCAGGGTGACGATCCGTCGGCAGGCGGGGAAAATGCCGGCAATACGGCTGCCCAAGGTACAAATCAAGCCGAAAAC
AATCAAACCGCCGGTTCTCAAAATCCTGCCTCTTCAACCAATCCTAGCGCCACGAATAGCGGTGGTGATTTTGGAAGG
ACGAACGTGGGCAATTCTGTTGTGATTGACGGGCCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGT
AGTGGCAATAATTTCTTGGATGAAGAAGTACAGCTAAAATCAGAATTTGAAAAATTAAGTGATGCAGACAAAATAAGT
AATTACAAGAAAGATGGGAAGAATGACGGGAAGAATGATAAATTTGTCGGTTTGGTTGCCGATAGTGTGCAGATGAAG
GGAATCAATCAATATATTATCTTTTATAAACCTAAACCCACTTCATTTGCGCGATTTAGGCGTTCTGCACGGTCGAGG
CGGTCGCTTCCGGCCGAGATGCCGCTGATTCCCGTCAATCAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGC
CTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGGATTACCGGTATCTGACTTACGGGGCGGAAAAATTTGCCC
GGCGGATCGTATGCCCTCCGTGTTCAAGGCGAACCTTCAAAAGGCGAAATGCTCGCGGGCACGGCAGTGTACAACGGC
GAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTCCCCGTCCAGAGGCAGGTTTGCCGCAAAAGTCGAGTTCGGC
AGCAAATCTGTGCACGGCATTATCGACAGCGGCGATGGTTTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGAT
GGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGCGGGGATGTTTCCGGAAAGTTTTACGGCCCGGCCGGCGAG
GAAGTGGCGGGAAAATACAGCTATCGCCCAACAGATGCGGAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAG
CAGGATGGATCCGGAGGAGGAGGAGCCACCTACAAAGTGGACGAATATCACGCCAACGCCCGTTTCGCCATCGACCAT
TTCAACACCAGCACCAACGTCGGCGGTTTTTACGGTCTGACCGGTTCCGTCGAGTTCGACCAAGCAAAACGCGACGGT
AAAATCGACATCACCATCCCCGTTGCCAACCTGCAAAGCGGTTCGCAACACTTTACCGACCACCTGAAATCAGCCGAC
ATCTTCGATGCCGCCCAATATCCGGACATCCGCTTTGTTTCCACCAAATTCAACTTCAACGGCAAAAAACTGGTTTCC
GTTGACGGCAACCTGACCATGCACGGCAAAACCGCCCCCGTCAAACTCAAAGCCGAAAAATTCAACTGCTACCAAAGC
CCGATGGCGAAAACCGAAGTTTGCGGCGGCGACTTCAGCACCACCATCGACCGCACCAAATGGGGCGTGGACTACCTC
GTTAACGTTGGTATGACCAAAAGCGTCCGCATCGACATCCAAATCGAGGCAGCCAAACAATAAAAGCTT
1 MASPDVKSAD TLSKPAAPVV SEKETEAKED APQAGSQGQG APSAQGGQDM
51 AAVSEENTGN GGAAATDKPK NEDEGAQNDM PQNAADTDSL TPNHTPASNM
101 PAGNMENQAP DAGESEQPAN QPDMANTADG MQGDDPSAGG ENAGNTAAQG
151 TNQAENNQTA GSQNPASSTN PSATNSGGDF GRTNVGNSVV IDGPSQNITL
201 THCKGDSCSG NMFLDEEVQL KSEFEKLSDA DKISNYKKDG KNDGKNDKFV
251 GLVADSVQMK GINQYIIFYK PKPTSFARFR RSARSRRSLP AEMPLIPVNQ
301 ADTLIVDGEA VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG SYALRVQGEP
351 SKGEMLAGTA VYNGEVLHFH TENGRPSPSR GRFAAKVDFG SKSVDGIIDS
401 GDGLHMGTQK FKAAIDGNGF KGTWTENGGG DVSGKFYGPA GEEVAGKYSY
451 RPTDAEKGGF GVFAGKKEQD GSGGGGATYK VDEYHANARF AIDHFNTSTN
501 VGGFYGLTGS VEFDQAKRDG KIDITIPVAN LQSGSQHFTD HLKSADIFDA
551 AQYPDIRFVS TKFNFNGKKL VSVDGNLTMH GKTAPVKLKA EKFNCYQSPM
601 AKTEVCGGDF STTIDRTKWG VDYLVNVGMT KSVRIDIQIE AAKQ*
ΔG287NZ-961
ATGGCTAGCCCCGATGTCAAGTCGGCGGACACGCTGTCAAAACCTGCCGCCCCTGTTGTTTCTGAAAAAGAGACAGAG
GCAAAGGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCGCACAAGGCGGTCAAGATATGGCGGCG
GTTTCGGAAGAAAATACAGGCAATGGCGGTGCGGCAGCAACGGACAAACCCAAAAATGAAGACGAGGGGGCGCAAAAT
GATATGCCGCAAAATGCCGCCGATACAGATAGTTTGACACCGAATCACACCCCGGCTTCGAATATGCCGGCCGGAAAT
ATGGAAAACCAAGCACCGGATGCCGGGGAATCGGAGCAGCCGGCAAACCAACCGGATATGGCAAATACGGCGGACGGA
ATGCAGGGTGACGATCCGTCGGCAGGCGGGGAAAATGCCGGCAATACGGCTGCCCAAGGTACAAATCAAGCCGAAAAC
AATCAAACCGCCGGTTCTCAAAATCCTGCCTCTTCAACCAATCCTAGCGCCACGAATAGCGGTGGTGATTTTGGAAGG
ACGAACGTGGGCAATTCTGTTGTGATTGACGGGCCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGT
AGTGGCAATAATTTCTTGGATGAAGAAGTACAGCTAAATCAGAATTTGAAAAATTAAGTGATGCAGACAAAAATAAGT
AATTACAAGAAAGATGGGAAGAATGACGGGAAGAATGATAAATTTGTCGGTTTGGTTGCCGATAGTGTGCAGATGAAG
GGAATCAATCAATATATTATCTTTTATAAACCTAAACCCACTTCATTTGCGCGATTTAGGCGTTCTGCACGGTCGAGG
CGGTCGCTTCCGGCCGAGATGCCGCTGATTCCCGTCAATCAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGC
CTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGGAATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCC
GGCGGATCGTATGCCCTCCGTGTTCAAGGCGAACCTTCAAAAGGCGAAATGCTCGCGGGCACGGCAGTGTACAACGGC
GAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTCCCCGTCCAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGC
AGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGGTTTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGAT
GGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGCGGGGATGTTTCCGGAAAGTTTTACGGCCCGGCCGGCGAG
GAAGTGGCGGGAAAATACAGCTATCGCCCAACAGATGCGGAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAG
CAGGATGGATCCGGAGGAGAGGAGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCC
TACAACAATGGCCAAGAAATCAACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACC
AAAAAAGACGCAACTGCAGCCGATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTG
ACCAAAACCGTCAATGAAAACAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACA
ACCAAGTTAGCAGACACTGATGCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAA
TTGGGAGAAAATATAACGACATTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAATTAAGAAGCCGTG
GCTGATACCGTCGACAAGCATGCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGAC
GAAGCCGTCAAAACCGCCAATGAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCT
GCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCT
GCAAAAGTTACCGACATCAAAGCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTG
TACACCAGAGAAGAGTCTGACAGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGC
TTGGCTTCTGCTGAAAAATCCATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGC
AAAGAAACCCGCCAAGGCCTTGCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAAT
GTAACGGCTGCAGTCGGCGGCTACAAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTT
GCCGCCAAAGCAGGCGTGGCAGTCGGCACTTCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGG
TAAAAGCTT
1 MASPDVKSAD TLSKPAAPVV SEKETEAKED APQAGSQGQG APSAQGGQDM
51 AAVSEENTGN GGAAATDKPK NEDEGAQNDM PQNAADTDSL TPNHTPASNM
101 PAGNMENQAP DAGESEQPAN QPDMANTADG MQGDDPSAGG ENAGNTAAQG
151 TNQAENNQTA GSQNPASSTN PSATNSGGDF GRTNVGNSVV IDGPSQNITL
201 THCKGDSCSG NNFLDEEVQL KSEFEKLSDA DKISNYKKDG KNDGKNDKFV
251 GLVADSVQMK GINQYIIFYK PKPTSFARFR RSARSRRSLP AEMPLIPVNQ
301 ADTLIVDGEA VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG SYALRVQGEP
351 SKGEMLAGTA VYNGEVLHFH TENGRPSPSR GRFAAKVDFG SKSVDGIIDS
401 GDGLHMGTQK FKAAIDGNGF KGTWTENGGG DVSGKFYGPA GEEVAGKYSY
451 RPTDAEKGGF GVFAGKKEQD GSGGGGATND DDKKAATVAL IAAAYNNGQE
501 INGFKAGETI YDIDEDGTIT KKDATAADVE ADDFKGLGLK KVVTNLTKTV
551 NENKQNVDAK VKAAESEIEK LTTKLADTDA ALADTDAALD ATTNALNKLG
601 ENITTFAEET KTNIVKIDEK LEAVADTVDK HAEAFNDIAD SLDETNTKAD
651 EAVKTANEAK QTAEETKQNV DAKVKAAETA AGKAEAAAGT ANTAADKAEA
701 VAAKVTDIKA DIATNKDNIA KKANSADVYT REESDSKFVR IDGLNATTEK
751 LDTRLASAEK SIADHDTRLN GLDKTVSDLR KETRQGLAEQ AALSGLFQPY
801 NVGRFNVTAA VGGYKSESAV AIGTGFRFTE NFAAKAGVAV GTSSGSSAAY
851 HVGVNYEW*
实施例3-ΔG983的杂交体
蛋白质983具有如下序列:
1 MRTTPTFPTK TFKPTAMALA VATTLSACLG GGGGGTSAPD FNAGGTGIGS
51 NSRATTAKSA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPPN
101 LHTGDFPNPN DAYKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE
151 LYGRKEHGYN ENYKNYTAYM RKEAPEDGGG KDIEASFDDE AVIETEAKPT
201 DIRHVKEIGH IDLVSHIIGG RSVDGRPAGG IAPDATLHIM NTNDETKNEM
251 MVAAIRNAWV KLGERGVRIV NNSFGTTSRA GTADLFQIAN SEEQYRQALL
301 DYSGGDKTDE GIRLMQQSDY GNLSYHIRNK NMLFIFSTGN DAQAQPNTYA
351 LLPFYEKDAQ KGIITVAGVD RSGEKFKREM YGEPGTEPLE YGSNHCGITA
401 MWCLSAPYEA SVRFTRTNPI QIAGTSFSAP IVTGTAALLL QKYPWMSNDN
451 LRTTLLTTAQ DIGAVGVDSK FGWGLLDAGK AMNGPASFPF GDFTADTKGT
501 SDIAYSFRND ISGTGGLIKK GGSQLQLHGN NTYTGKTIIE GGSLVLYGNN
551 KSDMRVETKG ALIYNGAASG GSLNSDGIVY LADTDQSGAN ETVHIKGSLQ
601 LDGKGTLYTR LGKLLKVDGT AIIGGKLYMS ARGKGAGYLN STGRRVPFLS
651 AAKIGQDYSF FTNIETDGGL LASLDSVEKT AGSEGDTLSY YVRRGNAART
701 ASAAAHSAPA GLKHAVEQGG SNLENLMVEL DASESSATPE TVETAAADRT
751 DMPGIRPYGA TFRAAAAVQH ANAADGVRIF NSLAATVYAD STAAHADMQG
801 RRLKAVSDGL DHNGTGLRVI AQTQQDGGTW EQGGVEGKMR GSTQTVGIAA
851 KTGENTTAAA TLGMGRSTWS ENSANAKTDS ISLFAGIEHD AGDIGYLKGL
901 FSYGRYKNSI SRSTGADEHA EGSVNGTLMQ LGALGGVNVP FAATGDLTVE
951 GGLRYDLLKQ DAFAEKGSAL GWSGNSLTEG TLVGLAGLKL SQPLSDKAVL
1001 FATAGVERDL NGRDYTVTGG FTGATAATGK TGARNMPHTR LVAGLGADVE
1051 FGNGWNGLAR YSYAGSKQYG NHSGRVGVGY RF*
因此ΔG983具有如下基本序列:
TSAPD FNAGGTGIGS
NSRATTAKSA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPPN
LHTGDFPNPN DAYKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE
LYGRKEHGYN ENYKNYTAYM RKEAPEDGGG KDIEASFDDE AVIETEAKPT
DIRHVKEIGH IDLVSHIIGG RSVDGRPAGG IAPDATLHIM NTNDETKNEM
MVAAIRNAWV KLGERGVRIV NNSFGTTSRA GTADLFQIAN SEEQYRQALL
DYSGGDKTDE GIRLMQQSDY GNLSYHIRNK NMLFIFSTGN DAQAQPNTYA
LLPFYEKDAQ KGIITVAGVD RSGEKFKREM YGEPGTEPLE YGSNHCGITA
MWCLSAPYEA SVRFTRTNPI QIAGTSFSAP IVTGTAALLL QKYPWMSNDN
LRTTLLTTAQ DIGAVGVDSK FGWGLLnAGK AMNGPASFPF GDFTADTKGT
SDIAYSFRND ISGTGGLIKK GGSQLQLHGN NTYTGKTIIE GGSLVLYGNN
KSDMRVETKG ALIYNGAASG GSLNSDGIVY LADTDQSGAN ETVHIKGSLQ
LDGKGTLYTR LGKLLKVDGT AIIGGKLYMS ARGKGAGYLN STGRRVPFLS
AAKIGQDYSF FTNIETDGGL LASLDSVEKT AGSEGDTLSY YVRRGNAART
ASAAAHSAPA GLKHAVEQGG SNLENLMVEL DASESSATPE TVETAAADRT
DMPGIRPYGA TFRAAAAVQH ANAADGVRIF NSLAATVYAD STAAHADMQG
RRLKAVSDGL DHNGTGLRVI AQTQQDGGTW EQGGVEGKMR GSTQTVGIAA
KTGENTTAAA TLGMGRSTWS ENSANAKTDS ISLFAGIRHD AGDIGYLKGL
FSYGRYKNSI SRSTGADEHA EGSVNGTLMQ LGALGGVNVP FAATGDLTVE
GGLRYDLLKQ DAFAEKGSAL GWSGNSLTEG TLVGLAGLKL SQPLSDKAVL
FATAGVERDL NGRDYTVTGG FTGATAATGK TGARNMPHTR LVAGLGADVE
FGNGWNGLAR YSYAGSKQYG NHSGRVGVGY RF*
将ΔG983作为在其C-未端与ORF46.1、741、961或961C的杂交体表达:
ΔG983-ORF46.1
ATGACTTCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCA
GCAGTATCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCG
GTTACAGACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCA
TACAAGAATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGAC
ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAA
AACTATACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAG
GCCGTTATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCAT
ATTATTGGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACG
AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGC
ATCGTCAATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAG
TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTAC
GGCAACCTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCC
AACACATATGCCCTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGT
GGAGAAAAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATT
ACTGCCATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGA
ACATCCTTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAAC
CTGCGTACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTG
GATGCGGGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGAT
ATTGCCTACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCAC
GGCAACAACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATG
CGCGTCGAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTAT
CTGGCAGATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACG
CTGTACACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGC
GGCAAGGGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTAT
TCTTTCTTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAA
GGCGACACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCC
GGTCTGAAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCC
GCAACACCCGAGACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTC
CGCGCAGCGGCAGCCGTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTAT
GCCGACAGTACCGCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGC
ACGGGTCTGCGCGTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGCAAAAATGCGC
GGCAGTACCCAAACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGC
AGCACATGGAGCGAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGC
GATATCGGCTATCTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAAGCCGCAGCACCGGTGCGACGAA
CATGCGGAAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACG
GGAGATTTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTG
GGCTGGAGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGAT
AAAGCCGTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACC
GGCGCGACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGAT
GTCGAATTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGA
CGAGTCGGCGTAGGCTACCGGTTCCTCGACGGTGGCGGAGGCACTGGATCCTCAGATTTGGCAAACGATTCTTTTATC
CGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCACCTATTCGGCAGCAGGGGGGAACTTGCCGAG
CGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGCAACCTGATGATTCAACAGGCGGCCATTAAA
GGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTCCATTCCCCCTTCGACAACCATGCCTCACAT
TCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTACCGCATCCATTGGGACGGATACGAACACCAT
CCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGCGCGAGGGATATATACAGCTACGAC
ATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGCAGCACCGGACAACGGCTTGCCGACCGTTTC
CACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAACGCGCCACCCGATACAGCCCCGAGCTGGAC
AGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTTAAAAACATCATCGGCGCGGCAGGAGAAATT
GTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATTGCTGTCATGCACGGCTTGGGTCTGCTTTCC
ACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAACTCAAAGACTATGCCGCAGCAGCCATCCGC
GATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTCAGCAATATCTTTATGGCAGCCATCCCCATC
AAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACGGCACATCCTATCAAGCGGTCGCAGATGGGC
GCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCCGATGCGGCATACGCCAAATACCCGTCCCCT
TACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAAGAAAACATCACCTCCTCAACCGTGCCGCCG
TCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACAGGCGTACCGTTTGACGGTAAAGGGTTTCCG
AATTTTGAGAAGCACGTGAAATATGATACGCTCGAGCACCACCACCACCACCACTGA
1 MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDD
51 VAVTDRDAKI NAPPPNLHTG DFPNPNDAYK NLINLKPAIE AGYTGRGVEV
101 GIVDTGESVG SISFPELYGR KEHGYNENYK NYTAYMRKEA PEDGGGKDIE
151 ASFDDEAVIE TEAKPTDIRH VKEIGHIDLV SHIIGGRSVD GRPAGGIAPD
201 ATLHIMNTND ETKNEMMVAA IRNAWVKLGE RGVRIVNNSF GTTSRAGTAD
251 LFQIANSEEQ YRQALLDYSG GDKTDEGIRL MQQSDYGNLS YHIRNKNMLF
301 IFSTGNDAQA QPNTYALLPF YEKDAQKGII TVAGVDRSGE KFKREMYGEP
351 GTEPLEYGSN HCGITAMWCL SAPYEASVRF TRTNPIQIAG TSFSAPIVTG
401 TAALLLQKYP WMSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG
451 PASFPFGDFT ADTKGTSDIA YSFRNDISGT GGLIKKGGSQ LQLHGNNTYT
501 GKTIIEGGSL VLYGNNKSDM RVETKGALIY NGAASGGSLN SDGIVYLADT
551 DQSGANETVH IKGSLQLDGK GTLYTRLGKL LKVDGTAIIG GKLYMSARGK
601 GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETDGGLLASL DSVEKTAGSE
651 GDTLSYYVRR GNAARTASAA AHSAPAGLKH AVEQGGSNLE NLMVELDASE
701 SSATPETVET AAADRTDMPG IRPYGATFRA AAAVQHANAA DGVRIFNSLA
751 ATVYADSTAA HADMQGRRLK AVSDGLDHNG TGLRVIAQTQ QDGGTWEQGG
801 VEGKMRGSTQ TVGIAAKTGE NTTAAATLGM GRSTWSENSA NAKTDSISLF
851 AGIRHDAGDI GYLKGLFSYG RYKNSISRST GADEHAEGSV NGTLMQLGAL
901 GGVNVPFAAT GDLTVEGGLR YDLLKQDAFA EKGSALGWSG NSLTEGTLVG
951 LAGLKLSQPL SDKAVLFATA GVERDLNGRD YTVTGGFTGA TAATGKTGAR
1001 NMPHTRLVAG LGADVEFGNG WNGLARYSYA GSKQYGNHSG RVGVGYRFLD
1051 GGGGTGSSDL ANDSFIRQVL DRQHTEPDGK YHLFGSRGEL AERSGHIGLG
1101 KIQSHQLGNL MIQQAAIKGN IGYIVRFSDH GHEVHSPFDN HASHSDSDEA
1151 GSPVDGFSLY RIHWDGYEHH PADGYDGPQG GGYPAPKGAR DIYSYDIKGV
1201 AQNIRLNLTD NRSTGQRLAD RFHNAGSMLT QGVGDGFKRA TRYSPELDRS
1251 GNAAEAFNGT ADIVKNIIGA AGEIVGAGDA VQGISEGSNI AVMHGLGLLS
1301 TENKMARIND LADMAQLKDY AAAAIRDWAV QNPNAAQGIE AVSNIFMAAI
1351 PIKGIGAVRG KYGLGGITAH PIKRSQMGAI ALPKGKSAVS DNFADAAYAK
1401 YPSPYHSRNI RSNLEQRYGK ENITSSTVPP SNGKNVKLAD QRHPKTGVPF
1451 DGKGFPNFEK HVKYDTLEHH HHHH*
ΔG983-741
ATGACTTCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCA
GCAGTATCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCG
GTTACAGACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCA
TACAAGAATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGAC
ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAA
AACTATACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAG
GCCGTTATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCCATTTGGTCTCCCAT
ATTATTGGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACG
AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGC
ATCGTCAATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAG
TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTAC
GGCAACCTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCC
AACACATATGCCCTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGT
GGAGAAAAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATT
ACTGCCATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGA
ACATCCTTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAAC
CTGCGTACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTG
GATGCGGGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGAT
ATTGCCTACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTGAKCAAAAAAGGCGGCAGCCAACTGCAACTGCAC
GGCAACAACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATG
CGCGTCGAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTAT
CTGGCAGATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACG
CTGTACACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGC
GGCAAGGGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTAT
TCTTTCTTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAA
GGCGACACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCC
GGTCTGAAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCC
GCAACACCCGAGACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTC
CGCGCAGCGGCAGCCGTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTAT
GCCGACAGTACCGCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGC
ACGGGTCTGCGCGTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGC
GGCAGTACCCAAACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGC
AGCACATGGAGCGAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGC
GATATCGGCTATCTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAA
CATGCGGAAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACG
GGAGATTTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTG
GGCTGGAGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGAT
AAAGCCGTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACC
GGCGCGACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGAT
GTCGAATTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGA
CGAGTCGGCGTAGGCTACCGGTTCCTCGAGGGATCCGGAGGGGGTGGTGTCGCCGCCGACATCGGTGCGGGGCTTGCC
GATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAGTCTTTGACGCTGGATCAGTCCGTCAGGAAAAAC
GAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGAAACGGTGACAGCCTCAATACGGGCAAATTGAAG
AACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTGGACGGGCAGCTCATTACCTTGGAGAGTGGAGAG
TTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAGACCGAGCAAATACAAGATTCGGAGCATTCCGGG
AAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGCGAACATACATCTTTTGACAAGCTTCCCGAAGGC
GGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCCGGCGGAAAACTGACCTACACCATAGATTTCGCC
GCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTCAATGTCGACCTGGCCGCCGCCGATATCAAG
CCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTACAACCAAGCCGAGAAAGGCAGTTACTCCCTCGGT
ATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTGAAAACCGTAAACGGCATACGCCATATCGGCCTT
GCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA
1 MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDD
51 VAVTDRDAKI NAPPPNLHTG DFPNPNDAYK NLINLKPAIE AGYTGRGVEV
101 GIVDTGESVG SISFPELYGR KEHGYNENYK NYTAYMRKEA PEDGGGKDIE
151 ASFDDEAVIE TEAKPTDIRH VKEIGHIDLV SHIIGGRSVD GRPAGGIAPD
201 ATLHIMNTND ETKNEMMVAA IRNAWVKLGE RGVRIVNNSF GTTSRAGTAD
251 LFQIANSEEQ YRQALLDYSG GDKTDEGIRL MQQSDYGNLS YHIRNKNMLF
301 IFSTGNDAQA QPNTYALLPF YEKDAQKGII TVAGVDRSGE KFKREMYGEP
351 GTEPLEYGSN HCGITAMWCL SAPYEASVRF TRTNPIQIAG TSFSAPIVTG
401 TAALLLQKYP WMSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG
451 PASFPFGDFT ADTKGTSDIA YSFRNDISGT GGLIKKGGSQ LQLHGNNTYT
501 GKTIIEGGSL VLYGNNKSDM RVETKGALIY NGAASGGSLN SDGIVYLADT
551 DQSGANETVH IKGSLQLDGK GTLYTRLGKL LKVDGTAIIG GKLYMSARGK
601 GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETDGGLLASL DSVEKTAGSE
651 GDTLSYYVRR GNAARTASAA AHSAPAGLKH AVEQGGSNLE NLMVELDASE
701 SSATPETVET AAADRTDMPG IRPYGATFRA AAAVQHANAA DGVRIFNSLA
751 ATVYADSTAA HADMQGRRLK AVSDGLDHNG TGLRVLAQTQ QDGGTWEQGG
801 VEGKMRGSTQ TVGIAAKTGE NTTAAATLGM GRSTWSENSA NAKTDSISLF
851 AGIRHDAGDI GYLKGLFSYG RYKNSISRST GADEHAEGSV NGTLMQLGAL
90 GGVNVPFAAT GDLTVEGGLR YDLLKQDAFA EKGSALGWSG NSLTEGTLVG
951 LAGLKLSQPL SDKAVLFATA GVERDLNGRD YTVTGGFTGA TAATGKTGAR
1001 NMPHTRLVAG LGADVEFGNG WNGLARYSYA GSKQYGNHSG RVGVGYRFLE
1051 GSGGGGVAAD IGAGLADALT APLDHKDKGL QSLTLDQSVR KNEKLKLAAQ
1101 GAEKTYGNGD SLNTGKLKND KVSRFDFIRQ IEVDGQLITL ESGEFQVYKQ
1151 SHSALTAFQT EQIQDSEHSG KMVAKRQFRI GDIAGEHTSF DKLPEGGRAT
1201 YRGTAFGSDD AGGKLTYTID FAAKQGNGKI EHLKSPELNV DLAAADIKPD
1251 GKRHAVISGS VLYNQAEKGS YSLGIFGGKA QEVAGSAEVK TVNGIRHIGL
1301 AAKQLEHHHH HH*
ΔG983-961
ATGACTTCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCA
GCAGTATCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCG
GTTACAGACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCA
TACAAGAATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGAC
ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAA
AACTATACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAG
GCCGTTATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCAT
ATTATTGGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACG
AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGC
ATCGTCAATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAG
TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTAC
GGCAACCTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCC
AACACATATGCCCTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGT
GGAGAAAAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATT
ACTGCCATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGA
ACATCCTTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAAC
CTGCGTACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTG
GATGCGGGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGAT
ATTGCCTACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCAC
GGCAACAACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATG
CGCGTCGAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTAT
CTGGCAGATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACG
CTGTACACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGC
GGCAAGGGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTAT
TCTTTCTTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAA
GGCGACACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCC
GGTCTGAAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCC
CATGCGGAAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACG
GGAGATTTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTG
GGCTGGAGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGAT
AAAGCCGTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACC
GGCGCGACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGAT
GTCGAATTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGA
CGAGTCGGCGTAGGCTACCGGTTCCTCGAGGGTGGCGGAGGCACTGGATCCGCCACAAACGACGACGATGTTAAAAAA
GCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAAGCTGGAGAGACCATCTAC
GACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAAGCCGACGACTTTAAAGGTCTG
GGTCTGAAAAAATCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTCGATGCCAAAGTAAAAAGCT
GCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTAGCAGATACTGATGCCGCTCTG
GATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAGACTAAGACAAATATCGTA
AAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTCAACGATATCGCCGATTCA
TTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAGACGGCCGAAGAAACCAAA
CAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCCGCTGGCACAGCTAATACT
GCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCTACGAACAAAGATAATATT
GCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTCAGAATTGATGGTCTGAAC
GCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCACGATACTCGCCTGAACGGT
TTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCCGCGCTCTCCGGTCTGTTC
CAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGCTACAAATCCGAATCGGCAGTCGCCATCGGT
ACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCAGTCGGCACTTCGTCCGGTTCTTCCGCAGCC
TACCATGTCGGCGTCAATTACGAGTGGCTCGAGCACCACCACCACCACCACTGA
1 MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDD
51 VAVTDRDAKI NAPPPNLHTG DFPNPNDAYK MLINLKPAIE AGYTGRGVEV
101 GIVDTGESVG SISFPELYGR KEHGYNENYK NYTAYMRKEA PEDGGGKDIE
151 ASFDDEAVIE TEAKPTDIRH VKEIGHIDLV SHIIGGRSVD GRPAGGIAPD
201 ATLHIMNTND ETKNEMMVAA IRNAWVKLGE RGVRIVNNSF GTTSRAGTAD
251 LFQIANSEEQ YRQALLDYSG GDKTDEGIRL MQQSDYGNLS YHIRNKNMLF
301 IFSTGNDAQA QPNTYALLPF YEKDAQKGII TVAGVDRSGE KFKREMYGEP
351 GTEPLEYGSN HCGITAMWCL SAPYEASVRF TRTNPIQIAG TSFSAPIVTG
401 TAALLLQKYP WMSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG
451 PASFPFGDFT ADTKGTSDIA YSFRNDISGT GGLIKKGGSQ LQLHGNNTYT
501 GKTIIEGGSL VLYGNNKSDM RVETKGALIY NGAASGGSLN SDGIVYLADT
551 DQSGANETVH IKGSLQLDGK GTLYTRLGKL LKVDGTAIIG GKLYMSARGK
601 GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETDGGLLASL DSVEKTAGSE
651 GDTLSYYVRR GNAARTASAA AHSAPAGLKH AVEQGGSNLE NLMVELDASE
701 SSATPETVET AAADRTDMPG IRPYGATFRA AAAVQHANAA DGVRIFNSLA
751 ATVYADSTAA HADMQGRRLK AVSDGLDHNG TGLRVIAQTQ QDGGTWEQGG
801 VEGKMRGSTQ TVGIAAKTGE MTTAAATLGM GRSTWSENSA NAKTDSISLF
851 AGIRHDAGDI GYLKGLFSYG RYKNSISRST GADEHAEGSV NGTLMQLGAL
901 GGVNVPFAAT GDLTVEGGLR YDLLKQDAFA EKGSALGWSG NSLTEGTLVG
951 LAGLKLSQPL SDKAVLFATA GVERDLNGRD YTVTGGFTGA TAATGKTGAR
1001 NKPHTRLVAG LGADVEFCNG WNGLARYSYA GSKQYGNHSG RVGVGYRFLE
1051 GGGGTGSATN DDDVKKAATV AIAAAYNNGQ EINGFKAGET IYDIDEDGTI
1101 TKKDATAADV EADDFKGLGL KKVVTNLTKT VNENKQNVDA KVKAAESEIE
1151 KLTTKLADTD AALADTDAAL DATTNALNKL GENITTFAEE TKTNIVKIDE
1201 KLEAVADTVD KHAEAFNDIA DSLDETNTKA DEAVKTANEA KQTAEETKQN
1251 VDAKVKAAET AAGKAEAAAG TANTAADKAE AVAAKVTDIK ADIATNKDNI
1301 AKKANSADVY TREESDSKFV RIDGLNATTE KLDTRLASAE KSIADHDTRL
1351 NGLDKTVSDL RKETRQGLAE QAALSGIFQP YNVGRFNVTA AVGGYKSESA
1401 VAIGTGFRFT ENFAAKAGVA VGTSSGSSAA YHVGVNYEWL EHHHHHH*
ΔG983-961c
ATGACTTCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCA
GCAGTATCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCG
GTTACAGACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCA
TACAAGAATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGAC
ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAA
AACTATACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAG
GCCGTTATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCAT
ATTATTGGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACG
AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGC
ATCGTCAATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAG
TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTAC
GGCAACCTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCC
AACACATATGCCCTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGT
GGAGAAAAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATT
ACTGCCATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGA
ACATCCTTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAAC
CTGCGTACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTG
GATGCGGGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGAT
ATTGCCTACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCAC
GGCAACAACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATG
CGCGTCGAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTAT
CTGGCAGATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACG
CTGTACACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGC
GGCAAGGGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTAT
TCTTTCTTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAA
GGCGACACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCC
GGTCTGAAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCCAATCATCC
GCAACACCCGAGACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTC
CGCGCAGCGGCAGCCGTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTAT
GCCGACAGTACCGCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGC
ACGGGTCTGCGCGTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGC
GGCAGTACCCAAACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGC
AGCACATGGAGCGAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGC
GATATCGGCTATCTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAA
CATGCGGAAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACG
GGAGATTTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTG
GGCTGGAGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGAT
AAAGCCGTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACC
GGCGCGACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGAT
GTCGAATTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGA
CGAGTCGGCGTAGGCTACCGGTTCCTCGAGGGTGGCGGAGGCACTGGATCCGCCACAAACGACGACGATGTTAAAAAA
GCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAAGCTGGAGAGACCATCTAC
GACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAAGCCGACGACTTTAAAGGTCTG
GGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTCGATGCCAAAGTAAAAGCT
GCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTAGCAGATACTGATGCCGCTCTG
GATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAGACTAAGACAAATATCGTA
AAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTCAACGATATCGCCGATTCA
TTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAGACGGCCGAAGAAACCAAA
CAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCCGCTGGCACAGCTAATACT
GCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCTACGAACAAAGATAATATT
GCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTCAGAATTGATGGTCTGAAC
GCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCACGATACTCGCCTGAACGGT
TTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCCGCGCTCTCCGGTCTGTTC
CAACCTTACAACGTGGGTCTCGAGCACCACCACCACCACCACTGA
1 MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDD
51 VAVTDRDAKI NAPPPNLHTG DFPNPNDAYK NLINLKPAIE AGYTGRVGEV
101 GIVDTGESVG SISFPELYGR KEHGYNENYK NYTAYMRKEA PEDGGGKDIE
151 ASFDDEAVIE TEAKPTDIRH VKEIGHIDLV SHIIGGRSVD GRPAGGIAPD
201 ATLHIMNNTD ETKNEMMVAA IRNAWVKLGE RGVRIVNNSF GTTSRAGTAD
251 LFQIANSEEQ YRQALLDYSG GDKTDEGIRL MQQSDYGNLS YHIRNKNMLF
301 IFSTGNDAQA QPNTYALLPF YEKDAQKGII TVAGVDRSGE KFKREMYGEP
351 GTEPLEYGSN HCGITAMWCL SAPYEASVRF TRTNPIQIAG TSFSAPIVTG
401 TAALLLQKYP WMSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG
451 PASFPFGDFT ADTKGTSDIA YSFRNDISGT GGLIKKGGSQ LQLHGNNTYT
501 GKTIIEGGSL VLYGNNKSDM RVETKGALIY NGAASGGSLN SDGIVYLADT
551 DQSGANETVH IKGSLQLDGK GTLYTRLGKL LKVDGTAIIG GKLYMSARGK
601 GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETDGGLLASL DSVEKTAGSE
651 GDTLSYYVRR GNAARTASAA AHSAPAGLKH AVEQGGSNLE NLMVELDASE
701 SSATPETVET AAADRTDMPG IRPYGATFRA AAAVQHNMAA DGVRIFNSLA
751 ATVYADSTAA HADMQGRRLK AVSDGLDHNG TGLRVIAQTQ QDGGTWEQGG
801 VEGKMRGSTQ TVGIAAKTGE NTTAAATLGM GRSTWSENSA NAKTDSISLF
851 AGIRHDAGDI GYLKGLFSYG RYKNSISRST GADEHAEGSV NGTLMQLGAL
901 GGVNVPFAAT GDLTVEGGLR YDLLKQDAFA EKGSALGWSG NSLTEGTLVG
951 LAGLKLSQPL SDKAVLFATA GVERDLNGRD YTVTGGFTGA TAATGKTGAR
1001 NMPHTRLVAG LGADVEFGNG WNGLARYSYA GSKQYGNHSG RVGVGYRFLE
1051 GGGGTGSATN DDDVKKAATV AIAAAYNNGQ EINGFKAGET IYDIDEDGTI
1101 TKKDATAADV EADDFKGLGL KKVVTNLTKT VNENKQNVDA KVKAAESEIE
1151 KLTTKLADTD AALADTDAAL DATTNALNKL GENITTFAEE TKTNIVKIDE
1201 KLEAVADTVD KHAEAFNDIA DSLDETNTKA DEAVKTANEA KQTAEETKQN
1251 VDAKVKAAET AAGKAEAAAG TANTADKAE AVAAKVTDIK ADIATNKDNI
1301 AKKANSADVY TREESDSKFV RIDGLNATTE KLDTRLASAE KSIADHDTRL
1351 NGLDKTVSDL RKETRQGLAE QAALSGLFQP YNVGLEHHHH HH*
实施例4-ΔG741的杂交体
蛋白质741具有如下序列:
1 VNRTAFCCLS LTTALILTAC SSGGGGVAAD IGAGLADALT APLDHKDKGL
51 QSLTLDQSVR KNEKLKLAAQ GAEKTYGNGD SLNTGKLKND KVSRFDFIRQ
101 IEVDGQLITL ESGEFQVYKQ SHSALTAFQT EQIQDSEHSG KMAKRQFRI
151 GDIAGEHTSF DKLPEGGRAT YRGTAFGSDD AGGKLTYTID FAAKQGNGKI
201 EHLKSPELN DLAAADIKPD GKRHAVISGS VLYNQAEKGS YSLGIFGGKA
251 QEVAGSAEVK TVNGIRHIGL AAKQ*
因此,ΔG741具有如下基本序列:
VAAD IGAGLADALT APLDHKDKGL
QSLTLDQSVR KNEKLKLAAQ GAEKTYGNGD SLNTGKLKND KVSRFDFIRQ
IEVDGQLITL ESGEFQVYKQ SHSALTAFQT EQIQDSEHSG KMVAKRQFRI
GDIAGEHTSF DKLPEGGRAT YRGTAFGSDD AGGKLTYTID FAAKQGNGKI
EHLKSPELNV DLAAADIKPD GKRHAVISGS VLYNQAEKGS YSLGIFGGKA
QEVAGSAEVK TVNGIRHIGL AAKQ*
将ΔG741直接融合于蛋白质961、961c、983和ORF46.1的符合读框的上游:
AG741-961
ATGGTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAG
TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGA
AACGGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTG
GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAG
ACCGAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGC
GAACATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCC
GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAA
CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTAC
AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTG
AAAACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGGGTGGCGGAGGCACTGGATCCGCCACA
AACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTC
AAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAA
GCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAAC
GTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTA
GCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAA
GAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCA
TTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAA
CAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCT
GCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATC
GCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTT
GTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGAT
CACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAA
GCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGCTACAAATCC
GAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCAGTCGGCACT
TCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGCTCGAGCACCACCACCACCACCACTGA
1 MVAADIGAGL ADALTAPLDH KDKGLQSLTL DQSVRKNEKL KLAAQGAEKT
51 YGNGDSLNTG KLKNDKVSRF DFIRQIEVDG QLITLESGEF QVYKQSHSAL
101 TAFQTEQIQD SEHSGKMVAK RQFRIGDIAG EHTSFDKLPE GGRATYRGTA
151 FGSDDAGGKL TYTIDFAAKQ GNGKIEHLKS PELNVDLAAA DICPDGKRHA
201 VISGSVLYNQ AEKGSYSLGI FGGKAQEVAG SAEVKTVNGI RHIGLAAKQL
251 EGGGGTGSAT NDDDVKKAAT VAIAAAYNNG QEINGFKAGE TIYDIDEDGT
301 ITKKDATAAD VEADDFKGLG LKKVVTNLTK TVNENKQNVD AKVKAAESEI
351 EKLTTKLADT DAALADTDAA LDATTNALNK LGENITTFAE ETKTNIVKID
401 EKLEAVADTV DKHAEAFNDI ADSLDETNTK ADEAVKTANE AKQTAEETKQ
451 NVDAKVKAAE TAAGKAEAAA GTANTAADKA EAVAAKVTDI KADIATNKDN
501 IAKKANSADV YTREESDSKF VRIDGLNATT EKLDTRLASA EKSIADHDTR
551 LNGLDKTVSD LRKETRQGLA EQAALSGLFQ PYNVGRFNVT AAVGGYKSES
601 AVAIGTGFRF TENFAAKAGV AVGTSSGSSA AYHVGVNYEW LEHHHHHH*
ΔG741-961c
ATGGTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAG
TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGA
AACGGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTG
GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAG
ACCGAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGC
GAACATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCC
GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAA
CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTAC
AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTG
AAAACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGGGTGGCGGAGGCACTGGATCCGCCACA
AACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTC
AAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAA
GCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAAC
GTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTA
GCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAA
GAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCA
TTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAA
CAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCT
GCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATC
GCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTT
GTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGAT
CACGATACTCGCCTGAACGGTTTGCATAAAACAGTGTCACACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAA
GCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCTCGAGCACCACCACCACCACCACTGA
1 MVAADIGAGL ADALTAPLDH KDKGLQSLTL DQSVRKNEKL KLAAQGAEKT
51 YGNGDSLNTG KLKNDKVSRF DFIRQIEVDG QLITLESGEF QVYKQSHSAL
101 TAFQTEQIQD SEHSGKMVAK RQFRIGDIAG EHTSFDKLPE GGRATYRGTA
151 FGSDDAGGKL TYTIDFAAKQ GNGKIEHLKS PELNVDLAAA DIKPDGKRHA
201 VISGSVLYNQ AEKGSYSLGI FGGKAQEVAG SAEVKTVNGI RHIGLAAKQL
251 EGGGGTGSAT NDDDVKKAAT VAIAAAYNNG QEINGFKAGE TIYDIDEDGT
301 ITKKDATAAD VEADDFKGKG LKKVVTNLTK TVNENKQNVD AKVKAAESEI
351 EKLTTKLADT DAALADTDAA LDATTNALNK LGENITTFAE ETKTNIVKID
401 EKLEAVADTV DKHAEAFNDI ADSLDETNTK ADEAVKTANE AKQTAEETKQ
451 NVDAKVKAAE TAAGKAEAAA GTANTAADKA EAVAAKVTDI KADIATNKDN
501 IAKKANSADV YTREESDSKF VRIDGLNATT EKLDTRLASA EKSIADHDTR
551 LNGLDKTVSD LRKETRQGLA EQAALSGLFQ PYNVGLEHHH HHH*
ΔG741-983
ATGGTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAG
TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGA
AACGGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTG
GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAG
ACCGAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGC
GAACATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCC
GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAA
CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTAC
AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTG
AAAACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGGGATCCGGCGGAGGCGGCACTTCTGCG
CCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCAGCAGTATCTTAC
GCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCGGTTACAGACAGG
GATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCATACAAGAATTTG
ATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGACACAGGCGAATCC
GTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAAAACTATACGGCG
TATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAGGCCGTTATAGAG
ACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCATTTTATTGGCGGG
CGTTCCGTGGACGGCATACCTGCATGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACGAATGATGAAACC
AAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGCATCGTCAATAAC
AGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAGTACCGCCAAGCG
TTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTACGGCAACCTGTCC
TACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCCAACACATATGCC
CTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGTGGAGAAAAGTTC
AAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATTACTGCCATGTGG
TGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGAACATCCTTTTCC
GCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAACCTGCGTACCACG
TTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTGGATGCGGGTAAG
GCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGATATTGCCTACTCC
TTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCACGGCAACAACACC
TATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATGCGCGTCGAAACC
AAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTATCTGGCAGATACC
GACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACGCTGTACACACGT
TTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGCGGCAAGGGGGCA
GGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTATTCTTTCTTCACA
AACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAAGGCGACACGCTG
TCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCCGGTCTGAAACAC
GCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCCGCAACACCCGAG
ACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTCCGCGCAGCGGCA
GCCGTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTATGCCGACAGTACC
GCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGCACGGGTCTGCGC
GTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGCGGCAGTACCCAA
ACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGCAGCACATGGAGC
GAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGCGATATCGGCTAT
CTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAACATGCGGAAGGC
AGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACGGGAGATTTGACG
GTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTGGGCTGGAGCGGC
AACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGATAAAGCCGTCCTG
TTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACCGGCGCGACTGCA
GCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGATGTCGAATTCGGC
AACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGACGAGTCGGCGTA
GGCTACCGGTTCCTCGAGCACCACCACCACCACCACTGA
1 MVAADIGAGL ADALTAPLDH KDKGLQSLTL DQSVRKNEKL KLAAQGAEKT
51 YGNGDSLNTG KLKNDKVSRF DFIRQIEVDG QLITLESGEF QVYKQSHSAL
101 TAFQTEQIQD SEHSGKMVAK RQFRIGDIAG EHTSFDKLPE GGRATYRGTA
151 FGSDDAGGKL TYTIDFAAKQ GNGKIEHLKS PELNVDLAAA DIKPDGKRHA
201 VISGSVLYNQ AEKGSYSLGI FGGKAQEVAG SAEVKTVNGI RHIGLAAKQL
251 EGSGGGGTSA PDFNAGGTGI GSNSRATTAK SAAVSYAGIK NEMCKDRSML
301 CAGRDDVAVT DRDAKINAPP PNLHTGDFPN PNDAYKNLIN LKPAIEAGYT
351 GRGVEVGIVD TGESVGSISF PELYGRKEHG YNENYKNYTA YMRKEAPEDG
401 GGKDIEASFD DEAVIETEAK PTDIRHVKEI GHIDLVSHII GGRSVDGRPA
451 GGIAPDATLH IMNTNDETKN EMMVAAIRNA WVKLGERGVR IVNNSFGTTS
501 RAGTADLFQI ANSEEQYRQA LLDYSGGDKT DEGIRLMQQS DYGNLSYHIR
551 NKNMLFIFST GNDAQAQPNT YALLPFYEKD AQKGIITVAG VDRSGEKFKR
601 ENYGEPGTEP LEYGSNHCGI TAMWCLSAPY EASVRFTRTN PIQIAGTSFS
651 APIVTGTAAL LLQKYPWMSN DNLRTTLLTT AQDIGAVGVD SKFGWGLLDA
701 GKAMNGPASF PFGDFTADTK GTSDIAYSFR NDISGTGGLI KKGGSQLQLH
751 GNNTYTGKTI IEGGSLVLYG NNKSDMRVET KGALIYNGAA SGGSLNSDGI
801 VYLADTDQSG ANETVHIKGS LQLDGKGTLY TRLGKLLKVD GTAIIGGKLY
851 MSARGKGAGY LNSTGRRVPF LSAAKIGQDY SFFTNIETDG GLLASLDSVE
901 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV
951 ELDASESSAT PETVETAAAD RTDMPGIRPY GATFRAAAAV QHANAADGVR
1001 IFNSLAATVY ADSTAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG
1051 TWEQGGVEGK MRGSTQTVGI AAKTGENTTA AATLGMGRST WSENSANAKT
1101 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL
1151 MQLGALGGVN VPFAATGDLT VEGGLRYDLL KQDAAEKGS ALGWSGNSLT
1201 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT
1251 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV
1301 GYRFLEHHHH HH*
ΔG741-ORF46.1
ATGGTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAG
TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGA
AACGGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTG
GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAG
ACCGAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGC
GAACATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCC
GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAA
CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTAC
AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTG
AAAACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGACGGTGGCGGAGGCACTGGATCCTCAGAT
TTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCACCTATTCGGC
AGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGCAACCTGATG
ATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTCCATTCCCCC
TTCGACAACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTACCGCATCCAT
TGGGACGGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGCGCG
AGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGCAGCACCGGA
CAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAACGCGCCACC
CGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTTAAAAACATC
ATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATTGCTGTCATG
CACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAACTCAAAGAC
TATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTCAGCAATATC
TTTATGGCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACGGCACATCCT
ATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCCGATGCGGCA
TACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAAGAAAACATC
ACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACAGGCGTACCG
TTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGCTCGAGCACCACCACCACCACCACTGA
1 MVAADIGAGL ADALTAPLDH KDKGLQSLTL DQSVRKNEKL KLAAQGAEKT
51 YGNGDSLNTG KLKNDKVSRF DFIRQIEVDG QLITLRSGEF QVYKQSHSAL
101 TAFQTEQIQD SEHSGKMVAK RQFRIGDIAG EHTSFDKLPE GGRATYRGTA
151 FGSDDAGGKL TYTIDFAAKQ GNGKIEHLKS PELNVDLAAA DIKPDGKRHA
201 VISGSVLYNQ AEKGSYSLGI FGGKAQEVAG SAEVKTVNGI RHIGLAAKQL
251 DGGGGTGSSD LANDSFIRQV LDRQHFEPDG KYHLFGSRGE LAERSGHIGL
301 GKIQSHQLGN LMIQQAAIKG NIGYIVRFSD HGHEVHSPFD NHASHSDSDE
351 AGSPVDGFSL YRIHWDGYEH HPADGYDGPQ GGGYPAPKGA RDIYSYDIKG
401 VAQNIRLNLT DNRSTGQRLA DRFHNAGSML TQGVGDGFKR ATRYSPELDR
451 SGNAAEAFNG TADIVKNIIG AAGEIVGAGD AVQGISEGSN IAVMHGLGLL
501 STENKMARIN DLADMAQLKD YAAAAIRDWA VQNPNAAQGI EAVSNIFMAA
551 IPIKGIGAVR GKYGLGGITA HPIKRSQMGA IALPKGKSAV SDNFADAAYA
601 KYPSPYHSRN IRSNLEQRYG KENITSSTVP PSNGKNVKLA DQRHPKTGVP
651 FDGKGFPNFE KHVKYDTLEH HHHHH*
实施例5-287的杂交体
以全长、具有C-未端His-标记或无其前导肽但具有C-未端His-标记的287的表达水平非常低。用N-未端GST-融合实现了较好的表达。将GST用作N-未端融合配体的替代方法,将287置于蛋白质919的C-未端(‘919-287′)、蛋白质953的C-端(‘953-287′)和蛋白质ORF46.1的C-未端(‘ORF46.1-287′)。在这两种方法中,前导肽都是缺失的,且杂交体是直接的符合读杠的融合。
为了制备953-287杂交体,通过设计正向引物自各序列前导区的下游,除去两种蛋白质的前导肽;在953反向引物中除去终止密码子序列,但在287反向引物中包含终止密码子序列。对953基因而言,用于扩增的5′和3′引物分别包括NdeI和BamHI限制酶切位点,而对287基因的扩增而言,5′和3′引物分别包括BamHI和XhoI限制酶切位点。用这种方法,用NdeI-BamHI(克隆第一个基因)且随后用BamHI-XhoI(克隆第二个基因),可以实现pET21b+中两个基因的顺序定向克隆。
通过将编码287的成熟部分的序列克隆入pET21b+中的919-His克隆的3′-端的XhoI位点,可以得到919-287杂交体。设计用于扩增287基因的引物,从而在PCR片段的5′-引入SalI限制酶切位点并在3′-引入XhoI位点。因为由SalI和XhoI限制酶产生的粘性末端是相容的,因此可以将由SalI-XhoI消化的287PCR产物插入到由XhoI切割的pET21b-919克隆中。
类似地得到ORF46.1-287杂交体。
将针对杂交蛋白产生的抗体的杀菌效力(同源菌株)与针对组分抗原的简单混合物产生的抗体相比:
|
与287的混合物 |
与287的杂交体 |
919 |
32000 |
16000 |
953 |
8192 |
8192 |
ORF46.1 |
128 |
8192 |
对919-287和953-287而言,还获得了针对异源MenB菌株和针对血清型A和C的杀菌活性的数据:
|
919 |
953 |
ORF46.1 |
菌株 |
混合物 |
杂交体 |
混合物 |
杂交体 |
混合物 |
杂交体 |
MC58 |
512 |
1024 |
512 |
1024 |
- |
1024 |
NGH38 |
1024 |
2048 |
2048 |
4096 |
- |
4096 |
BZ232 |
512 |
128 |
1024 |
16 |
- |
- |
MenA(F6124) |
512 |
2048 |
2048 |
32 |
- |
1024 |
MenC(C11) |
>2048 |
n.d. |
>2048 |
n.d. |
- |
n.d. |
MenC(BZ133) |
>4096 |
>8192 |
>4096 |
<16 |
- |
2048 |
还构建了ORF46.1和919的杂交体。在N-未端用919得到了最佳结果(高4倍的滴定度)。
还测试了杂交体919-519His、ORF97-225His和225-ORF97His。它们的ELISA滴定度和杀菌抗体应答结果中等。
两种蛋白质A和B的杂交体可以是NH2-A-B-COOH或NH2-B-A-COOH,用ΔG287还制得了在N-未端与287“反向”杂交的杂交体。使用一系列菌株,包括同源菌株2996。FCA用作佐剂:
|
287&919 |
287&953 |
287&ORR46.1 |
菌株 |
ΔG287-919 |
919-287 |
ΔG287-953 |
953-287 |
ΔG287-46.1 |
46.1-287 |
2996 |
128000 |
16000 |
65536 |
8192 |
16384 |
8192 |
BZ232 |
256 |
128 |
128 |
<4 |
<4 |
<4 |
1000 |
2048 |
<4 |
<4 |
<4 |
<4 |
<4 |
MC58 |
8192 |
1024 |
16384 |
1024 |
512 |
128 |
NGH38 |
32000 |
2048 |
>2048 |
4096 |
16384 |
4096 |
394/98 |
4096 |
32 |
256 |
128 |
128 |
16 |
MenA(F6124) |
32000 |
2048 |
>2048 |
32 |
8192 |
1024 |
MenC(BZ133) |
64000 |
>8192 |
>8192 |
<16 |
8192 |
2048 |
通常在N-未端用287观察到较好的杀菌滴定度。
当融合于蛋白质961[如上所示的NH2-ΔG287-961-COOH-序列]时,得到的蛋白质是不溶解的,纯化必须将其变性和复性。在复性后,发现约50%蛋白质仍不溶解。比较可溶和不溶蛋白质,用可溶蛋白质得到更好的杀菌滴定度(FCA作为佐剂):
|
2996 |
BZ232 |
MC58 |
NGH38 |
F6124 |
BZ133 |
可溶的 |
65536 |
128 |
4096 |
>2048 |
>2048 |
4096 |
不溶的 |
8192 |
<4 |
<4 |
16 |
n.d. |
n.d. |
但用明矾佐剂替代,可以改善不溶形式的蛋白质的滴定度:
不溶的 |
32768 |
128 |
4096 |
>2048 |
>2048 |
2048 |
还在杂交蛋白中使用961c(见上)。由于961及其结构域变体指导有效的表达,它们非常适合作为杂交蛋白的N-未端部分。
实施例23-其它杂交体
附图中显示本发明的其它杂交蛋白,它们具有如下所示的序列。当与单独蛋白质相比时,这此杂交蛋白是有利的:
ORF46.1-741
ATGTCAGATTTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCAC
CTATTCGGCAGCAGGGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAAATACAAAGCCATCAGTTGGGC
AACCTGATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCADGAAGTC
CATTCCCCCTTCGACAACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTAC
CGCATCCATTGGGACGGATACGAACACCATCCCGCCGADGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCC
AAAGGCGCGAGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGC
AGCACCGGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAA
CGCGCCACCCGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTT
AAAAACATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATT
GCTGTCATGCACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAA
CTCAAAGACTATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTC
AGCAATATCTTTATGGCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACG
GCACATCCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCC
GATGCGGCATACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAA
GAAAACATCACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACA
GGCGTACCGTTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGGGATCCGGAGGGGGTGGT
GTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAGTCT
TTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGAAAC
GGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTGGAC
GGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAGACC
GAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGCGAA
CATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACMATCGCGGGACGGCGTTCGGTTCAGAACGATGCCGGC
GGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTC
AATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTACAAC
CAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTGAAA
ACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA
1 MSDLANDSFI RQVLDRQHFE PDGKYHLFGS RGELAERSGH IGLGKIQSHQ
51 LGNLMIQQAA IKGNIGYIVR FSDHGHEVHS PFDNHASHSD SDEAGSPVDG
101 FSLYRIHWDG YEHHPADGYD GPQGGGYPAP KGARDIYSYD IKGVAQNIRL
151 NLTDNRSTGQ RLADRFHNAG SMLTQGVGDG FKRATRYSPE LDRSGNAAEA
201 FNGTADIVKN IIGAAGEIVG AGDAVQGISE GSNIAVMHGL GLLSTENKMA
251 RINDLADMAQ LKDYAAAAIR DWAVQNPNAA QGIEAVSNIF MAAIPIKGIG
301 AVRGKYGLGG ITAHPIKRSQ MGAIALPKGK SAVSDNFADA AYAKYPSPYH
351 SRNIRSNLEQ RYGKENITSS TVPPSNGKNV KLADQRHPKT GVPFDGKGFP
401 NFEKHVKYDT GSGGGGVAAD IGAGLADALT APLDHKDKGL QSLTLDQSVR
451 KNEKLKLAAQ GAEKTYGNGD SLNTGKLKND KVSRFDFIRQ IEVDGQLITL
501 ESGEFQVYKQ SHSALTAFQT EQIQDSEHSG KMVAKRQFRI GDIAGEHTSF
551 DKLPEGGRAT YRGTAFGSDD AGGKLTYTID FAAKQGNGKI EHLKSPELNV
601 DLAAADIKPD GKRHAVISGS VLYNQAEKGS YSLGIFGGKA QEVAGSAEVK
651 TVNGIRHIGL AAKQLEHHHH HH*
ORF46.1-961
ATGTCAGATTTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCAC
CTATTCGGCAGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGC
AACCTGATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTC
CATTCCCCCTTCGACAACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTAC
CGCATCCATTGGGACGGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCC
AAAGGCGCGAGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGC
AGCACCGGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAA
CGCGCCACCCGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTT
AAAAACATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATT
GCTGTCATGCACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAA
CTCAAAGACTATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTC
AGCAATATCTTTATGGCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACG
GCACATCCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCC
GATGCGGCATACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAA
GAAAACATCACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACA
GGCGTACCGTTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGGGATCCGGAGGAGGAGGA
GCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAAC
GGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGAT
GTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAA
CAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCC
GCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTT
GCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCC
GAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAA
GCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCC
GAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCT
GATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGC
AAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTAGCTTCTGCTGAAAAATCCATT
GCCGATCACATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGAACCTGCGCAAAGAAACCCGCCAAGGCCTTGCA
GAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGCTAC
AAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCAGTC
GGCACTTCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGCTCGAGCACCACCACCACCACCAC
TGA
1 MSDLANDSFI RQVLDRQHFE PDGKYHLFGS RGELAERSGH IGLGKIQSHQ
51 LGNLMIQQAA IKGNIGYIVR FSDHGHEVHS PFDNHASHSD SDEAGSPVDG
101 FSLYRIHWDG YEHHPADGYD GPQGGGYPAP KGARDIYSYD IKGVAQNIRL
151 NLTDNRSTGQ RLADRFHNAG SMLTQGVGDG FKRATRYSPE LDRSGNAAEA
201 FNGTADIVKN IIGAAGEIVG AGDAVQGISE GSNIAVMHGL GLLSTENKMA
251 RINDLADMAQ LKDYAAAAIR DWAVQNPNAA QGIEAVSNIF MAAIPIKGIG
301 AVRGKYGLGG ITAHPIKRSQ MGAIALPKGK SAVSDNFADA AYAKYPSPYH
351 SRNIRSNLEQ RYGKENITSS TVPPSNGKNV KLADQRHPKT GVPFDGKGFP
401 NFEKHVKYDT GSGGGGATND DDVKKAATVA IAAAYNNGQE INGFKAGETI
451 YDIDEDGTIT KKDATAADVE ADDFKGLGLK KVVTNLTKTV NENKQNVDAK
501 VKAAESEIEK LTTKLADTDA ALADTDAALD ATTNALNKLG ENITTFAEET
551 KTNIVKIDEK LEAVADTVDK EAEAFNDIAD SLDETNTKAD EAVKTANEAK
601 QTAEETKQNV DAKVKAAETA AGKAEAAAGT ANTAADKAEA VAAKVTDIKA
651 DIATNKDNIA KKANSADVYT REESDSKFVR IDGLNATTEK LDTRLASAEK
701 SIADHDTRLN GLDKTVSDLR KETRQGLAEQ AALSGLFQPY NVGRFNVTAA
751 VGGYKSESAV AIGTGFRFTE NFAAKAGVAV GTSSGSSAAY HVGVNYEWLE
801 HHHHHH*
ORF46.1-961c
ATGTCAGATTTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCAC
CTATTCGGCAGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGC
AACCTGATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTC
CATTCCCCCTTCGACAACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTAC
CGCATCCATTGGGACGGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCC
AAAGGCGCGAGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGC
AGCACCGGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAA
CGCGCCACCCGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTT
AAAAACATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATT
GCTGTCATGCACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAA
CTCAAAGACTATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTC
AGCAATATCTTTATGGCAGCCATCCCCATCAAAGGGATTGCAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACG
GCACATCCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCC
GATGCGGCATACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAA
GAAAACATCACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACA
GGCGTACCGTTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGGGATCCGGAGGAGGAGGA
GCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAAC
GGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGAT
GTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAA
CAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCC
GCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTT
GCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCC
GAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAA
GCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCC
GAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCT
GATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGC
AAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATT
GCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCA
GAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCTCGAGCACCACCACCACCACCACTGA
1 MSDLANDSFI RQVLDRQHFE PDGKYHLFGS RGELAERSGH IGLGKIQSHQ
51 LGNLMIQQAA IKGNIGYIVR FSDHGHEVHS PFDNHASHSD SDEAGSPVDG
101 FSLYRIHWDG YEHHPADGYD GPQGGGYPAP KGARDIYSYD IKGVAQNIRL
151 NLTDNRSTGQ RLADRFHNAG SMLTQGVGDG FKRATRYSPE LDRSGNAAEA
201 FNGTADIVKN IIGAAGETVG AGDAVQGISE GSNIAVMHGL GLLSTENKMA
251 RINDLADAAQ LKKYAAAAIR DWAVQNPNAA QGIEAVSNIF MAAIPIKGIG
301 AVRGKYGLGG ITAHPIKRSQ MGAIALPKGK SAVSDNFADA AYAKYPSPYH
351 SRNIRSNLEQ RYGKENITSS TVPPSNGKNV KLADQRHPKT GVPFDGKGFP
401 NFEKHVKYDT GSGGGGATND DDVKKAATVA IAAAYNNGQE INGFKAGETI
451 YDIDEDGTIT KKDATAADVE ADDFKGLGLK KVVTNLTKTV NENKQNVDAK
501 VKAAESEIEK LTTKLADTDA ALADTDAALD ATTNALNKLG ENITTFAEET
551 KTNIVKIDEK LEAVADTVDK HAEAFNDIAD SLDETNTKAD EAVKTANEAK
601 QTAEETKQNV DAKVKAAETA AGKAEAAAGT ANTAADKAEA VAAKVTDIKA
651 DIATNKDNIA KKANSADVYT REESDSKEVR IDGLNATTEK LDTRLASAEK
701 SIADHDTRLN GLDKTVSDLR KETRQGLAEQ AALSGLFQPY NVGLEHHHHH
751 H*
961-ORF46.1
ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATC
AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC
GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAAC
AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGAT
GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACA
TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCAT
GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAAT
GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAA
GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAA
GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC
AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCC
ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT
GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGC
TACAAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCA
GTCGGCACTTCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGGGATCCGGAGGAGGAGGATCA
GATTTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCACCTATTC
GGCAGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGCAACCTG
ATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTCCATTCC
CCCTTCGACAACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTACCGCATC
CATTGGGACGGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGC
GCGAGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGCAGCACC
GGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAACGCGCC
ACCCGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTTAAAAAC
ATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATTGCTGTC
ATGCACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAACTCAAA
GACTATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTCAGCAAT
ATCTTTATGGCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACGGCACAT
CCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCCGATGCG
GCATACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAAGAAAAC
ATCACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACAGGCGTA
CCGTTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGCTCGAGCACCACCACCACCACCAC
TGA
1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT
51 AADVEADDFK GLGLKKVVTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL
101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA
151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK
201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS
251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT
301 VSDLRKETRQ GLAEQAALSG LFQPYNVGRF NVTAAVGGYK SESAVAIGTG
351 FRFTENFAAK AGVAVGTSSG SSAAYHVGVN YEWGSGGGGS DLANDSFIRQ
401 VLDRQHFEPD GKYHLFGSRG ELAERSGHIG LGKIQSHQLG NLMIQQAAIK
451 GNIGYIVRFS DHGHEVHSPF DNHASHSDSD EAGSPVDGFS LYRIHWDGYE
501 HHPADGYDGP QGGGYPAPKG ARDIYSYDIK GVAQNIRLNL TDNRSTGQRL
551 ADRFHNAGSM LTQGVGDGFK RATRYSPELD RSGNAAEAFN GTADTVKNII
601 GAAGEIVGAG DAVQGISEGS NIAVMHGLGL LSTENKMARI NDLADMAQLK
651 DYAAAAIRDW AVQNPNAAQG IEAVSNIFMA AIPIKGIGAV RGKYGLGGIT
701 AHPIKRSQMG AIALPKGKSA VSDNFADAAY AKYPSPYHSR NIRSNLEQRY
751 GKENITSSTV PPSNGKNVKL ADQRPKTTGV PFDGKGFPNF EKHVKYDTLE
801 HHHHHH*
961-741
ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTAGACAATGGGCCAAGAAATC
AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC
GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAAC
AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGAG
GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACA
TTTGCTGAAGAGAACTAAGACAATATTCGTAAAAATTGATAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCAT
GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACAGAACCGTCAAAACCGCCAAT
GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAA
GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAA
GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC
AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACCACACGCTTGGCTTCTGCTAAAAATCC
ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT
GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGC
TACAAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCA
GTCGGCACTTCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGGGATCCGGAGGGGGTGGTGTC
GCCGCCGACATCGGTGCGGGGCTTGCCGATGCACATACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAGTCTTTG
ACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGAAACGGT
GACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTGGACGGG
CAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAGACCGAG
CAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGCGAACAT
ACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCCGGCGGA
AAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTCAAT
GTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTACAACCAA
GCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTGAAAACC
GTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA
1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT
51 AADVEADDFK GLGLKKVVTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL
101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA
151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK
201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS
251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT
301 VSDLRKETRQ GLAEQAALSG LFQPYNVGRF NVTAAVGGYK SESAVAIGTG
351 FRFTENFAAK AGVAVGTSSG SSAAYHVGVN YEWGSGGGGV AADIGAGLAD
401 ALTAPLDHKD KGLQSLTLDQ SVRKNEKLKL AAQGAEKTYG NGDSLNTGKL
451 KNDKVSRFDF IRQIEVDGQL ITLESGEFQV YKQSHSALTA FQTEQIQDSE
501 HSGKMVAKRQ FRIGDIAGEH TSFDKLPEGG RATYRGTAFG SDDAGGKLTY
551 TIDFAAKQGN GKIEHLKSPE LNVDLAAADI KPDGKRHAVI SGSVLYNQAE
601 KGSYSLGIFG GKAQEVAGSA EVKTVNGIRH IGLAAKQLEH HHHHH*
961-983
ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATC
AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC
GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTACTAACCTGACCAAAACCGTCAATGAAAAAC
AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGAT
GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACA
TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCAT
GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAAT
GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAA
GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAA
GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC
AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCC
ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT
GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGC
TACAAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCA
GTCGGCACTTCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGGGATCCGGCGGAGGCGGCACT
TCTGCGCCCGACTTCAATGCAGGCGCGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAACAGCAGCAGTA
TCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCGGTTACA
GACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCATACAAG
AATTTGATCAACCTCAACCTGCAATTGAAGCAGGCTATACAGGACGCGGCGGTAGAGGTAGGTATCGTCGACACAGGC
GAATCCGTCGGCAGCATTCCTTTCCCGAACTGTATGGCAAGAAAAGAACACGGCTATAACGAAAATTACAAAAACTAT
ACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAGGCCGTT
ATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCATATTATT
GGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACGAATGAT
GAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGCATCGTC
AATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAGTACCGC
CAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTACGGCAAC
CTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCCAACACA
TATGCCCTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGTGGAGAA
AAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATTACTGCC
ATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGAACATCC
TTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAACCTGCGT
ACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTGGATGCG
GGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGATAATGCC
TACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCACGGCAAC
AACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATGCGCGTC
GAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTATCTGGCA
GATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACGCTGTAC
ACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGCGGCAAG
GGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTATTCTTTC
TTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAAGGCGAC
ACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCCGGTCTG
AAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCCGCAACA
CCCGAGACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTCCGCGCA
GCGGCAGCCGTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTATGCCGAC
AGTACCGCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGCACGGGT
CTGCGCGTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGCGGCAGT
ACCCAAACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGCAGCACA
TGGAGCGAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGCGATATC
GGCTATCTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAACATGCG
AAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACGGGAGAT
TTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTGGGCTGG
AGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGATAAAGCC
GTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACCGGCGCG
ACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGATGTCGAA
TTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGACGAGTC
GGCGTAGGCTACCGGTTCCTCGAGCACCACCACCACCACCACTGA
1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT
51 AADVEADDFK GLGLKKVVTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL
101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA
151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK
201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS
251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT
301 VSDLRKETRQ GLAEQAALSG LFQPYNVGRF NVTAAVGGYK SESAVAIGTG
351 FRFTENFAAK AGVAVGTSSG SSAAYHVGVN YEWGSGGGGT SAPDFNAGGT
401 GIGSNSRATT AKSAAVSYAG IKNEMCKDRS MLCAGRDDVA VTDRDAKINA
451 PPPNLHTGDF PNPNDAYKKL INLKPAIEAG YTGRGVEVGI VDTGESVGSI
501 SFPELYGRKE HGYNENYKNY TAYMRKEAPE DGGGKDIEAS FDDEAVIETE
551 AKPTDIRHVK EIGHIDLVSH IIGGRSVDGR PAGGIAPDAT LHIMNTNDET
601 KNEMMVAAIR NAWVKLGERG VRIVNNSFGT TSRAGTADLF QIANSEEQYR
651 QALLDYSGGD KTDEGIRLMQ QSDYGNLSYH IRNKNMLFIF STGNDAQAQP
701 NTYALLPFYE KDAQKGIITV AGVDRSGEKT KREMYGEPGT EPLEYGSNHC
751 GITAMWCLSA PYEASVRFTR TNPIQIAGTS FSAPIVTGTA ALLLQKYPWM
801 SNDNLRTTLL TTAQDIGAVG VDSKFGWGLL DAGKAMNGPA SFPFGDFTAD
851 TKGTSDIAYS FRNDISGTGG LIKKGGSQLQ LHGNNTYTGK TIIEGGSLVL
901 YGNNKSDMRV ETKGALIYNG AASGGSLNSD GIVYLADTDQ SGANETVHIK
951 GSLQLDGKGT LYTRLGKLLK VDGTAIIGGK LYMSARGKGA GYLNSTGRRV
1001 PFLSAAKIGQ DYSFFTNIET DGGLLASLDS VEKTAGSEGD TLSYYVRRGN
1051 AARTASAAAH SAPAGLKHAV EQGGSNLENL MVELDASESS ATPETVETAA
1101 ADRTDMPGIR PYGATFRAAA AVQHANAADG VRIFNSLAAT VYADSTAAHA
1151 DMQGRRLKAV SDGLDHNGTG LRVIAQTQQD GGTWEQGGVE GKMRGSTQTV
1201 GIAAKTGENT TAAATLGMGR STWSENSANA KTDSISLFAG IRHDAGDIGY
1251 LKGLFSYGRY KNSISRSTGA DEHAEGSVNG TLMQLGALGG VNVPFAATGD
1301 LTVEGGLRYD LLKQDAFAEK GSALGWSGNS LTEGTLVGLA GLKLSQPLSD
1351 KAVLFATAGV ERDLNGRDYT VTGGFTGATA ATGKTGARNM PHTRLVAGLG
1401 ADVEFGNGWN GLARYSYAGS KQYGNHSGRV GVGYRFLEHH HHHH*
961c-ORF46.1
ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATC
AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC
GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAAC
AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAAGAATCTGAAATAGAAAGTTAACAACCAAGTTAGCAGACACTGAT
GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAAATGGGAGAAAATATAACGACA
TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCAT
GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAAT
GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAA
GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAA
GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC
AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCC
ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT
GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGAGGAGAGGATCAGAATTTGGCA
AACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCTACCTATTCGGAGCAGG
GGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGCAACCTGATGATTCAA
CAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTCCATTCCCCCTTCGAC
AACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTACCGCATCCATTGGGAC
GGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGCGCGAGGGAT
ATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGCAGCACCGGACAACGG
CTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAACGCGCCACCCGATAC
AGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTTAAAAACATCATCGGC
GCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATTGCTGTCATGCACGGC
TTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAACTCAAAGACTATGCC
GCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTCAGCAATATCTTTATG
GCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACGGCACATCCTATCAAG
CGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCCGATGCGGCATACGCC
AAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAAGAAAACATCACCTCC
TCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACAGGCGTACCGTTTGAC
GGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGCTCGAGCACCACCACCACCACCACTGA
1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT
51 AADVEADDFK GLGLKKVVTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL
101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA
151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK
201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS
251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT
301 VSDLRKETRQ GLAEQAALSG LFQPYNVGGS GGGGSDLAND SFIRQVLDRQ
351 HFEPDGKYHL FGSRGELAER SGHIGLGKIQ SHQLGNLMIQ QAAIKGNIGY
401 IVRFSDHGHE VHSPFDNHAS HSDSDEAGSP VDGFSLYRIH WDGYEHHPAD
451 GYDGPQGGGY PAPKGARDIY SYDIKGVAQN IRLNLTDNRS TGQRLADRFH
501 NAGSMLTQGV GDGFKRATRY SPELDRSGNA AEAFNGTADI VKNIIGAAGE
551 IVGAGDAVQG ISEGSNIAVM HGLGLLSTEN KMARINDLAD MAQLKDYAAA
601 AIRDWAVQNP NAAQGIEAVS NIFMAAIPIK GIGAVRGKYG LGGITAHPIK
651 RSQMGAIALP KGKSAVSDNF ADAAYAKYPS PYHSRNIRSN LEQRYGKENI
701 TSSTVPPSNG KNVKLADQRH PKTGVPFDGK GFPNFEKHVK YDTLEHHHHH
751 H*
961c-741
ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATC
AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC
GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAAC
AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGAT
GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACA
TTTGCTGAAGAGACTAAGACAAATATCGTAAAATTGATGAAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCAT
GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAAT
GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGTCGCCAAAGTAAATGCTGCAGAAACTGCAGCAGGCAAA
GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAA
GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC
AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTCGACACACGCTTGGCTTCTGCTGAAAAATCC
ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT
GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGAGGGGGTGGTGTCGCCGCCGAC
ATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAGTCTTTGACGCTGGAT
CAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGAAACGGTGACAGCCTC
AATACGGGCAAATTGAAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAATCGAAGTGGACGGGCAGCTCATT
ACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAGACCGAGCAAATACAA
GATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGCGAACATACATCTTTT
GACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCCGGCGGAAAACTGACC
TACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTCAATGTCGACCTG
GCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTACAACCAAGCCGAGAAA
GGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTGAAAACCGTAAACGGC
ATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA
1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT
51 AADVEADDFK GLGLKKVVTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL
101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNTV KIDEKLEAVA
151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK
201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS
251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT
301 VSDLRKETRQ GLAEQAALSG LFQPYNVGGS GGGGVAADIG AGLADALTAP
351 LDHKDKGLQS LTLDQSVRKN EKLKLAAQGA EKTYGNGDSL NTGKLKNDKV
401 SRFDFIRQIE VDGQLITLES GEFQVYKQSH SALTAFQTEQ IQDSEHSGKM
451 VAKRQFRIGD IAGEHTSFDK LPEGGRATYR GTAFGSDDAG GKLTYTIDFA
501 AKQGNGKIEH LKSPELNVDL AAADIKPDGK RHAVISGSVL TNQAEKGSYS
551 LGIFGGKAQE VAGSAEVKTV NGIRHIGLAA KQLEHHHHHH*
961c-983
ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATC
AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC
GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAAC
AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGAT
GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACA
TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCAT
GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAAT
GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAA
GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAA
GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC
AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCC
ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT
GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGCGGAGGCGGCACTTCTGCGCCC
GACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCAGCAGTATCTTACGCC
GGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCGGTTACAGACAGGGAT
GCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCATACAAGAATTTGATC
AACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGACACAGGCGAATCCGTC
GGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAAAACTATACGGCGTAT
ATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAGGCCGTTATAGAGACT
GAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCATATTATTGGCGGGCGT
TCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACGAATGATGAAACCAAG
AACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGCATCGTCAATAACAGT
TTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAGTACCGCCAAGCGTTG
CTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTACGGCAACCTGTCCTAC
CACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCCAACACATATGCCCTA
TTGCCATTTTATGAAAAACACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGTGGAGAAAAGTTCAAA
CGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATTACTGCCATGTGGTGC
CTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGAACATCCTTTTCCGCA
CCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAACCTGCGTACCACGTTG
CTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTGGATGCGGGTAAGGCC
ATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGATATTGCCTACTCCTTC
CGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCACGGCAACAACACCTAT
ACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATGCGCGTCGAAACCAAA
GGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTATCTGGCAGATACCGAC
CAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACGCTGTACACACGTTTG
GGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGCGGCAAGGGGGCAGGC
TATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTATTCTTTCTTCACAAAC
ATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAAGGCGACACGCTGTCC
TATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCCGGTCTGAAACACGCC
GTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCCGCAACACCCGAGACG
GTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTCCGCGCAGCGGCAGCC
GTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTATGCCGACAGTACCGCC
GCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGCACGGGTCTGCGCGTC
ATCGCGCAAACCCAACAGGACGGTGCAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGCGGCAGTACCCAAACC
GTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGCAGCACATGGAGCGAA
AACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGCGATATCGGCTATCTC
AAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAACATGCGGAAGGCAGC
GTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACGGGAGATTTGACGGTC
GAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTGGGCTGGAGCGGCAAC
AGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGATAAAGCCGTCCTGTTT
GCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACCGGCGCGACTGCAGCA
ACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGATGTCGAATTCGGCAAC
GGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGACGAGTCGGCGTAGGC
TACCGGTTCCTCGAGCACCACCACCACCACCACTGA
1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT
51 AADVEADDFK GLGLKKVVTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL
101 ADTDAALADT DAALDATTNA LNKMGENITT FAEETKTNIV KIDEKLEAVA
151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK
201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS
251 ADVYTREESD SKPVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT
301 VSDLRKETRQ GLAEQAALSG LFQPYNVGGS GGGGTSAPDF NAGGTGIGSN
351 SRATTAKSAA VSYAGIKNEM CKDRSMLCAG RDDVAVTDRD AKINAPPPNL
401 HTGDFPNPND AYKNLINLKP AIEAGYTGRG VEVGIVDTGE SVGSISFPEL
451 YGRKEHGYNE NYKNYTAYMR KEAPEDGGGK DIEASFDDEA VIETEAKPTD
501 IRHVKEIGHI DLVSHIIGGR SVDGRPAGGI APDATLHIMN TNDETKNEMM
551 VAAIRNAWVK LGERGVRIVN NSFGTTSRAG TADLFQIANS EEQYRQALLD
601 YSGGDKTDEG IRLMQQSDYG NLSYHIRNKN MLFIFSTGND AQAQPNTYAL
651 LPFYEKDAQK GIITVAGVDR SGEKFKREMY GEPGTEPLEY GSNHCGITAM
701 WCLSAPYEAS VRFTRTNPIQ IAGTSFSAPI VTGTAALLLQ KYPWMSNDNL
751 RTTLLTTAQD IGAVGVDSKF GWGLLDAGKA MNGPASFPFG DFTADTKGTS
801 DIAYSFRNDI SGTGGLIKKG GSQLQLHGNN TYTGKTIIEG GSLVLYGNNK
851 SDMRVETKGA LIYNGAASGG SLNSDGIVYL ADTDQSGANE TVHIKGSLQL
901 DGKGTLYTRL GKLLKVDGTA IIGGKLYMSA RGKGAGYLNS TGRRVPFLSA
951 AKIGQDYSFF TNIETDGGLL ASLDSVEKTA GSEGDTLSYY VRRGNAARTA
1001 SAAAHSAPAG LKHAVEQGGS NLENLMVELD ASESSATPET VETAAADRTD
1051 MPGIRPYGAT FRAAAAVQHA NAADGVRIFN SLAATVYADS TAAHADMQGR
1101 RLKAVSDGLD HNGTGLRVIA QTQQDGGTWE QGGVEGKMRG STQTVGIAAK
1151 TGENTTAAAT LGMGRSTWSE NSANAKTDSI SLFAGIRHDA GDIGYLKGLF
1201 SYGRYKNSIS RSTGADEHAE GSVNGTLMQL GALGGVNVPF AATGDLTVEG
1251 GLRYDLLKQD AFAEKGSALG WSGNSLTEGT LVGLAGLKLS QPLSDKAVLF
1301 ATAGVERDLN GRDYTVTGGF TGATAATGKT GARNMPHTRL VAGLGADVEF
1351 GNGWNGLARY SYAGSKQYGN HSGRVGVGYR FLEHHHHHH*
961cL-ORF46.1
ATGAAACACTTTCCATCCAAAGTACTGACCACAGCCATCCTTGCCACTTTCTGTAGCGGCGCACTGGCAGCCACAAAC
GACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAA
GCTGGAGAGACCATCTACGACATTCATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAAGCC
GACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTC
GATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTAGCA
GATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAG
ACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTC
AACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAG
ACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCC
GCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCT
ACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTC
AGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCAC
GATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCC
GCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGAGGAGGAGGATCAGATTTGGCAAACGATTCTTTT
ATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCACCTATTCGGCAGCAGGGGGGAACTTGCC
GAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGCAACCTGATGATTCAACAGGCGGCCATT
AAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTCCATTCCCCCTTCGACAACCATGCCTCA
CATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTACCGCATCCATTGGGACGGATACGAACAC
CATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGCGCGAGGGATATATACAGCTAC
GACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTCACCGACAACCGCAGCACCGGACAACGGCTTGCCGACCGT
TTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAACGCGCCACCCGATACAGCCCCGAGCTG
GACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTTAAAAACATCATCGGCGCGGCAGGAGAA
ATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATTGCTGTCATGCACGGCTTGGGTCTGCTT
TCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAACTCAAAGACTATGCCGCAGCAGCCATC
CGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTCAGCAATATCTTTATGGCAGCCATCCCC
ATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACGGCACATCCTATCAAGCGGTCGCAGATG
GGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCCGATGCGGCATACGCCAAATACCCGTCC
CCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAAGAAAACATCACCTCCTCAACCGTGCCG
CCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACAGGCGTACCGTTTGACGGTAAAGGGTTT
CCGAATTTTGAGAAGCACGTGAAATATGATACGTAACTCGAG
1 MKHFPSKVLT TAILATFCSG ALAATNDDDV KKAATVAIAA AYNNGQEING
51 FKAGETIYDI DEDGTITKKD ATAADVEADD FKGLGLKKVV TNLTKTVNEN
101 KQNVDAKVKA AESEIEKLTT KLADTDAALA DTDAALDATT NALNKLGENI
151 TTFAEETKTN IVKIDEKLEA VADTVDKHAE AFNDIADSLD ETNTKADEAV
201 KTANEAKQTA EETKQNVDAK VKAAETAAGK AEAAAGTANT AADKAEAVAA
251 KVTDIKADIA TNKDNIAKKA NSADVYTREE SDSKFVRIDG LNATTEKLDT
301 RLASAEKSIA DHDTRLNGLD KTVSDLRKET RQGLAEQAAL SGLFQPYNVG
351 GSGGGGSDLA NDSFIRQVLD RQHFEPDGKY HLFGSRGELA ERSGHIGLGK
401 IQSHQLGNLM IQQAAIKGNI GYIVRFSDHG HEVHSPFDNH ASHSDSDEAG
451 SPVDGFSLYR IHWDGYEHHP ADGYDGPQGG GYPAPKGARD IYSYDIKGVA
501 QNIRLNLTDN RSTGQRLADR FHNAGSMLTQ GVGDGFKRAT RYSPELDRSG
551 NAAEAFNGTA DIVKNIIGAA GEIVGAGDAV QGISEGSNIA VMHGLGLLST
601 ENKMARINDL ADMAQLKDYA AAAIRDWAVQ NPNAAQGIEA VSNIFMAAIP
651 IKGIGAVRGK YGLGGITAHP IKRSQMGAIA LPKGKSAVSD NFADAAYAKY
701 PSPYHSRNIR SNLEQRYGKE NITSSTVPPS NGKNVKLADQ RHPKTGVPFD
751 GKGFPNFEKH VKYDT*
961cL-741
ATGAAACACTTTCCATCCAAAGTACTGACCACAGCCATCCTTGCCACTTTCTGTAGCGGCGCACTGGCAGCCACAAAC
GACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAA
GCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAAGCC
GACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTC
GATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTAGCA
GATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAG
ACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTC
AACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAG
ACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCC
GCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCT
ACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTC
AGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCAC
GATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCC
GCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGAGGGGGTGGTGTCGCCGCCGACATCGGTGCGGGG
CTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAGTCTTTGACGCTGGATCAGTCCGTCAGG
AAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGAAACGGTGACAGCCTCAATACGGGCAAA
TTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTGGACGGGCAGCTCATTACCTTGGAGAGT
GGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAGACCGAGCAAATACAAGATTCGGAGCAT
TCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGCGAACATACATCTTTTGACAAGCTTCCC
GAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGGACGATGCCGGCGGAAAACTACCTACACCATAGAT
TTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTCAATGTCGACCTGGCCGCCGCCGAT
ATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTACAACCAAGCCGAGAAAGGCAGTTACTCC
CTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTGAAAACCGTAAACGGCATACGCCATATC
GGCCTTGCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA
1 MKHFPSKVLT TAILATFCSG ALAATNDDDV KKAATVAIAA AYNNGQEING
51 FKAGETIYDI DEDGTITKKD ATAADVEADD FKGLGLKKVV TNLTKTVNEN
101 KQNVDAKVKA AESEIEKLTT KLADTDAALA DTDAALDATT NALNKLGENI
151 TTFAEETKTN IVKIDEELEA VADTVDHKAE AFNDIADSLD ETNTKADEAV
201 KTANEAKQTA EETKQNVDAK VKAAETAAGK AEAAAGTANT AADKAEAVAA
251 KVTDIKADIA TNKDNIAKKA NSADVYTREE SDSKFVRIDG LNATTEKLDT
301 RLASAEKSIA DHDTRLNGLD KTVSDLRKET RQGLAEQAAL SGLFQFYNVG
351 GSGGGGVAAD IGAGLADALT APLDHKDKGL QSLTLDQSVR KNEKLKLAAQ
401 GAEKTYGNGD SLNYGKLKND KVSRFDFIRQ IEVDGQLITL ESGEFQVYKQ
451 SHSALTAFQT EQIQDSEHSG KMVAKRQFRI GDIAGEHTSF DKLPEGGRAT
501 YRGTAFGSDD AGGKLTYTID FAAKQGNGKI EHLKSPELNV DLAAADIKPD
551 GKRHAVISGS VLYNQAEKGS YSLGIFGGKA QEVAGSAEVK TVNGIRHIGL
601 AAKQLEHHHH HH*
961cL-983
ATGAAACACTTTCCATCCAAAGTACTGACCACAGCCATCCTTGCCACTTTCTGTAGCGGCGCACTGGCAGCCACAAAC
GACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAA
GCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAAGCC
GACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTC
GATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTAGCA
GATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAG
ACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTC
AACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAG
ACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCC
GCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCT
ACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTC
AGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCAC
GATACTCGCCTGAACGGTTTGGATAAAAGTGTCAGAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCC
GCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGCGGAGGCGGCACTTCTGCGCCCGACTTCAATGCA
GGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCAGCAGTATCTTACGCCGGTATCAAGAAC
GAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCGGTTACAGACAGGGATGCCAAAATCAAT
GCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCATACAAGAATTTGATCAACCTCAAACCT
GCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGACACAGGCGAATCCGTCGGCAGCATATCC
TTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAAAACTATACGGCGTATATGCGGAAGGAA
GCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAGGCCGTTATAGAGACTGAAGCAAAGCCG
ACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCATATTATTGGCGGGCGTTCCGTGGACGGC
AGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACGAATGATGAAACCAAGAACGAAATGATG
GTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGCATCGTCAATAACAGTTTTGGAACAACA
TCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAGTACCGCCAAGCGTTGCTCGACTATTCC
GGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTACGGCAACCTGTCCTACCACATCCGTAAT
AAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCCAACACATATGCCCTATTGCCATTTTAT
GAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGTGGAGAAAAGTTCAAACGGGAAATGTAT
GGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATTACTGCCATGTGGTGCCTGTCGGCACCC
TATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGAACATCCTTTTCCGCACCCATCGTAACC
GGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAACCTGCGTACCACGTTGCTGACGACGGCT
CAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTGGATGCGGGTAAGGCCATGAACGGACCC
GCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGATATTGCCTACTCCTTCCGTAACGACATT
TCAGGCACGGGCGGCCTGATCAAAAAAGQCGGCAGCCAACTGCAACTGCACGGCAACAACACCTATACGGGCAAAACC
ATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATGCGCGTCGAAACCAAAGGTGCGCTGATT
TATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTATCTGGCAGATACCGACCAATCCGGCGCA
AACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACGCTGTACACACGTTTGGGCAAACTGCTG
AAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGCGGCAAGGGGGCAGGCTATCTCAACAGT
ACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTATTCTTTCTTCACAAACATCGAAACCGAC
GGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAAGGCGACACGCTGTCCTATTATGTCCGT
CGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCCGGTCTGAAACACGCCGTAGAACAGGGC
GGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCCGCAACACCCGAGACCGTTGAAACTGCG
GCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTCCGCGCAGCGGCAGCCGTACAGCATGCG
AATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTATGCCGACAGTACCGCCGCCCATGCCGAT
ATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGCACGGGTCTGCGCGTCATCGCGCAAACC
CAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGCGGCAGTACCCAAACCGTCGGCATTGCC
GCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGCAGCACATGGAGCGAAAACAGTGCAAAT
GCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGCGATATCGGCTATCTCAAAGGCCTGTTC
TCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAACATGCGGAAGGCAGCGTCAACGGCACG
CTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACGGGAGATTTGACGGTCGAAGGCGGTCTG
CGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTGGGCTGGAGCGGCAACAGCCTCACTGAA
GGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGATAAAGCCGTCCTGTTTGCAACGGCGGGC
GTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACCGGCGCGACTGCAGCAACCGGCAAGACG
GGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGATGTCGAATTCGGCAACGGCTGGAACGGC
TTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGACGAGTCGGCGTAGGCTACCGGTTCTGA
CTCGAG
1 MKHFPSKVLT TAILATFCSG ALAATNDDDV KKAATVAIAA AYNNGQEING
51 FKAGETIYDI DEDGTITKKD ATAADVEADD FKGLGLKKVV TNLTKTVNEN
101 KQNVDAKVKA AESEIEKLTT KLADTDAALA DTDAALDATT NALNKLGENI
151 TTFAEETKTN IVKIDEKLEA VADTVDKHAE AFNDIADSLD ETNTKADEAV
201 KTANEAKQTA EETKQNVDAK VKAAETAAGK AEAAAGTANT AADKAEAVAA
251 KVTDIKADIA TNKDNIAKKA NSADVYTREE SDSKFVRIDG LNATTEKLDT
301 RLASAEKSIA DHDTRLNGLD KTVSDLRKET RQGLAEQAAL SGLFQFYNVG
351 GSGGGGTSAP DFNAGGTGIG SNSRATTAKS AAVSYAGIKN EMCKDRSMLC
401 AGRDDVAVTD RDAKINAPPP NLHTGDFPNP NDAYKNLINL KPAIEAGYTG
451 RGVEVGIVDT GESVGSISFP ELYGRKEHGY NENYKNYTAY MRKEAPEDGG
501 GKDIEASFDD EAVIETEAKP TDIRHVKEIG HIDLVSHIIG GRSVDGRPAG
551 GIAPDATLHI MNTNDETKNE MMVAAIRNAW VKLGERGVRI VNNSFGTTSR
601 AGTADLFQIA NSEEQYRQAL LDYSGGDKTD EGIRLMQQSD YGNLSYHIRN
651 KNMLFIFSTG NDAQAQPNTY ALLPFYEKDA QKGIITVAGV DRSGEKFKRE
701 MYGEPGTEPL EYGSNHCGIT AMWCLSAPYE ASVRFTRTNP IQIAGTSFSA
751 PIVTGTAALL LQKYPWMSND NLRTTLLTTA QDIGAVGVDS KFGWGLLDAG
801 KAMNGPASFP FGDFTADTKG TSDIAYSFRN DISGTGGLIK KGGSQLQLHG
851 NNTYTGKTII EGGSLVLYGN NKSDMRVETK GALIYNGAAS GGSLNSDGIV
901 YLADTDQSGA NETVHIKGSL QLDGKGTLYT RLGKLLKVDG TAIIGGKLYM
951 SARGKGAGYL NSTGRRVPFL SAAKIGQDYS FFTNIETDGG LLASLDSVEK
1001 TAGSEGDTLS YYVRRGNAAR TASAAAHSAP AGLKHAVEQG GSNLENLMVE
1051 LDASESSATP ETVETAAADR TDMPGIRPYG ATFRAAAAVQ HANAADGVRI
1101 FNSLAATVYA DSTAAHADMQ GRRLKAVSDG LDHNGTGLRV IAQTQQDGGT
1151 WEQGGVEGKM RGSTQTVGIA AKTGENTTAA ATLGMGRSTW SENSANAKTD
1201 SISLFAGIRH DAGDIGYLKG LFSYGRYKNS ISRSTGADEH AEGSVNGTLM
1251 QLGALGGVNV PFAATGDLTV EGGLRYDLLK QDAFAEKGSA LGWSGNSLTE
1301 GTLVGLAGLK LSQPLSDKAV LFATAGVERD LNGRDYTVTG GFTGATAATG
1351 KTGARNMPHT RLVAGLGADV EFGNGWNGLA RYSYAGSKQY GNHSGRVGVG
1401 YRF*
可以理解本发明仅以实施例的方式进行描述,在本发明的范围和精神内还可进行改变。例如,设想可以使用其它菌株的蛋白质[如,参见WO 00/66741,ORF4、ORF40、ORF46、225、235、287、519、726、919和953的多态序列]。
实验详述
克隆策略和寡核苷酸设计
用以脑膜炎奈瑟球菌B MC58的基因组序列为基础设计的寡核苷酸,通过PCR扩增编码感兴趣的抗原的基因。除非特别指出,通常将菌株2996的基因组DNA用作PCR反应的模板,将扩增的片段克隆入表达载体pET21b+(Novagen),从而以C-未端His标记的产物形式表达该蛋白,或将其克隆入pET-24b+(Novagen)以‘未标记的′形式(如ΔG287K)表达该蛋白。
不用融合配体和用其自身前导肽(如果存在时)表达蛋白质,进行开放读框(ATG到终止密码子)的扩增。
当蛋白质以‘未标记的′的形式表达时,通过从预定的前导序列设计5′-端扩增引物下游除去前导肽。
用于PCR的引物的解链温度取决于整个引物中杂交核苷酸的数量和类型,并用以下公式确定:
Tm1=4(G+C)+2(A+T) (除去尾部的)
Tm2=64.9+041(%GC)-600/N (完整的引物)
对整个低聚物而言,所选寡核苷酸的解链温度通常为65-70℃,仅针对杂交区域,解链温度为50-60℃。
用Perkin Elmer 394 DNA/RNA合成仪合成寡核苷酸,将其从柱上洗脱到2.0ml NH4OH中,在56℃培育5小时以去保护。加入0.3M乙酸钠和2体积的乙醇沉淀寡聚物。将样品离心,将沉淀物重悬浮于水中。
|
|
序列 |
限制酶切位点 |
fu(961)- |
Fwd |
CGCGGATCC-GGACGGGGTGGTGTCG |
BamHI |
741(MC58)-His |
|
|
|
Rev |
CCCGCTCGAG-TTGCTTGGCGGCAAGGC |
XhoI |
fu(961)-983-His |
Fwd |
CGCGGATCC-GGCGGAGGCGGCACTT |
BamHI |
Rev |
CCCGCTCGAG-GAACCGGTAGCCTACG |
XhoI |
fu(961)-Orf46.1-His |
Fwd |
CGCGGATCCGGTGGTGGTGGT-TCAGATTTGGCAAACGATTC |
BamHI |
Rev |
CCCGCTCGAG-CGTATCATATTTCACGTGC |
XhoI |
fu(961 c-L)-741(MC58) |
Fwd |
CGCGGATCC-GGAGGGGGTGGTGTCG |
BamHI |
|
Rev |
CCCGCTCGAG-TTATTGCTTGGCGGCAAG |
XhoI |
fu(961c-L)-983 |
Fwd |
CGCGGATCC-GGCGGAGGCGGCACTT |
BamHI |
Rev |
CCCGCTCGAG-TCAGAACCGGTAGCCTAC |
XhoI |
fu(961c-L)-Orf46.1 |
Fwd |
CGCGGATCCGGTGGTGGTGGT-TCAGATTTGGCAAACGATTC |
BamHI |
Rev |
CCCGCTCGAG-TTACGTATCATATTTCACGTGC |
XhoI |
fu-(ΔG287)-919-His |
Fwd |
CGCGGATCCGGTGGTGGTGGT-CAAAGCAAGAGCATCCAAACC |
BamHI |
Rev |
CCCAAGCTT-TTCGGGCGGTATTCGGGCTTC |
HindIII |
fu-(ΔG287)-953-His |
Fwd |
CGCGGATCCGGTGGTGGTGGT-GCCACCTACAAAGTGGAC |
BamHI |
Rev |
GCCCAAGCTT-TTGTTTGGCTGCCTCGAT |
HindIII |
fu-(ΔG287)-961-His |
Fwd |
CGCGGATCCGGTGGTGGTGGT-ACAAGCGACGACG |
BamHI |
Rev |
GCCCAAGCTT-CCACTCGTAATTGACGCC |
HindIII |
fu-(ΔG287)-Orf46.1-His |
Fwd |
CGCGGATCCGGTGGTGGTGGT-TCAGATTTGGCAAACGATTC |
BamHI |
Rev |
CCCAAGCTT-CGTATCATATTTCACGTGC |
HindIII |
fu-(ΔG287-919)-Orf46.1-His |
Fwd |
CCCAAGCTTGGTGGTGGTGGTGGT-TCAGATTTGGCAAACGATTC |
HindIII |
Rev |
CCCGCTCGAG-CGTATCATATTTCACGTGC |
XhoI |
fu-(ΔG287-Orf46.1)-919-His |
Fwd |
CCCAAGCTTGGTGCTGGTGGTGGT-CAAAGCAAGAGCATCCAAACC |
HindIII |
Rev |
CCCGCTCGAG-CGGGCGGTATTCGGGCTT |
XhoI |
fuΔG287(394.98)- |
Fwd |
CGCGGATCCGCTAGC-CCCGATGTTAAATCGGC |
NheI |
Rev |
CGGGGATCC-ATCCTGCTCTTTTTTGCCGG |
BamHI |
fu Orf1-(Orf46.1)-His |
Fwd |
CGCGGATCCGCTAGC-GGACACACTTATTTCGGCATC |
NheI |
Rev |
CGCGGATCC-CCAGCGGTAGCCTAATTTGAT |
|
fu(Orf1)-Orf46.1-His |
Fwd |
CGCGGATCCGGTGGTGGTGGT-TCAGATTTGGCAAACGATTC |
BamHI |
Rev |
CCCAAGCTT-CGTATCATATTTCACGTGC |
HindIII |
fu(919)-Orf46.1-His |
Fwd1 |
GCGGCGTCGACGGTGGCGGAGGCACTGGATCCTCAG |
SalI |
Fwd2 |
GGAGGCACTGGATCCTCAGATTTGGCAAACGATTC |
|
Rev |
CCCGCTCGAG-CGTATCTATTTCACGTGC |
XhoI |
Fu(orf46)-287-His |
Fwd |
CGGGGATCCGGGGGCGGCGGTGGCG |
BamHI |
Rev |
CCCAAGCTTATCCTGCTCTTTTTTTGCCGGC |
HindIII |
Fu(orf46)-919-His |
Fwd |
CGCGGATCCGGTGGTGGTGGTCAAAGCAAGAGCATCCAAACC |
BamHI |
Rev |
CCCAAGCTTCGGGCGGTATTTCGGGCTTC |
HindIII |
Fu(orf46-919)-287-His |
Fwd |
CCCCAAGCTTGGGGGCGGCGGTGGCG |
HindIII |
Rev |
CCCGCTCGAGATCCTGCTCTTTTTTCCCGGC |
XhoI |
Fu(orf46-287)-919-His |
Fwd |
CCCAAGCTTGGTGGTGGTGGTGGTCAAAGCAAGAGCATCCAAACC |
HindIII |
Rev |
CCCGCTCGAGCGGGCGGTATTCGGGCTT |
XhoI |
(ΔG741)-961c-His |
Fwd1Fwd2 |
GGAGGCACTGGATCCGCAGCCACAAACGACGACGAGCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG |
XhoI |
Rev |
CCCGCTCGAG-ACCCAGCTTGTAAGGTTG |
XhoI |
(ΔG741)-961-His |
Fwd1Fwd2 |
GGAGGCACTGGATCCGCAGCCACAAACGACGACGAGCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG |
XhoI |
Rev |
CCCGCTCGAG-CCACTCGTAATTGACGCC |
XhoI |
(ΔG741)-983-His |
Fwd |
GCGGCCTCGAG-GGATCCGGCGGAGGCGGCACTTCTGCG |
XhoI |
Rev |
CCCGCTCGAG-GAACCGGTAGCCTACG |
XhoI |
(ΔG741)-orf46.1-His |
Fwd1Fwd2 |
GGAGGCACTGGATCCTCAGATTTGGCAAACGATTCGCGGCGTCGACGGTGGCGGAGGCACTGGATCCTCAGA |
SalI |
Rev |
CCCGCTCGAG-CGTATCATATTTCACGTGC |
XhoI |
(ΔG983)-741(MC58)-His |
Fwd |
GCGGCCTCGAG-GGATCCGGAGGGGGTGGTGTCGCC |
XhoI |
Rev |
CCCGCTCGAG-TTGCTTGGCGGCAAG |
XhoI |
(ΔG983)-961c-His |
Fwd1Fwd2 |
GGAGGCACTGGATCCGCAGCCACAAACGACGACGAGCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG |
XhoI |
|
Rev |
CCCGCTCGAG-ACCCAGCTTGTAAGGTTG |
XhoI |
(ΔG983)-961-His |
Fwd1Fwd2 |
GGAGGCACTGGATCCGCAGCCACAAACGACGACGAGCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG |
XhoI |
Rev |
CCCGCTCGAG-CCACTCGTAATTGACGCC |
XhoI |
(ΔG983)-Orf46.1-His |
Fwd1Fwd2 |
GGAGGCACTGGATCCTCAGATTTGCCAAACGATTCGCGGCGTCGACGGTGGCGGAGGCACTGGATCCTCAGA |
SalI |
Rev |
CCCGCTCGAG-CGTATCATATTTCACGTGC |
XhoI |
*将该引物用作反向引物,将所有287的C未端融合于His-标记。
§与287-His反向引物联用的正向引物。
NB-所有PCR反应使用菌株2996,除非特别指出(如菌株MC58)。
在所有以ATG起始不跟随唯一NheI位点的构建物中,ATG密码子是用于克隆的NdeI位点的一部分。在5′端用NheI作为克隆位点制备的构建物(如所有那些在N-未端包含287的)另有两个融合于抗原的编码序列的密码子(GCTAGC)。
染色体DNA模板的制备
在100ml GC培养基中使脑膜炎奈瑟球菌前株2996、MC58、394.98、1000和BZ232(及其它)生长至指数期,离心收获,并重悬浮于5ml缓冲液(20%w/v蔗糖、50mM Tris-HCl、50mM EDTA、pH 8)中。在冰上培育10分钟后,加入10ml裂解液(50mM NaCl、1%十二烷基肌氨酸钠(Na-Sarkosyl)、50μg/ml蛋白酶K)裂解细菌,悬浮液在37℃培育2小时。进行2次苯酚提取(平衡至pH8)和一次CHCl3/异戊醇(24∶1)提取。加入0.3M乙酸钠和2体积乙醇沉降DNA,并离心收集。用70%(v/v)乙醇洗涤沉淀物1次,并重溶解于4.0ml TE缓冲液(10mMTris-HCl、1mM EDTA、pH 8.0)。读取OD260测定DNA浓度。
PCR扩增
标准的PCR过程:进行如下在40μM各寡核苷酸引物、400-800μM dNTP溶液、1x PCR缓冲液(包含1.5mM MgCl2)、2.5单位TaqI DNA聚合酶(用Perkin-Elmer AmpliTaQ,Boerhingher Mannheim ExpandTM长模板)存在时,将200ng 2996、MC581000、或BZ232菌株的基因组DNA或10ng重组克隆的质粒DNA制备物用作模板。
将整个混合物在95℃初步培育3分钟后,每份样品进行2-步的扩增:用除去引物(Tm1)的限制酶尾部的杂交温度进行前5轮。然后按全长低聚物(Tm2)计算的杂交温度进行30轮。根据要扩增的Orf的长度,在68℃或72℃进行的延伸时间各不相同。对Orf1而言,自3分钟开始的延伸时间每轮递增15秒。以在72℃10分钟延伸步骤完成循环。
将扩增的DNA直接加到1%琼脂糖凝胶上。按制造商的说明,用Qiagen凝胶提取试剂盒纯化相应于正确大小条带的DNA片段。
PCR片段和克隆载体的消化
用适合的限制酶消化相应于扩增片段的纯化DNA,从而克隆入pET-21b+、pET22b+、或pET-24b+。用QIAquick PCR纯化试剂盒(按制造商的说明)纯化消化的片段,用H2O或10mM Tris(pH 8.5)洗脱。用适合的限制酶消化质粒载体,加到1.0%琼脂糖凝胶上,用Qiagen AIQquick凝胶提取试剂盒纯化相应于消化的载体的条带。
克隆
将预先消化和纯化的、相应于各基因的片段连接到pET21b+、pET22b+或pET-24b+中。将在连接缓冲液(由制造商提供)中的T4 DNA连接酶用于摩尔比为3∶1的片段/载体。
通过在冰上培育连接酶反应溶液和细菌40分钟,然后在37℃培育3分钟,将重组质粒转化入感受态大肠杆菌DH5或HB101中。
然后添加800μl LB肉汤,并在37℃培育20分钟。在Eppendorf微型离心机中以最高速度离心这些细胞,并重悬浮于约200μl的上清液中,并涂布在LB氨苄青霉素(100mg/ml)琼脂上。
在4.0ml LB肉汤+100μg/ml氨苄青霉素中培育随机选择的集落过夜,进行重组集落的筛选。使细胞沉淀,并按制造商的说明用Qiagen QIAprep SpinMiniprep试剂盒提取质粒DNA。用适合的限制酶消化约1μg的各微型制备物,将消化物加到1-1.5%琼脂糖凝胶(取决于预计的插入大小)与分子量标记(1kbDNA Ladder,GIBCO)平行。根据插入的大小选择阳性克隆。
表达
各基因克隆入表达载体后,将重组质粒转化入适合表达重组蛋白的大肠杆菌菌株中。如上所述,用1μl各构建物转化大肠杆菌BL21-DE3。将单重组集落接种入2ml LB+Amp(100μg/ml)中,在37℃培育过夜,然后在100ml烧瓶中用20ml LB+Amp(100μg/ml)以1∶30稀释,使OD600在0.1-0.2之间。将烧瓶置于旋转式水浴摇床中于30℃或37℃培育,直到OD600显示适合诱导表达的指数生长期(0.4-0.8OD)。加入1.0mM IPTG诱导蛋白质表达。在30℃或37℃培育3小时后,测定OD600并检测表达。用微型离心机离心1.0ml各样品,将沉淀物重悬浮于PBS中,用SDS-PAGE和考马斯蓝染色分析。
His-标记的蛋白质的纯化
从菌株2996和MC58中克隆了287的各种形式。用C-未端His标记的融合体进行构建,其包括成熟形式(aal8-427)、含缺失(Δ1、Δ2、Δ3和Δ4)的构建物和由B或C结构域组成的克隆。对以His-融合体纯化的各克隆而言,划线接种单集落,并且在37℃LB/Amp(100μg/ml)琼脂板上培育过夜。将从该平板上分离的集落接种到20ml LB/Amp(100μg/ml)的液体培养基中,并在37℃振荡生长过夜。以1∶30将过夜培养物稀释到1.0L LB/Amp(100μg/ml)液体培养基中,让其在最佳温度(30或37℃)生长,直到OD550达到0.6-0.8。添加IPTG(终浓度为1.0mM)诱导重组蛋白的表达,再培育培养物3小时。于4℃,以8000g离心15分钟收获细菌。将细菌沉淀物重悬浮于7.5ml(i)冷缓冲液A(300mM NaCl、50mM磷酸盐缓冲液、10mM咪唑、pH 8.0),用于可溶性蛋白质;或(ii)缓冲液B(10mM Tirs-HCl、100mM磷酸盐缓冲液,pH 8.8和任选地8M尿素),用于不溶性蛋白质。以可溶形式纯化的蛋白质包括287-His、Δ1、Δ2、Δ3和Δ4287-His、Δ4287MC58-His、287c-His和287cMC58-His。蛋白质287bMC58-His是不溶的并相应地纯化。用Branson Sonifier 450,在冰上以40W、30秒超声处理破坏细胞4次,并于4℃以13000xg离心30分钟。对于不溶蛋白质,将沉淀物重悬浮于2.0ml缓冲液C(6M盐酸胍、100mM磷酸盐缓冲液、10mM Tris-HCl、pH 7.5),并用Dounce匀浆器处理10次。以13000g离心匀浆30分钟并保留上清液。将可溶的和不溶制备物的上清液与150μl Ni2+-树脂(预先用缓冲液A或缓冲液B平衡)混合,并在室温温和振荡培育30分钟。树脂是按制造商说明制备的ChelatingSepharose Fast Flow(Pharmcia)。于4℃,以700g离心分批制备物5分钟,弃去上清液。用10ml缓冲液A或B洗涤树脂2次(分批)10分钟,重悬浮于1.0ml缓冲液A或B中,加到一次性柱上。用(i)缓冲液A(4℃)或(ii)缓冲液B(室温)持续洗涤树脂,直到流出物的OD280达到0.02-0.01。再用(i)冷缓冲液C(300mMNaCl、50mM磷酸盐缓冲液、20mM咪唑、pH 8.0)或(ii)缓冲液D(10mM Tris-HCl、100mM磷酸盐缓冲液、pH 6.3和任选地8M尿素)进一步洗涤树脂,直到流出物的OD280达到0.02-0.01。加入700μl(i)冷洗脱缓冲液A(300mM NaCl、50mM磷酸盐缓冲液、250mM咪唑、pH 8.0)或(ii)洗脱缓冲液B(10mM Tris-HCl、100mM磷酸盐缓冲液、pH 4.5和任选地8M尿素)洗脱His-融合蛋白,收集组分直到OD280显示获得了所有重组蛋白。用SDS-PAGE分析20μl量的各洗脱组分。用Bradford试验法计算蛋白质浓度。
变性的His-融合蛋白的复性
需要变性以稳定287bMC8,因此在免疫接种前需进行复性步骤。将甘油加到上述得到的变性组分中,使终浓度为10%v/v。用透析缓冲液I(10%v/v甘油,0.5M精氨酸、50mM磷酸盐缓冲液、5.0mM还原的谷胱甘肽、0.5mM氧化的谷胱甘肽、2.0M尿素,pH8.8)将蛋白质稀释至200μg/ml,用相同的缓冲液在4℃透析12-14小时。于4℃,用缓冲液II(10%v/v甘油,0.5M精氨酸、50mM磷酸盐缓冲液、5.0mM还原的谷胱甘肽、0.5mM氧化的谷胱甘肽、pH 8.8)再进行透析12-14小时。用以下公式计算蛋白质的浓度:
蛋白质(mg/ml)=(1.55×OD280)-(0.76×OD260)
免疫接种
在第0、21和35天,用抗原免疫接种Balb/C小鼠,在第49天分析血清。
血清分析-ELISA
将不包囊的MenB M7和包囊的菌株置于巧克力琼脂板上,在37℃、5%CO2中培育过夜。用无菌Dracon刷从琼脂板上收集细菌菌落,并接种到含有0.25%葡萄糖的Mueller-Hinton肉汤(Difco)中。每30分钟监测一次细菌的生长,随后测定OD620。让细菌生长直到OD达到0.4-0.5。将培养物以4000rpm离心10分钟。弃去上清液,用PBS洗涤细菌2次,重悬浮于含有0.025%甲醛的PBS中,并在37℃培育1小时,然后在4℃搅拌培育过夜。在96孔Greiner板的各孔中添加100μl细菌细胞,并在4℃培育过夜。然后用PBT洗涤缓冲液(0.1%Tween-20的PBS溶液)冲洗这些孔3次。在各孔中加入200μl饱和缓冲液(2.7%聚乙烯吡咯烷酮10的水溶液),将这些平板在37℃培育2小时。用PBT冲洗这些孔3次。在各孔中加入200μl稀释的血清(稀释缓冲液:1%BSA、0.1%Tween-20、0.1%NaN3的PBS溶液),将这些平板在37℃培育2小时。用PBT冲洗这些孔3次。在各孔中加入100μl HRP-缀合的兔抗-小鼠(Dako)血清(用稀释缓冲液以1∶2000稀释),将这些平板置于37℃培育90分钟。用PBT缓冲液洗涤这些孔3次。在各孔中加入100μl HRP的底物缓冲液(25ml柠檬酸缓冲液pH 5,10mg邻-苯二胺(phenildiamine)和10μl H2O2),并将平板在室温中静置20分钟。在各孔中加入100μl 12.5%H2SO4,随后测定OD490。计算ELISA滴定度,即高于预先免疫血清稀释度的、OD490值为0.4的血清稀释度。当OD490值为0.4的血清稀释度高于1∶400时,将ELISA视为阳性。
血清分析-FACS扫描细菌结合分析
将不包囊的MenB M7菌株置于巧克力琼脂板上,在37℃、5%CO2中培育过夜。用无菌Dracon刷从琼脂板上收集菌落,并接种到4支装有0.25%葡萄糖的8ml Mueller-Hinton肉汤(Difco)试管中。每30分钟监测细菌的生长,然后测定OD620。让细菌生长直到OD达到0.35-0.5。将培养物以4000rpm离心10分钟。弃去上清液,用封阻缓冲液(1%BSA的PBS溶液,0.4%NaN3)重悬浮沉淀物,并以4000rpm离心5分钟。将细胞重悬浮于封阻缓冲液,使OD620达到0.05。在96孔Costar板的各孔中添加100μl细菌细胞。在各孔中加入100μl稀释的(1∶100、1∶200、1∶400)血清(以封阻缓冲液配制),并将这些平板在4℃培育2小时。以4000rpm离心细胞5分钟,吸出上清液,在各孔中加入200μl/孔封阻缓冲液洗少涤细胞。在各孔中加入100μl R-Phicoerytrin缀合的F(ab)2山羊抗-小鼠(以1∶100稀释),将平板置于4℃培育1小时。以4000rpm离心5分钟以沉淀细胞,添加200μl/孔封阻缓冲洗涤细胞。吸出上清液,细胞重悬浮于200μl/孔PBS、0.25%甲醛。将样品转移到FACScan管中并读数。FACScan(Laser Power15mW)的条件设定为:FL2开;FSC-H阈值:92;FSC PMT电压:E01;SSCPMT:474;Amp.Gains 6.1;FL-2 PMT:586;补偿值:0。
血清分析-杀菌试验
于37℃,在5%CO2、在巧克力琼脂板(以冷冻原液起始)上培育脑膜炎奈瑟球菌菌株2996过夜。收集菌落,并将其接种到7ml含有0.25%葡萄糖的Mueller-Hinton肉汤中,使OD620达到0.05-0.08。在37℃振荡培育约1.5小时,直到OD620达到0.23-0.24。用50mM磷酸盐缓冲液(pH7.2,含10mM MgCl2、10mMCaCl2和0.5%(w/v)BSA(分析缓冲液))以105CFU/ml工作稀释度稀释细菌。最终反应混合物的总体积为50μl,其中25μl连续2倍稀释的测试血清,12.5μl工作稀释度的细菌,12.5μl幼兔补体(终浓度25%)。
对照包括:用补体血清培育的细菌、用细菌培育并在56℃加热30分钟补充灭活的免疫血清。在加入幼兔补体后,用斜置方法立即将10μl对照置于Mueller-Hinton琼脂板上(0时间)。37℃旋转培育96孔板1小时。将每份样品的7μl涂布在Mueller-Hinton琼脂板上作为斑点,而用斜置方法将10μl对照涂布在Mueller-Hinton琼脂板上(1时间)。37℃培育琼脂板18小时,计算相应于0时间和1时间的菌落数量。
血清分析-Western印迹法
将纯化的蛋白质(500ng/泳道)、外膜小泡(5μg)和MenB菌株2996衍生的全细胞提取物(25μg)加到12%SDS-聚丙烯酰胺凝胶上,并转移到硝基纤维素膜上。用转化缓冲液(0.3%Tris碱,1.14%甘氨酸、20%(v/v)甲醇)在4℃、150mA进行转化2小时。于4℃,在饱和缓冲液(10%脱脂乳、0.1%Triton X100的PBS溶液)中培育过夜使膜饱和。用冲洗缓冲液(3%脱脂乳、0.1%Triton X100的PBS溶液)冲洗膜2次,并在37℃与用洗涤缓冲液以1∶200稀释的小鼠血清一起培育2小时。冲洗膜2次,与1∶2000稀释的辣根过氧化物酶标记的抗-小鼠Ig一起培育90分钟。用0.1%Triton X100的PBS溶液冲洗膜2次,用Opti-4CN底物试剂盒(Bio-Rad)显色。加入水终止反应。
如下制备OMV:于37℃、5%CO2,在5个GC板上让脑膜炎奈瑟球菌2996生长过夜,用接种环收获,并重悬浮于10ml 20mM Tris-HCl pH 7.5、2mM EDTA中。在56℃热灭活45分钟,在冰上超声处理破碎细胞5分钟(50%负载循环(dutycycle)、50%输出,Branson超声仪3mm微型针头)。以5000g离心10分钟除去未被破坏的细胞,回收含有完整细胞包膜组分的上清液,于4℃以50000g进一步离心过夜。将含有膜的沉淀物重悬浮于2%二烷基肌氨酸钠、20mM Tris-HClpH 7.5、2mM EDTA中,在室温培育20分钟以溶解内膜。以10000g离心上清液10分钟除去聚集体,以50000g进一步离心上清液3小时。用PBS冲洗含有外膜的沉淀物,并重悬浮于相同的缓冲液中。用BSA作为标准,由D.C.Bio-Rad蛋白分析(改进的Lowry方法)测定蛋白质的浓度。
如下制备全细胞提取物:使脑膜炎奈瑟球菌在GC板上培育过夜,用接种环收获,并重悬浮于1ml 20mM Tris-HCl中。在56℃热灭活30分钟。
序列表
<110>启龙股份公司(Chiron SpA)
<120>奈瑟球菌蛋白质的杂交表达
<130>024941F2
<150>GB 0004695.3
<151>2000-02-28
<150>GB 0027675.8
<151>2000-11-13
<160>121
<170>SeqWin99,version 1.02
<210>1
<211>608
<212>PRT
<213>脑膜炎奈瑟氏菌(Neisseria meningitidis)
<400>1
Leu Gly Ile Ser Arg Lys Ile Ser Leu Ile Leu Ser Ile Leu Ala Val
1 5 10 15
Cys Leu Pro Met His Ala His Ala Ser Asp Leu Ala Asn Asp Ser Phe
20 25 30
Ile Arg Gln Val Leu Asp Arg Gln His Phe Glu Pro Asp Gly Lys Tyr
35 40 45
His Leu Phe Gly Ser Arg Gly Glu Leu Ala Glu Arg Ser Gly His Ile
50 55 60
Gly Leu Gly Lys Ile Gln Ser His Gln Leu Gly Asn Leu Met Ile Gln
65 70 75 80
Gln Ala Ala Ile Lys Gly Asn Ile Gly Tyr Ile Val Arg Phe Ser Asp
85 90 95
His Gly His Glu Val His Ser Pro Phe Asp Asn His Ala Ser His Ser
100 105 110
Asp Ser Asp Glu Ala Gly Ser Pro Val Asp Gly Phe Ser Leu Tyr Arg
115 120 125
Ile His Trp Asp Gly Tyr Glu His His Pro Ala Asp Gly Tyr Asp Gly
130 135 140
Pro Gln Gly Gly Gly Tyr Pro Ala Pro Lys Gly Ala Arg Asp Ile Tyr
145 150 155 160
Ser Tyr Asp Ile Lys Gly Val Ala Gln Asn Ile Arg Leu Asn Leu Thr
165 170 175
Asp Asn Arg Ser Thr Gly Gln Arg Leu Ala Asp Arg Phe His Asn Ala
180 185 190
Gly Ser Met Leu Thr Gln Gly Val Gly Asp Gly Phe Lys Arg Ala Thr
195 200 205
Arg Tyr Ser Pro Glu Leu Asp Arg Ser Gly Asn Ala Ala Glu Ala Phe
210 215 220
Asn Gly Thr Ala Asp Ile Val Lys Asn Ile Ile Gly Ala Ala Gly Glu
225 230 235 240
Ile Val Gly Ala Gly Asp Ala Val Gln Gly Ile Ser Glu Gly Ser Asn
245 250 255
Ile Ala Val Met His Gly Leu Gly Leu Leu Ser Thr Glu Asn Lys Met
260 265 270
Ala Arg Ile Asn Asp Leu Ala Asp Met Ala Gln Leu Lys Asp Tyr Ala
275 280 285
Ala Ala Ala Ile Arg Asp Trp Ala Val Gln Asn Pro Asn Ala Ala Gln
290 295 300
Gly Ile Glu Ala Val Ser Asn Ile Phe Met Ala Ala Ile Pro Ile Lys
305 310 315 320
Gly Ile Gly Ala Val Arg Gly Lys Tyr Gly Leu Gly Gly Ile Thr Ala
325 330 335
His Pro Ile Lys Arg Ser Gln Met Gly Ala Ile Ala Leu Pro Lys Gly
340 345 350
Lys Ser Ala Val Ser Asp Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr
355 360 365
Pro Ser Pro Tyr His Ser Arg Asn Ile Arg Ser Asn Leu Glu Gln Arg
370 375 380
Tyr Gly Lys Glu Asn Ile Thr Ser Ser Thr Val Pro Pro Ser Asn Gly
385 390 395 400
Lys Asn Val Lys Leu Ala Asp Gln Arg His Pro Lys Thr Gly Val Pro
405 410 415
Phe Asp Gly Lys Gly Phe Pro Asn Phe Glu Lys His Val Lys Tyr Asp
420 425 430
Thr Lys Leu Asp Ile Gln Glu Leu Ser Gly Gly Gly Ile Pro Lys Ala
435 440 445
Lys Pro Val Ser Asp Ala Lys Pro Arg Trp Glu Val Asp Arg Lys Leu
450 455 460
Asn Lys Leu Thr Thr Arg Glu Gln Val Glu Lys Asn Val Gln Glu Ile
465 470 475 480
Arg Asn Gly Asn Lys Asn Ser Asn Phe Ser Gln His Ala Gln Leu Glu
485 490 495
Arg Glu Ile Asn Lys Leu Lys Ser Ala Asp Glu Ile Asn Phe Ala Asp
500 505 510
Gly Met Gly Lys Phe Thr Asp Ser Met Asn Asp Lys Ala Phe Ser Arg
515 520 525
Leu Val Lys Ser Val Lys Glu Asn Gly Phe Thr Asn Pro Val Val Glu
530 535 540
Tyr Val Glu Ile Asn Gly Lys Ala Tyr Ile Val Arg Gly Asn Asn Arg
545 550 555 560
Val Phe Ala Ala Glu Tyr Leu Gly Arg Ile His Glu Leu Lys Phe Lys
565 570 575
Lys Val Asp Phe Pro Val Pro Asn Thr Ser Trp Lys Asn Pro Thr Asp
580 585 590
Val Leu Asn Glu Ser Gly Asn Val Lys Arg Pro Arg Tyr Arg Ser Lys
595 600 605
<210>2
<211>464
<212>PRT
<213>人工序列
<220>
<223>ΔG287
<400>2
Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala Ala Pro
1 5 10 15
Val Val Ser Glu Lys Glu Thr Glu Ala Lys Glu Asp Ala Pro Gln Ala
20 25 30
Gly Ser Gln Gly Gln Gly Ala Pro Ser Ala Gln Gly Ser Gln Asp Met
35 40 45
Ala Ala Val Ser Glu Glu Asn Thr Gly Asn Gly Gly Ala Val Thr Ala
50 55 60
Asp Asn Pro Lys Asn Glu Asp Glu Val Ala Gln Asn Asp Met Pro Gln
65 70 75 80
Asn Ala Ala Gly Thr Asp Ser Ser Thr Pro Asn His Thr Pro Asp Pro
85 90 95
Asn Met Leu Ala Gly Asn Met Glu Asn Gln Ala Thr Asp Ala Gly Glu
100 105 110
Ser Ser Gln Pro Ala Asn Gln Pro Asp Met Ala Asn Ala Ala Asp Gly
115 120 125
Met Gln Gly Asp Asp Pro Ser Ala Gly Gly Gln Asn Ala Gly Asn Thr
130 135 140
Ala Ala Gln Gly Ala Asn Gln Ala Gly Asn Asn Gln Ala Ala Gly Ser
145 150 155 160
Ser Asp Pro Ile Pro Ala Ser Asn Pro Ala Pro Ala Asn Gly Gly Ser
165 170 175
Asn Phe Gly Arg Val Asp Leu Ala Asn Gly Val Leu Ile Asp Gly Pro
180 185 190
Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser Cys Ser Gly
195 200 205
Asn Asn Phe Leu Asp Glu Glu Val Gln Leu Lys Ser Glu Phe Glu Lys
210 215 220
Leu Ser Asp Ala Asp Lys Ile Ser Asn Tyr Lys Lys Asp Gly Lys Asn
225 230 235 240
Asp Lys Phe Val Gly Leu Val Ala Asp Ser Val Gln Met Lys Gly Ile
245 250 255
Asn Gln Tyr Ile Ile Phe Tyr Lys Pro Lys Pro Thr Ser Phe Ala Arg
260 265 270
Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser Leu Pro Ala Glu Met Pro
275 280 285
Leu Ile Pro Val Asn Gln Ala Asp Thr Leu Ile Val Asp Gly Glu Ala
290 295 300
Val Ser Leu Thr Gly His Ser Gly Asn Ile Phe Ala Pro Glu Gly Asn
305 310 315 320
Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys Leu Pro Gly Gly Ser Tyr
325 330 335
Ala Leu Arg Val Gln Gly Glu Pro Ala Lys Gly Glu Met Leu Ala Gly
340 345 350
Ala Ala Val Tyr Asn Gly Glu Val Leu His Phe His Thr Glu Asn Gly
355 360 365
Arg Pro Tyr Pro Thr Arg Gly Arg Phe Ala Ala Lys Val Asp Phe Gly
370 375 380
Ser Lys Ser Val Asp Gly Ile Ile Asp Ser Gly Asp Asp Leu His Met
385 390 395 400
Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp Gly Asn Gly Phe Lys Gly
405 410 415
Thr Trp Thr Glu Asn Gly Ser Gly Asp Val Ser Gly Lys Phe Tyr Gly
420 425 430
Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr Ser Tyr Arg Pro Thr Asp
435 440 445
Ala Glu Lys Gly Gly Phe Gly Val Phe Ala Gly Lys Lys Glu Gln Asp
450 455 460
<210>3
<211>2505
<212>DNA
<213>人工序列
<220>
<223>ΔG287-919
<400>3
atggctagcc ccgatgttaa atcggcggac acgctgtcaa aaccggccgc tcctgttgtt 60
gctgaaaaag agacagaggt aaaagaagat gcgccacagg caggttctca aggacagggc 120
gcgccatcca cacaaggcag ccaagatatg gcggcagttt cggcagaaaa tacaggcaat 180
ggcggtgcgg caacaacgga caaacccaaa aatgaagacg agggaccgca aaatgatatg 240
ccgcaaaatt ccgccgaatc cgcaaatcaa acagggaaca accaacccgc cgattcttca 300
gattccgccc ccgcgtcaaa ccctgcacct gcgaatggcg gtagcaattt tggaagggtt 360
gatttggcta atggcgtttt gattgatggg ccgtcgcaaa atataacgtt gacccactgt 420
aaaggcgatt cttgtaatgg tgataattta ttggatgaag aagcaccgtc aaaatcagaa 480
tttgaaaatt taaatgagtc tgaacgaatt gagaaatata agaaagatgg gaaaagcgat 540
aaatttacta atttggttgc gacagcagtt caagctaatg gaactaacaa atatgtcatc 600
atttataaag acaagtccgc ttcatcttca tctgcgcgat tcaggcgttc tgcacggtcg 660
aggaggtcgc ttcctgccga gatgccgcta atccccgtca atcaggcgga tacgctgatt 720
gtcgatgggg aagcggtcag cctgacgggg cattccggca atatcttcgc gcccgaaggg 780
aattaccggt atctgactta cggggcggaa aaattgcccg gcggatcgta tgccctccgt 840
gtgcaaggcg aaccggcaaa aggcgaaatg cttgctggca cggccgtgta caacggcgaa 900
gtgctgcatt ttcatacgga aaacggccgt ccgtacccga ctagaggcag gtttgccgca 960
aaagtcgatt tcggcagcaa atctgtggac ggcattatcg acagcggcga tgatttgcat 1020
atgggtacgc aaaaattcaa agccgccatc gatggaaacg gctttaaggg gacttggacg 1080
gaaaatggcg gcggggatgt ttccggaagg ttttacggcc cggccggcga ggaagtggcg 1140
ggaaaataca gctatcgccc gacagatgcg gaaaagggcg gattcggcgt gtttgccggc 1200
aaaaaagagc aggatggatc cggaggagga ggatgccaaa gcaagagcat ccaaaccttt 1260
ccgcaacccg acacatccgt catcaacggc ccggaccggc cggtcggcat ccccgacccc 1320
gccggaacga cggtcggcgg cggcggggcc gtctataccg ttgtaccgca cctgtccctg 1380
ccccactggg cggcgcagga tttcgccaaa agcctgcaat ccttccgcct cggctgcgcc 1440
aatttgaaaa accgccaagg ctggcaggat gtgtgcgccc aagcctttca aacccccgtc 1500
cattcctttc aggcaaaaca gttttttgaa cgctatttca cgccgtggca ggttgcaggc 1560
aacggaagcc ttgccggtac ggttaccggc tattacgagc cggtgctgaa gggcgacgac 1620
aggcggacgg cacaagcccg cttcccgatt tacggtattc ccgacgattt tatctccgtc 1680
cccctgcctg ccggtttgcg gagcggaaaa gcccttgtcc gcatcaggca gacgggaaaa 1740
aacagcggca caatcgacaa taccggcggc acacataccg ccgacctctc ccgattcccc 1800
atcaccgcgc gcacaacggc aatcaaaggc aggtttgaag gaagccgctt cctcccctac 1860
cacacgcgca accaaatcaa cggcggcgcg cttgacggca aagccccgat actcggttac 1920
gccgaagacc ccgtcgaact tttttttatg cacatccaag gctcgggccg tctgaaaacc 1980
ccgtccggca aatacatccg catcggctat gccgacaaaa acgaacatcc ctacgtttcc 2040
atcggacgct atatggcgga caaaggctac ctcaagctcg ggcagacctc gatgcagggc 2100
atcaaagcct atatgcggca aaatccgcaa cgcctcgccg aagttttggg tcaaaacccc 2160
agctatatct ttttccgcga gcttgccgga agcagcaatg acggtcccgt cggcgcactg 2220
ggcacgccgt tgatggggga atatgccggc gcagtcgacc ggcactacat taccttgggc 2280
gcgcccttat ttgtcgccac cgcccatccg gttacccgca aagccctcaa ccgcctgatt 2340
atggcgcagg ataccggcag cgcgattaaa ggcgcggtgc gcgtggatta tttttgggga 2400
tacggcgacg aagccggcga acttgccggc aaacagaaaa ccacgggtta cgtctggcag 2460
ctcctaccca acggtatgaa gcccgaatac cgcccgtaac tcgag 2505
<210>4
<211>832
<212>PRT
<213>人工序列
<220>
<223>ΔG287-919
<400>4
Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala
1 5 10 15
Ala Pro Val Val Ala Glu Lys Glu Thr Glu Val Lys Glu Asp Ala Pro
20 25 30
Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Thr Gln Gly Ser Gln
35 40 45
Asp Met Ala Ala Val Ser Ala Glu Asn Thr Gly Asn Gly Gly Ala Ala
50 55 60
Thr Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Pro Gln Asn Asp Met
65 70 75 80
Pro Gln Asn Ser Ala Glu Ser Ala Asn Gln Thr Gly Asn Asn Gln Pro
85 90 95
Ala Asp Ser Ser Asp Ser Ala Pro Ala Ser Asn Pro Ala Pro Ala Asn
100 105 110
Gly Gly Ser Asn Phe Gly Arg Val Asp Leu Ala Asn Gly Val Leu Ile
115 120 125
Asp Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser
130 135 140
Cys Asn Gly Asp Asn Leu Leu Asp Glu Glu Ala Pro Ser Lys Ser Glu
145 150 155 160
Phe Glu Asn Leu Asn Glu Ser Glu Arg Ile Glu Lys Tyr Lys Lys Asp
165 170 175
Gly Lys Ser Asp Lys Phe Thr Asn Leu Val Ala Thr Ala Val Gln Ala
180 185 190
Asn Gly Thr Asn Lys Tyr Val Ile Ile Tyr Lys Asp Lys Ser Ala Ser
195 200 205
Ser Ser Ser Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser Leu
210 215 220
Pro Ala Glu Met Pro Leu Ile Pro Val Asn Gln Ala Asp Thr Leu Ile
225 230 235 240
Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile Phe
245 250 255
Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys Leu
260 265 270
Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ala Lys Gly
275 280 285
Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His Phe
290 295 300
His Thr Glu Asn Gly Arg Pro Tyr Pro Thr Arg Gly Arg Phe Ala Ala
305 310 315 320
Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser Gly
325 330 335
Asp Asp Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp Gly
340 345 350
Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val Ser
355 360 365
Gly Arg Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr Ser
370 375 380
Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala Gly
385 390 395 400
Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Cys Gln Ser Lys Ser
405 410 415
Ile Gln Thr Phe Pro Gln Pro Asp Thr Ser Val Ile Asn Gly Pro Asp
420 425 430
Arg Pro Val Gly Ile Pro Asp Pro Ala Gly Thr Thr Val Gly Gly Gly
435 440 445
Gly Ala Val Tyr Thr Val Val Pro His Leu Ser Leu Pro His Trp Ala
450 455 460
Ala Gln Asp Phe Ala Lys Ser Leu Gln Ser Phe Arg Leu Gly Cys Ala
465 470 475 480
Asn Leu Lys Asn Arg Gln Gly Trp Gln Asp Val Cys Ala Gln Ala Phe
485 490 495
Gln Thr Pro Val His Ser Phe Gln Ala Lys Gln Phe Phe Glu Arg Tyr
500 505 510
Phe Thr Pro Trp Gln Val Ala Gly Asn Gly Ser Leu Ala Gly Thr Val
515 520 525
Thr Gly Tyr Tyr Glu Pro Val Leu Lys Gly Asp Asp Arg Arg Thr Ala
530 535 540
Gln Ala Arg Phe Pro Ile Tyr Gly Ile Pro Asp Asp Phe Ile Ser Val
545 550 555 560
Pro Leu Pro Ala Gly Leu Arg Ser Gly Lys Ala Leu Val Arg Ile Arg
565 570 575
Gln Thr Gly Lys Asn Ser Gly Thr Ile Asp Asn Thr Gly Gly Thr His
580 585 590
Thr Ala Asp Leu Ser Arg Phe Pro Ile Thr Ala Arg Thr Thr Ala Ile
595 600 605
Lys Gly Arg Phe Glu Gly Ser Arg Phe Leu Pro Tyr His Thr Arg Asn
610 615 620
Gln Ile Asn Gly Gly Ala Leu Asp Gly Lys Ala Pro Ile Leu Gly Tyr
625 630 635 640
Ala Glu Asp Pro Val Glu Leu Phe Phe Met His Ile Gln Gly Ser Gly
645 650 655
Arg Leu Lys Thr Pro Ser Gly Lys Tyr Ile Arg Ile Gly Tyr Ala Asp
660 665 670
Lys Asn Glu His Pro Tyr Val Ser Ile Gly Arg Tyr Met Ala Asp Lys
675 680 685
Gly Tyr Leu Lys Leu Gly Gln Thr Ser Met Gln Gly Ile Lys Ala Tyr
690 695 700
Met Arg Gln Asn Pro Gln Arg Leu Ala Glu Val Leu Gly Gln Asn Pro
705 710 715 720
Ser Tyr Ile Phe Phe Arg Glu Leu Ala Gly Ser Ser Asn Asp Gly Pro
725 730 735
Val Gly Ala Leu Gly Thr Pro Leu Met Gly Glu Tyr Ala Gly Ala Val
740 745 750
Asp Arg His Tyr Ile Thr Leu Gly Ala Pro Leu Phe Val Ala Thr Ala
755 760 765
His Pro Val Thr Arg Lys Ala Leu Asn Arg Leu Ile Met Ala Gln Asp
770 775 780
Thr Gly Ser Ala Ile Lys Gly Ala Val Arg Val Asp Tyr Phe Trp Gly
785 790 795 800
Tyr Gly Asp Glu Ala Gly Glu Leu Ala Gly Lys Gln Lys Thr Thr Gly
805 810 815
Tyr Val Trp Gln Leu Leu Pro Asn Gly Met Lys Pro Glu Tyr Arg Pro
820 825 830
<210>5
<211>1746
<212>DNA
<213>人工序列
<220>
<223>ΔG287-953
<400>5
atggctagcc ccgatgttaa atcggcggac acgctgtcaa aaccggccgc tcctgttgtt 60
gctgaaaaag agacagaggt aaaagaagat gcgccacagg caggttctca aggacagggc 120
gcgccatcca cacaaggcag ccaagaratg gcggcagttt cggcagaaaa tacaggcaat 180
ggcggtgcgg caacaacgga caaacccaaa aatgaagacg agggaccgca aaatgatatg 240
ccgcaaaatt ccgccgaatc cgcaaatcaa acagggaaca accaacccgc cgattcttca 300
gattccgccc ccgcgtcaaa ccctgcacct gcgaatggcg gtagcaattt tggaagggtt 360
gatttggcta atggcgtttt gattgatggg ccgtcgcaaa atataacgtt gacccactgt 420
aaaggcgatt cttgtaatgg tgataattta ttggatgaag aagcaccgtc aaaatcagaa 480
tttgaaaatt taaatgagtc tgaacgaatt gagaaatata agaaagatgg gaaaagcgat 540
aaatttacta atttggttgc gacagcagtt caagctaatg gaactaacaa atatgtcatc 600
atttataaag acaagtccgc ttcatcttca tctgcgcgat tcaggcgttc tgcacggtcg 660
aggaggtcgc ttcctgccga gatgccgcta atccccgtca atcaggcgga tacgctgatt 720
gtcgatgggg aagcggtcag cctgacgggg cattccggca atatcttcgc gcccgaaggg 780
aattaccggt atctgactta cggggcggaa aaattgcccg gcggatcgta tgccctccgt 840
gtgcaaggcg aaccggcaaa aggcgaaatg cttgctggca cggccgtgta caacggcgaa 900
gtgctgcatt ttcatacgga aaacggccgt ccgtacccga ctagaggcag gtttgccgca 960
aaagtcgatt tcggcagcaa atctgtggac ggcattatcg acagcggcga tgatttgcat 1020
atgggtacgc aaaaattcaa agccgccatc gatggaaacg gctttaaggg gacttggacg 1080
gaaaatggcg gcggggatgt ttccggaagg ttttacggcc cggccggcga ggaagtggcg 1140
ggaaaataca gctatcgccc gacagatgcg gaaaagggcg gattcggcgt gtttgccggc 1200
aaaaaagagc aggatggatc cggaggagga ggagccacct acaaagtgga cgaatatcac 1260
gccaacgccc gtttcgccat cgaccatttc aacaccagca ccaacgtcgg cggtttttac 1320
ggtctgaccg gttccgtcga gttcgaccaa gcaaaacgcg acggtaaaat cgacatcacc 1380
atccccgttg ccaacctgca aagcggttcg caacacttta ccgaccacct gaaatcagcc 1440
gacatcttcg atgccgccca atatccggac atccgctttg tttccaccaa attcaacttc 1500
aacggcaaaa aactggtttc cgttgacggc aacctgacca tgcacggcaa aaccgccccc 1560
gtcaaactca aagccgaaaa attcaactgc taccaaagcc cgatggcgaa aaccgaagtt 1620
tgcggcggcg acttcagcac caccatcgac cgcaccaaat ggggcgtgga ctacctcgtt 1680
aacgttggta tgaccaaaag cgtccgcatc gacatccaaa tcgaggcagc caaacaataa 1740
ctcgag 1746
<210>6
<211>579
<212>PRT
<213>人工序列
<220>
<223>ΔG287-953
<400> 6
Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala
1 5 10 15
Ala Pro Val Val Ala Glu Lys Glu Thr Glu Val Lys Glu Asp Ala Pro
20 25 30
Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Thr Gln Gly Ser Gln
35 40 45
Asp Met Ala Ala Val Ser Ala Glu Asn Thr Gly Asn Gly Gly Ala Ala
50 55 60
Thr Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Pro Gln Asn Asp Met
65 70 75 80
Pro Gln Asn Ser Ala Glu Ser Ala Asn Gln Thr Gly Asn Asn Gln Pro
85 90 95
Ala Asp Ser Ser Asp Ser Ala Pro Ala Ser Asn Pro Ala Pro Ala Asn
100 105 110
Gly Gly Ser Asn Phe Gly Arg Val Asp Leu Ala Asn Gly Val Leu Ile
115 120 125
Asp Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser
130 135 140
Cys Asn Gly Asp Asn Leu Leu Asp Glu Glu Ala Pro Ser Lys Ser Glu
145 150 155 160
Phe Glu Asn Leu Asn Glu Ser Glu Arg Ile Glu Lys Tyr Lys Lys Asp
165 170 175
Gly Lys Ser Asp Lys Phe Thr Asn Leu Val Ala Thr Ala Val Gln Ala
180 185 190
Asn Gly Thr Asn Lys Tyr Val Ile Ile Tyr Lys Asp Lys Ser Ala Ser
195 200 205
Ser Ser Ser Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser Leu
210 215 220
Pro Ala Glu Met Pro Leu Ile Pro Val Asn Gln Ala Asp Thr Leu Ile
225 230 235 240
Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile Phe
245 250 255
Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys Leu
260 265 270
Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ala Lys Gly
275 280 285
Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His Phe
290 295 300
His Thr Glu Asn Gly Arg Pro Tyr Pro Thr Arg Gly Arg Phe Ala Ala
305 310 315 320
Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser Gly
325 330 335
Asp Asp Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp Gly
340 345 350
Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val Ser
355 360 365
Gly Arg Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr Ser
370 375 380
Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala Gly
385 390 395 400
Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Ala Thr Tyr Lys Val
405 410 415
Asp Glu Tyr His Ala Asn Ala Arg Phe Ala Ile Asp His Phe Asn Thr
420 425 430
Ser Thr Asn Val Gly Gly Phe Tyr Gly Leu Thr Gly Ser Val Glu Phe
435 440 445
Asp Gln Ala Lys Arg Asp Gly Lys Ile Asp Ile Thr Ile Pro Val Ala
450 455 460
Asn Leu Gln Ser Gly Ser Gln His Phe Thr Asp His Leu Lys Ser Ala
465 470 475 480
Asp Ile Phe Asp Ala Ala Gln Tyr Pro Asp Ile Arg Phe Val Ser Thr
485 490 495
Lys Phe Asn Phe Asn Gly Lys Lys Leu Val Ser Val Asp Gly Asn Leu
500 505 510
Thr Met His Gly Lys Thr Ala Pro Val Lys Leu Lys Ala Glu Lys Phe
515 520 525
Asn Cys Tyr Gln Ser Pro Met Ala Lys Thr Glu Val Cys Gly Gly Asp
530 535 540
Phe Ser Thr Thr Ile Asp Arg Thr Lys Trp Gly Val Asp Tyr Leu Val
545 550 555 560
Asn Val Gly Met Thr Lys Ser Val Arg Ile Asp Ile Gln Ile Glu Ala
565 570 575
Ala Lys Gln
<210>7
<211>2388
<212>DNA
<213>人工序列
<220>
<223>ΔG287-961
<400>7
atggctagcc ccgatgttaa atcggcggac acgctgtcaa aaccggccgc tcctgttgtt 60
gctgaaaaag agacagaggt aaaagaagat gcgccacagg caggttctca aggacaggge 120
gcgccatcca cacaaggcag ccaagatatg gcggcagttt cggcagaaaa tacaggcaat 180
ggcggtgcgg caacaacgga caaacccaaa aatgaagacg agggaccgca aaatgatatg 240
ccgcaaaatt ccgccgaatc cgcaaatcaa acagggaaca accaacccgc cgattcttca 300
gattccgccc ccgcgtcaaa ccctgcacct gcgaatggcg gtagcaattt tggaagggtt 360
gatttggcta atggcgtttt gattgatggg ccgtcgcaaa atataacgtt gacccactgt 420
aaaggcgatt cttgtaatgg tgataattta ttggatgaag aagcaccgtc aaaatcagaa 480
tttgaaaatt taaatgagtc tgaacgaatt gagaaatata agaaagatgg gaaaagcgat 540
aaatttacta atttggttgc gacagcagtt caagctaatg gaactaacaa atatgtcatc 600
atttataaag acaagtccgc ttcatcttca tctgcgcgat tcaggcgttc tgcacggtcg 660
aggaggtcgc ttcctgccga gatgccgcta atccccgtca atcaggcgga tacgctgatt 720
gtcgatgggg aagcggtcag cctgacgggg cattccggca atatcttcgc gcccgaaggg 780
aattaccggt atctgactta cggggcggaa aaattgcccg gcggatcgta tgccctccgt 840
gtgcaaggcg aaccggcaaa aggcgaaatg cttgctggca cggccgtgta caacggcgaa 900
gtgctgcatt ttcatacgga aaacggccgt ccgtacccga ctagaggcag gtttgccgca 960
aaagtcgatt tcggcagcaa atctgtggac ggcattatcg acagcggcga tgatttgcat 1020
atgggtacgc aaaaattcaa agccgccatc gatggaaacg gctttaaggg gacttggacg 1080
gaaaatggcg gcggggatgt ttccggaagg ttttacggcc cggccggcga ggaagtggcg 1140
ggaaaataca gctatcgccc gacagatgcg gaaaagggcg gattcggcgt gtttgccggc 1200
aaaaaagagc aggatggatc cggaggagga ggagccacaa acgacgacga tgttaaaaaa 1260
gctgccactg tggccattgc tgctgcctac aacaatggcc aagaaatcaa cggtttcaaa 1320
gctggagaga ccatctacga cattgatgaa gacggcacaa ttaccaaaaa agacgcaact 1380
gcagccgatg ttgaagccga cgactttaaa ggtctgggtc tgaaaaaagt cgtgactaae 1440
ctgaccaaaa ccgtcaatga aaacaaacaa aacgtcgatg ccaaagtaaa agctgcagaa 1500
tctgaaatag aaaagttaac aaccaagtta gcagacactg atgccgcttt agcagatact 1560
gatgccgctc tggatgcaac caccaacgcc ttgaataaat tgggagaaaa tataacgaca 1620
tttgctgaag agactaagac aaatatcgta aaaattgatg aaaaattaga agccgtggct 1680
gataccgtcg acaagcatgc cgaagcattc aacgatatcg ccgattcatt ggatgaaacc 1740
aacactaagg cagacgaagc cgtcaaaacc gccaatgaag ccaaacagac ggccgaagaa 1800
accaaacaaa acgtcgatgc caaagtaaaa gctgcagaaa ctgcagcagg caaagccgaa 1860
gctgccgctg gcacagctaa tactgcagcc gacaaggccg aagctgtcgc tgcaaaagtt 1920
accgacatca aagctgatat cgctacgaac aaagataata ttgctaaaaa agcaaacagt 1980
gccgacgtgt acaccagaga agagtctgac agcaaatttg tcagaattga tggtctgaac 2040
gctactaccg aaaaattgga cacacgcttg gcttctgctg aaaaatccat tgccgatcac 2100
gatactcgcc tgaacggttt ggataaaaca gtgtcagacc tgcgcaaaga aacccgccaa 2160
ggccttgcag aacaagccgc gctctccggt ctgttccaac cttacaacgt gggtcggttc 2220
aatgtaacgg ctgcagtcgg cggctacaaa tccgaatcgg cagtcgccat cggtaccggc 2280
ttccgcttta ccgaaaactt tgccgccaaa gcaggcgtgg cagtcggcac ttcgtccggt 2340
tcttccgcag cctaccatgt cggcgtcaat tacgagtggt aactcgag 2388
<210>8
<211>793
<212>PRT
<213>人工序列
<220>
<223>ΔG287-961
<400> 8
Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala
1 5 10 15
Ala Pro Val Val Ala Glu Lys Glu Thr Glu Val Lys Glu Asp Ala Pro
20 25 30
Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Thr Gln Gly Ser Gln
35 40 45
Asp Met Ala Ala Val Ser Ala Glu Asn Thr Gly Asn Gly Gly Ala Ala
50 55 60
Thr Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Pro Gln Asn Asp Met
65 70 75 80
Pro Gln Asn Ser Ala Glu Ser Ala Asn Gln Thr Gly Asn Asn Gln Pro
85 90 95
Ala Asp Ser Ser Asp Ser Ala Pro Ala Ser Asn Pro Ala Pro Ala Asn
100 105 110
Gly Gly Ser Asn Phe Gly Arg Val Asp Leu Ala Asn Gly Val Leu Ile
115 120 125
Asp Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser
130 135 140
Cys Asn Gly Asp Asn Leu Leu Asp Glu Glu Ala Pro Ser Lys Ser Glu
145 150 155 160
Phe Glu Asn Leu Asn Glu Ser Glu Arg Ile Glu Lys Tyr Lys Lys Asp
165 170 175
Gly Lys Ser Asp Lys Phe Thr Asn Leu Val Ala Thr Ala Val Gln Ala
180 185 190
Asn Gly Thr Asn Lys Tyr Val Ile Ile Tyr Lys Asp Lys Ser Ala Ser
195 200 205
Ser Ser Ser Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser Leu
210 215 220
Pro Ala Glu Met Pro Leu Ile Pro Val Asn Gln Ala Asp Thr Leu Ile
225 230 235 240
Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile Phe
245 250 255
Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys Leu
260 265 270
Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ala Lys Gly
275 280 285
Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His Phe
290 295 300
His Thr Glu Asn Gly Arg Pro Tyr Pro Thr Arg Gly Arg Phe Ala Ala
305 310 315 320
Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser Gly
325 330 335
Asp Asp Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp Gly
340 345 350
Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val Ser
355 360 365
Gly Arg Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr Ser
370 375 380
Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala Gly
385 390 395 400
Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Ala Thr Asn Asp Asp
405 410 415
Asp Val Lys Lys Ala Ala Thr Val Ala Ile Ala Ala Ala Tyr Asn Asn
420 425 430
Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly Glu Thr Ile Tyr Asp Ile
435 440 445
Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp Ala Thr Ala Ala Asp Val
450 455 4 60
Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr Asn
465 470 475 480
Leu Thr Lys Thr Val Asn Glu Asn Lys Gln Asn Val Asp Ala Lys Val
485 490 495
Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu Thr Thr Lys Leu Ala Asp
500 505 510
Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr Thr
515 520 525
Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile Thr Thr Phe Ala Glu Glu
530 535 540
Thr Lys Thr Asn Ile Val Lys Ile Asp Glu Lys Leu Glu Ala Val Ala
545 550 555 560
Asp Thr Val Asp Lys His Ala Glu Ala Phe Asn AspIle Ala Asp Ser
565 570 575
Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu Ala Val Lys Thr Ala Asn
580 585 590
Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys Gln Asn Val Asp Ala Lys
595 600 605
Val Lys Ala Ala Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly
610 615 620
Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys Val
625 630 635 640
Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn Lys Asp Asn Ile Ala Lys
645 650 655
Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg Glu Glu Ser Asp Ser Lys
660 665 670
Phe Val Arg Ile Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp Thr
675 680 685
Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala Asp His Asp Thr Arg Leu
690 695 700
Asn Gly Leu Asp Lys Thr Val Ser Asp Leu Arg Lys Glu Thr Arg Gln
705 710 715 720
Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly Leu Phe Gln Pro Tyr Asn
725 730 735
Val Gly Arg Phe Asn Val Thr Ala Ala Val Gly Gly Tyr Lys Ser Glu
740 745 750
Ser Ala Val Ala Ile Gly Thr Gly Phe Arg Phe Thr Glu Asn Phe Ala
755 760 765
Ala Lys Ala Gly Val Ala Val Gly Thr Ser Ser Gly Ser Ser Ala Ala
770 775 780
Tyr His Val Gly Val Asn Tyr Glu Trp
785 790
<210>9
<211>2700
<212>DNA
<213>人工序列
<220>
<223>ΔG287NZ-919
<400>9
atggctagcc ccgatgtcaa gtcggcggac acgctgtcaa aacctgccgc cectgttgtt 60
tctgaaaaag agacagaggc aaaggaagat gcgccacagg caggttctca aggacagggc 120
gcgccatccg cacaaggcgg tcaagatatg gcggcggttt cggaagaaaa tacaggcaat 180
ggcggtgcgg cagcaacgga caaacccaaa aatgaagacg agggggcgca aaatgatatg 240
ccgcaaaatg ccgccgatac agatagtttg acaccgaatc acaccccggc ttcgaatatg 300
ccggccggaa atatggaaaa ccaagcaccg gatgccgggg aatcggagca gccggcaaac 360
caaccggata tggcaaatac ggcggacgga atgcagggtg acgatccgtc ggcaggcggg 420
gaaaatgccg gcaatacggc tgcccaaggt acaaatcaag ccgaaaacaa tcaaaccgcc 480
ggttctcaaa atcctgcctc ttcaaccaat cctagcgcca cgaatagcgg tggtgatttt 540
ggaaggacga acgtgggcaa ttctgttgtg attgacgggc cgtcgcaaaa tataacgttg 600
acccactgta aaggcgattc ttgtagtggc aataatttct tggatgaaga agtacagcta 660
aaatcagaat ttgaaaaatt aagtgatgca gacaaaataa gtaattacaa gaaagatggg 720
aagaatgacg ggaagaatga taaatttgtc ggtttggttg ccgatagtgt gcagatgaag 780
ggaatcaatc aatatattat cttttataaa cctaaaccca cttcatttgc gcgatttagg 840
cgttctgcac ggtcgaggcg gtcgcttccg gccgagatgc cgctgattcc cgtcaatcag 900
gcggatacgc tgattgtcga tggggaagcg gtcagcctga cggggcattc cggcaatatc 960
ttcgcgcccg aagggaatta ccggtatctg acttacgggg cggaaaaatt gcccggcgga 1020
tcgtatgccc tccgtgttca aggcgaacct tcaaaaggcg aaatgctcgc gggcacggca 1080
gtgtacaacg gcgaagtgct gcattttcat acggaaaacg gccgtccgtc cccgtccaga 1140
ggcaggtttg ccgcaaaagt cgatttcggc agcaaatctg tggacggcat tatcgacagc 1200
ggcgatggtt tgcatatggg tacgcaaaaa ttcaaagccg ccatcgatgg aaacggcttt 1260
aaggggactt ggacggaaaa tggcggcggg gatgtttccg gaaagtttta cggcccggcc 1320
ggcgaggaag tggcgggaaa atacagctat cgcccaacag atgcggaaaa gggcggattc 1380
ggcgtgtttg ccggcaaaaa agagcaggat ggatccggag gaggaggatg ccaaagcaag 1440
agcatccaaa cctttccgca acccgacaca tccgtcatca acggcccgga ccggccggtc 1500
ggcatccccg accccgccgg aacgacggtc ggcggcggcg gggccgtcta taccgttgta 1560
ccgcacctgt ccctgcccca ctgggcggcg caggatttcg ccaaaagcct gcaatccttc 1620
cgcctcggct gcgccaattt gaaaaaccgc caaggctggc aggatgtgtg cgcccaagcc 1680
tttcaaaccc ccgtccattc ctttcaggca aaacagtttt ttgaacgcta tttcacgccg 1740
tggcaggttg caggcaacgg aagccttgcc ggtacggtta ccggctatta cgagccggtg 1800
ctgaagggcg acgacaggcg gacggcacaa gcccgcttcc cgatttacgg tattcccgac 1860
gattttatct ccgtccccct gcctgccggt ttgcggagcg gaaaagccct tgtccgcatc 1920
aggcagacgg gaaaaaacag cggcacaatc gacaataccg gcggcacaca taccgccgac 1980
ctctcccgat tccccatcac cgcgcgcaca acggcaatca aaggcaggtt tgaaggaagc 2040
cgcttcctcc cctaccacac gcgcaaccaa atcaacggcg gcgcgcttga cggcaaagcc 2100
ccgatactcg gttacgccga agaccccgtc gaactttttt ttatgcacat ccaaggctcg 2160
ggccgtctga aaaccccgtc cggcaaatac atccgcatcg gctatgccga caaaaacgaa 2220
catccctacg tttccatcgg acgctatatg gcggacaaag gctacctcaa gctcgggcag 2280
acctcgatgc agggcatcaa agcctatatg cggcaaaatc cgcaacgcct cgccgaagtt 2340
ttgggtcaaa accccagcta tatctttttc cgcgagcttg ccggaagcag caatgacggt 2400
cccgtcggcg cactgggcac gccgttgatg ggggaatatg ccggcgcagt cgaccggcac 2460
tacattacct tgggcgcgcc cttatttgtc gccaccgccc atccggttac ccgcaaagcc 2520
ctcaaccgcc tgattatggc gcaggatacc ggcagcgcga ttaaaggcgc ggtgcgcgtg 2580
gattattttt ggggatacgg cgacgaagcc ggcgaacttg ccggcaaaca gaaaaccacg 2640
ggttacgtct ggcagctcct acccaacggt atgaagcccg aataccgccc gtaaaagctt 2700
<210>10
<211>897
<212>PRT
<213>人工序列
<220>
<223>ΔG287NZ-919
<400>10
Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala
1 5 10 15
Ala Pro Val Val Ser Glu Lys Glu Thr Glu Ala Lys Glu Asp Ala Pro
20 25 30
Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Ala Gln Gly Gly Gln
35 40 45
Asp Met Ala Ala Val Ser Glu Glu Asn Thr Gly Asn Gly Gly Ala Ala
50 55 60
Ala Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Ala Gln Asn Asp Met
65 70 75 80
Pro Gln Asn Ala Ala Asp Thr Asp Ser Leu Thr Pro Asn His Thr Pro
85 90 95
Ala Ser Asn Met Pro Ala Gly Asn Met Glu Asn Gln Ala Pro Asp Ala
100 105 110
Gly Glu Ser Glu Gln Pro Ala Asn Gln Pro Asp Met Ala Asn Thr Ala
115 120 125
Asp Gly Met Gln Gly Asp Asp Pro Ser Ala Gly Gly Glu Asn Ala Gly
130 135 140
Asn Thr Ala Ala Gln Gly Thr Asn Gln Ala Glu Asn Asn Gln Thr Ala
145 150 155 160
Gly Ser Gln Asn Pro Ala Ser Ser Thr Asn Pro Ser Ala Thr Asn Ser
165 170 175
Gly Gly Asp Phe Gly Arg Thr Asn Val Gly Asn Ser Val Val Ile Asp
180 185 190
Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser Cys
195 200 205
Ser Gly Asn Asn Phe Leu Asp Glu Glu Val Gln Leu Lys Ser Glu Phe
210 215 220
Glu Lys Leu Ser Asp Ala Asp Lys Ile Ser Asn Tyr Lys Lys Asp Gly
225 230 235 240
Lys Asn Asp Gly Lys Asn Asp Lys Phe Val Gly Leu Val Ala Asp Ser
245 250 255
Val Gln Met Lys Gly Ile Asn Gln Tyr Ile Ile Phe Tyr Lys Pro Lys
260 265 270
Pro Thr Ser Phe Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser
275 280 285
Leu Pro Ala Glu Met Pro Leu Ile Pro Val Ash Gln Ala Asp Thr Leu
290 295 300
Ile Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile
305 310 315 320
Phe Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys
325 330 335
Leu Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ser Lys
340 345 350
Gly Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His
355 360 365
Phe His Thr Glu Asn Gly Arg Pro Ser Pro Ser Arg Gly Arg Phe Ala
370 375 380
Ala Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser
385 390 395 400
Gly Asp Gly Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp
405 410 415
Gly Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val
420 425 430
Ser Gly Lys Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr
435 440 445
Ser Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala
450 455 460
Gly Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Cys Gln Ser Lys
465 470 475 480
Ser Ile Gln Thr Phe Pro Gln Pro Asp Thr Ser Val Ile Asn Gly Pro
485 490 495
Asp Arg Pro Val Gly Ile Pro Asp Pro Ala Gly Thr Thr Val Gly Gly
500 505 510
Gly Gly Ala Val Tyr Thr Val Val Pro His Leu Ser Leu Pro His Trp
515 520 525
Ala Ala Gln Asp Phe Ala Lys Ser Leu Gln Ser Phe Arg Leu Gly Cys
530 535 540
Ala Asn Leu Lys Asn Arg Gln Gly Trp Gln Asp Val Cys Ala Gln Ala
545 550 555 560
Phe Gln Thr Pro Val His Ser Phe Gln Ala Lys Gln Phe Phe Glu Arg
565 570 575
Tyr Phe Thr Pro Trp Gln Val Ala Gly Asn Gly Ser Leu Ala Gly Thr
580 585 590
Val Thr Gly Tyr Tyr Glu Pro Val Leu Lys Gly Asp Asp Arg Arg Thr
595 600 605
Ala Gln Ala Arg Phe Pro Ile Tyr Gly Ile Pro Asp Asp Phe Ile Ser
610 615 620
Val Pro Leu Pro Ala Gly Leu Arg Ser Gly Lys Ala Leu Val Arg Ile
625 630 635 640
Arg Gln Thr Gly Lys Asn Ser Gly Thr Ile Asp Asn Thr Gly Gly Thr
645 650 655
His Thr Ala Asp Leu Ser Arg Phe Pro Ile Thr Ala Arg Thr Thr Ala
660 665 670
Ile Lys Gly Arg Phe Glu Gly Ser Arg Phe Leu Pro Tyr His Thr Arg
675 680 685
Asn Gln Ile Asn Gly Gly Ala Leu Asp Gly Lys Ala Pro Ile Leu Gly
690 695 700
Tyr Ala Glu Asp Pro Val Glu Leu Phe Phe Met His Ile Gln Gly Ser
705 710 715 720
Gly Arg Leu Lys Thr Pro Ser Gly Lys Tyr Ile Arg Ile Gly Tyr Ala
725 730 735
Asp Lys Asn Glu His Pro Tyr Val Ser Ile Gly Arg Tyr Met Ala Asp
740 745 750
Lys Gly Tyr Leu Lys Leu Gly Gln Thr Ser Met Gln Gly Ile Lys Ala
755 760 765
Tyr Met Arg Gln Asn Pro Gln Arg Leu Ala Glu Val Leu Gly Gln Asn
770 775 780
Pro Ser Tyr Ile Phe Phe Arg Glu Leu Ala Gly Ser Ser Asn Asp Gly
785 790 795 800
Pro Val Gly Ala Leu Gly Thr Pro Leu Met Gly Glu Tyr Ala Gly Ala
805 810 815
Val Asp Arg His Tyr Ile Thr Leu Gly Ala Pro Leu Phe Val Ala Thr
820 825 830
Ala His Pro Val Thr Arg Lys Ala Leu Asn Arg Leu Ile Met Ala Gln
835 840 845
Asp Thr Gly Ser Ala Ile Lys Gly Ala Val Arg Val Asp Tyr Phe Trp
850 855 860
Gly Tyr Gly Asp Glu Ala Gly Glu Leu Ala Gly Lys Gln Lys Thr Thr
865 870 875 880
Gly Tyr Val Trp Gln Leu Leu Pro Asn Gly Met Lys Pro Glu Tyr Arg
885 890 895
Pro
<210>11
<211>1941
<212>DNA
<213>人工序列
<220>
<223>ΔG287NZ-953
<400>11
atggctagcc ccgatgtcaa gtcggcggac acgctgtcaa aacctgccgc ccctgttgtt 60
tctgaaaaag agacagaggc aaaggaagat gcgccacagg caggttctca aggacagggc 120
gcgccatccg cacaaggcgg tcaagatatg gcggcggttt cggaagaaaa tacaggcaat 180
ggcggtgcgg cagcaacgga caaacccaaa aatgaagacg agggggcgca aaatgatatg 240
ccgcaaaatg ccgccgatac agatagtttg acaccgaatc acaccccggc ttcgaatatg 300
ccggccggaa atatggaaaa ccaagcaccg gatgccgggg aatcggagca gccggcaaac 360
caaccggata tggcaaatac ggcggacgga atgcagggtg acgatccgtc ggcaggcggg 420
gaaaatgccg gcaatacggc tgcccaaggt acaaatcaag ccgaaaacaa tcaaaccgcc 480
ggttctcaaa atcctgcctc ttcaaccaat cctagcgcca cgaatagcgg tggtgatttt 540
ggaaggacga acgtgggcaa ttctgttgtg attgacgggc cgtcgcaaaa tataacgttg 600
acccactgta aaggcgattc ttgtagtggc aataatttct tggatgaaga agtacagcta 660
aaatcagaat ttgaaaaatt aagtgatgca gacaaaataa gtaattacaa gaaagatggg 720
aagaatgacg ggaagaatga taaatttgtc ggtttggttg ccgatagtgt gcagatgaag 780
ggaatcaatc aatatattat cttttataaa cctaaaccca cttcatttgc gcgatttagg 840
cgttctgcac ggtcgaggcg gtcgcttccg gccgagatgc cgctgattcc cgtcaatcag 900
gcggatacgc tgattgtcga tggggaagcg gtcagcctga cggggcattc cggcaatatc 960
ttcgcgcccg aagggaatta ccggtatctg acttacgggg cggaaaaatt gcccggcgga 1020
tcgtatgccc tccgtgttca aggcgaacct tcaaaaggcg aaatgctcgc gggcacggca 1080
gtgtacaacg gcgaagtgct gcattttcat acggaaaacg gccgtccgtc cccgtccaga 1140
ggcaggtttg ccgcaaaagt cgatttcggc agcaaatctg tggacggcat tatcgacagc 1200
ggcgatggtt tgcatatggg tacgcaaaaa ttcaaagccg ccatcgatgg aaacggcttt 1260
aaggggactt ggacggaaaa tggcggcggg gatgtttccg gaaagtttta cggcccggcc 1320
ggcgaggaag tggcgggaaa atacagctat cgcccaacag atgcggaaaa gggcggattc 1380
ggcgtgtttg ccggcaaaaa agagcaggat ggatccggag gaggaggagc cacctacaaa 1440
gtggacgaat atcacgccaa cgcccgtttc gccatcgacc atttcaacac cagcaccaac 1500
gtcggcggtt tttacggtct gaccggttcc gtcgagttcg accaagcaaa acgcgacggt 1560
aaaatcgaca tcaccatccc cgttgccaac ctgcaaagcg gttcgcaaca ctttaccgac 1620
cacctgaaat cagccgacat cttcgatgcc gcccaatatc cggacatccg ctttgtttcc 1680
accaaattca acttcaacgg caaaaaactg gtttccgttg acggcaacct gaccatgcac 1740
ggcaaaaccg cccccgtcaa actcaaagcc gaaaaattca actgctacca aagcccgatg 1800
gcgaaaaccg aagtttgcgg cggcgacttc agcaccacca tcgaccgcac caaatggggc 1860
gtggactacc tcgttaacgt tggtatgacc aaaagcgtcc gcatcgacat ccaaatcgag 1920
gcagccaaac aataaaagct t 1941
<210>12
<211>644
<212>PRT
<213>人工序列
<220>
<223>ΔG287NZ-953
<400>12
Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala
1 5 10 15
Ala Pro Val Val Ser Glu Lys Glu Thr Glu Ala Lys Glu Asp Ala Pro
20 25 30
Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Ala Gln Gly Gly Gln
35 40 45
Asp Met Ala Ala Val Ser Glu Glu Asn Thr Gly Asn Gly Gly Ala Ala
50 55 60
Ala Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Ala Gln Asn Asp Met
65 70 75 80
Pro Gln Asn Ala Ala Asp Thr Asp Ser Leu Thr Pro Asn His Thr Pro
85 90 95
Ala Ser Asn Met Pro Ala Gly Asn Met Glu Asn Gln Ala Pro Asp Ala
100 105 110
Gly Glu Ser Glu Gln Pro Ala Asn Gln Pro Asp Met Ala Asn Thr Ala
115 120 125
Asp Gly Met Gln Gly Asp Asp Pro Ser Ala Gly Gly Glu Asn Ala Gly
130 135 140
Asn Thr Ala Ala Gln Gly Thr Asn Gln Ala Glu Asn Asn Gln Thr Ala
145 150 155 160
Gly Ser Gln Asn Pro Ala Ser Ser Thr Asn Pro Ser Ala Thr Asn Ser
165 170 175
Gly Gly Asp Phe Gly Arg Thr Asn Val Gly Asn Ser Val Val Ile Asp
180 185 190
Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser Cys
195 200 205
Ser Gly Asn Asn Phe Leu Asp Glu Glu Val Gln Leu Lys Ser Glu Phe
210 215 220
Glu Lys Leu Ser Asp Ala Asp Lys Ile Ser Asn Tyr Lys Lys Asp Gly
225 230 235 240
Lys Asn Asp Gly Lys Asn Asp Lys Phe Val Gly Leu Val Ala Asp Ser
245 250 255
Val Gln Met Lys Gly Ile Asn Gln Tyr Ile Ile Phe Tyr Lys Pro Lys
260 265 270
Pro Thr Ser Phe Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser
275 280 285
Leu Pro Ala Glu Met Pro Leu Ile Pro Val Asn Gln Ala Asp Thr Leu
290 295 300
Ile Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile
305 310 315 320
Phe Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys
325 330 335
Leu Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ser Lys
340 345 350
Gly Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His
355 360 365
Phe His Thr Glu Asn Gly Arg Pro Ser Pro Ser Arg Gly Arg Phe Ala
370 375 380
Ala Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser
385 390 395 400
Gly Asp Gly Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp
405 410 415
Gly Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val
420 425 430
Ser Gly Lys Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr
435 440 445
Ser Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala
450 455 460
Gly Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Ala Thr Tyr Lys
465 470 475 480
Val Asp Glu Tyr His Ala Asn Ala Arg Phe Ala Ile Asp His Phe Asn
485 490 495
Thr Ser Thr Asn Val Gly Gly Phe Tyr Gly Leu Thr Gly Ser Val Glu
500 505 510
Phe Asp Gln Ala Lys Arg Asp Gly Lys Ile Asp Ile Thr Ile Pro Val
515 520 525
Ala Asn Leu Gln Ser Gly Ser Gln His Phe Thr Asp His Leu Lys Ser
530 535 540
Ala Asp Ile Phe Asp Ala Ala Gln Tyr Pro Asp Ile Arg Phe Val Ser
545 550 555 560
Thr Lys Phe Asn Phe Asn Gly Lys Lys Leu Val Ser Val Asp Gly Asn
565 570 575
Leu Thr Met His Gly Lys Thr Ala Pro Val Lys Leu Lys Ala Glu Lys
580 585 590
Phe Asn Cys Tyr Gln Ser Pro Met Ala Lys Thr Glu Val Cys Gly Gly
595 600 605
Asp Phe Ser Thr Thr Ile Asp Arg Thr Lys Trp Gly Val Asp Tyr Leu
610 615 620
Val Asn Val Gly Met Thr Lys Ser Val Arg Ile Asp Ile Gln Ile Glu
625 630 635 640
Ala Ala Lys Gln
<210>13
<211>2583
<212>DNA
<213>人工序列
<220>
<223>ΔG287NZ-961
<400>13
atggctagcc ccgatgtcaa gtcggcggac acgctgtcaa aacctgccgc ccctgttgtt 60
tctgaaaaag agacagaggc aaaggaagat gcgccacagg caggttctca aggacagggc 120
gcgccatccg cacaaggcgg tcaagatatg gcggcggttt cggaagaaaa tacaggcaat 180
ggcggtgcgg cagcaacgga caaacccaaa aatgaagacg agggggcgca aaatgatatg 240
ccgcaaaatg ccgccgatac agatagtttg acaccgaatc acaccccggc ttcgaatatg 300
ccggccggaa atatggaaaa ccaagcaccg gatgccgggg aatcggagca gccggcaaac 360
caaccggata tggcaaatac ggcggacgga atgcagggtg acgatccgtc ggcaggcggg 420
gaaaatgccg gcaatacggc tgcccaaggt acaaatcaag ccgaaaacaa tcaaaccgcc 480
ggttctcaaa atcctgcctc ttcaaccaat cctagcgcca cgaatagcgg tggtgatttt 540
ggaaggacga acgtgggcaa ttctgttgtg attgacgggc cgtcgcaaaa tataacgttg 600
acccactgta aaggcgattc ttgtagtggc aataatttct tggatgaaga agtacagcta 660
aaatcagaat ttgaaaaatt aagtgatgca gacaaaataa gtaattacaa gaaagatggg 720
aagaatgacg ggaagaatga taaatttgtc ggtttggttg ccgatagtgt gcagatgaag 780
ggaatcaatc aatatattat cttttataaa cctaaaccca cttcatttgc gcgatttagg 840
cgttctgcac ggtcgaggcg gtcgcttccg gccgagatgc cgctgattcc cgtcaatcag 900
gcggatacgc tgattgtcga tggggaagcg gtcagcctga cggggcattc cggcaatatc 960
ttcgcgcccg aagggaatta ccggtatctg acttacgggg cggaaaaatt gcccggcgga 1020
tcgtatgccc tccgtgttca aggcgaacct tcaaaaggcg aaatgctcgc gggcacggca 1080
gtgtacaacg gcgaagtgct gcattttcat acggaaaacg gccgtccgtc cccgtccaga 1140
ggcaggtttg ccgcaaaagt cgatttcggc agcaaatctg tggacggcat tatcgacagc 1200
ggcgatggtt tgcatatggg tacgcaaaaa ttcaaagccg ccatcgatgg aaacggcttt 1260
aaggggactt ggacggaaaa tggcggcggg gatgtttccg gaaagtttta cggcccggcc 1320
ggcgaggaag tggcgggaaa atacagctat cgcccaacag atgcggaaaa gggcggattc 1380
ggcgtgtttg ccggcaaaaa agagcaggat ggatccggag gaggaggagc cacaaacgac 1440
gacgatgtta aaaaagctgc cactgtggcc attgctgctg cctacaacaa tggccaagaa 1500
atcaacggtt tcaaagctgg agagaccatc tacgacattg atgaagacgg cacaattacc 1560
aaaaaagacg caactgcagc cgatgttgaa gccgacgact ttaaaggtct gggtctgaaa 1620
aaagtcgtga ctaacctgac caaaaccgtc aatgaaaaca aacaaaacgt cgatgccaaa 1680
gtaaaagctg cagaatctga aatagaaaag ttaacaacca agttagcaga cactgatgcc 1740
gctttagcag atactgatgc cgctctggat gcaaccacca acgccttgaa taaattggga 1800
gaaaatataa cgacatttgc tgaagagact aagacaaata tcgtaaaaat tgatgaaaaa 1860
ttagaagccg tggctgatac cgtcgacaag catgccgaag cattcaacga tatcgccgat 1920
tcattggatg aaaccaacac taaggcagac gaagccgtca aaaccgccaa tgaagccaaa 1980
cagacggccg aagaaaccaa acaaaacgtc gatgccaaag taaaagctgc agaaactgca 2040
gcaggcaaag ccgaagctgc cgctggcaca gctaatactg cagccgacaa ggccgaagct 2100
gtcgctgcaa aagttaccga catcaaagct gatatcgcta cgaacaaaga taatattgct 2160
aaaaaagcaa acagtgccga cgtgtacacc agagaagagt ctgacagcaa atttgtcaga 2220
attgatggtc tgaacgctac taccgaaaaa ttggacacac gcttggcttc tgctgaaaaa 2280
tccattgccg atcacgatac tcgcctgaac ggtttggata aaacagtgtc agacctgcgc 2340
aaagaaaccc gccaaggcct tgcagaacaa gccgcgctct ccggtctgtt ccaaccttac 2400
aacgtgggtc ggttcaatgt aacggctgca gtcggcggct acaaatccga atcggcagtc 2460
gccatcggta ccggcttccg ctttaccgaa aactttgccg ccaaagcagg cgtggcagtc 2520
ggcacttcgt ccggttcttc cgcagcctac catgtcggcg tcaattacga gtggtaaaag 2580
ctt 2583
<210>14
<211>858
<212>PRT
<213>人工序列
<220>
<223>ΔG287NZ-961
<400>14
Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala
1 5 10 15
Ala Pro Val Val Ser Glu Lys Glu Thr Glu Ala Lys Glu Asp Ala Pro
20 25 30
Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Ala Gln Gly Gly Gln
35 40 45
Asp Met Ala Ala Val Ser Glu Glu Asn Thr Gly Asn Gly Gly Ala Ala
50 55 60
Ala Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Ala Gln Asn Asp Met
65 70 75 80
Pro Gln Asn Ala Ala Asp Thr Asp Ser Leu Thr Pro Asn His Thr Pro
85 90 95
Ala Ser Asn Met Pro Ala Gly Asn Met Glu Asn Gln Ala Pro Asp Ala
100 105 110
Gly Glu Ser Glu Gln Pro Ala Asn Gln Pro Asp Met Ala Asn Thr Ala
115 120 125
Asp Gly Met Gln Gly Asp Asp Pro Ser Ala Gly Gly Glu Asn Ala Gly
130 135 140
Asn Thr Ala Ala Gln Gly Thr Asn Gln Ala Glu Asn Asn Gln Thr Ala
145 150 155 160
Gly Ser Gln Asn Pro Ala Ser Ser Thr Asn Pro Ser Ala Thr Asn Ser
165 170 175
Gly Gly Asp Phe Gly Arg Thr Asn Val Gly Asn Ser Val Val Ile Asp
180 185 190
Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser Cys
195 200 205
Ser Gly Asn Asn Phe Leu Asp Glu Glu Val Gln Leu Lys Ser Glu Phe
210 215 220
Glu Lys Leu Ser Asp Ala Asp Lys Ile Ser Asn Tyr Lys Lys Asp Gly
225 230 235 240
Lys Asn Asp Gly Lys Asn Asp Lys Phe Val Gly Leu Val Ala Asp Ser
245 250 255
Val Gln Met Lys Gly Ile Asn Gln Tyr Ile Ile Phe Tyr Lys Pro Lys
260 265 270
Pro Thr Ser Phe Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser
275 280 285
Leu Pro Ala Glu Met Pro Leu Ile Pro Val Asn Gln Ala Asp Thr Leu
290 295 300
Ile Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile
305 310 315 320
Phe Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys
325 330 335
Leu Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ser Lys
340 345 350
Gly Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His
355 360 365
Phe His Thr Glu Asn Gly Arg Pro Ser Pro Ser Arg Gly Arg Phe Ala
370 375 380
Ala Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser
385 390 395 400
Gly Asp Gly Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp
405 410 415
Gly Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val
420 425 430
Ser Gly Lys Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr
435 440 445
Ser Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala
450 455 460
Gly Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Ala Thr Asn Asp
465 470 475 480
Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile Ala Ala Ala Tyr Asn
485 490 495
Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly Glu Thr Ile Tyr Asp
500 505 510
Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp Ala Thr Ala Ala Asp
515 520 525
Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr
530 535 540
Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln Asn Val Asp Ala Lys
545 550 555 560
Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu Thr Thr Lys Leu Ala
565 570 575
Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr
580 585 590
Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile Thr Thr Phe Ala Glu
595 600 605
Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu Lys Leu Glu Ala Val
610 615 620
Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe Asn Asp Ile Ala Asp
625 630 635 640
Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu Ala Val Lys Thr Ala
645 650 655
Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys Gln Asn Val Asp Ala
660 665 670
Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala
675 680 685
Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys
690 695 700
Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn Lys Asp Asn Ile Ala
705 710 715 720
Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg Glu Glu Ser Asp Ser
725 730 735
Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp
740 745 750
Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala Asp His Asp Thr Arg
755 760 765
Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu Arg Lys Glu Thr Arg
770 775 780
Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly Leu Phe Gln Pro Tyr
785 790 795 800
Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val Gly Gly Tyr Lys Ser
805 810 815
Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg Phe Thr Glu Asn Phe
820 825 830
Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser Ser Gly Ser Ser Ala
835 840 845
Ala Tyr His Val Gly Val Asn Tyr Glu Trp
850 855
<210>15
<211>1082
<212>PRT
<213>人工序列
<220>
<223>983
<400>15
Met Arg Thr Thr Pro Thr Phe Pro Thr Lys Thr Phe Lys Pro Thr Ala
1 5 10 15
Met Ala Leu Ala Val Ala Thr Thr Leu Ser Ala Cys Leu Gly Gly Gly
20 25 30
Gly Gly Gly Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile
35 40 45
Gly Ser Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr
50 55 60
Ala Gly Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala
65 70 75 80
Gly Arg Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala
85 90 95
Pro Pro Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala
100 105 110
Tyr Lys Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr
115 120 125
Gly Arg Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly
130 135 140
Ser Ile Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn
145 150 155 160
Glu Asn Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu
165 170 175
Asp Gly Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val
180 185 190
Ile Glu Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile
195 200 205
Gly His Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp
210 215 220
Gly Arg Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met
225 230 235 240
Asn Thr Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg
245 250 255
Asn Ala Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn
260 265 270
Ser Phe Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile
275 280 285
Ala Asn Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly
290 295 300
Gly Asp Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr
305 310 315 320
Gly Asn Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe
325 330 335
Ser Thr Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu
340 345 350
Pro Phe Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly
355 360 365
Val Asp Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro
370 375 380
Gly Thr Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala
385 390 395 400
Met Trp Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg
405 410 415
Thr Asn Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val
420 425 430
Thr Gly Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn
435 440 445
Asp Asn Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala
450 455 460
Val Gly Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys
465 470 475 480
Ala Met Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp
485 490 495
Thr Lys Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser
500 505 510
Gly Thr Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His
515 520 525
Gly Asn Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu
530 535 540
Val Leu Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly
545 550 555 560
Ala Leu Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp
565 570 575
Gly Ile Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr
580 585 590
Val His Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr
595 600 605
Thr Arg Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly
610 615 620
Gly Lys Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn
625 630 635 640
Ser Thr Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln
645 650 655
Asp Tyr Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala
660 665 670
Ser Leu Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu
675 680 685
Ser Tyr Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala
690 695 700
Ala His Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly
705 710 715 720
Ser Asn Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser
725 730 735
Ala Thr Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met
740 745 750
Pro Gly Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val
755 760 765
Gln His Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala
770 775 780
Ala Thr Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly
785 790 795 800
Arg Arg Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly
805 810 815
Leu Arg Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln
820 825 830
Gly Gly Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile
835 840 845
Ala Ala Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met
850 855 860
Gly Arg Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser
865 870 875 880
Ile Ser Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr
885 890 895
Leu Lys Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg
900 905 910
Ser Thr Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu
915 920 925
Met Gln Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr
930 935 940
Gly Asp Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln
945 950 955 960
Asp Ala Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser
965 970 975
Leu Thr Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln
980 985 990
Pro Leu Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg
995 1000 1005
Asp Leu Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala
1010 1015 1020
Thr Ala Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg
1025 1030 1035 1040
Leu Val Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn
1045 1050 1055
Gly Leu Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His
1060 1065 1070
Ser Gly Arg Val Gly Val Gly Tyr Arg Phe
1075 1080
<210>16
<211>1047
<212>PRT
<213>人工序列
<220>
<223>ΔG983
<400>16
Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser Asn
1 5 10 15
Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly Ile
20 25 30
Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg Asp
35 40 45
Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro Pro
50 55 60
Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys Asn
65 70 75 80
Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg Gly
85 90 95
Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile Ser
100 105 110
Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn Tyr
115 120 125
Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly Gly
130 135 140
Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu Thr
145 150 155 160
Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His Ile
165 170 175
Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg Pro
180 185 190
Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr Asn
195 200 205
Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala Trp
210 215 220
Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe Gly
225 230 235 240
Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn Ser
245 250 255
Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp Lys
260 265 270
Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn Leu
275 280 285
Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr Gly
290 295 300
Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe Tyr
305 310 315 320
Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp Arg
325 330 335
Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr Glu
340 345 350
Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp Cys
355 360 365
Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn Pro
370 375 380
Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly Thr
385 390 395 400
Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn Leu
405 410 415
Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly Val
420 425 430
Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met Asn
435 440 445
Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys Gly
450 455 460
Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr Gly
465 470 475 480
Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn Asn
485 490 495
Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu Tyr
500 505 510
Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu Ile
515 520 525
Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile Val
530 535 540
Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His Ile
545 550 555 560
Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg Leu
565 570 575
Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys Leu
580 585 590
Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr Gly
595 600 605
Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr Ser
610 615 620
Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu Asp
625 630 635 640
Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr Tyr
645 650 655
Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His Ser
660 665 670
Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn Leu
675 680 685
Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr Pro
690 695 700
Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly Ile
705 710 715 720
Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His Ala
725 730 735
Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr Val
740 745 750
Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg Leu
755 760 765
Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg Val
770 775 780
Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly Val
785 790 795 800
Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala Lys
805 810 815
Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg Ser
820 825 830
Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser Leu
835 840 845
Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys Gly
850 855 860
Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr Gly
865 870 875 880
Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln Leu
885 890 895
Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp Leu
900 905 9l0
Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala Phe
915 920 925
Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr Glu
930 935 940
Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu Ser
945 950 955 960
Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu Asn
965 970 975
Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala Ala
980 985 990
Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val Ala
995 1000 1005
Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu Ala
1010 1015 1020
Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly Arg
1025 1030 1035 1040
Val Gly Val Gly Tyr Arg Phe
1045
<210>17
<211>4425
<212>DNA
<213>人工序列
<220>
<223>ΔG983-ORF46.1
<400>17
atgacttctg cgcccgactt caatgcaggc ggtaccggta tcggcagcaa cagcagagca 60
acaacagcga aatcagcagc agtatcttac gccggtatca agaacgaaat gtgcaaagac 120
agaagcatgc tctgtgccgg tcgggatgac gttgcggtta cagacaggga tgccaaaatc 180
aatgcccccc ccccgaatct gcataccgga gactttccaa acccaaatga cgcatacaag 240
aatttgatca acctcaaacc tgcaattgaa gcaggctata caggacgcgg ggtagaggta 300
ggtatcgtcg acacaggcga atccgtcggc agcatatcct ttcccgaact gtatggcaga 360
aaagaacacg gctataacga aaattacaaa aactatacgg cgtatatgcg gaaggaagcg 420
cctgaagacg gaggcggtaa agacattgaa gcttctttcg acgatgaggc cgttatagag 480
actgaagcaa agccgacgga tatccgccac gtaaaagaaa tcggacacat cgatttggtc 540
tcccatatta ttggcgggcg ttccgtggac ggcagacctg caggcggtat tgcgcccgat 600
gcgacgctac acataatgaa tacgaatgat gaaaccaaga acgaaatgat ggttgcagcc 660
atccgcaatg catgggtcaa gctgggcgaa cgtggcgtgc gcatcgtcaa taacagtttt 720
ggaacaacat cgagggcagg cactgccgac cttttccaaa tagccaattc ggaggagcag 780
taccgccaag cgttgctcga ctattccggc ggtgataaaa cagacgaggg tatccgcctg 840
atgcaacaga gcgattacgg caacctgtcc taccacatcc gtaataaaaa catgcttttc 900
atcttttcga caggcaatga cgcacaagct cagcccaaca catatgccct attgccattt 960
tatgaaaaag acgctcaaaa aggcattatc acagtcgcag gcgtagaccg cagtggagaa 1020
aagttcaaac gggaaatgta tggagaaccg ggtacagaac cgcttgagta tggctccaac 1080
cattgcggaa ttactgccat gtggtgcctg tcggcaccct atgaagcaag cgtccgtttc 1140
acccgtacaa acccgattca aattgccgga acatcctttt ccgcacccat cgtaaccggc 1200
acggcggctc tgctgctgca gaaatacccg tggatgagca acgacaacct gcgtaccacg 1260
ttgctgacga cggctcagga catcggtgca gtcggcgtgg acagcaagtt cggctgggga 1320
ctgctggatg cgggtaaggc catgaacgga cccgcgtcct ttccgttcgg cgactttacc 1380
gccgatacga aaggtacatc cgatattgcc tactccttcc gtaacgacat ttcaggcacg 1440
ggcggcctga tcaaaaaagg cggcagccaa ctgcaactgc acggcaacaa cacctatacg 1500
ggcaaaacca ttatcgaagg cggttcgctg gtgttgtacg gcaacaacaa atcggatatg 1560
cgcgtcgaaa ccaaaggtgc gctgatttat aacggggcgg catccggcgg cagcctgaac 1620
agcgacggca ttgtctatct ggcagatacc gaccaatccg gcgcaaacga aaccgtacac 1680
atcaaaggca gtctgcagct ggacggcaaa ggtacgctgt acacacgttt gggcaaactg 1740
ctgaaagtgg acggtacggc gattatcggc ggcaagctgt acatgtcggc acgcggcaag 1800
ggggcaggct atctcaacag taccggacga cgtgttccct tcctgagtgc cgccaaaatc 1860
gggcaggatt attctttctt cacaaacatc gaaaccgacg gcggcctgct ggcttccctc 1920
gacagcgtcg aaaaaacagc gggcagtgaa ggcgacacgc tgtcctatta tgtccgtcgc 1980
ggcaatgcgg cacggactgc ttcggcagcg gcacattccg cgcccgccgg tctgaaacac 2040
gccgtagaac agggcggcag caatctggaa aacctgatgg tcgaactgga tgcctccgaa 2100
tcatccgcaa cacccgagac ggttgaaact gcggcagccg accgcacaga tatgccgggc 2160
atccgcccct acggcgcaac tttccgcgca gcggcagccg tacagcatgc gaatgccgcc 2220
gacggtgtac gcatcttcaa cagtctcgcc gctaccgtct atgccgacag taccgccgcc 2280
catgccgata tgcagggacg ccgcctgaaa gccgtatcgg acgggttgga ccacaacggc 2340
acgggtctgc gcgtcatcgc gcaaacccaa caggacggtg gaacgtggga acagggcggt 2400
gttgaaggca aaatgcgcgg cagtacccaa accgtcggca ttgccgcgaa aaccggcgaa 2460
aatacgacag cagccgccac actgggcatg ggacgcagca catggagcga aaacagtgca 2520
aatgcaaaaa ccgacagcat tagtctgttt gcaggcatac ggcacgatgc gggcgatatc 2580
ggctatctca aaggcctgtt ctcctacgga cgctacaaaa acagcatcag ccgcagcacc 2640
ggtgcggacg aacatgcgga aggcagcgtc aacggcacgc tgatgcagct gggcgcactg 2700
ggcggtgtca acgttccgtt tgccgcaacg ggagatttga cggtcgaagg cggtctgcgc 2760
tacgacctgc tcaaacagga tgcattcgcc gaaaaaggca gtgctttggg ctggagcggc 2820
aacagcctca ctgaaggcac gctggtcgga ctcgcgggtc tgaagctgtc gcaacccttg 2880
agcgataaag ccgtcctgtt tgcaacggcg ggcgtggaac gcgacctgaa cggacgcgac 2940
tacacggtaa cgggcggctt taccggcgcg actgcagcaa ccggcaagac gggggcacgc 3000
aatatgccgc acacccgtct ggttgccggc ctgggcgcgg atgtcgaatt cggcaacggc 3060
tggaacggct tggcacgtta cagctacgcc ggttccaaac agtacggcaa ccacagcgga 3120
cgagtcggcg taggctaccg gttcctcgac ggtggcggag gcactggatc ctcagatttg 3180
gcaaacgatt cttttatccg gcaggttctc gaccgtcagc atttcgaacc cgacgggaaa 3240
taccacctat tcggcagcag gggggaactt gccgagcgca gcggccatat cggattggga 3300
aaaatacaaa gccatcagtt gggcaacctg atgattcaac aggcggccat taaaggaaat 3360
atcggctaca ttgtccgctt ttccgatcac gggcacgaag tccattcccc cttcgacaac 3420
catgcctcac attccgattc tgatgaagcc ggtagtcccg ttgacggatt tagcctttac 3480
cgcatccatt gggacggata cgaacaccat cccgccgacg gctatgacgg gccacagggc 3540
ggcggctatc ccgctcccaa aggcgcgagg gatatataca gctacgacat aaaaggcgtt 3600
gcccaaaata tccgcctcaa cctgaccgac aaccgcagca ccggacaacg gcttgccgac 3660
cgtttccaca atgccggtag tatgctgacg caaggagtag gcgacggatt caaacgcgcc 3720
acccgataca gccccgagct ggacagatcg ggcaatgccg ccgaagcctt caacggcact 3780
gcagatatcg ttaaaaacat catcggcgcg gcaggagaaa ttgtcggcgc aggcgatgcc 3840
gtgcagggca taagcgaagg ctcaaacatt gctgtcatgc acggcttggg tctgctttcc 3900
accgaaaaca agatggcgcg catcaacgat ttggcagata tggcgcaact caaagactat 3960
gccgcagcag ccatccgcga ttgggcagtc caaaacccca atgccgcaca aggcatagaa 4020
gccgtcagca atatctttat ggcagccatc cccatcaaag ggattggagc tgttcgggga 4080
aaatacggct tgggcggcat cacggcacat cctatcaagc ggtcgcagat gggcgcgatc 4140
gcattgccga aagggaaatc cgccgtcagc gacaattttg ccgatgcggc atacgccaaa 4200
tacccgtccc cttaccattc ccgaaatatc cgttcaaact tggagcagcg ttacggcaaa 4260
gaaaacatca cctcctcaac cgtgccgccg tcaaacggca aaaatgtcaa actggcagac 4320
caacgccacc cgaagacagg cgtaccgttt gacggtaaag ggtttccgaa ttttgagaag 4380
cacgtgaaat atgatacgct cgagcaccac caccaccacc actga 4425
<210>18
<211>1474
<212>PRT
<213>人工序列
<220>
<223>ΔG983-ORF46.1
<400>18
Met Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser
1 5 10 15
Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly
20 25 30
Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg
35 40 45
Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro
50 55 60
Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys
65 70 75 80
Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg
85 90 95
Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile
100 105 110
Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn
115 120 125
Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly
130 135 140
Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu
145 150 155 160
Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His
165 170 175
Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg
180 185 190
Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr
195 200 205
Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala
210 215 220
Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe
225 230 235 240
Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn
245 250 255
Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp
260 265 270
Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn
275 280 285
Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr
290 295 300
Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe
305 310 315 320
Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp
325 330 335
Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr
340 345 350
Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp
355 360 365
Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn
370 375 380
Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly
385 390 395 400
Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn
405 410 415
Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly
420 425 430
Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met
435 440 445
Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys
450 455 460
Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr
465 470 475 480
Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn
485 490 495
Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu
500 505 510
Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu
515 520 525
Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile
530 535 540
Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His
545 550 555 560
Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg
565 570 575
Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys
580 585 590
Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr
595 600 605
Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr
610 615 620
Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu
625 630 635 640
Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr
645 650 655
Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His
660 665 670
Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn
675 680 685
Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr
690 695 700
Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly
705 710 715 720
Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His
725 730 735
Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr
740 745 750
Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg
755 760 765
Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg
770 775 780
Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly
785 790 795 800
Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala
805 810 815
Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg
820 825 830
Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser
835 840 845
Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys
850 855 860
Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr
865 870 875 880
Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln
885 890 895
Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp
900 905 910
Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala
9l5 920 925
Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr
930 935 940
Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu
945 950 955 960
Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu
965 970 975
Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala
980 985 990
Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val
995 1000 1005
Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu
1010 1015 1020
Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly
1025 1030 1035 1040
Arg Val Gly Val Gly Tyr Arg Phe Leu Asp Gly Gly Gly Gly Thr Gly
1045 1050 1055
Ser Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp Arg
1060 1065 1070
Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly
1075 1080 1085
Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly Lys Ile Gln Ser
1090 1095 1100
His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly Asn
1105 1110 1115 1120
Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu Val His Ser
1125 1130 1135
Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser
1140 1145 1150
Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr Glu
1155 1160 1165
His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr Pro
1170 1175 1180
Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly Val
1185 1190 1195 1200
Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gln
1205 1210 1215
Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln Gly
1220 1225 1230
Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu Asp
1235 1240 1245
Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile Val
1250 1255 1260
Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp Ala
1265 1270 1275 1280
Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly Leu
1285 1290 1295
Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu Ala
1300 1305 1310
Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp Trp
1315 1320 1325
Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser Asn
1330 1335 1340
Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala Val Arg Gly
1345 1350 1355 1360
Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser Gln
1365 1370 1375
Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp Asn
1380 1385 1390
Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg
1395 1400 1405
Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile Thr
1410 1415 1420
Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala Asp
1425 1430 1435 1440
Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro
1445 1450 1455
Asn Phe Glu Lys His Val Lys Tyr Asp Thr Leu Glu His His His His
1460 1465 1470
His His
<210>19
<211>3939
<212>DNA
<213>人工序列
<220>
<223>ΔG983-741
<400>19
atgacttctg cgcccgactt caatgcaggc ggtaccggta tcggcagcaa cagcagagca 60
acaacagcga aatcagcagc agtatcttac gccggtatca agaacgaaat gtgcaaagac 120
agaagcatgc tctgtgccgg tcgggatgac gttgcggtta cagacaggga tgccaaaatc 180
aatgcccccc ccccgaatct gcataccgga gactttccaa acccaaatga cgcatacaag 240
aatttgatca acctcaaacc tgcaattgaa gcaggctata caggacgcgg ggtagaggta 300
ggtatcgtcg acacaggcga atccgtcggc agcatatcct ttcccgaact gtatggcaga 360
aaagaacacg gctataacga aaattacaaa aactatacgg cgtatatgcg gaaggaagcg 420
cctgaagacg gaggcggtaa agacattgaa gcttctttcg acgatgaggc cgttatagag 480
actgaagcaa agccgacgga tatccgccac gtaaaagaaa tcggacacat cgatttggtc 540
tcccatatta ttggcgggcg ttccgtggac ggcagacctg caggcggtat tgcgcccgat 600
gcgacgctac acataatgaa tacgaatgat gaaaccaaga acgaaatgat ggttgcagcc 660
atccgcaatg catgggtcaa gctgggcgaa cgtggcgtgc gcatcgtcaa taacagtttt 720
ggaacaacat cgagggcagg cactgccgac cttttccaaa tagccaattc ggaggagcag 780
taccgccaag cgttgctcga ctattccggc ggtgataaaa cagacgaggg tatccgcctg 840
atgcaacaga gcgattacgg caacctgtcc taccacatcc gtaataaaaa catgcttttc 900
atcttttcga caggcaatga cgcacaagct cagcccaaca catatgccct attgccattt 960
tatgaaaaag acgctcaaaa aggcattatc acagtcgcag gcgtagaccg cagtggagaa 1020
aagttcaaac gggaaatgta tggagaaccg ggtacagaac cgcttgagta tggctccaac 1080
cattgcggaa ttactgccat gtggtgcctg tcggcaccct atgaagcaag cgtccgtttc 1140
acccgtacaa acccgattca aattgccgga acatcctttt ccgcacccat cgtaaccggc 1200
acggcggctc tgctgctgca gaaatacccg tggatgagca acgacaacct gcgtaccacg 1260
ttgctgacga cggctcagga catcggtgca gtcggcgtgg acagcaagtt cggctgggga 1320
ctgctggatg cgggtaaggc catgaacgga cccgcgtcct ttccgttcgg cgactttacc 1380
gccgatacga aaggtacatc cgatattgcc tactccttcc gtaacgacat ttcaggcacg 1440
ggcggcctga tcaaaaaagg cggcagccaa ctgcaactgc acggcaacaa cacctatacg 1500
ggcaaaacca ttatcgaagg cggttcgctg gtgttgtacg gcaacaacaa atcggatatg 1560
cgcgtcgaaa ccaaaggtgc gctgatttat aacggggcgg catccggcgg cagcctgaac 1620
agcgacggca ttgtctatct ggcagatacc gaccaatccg gcgcaaacga aaccgtacac 1680
atcaaaggca gtctgcagct ggacggcaaa ggtacgctgt acacacgttt gggcaaactg 1740
ctgaaagtgg acggtacggc gattatcggc ggcaagctgt acatgtcggc acgcggcaag 1800
ggggcaggct atctcaacag taccggacga cgtgttccct tcctgagtgc cgccaaaatc 1860
gggcaggatt attctttctt cacaaacatc gaaaccgacg gcggcctgct ggcttccctc 1920
gacagcgtcg aaaaaacagc gggcagtgaa ggcgacacgc tgtcctatta tgtccgtcgc 1980
ggcaatgcgg cacggactgc ttcggcagcg gcacattccg cgcccgccgg tctgaaacac 2040
gccgtagaac agggcggcag caatctggaa aacctgatgg tcgaactgga tgcctccgaa 2100
tcatccgcaa cacccgagac ggttgaaact gcggcagccg accgcacaga tatgccgggc 2160
atccgcccct acggcgcaac tttccgcgca gcggcagccg tgcagcatgc gaatgccgcc 2220
gacggtgtac gcatcttcaa cagtctcgcc gctaccgtct atgccgacag taccgccgcc 2280
catgccgata tgcagggacg ccgcctgaaa gccgtatcgg acgggttgga ccacaacggc 2340
acgggtctgc gcgtcatcgc gcaaacccaa caggacggtg gaacgtggga acagggcggt 2400
gttgaaggca aaatgcgcgg cagtacccaa accgtcggca ttgccgcgaa aaccggcgaa 2460
aatacgacag cagccgccac actgggcatg ggacgcagca catggagcga aaacagtgca 2520
aatgcaaaaa ccgacagcat tagtctgttt gcaggcatac ggcacgatgc gggcgatatc 2580
ggctatctca aaggcctgtt ctcctacgga cgctacaaaa acagcatcag ccgcagcacc 2640
ggtgcggacg aacatgcgga aggcagcgtc aacggcacgc tgatgcagct gggcgcactg 2700
ggcggtgtca acgttccgtt tgccgcaacg ggagatttga cggtcgaagg cggtctgcgc 2760
tacgacctgc tcaaacagga tgcattcgcc gaaaaaggca gtgctttggg ctggagcggc 2820
aacagcctca ctgaaggcac gctggtcgga ctcgcgggtc tgaagctgtc gcaacccttg 2880
agcgataaag ccgtcctgtt tgcaacggcg ggcgtggaac gcgacctgaa cggacgcgac 2940
tacacggtaa cgggcggctt taccggcgcg actgcagcaa ccggcaagac gggggcacgc 3000
aatatgccgc acacccgtct ggttgccggc ctgggcgcgg atgtcgaatt cggcaacggc 3060
tggaacggct tggcacgtta cagctacgcc ggttccaaac agtacggcaa ccacagcgga 3120
cgagtcggcg taggctaccg gttcctcgag ggatccggag ggggtggtgt cgccgccgac 3180
atcggtgcgg ggcttgccga tgcactaacc gcaccgctcg accataaaga caaaggtttg 3240
cagtctttga cgctggatca gtccgtcagg aaaaacgaga aactgaagct ggcggcacaa 3300
ggtgcggaaa aaacttatgg aaacggtgac agcctcaata cgggcaaatt gaagaacgac 3360
aaggtcagcc gtttcgactt tatccgccaa atcgaagtgg acgggcagct cattaccttg 3420
gagagtggag agttccaagt atacaaacaa agccattccg ccttaaccgc ctttcagacc 3480
gagcaaatac aagattcgga gcattccggg aagatggttg cgaaacgcca gttcagaatc 3540
ggcgacatag cgggcgaaca tacatctttt gacaagcttc ccgaaggcgg cagggcgaca 3600
tatcgcggga cggcgttcgg ttcagacgat gccggcggaa aactgaccta caccatagat 3660
ttcgccgcca agcagggaaa cggcaaaatc gaacatttga aatcgccaga actcaatgtc 3720
gacctggccg ccgccgatat caagccggat ggaaaacgcc atgccgtcat cagcggttcc 3780
gtcctttaca accaagccga gaaaggcagt tactccctcg gtatctttgg cggaaaagcc 3840
caggaagttg ccggcagcgc ggaagtgaaa accgtaaacg gcatacgcca tatcggcctt 3900
gccgccaagc aactcgagca ccaccaccac caccactga 3939
<210>20
<211>1312
<212>PRT
<213>人工序列
<220>
<223>ΔG983-741
<400>20
Met Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser
1 5 10 15
Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly
20 25 30
Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg
35 40 45
Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro
50 55 60
Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys
65 70 75 80
Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg
85 90 95
Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile
100 105 110
Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn
115 120 125
Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly
130 135 140
Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu
145 150 155 160
Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His
165 170 175
Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg
180 185 190
Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr
195 200 205
Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala
210 215 220
Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe
225 230 235 240
Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn
245 250 255
Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp
260 265 270
Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn
275 280 285
Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr
290 295 300
Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe
305 310 315 320
Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp
325 330 335
Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr
340 345 350
Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp
355 360 365
Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn
370 375 380
Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly
385 390 395 400
Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn
405 410 415
Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly
420 425 430
Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met
435 440 445
Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys
450 455 460
Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr
465 470 475 480
Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn
485 490 495
Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu
500 505 510
Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu
515 520 525
Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile
530 535 540
Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His
545 550 555 560
Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg
565 570 575
Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys
580 585 590
Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr
595 600 605
Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr
610 615 620
Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu
625 630 635 640
Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr
645 650 655
Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His
660 665 670
Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn
675 680 685
Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr
690 695 700
Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly
705 710 715 720
Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His
725 730 735
Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr
740 745 750
Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg
755 760 765
Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg
770 775 780
Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly
785 790 795 800
Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala
805 810 815
Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg
820 825 830
Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser
835 840 845
Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys
850 855 860
Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr
865 870 875 880
Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln
885 890 895
Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp
900 905 910
Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala
915 920 925
Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr
930 935 940
Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu
945 950 955 960
Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu
965 970 975
Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala
980 985 990
Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val
995 1000 1005
Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu
1010 1015 1020
Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly
1025 1030 1035 1040
Arg Val Gly Val Gly Tyr Arg Phe Leu Glu Gly Ser Gly Gly Gly Gly
1045 1050 1055
Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro
1060 1065 1070
Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln Ser
1075 1080 1085
Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu Lys
1090 1095 1100
Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp
1105 1110 1115 1120
Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly Gln
1125 1130 1135
Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser His
1140 1145 1150
Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu His
1155 1160 1165
Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly AspIle Ala
1170 1175 1180
Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr
1185 1190 1195 1200
Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr
1205 1210 1215
Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu His
1220 1225 1230
Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile Lys
1235 1240 1245
Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr Asn
1250 1255 1260
Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys Ala
1265 1270 1275 1280
Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile Arg
1285 1290 1295
His Ile Gly Leu Ala Ala Lys Gln Leu Glu His His His His His His
1300 1305 1310
<210>21
<211>4344
<212>DNA
<213>人工序列
<220>
<223>ΔG983-961
<400>21
atgacttctg cgcccgactt caatgcaggc ggtaccggta tcggcagcaa cagcagagca 60
acaacagcga aatcagcagc agtatcttac gccggtatca agaacgaaat gtgcaaagac 120
agaagcatgc tctgtgccgg tcgggatgac gttgcggtta cagacaggga tgccaaaatc 180
aatgcccccc ccccgaatct gcataccgga gactttccaa acccaaatga cgcatacaag 240
aatttgatca acctcaaacc tgcaattgaa gcaggctata caggacgcgg ggtagaggta 300
ggtatcgtcg acacaggcga atccgtcggc agcatatcct ttcccgaact gtatggcaga 360
aaagaacacg gctataacga aaattacaaa aactatacgg cgtatatgcg gaaggaagcg 420
cctgaagacg gaggcggtaa agacattgaa gcttctttcg acgatgaggc cgttatagag 480
actgaagcaa agccgacgga tatccgccac gtaaaagaaa tcggacacat cgatttggtc 540
tcccatatta ttggcgggcg ttccgtggac ggcagacctg caggcggtat tgcgcccgat 600
gcgacgctac acataatgaa tacgaatgat gaaaccaaga acgaaatgat ggttgcagcc 660
atccgcaatg catgggtcaa gctgggcgaa cgtggcgtgc gcatcgtcaa taacagtttt 720
ggaacaacat cgagggcagg cactgccgac cttttccaaa tagccaattc ggaggagcag 780
taccgccaag cgttgctcga ctattccggc ggtgataaaa cagacgaggg tatccgcctg 840
atgcaacaga gcgattacgg caacctgtcc taccacatcc gtaataaaaa catgcttttc 900
atcttttcga caggcaatga cgcacaagct cagcccaaca catatgccct attgccattt 960
tatgaaaaag acgctcaaaa aggcattatc acagtcgcag gcgtagaccg cagtggagaa 1020
aagttcaaac gggaaatgta tggagaaccg ggtacagaac cgcttgagta tggctccaac 1080
cattgcggaa ttactgccat gtggtgcctg tcggcaccct atgaagcaag cgtccgtttc 1140
acccgtacaa acccgattca aattgccgga acatcctttt ccgcacccat cgtaaccggc 1200
acggcggctc tgctgctgca gaaatacccg tggatgagca acgacaacct gcgtaccacg 1260
ttgctgacga cggctcagga catcggtgca gtcggcgtgg acagcaagtt cggctgggga 1320
ctgctggatg cgggtaaggc catgaacgga cccgcgtcct ttccgttcgg cgactttacc 1380
gccgatacga aaggtacatc cgatattgcc tactccttcc gtaacgacat ttcaggcacg 1440
ggcggcctga tcaaaaaagg cggcagccaa ctgcaactgc acggcaacaa cacctatacg 1500
ggcaaaacca ttatcgaagg cggttcgctg gtgttgtacg gcaacaacaa atcggatatg 1560
cgcgtcgaaa ccaaaggtgc gctgatttat aacggggcgg catccggcgg cagcctgaac 1620
agcgacggca ttgtctatct ggcagatacc gaccaatccg gcgcaaacga aaccgtacac 1680
atcaaaggca gtctgcagct ggacggcaaa ggtacgctgt acacacgttt gggcaaactg 1740
ctgaaagtgg acggtacggc gattatcggc ggcaagctgt acatgtcggc acgcggcaag 1800
ggggcaggct atctcaacag taccggacga cgtgttccct tcctgagtgc cgccaaaatc 1860
gggcaggatt attctttctt cacaaacatc gaaaccgacg gcggcctgct ggcttccctc 1920
gacagcgtcg aaaaaacagc gggcagtgaa ggcgacacgc tgtcctatta tgtccgtcgc 1980
ggcaatgcgg cacggactgc ttcggcagcg gcacattccg cgcccgccgg tctgaaacac 2040
gccgtagaac agggcggcag caatctggaa aacctgatgg tcgaactgga tgcctccgaa 2100
tcatccgcaa cacccgagac ggttgaaact gcggcagccg accgcacaga tatgccgggc 2160
atccgcccct acggcgcaac tttccgcgca gcggcagccg tacagcatgc gaatgccgcc 2220
gacggtgtac gcatcttcaa cagtctcgcc gctaccgtct atgccgacag taccgccgcc 2280
catgccgata tgcagggacg ccgcctgaaa gccgtatcgg acgggttgga ccacaacggc 2340
acgggtctgc gcgtcatcgc gcaaacccaa caggacggtg gaacgtggga acagggcggt 2400
gttgaaggca aaatgcgcgg cagtacccaa accgtcggca ttgccgcgaa aaccggcgaa 2460
aatacgacag cagccgccac actgggcatg ggacgcagca catggagcga aaacagtgca 2520
aatgcaaaaa ccgacagcat tagtctgttt gcaggcatac ggcacgatgc gggcgatatc 2580
ggctatctca aaggcctgtt ctcctacgga cgctacaaaa acagcatcag ccgcagcacc 2640
ggtgcggacg aacatgcgga aggcagcgtc aacggcacgc tgatgcagct gggcgcactg 2700
ggcggtgtca acgttccgtt tgccgcaacg ggagatttga cggtcgaagg cggtctgcgc 2760
tacgacctgc tcaaacagga tgcattcgcc gaaaaaggca gtgctttggg ctggagcggc 2820
aacagcctca ctgaaggcac gctggtcgga ctcgcgggtc tgaagctgtc gcaacccttg 2880
agcgataaag ccgtcctgtt tgcaacggcg ggcgtggaac gcgacctgaa cggacgcgac 2940
tacacggtaa cgggcggctt taccggcgcg actgcagcaa ccggcaagac gggggcacgc 3000
aatatgccgc acacccgtct ggttgccggc ctgggcgcgg atgtcgaatt cggcaacggc 3060
tggaacggct tggcacgtta cagctacgcc ggttccaaac agtacggcaa ccacagcgga 3120
cgagtcggcg taggctaccg gttcctcgag ggtggcggag gcactggatc cgccacaaac 3180
gacgacgatg ttaaaaaagc tgccactgtg gccattgctg ctgcctacaa caatggccaa 3240
gaaatcaacg gtttcaaagc tggagagacc atctacgaca ttgatgaaga cggcacaatt 3300
accaaaaaag acgcaactgc agccgatgtt gaagccgacg actttaaagg tctgggtctg 3360
aaaaaagtcg tgactaacct gaccaaaacc gtcaatgaaa acaaacaaaa cgtcgatgcc 3420
aaagtaaaag ctgcagaatc tgaaatagaa aagttaacaa ccaagttagc agacactgat 3480
gccgctttag cagatactga tgccgctctg gatgcaacca ccaacgcctt gaataaattg 3540
ggagaaaata taacgacatt tgctgaagag actaagacaa atatcgtaaa aattgatgaa 3600
aaattagaag ccgtggctga taccgtcgac aagcatgccg aagcattcaa cgatatcgcc 3660
gattcattgg atgaaaccaa cactaaggca gacgaagccg tcaaaaccgc caatgaagcc 3720
aaacagacgg ccgaagaaac caaacaaaac gtcgatgcca aagtaaaagc tgcagaaact 3780
gcagcaggca aagccgaagc tgccgctggc acagctaata ctgcagccga caaggccgaa 3840
gctgtcgctg caaaagttac cgacatcaaa gctgatatcg ctacgaacaa agataatatt 3900
gctaaaaaag caaacagtgc cgacgtgtac accagagaag agtctgacag caaatttgtc 3960
agaattgatg gtctgaacgc tactaccgaa aaattggaca cacgcttggc ttctgctgaa 4020
aaatccattg ccgatcacga tactcgcctg aacggtttgg ataaaacagt gtcagacctg 4080
cgcaaagaaa cccgccaagg ccttgcagaa caagccgcgc tctccggtct gttccaacct 4140
tacaacgtgg gtcggttcaa tgtaacggct gcagtcggcg gctacaaatc cgaatcggca 4200
gtcgccatcg gtaccggctt ccgctttacc gaaaactttg ccgccaaagc aggcgtggca 4260
gtcggcactt cgtccggttc ttccgcagcc taccatgtcg gcgtcaatta cgagtggctc 4320
gagcaccacc accaccacca ctga 4344
<210>22
<211>1447
<212>PRT
<213>人工序列
<220>
<223>ΔG983-961
<400> 22
Met Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser
1 5 10 15
Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly
20 25 30
Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg
35 40 45
Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro
50 55 60
Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys
65 70 75 80
Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg
85 90 95
Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile
100 105 110
Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn
115 120 125
Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly
130 135 140
Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu
145 150 155 160
Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His
165 170 175
Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg
180 185 190
Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr
195 200 205
Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala
210 215 220
Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe
225 230 235 240
Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn
245 250 255
Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp
260 265 270
Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn
275 280 285
Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr
290 295 300
Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe
305 310 315 320
Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp
325 330 335
Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr
340 345 350
Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp
355 360 365
Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn
370 375 380
Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly
385 390 395 400
Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn
405 410 415
Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly
420 425 430
Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met
435 440 445
Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys
450 455 460
Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr
465 470 475 480
Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn
485 490 495
Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu
500 505 510
Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu
515 520 525
Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile
530 535 540
Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His
545 550 555 560
Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg
565 570 575
Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys
580 585 590
Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr
595 600 605
Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr
610 615 620
Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu
625 630 635 640
Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr
645 650 655
Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His
660 665 670
Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn
675 680 685
Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr
690 695 700
Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly
705 710 715 720
Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His
725 730 735
Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr
740 745 750
Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg
755 760 765
Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg
770 775 780
Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly
785 790 795 800
Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala
805 810 815
Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg
820 825 830
Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser
835 840 845
Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys
850 855 860
Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr
865 870 875 880
Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln
885 890 895
Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp
900 905 910
Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala
915 920 925
Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr
930 935 940
Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu
945 950 955 960
Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu
965 970 975
Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala
980 985 990
Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val
995 1000 1005
Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu
1010 1015 1020
Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly
1025 1030 1035 1040
Arg Val Gly Val Gly Tyr Arg Phe Leu Glu Gly Gly Gly Gly Thr Gly
1045 1050 1055
Ser Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile
1060 1065 1070
AlaAla Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly
1075 1080 1085
Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp
1090 1095 1100
Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu
1105 1110 1115 1120
Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln
1125 1130 1135
Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu
1140 1145 1150
Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala
1155 1160 1165
Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile
1170 1175 1180
Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu
1185 1190 1195 1200
Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe
1205 1210 1215
Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu
1220 1225 1230
Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys
1235 1240 1245
Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys
1250 1255 1260
Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu
1265 1270 1275 1280
Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn
1285 1290 1295
Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg
1300 1305 1310
Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr
1315 1320 1325
Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala
1330 1335 1340
Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr ValSer Asp Leu
1345 1350 1355 1360
Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly
1365 1370 1375
Leu Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val
1380 1385 1390
Gly Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg
1395 1400 1405
Phe Thr Glo Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser
1410 1415 1420
Ser Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp Leu
1425 1430 1435 1440
Glu His His His His His His
1445
<210>23
<211>4179
<212>DNA
<213>人工序列
<220>
<223>ΔG983-961c
<400>23
atgacttctg cgcccgactt caatgcaggc ggtaccggta tcggcagcaa cagcagagca 60
acaacagcga aatcagcagc agtatcttac gccggtatca agaacgaaat gtgcaaagac 120
agaagcatgc tctgtgccgg tcgggatgac gttgcggtta cagacaggga tgccaaaatc 180
aatgcccccc ccccgaatct gcataccgga gactttccaa acccaaatga cgcatacaag 240
aatttgatca acctcaaacc tgcaattgaa gcaggctata caggacgcgg ggtagaggta 300
ggtatcgtcg acacaggcga atccgtcggc agcatatcct ttcccgaact gtatggcaga 360
aaagaacacg gctataacga aaattacaaa aactatacgg cgtatatgcg gaaggaagcg 420
cctgaagacg gaggcggtaa agacattgaa gcttctttcg acgatgaggc cgttatagag 480
actgaagcaa agccgacgga tatccgccac gtaaaagaaa tcggacacat cgatttggtc 540
tcccatatta ttggcgggcg ttccgtggac ggcagacctg caggcggtat tgcgcccgat 600
gcgacgctac acataatgaa tacgaatgat gaaaccaaga acgaaatgat ggttgcagcc 660
atccgcaatg catgggtcaa gctgggcgaa cgtggcgtgc gcatcgtcaa taacagtttt 720
ggaacaacat cgagggcagg cactgccgac cttttccaaa tagccaattc ggaggagcag 780
taccgccaag cgttgctcga ctattccggc ggtgataaaa cagacgaggg tatccgcctg 840
atgcaacaga gcgattacgg caacctgtcc taccacatcc gtaataaaaa catgcttttc 900
atcttttcga caggcaatga cgcacaagct cagcccaaca catatgccct attgccattt 960
tatgaaaaag acgctcaaaa aggcattatc acagtcgcag gcgtagaccg cagtggagaa 1020
aagttcaaac gggaaatgta tggagaaccg ggtacagaac cgcttgagta tggctccaac 1080
cattgcggaa ttactgccat gtggtgcctg tcggcaccct atgaagcaag cgtccgtttc 1140
acccgtacaa acccgattca aattgccgga acatcctttt ccgcacccat cgtaaccggc 1200
acggcggctc tgctgctgca gaaatacccg tggatgagca acgacaacct gcgtaccacg 1260
ttgctgacga cggctcagga catcggtgca gtcggcgtgg acagcaagtt cggctgggga 1320
ctgctggatg cgggtaaggc catgaacgga cccgcgtcct ttccgttcgg cgactttacc 1380
gccgatacga aaggtacatc cgatattgcc tactccttcc gtaacgacat ttcaggcacg 1440
ggcggcctga tcaaaaaagg cggcagccaa ctgcaactgc acggcaacaa cacctatacg 1500
ggcaaaacca ttatcgaagg cggttcgctg gtgttgtacg gcaacaacaa atcggatatg 1560
cgcgtcgaaa ccaaaggtgc gctgatttat aacggggcgg catccggcgg cagcctgaac 1620
agcgacggca ttgtctatct ggcagatacc gaccaatccg gcgcaaacga aaccgtacac 1680
atcaaaggca gtctgcagct ggacggcaaa ggtacgctgt acacacgttt gggcaaactg 1740
ctgaaagtgg acggtacggc gattatcggc ggcaagctgt acatgtcggc acgcggcaag 1800
ggggcaggct atctcaacag taccggacga cgtgttccct tcctgagtgc cgccaaaatc 1860
gggcaggatt attctttctt cacaaacatc gaaaccgacg gcggcctgct ggcttccctc 1920
gacagcgtcg aaaaaacagc gggcagtgaa ggcgacacgc tgtcctatta tgtccgtcgc 1980
ggcaatgcgg cacggactgc ttcggcagcg gcacattccg cgcccgccgg tctgaaacac 2040
gccgtagaac agggcggcag caatctggaa aacctgatgg tcgaactgga tgcctccgaa 2100
tcatccgcaa cacccgagac ggttgaaact gcggcagccg accgcacaga tatgccgggc 2160
atccgcccct acggcgcaac tttccgcgca gcggcagccg tacagcatgc gaatgccgcc 2220
gacggtgtac gcatcttcaa cagtctcgcc gctaccgtct atgccgacag taccgccgcc 2280
catgccgata tgcagggacg ccgcctgaaa gccgtatcgg acgggttgga ccacaacggc 2340
acgggtctgc gcgtcatcgc gcaaacccaa caggacggtg gaacgtggga acagggcggt 2400
gttgaaggca aaatgcgcgg cagtacccaa accgtcggca ttgccgcgaa aaccggcgaa 2460
aatacgacag cagccgccac actgggcatg ggacgcagca catggagcga aaacagtgca 2520
aatgcaaaaa ccgacagcat tagtctgttt gcaggcatac ggcacgatgc gggcgatatc 2580
ggctatctca aaggcctgtt ctcctacgga cgctacaaaa acagcatcag ccgcagcacc 2640
ggtgcggacg aacatgcgga aggcagcgtc aacggcacgc tgatgcagct gggcgcactg 2700
ggcggtgtca acgttccgtt tgccgcaacg ggagatttga cggtcgaagg cggtctgcgc 2760
tacgacctgc tcaaacagga tgcattcgcc gaaaaaggca gtgctttggg ctggagcggc 2820
aacagcctca ctgaaggcac gctggtcgga ctcgcgggtc tgaagctgtc gcaacccttg 2880
agcgataaag ccgtcctgtt tgcaacggcg ggcgtggaac gcgacctgaa cggacgcgac 2940
tacacggtaa cgggcggctt taccggcgcg actgcagcaa ccggcaagac gggggcacgc 3000
aatatgccgc acacccgtct ggttgccggc ctgggcgcgg atgtcgaatt cggcaacggc 3060
tggaacggct tggcacgtta cagctacgcc ggttccaaac agtacggcaa ccacagcgga 3120
cgagtcggcg taggctaccg gttcctcgag ggtggcggag gcactggatc cgccacaaac 3180
gacgacgatg ttaaaaaagc tgccactgtg gccattgctg ctgcctacaa caatggccaa 3240
gaaatcaacg gtttcaaagc tggagagacc atctacgaca ttgatgaaga cggcacaatt 3300
accaaaaaag acgcaactgc agccgatgtt gaagccgacg actttaaagg tctgggtctg 3360
aaaaaagtcg tgactaacct gaccaaaacc gtcaatgaaa acaaacaaaa cgtcgatgcc 3420
aaagtaaaag ctgcagaatc tgaaatagaa aagttaacaa ccaagttagc agacactgat 3480
gccgctttag cagatactga tgccgctctg gatgcaacca ccaacgcctt gaataaattg 3540
ggagaaaata taacgacatt tgctgaagag actaagacaa atatcgtaaa aattgatgaa 3600
aaattagaag ccgtggctga taccgtcgac aagcatgccg aagcattcaa cgatatcgcc 3660
gattcattgg atgaaaccaa cactaaggca gacgaagccg tcaaaaccgc caatgaagcc 3720
aaacagacgg ccgaagaaac caaacaaaac gtcgatgcca aagtaaaagc tgcagaaact 3780
gcagcaggca aagccgaagc tgccgctggc acagctaata ctgcagccga caaggccgaa 3840
gctgtcgctg caaaagttac cgacatcaaa gctgatatcg ctacgaacaa agataatatt 3900
gctaaaaaag caaacagtgc cgacgtgtac accagagaag agtctgacag caaatttgtc 3960
agaattgatg gtctgaacgc tactaccgaa aaattggaca cacgcttggc ttctgctgaa 4020
aaatccattg ccgatcacga tactcgcctg aacggtttgg ataaaacagt gtcagacctg 4080
cgcaaagaaa cccgccaagg ccttgcagaa caagccgcgc tctccggtct gttccaacct 4140
tacaacgtgg gtctcgagca ccaccaccac caccactga 4179
<210>24
<211>1392
<212>PRT
<213>人工序列
<220>
<223>ΔG983-961c
<400>24
Met Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser
1 5 10 15
Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly
20 25 30
Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg
35 40 45
Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro
50 55 60
Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys
65 70 75 80
Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg
85 90 95
Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile
100 105 110
Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn
115 120 125
Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly
130 135 140
Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu
145 150 155 160
Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His
165 170 175
Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg
180 185 190
Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr
195 200 205
Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala
210 215 220
Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe
225 230 235 240
Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn
245 250 255
Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp
260 265 270
Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn
275 280 285
Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr
290 295 300
Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe
305 310 315 320
Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp
325 330 335
Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr
340 345 350
Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp
355 360 365
Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn
370 375 380
Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly
385 390 395 400
Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn
405 410 415
Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly
420 425 430
Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met
435 440 445
Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys
450 455 460
Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr
465 470 475 480
Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn
485 490 495
Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu
500 505 510
Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu
515 520 525
Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile
530 535 540
Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His
545 550 555 560
Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg
565 570 575
Leu Gly Lys Leu Leu Lys Val Asp Gly Thr AlaIle Ile Gly Gly Lys
580 585 590
Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr
595 600 605
Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr
610 615 620
Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu
625 630 635 640
Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr
645 650 655
Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His
660 665 670
Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn
675 680 685
Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr
690 695 700
Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly
705 710 715 720
Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His
725 730 735
Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr
740 745 750
Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg
755 760 765
Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg
770 775 780
Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly
785 790 795 800
Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala
805 810 815
Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg
820 825 830
Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser
835 840 845
Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys
850 855 860
Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr
865 870 875 880
Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln
885 890 895
Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp
900 905 9l0
Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala
915 920 925
Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr
930 935 940
Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu
945 950 955 960
Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu
965 970 975
Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala
980 985 990
Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val
995 1000 1005
Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu
10l0 1015 1020
Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly
1025 1030 1035 1040
Arg Val Gly Val Gly Tyr Arg Phe Leu Glu Gly Gly Gly Gly Thr Gly
1045 1050 1055
Ser Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile
1060 1065 1070
Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly
1075 1080 1085
Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp
1090 1095 1100
Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu
1105 1110 1115 1120
Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln
1125 1130 1135
Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu
1140 1145 1150
Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala
1155 1160 1165
Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile
1170 1175 1180
Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu
1185 1190 1195 1200
Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe
1205 1210 1215
Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu
1220 1225 1230
Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys
1235 1240 1245
Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys
1250 1255 1260
Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu
1265 1270 1275 1280
Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn
1285 1290 1295
Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg
1300 1305 1310
Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr
1315 1320 1325
Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala
1330 1335 1340
Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu
1345 1350 1355 1360
Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly
1365 1370 1375
Leu Phe Gln Pro Tyr Asn Val Gly Leu Glu His His His His His His
1380 1385 1390
<210>25
<211>274
<212>PRT
<213>人工序列
<220>
<223>741
<400>25
Val Asn Arg Thr Ala Phe Cys Cys Leu Ser Leu Thr Thr Ala Leu Ile
1 5 10 15
Leu Thr Ala Cys Ser Ser Gly Gly Gly Gly Val Ala Ala Asp Ile Gly
20 25 30
Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro Leu Asp His Lys Asp Lys
35 40 45
Gly Leu Gln Ser Leu Thr Leu Asp Gln Ser Val Arg Lys Asn Glu Lys
50 55 60
Leu Lys Leu Ala Ala Gln Gly Ala Glu Lys Thr Tyr Gly Asn Gly Asp
65 70 75 80
Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp Lys Val Ser Arg Phe Asp
85 90 95
Phe Ile Arg Gln Ile Glu Val Asp Gly Gln Leu Ile Thr Leu Glu Ser
100 105 110
Gly Glu Phe Gln Val Tyr Lys Gln Ser His Ser Ala Leu Thr Ala Phe
115 120 125
Gln Thr Glu Gln Ile Gln Asp Ser Glu His Ser Gly Lys Met Val Ala
130 135 140
Lys Arg Gln Phe Arg Ile Gly Asp Ile Ala Gly Glu His Thr Ser Phe
145 150 155 160
Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr Tyr Arg Gly Thr Ala Phe
165 170 175
Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr Tyr Thr Ile Asp Phe Ala
180 185 190
Ala Lys Gln Gly Asn Gly Lys Ile Glu His Leu Lys Ser Pro Glu Leu
195 200 205
Asn Val Asp Leu Ala Ala Ala Asp Ile Lys Pro Asp Gly Lys Arg His
210 215 220
Ala Val Ile Ser Gly Ser Val Leu Tyr Asn Gln Ala Glu Lys Gly Ser
225 230 235 240
Tyr Ser Leu Gly Ile Phe Gly Gly Lys Ala Gln Glu Val Ala Gly Ser
245 250 255
Ala Glu Val Lys Thr Val Asn Gly Ile Arg His Ile Gly Leu Ala Ala
260 265 270
Lys Gln
<210>26
<211>248
<212>PRT
<213>人工序列
<220>
<223>ΔG741
<400>26
Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro
1 5 10 15
Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln Ser
20 25 30
Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu Lys
35 40 45
Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp
50 55 60
Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly Gln
65 70 75 80
Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser His
85 90 95
Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu His
100 105 110
Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile Ala
115 120 125
Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr
130 135 140
Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr
145 150 155 160
Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu His
165 170 175
Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile Lys
180 185 190
Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr Asn
195 200 205
Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys Ala
210 215 220
Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile Arg
225 230 235 240
His Ile Gly Leu Ala Ala Lys Gln
245
<210>27
<211>1947
<212>DNA
<213>人工序列
<220>
<223>ΔG741-961
<400>27
atggtcgccg ccgacatcgg tgcggggctt gccgatgcac taaccgcacc gctcgaccat 60
aaagacaaag gtttgcagtc tttgacgctg gatcagtccg tcaggaaaaa cgagaaactg 120
aagctggcgg cacaaggtgc ggaaaaaact tatggaaacg gtgacagcct caatacgggc 180
aaattgaaga acgacaaggt cagccgtttc gactttatcc gccaaatcga agtggacggg 240
cagctcatta ccttggagag tggagagttc caagtataca aacaaagcca ttccgcctta 300
accgcctttc agaccgagca aatacaagat tcggagcatt ccgggaagat ggttgcgaaa 360
cgccagttca gaatcggcga catagcgggc gaacatacat cttttgacaa gcttcccgaa 420
ggcggcaggg cgacatatcg cgggacggcg ttcggttcag acgatgccgg cggaaaactg 480
acctacacca tagatttcgc cgccaagcag ggaaacggca aaatcgaaca tttgaaatcg 540
ccagaactca atgtcgacct ggccgccgcc gatatcaagc cggatggaaa acgccatgcc 600
gtcatcagcg gttccgtcct ttacaaccaa gccgagaaag gcagttactc cctcggtatc 660
tttggcggaa aagcccagga agttgccggc agcgcggaag tgaaaaccgt aaacggcata 720
cgccatatcg gccttgccgc caagcaactc gagggtggcg gaggcactgg atccgccaca 780
aacgacgacg atgttaaaaa agctgccact gtggccattg ctgctgccta caacaatggc 840
caagaaatca acggtttcaa agctggagag accatctacg acattgatga agacggcaca 900
attaccaaaa aagacgcaac tgcagccgat gttgaagccg acgactttaa aggtctgggt 960
ctgaaaaaag tcgtgactaa cctgaccaaa accgtcaatg aaaacaaaca aaacgtcgat 1020
gccaaagtaa aagctgcaga atctgaaata gaaaagttaa caaccaagtt agcagacact 1080
gatgccgctt tagcagatac tgatgccgct ctggatgcaa ccaccaacgc cttgaataaa 1140
ttgggagaaa atataacgac atttgctgaa gagactaaga caaatatcgt aaaaattgat 1200
gaaaaattag aagccgtggc tgataccgtc gacaagcatg ccgaagcatt caacgatatc 1260
gccgattcat tggatgaaac caacactaag gcagacgaag ccgtcaaaac cgccaatgaa 1320
gccaaacaga cggccgaaga aaccaaacaa aacgtcgatg ccaaagtaaa agctgcagaa 1380
actgcagcag gcaaagccga agctgccgct ggcacagcta atactgcagc cgacaaggcc 1440
gaagctgtcg ctgcaaaagt taccgacatc aaagctgata tcgctacgaa caaagataat 1500
attgctaaaa aagcaaacag tgccgacgtg tacaccagag aagagtctga cagcaaattt 1560
gtcagaattg atggtctgaa cgctactacc gaaaaattgg acacacgctt ggcttctgct 1620
gaaaaatcca ttgccgatca cgatactcgc ctgaacggtt tggataaaac agtgtcagac 1680
ctgcgcaaag aaacccgcca aggccttgca gaacaagccg cgctctccgg tctgttccaa 1740
ccttacaacg tgggtcggtt caatgtaacg gctgcagtcg gcggctacaa atccgaatcg 1800
gcagtcgcca tcggtaccgg cttccgcttt accgaaaact ttgccgccaa agcaggcgtg 1860
gcagtcggca cttcgtccgg ttcttccgca gcctaccatg tcggcgtcaa ttacgagtgg 1920
ctcgagcacc accaccacca ccactga 1947
<210>28
<211>648
<212>PRT
<213>人工序列
<220>
<223>ΔG741-961
<400>28
Met Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala
1 5 10 15
Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln
20 25 30
Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu
35 40 45
Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn
50 55 60
Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly
65 70 75 80
Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser
85 90 95
His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu
100 105 110
His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile
115 120 125
Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala
130 135 140
Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu
145 150 155 160
Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu
165 170 175
Hi s Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile
180 185 190
Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr
195 200 205
Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys
210 215 220
Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile
225 230 235 240
Arg His Ile Gly Leu Ala Ala Lys Gln Leu Glu Gly Gly Gly Gly Thr
245 250 255
Gly Ser Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala
260 265 270
Ile Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala
275 280 285
Gly Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys
290 295 300
Asp Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly
305 310 315 320
Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys
325 330 335
Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys
340 345 350
Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp
355 360 365
Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn
370 375 380
Ile Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp
385 390 395 400
Glu Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala
405 410 415
Phe Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp
420 425 430
Glu Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr
435 440 445
Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly
450 455 460
Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala
465 470 475 480
Glu Ala Val Ala Ala Lys Val Thr AspIle Lys Ala AspIle Ala Thr
485 490 495
Asn Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr
500 505 510
Arg Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala
515 520 525
Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile
530 535 540
Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp
545 550 555 560
Leu Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser
565 570 575
Gly Leu Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala
580 585 590
Val Gly Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe
595 600 605
Arg Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr
610 615 620
Ser Ser Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp
625 630 635 640
Leu Glu His His His His His His
645
<210>29
<211>1782
<212>DNA
<213>人工序列
<220>
<223>ΔG741-961c
<400>29
atggtcgccg ccgacatcgg tgcggggctt gccgatgcae taaccgcacc gctcgaccat 60
aaagacaaag gtttgcagtc tttgacgctg gatcagtccg tcaggaaaaa cgagaaactg 120
aagctggcgg cacaaggtgc ggaaaaaact tatggaaacg gtgacagcct caatacgggc 180
aaattgaaga acgacaaggt cagccgtttc gactttatcc gccaaatcga agtggacggg 240
cagctcatta ccttggagag tggagagttc caagtataca aacaaagcca ttccgcctta 300
accgcctttc agaccgagca aatacaagat tcggagcatt ccgggaagat ggttgcgaaa 360
cgccagttca gaatcggcga catagcgggc gaacatacat cttttgacaa gcttcccgaa 420
ggcggcaggg cgacatatcg cgggacggcg ttcggttcag acgatgccgg cggaaaactg 480
acctacacca tagatttcgc cgccaagcag ggaaacggca aaatcgaaca tttgaaatcg 540
ccagaactca atgtcgacct ggccgccgcc gatatcaagc cggatggaaa acgccatgcc 600
gtcatcagcg gttccgtcct ttacaaccaa gccgagaaag gcagttactc cctcggtatc 660
tttggcggaa aagcccagga agttgccggc agcgcggaag tgaaaaccgt aaacggcata 720
cgccatatcg gccttgccgc caagcaactc gagggtggcg gaggcactgg atccgccaca 780
aacgacgacg atgttaaaaa agctgccact gtggccattg ctgctgccta caacaatggc 840
caagaaatca acggtttcaa agctggagag accatctacg acattgatga agacggcaca 900
attaccaaaa aagacgcaac tgcagccgat gttgaagccg acgactttaa aggtctgggt 960
ctgaaaaaag tcgtgactaa cctgaccaaa accgtcaatg aaaacaaaca aaacgtcgat 1020
gccaaagtaa aagctgcaga atctgaaata gaaaagttaa caaccaagtt agcagacact 1080
gatgccgctt tagcagatac tgatgccgct ctggatgcaa ccaccaacgc cttgaataaa 1140
ttgggagaaa atataacgac atttgctgaa gagactaaga caaatatcgt aaaaattgat 1200
gaaaaattag aagccgtggc tgataccgtc gacaagcatg ccgaagcatt caacgatatc 1260
gccgattcat tggatgaaac caacactaag gcagacgaag ccgtcaaaac cgccaatgaa 1320
gccaaacaga cggccgaaga aaccaaacaa aacgtcgatg ccaaagtaaa agctgcagaa 1380
actgcagcag gcaaagccga agctgccgct ggcacagcta atactgcagc cgacaaggcc 1440
gaagctgtcg ctgcaaaagt taccgacatc aaagctgata tcgctacgaa caaagataat 1500
attgctaaaa aagcaaacag tgccgacgtg tacaccagag aagagtctga cagcaaattt 1560
gtcagaattg atggtctgaa cgctactacc gaaaaattgg acacacgctt ggcttctgct 1620
gaaaaatcca ttgccgatca cgatactcgc ctgaacggtt tggataaaac agtgtcagac 1680
ctgcgcaaag aaacccgcca aggccttgca gaacaagccg cgctctccgg tctgttccaa 1740
ccttacaacg tgggtctcga gcaccaccac caccaccact ga 1782
<210>30
<211>593
<212>PRT
<213>人工序列
<220>
<223>ΔG741-961c
<400>30
Met Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala
1 5 10 15
Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln
20 25 30
Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu
35 40 45
Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn
50 55 60
Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly
65 70 75 80
Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser
85 90 95
His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu
100 105 110
His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile
115 120 125
Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala
130 135 140
Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu
145 150 155 160
Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu
165 170 175
His Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile
180 185 190
Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr
195 200 205
Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu GlyIle Phe Gly Gly Lys
2l0 215 220
Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile
225 230 235 240
Arg His Ile Gly Leu Ala Ala Lys Gln Leu Glu Gly Gly Gly Gly Thr
245 250 255
Gly Ser Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala
260 265 270
Ile Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala
275 280 285
Gly Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys
290 295 300
Asp Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly
305 310 315 320
Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys
325 330 335
Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys
340 345 350
Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp
355 360 365
Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn
370 375 380
Ile Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp
385 390 395 400
Glu Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala
405 410 415
Phe Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp
420 425 430
Glu Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr
435 440 445
Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly
450 455 460
Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala
465 470 475 480
Glu Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr
485 490 495
Asn Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr
500 505 510
Arg Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala
515 520 525
Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile
530 535 540
Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp
545 550 555 560
Leu Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser
565 570 575
Gly Leu Phe Gln Pro Tyr Asn Val Gly Leu Glu His His His His His
580 585 590
His
<210>31
<211>3939
<212>DNA
<213>人工序列
<220>
<223>ΔG741-983
<400>31
atggtcgccg ccgacatcgg tgcggggctt gccgatgcac taaccgcacc gctcgaccat 60
aaagacaaag gtttgcagtc tttgacgctg gatcagtccg tcaggaaaaa cgagaaactg 120
aagctggcgg cacaaggtgc ggaaaaaact tatggaaacg gtgacagcct caatacgggc 180
aaattgaaga acgacaaggt cagccgtttc gactttatcc gccaaatcga agtggacggg 240
cagctcatta ccttggagag tggagagttc caagtataca aacaaagcca ttccgcctta 300
accgcctttc agaccgagca aatacaagat tcggagcatt ccgggaagat ggttgcgaaa 360
cgccagttca gaatcggcga catagcgggc gaacatacat cttttgacaa gcttcccgaa 420
ggcggcaggg cgacatatcg cgggacggcg ttcggttcag acgatgccgg cggaaaactg 480
acctacacca tagatttcgc cgccaagcag ggaaacggca aaatcgaaca tttgaaatcg 540
ccagaactca atgtcgacct ggccgccgcc gatatcaagc cggatggaaa acgccatgcc 600
gtcatcagcg gttccgtcct ttacaaccaa gccgagaaag gcagttactc cctcggtatc 660
tttggcggaa aagcccagga agttgccggc agcgcggaag tgaaaaccgt aaacggcata 720
cgccatatcg gccttgccgc caagcaactc gagggatccg gcggaggcgg cacttctgcg 780
cccgacttca atgcaggcgg taccggtatc ggcagcaaca gcagagcaac aacagcgaaa 840
tcagcagcag tatcttacgc cggtatcaag aacgaaatgt gcaaagacag aagcatgctc 900
tgtgccggtc gggatgacgt tgcggttaca gacagggatg ccaaaatcaa tgcccccccc 960
ccgaatctgc ataccggaga ctttccaaac ccaaatgacg catacaagaa tttgatcaac 1020
ctcaaacctg caattgaagc aggctataca ggacgcgggg tagaggtagg tatcgtcgac 1080
acaggcgaat ccgtcggcag catatccttt cccgaactgt atggcagaaa agaacacggc 1140
tataacgaaa attacaaaaa ctatacggcg tatatgcgga aggaagcgcc tgaagacgga 1200
ggcggtaaag acattgaagc ttctttcgac gatgaggccg ttatagagac tgaagcaaag 1260
ccgacggata tccgccacgt aaaagaaatc ggacacatcg atttggtctc ccatattatt 1320
ggcgggcgtt ccgtggacgg cagacctgca ggcggtattg cgcccgatgc gacgctacac 1380
ataatgaata cgaatgatga aaccaagaac gaaatgatgg ttgcagccat ccgcaatgca 1440
tgggtcaagc tgggcgaacg tggcgtgcgc atcgtcaata acagttttgg aacaacatcg 1500
agggcaggca ctgccgacct tttccaaata gccaattcgg aggagcagta ccgccaagcg 1560
ttgctcgact attccggcgg tgataaaaca gacgagggta tccgcctgat gcaacagagc 1620
gattacggca acctgtccta ccacatccgt aataaaaaca tgcttttcat cttttcgaca 1680
ggcaatgacg cacaagctca gcccaacaca tatgccctat tgccatttta tgaaaaagac 1740
gctcaaaaag gcattatcac agtcgcaggc gtagaccgca gtggagaaaa gttcaaacgg 1800
gaaatgtatg gagaaccggg tacagaaccg cttgagtatg gctccaacca ttgcggaatt 1860
actgccatgt ggtgcctgtc ggcaccctat gaagcaagcg tccgtttcac ccgtacaaac 1920
ccgattcaaa ttgccggaac atccttttcc gcacccatcg taaccggcac ggcggctctg 1980
ctgctgcaga aatacccgtg gatgagcaac gacaacctgc gtaccacgtt gctgacgacg 2040
gctcaggaca tcggtgcagt cggcgtggac agcaagttcg gctggggact gctggatgcg 2100
ggtaaggcca tgaacggacc cgcgtccttt ccgttcggcg actttaccgc cgatacgaaa 2160
ggtacatccg atattgccta ctccttccgt aacgacattt caggcacggg cggcctgatc 2220
aaaaaaggcg gcagccaact gcaactgcac ggcaacaaca cctatacggg caaaaccatt 2280
atcgaaggcg gttcgctggt gttgtacggc aacaacaaat cggatatgcg cgtcgaaacc 2340
aaaggtgcgc tgatttataa cggggcggca tccggcggca gcctgaacag cgacggcatt 2400
gtctatctgg cagataccga ccaatccggc gcaaacgaaa ccgtacacat caaaggcagt 2460
ctgcagctgg acggcaaagg tacgctgtac acacgtttgg gcaaactgct gaaagtggac 2520
ggtacggcga ttatcggcgg caagctgtac atgtcggcac gcggcaaggg ggcaggctat 2580
ctcaacagta ccggacgacg tgttcccttc ctgagtgccg ccaaaatcgg gcaggattat 2640
tctttcttca caaacatcga aaccgacggc ggcctgctgg cttccctcga cagcgtcgaa 2700
aaaacagcgg gcagtgaagg cgacacgctg tcctattatg tccgtcgcgg caatgcggca 2760
cggactgctt cggcagcggc acattccgcg cccgccggtc tgaaacacgc cgtagaacag 2820
ggcggcagca atctggaaaa cctgatggtc gaactggatg cctccgaatc atccgcaaca 2880
cccgagacgg ttgaaactgc ggcagccgac cgcacagata tgccgggcat ccgcccctac 2940
ggcgcaactt tccgcgcagc ggcagccgta cagcatgcga atgccgccga cggtgtacgc 3000
atcttcaaca gtctcgccgc taccgtctat gccgacagta ccgccgccca tgccgatatg 3060
cagggacgcc gcctgaaagc cgtatcggac gggttggacc acaacggcac gggtctgcgc 3120
gtcatcgcgc aaacccaaca ggacggtgga acgtgggaac agggcggtgt tgaaggcaaa 3180
atgcgcggca gtacccaaac cgtcggcatt gccgcgaaaa ccggcgaaaa tacgacagca 3240
gccgccacac tgggcatggg acgcagcaca tggagcgaaa acagtgcaaa tgcaaaaacc 3300
gacagcatta gtctgtttgc aggcatacgg cacgatgcgg gcgatatcgg ctatctcaaa 3360
ggcctgttct cctacggacg ctacaaaaac agcatcagcc gcagcaccgg tgcggacgaa 3420
catgcggaag gcagcgtcaa cggcacgctg atgcagctgg gcgcactggg cggtgtcaac 3480
gttccgtttg ccgcaacggg agatttgacg gtcgaaggcg gtctgcgcta cgacctgctc 3540
aaacaggatg cattcgccga aaaaggcagt gctttgggct ggagcggcaa cagcctcact 3600
gaaggcacgc tggtcggact cgcgggtctg aagctgtcgc aacccttgag cgataaagcc 3660
gtcctgtttg caacggcggg cgtggaacgc gacctgaacg gacgcgacta cacggtaacg 3720
ggcggcttta ccggcgcgac tgcagcaacc ggcaagacgg gggcacgcaa tatgccgcac 3780
acccgtctgg ttgccggcct gggcgcggat gtcgaattcg gcaacggctg gaacggcttg 3840
gcacgttaca gctacgccgg ttccaaacag tacggcaacc acagcggacg agtcggcgta 3900
ggctaccggt tcctcgagca ccaccaccac caccactga 3939
<210>32
<211>1312
<212>PRT
<213>人工序列
<220>
<223>ΔG741-983
<400>32
Met Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala
1 5 10 15
Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln
20 25 30
Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu
35 40 45
Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn
50 55 60
Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly
65 70 75 80
Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser
85 90 95
His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu
100 105 110
His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile
115 120 125
Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala
130 135 140
Thr Tyr Arg Gly Thr Ala Phe G1y Ser Asp Asp Ala Gly Gly Lys Leu
145 150 155 160
Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu
165 170 175
His Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile
180 185 190
Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr
195 200 205
Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys
210 215 220
Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile
225 230 235 240
Arg His Ile Gly Leu Ala Ala Lys Gln Leu Glu Gly Ser Gly Gly Gly
245 250 255
Gly Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser
260 265 270
Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly
275 280 285
Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg
290 295 V 300
Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro
305 310 315 320
Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys
325 330 335
Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg
340 345 350
Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile
355 360 365
Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn
370 375 380
Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly
385 390 395 400
Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu
405 410 415
Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His
420 425 430
Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg
435 440 445
Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr
450 455 460
Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala
465 470 475 480
Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe
485 490 495
Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn
500 505 510
Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp
515 520 525
Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn
530 535 540
Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr
545 550 555 560
Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe
565 570 575
Tyr Glu Lys Asp Ala Gln Lys Gly IleIle Thr Val Ala Gly Val Asp
580 585 590
Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr
595 600 605
Glu Pro Leu Glu Tyr Gly Ser Asn His Cys GlyIle Thr Ala Met Trp
610 615 620
Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn
625 630 635 640
Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly
645 650 655
Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn
660 665 670
Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly
675 680 685
Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met
690 695 700
Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys
705 710 715 720
Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr
725 730 735
Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn
740 745 750
Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu
755 760 765
Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu
770 775 780
Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile
785 790 795 800
Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His
805 8l0 815
Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg
820 825 830
Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys
835 840 845
Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr
850 855 860
Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr
865 870 875 880
Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu
885 890 895
Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr
900 905 910
Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His
915 920 925
Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn
930 935 940
Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr
945 950 955 960
Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly
965 970 975
Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His
980 985 990
Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr
995 1000 1005
Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg
1010 1015 1020
Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg
1025 1030 1035 1040
Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly
1045 1050 1055
Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala
1060 1065 1070
Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg
1075 1080 1085
Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser
1090 1095 1100
Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys
1105 1110 1115 1120
Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr
1125 1130 1135
Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln
1140 1145 1150
Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp
1155 1160 1165
Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala
1170 1175 1180
Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr
1185 1190 1195 1200
Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu
1205 1210 1215
Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu
1220 1225 1230
Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala
1235 1240 1245
Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val
1250 1255 1260
Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu
1265 1270 1275 1280
Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly
1285 1290 1295
Arg Val Gly Val Gly Tyr Arg Phe Leu Glu His His His His His His
1300 1305 1310
<210>33
<211>2028
<212>DNA
<213>人工序列
<220>
<223>ΔG741-ORF46.1
<400>33
atggtcgccg ccgacatcgg tgcggggctt gccgatgcac taaccgcacc gctcgaccat 60
aaagacaaag gtttgcagtc tttgacgctg gatcagtccg tcaggaaaaa cgagaaactg 120
aagctggcgg cacaaggtgc ggaaaaaact tatggaaacg gtgacagcct caatacgggc 180
aaattgaaga acgacaaggt cagccgtttc gactttatcc gccaaatcga agtggacggg 240
cagctcatta ccttggagag tggagagttc caagtataca aacaaagcca ttccgcctta 300
accgcctttc agaccgagca aatacaagat tcggagcatt ccgggaagat ggttgcgaaa 360
cgccagttca gaatcggcga catagcgggc gaacatacat cttttgacaa gcttcccgaa 420
ggcggcaggg cgacatatcg cgggacggcg ttcggttcag acgatgccgg cggaaaactg 480
acctacacca tagatttcgc cgccaagcag ggaaacggca aaatcgaaca tttgaaatcg 540
ccagaactca atgtcgacct ggccgccgcc gatatcaage cggatggaaa acgccatgcc 600
gtcatcagcg gttccgtcct ttacaaccaa gccgagaaag gcagttactc cctcggtatc 660
tttggcggaa aagcccagga agttgccggc agcgcggaag tgaaaaccgt aaacggcata 720
cgccatatcg gccttgccgc caagcaactc gacggtggcg gaggcactgg atcctcagat 780
ttggcaaacg attcttttat ccggcaggtt ctcgaccgtc agcatttcga acccgacggg 840
aaataccacc tattcggcag caggggggaa cttgccgagc gcagcggcca tatcggattg 900
ggaaaaatac aaagccatca gttgggcaac ctgatgattc aacaggcggc cattaaagga 960
aatatcggct acattgtccg cttttccgat cacgggcacg aagtccattc ccccttcgac 1020
aaccatgcct cacattccga ttctgatgaa gccggtagtc ccgttgacgg atttagcctt 1080
taccgcatcc attgggacgg atacgaacac catcccgccg acggctatga cgggccacag 1140
ggcggcggct atcccgctcc caaaggcgcg agggatatat acagctacga cataaaaggc 1200
gttgcccaaa atatccgcct caacctgacc gacaaccgca gcaccggaca acggcttgcc 1260
gaccgtttcc acaatgccgg tagtatgctg acgcaaggag taggcgacgg attcaaacgc 1320
gccacccgat acagccccga gctggacaga tcgggcaatg ccgccgaagc cttcaacggc 1380
actgcagata tcgttaaaaa catcatcggc gcggcaggag aaattgtcgg cgcaggcgat 1440
gccgtgcagg gcataagcga aggctcaaac attgctgtca tgcacggctt gggtctgctt 1500
tccaccgaaa acaagatggc gcgcatcaac gatttggcag atatggcgca actcaaagac 1560
tatgccgcag cagccatccg cgattgggca gtccaaaacc ccaatgccgc acaaggcata 1620
gaagccgtca gcaatatctt tatggcagcc atccccatca aagggattgg agctgttcgg 1680
ggaaaatacg gcttgggcgg catcacggca catcctatca agcggtcgca gatgggcgcg 1740
atcgcattgc cgaaagggaa atccgccgtc agcgacaatt ttgccgatgc ggcatacgcc 1800
aaatacccgt ccccttacca ttcccgaaat atccgttcaa acttggagca gcgttacggc 1860
aaagaaaaca tcacctcctc aaccgtgccg ccgtcaaacg gcaaaaatgt caaactggca 1920
gaccaacgcc acccgaagac aggcgtaccg tttgacggta aagggtttcc gaattttgag 1980
aagcacgtga aatatgatac gctcgagcac caccaccacc accactga 2028
<210>34
<211>675
<212>PRT
<213>人工序列
<220>
<223>ΔG741-ORF46.1
<400>34
Met Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala
1 5 10 15
Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln
20 25 30
Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu
35 40 45
Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn
50 55 60
Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly
65 70 75 80
Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser
85 90 95
His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu
100 105 110
His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile
115 120 125
Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala
130 135 140
Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu
145 150 155 160
Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu
165 170 175
His Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile
180 185 190
Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr
195 200 205
Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys
210 215 220
Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile
225 230 235 240
Arg His Ile Gly Leu Ala Ala Lys Gln Leu Asp Gly Gly Gly Gly Thr
245 250 255
Gly Ser Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp
260 265 270
Arg Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg
275 280 285
Gly Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly Lys Ile Gln
290 295 300
Ser His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly
305 310 315 320
Asn Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu Val His
325 330 335
Ser Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly
340 345 350
Ser Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr
355 360 365
Glu His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr
370 375 380
Pro Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly
385 390 395 400
Val Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly
405 410 415
Gln Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln
420 425 430
Gly Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu
435 440 445
Asp Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile
450 455 460
Val Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp
465 470 475 480
Ala Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly
485 490 495
Leu Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu
500 505 510
Ala Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp
515 520 525
Trp Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser
530 535 540
Asn Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala Val Arg
545 550 555 560
Gly Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser
565 570 575
Gln Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp
580 585 590
Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser
595 600 605
Arg Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile
610 615 620
Thr Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala
625 630 635 640
Asp Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe
645 650 655
Pro Asn Phe Glu Lys His Val Lys Tyr Asp Thr Leu Glu His His His
660 665 670
His His His
675
<210>35
<211>2019
<212>DNA
<213>人工序列
<220>
<223>ORF46.1-741
<400>35
atgtcagatt tggcaaacga ttcttttatc cggcaggttc tcgaccgtca gcatttcgaa 60
cccgacggga aataccacct attcggcagc aggggggaac ttgccgagcg cagcggccat 120
atcggattgg gaaaaataca aagccatcag ttgggcaacc tgatgattca acaggcggcc 180
attaaaggaa atatcggcta cattgtccgc ttttccgatc acgggcacga agtccattcc 240
cccttcgaca accatgcctc acattccgat tctgatgaag ccggtagtcc cgttgacgga 300
tttagccttt accgcatcca ttgggacgga tacgaacacc atcccgccga cggctatgac 360
gggccacagg gcggcggcta tcccgctccc aaaggcgcga gggatatata cagctacgac 420
ataaaaggcg ttgcccaaaa tatccgcctc aacctgaccg acaaccgcag caccggacaa 480
cggcttgccg accgtttcca caatgccggt agtatgctga cgcaaggagt aggcgacgga 540
ttcaaacgcg ccacccgata cagccccgag ctggacagat cgggcaatgc cgccgaagcc 600
ttcaacggca ctgcagatat cgttaaaaac atcatcggcg cggcaggaga aattgtcggc 660
gcaggcgatg ccgtgcaggg cataagcgaa ggctcaaaca ttgctgtcat gcacggcttg 720
ggtctgcttt ccaccgaaaa caagatggcg cgcatcaacg atttggcaga tatggcgcaa 780
ctcaaagact atgccgcagc agccatccgc gattgggcag tccaaaaccc caatgccgca 840
caaggcatag aagccgtcag caatatcttt atggcagcca tccccatcaa agggattgga 900
gctgttcggg gaaaatacgg cttgggcggc atcacggcac atcctatcaa gcggtcgcag 960
atgggcgcga tcgcattgcc gaaagggaaa tccgccgtca gcgacaattt tgccgatgcg 1020
gcatacgcca aatacccgtc cccttaccat tcccgaaata tccgttcaaa cttggagcag 1080
cgttacggca aagaaaacat cacctcctca accgtgccgc cgtcaaacgg caaaaatgtc 1140
aaactggcag accaacgcca cccgaagaca ggcgtaccgt ttgacggtaa agggtttccg 1200
aattttgaga agcacgtgaa atatgatacg ggatccggag ggggtggtgt cgccgccgac 1260
atcggtgcgg ggcttgccga tgcactaacc gcaccgctcg accataaaga caaaggtttg 1320
cagtctttga cgctggatca gtccgtcagg aaaaacgaga aactgaagct ggcggcacaa 1380
ggtgcggaaa aaacttatgg aaacggtgac agcctcaata cgggcaaatt gaagaacgac 1440
aaggtcagcc gtttcgactt tatccgccaa atcgaagtgg acgggcagct cattaccttg 1500
gagagtggag agttccaagt atacaaacaa agccattccg ccttaaccgc ctttcagacc 1560
gagcaaatac aagattcgga gcattccggg aagatggttg cgaaacgcca gttcagaatc 1620
ggcgacatag cgggcgaaca tacatctttt gacaagcttc ccgaaggcgg cagggcgaca 1680
tatcgcggga cggcgttcgg ttcagacgat gccggcggaa aactgaccta caccatagat 1740
ttcgccgcca agcagggaaa cggcaaaatc gaacatttga aatcgccaga actcaatgtc 1800
gacctggccg ccgccgatat caagccggat ggaaaacgcc atgccgtcat cagcggttcc 1860
gtcctttaca accaagccga gaaaggcagt tactccctcg gtatctttgg cggaaaagcc 1920
caggaagttg ccggcagcgc ggaagtgaaa accgtaaacg gcatacgcca tatcggcctt 1980
gccgccaagc aactcgagca ccaccaccac caccactga 2019
<210>36
<211>672
<212>PRT
<213>人工序列
<220>
<223>ORF46.1-741
<400>36
Met Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp Arg
1 5 10 15
Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly
20 25 30
Glu Leu Ala Glu Arg Ser Gly His lle Gly Leu Gly Lys Ile Gln Ser
35 40 45
His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly Asn
50 55 60
Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu Val His Ser
65 70 75 80
Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser
85 90 95
Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr Glu
100 105 110
His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr Pro
115 120 125
Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly Val
130 135 140
Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gln
145 150 155 160
Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln Gly
165 170 175
Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu Asp
180 185 190
Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile Val
195 200 205
Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp Ala
210 215 220
Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly Leu
225 230 235 240
Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu Ala
245 250 255
Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp Trp
260 265 270
Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser Asn
275 280 285
Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala Val Arg Gly
290 295 300
Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser Gln
305 310 315 320
Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp Asn
325 330 335
Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg
340 345 350
Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile Thr
355 360 365
Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala Asp
370 375 380
Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro
385 390 395 400
Asn Phe Glu Lys His Val Lys Tyr Asp Thr Gly Ser Gly Gly Gly Gly
405 410 415
Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro
420 425 430
Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln Ser
435 440 445
Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu Lys
450 455 460
Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp
465 470 475 480
Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly Gln
485 490 495
Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser His
500 505 510
Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu His
5l5 520 525
Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly AspIle Ala
530 535 540
Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr
545 550 555 560
Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr
565 570 575
Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu His
580 585 590
Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp lle Lys
595 600 605
Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr Asn
610 615 620
Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys Ala
625 630 635 640
Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile Arg
645 650 655
His Ile Gly Leu Ala Ala Lys Gln Leu Glu His His His His His His
660 665 670
<210>37
<211>2421
<212>DNA
<213>人工序列
<220>
<223>ORF46.1-961
<400>37
atgtcagatt tggcaaacga ttcttttatc cggcaggttc tcgaccgtca gcatttcgaa 60
cccgacggga aataccacct attcggcagc aggggggaac ttgccgagcg cagcggccat 120
atcggattgg gaaaaataca aagccatcag ttgggcaacc tgatgattca acaggcggcc 180
attaaaggaa atatcggcta cattgtccgc ttttccgatc acgggcacga agtccattcc 240
cccttcgaca accatgcctc acattccgat tctgatgaag ccggtagtcc cgttgacgga 300
tttagccttt accgcatcca ttgggacgga tacgaacacc atcccgccga cggctatgac 360
gggccacagg gcggcggcta tcccgctccc aaaggcgcga gggatatata cagctacgac 420
ataaaaggcg ttgcccaaaa tatccgcctc aacctgaccg acaaccgcag caccggacaa 480
cggcttgccg accgtttcca caatgccggt agtatgctga cgcaaggagt aggcgacgga 540
ttcaaacgcg ccacccgata cagccccgag ctggacagat cgggcaatgc cgccgaagcc 600
ttcaacggca ctgcagatat cgttaaaaac atcatcggcg cggcaggaga aattgtcggc 660
gcaggcgatg ccgtgcaggg cataagcgaa ggctcaaaca ttgctgtcat gcacggcttg 720
ggtctgcttt ccaccgaaaa caagatggcg cgcatcaacg atttggcaga tatggcgcaa 780
ctcaaagact atgccgcagc agccatccgc gattgggcag tccaaaaccc caatgccgca 840
caaggcatag aagccgtcag caatatcttt atggcagcca tccccatcaa agggattgga 900
gctgttcggg gaaaatacgg cttgggcggc atcacggcac atcctatcaa gcggtcgcag 960
atgggcgcga tcgcattgcc gaaagggaaa tccgccgtca gcgacaattt tgccgatgcg 1020
gcatacgcca aatacccgtc cccttaccat tcccgaaata tccgttcaaa cttggagcag 1080
cgttacggca aagaaaacat cacctcctca accgtgccgc cgtcaaacgg caaaaatgtc 1140
aaactggcag accaacgcca cccgaagaca ggcgtaccgt ttgacggtaa agggtttccg 1200
aattttgaga agcacgtgaa atatgatacg ggatccggag gaggaggagc cacaaacgac 1260
gacgatgtta aaaaagctgc cactgtggcc attgctgctg cctacaacaa tggccaagaa 1320
atcaacggtt tcaaagctgg agagaccatc tacgacattg atgaagacgg cacaattacc 1380
aaaaaagacg caactgcagc cgatgttgaa gccgacgact ttaaaggtct gggtctgaaa 1440
aaagtcgtga ctaacctgac caaaaccgtc aatgaaaaca aacaaaacgt cgatgccaaa 1500
gtaaaagctg cagaatctga aatagaaaag ttaacaacca agttagcaga cactgatgcc 1560
gctttagcag atactgatgc cgctctggat gcaaccacca acgccttgaa taaattggga 1620
gaaaatataa cgacatttgc tgaagagact aagacaaata tcgtaaaaat tgatgaaaaa 1680
ttagaagccg tggctgatac cgtcgacaag catgccgaag cattcaacga tatcgccgat 1740
tcattggatg aaaccaacac taaggcagac gaagccgtca aaaccgccaa tgaagccaaa 1800
cagacggccg aagaaaccaa acaaaacgtc gatgccaaag taaaagctgc agaaactgca 1860
gcaggcaaag ccgaagctgc cgctggcaca gctaatactg cagccgacaa ggccgaagct 1920
gtcgctgcaa aagttaccga catcaaagct gatatcgcta cgaacaaaga taatattgct 1980
aaaaaagcaa acagtgccga cgtgtacacc agagaagagt ctgacagcaa atttgtcaga 2040
attgatggtc tgaacgctac taccgaaaaa ttggacacac gcttggcttc tgctgaaaaa 2100
tccattgccg atcacgatac tcgcctgaac ggtttggata aaacagtgtc agacctgcgc 2160
aaagaaaccc gccaaggcct tgcagaacaa gccgcgctct ccggtctgtt ccaaccttac 2220
aacgtgggtc ggttcaatgt aacggctgca gtcggcggct acaaatccga atcggcagtc 2280
gccatcggta ccggcttccg ctttaccgaa aactttgccg ccaaagcagg cgtggcagtc 2340
ggcacttcgt ccggttcttc cgcagcctac catgtcggcg tcaattacga gtggctcgag 2400
caccaccacc accaccactg a 2421
<210>38
<211>806
<212>PRT
<213>人工序列
<220>
<223>ORF46.1-961
<400>38
Met Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp Arg
1 5 10 15
Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly
20 25 30
Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly Lys Ile Gln Ser
35 40 45
His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly Asn
50 55 60
Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu Val His Ser
65 70 75 80
Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser
85 90 95
Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr Glu
100 105 110
His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr Pro
115 120 125
Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly Val
130 135 140
Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gln
145 150 155 160
Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln Gly
165 170 175
Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu Asp
180 185 190
Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile Val
195 200 205
Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp Ala
210 215 220
Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly Leu
225 230 235 240
Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu Ala
245 250 255
Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp Trp
260 265 270
Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser Asn
275 280 285
Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala Val Arg Gly
290 295 300
Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser Gln
305 310 315 320
Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp Asn
325 330 335
Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg
340 345 350
Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile Thr
355 360 365
Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala Asp
370 375 380
Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro
385 390 395 400
Asn Phe Glu Lys His Val Lys Tyr Asp Thr Gly Ser Gly Gly Gly Gly
405 410 415
Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile Ala
420 425 430
Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly Glu
435 440 445
Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp Ala
450 455 460
Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu Lys
465 470 475 480
Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln Asn
485 490 495
Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu Thr
500 505 510
Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala Ala
515 520 525
Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile Thr
530 535 540
Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu Lys
545 550 555 560
Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe Asn
565 570 575
Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu Ala
580 585 590
Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys Gln
595 600 605
Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys Ala
610 615 620
Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu Ala
625 630 635 640
Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn Lys
645 650 655
Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg Glu
660 665 670
Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr Thr
675 680 685
Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala Asp
690 695 700
His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu Arg
705 710 715 720
Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly Leu
725 730 735
Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val Gly
740 745 750
Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg Phe
755 760 765
Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser Ser
770 775 780
Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp Leu Glu
785 790 795 800
His His His His His His
805
<210>39
<211>2256
<212> DNA
<213>人工序列
<220>
<223>ORF46.1-96lc
<400>39
atgtcagatt tggcaaacga ttcttttatc cggcaggttc tcgaccgtca gcatttcgaa 60
cccgacggga aataccacct attcggcagc aggggggaac ttgccgagcg cagcggccat 120
atcggattgg gaaaaataca aagccatcag ttgggcaacc tgatgattca acaggcggcc 180
attaaaggaa atatcggcta cattgtccgc ttttccgatc acgggcacga agtccattcc 240
cccttcgaca accatgcctc acattccgat tctgatgaag ccggtagtcc cgttgacgga 300
tttagccttt accgcatcca ttgggacgga tacgaacacc atcccgccga cggctatgac 360
gggccacagg gcggcggcta tcccgctccc aaaggcgcga gggatatata cagctacgac 420
ataaaaggcg ttgcccaaaa tatccgcctc aacctgaccg acaaccgcag caccggacaa 480
cggcttgccg accgtttcca caatgccggt agtatgctga cgcaaggagt aggcgacgga 540
ttcaaacgcg ccacccgata cagccccgag ctggacagat cgggcaatgc cgccgaagcc 600
ttcaacggca ctgcagatat cgttaaaaac atcatcggcg cggcaggaga aattgtcggc 660
gcaggcgatg ccgtgcaggg cataagcgaa ggctcaaaca ttgctgtcat gcacggcttg 720
ggtctgcttt ccaccgaaaa caagatggcg cgcatcaacg atttggcaga tatggcgcaa 780
ctcaaagact atgccgcagc agccatccgc gattgggcag tccaaaaccc caatgccgca 840
caaggcatag aagccgtcag caatatcttt atggcagcca tccccatcaa agggattgga 900
gctgttcggg gaaaatacgg cttgggcggc atcacggcac atcctatcaa gcggtcgcag 960
atgggcgcga tcgcattgcc gaaagggaaa tccgccgtca gcgacaattt tgccgatgcg 1020
gcatacgcca aatacccgtc cccttaccat tcccgaaata tccgttcaaa cttggagcag 1080
cgttacggca aagaaaacat cacctcctca accgtgccgc cgtcaaacgg caaaaatgtc 1140
aaactggcag accaacgcca cccgaagaca ggcgtaccgt ttgacggtaa agggtttccg 1200
aattttgaga agcacgtgaa atatgatacg ggatccggag gaggaggagc cacaaacgac 1260
gacgatgtta aaaaagctgc cactgtggcc attgctgctg cctacaacaa tggccaagaa 1320
atcaacggtt tcaaagctgg agagaccatc tacgacattg atgaagacgg cacaattacc 1380
aaaaaagacg caactgcagc cgatgttgaa gccgacgact ttaaaggtct gggtctgaaa 1440
aaagtcgtga ctaacctgac caaaaccgtc aatgaaaaca aacaaaacgt cgatgccaaa 1500
gtaaaagctg cagaatctga aatagaaaag ttaacaacca agttagcaga cactgatgcc 1560
gctttagcag atactgatgc cgctctggat gcaaccacca acgccttgaa taaattggga 1620
gaaaatataa cgacatttgc tgaagagact aagacaaata tcgtaaaaat tgatgaaaaa 1680
ttagaagccg tggctgatac cgtcgacaag catgccgaag cattcaacga tatcgccgat 1740
tcattggatg aaaccaacac taaggcagac gaagccgtca aaaccgccaa tgaagccaaa 1800
cagacggccg aagaaaccaa acaaaacgtc gatgccaaag taaaagctgc agaaactgca 1860
gcaggcaaag ccgaagctgc cgctggcaca gctaatactg cagccgacaa ggccgaagct 1920
gtcgctgcaa aagttaccga catcaaagct gatatcgcta cgaacaaaga taatattgct 1980
aaaaaagcaa acagtgccga cgtgtacacc agagaagagt ctgacagcaa atttgtcaga 2040
attgatggtc tgaacgctac taccgaaaaa ttggacacac gcttggcttc tgctgaaaaa 2100
tccattgccg atcacgatac tcgcctgaac ggtttggata aaacagtgtc agacctgcgc 2160
aaagaaaccc gccaaggcct tgcagaacaa gccgcgctct ccggtctgtt ccaaccttac 2220
aacgtgggtc tcgagcacca ccaccaccac cactga 2256
<210>40
<211>751
<212> PRT
<213>人工序列
<220>
<223>ORF46.1-961c
<400>40
Met Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp Arg
1 5 10 15
Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly
20 25 30
Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly Lys Ile Gln Ser
35 40 45
His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly Asn
50 55 60
Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu Val His Ser
65 70 75 80
Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser
85 90 95
Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr Glu
100 105 110
His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr Pro
115 120 125
Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly Val
130 135 140
Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gln
145 150 155 160
Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln Gly
165 170 175
Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu Asp
180 185 190
Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile Val
195 200 205
Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp Ala
210 215 220
Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly Leu
225 230 235 240
Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu Ala
245 250 255
Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp Trp
260 265 270
Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser Asn
275 280 285
Ile Phe Met Ala Ala Ile Pro Ile Lys GlyIle Gly Ala Val Arg Gly
290 295 300
Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser Gln
305 310 315 320
Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp Asn
325 330 335
Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg
340 345 350
Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile Thr
355 360 365
Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala Asp
370 375 380
Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro
385 390 395 400
Asn Phe Glu Lys His Val Lys Tyr Asp Thr Gly Ser Gly Gly Gly Gly
405 410 415
Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile Ala
420 425 430
Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly Glu
435 440 445
Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp Ala
450 455 460
Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu Lys
465 470 475 480
Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln Asn
485 490 495
Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu Thr
500 505 510
Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala Ala
515 520 525
Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile Thr
530 535 540
Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu Lys
545 550 555 560
Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe Asn
565 570 575
Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu Ala
580 585 590
Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys Gln
595 600 605
Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys Ala
610 615 620
Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu Ala
625 630 635 640
Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn Lys
645 650 655
Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg Glu
660 665 670
Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr Thr
675 680 685
Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala Asp
690 695 700
His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu Arg
705 710 715 720
Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly Leu
725 730 735
Phe Gln Pro Tyr Asn Val Gly Leu Glu His His His His His His
740 745 750
<210>41
<211>2421
<212>DNA
<213>人工序列
<220>
<223>961-ORF46.1
<400>41
atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60
aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120
gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180
ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240
aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300
gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360
ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420
aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480
aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540
gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600
gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660
gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720
aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780
agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840
gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900
gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960
ctgttccaac cttacaacgt gggtcggttc aatgtaacgg ctgcagtcgg cggctacaaa 1020
tccgaatcgg cagtcgccat cggtaccggc ttccgcttta ccgaaaactt tgccgccaaa 1080
gcaggcgtgg cagtcggcac ttcgtccggt tcttccgcag cctaccatgt cggcgtcaat 1140
tacgagtggg gatccggagg aggaggatca gatttggcaa acgattcttt tatccggcag 1200
gttctcgacc gtcagcattt cgaacccgac gggaaatacc acctattcgg cagcaggggg 1260
gaacttgccg agcgcagcgg ccatatcgga ttgggaaaaa tacaaagcca tcagttgggc 1320
aacctgatga ttcaacaggc ggccattaaa ggaaatatcg gctacattgt ccgcttttcc 1380
gatcacgggc acgaagtcca ttcccccttc gacaaccatg cctcacattc cgattctgat 1440
gaagccggta gtcccgttga cggatttagc ctttaccgca tccattggga cggatacgaa 1500
caccatcccg ccgacggcta tgacgggcca cagggcggcg gctatcccgc tcccaaaggc 1560
gcgagggata tatacagcta cgacataaaa ggcgttgccc aaaatatccg cctcaacctg 1620
accgacaacc gcagcaccgg acaacggctt gccgaccgtt tccacaatgc cggtagtatg 1680
ctgacgcaag gagtaggcga cggattcaaa cgcgccaccc gatacagccc cgagctggac 1740
agatcgggca atgccgccga agccttcaac ggcactgcag atatcgttaa aaacatcatc 1800
ggcgcggcag gagaaattgt cggcgcaggc gatgccgtgc agggcataag cgaaggctca 1860
aacattgctg tcatgcacgg cttgggtctg ctttccaccg aaaacaagat ggcgcgcatc 1920
aacgatttgg cagatatggc gcaactcaaa gactatgccg cagcagccat ccgcgattgg 1980
gcagtccaaa accccaatgc cgcacaaggc atagaagccg tcagcaatat ctttatggca 2040
gccatcccca tcaaagggat tggagctgtt cggggaaaat acggcttggg cggcatcacg 2100
gcacatccta tcaagcggtc gcagatgggc gcgatcgcat tgccgaaagg gaaatccgcc 2160
gtcagcgaca attttgccga tgcggcatac gccaaatacc cgtcccctta ccattcccga 2220
aatatccgtt caaacttgga gcagcgttac ggcaaagaaa acatcacctc ctcaaccgtg 2280
ccgccgtcaa acggcaaaaa tgtcaaactg gcagaccaac gccacccgaa gacaggcgta 2340
ccgtttgacg gtaaagggtt tccgaatttt gagaagcacg tgaaatatga tacgctcgag 2400
caccaccacc accaccactg a 2421
<210>42
<211>806
<212>PRT
<213>人工序列
<220>
<223>961-ORF46.1
<400>42
Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile
1 5 10 15
Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly
20 25 30
Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp
35 40 45
Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu
50 55 60
Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln
65 70 75 80
Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu
85 90 95
Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala
100 105 110
Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile
115 120 125
Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu
130 135 140
Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe
145 150 155 160
Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu
165 170 175
Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys
180 185 190
Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys
195 200 205
Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu
210 215 220
Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn
225 230 235 240
Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg
245 250 255
Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr
260 265 270
Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala
275 280 285
Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu
290 295 300
Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly
305 310 315 320
Leu Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val
325 330 335
Gly Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg
340 345 350
Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser
355 360 365
Ser Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp Gly
370 375 380
Ser Gly Gly Gly Gly Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln
385 390 395 400
Val Leu Asp Arg Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe
405 410 415
Gly Ser Arg Gly Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly
420 425 430
Lys Ile Gln Ser His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala
435 440 445
Ile Lys Gly Asn Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His
450 455 460
Glu Val His Ser Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp
465 470 475 480
Glu Ala Gly Ser Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp
485 490 495
Asp Gly Tyr Glu His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly
500 505 510
Gly Gly Tyr Pro Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp
515 520 525
Ile Lys Gly Val Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg
530 535 540
Ser Thr Gly Gln Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met
545 550 555 560
Leu Thr Gln Gly Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser
565 570 575
Pro Glu Leu Asp Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr
580 585 590
Ala Asp Ile Val Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly
595 600 605
Ala Gly Asp Ala Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val
610 615 620
Met His Gly Leu Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile
625 630 635 640
Asn Asp Leu Ala Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala
645 650 655
Ile Arg Asp Trp Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu
660 665 670
Ala Val Ser Asn Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly
675 680 685
Ala Val Arg Gly Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile
690 695 700
Lys Arg Ser Gln Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala
705 710 715 720
Val Ser Asp Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro
725 730 735
Tyr His Ser Arg Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys
740 745 750
Glu Asn Ile Thr Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val
755 760 765
Lys Leu Ala Asp Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly
770 775 780
Lys Gly Phe Pro Asn Phe Glu Lys His Val Lys Tyr Asp Thr Leu Glu
785 790 795 800
His His His His His His
805
<210>43
<211>1938
<212>DNA
<213>人工序列
<220>
<223>961-741
<400>43
atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60
aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120
gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180
ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240
aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300
gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360
ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420
aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480
aacgatatcg ccgattcatt ggatgaaaec aacactaagg cagacgaagc cgtcaaaacc 540
gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600
gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660
gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720
aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780
agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacaogcttg 840
gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900
gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960
ctgttccaac cttacaacgt gggtcggttc aatgtaacgg ctgcagtcgg cggctacaaa 1020
tccgaatcgg cagtcgccat cggtaccggc ttccgcttta ccgaaaactt tgccgccaaa 1080
gcaggcgtgg cagtcggcac ttcgtccggt tcttccgcag cctaccatgt cggcgtcaat 1140
tacgagtggg gatccggagg gggtggtgtc gccgccgaca tcggtgcggg gcttgccgat 1200
gcactaaccg caccgctcga ccataaagac aaaggtttgc agtctttgac gctggatcag 1260
tccgtcagga aaaacgagaa actgaagctg gcggcacaag gtgcggaaaa aacttatgga 1320
aacggtgaca gcctcaatac gggcaaattg aagaacgaca aggtcagccg tttcgacttt 1380
atccgccaaa tcgaagtgga cgggcagctc attaccttgg agagtggaga gttccaagta 1440
tacaaacaaa gccattccgc cttaaccgcc tttcagaccg agcaaataca agattcggag 1500
cattccggga agatggttgc gaaacgccag ttcagaatcg gcgacatagc gggcgaacat 1560
acatcttttg acaagcttcc cgaaggcggc agggcgacat atcgcgggac ggcgttcggt 1620
tcagacgatg ccggcggaaa actgacctac accatagatt tcgccgccaa gcagggaaac 1680
ggcaaaatcg aacatttgaa atcgccagaa ctcaatgtcg acctggccgc cgccgatatc 1740
aagccggatg gaaaacgcca tgccgtcatc agcggttccg tcctttacaa ccaagccgag 1800
aaaggcagtt actccctcgg tatctttggc ggaaaagccc aggaagttgc cggcagcgcg 1860
gaagtgaaaa ccgtaaacgg catacgccat atcggccttg ccgccaagca actcgagcac 1920
caccaccacc accactga 1938
<210>44
<211>645
<212>PRT
<213>人工序列
<220>
<223>961741
<400>44
Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile
1 5 10 15
Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly
20 25 30
Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp
35 40 45
Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu
50 55 60
Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln
65 70 75 80
Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu
85 90 95
Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala
100 105 110
Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile
115 120 125
Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu
130 135 140
Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His A1a Glu Ala Phe
145 150 155 160
Asn Asp Ile Ala Asp Ser Leu Asp G1u Thr Asn Thr Lys Ala Asp Glu
165 170 175
Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys
180 185 190
Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys
195 200 205
Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu
210 215 220
Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn
225 230 235 240
Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg
245 250 255
Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr
260 265 270
Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala
275 280 285
Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu
290 295 300
Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly
305 310 315 320
Leu Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val
325 330 335
Gly Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg
340 345 350
Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser
355 360 365
Ser Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp Gly
370 375 380
Ser Gly Gly Gly Gly Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp
385 390 395 400
Ala Leu Thr Ala Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu
405 410 415
Thr Leu Asp Gln Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala
420 425 430
Gln Gly Ala Glu Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly
435 440 445
Lys Leu Lys Asn Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile
450 455 460
Glu Val Asp Gly Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val
465 470 475 480
Tyr Lys Gln Ser His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile
485 490 495
Gln Asp Ser Glu His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg
500 505 510
Ile Gly Asp Ile Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu
515 520 525
Gly Gly Arg Ala Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala
530 535 540
Gly Gly Lys Leu Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn
545 550 555 560
Gly Lys Ile Glu His Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala
565 570 575
Ala Ala Asp Ile Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly
580 585 590
Ser Val Leu Tyr Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile
595 600 605
Phe Gly Gly Lys Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr
610 615 620
Val Asn Gly Ile Arg His Ile Gly Leu Ala A1a Lys Gln Leu Glu His
625 630 635 640
His His His His His
645
<210>45
<211>4335
<212>DNA
<213>人工序列
<220>
<223>961-983
<400>45
atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60
aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120
gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180
ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240
aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300
gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360
ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420
aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480
aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540
gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600
gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660
gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720
aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780
agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840
gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900
gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960
ctgttccaac cttacaacgt gggtcggttc aatgtaacgg ctgcagtcgg cggctacaaa 1020
tccgaatcgg cagtcgccat cggtaccggc ttccgcttta ccgaaaactt tgccgccaaa 1080
gcaggcgtgg cagtcggcac ttcgtccggt tcttccgcag cctaccatgt cggcgtcaat 1140
tacgagtggg gatccggcgg aggcggcact tctgcgcccg acttcaatgc aggcggtacc 1200
ggtatcggca gcaacagcag agcaacaaca gcgaaatcag cagcagtatc ttacgccggt 1260
atcaagaacg aaatgtgcaa agacagaagc atgctctgtg ccggtcggga tgacgttgcg 1320
gttacagaca gggatgccaa aatcaatgcc ccccccccga atctgcatac cggagacttt 1380
ccaaacccaa atgacgcata caagaatttg atcaacctca aacctgcaat tgaagcaggc 1440
tatacaggac gcggggtaga ggtaggtatc gtcgacacag gcgaatccgt cggcagcata 1500
tcctttcccg aactgtatgg cagaaaagaa cacggctata acgaaaatta caaaaactat 1560
acggcgtata tgcggaagga agcgcctgaa gacggaggcg gtaaagacat tgaagcttct 1620
ttcgacgatg aggccgttat agagactgaa gcaaagccga cggatatccg ccacgtaaaa 1680
gaaatcggac acatcgattt ggtctcccat attattggcg ggcgttccgt ggacggcaga 1740
cctgcaggcg gtattgcgcc cgatgcgacg ctacacataa tgaatacgaa tgatgaaacc 1800
aagaacgaaa tgatggttgc agccatccgc aatgcatggg tcaagctggg cgaacgtggc 1860
gtgcgcatcg tcaataacag ttttggaaca acatcgaggg caggcactgc cgaccttttc 1920
caaatagcca attcggagga gcagtaccgc caagcgttgc tcgactattc cggcggtgat 1980
aaaacagacg agggtatccg cctgatgcaa cagagcgatt acggcaacct gtcctaccac 2040
atccgtaata aaaacatgct tttcatcttt tcgacaggca atgacgcaca agctcagccc 2100
aacacatatg ccctattgcc attttatgaa aaagacgctc aaaaaggcat tatcacagtc 2160
gcaggcgtag accgcagtgg agaaaagttc aaacgggaaa tgtatggaga accgggtaca 2220
gaaccgcttg agtatggctc caaccattgc ggaattactg ccatgtggtg cctgtcggca 2280
ccctatgaag caagcgtccg tttcacccgt acaaacccga ttcaaattgc cggaacatcc 2340
ttttccgcac ccatcgtaac cggcacggcg gctctgctgc tgcagaaata cccgtggatg 2400
agcaacgaca acctgcgtac cacgttgctg acgacggctc aggacatcgg tgcagtcggc 2460
gtggacagca agttcggctg gggactgctg gatgcgggta aggccatgaa cggacccgcg 2520
tcctttccgt tcggcgactt taccgccgat acgaaaggta catccgarat tgcctactcc 2580
ttccgtaacg acatttcagg cacgggcggc ctgatcaaaa aaggcggcag ccaactgcaa 2640
ctgcacggca acaacaccta tacgggcaaa accattatcg aaggcggttc gctggtgttg 2700
tacggcaaca acaaatcgga tatgcgcgtc gaaaccaaag gtgcgctgat ttataacggg 2760
gcggcatccg gcggcagcct gaacagcgac ggcattgtct atctggcaga taccgaccaa 2820
tccggcgcaa acgaaaccgt acacatcaaa ggcagtctgc agctggacgg caaaggtacg 2880
ctgtacacac gtttgggcaa actgctgaaa gtggacggta cggcgattat cggcggcaag 2940
ctgtacatgt cggcacgcgg caagggggca ggctatctca acagtaccgg acgacgtgtt 3000
cccttcctga gtgccgccaa aatcgggcag gattattctt tcttcacaaa catcgaaacc 3060
gacggcggcc tgctggcttc cctcgacagc gtcgaaaaaa cagcgggcag tgaaggcgac 3120
acgctgtcct attatgtccg tcgcggcaat gcggcacgga ctgcttcggc agcggcacat 3180
tccgcgcccg ccggtctgaa acacgccgta gaacagggcg gcagcaatct ggaaaacctg 3240
atggtcgaac tggatgcctc cgaatcatcc gcaacacccg agacggttga aactgcggca 3300
gccgaccgca cagatatgcc gggcatccgc ccctacggcg caactttccg cgcagcggca 3360
gccgtacagc atgcgaatgc cgccgacggt gtacgcatct tcaacagtct cgccgctacc 3420
gtctatgccg acagtaccgc cgcccatgcc gatatgcagg gacgccgcct gaaagccgta 3480
tcggacgggt tggaccacaa cggcacgggt ctgcgcgtca tcgcgcaaac ccaacaggac 3540
ggtggaacgt gggaacaggg cggtgttgaa ggcaaaatgc gcggcagtac ccaaaccgtc 3600
ggcattgccg cgaaaaccgg cgaaaatacg acagcagccg ccacactggg catgggacgc 3660
agcacatgga gcgaaaacag tgcaaatgca aaaaccgaca gcattagtct gtttgcaggc 3720
atacggcacg atgcgggcga tatcggctat ctcaaaggcc tgttctccta cggacgctac 3780
aaaaacagca tcagccgcag caccggtgcg gacgaacatg cggaaggcag cgtcaacggc 3840
acgctgatgc agctgggcgc actgggcggt gtcaacgttc cgtttgccgc aacgggagat 3900
ttgacggtcg aaggcggtct gcgctacgac ctgctcaaac aggatgcatt cgccgaaaaa 3960
ggcagtgctt tgggctggag cggcaacagc ctcactgaag gcacgctggt cggactcgcg 4020
ggtctgaagc tgtcgcaacc cttgagcgat aaagccgtcc tgtttgcaac ggcgggcgtg 4080
gaacgcgacc tgaacggacg cgactacacg gtaacgggcg gctttaccgg cgcgactgca 4140
gcaaccggca agacgggggc acgcaatatg ccgcacaccc gtctggttgc cggcctgggc 4200
gcggatgtcg aattcggcaa cggctggaac ggcttggcac gttacagcta cgccggttcc 4260
aaacagtacg gcaaccacag cggacgagtc ggcgtaggct accggttcct cgagcaccac 4320
caccaccacc actga 4335
<210>46
<211>1444
<212>PRT
<213>人工序列
<220>
<223>961-983
<400>46
Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile
1 5 10 15
Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly
20 25 30
Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp
35 40 45
Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu
50 55 60
Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln
65 70 75 80
Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu
85 90 95
Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala
100 105 110
Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile
115 120 125
Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu
130 135 140
Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe
145 150 155 160
Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu
165 170 175
Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys
180 185 190
Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys
195 200 205
Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu
210 215 220
Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn
225 230 235 240
Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg
245 250 255
Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr
260 265 270
Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala
275 280 285
Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu
290 295 300
Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly
305 310 315 320
Leu Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val
325 330 335
Gly Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg
340 345 350
Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser
355 360 365
Ser Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp Gly
370 375 380
Ser Gly Gly Gly Gly Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr
385 390 395 400
Gly Ile Gly Ser Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val
405 410 415
Ser Tyr Ala Gly Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu
420 425 430
Cys Ala Gly Arg Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile
435 440 445
Asn Ala Pro Pro Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn
450 455 460
Asp Ala Tyr Lys Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly
465 470 475 480
Tyr Thr Gly Arg Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser
485 490 495
Val Gly Ser Ile Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly
500 505 510
Tyr Asn Glu Asn Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala
515 520 525
Pro Glu Asp Gly Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu
530 535 540
Ala Val Ile Glu Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys
545 550 555 560
Glu Ile Gly His Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser
565 570 575
Val Asp Gly Arg Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His
580 585 590
Ile Met Asn Thr Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala
595 600 605
Ile Arg Asn Ala Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val
610 615 620
Asn Asn Ser Phe Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe
625 630 635 640
Gln Ile Ala Asn Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr
645 650 655
Ser Gly Gly Asp Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser
660 665 670
Asp Tyr Gly Asn Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe
675 680 685
Ile Phe Ser Thr Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala
690 695 700
Leu Leu Pro Phe Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val
705 710 715 720
Ala Gly Val Asp Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly
725 730 735
Glu Pro Gly Thr Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile
740 745 750
Thr Ala Met Trp Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe
755 760 765
Thr Arg Thr Asn Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro
770 775 780
Ile Val Thr Gly Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met
785 790 795 800
Ser Asn Asp Asn Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile
805 810 815
Gly Ala Val Gly Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala
820 825 830
Gly Lys Ala Met Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr
835 840 845
Ala Asp Thr Lys Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp
850 855 860
Ile Ser Gly Thr Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln
865 870 875 880
Leu His Gly Asn Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly
885 890 895
Ser Leu Val Leu Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr
900 905 910
Lys Gly Ala Leu Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn
915 920 925
Ser Asp Gly Ile Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn
930 935 940
Glu Thr Val His Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr
945 950 955 960
Leu Tyr Thr Arg Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile
965 970 975
Ile Gly Gly Lys Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr
980 985 990
Leu Asn Ser Thr Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile
995 1000 1005
Gly Gln Asp Tyr Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu
1010 1015 1020
Leu Ala Ser Leu Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp
1025 1030 1035 1040
Thr Leu Ser Tyr Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser
1045 1050 1055
Ala Ala Ala His Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln
1060 1065 1070
Gly Gly Ser Asn Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu
1075 1080 1085
Ser Ser Ala Thr Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr
1090 1095 1100
Asp Met Pro Gly Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala
1105 1110 1115 1120
Ala Val Gln His Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser
1125 1130 1135
Leu Ala Ala Thr Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met
1140 1145 1150
Gln Gly Arg Arg Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly
1155 1160 1165
Thr Gly Leu Arg Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp
1170 1175 1180
Glu Gln Gly Gly Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val
1185 1190 1195 1200
Gly Ile Ala Ala Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu
1205 1210 1215
Gly Met Gly Arg Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr
1220 1225 1230
Asp Ser Ile Ser Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile
1235 1240 1245
Gly Tyr Leu Lys Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile
1250 1255 1260
Ser Arg Ser Thr Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly
1265 1270 1275 1280
Thr Leu Met Gln Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala
1285 1290 1295
Ala Thr Gly Asp Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu
1300 1305 1310
Lys Gln Asp Ala Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly
1315 1320 1325
Asn Ser Leu Thr Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu
1330 1335 1340
Ser Gln Pro Leu Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val
1345 1350 1355 1360
Glu Arg Asp Leu Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr
1365 1370 1375
Gly Ala Thr Ala Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His
1380 1385 1390
Thr Arg Leu Val Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly
1395 1400 1405
Trp Asn Gly Leu Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly
1410 1415 1420
Asn His Ser Gly Arg Val Gly Val Gly Tyr Arg Phe Leu Glu His His
1425 1430 1435 1440
His His His His
<210>47
<211>2256
<212>DNA
<213>人工序列
<220>
<223>961c-0RF46.1
<400>47
atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60
aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120
gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180
ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240
aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300
gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360
ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420
aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480
aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540
gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600
gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660
gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720
aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780
agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840
gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900
gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960
ctgttccaac cttacaacgt gggtggatcc ggaggaggag gatcagattt ggcaaacgat 1020
tcttttatcc ggcaggttct cgaccgtcag catttcgaac ccgacgggaa ataccaccta 1080
ttcggcagca ggggggaact tgccgagcgc agcggccata tcggattggg aaaaatacaa 1140
agccatcagt tgggcaacct gatgattcaa caggcggcca ttaaaggaaa tatcggctac 1200
attgtccgct tttccgatca cgggcacgaa gtccattccc ccttcgacaa ccatgcctca 1260
cattccgatt ctgatgaagc cggtagtccc gttgacggat ttagccttta ccgcatccat 1320
tgggacggat acgaacacca tcccgccgac ggctatgacg ggccacaggg cggcggctat 1380
cccgctccca aaggcgcgag ggatatatac agctacgaca taaaaggcgt tgcccaaaat 1440
atccgcctca acctgaccga caaccgcagc accggacaac ggcttgccga ccgtttccac 1500
aatgccggta gtatgctgac gcaaggagta ggcgacggat tcaaacgcgc cacccgatac 1560
agccccgagc tggacagatc gggcaatgcc gccgaagcct tcaacggcac tgcagatatc 1620
gttaaaaaca tcatcggcgc ggcaggagaa attgtcggcg caggcgatgc cgtgcagggc 1680
ataagcgaag gctcaaacat tgctgtcatg cacggcttgg gtctgctttc caccgaaaac 1740
aagatggcgc gcatcaacga tttggcagat atggcgcaac tcaaagacta tgccgcagca 1800
gccatccgcg attgggcagt ccaaaacccc aatgccgcac aaggcataga agccgtcagc 1860
aatatcttta tggcagccat ccccatcaaa gggattggag ctgttcgggg aaaatacggc 1920
ttgggcggca tcacggcaca tcctatcaag cggtcgcaga tgggcgcgat cgcattgccg 1980
aaagggaaat ccgccgtcag cgacaatttt gccgatgcgg catacgccaa atacccgtcc 2040
ccttaccatt cccgaaatat ccgttcaaac ttggagcagc gttacggcaa agaaaacatc 2100
acctcctcaa ccgtgccgcc gtcaaacggc aaaaatgtca aactggcaga ccaacgccac 2160
ccgaagacag gcgtaccgtt tgacggtaaa gggtttccga attttgagaa gcacgtgaaa 2220
tatgatacgc tcgagcacca ccaccaccac cactga 2256
<210>48
<211>751
<212>PRT
<213>人工序列
<220>
<223>961c-ORF46.1
<400>48
Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile
1 5 10 15
Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly
20 25 30
Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp
35 40 45
Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu
50 55 60
Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln
65 70 75 80
Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu
85 90 95
Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala
100 105 110
Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile
115 120 125
Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu
130 135 140
Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe
145 150 155 160
Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu
165 170 175
Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys
180 185 190
Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys
195 200 205
Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu
210 215 220
Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn
225 230 235 240
Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg
245 250 255
Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr
260 265 270
Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala
275 280 285
Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu
290 295 300
Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly
305 310 315 320
Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser Gly Gly Gly Gly Ser Asp
325 330 335
Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp Arg Gln His Phe
340 345 350
Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly Glu Leu Ala
355 360 365
Glu Arg Ser Gly His Ile Gly Leu Gly Lys Ile Gln Ser His Gln Leu
370 375 380
Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly Asn I1e Gly Tyr
385 390 395 400
Ile Val Arg Phe Ser Asp His Gly His Glu Val His Ser Pro Phe Asp
405 410 415
Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser Pro Val Asp
420 425 430
Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr Glu His His Pro
435 440 445
Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr Pro Ala Pro Lys
450 455 460
Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly Val Ala Gln Asn
465 470 475 480
Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gln Arg Leu Ala
485 490 495
Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln Gly Val Gly Asp
500 505 510
Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu Asp Arg Ser Gly
515 520 525
Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile Val Lys Asn Ile
530 535 540
Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp Ala Val Gln Gly
545 550 555 560
Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly Leu Gly Leu Leu
565 570 575
Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu Ala Asp Met Ala
580 585 590
Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp Trp Ala Val Gln
595 600 605
Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser Asn Ile Phe Met
610 615 620
Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala Val Arg Gly Lys Tyr Gly
625 630 635 640
Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser Gln Met Gly Ala
645 650 655
Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp Asn Phe Ala Asp
660 665 670
Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg Asn Ile Arg
675 680 685
Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile Thr Ser Ser Thr
690 695 700
Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala Asp Gln Arg His
705 710 715 720
Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro Asn Phe Glu
725 730 735
Lys His Val Lys Tyr Asp Thr Leu Glu His His His His His His
740 745 750
<210>49
<211>1773
<212>DNA
<213>人工序列
<220>
<223>961c-741
<400>49
atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60
aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120
gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180
ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240
aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300
gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360
ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420
aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480
aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540
gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600
gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660
gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720
aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780
agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840
gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900
gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960
ctgttccaac cttacaacgt gggtggatcc ggagggggtg gtgtcgccgc cgacatcggt 1020
gcggggcttg ccgatgcact aaccgcaccg ctcgaccata aagacaaagg tttgcagtct 1080
ttgacgctgg atcagtccgt caggaaaaac gagaaactga agctggcggc acaaggtgcg 1140
gaaaaaactt atggaaacgg tgacagcctc aatacgggca aattgaagaa cgacaaggtc 1200
agccgtttcg actttatccg ccaaatcgaa gtggacgggc agctcattac cttggagagt 1260
ggagagttcc aagtatacaa acaaagccat tccgccttaa ccgcctttca gaccgagcaa 1320
atacaagatt cggagcattc cgggaagatg gttgcgaaac gccagttcag aatcggcgac 1380
atagcgggcg aacatacatc ttttgacaag cttcccgaag gcggcagggc gacatatcgc 1440
gggacggcgt tcggttcaga cgatgccggc ggaaaactga cctacaccat agatttcgcc 1500
gccaagcagg gaaacggcaa aatcgaacat ttgaaatcgc cagaactcaa tgtcgacctg 1560
gccgccgccg atatcaagcc ggatggaaaa cgccatgccg tcatcagcgg ttccgtcctt 1620
tacaaccaag ccgagaaagg cagttactcc ctcggtatct ttggcggaaa agcccaggaa 1680
gttgccggca gcgcggaagt gaaaaccgta aacggcatac gccatatcgg ccttgccgcc 1740
aagcaactcg agcaccacca ccaccaccac tga 1773
<210>50
<211>590
<212>PRT
<213>人工序列
<220>
<223>961c-741
<400> 50
Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile
1 5 10 15
Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly
20 25 30
Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp
35 40 45
Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu
50 55 60
Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln
65 70 75 80
Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu
85 90 95
Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala
100 105 110
Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile
115 120 125
Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu
130 135 140
Lys Leu Glu A1a Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe
145 150 155 160
Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu
165 170 175
Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys
180 185 190
Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys
195 200 205
Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu
210 215 220
Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn
225 230 235 240
Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg
245 250 255
Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr
260 265 270
Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala
275 280 285
Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu
290 295 300
Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly
305 310 315 320
Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser Gly Gly Gly Gly Val Ala
325 330 335
Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro Leu Asp
340 345 350
His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln Ser Val Arg
355 360 365
Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu Lys Thr Tyr
370 375 380
Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp Lys Val
385 390 395 400
Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly Gln Leu Ile
405 410 415
Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser His Ser Ala
420 425 430
Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu His Ser Gly
435 440 445
Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile Ala Gly Glu
450 455 460
His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr Tyr Arg
465 470 475 480
Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr Tyr Thr
485 490 495
Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu His Leu Lys
500 505 510
Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile Lys Pro Asp
515 520 525
Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr Asn Gln Ala
530 535 540
Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys Ala Gln Glu
545 550 555 560
Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile Arg His Ile
565 570 575
Gly Leu Ala Ala Lys Gln Leu Glu His His His His His His
580 585 590
<210>51
<211>4170
<212>DNA
<213>人工序列
<220>
<223>961c-983
<400>51
atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60
aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120
gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180
ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240
aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300
gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360
ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420
aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480
aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540
gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600
gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660
gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720
aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780
agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840
gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900
gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960
ctgttccaac cttacaacgt gggtggatcc ggcggaggcg gcacttctgc gcccgacttc 1020
aatgcaggcg gtaccggtat cggcagcaac agcagagcaa caacagcgaa atcagcagca 1080
gtatcttacg ccggtatcaa gaacgaaatg tgcaaagaca gaagcatgct ctgtgccggt 1140
cgggatgacg ttgcggttac agacagggat gccaaaatca atgccccccc cccgaatctg 1200
cataccggag actttccaaa cccaaatgac gcatacaaga atttgatcaa cctcaaacct 1260
gcaattgaag caggctatac aggacgcggg gtagaggtag gtatcgtcga cacaggcgaa 1320
tccgtcggca gcatatcctt tcccgaactg tatggcagaa aagaacacgg ctataacgaa 1380
aattacaaaa actatacggc gtatatgcgg aaggaagcgc ctgaagacgg aggcggtaaa 1440
gacattgaag cttctttcga cgatgaggcc gttatagaga ctgaagcaaa gccgacggat 1500
atccgccacg taaaagaaat cggacacatc gatttggtct cccatattat tggcgggcgt 1560
tccgtggacg gcagacctgc aggcggtatt gcgcccgatg cgacgctaca cataatgaat 1620
acgaatgatg aaaccaagaa cgaaatgatg gttgcagcca tccgcaatgc atgggtcaag 1680
ctgggcgaac gtggcgtgcg catcgtcaat aacagttttg gaacaacatc gagggcaggc 1740
actgccgacc ttttccaaat agccaattcg gaggagcagt accgccaagc gttgctcgac 1800
tattccggcg gtgataaaac agacgagggt atccgcctga tgcaacagag cgattacggc 1860
aacctgtcct accacatccg taataaaaac atgcttttca tcttttcgac aggcaatgac 1920
gcacaagctc agcccaacac atatgcccta ttgccatttt atgaaaaaga cgctcaaaaa 1980
ggcattatca cagtcgcagg cgtagaccgc agtggagaaa agttcaaacg ggaaatgtat 2040
ggagaaccgg gtacagaacc gcttgagtat ggctccaacc attgcggaat tactgccatg 2100
tggtgcctgt cggcacccta tgaagcaagc gtccgtttca cccgtacaaa cccgattcaa 2160
attgccggaa catccttttc cgcacccatc gtaaccggca cggcggctct gctgctgcag 2220
aaatacccgt ggatgagcaa cgacaacctg cgtaccacgt tgctgacgac ggctcaggac 2280
atcggtgcag tcggcgtgga cagcaagttc ggctggggac tgctggatgc gggtaaggcc 2340
atgaacggac ccgcgtcctt tccgttcggc gactttaccg ccgatacgaa aggtacatcc 2400
gatattgcct actccttccg taacgacatt tcaggcacgg gcggcctgat caaaaaaggc 2460
ggcagccaac tgcaactgca cggcaacaac acctatacgg gcaaaaccat tatcgaaggc 2520
ggttcgctgg tgttgtacgg caacaacaaa tcggatatgc gcgtcgaaac caaaggtgcg 2580
ctgatttata acggggcggc atccggcggc agcctgaaca gcgacggcat tgtctatctg 2640
gcagataccg accaatccgg cgcaaacgaa accgtacaca tcaaaggcag tctgcagctg 2700
gacggcaaag gtacgctgta cacacgtttg ggcaaactgc tgaaagtgga cggtacggcg 2760
attatcggcg gcaagctgta catgtcggca cgcggcaagg gggcaggcta tctcaacagt 2820
accggacgac gtgttccctt cctgagtgcc gccaaaatcg ggcaggatta ttctttcttc 2880
acaaacatcg aaaccgacgg cggcctgctg gcttccctcg acagcgtcga aaaaacagcg 2940
ggcagtgaag gcgacacgct gtcctattat gtccgtcgcg gcaatgcggc acggactgct 3000
tcggcagcgg cacattccgc gcccgccggt ctgaaacacg ccgtagaaca gggcggcagc 3060
aatctggaaa acctgatggt cgaactggat gcctccgaat catccgcaac acccgagacg 3120
gttgaaactg cggcagccga ccgcacagat atgccgggca tccgccccta cggcgcaact 3180
ttccgcgcag cggcagccgt acagcatgcg aatgccgccg acggtgtacg catcttcaac 3240
agtctcgccg ctaccgtcta tgccgacagt accgccgccc atgccgatat gcagggacgc 3300
cgcctgaaag ccgtatcgga cgggttggac cacaacggca cgggtctgcg cgtcatcgcg 3360
caaacccaac aggacggtgg aacgtgggaa cagggcggtg ttgaaggcaa aatgcgcggc 3420
agtacccaaa ccgtcggcat tgccgcgaaa accggcgaaa atacgacagc agccgccaca 3480
ctgggcatgg gacgcagcac atggagcgaa aacagtgcaa atgcaaaaac cgacagcatt 3540
agtctgtttg caggcatacg gcacgatgcg ggcgatatcg gctatctcaa aggcctgttc 3600
tcctacggac gctacaaaaa cagcatcagc cgcagcaccg gtgcggacga acatgcggaa 3660
ggcagcgtca acggcacgct gatgcagctg ggcgcactgg gcggtgtcaa cgttccgttt 3720
gccgcaacgg gagatttgac ggtcgaaggc ggtctgcgct acgacctgct caaacaggat 3780
gcattcgccg aaaaaggcag tgctttgggc tggagcggca acagcctcac tgaaggcacg 3840
ctggtcggac tcgcgggtct gaagctgtcg caacccttga gcgataaagc cgtcctgttt 3900
gcaacggcgg gcgtggaacg cgacctgaac ggacgcgact acacggtaac gggcggcttt 3960
accggcgcga ctgcagcaac cggcaagacg ggggcacgca atatgccgca cacccgtctg 4020
gttgccggcc tgggcgcgga tgtcgaattc ggcaacggct ggaacggctt ggcacgttac 4080
agctacgccg gttccaaaca gtacggcaac cacagcggac gagtcggcgt aggctaccgg 4140
ttcctcgagc accaccacca ccaccactga 4170
<210>52
<211>1389
<212>PRT
<213>人工序列
<220>
<223>961c-983
<400>52
Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile
1 5 10 15
Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly
20 25 30
Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp
35 40 45
Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu
50 55 60
Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln
65 70 75 80
Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu
85 90 95
Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala
100 l05 110
Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile
115 120 125
Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu
130 135 140
Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe
145 150 155 160
Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu
165 170 175
Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys
180 185 190
Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys
195 200 205
Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu
210 215 220
Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn
225 230 235 240
Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg
245 250 255
Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr
260 265 270
Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala
275 280 285
Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu
290 295 300
Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly
305 310 315 320
Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser Gly Gly Gly Gly Thr Ser
325 330 335
Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser Asn Ser Arg
340 345 350
Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly Ile Lys Asn
355 360 365
Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg Asp Asp Val
370 375 380
Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro Pro Asn Leu
385 390 395 400
His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys Asn Leu Ile
405 410 415
Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg Gly Val Glu
420 425 430
Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile Ser Phe Pro
435 440 445
Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn Tyr Lys Asn
450 455 460
Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly Gly Gly Lys
465 470 475 480
Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu Thr Glu Ala
485 490 495
Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His Ile Asp Leu
500 505 510
Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg Pro Ala Gly
515 520 525
Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr Asn Asp Glu
530 535 540
Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala Trp Val Lys
545 550 555 560
Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe Gly Thr Thr
565 570 575
Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn Ser Glu Glu
580 585 590
Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp Lys Thr Asp
595 600 605
Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn Leu Ser Tyr
610 615 620
His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr Gly Asn Asp
625 630 635 640
Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe Tyr Glu Lys
645 650 655
Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp Arg Ser Gly
660 665 670
Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr Glu Pro Leu
675 680 685
Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp Cys Leu Ser
690 695 700
Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn Pro Ile Gln
705 7l0 7l5 720
Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly Thr Ala Ala
725 730 735
Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn Leu Arg Thr
740 745 750
Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly Val Asp Ser
755 760 765
Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met Asn Gly Pro
770 775 780
Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys Gly Thr Ser
785 790 795 800
Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr Gly Gly Leu
805 810 815
Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn Asn Thr Tyr
820 825 830
Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu Tyr Gly Asn
835 840 845
Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu Ile Tyr Asn
850 855 860
Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile Val Tyr Leu
865 870 875 880
Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His Ile Lys Gly
885 890 895
Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg Leu Gly Lys
900 905 910
Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys Leu Tyr Met
915 920 925
Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr Gly Arg Arg
930 935 940
Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr Ser Phe Phe
945 950 955 960
Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu Asp Ser Val
965 970 975
Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr Tyr Val Arg
980 985 990
Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His Ser Ala Pro
995 1000 1005
Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn Leu Glu Asn
1010 1015 1020
Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr Pro Glu Thr
1025 1030 1035 1040
Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly Ile Arg Pro
1045 1050 1055
Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His Ala Asn Ala
1060 1065 1070
Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr Val Tyr Ala
1075 1080 1085
Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg Leu Lys Ala
1090 1095 1100
Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg Val Ile Ala
1105 1110 1115 1120
Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly Val Glu Gly
1125 1130 1135
Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala Lys Thr Gly
1140 1145 1150
Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg Ser Thr Trp
1155 1160 1165
Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser Leu Phe Ala
1170 1175 1180
Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys Gly Leu Phe
1185 1190 1195 1200
Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr Gly Ala Asp
1205 1210 1215
Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln Leu Gly Ala
1220 1225 1230
Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp Leu Thr Val
1235 1240 1245
Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala Phe Ala Glu
1250 1255 1260
Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr Glu Gly Thr
1265 1270 1275 1280
Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu Ser Asp Lys
1285 1290 1295
Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu Asn Gly Arg
1300 1305 1310
Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala Ala Thr Gly
1315 1320 1325
Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val Ala Gly Leu
1330 1335 1340
Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu Ala Arg Tyr
1345 1350 1355 1360
Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly Arg Val Gly
1365 1370 1375
Val Gly Tyr Arg Phe Leu Glu His His His His His His
1380 1385
<210>53
<211>2304
<212>DNA
<213>人工序列
<220>
<223>961cL-0RF46.1
<400>53
atgaaacact ttccatccaa agtactgacc acagccatcc ttgccacttt ctgtagcggc 60
gcactggcag ccacaaacga cgacgatgtt aaaaaagctg ccactgtggc cattgctgct 120
gcctacaaca atggccaaga aatcaacggt ttcaaagctg gagagaccat ctacgacatt 180
gatgaagacg gcacaattac caaaaaagac gcaactgcag ccgatgttga agccgacgac 240
tttaaaggtc tgggtctgaa aaaagtcgtg actaacctga ccaaaaccgt caatgaaaac 300
aaacaaaacg tcgatgccaa agtaaaagct gcagaatctg aaatagaaaa gttaacaacc 360
aagttagcag acactgatgc cgctttagca gatactgatg ccgctctgga tgcaaccacc 420
aacgccttga ataaattggg agaaaatata acgacatttg ctgaagagac taagacaaat 480
atcgtaaaaa ttgatgaaaa attagaagcc gtggctgata ccgtcgacaa gcatgccgaa 540
gcattcaacg atatcgccga ttcattggat gaaaccaaca ctaaggcaga cgaagccgtc 600
aaaaccgcca atgaagccaa acagacggcc gaagaaacca aacaaaacgt cgatgccaaa 660
gtaaaagctg cagaaactgc agcaggcaaa gccgaagctg ccgctggcac agctaatact 720
gcagccgaca aggccgaagc tgtcgctgca aaagttaccg acatcaaagc tgatatcgct 780
acgaacaaag ataatattgc taaaaaagca aacagtgccg acgtgtacac cagagaagag 840
tctgacagca aatttgtcag aattgatggt ctgaacgcta ctaccgaaaa attggacaca 900
cgcttggctt ctgctgaaaa atccattgcc gatcacgata ctcgcctgaa cggtttggat 960
aaaacagtgt cagacctgcg caaagaaacc cgccaaggcc ttgcagaaca agccgcgctc 1020
tccggtctgt tccaacctta caacgtgggt ggatccggag gaggaggatc agatttggca 1080
aacgattctt ttatccggca ggttctcgac cgtcagcatt tcgaacccga cgggaaatac 1140
cacctattcg gcagcagggg ggaacttgcc gagcgcagcg gccatatcgg attgggaaaa 1200
atacaaagcc atcagttggg caacctgatg attcaacagg cggccattaa aggaaatatc 1260
ggctacattg tccgcttttc cgatcacggg cacgaagtcc attccccctt cgacaaccat 1320
gcctcacatt ccgattctga tgaagccggt agtcccgttg acggatttag cctttaccgc 1380
atccattggg acggatacga acaccatccc gccgacggct atgacgggcc acagggcggc 1440
ggctatcccg ctcccaaagg cgcgagggat atatacagct acgacataaa aggcgttgcc 1500
caaaatatcc gcctcaacct gaccgacaac cgcagcaccg gacaacggct tgccgaccgt 1560
ttccacaatg ccggtagtat gctgacgcaa ggagtaggcg acggattcaa acgcgccacc 1620
cgatacagcc ccgagctgga cagatcgggc aatgccgccg aagccttcaa cggcactgca 1680
gatatcgtta aaaacatcat cggcgcggca ggagaaattg tcggcgcagg cgatgccgtg 1740
cagggcataa gcgaaggctc aaacattgct gtcatgcacg gcttgggtct gctttccacc 1800
gaaaacaaga tggcgcgcat caacgatttg gcagatatgg cgcaactcaa agactatgcc 1860
gcagcagcca tccgcgattg ggcagtccaa aaccccaatg ccgcacaagg catagaagcc 1920
gtcagcaata tctttatggc agccatcccc atcaaaggga ttggagctgt tcggggaaaa 1980
tacggcttgg gcggcatcac ggcacatcct atcaagcggt cgcagatggg cgcgatcgca 2040
ttgccgaaag ggaaatccgc cgtcagcgac aattttgccg atgcggcata cgccaaatac 2100
ccgtcccctt accattcccg aaatatccgt tcaaacttgg agcagcgtta cggcaaagaa 2160
aacatcacct cctcaaccgt gccgccgtca aacggcaaaa atgtcaaact ggcagaccaa 2220
cgccacccga agacaggcgt accgtttgac ggtaaagggt ttccgaattt tgagaagcac 2280
gtgaaatatg atacgtaact cgag 2304
<210>54
<211>765
<212>PRT
<213>人工序列
<220>
<223>961cL-ORF46.1
<400>54
Met Lys His Phe Pro Ser Lys Val Leu Thr Thr Ala Ile Leu Ala Thr
1 5 10 15
Phe Cys Ser Gly Ala Leu Ala Ala Thr Asn Asp Asp Asp Val Lys Lys
20 25 30
Ala Ala Thr Val Ala Ile Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile
35 40 45
Asn Gly Phe Lys Ala Gly Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly
50 55 60
Thr Ile Thr Lys Lys Asp Ala Thr Ala Ala Asp Val Glu Ala Asp Asp
65 70 75 80
Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr
85 90 95
Val Asn Glu Asn Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu
100 105 110
Ser Glu Ile Glu Lys Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala
115 120 125
Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn
130 135 140
Lys Leu Gly Glu Asn Ile Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn
145 150 155 160
Ile Val Lys Ile Asp Glu Lys Leu Glu Ala Val Ala Asp Thr Val Asp
165 170 175
Lys His Ala Glu Ala Phe Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr
180 185 190
Asn Thr Lys Ala Asp Glu Ala Val Lys Thr Ala Asn Glu Ala Lys Gln
195 200 205
Thr Ala Glu Glu Thr Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala
210 215 220
Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr
225 230 235 240
Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys Val Thr Asp Ile Lys
245 250 255
Ala Asp Ile Ala Thr Asn Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser
260 265 270
Ala Asp Val Tyr Thr Arg Glu Glu Ser Asp Ser Lys Phe Val Arg Ile
275 280 285
Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser
290 295 300
Ala Glu Lys Ser Ile Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp
305 310 315 320
Lys Thr Val Ser Asp Leu Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu
325 330 335
Gln Ala Ala Leu Ser Gly Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser
340 345 350
Gly Gly Gly Gly Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val
355 360 365
Leu Asp Arg Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly
370 375 380
Ser Arg Gly Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly Lys
385 390 395 400
Ile Gln Ser His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile
405 410 415
Lys Gly Asn Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu
420 425 430
Val His Ser Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu
435 440 445
Ala Gly Ser Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp
450 455 460
Gly Tyr Glu His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly
465 470 475 480
Gly Tyr Pro Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile
485 490 495
Lys Gly Val Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser
500 505 510
Thr Gly Gln Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu
515 520 525
Thr Gln Gly Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro
530 535 540
Glu Leu Asp Arg Ser Gly Asn Ala Ala G1u Ala Phe Asn Gly Thr Ala
545 550 555 560
Asp Ile Val Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala
565 570 575
Gly Asp Ala Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met
580 585 590
His Gly Leu Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn
595 600 605
Asp Leu Ala Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile
610 615 620
Arg Asp Trp Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala
625 630 635 640
Val Ser Asn Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala
645 650 655
ValArg Gly Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys
660 665 670
Arg Ser Gln Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val
675 680 685
Ser Asp Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr
690 695 700
His Ser Arg Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu
705 710 715 720
Asn Ile Thr Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys
725 730 735
Leu Ala Asp Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys
740 745 750
Gly Phe Pro Asn Phe Glu Lys His Val Lys Tyr Asp Thr
755 760 765
<210>55
<211>1839
<212>DNA
<213>人工序列
<220>
<223>961cL-741
<400>55
atgaaacact ttccatccaa agtactgacc acagccatcc ttgccacttt ctgtagcggc 60
gcactggcag ccacaaacga cgacgatgtt aaaaaagctg ccactgtggc cattgctgct 120
gcctacaaca atggccaaga aatcaacggt ttcaaagctg gagagaccat ctacgacatt 180
gatgaagacg gcacaattac caaaaaagac gcaactgcag ccgatgttga agccgacgac 240
tttaaaggtc tgggtctgaa aaaagtcgtg actaacctga ccaaaaccgt caatgaaaac 300
aaacaaaacg tcgatgccaa agtaaaagct gcagaatctg aaatagaaaa gttaacaacc 360
aagttagcag acactgatgc cgctttagca gatactgatg ccgctctgga tgcaaccacc 420
aacgccttga ataaattggg agaaaatata acgacatttg ctgaagagac taagacaaat 480
atcgtaaaaa ttgatgaaaa attagaagcc gtggctgata ccgtcgacaa gcatgccgaa 540
gcattcaacg atatcgccga ttcattggat gaaaccaaca ctaaggcaga cgaagccgtc 600
aaaaccgcca atgaagccaa acagacggcc gaagaaacca aacaaaacgt cgatgccaaa 660
gtaaaagctg cagaaactgc agcaggcaaa gccgaagctg ccgctggcac agctaatact 720
gcagccgaca aggccgaagc tgtcgctgca aaagttaccg acatcaaagc tgatatcgct 780
acgaacaaag ataatattgc taaaaaagca aacagtgccg acgtgtacac cagagaagag 840
tctgacagca aatttgtcag aattgatggt ctgaacgcta ctaccgaaaa attggacaca 900
cgcttggctt ctgctgaaaa atccattgcc gatcacgata ctcgcctgaa cggtttggat 960
aaaacagtgt cagacctgcg caaagaaacc cgccaaggcc ttgcagaaca agccgcgctc 1020
tccggtctgt tccaacctta caacgtgggt ggatccggag ggggtggtgt cgccgccgac 1080
atcggtgcgg ggcttgccga tgcactaacc gcaccgctcg accataaaga caaaggtttg 1140
cagtctttga cgctggatca gtccgtcagg aaaaacgaga aactgaagct ggcggcacaa 1200
ggtgcggaaa aaacttatgg aaacggtgac agcctcaata cgggcaaatt gaagaacgac 1260
aaggtcagcc gtttcgactt tatccgccaa atcgaagtgg acgggcagct cattaccttg 1320
gagagtggag agttccaagt atacaaacaa agccattccg ccttaaccgc ctttcagacc 1380
gagcaaatac aagattcgga gcattccggg aagatggttg cgaaacgcca gttcagaatc 1440
ggcgacatag cgggcgaaca tacatctttt gacaagcttc ccgaaggcgg cagggcgaca 1500
tatcgcggga cggcgttcgg ttcagacgat gccggcggaa aactgaccta caccatagat 1560
ttcgccgcca agcagggaaa cggcaaaatc gaacatttga aatcgccaga actcaatgtc 1620
gacctggccg ccgccgatat caagccggat ggaaaacgcc atgccgtcat cagcggttcc 1680
gtcctttaca accaagccga gaaaggcagt tactccctcg gtatctttgg cggaaaagcc 1740
caggaagttg ccggcagcgc ggaagtgaaa accgtaaacg gcatacgcca tatcggcctt 1800
gccgccaagc aactcgagca ccaccaccac caccactga 1839
<210>56
<211>612
<212>PRT
<213>人工序列
<220>
<223>961cL-74l
<400>56
Met Lys His Phe Pro Ser Lys Val Leu Thr Thr Ala Ile Leu Ala Thr
1 5 10 15
Phe Cys Ser Gly Ala Leu Ala Ala Thr Asn Asp Asp Asp Val Lys Lys
20 25 30
Ala Ala Thr Val Ala Ile Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile
35 40 45
Asn Gly Phe Lys Ala Gly Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly
50 55 60
Thr Ile Thr Lys Lys Asp Ala Thr Ala Ala Asp Val Glu Ala Asp Asp
65 70 75 80
Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr
85 90 95
Val Asn Glu Asn Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu
100 105 110
Ser Glu Ile Glu Lys Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala
115 120 125
Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn
130 135 140
Lys Leu Gly Glu Asn Ile Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn
145 150 155 160
Ile Val Lys Ile Asp Glu Lys Leu Glu Ala Val Ala Asp Thr Val Asp
165 170 175
Lys His Ala Glu Ala Phe Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr
180 185 190
Asn Thr Lys Ala Asp Glu Ala Val Lys Thr Ala Asn Glu Ala Lys Gln
195 200 205
Thr Ala Glu Glu Thr Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala
210 215 220
Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr
225 230 235 240
Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys Val Thr Asp Ile Lys
245 250 255
Ala Asp Ile Ala Thr Asn Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser
260 265 270
Ala Asp Val Tyr Thr Arg Glu Glu Ser Asp Ser Lys Phe Val Arg Ile
275 280 285
Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser
290 295 300
Ala Glu Lys Ser Ile Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp
305 310 315 320
Lys Thr Val Ser Asp Leu Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu
325 330 335
Gln Ala Ala Leu Ser Gly Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser
340 345 350
Gly Gly Gly Gly Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala
355 360 365
Leu Thr Ala Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr
370 375 380
Leu Asp Gln Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln
385 390 395 400
Gly Ala Glu Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys
405 410 415
Leu Lys Asn Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu
420 425 430
Val Asp Gly Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr
435 440 445
Lys Gln Ser His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln
450 455 460
Asp Ser Glu His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile
465 470 475 480
Gly Asp Ile Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly
485 490 495
Gly Arg Ala Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly
500 505 510
Gly Lys Leu Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly
515 520 525
Lys Ile Glu His Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala
530 535 540
Ala Asp Ile Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser
545 550 555 560
Val Leu Tyr Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe
565 570 575
Gly Gly Lys Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val
580 585 590
Asn Gly Ile Arg His Ile Gly Leu Ala Ala Lys Gln Leu Glu His His
595 600 605
His His His His
610
<210>57
<211>4218
<212>DNA
<213>人工序列
<220>
<223>961cL-983
<400>57
atgaaacact ttccatccaa agtactgacc acagccatcc ttgccacttt ctgtagcggc 60
gcactggcag ccacaaacga cgacgatgtt aaaaaagctg ccactgtggc cattgctgct 120
gcctacaaca atggccaaga aatcaacggt ttcaaagctg gagagaccat ctacgacatt 180
gatgaagacg gcacaattac caaaaaagac gcaactgcag ccgatgttga agccgacgac 240
tttaaaggtc tgggtctgaa aaaagtcgtg actaacctga ccaaaaccgt caatgaaaac 300
aaacaaaacg tcgatgccaa agtaaaagct gcagaatctg aaatagaaaa gttaacaacc 360
aagttagcag acactgatgc cgctttagca gatactgatg ccgctctgga tgcaaccacc 420
aacgccttga ataaattggg agaaaatata acgacatttg ctgaagagac taagacaaat 480
atcgtaaaaa ttgatgaaaa attagaagcc gtggctgata ccgtcgacaa gcatgccgaa 540
gcattcaacg atatcgccga ttcattggat gaaaccaaca ctaaggcaga cgaagccgtc 600
aaaaccgcca atgaagccaa acagacggcc gaagaaacca aacaaaacgt cgatgccaaa 660
gtaaaagctg cagaaactgc agcaggcaaa gccgaagctg ccgctggcac agctaatact 720
gcagccgaca aggccgaagc tgtcgctgca aaagttaccg acatcaaagc tgatatcgct 780
acgaacaaag ataatattgc taaaaaagca aacagtgccg acgtgtacac cagagaagag 840
tctgacagca aatttgtcag aattgatggt ctgaacgcta ctaccgaaaa attggacaca 900
cgcttggctt ctgctgaaaa atccattgcc gatcacgata ctcgcctgaa cggtttggat 960
aaaacagtgt cagacctgcg caaagaaacc cgccaaggcc ttgcagaaca agccgcgctc 1020
tccggtctgt tccaacctta caacgtgggt ggatccggcg gaggcggcac ttctgcgccc 1080
gacttcaatg caggcggtac cggtatcggc agcaacagca gagcaacaac agcgaaatca 1140
gcagcagtat cttacgccgg tatcaagaac gaaatgtgca aagacagaag catgctctgt 1200
gccggtcggg atgacgttgc ggttacagac agggatgcca aaatcaatgc cccccccccg 1260
aatctgcata ccggagactt tccaaaccca aatgacgcat acaagaattt gatcaacctc 1320
aaacctgcaa ttgaagcagg ctatacagga cgcggggtag aggtaggtat cgtcgacaca 1380
ggcgaatccg tcggcagcat atcctttccc gaactgtatg gcagaaaaga acacggctat 1440
aacgaaaatt acaaaaacta tacggcgtat atgcggaagg aagcgcctga agacggaggc 1500
ggtaaagaca ttgaagcttc tttcgacgat gaggccgtta tagagactga agcaaagccg 1560
acggatatcc gccacgtaaa agaaatcgga cacatcgatt tggtctccca tattattggc 1620
gggcgttccg tggacggcag acctgcaggc ggtattgcgc ccgatgcgac gctacacata 1680
atgaatacga atgatgaaac caagaacgaa atgatggttg cagccatccg caatgcatgg 1740
gtcaagctgg gcgaacgtgg cgtgcgcatc gtcaataaca gttttggaac aacatcgagg 1800
gcaggcactg ccgacctttt ccaaatagcc aattcggagg agcagtaccg ccaagcgttg 1860
ctcgactatt ccggcggtga taaaacagac gagggtatcc gcctgatgca acagagcgat 1920
tacggcaacc tgtcctacca catccgtaat aaaaacatgc ttttcatctt ttcgacaggc 1980
aatgacgcac aagctcagcc caacacatat gccctattgc cattttatga aaaagacgct 2040
caaaaaggca ttatcacagt cgcaggcgta gaccgcagtg gagaaaagtt caaacgggaa 2100
atgtatggag aaccgggtac agaaccgctt gagtatggct ccaaccattg cggaattact 2160
gccatgtggt gcctgtcggc accctatgaa gcaagcgtcc gtttcacccg tacaaacccg 2220
attcaaattg ccggaacatc cttttccgca cccatcgtaa ccggcacggc ggctctgctg 2280
ctgcagaaat acccgtggat gagcaacgac aacctgcgta ccacgttgct gacgacggct 2340
caggacatcg gtgcagtcgg cgtggacagc aagttcggct ggggactgct ggatgcgggt 2400
aaggccatga acggacccgc gtcctttccg ttcggcgact ttaccgccga tacgaaaggt 2460
acatccgata ttgcctactc cttccgtaac gacatttcag gcacgggcgg cctgatcaaa 2520
aaaggcggca gccaactgca actgcacggc aacaacacct atacgggcaa aaccattatc 2580
gaaggcggtt cgctggtgtt gtacggcaac aacaaatcgg atatgcgcgt cgaaaccaaa 2640
ggtgcgctga tttataacgg ggcggcatcc ggcggcagcc tgaacagcga cggcattgtc 2700
tatctggcag ataccgacca atccggcgca aacgaaaccg tacacatcaa aggcagtctg 2760
cagctggacg gcaaaggtac gctgtacaca cgtttgggca aactgctgaa agtggacggt 2820
acggcgatta tcggcggcaa gctgtacatg tcggcacgcg gcaagggggc aggctatctc 2880
aacagtaccg gacgacgtgt tcccttcctg agtgccgcca aaatcgggca ggattattct 2940
ttcttcacaa acatcgaaac cgacggcggc ctgctggctt ccctcgacag cgtcgaaaaa 3000
acagcgggca gtgaaggcga cacgctgtcc tattatgtcc gtcgcggcaa tgcggcacgg 3060
actgcttcgg cagcggcaca ttccgcgccc gccggtctga aacacgccgt agaacagggc 3120
ggcagcaatc tggaaaacct gatggtcgaa ctggatgcct ccgaatcatc cgcaacaccc 3180
gagacggttg aaactgcggc agccgaccgc acagatatgc cgggcatccg cccctacggc 3240
gcaactttcc gcgcagcggc agccgtacag catgcgaatg ccgccgacgg tgtacgcatc 3300
ttcaacagtc tcgccgctac cgtctatgcc gacagtaccg ccgcccatgc cgatatgcag 3360
ggacgccgcc tgaaagccgt atcggacggg ttggaccaca acggcacggg tctgcgcgtc 3420
atcgcgcaaa cccaacagga cggtggaacg tgggaacagg gcggtgttga aggcaaaatg 3480
cgcggcagta cccaaaccgt cggcattgcc gcgaaaaccg gcgaaaatac gacagcagcc 3540
gccacactgg gcatgggacg cagcacatgg agcgaaaaca gtgcaaatgc aaaaaccgac 3600
agcattagtc tgtttgcagg catacggcac gatgcgggcg atatcggcta tctcaaaggc 3660
ctgttctcct acggacgcta caaaaacagc atcagccgca gcaccggtgc ggacgaacat 3720
gcggaaggca gcgtcaacgg cacgctgatg cagctgggcg cactgggcgg tgtcaacgtt 3780
ccgtttgccg caacgggaga tttgacggtc gaaggcggtc tgcgctacga cctgctcaaa 3840
caggatgcat tcgccgaaaa aggcagtgct ttgggctgga gcggcaacag cctcactgaa 3900
ggcacgctgg tcggactcgc gggtctgaag ctgtcgcaac ccttgagcga taaagccgtc 3960
ctgtttgcaa cggcgggcgt ggaacgcgac ctgaacggac gcgactacac ggtaacgggc 4020
ggctttaccg gcgcgactgc agcaaccggc aagacggggg cacgcaatat gccgcacacc 4080
cgtctggttg ccggcctggg cgcggatgtc gaattcggca acggctggaa cggcttggca 4140
cgttacagct acgccggttc caaacagtac ggcaaccaca gcggacgagt cggcgtaggc 4200
taccggttct gactcgag 4218
<210>58
<211>1403
<212>PRT
<213>人工序列
<220>
<223>961cL-983
<400>58
Met Lys His Phe Pro Ser Lys Val Leu Thr Thr Ala I1e Leu Ala Thr
1 5 10 15
Phe Cys Ser Gly Ala Leu Ala Ala Thr Asn Asp Asp Asp Val Lys Lys
20 25 30
Ala Ala Thr Val Ala Ile Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile
35 40 45
Asn Gly Phe Lys Ala Gly Glu ThrIle Tyr Asp Ile Asp Glu Asp Gly
50 55 60
Thr Ile Thr Lys Lys Asp Ala Thr Ala Ala Asp Val Glu Ala Asp Asp
65 70 75 80
Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr
85 90 95
Val Asn Glu Asn Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu
100 105 110
Ser Glu Ile Glu Lys Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala
115 120 125
Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn
130 135 140
Lys Leu Gly Glu AsnIle Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn
145 150 155 160
Ile Val Lys Ile Asp Glu Lys Leu Glu Ala Val Ala Asp Thr Val Asp
165 170 175
Lys His Ala Glu Ala Phe Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr
180 185 190
Asn Thr Lys Ala Asp Glu Ala Val Lys Thr Ala Asn Glu Ala Lys Gln
195 200 205
Thr Ala Glu Glu Thr Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala
210 215 220
Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr
225 230 235 240
Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys Val Thr Asp Ile Lys
245 250 255
Ala Asp Ile Ala Thr Asn Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser
260 265 270
Ala Asp Val Tyr Thr Arg Glu Glu Ser Asp Ser Lys Phe Val Arg Ile
275 280 285
Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser
290 295 300
Ala Glu Lys Ser Ile Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp
305 310 315 320
Lys Thr Val Ser Asp Leu Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu
325 330 335
Gln Ala Ala Leu Ser Gly Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser
340 345 350
Gly Gly Gly Gly Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly
355 360 365
Ile Gly Ser Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser
370 375 380
Tyr Ala Gly Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys
385 390 395 400
Ala Gly Arg Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn
405 410 415
Ala Pro Pro Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp
420 425 430
Ala Tyr Lys Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr
435 440 445
Thr Gly Arg Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val
450 455 460
Gly Ser Ile Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr
465 470 475 480
Asn Glu Asn Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro
485 490 495
Glu Asp Gly Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala
500 505 510
Val Ile Glu Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu
515 520 525
Ile Gly His Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val
530 535 540
Asp Gly Arg Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile
545 550 555 560
Met Asn Thr Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile
565 570 575
Arg Asn Ala Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn
580 585 590
Asn Ser Phe Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln
595 600 605
Ile Ala Asn Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser
610 615 620
Gly Gly Asp Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp
625 630 635 640
Tyr Gly Asn Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile
645 650 655
Phe Ser Thr Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu
660 665 670
Leu Pro Phe Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala
675 680 685
Gly Val Asp Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu
690 695 700
Pro Gly Thr Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr
705 710 715 720
Ala Met Trp Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr
725 730 735
Arg Thr Asn Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile
740 745 750
Val Thr Gly Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser
755 760 765
Asn Asp Asn Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly
770 775 780
Ala Val Gly Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly
785 790 795 800
Lys Ala Met Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala
805 8l0 815
Asp Thr Lys Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile
820 825 830
Ser Gly Thr Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu
835 840 845
His Gly Asn Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser
850 855 8 60
Leu Val Leu Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys
865 870 875 880
Gly Ala Leu Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser
885 890 895
Asp Gly Ile Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu
900 905 910
Thr Val His Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu
915 920 925
Tyr Thr Arg Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile
930 935 940
Gly Gly Lys Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu
945 950 955 960
Asn Ser Thr Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly
965 970 975
Gln Asp Tyr Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu
980 985 990
Ala Ser Leu Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr
995 1000 1005
Leu Ser Tyr Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala
1010 1015 1020
Ala Ala His Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly
1025 1030 1035 1040
Gly Ser Asn Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser
1045 1050 1055
Ser Ala Thr Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp
1060 1065 1070
Met Pro Gly Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala
1075 1080 1085
Val Gln His Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu
1090 1095 1100
Ala Ala Thr Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln
1105 1110 1115 1120
Gly Arg Arg Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr
1125 1130 1135
Gly Leu Arg Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu
1140 1145 1150
Gln Gly Gly Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly
1155 1160 1165
Ile Ala Ala Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly
1170 1175 1180
Met Gly Arg Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp
1185 1190 1195 1200
Ser Ile Ser Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly
1205 1210 1215
Tyr Leu Lys Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser
1220 1225 1230
Arg Ser Thr Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr
1235 1240 1245
Leu Met Gln Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala
1250 1255 1260
Thr Gly Asp Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys
1265 1270 1275 1280
Gln Asp Ala Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn
1285 1290 1295
Ser Leu Thr Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser
1300 1305 1310
Gln Pro Leu Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu
1315 1320 1325
Arg Asp Leu Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly
1330 1335 1340
Ala Thr Ala Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr
1345 1350 1355 1360
Arg Leu Val Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp
1365 1370 1375
Asn Gly Leu Ala Arg Tyr Ser Tyr Ala G1y Ser Lys Gln Tyr Gly Asn
1380 1385 1390
His Ser Gly Arg Val Gly Val Gly Tyr Arg Phe
1395 1400
<210>59
<211>25
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>59
cgcggatccg gagggggtgg tgtcg 25
<210>60
<211>27
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>60
cccgctcgag ttgcttggcg gcaaggc 27
<210>61
<211>25
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>61
cgcggatccg gcggaggcgg cactt 25
<210>62
<211>26
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>62\
cccgctcgag gaaccggtag cctacg 26
<210>63
<211>41
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>63
cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt c 41
<210>64
<211>29
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>64
cccgctcgag cgtatcatat ttcacgtgc 29
<210>65
<211>25
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>65
cgcggatccg gagggggtgg tgtcg 25
<210>66
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>66
cccgctcgag ttattgcttg gcggcaag 28
<210>67
<211>25
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>67
cgcggatccg gcggaggcgg cactt 25
<210>68
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>68
cccgctcgag tcagaaccgg tagcctac 28
<210>69
<211>41
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>69
cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt c 41
<210>70
<211>32
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>70
cccgctcgag ttacgtatca tatttcacgt gc 32
<210>71
<211>42
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>71
cgcggatccg gtggtggtgg tcaaagcaag agcatccaaa cc42
<210>72
<211>30
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>72
cccaagcttt tcgggcggta ttcgggcttc 30
<210>73
<211>39
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>73
cgcggatccg gtggtggtgg tgccacctac aaagtggac 39
<210>74
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>74
gcccaagctt ttgtttggct gcctcgat 28
<210>75
<211>34
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>75
cgcggatccg gtggtggtgg tacaagcgac gacg 34
<210>76
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>76
gcccaagctt ccactcgtaa ttgacgcc 28
<210>77
<211>41
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>77
cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt c 41
<210>78
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>78
cccaagcttc gtatcatatt tcacgtgc 28
<210>79
<211>44
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>79
cccaagcttg gtggtggtgg tggttcagat ttggcaaacg attc 44
<210>80
<211>29
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>80
cccgctcgag cgtatcatat ttcacgtgc 29
<210>81
<211>45
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>81
cccaagcttg gtggtggtgg tggtcaaagc aagagcatcc aaacc 45
<210>82
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>82
cccgctcgag cgggcggtat tcgggctt 28
<210>83
<211>32
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>83
cgcggatccg ctagccccga tgttaaatcg gc 32
<210>84
<211>29
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>84
cggggatcca tcctgctctt ttttgccgg 29
<210>85
<211>36
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>85
cgcggatccg ctagcggaca cacttatttc ggcatc 36
<210>86
<211>30
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>86
cgcggatccc cagcggtagc ctaatttgat 30
<210>87
<211>41
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>87
cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt c 41
<210>88
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>88
cccaagcttc gtatcatatt tcacgtgc 28
<210>89
<211>36
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>89
gcggcgtcga cggtggcgga ggcactggat cctcag 36
<210>90
<211>35
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>90
ggaggcactg gatcctcaga tttggcaaac gattc 35
<210>91
<211>29
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>91
cccgctcgag cgtatcatat ttcacgtgc 29
<210>92
<211>25
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>92
cggggatccg ggggcggcgg tggcg 25
<210>93
<211>30
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>93
cccaagctta tcctgctctt ttttgccggc 30
<210>94
<211>42
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>94
cgcggatccg gtggtggtgg tcaaagcaag agcatccaaa cc 42
<210>95
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>95
cccaagcttc gggcggtatt cgggcttc 28
<210>96
<211>26
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>96
ccccaagctt gggggcggcg gtggcg 26
<210>97
<211>31
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>97
cccgctcgag atcctgctct tttttgccgg c 31
<210>98
<211>45
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>98
cccaagcttg gtggtggtgg tggtcaaagc aagagcatcc aaacc 45
<210>99
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>99
cccgctcgag cgggcggtat tcgggctt 28
<210>100
<211>35
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>100
ggaggcactg gatccgcagc cacaaacgac gacga 35
<210>101
<211>36
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>101
gcggcctcga gggtggcgga ggcactggat ccgcag 36
<210>102
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>102
cccgctcgag acccagcttg taaggttg 28
<210>103
<211>35
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>103
ggaggcactg gatccgcagc cacaaacgac gacga 35
<210>104
<211>36
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>104
gcggcctcga gggtggcgga ggcactggat ccgcag 36
<210>105
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>105
cccgctcgag ccactcgtaa ttgacgcc 28
<210>106
<211>38
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>106
gcggcctcga gggatccggc ggaggcggca cttctgcg 38
<210>107
<211>26
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>107
cccgctcgag gaaccggtag cctacg 26
<210>108
<211>35
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>108
ggaggcactg gatcctcaga tttggcaaac gattc 35
<210>109
<211>37
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>109
gcggcgtcga cggtggcgga ggcactggat cctcaga 37
<210>110
<211>29
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>110
cccgctcgag cgtatcatat ttcacgtgc 29
<210>111
<211>35
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>111
gcggcctcga gggatccgga gggggtggtg tcgcc 35
<210>112
<211>25
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>112
cccgctcgag ttgcttggcg gcaag 25
<210>113
<211>35
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>113
ggaggcactg gatccgcagc cacaaacgac gacga 35
<210>114
<211>36
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>114
gcggcctcga gggtggcgga ggcactggat ccgcag 36
<210>115
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>115
cccgctcgag acccagcttg taaggttg 28
<210>116
<211>35
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>116
ggaggcactg gatccgcagc cacaaacgac gacga 35
<210>117
<211>36
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>117
gcggcctcga gggtggcgga ggcactggat ccgcag 36
<210>118
<211>28
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>118
cccgctcgag ccactcgtaa ttgacgcc 28
<210>119
<211>35
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>119
ggaggcactg gatcctcaga tttggcaaac gattc 35
<210>120
<211>37
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>120
gcggcgtcga cggtggcgga ggcactggat cctcaga 37
<210>121
<211>29
<212>DNA
<213>人工序列
<220>
<223>寡核苷酸
<400>121
cccgctcgag cgtatcatat ttcacgtgc 29