一种抗病毒的融合蛋白及其编码基因与应用
技术领域
本发明涉及一种抗病毒的融合蛋白及其编码基因与应用。
背景技术
核糖核酸(RNA)是生命活动中不可缺少的重要分子,具有传递遗传信息、编码蛋白质等许多功能。有害RNA的产生导致多种疾病的发生。
HIV病毒是一种RNA病毒,其基因组RNA是病毒遗传信息的携带者,病毒RNA也是翻译病毒蛋白的模板。HIV所引起的艾滋病是当今世界上对人类健康威胁最大的流行性传染病。在过去的十几年中,各国投入了巨大的努力和巨额资金致力于研制开发抗艾滋病的药物,目前世界上批准用于临床的抗艾滋病药物已有近二十种。但是,已有的任何一种药物都不能完全清除人体内的HIV病毒,几种药物组合使用的效果会优于单一药物治疗,但也不能完全清除HIV病毒。HIV在复制过程中有很高的突变率,病人体内的病毒呈异源性,其中一些病毒具有抗药性,当治疗进行到一定阶段时,这些抗药病毒的数量会逐渐占据优势,使得现有的治疗无效。而且这些抗病毒药物的长期毒性已在临床上显示出来。开发一种无副作用而且不会产生抗药性的抗艾滋病药物是本领域每一个科学家的梦想。抗艾滋病药物的副作用来源于药物作用的不专一性;抗药性的产生则是因为HIV病毒具有非常高的突变率,使作为药物靶点的病毒蛋白可以通过少至一个氨基酸的改变而逃避药物的抑制作用,但不影响病毒蛋白自身的功能。因此,理想的抗艾滋病药物应对HIV病毒具有极高的特异性,对作用靶点的突变又有一定的包容性。
HIV是一种逆转录病毒,在感染过程中首先通过病毒表面包膜蛋白(envelope)与其受体的相互作用吸附到细胞表面,然后病毒质膜与细胞质膜融合,裸露的病毒颗粒进入到细胞中。在细胞质中病毒的RNA被逆转录成双链DNA,这段DNA存在于由病毒蛋白及宿主蛋白组成的整合前复合体(Pre-Integration Complex,简称PIC)中。
PIC由细胞质膜附近迁移到核膜附近并进入细胞核。在细胞核中病毒DNA由病毒的整合酶整合到宿主细胞的染色体中。上述过程度被称为逆转录病毒生命周期的前期。整合的病毒DNA被转录成RNA,其中大部分经过剪切后作为模板翻译产生病毒蛋白,小部分全长RNA作为病毒遗传物质被病毒蛋白包装成病毒颗粒出芽到细胞外成为未成熟病毒。在未成熟病毒中一些病毒蛋白以融合蛋白形式存在,经过病毒蛋白酶剪切后病毒颗粒结构重组成为有感染性的成熟病毒,此过程称为逆转录病毒生命周期的后期。
在HIV复制过程中,病毒自身编码的Tat蛋白发挥着重要作用。Tat是一种RNA结合蛋白,能够特异性结合到HIV病毒RNA上的TAR序列:gggtctctctggttagaccagatctgagcctgggagctctctggctaactagggaacccactgcttaagcctcaataaagct(ROBERT A.MARCINIAK,MARIANO A.GARCIA-BLANCO,AND PHILLIP A.SHARP.(1990)Identification and characterization of a HeLa nuclear protein thatspecifically binds to the trans-activation-response(TAR)element of humanimmunodeficiency virus.PNAS,Vol.87,pp.3624-3628.)。Tat通过结合TAR序列实现HIV基因组的持续转录,是HIV生命周期中不可缺少的一环。
ZAP是一个抗病毒因子,通过在胞质中特异性降解病毒RNA从而抑制某些病毒的复制,如鼠白血病病毒(MLV)、辛德比斯病毒(SIN)。但是ZAP对HIV病毒作用并不明显。ZAP作用机理的研究结果表明,ZAP本身不具有RNA酶的活性,但是能特异性结合到病毒RNA的特定序列上,并招募细胞内具有降解RNA功能的蛋白复合物Exosome而特异性地降解所结合的RNA。
发明内容
本发明的目的是提供一种抗病毒的融合蛋白及其编码基因与应用。
本发明所提供的抗病毒的融合蛋白,名称为ZAP-Tat,是具有下述氨基酸残基序列之一的蛋白质:
1)序列表中的SEQ ID №:1;
2)将序列表中SEQ ID №:1的氨基酸残基序列经过一个或几个氨基酸残基的取代和/或缺失和/或添加且可特异性降解HIV病毒RNA的蛋白质。
所述一个或几个氨基酸残基的取代和/或缺失和/或添加是指不多于十个氨基酸残基的取代和/或缺失和/或添加。
序列表中的SEQ ID №:1由904个氨基酸残基组成。自氨基端的第1位至776位氨基酸残基为ZAP,自氨基端的第777位至805位氨基酸残基为linker,自氨基端的第806位至881位氨基酸残基为Tat,自氨基端的第884位至893位氨基酸残基为myc,自氨基端的第899位至904位氨基酸残基为6×HIS。
上述抗病毒的融合蛋白(ZAP-Tat)的编码基因也属于本发明的保护范围。
抗病毒的融合蛋白(ZAP-Tat)的编码基因,可具有下述核苷酸序列之一:
1)序列表中SEQ ID №:2的核苷酸序列;
2)编码序列表中SEQ ID №:1蛋白质序列的DNA;
3)在高严谨条件下可与序列表中的SEQ ID №:2限定的DNA序列杂交的核苷酸序列。
序列表中的SEQ ID №:2由2712个核苷酸组成。自5′端的第1位至2328位核苷酸为ZAP编码序列,自5′端的第2329位至2415位核苷酸为linker的编码序列,自5′端的第2416位至2643位核苷酸为Tat的编码序列,自5′端的第2650位至2679位核苷酸为myc的编码序列,自5′端的第2695位至2712位核苷酸为myc的编码序列。
所述高严谨条件可为用0.1×SSPE(或0.1×SSC),0.1%SDS的溶液,在65℃下杂交并洗膜。
含有本发明基因的表达载体、细胞系及宿主菌均属于本发明的保护范围。
本发明的融合蛋白包括可与病毒TAR序列特异结合的RNA结合蛋白亚基(Tat)和具有降解RNA功能的蛋白亚基(ZAP),该融合蛋白通过RNA结合蛋白亚基结合到特定的靶RNA上,RNA降解亚基招募细胞内的RNA酶,降解融合蛋白所结合的特定RNA。本发明的融合蛋白对HIV病毒具有明显的抑制效果可用于制备抗病毒的药物,如制备预防和/或治疗艾滋病的药物。
附图说明
图1为ZAP-Tat等质粒酶切电泳图。
图2为利用Western免疫杂交方法检测ZAP-Tat等在细胞内的表达。
图3为分析ZAP-Tat对HIV载体表达的抑制效果。
具体实施方式
下述实施例中的实验方法,如无特别说明,均为常规方法。
实施例1、融合蛋白ZAP-Tat的表达
本实施例中,首先在ZAP的C-端接上联接肽序列(命名为ZAP-linker),并证明了该联接肽序列不影响ZAP的功能。然后,将Tat通过联接肽序列(linker)融合到ZAP的C-端,成为ZAP-Tat。使用联接肽序列的目的是将ZAP和Tat两个结构域分开,使之不互相干扰彼此的活性。作为对照,还构建了N-端含有69个来自ZAP的氨基酸残基的Tat(命名为dTat)。
1、linker的制备
利用化学合成方法合成5′→3′链具有以下序列的一段双链DNA:5’-GGT GGAGGC TCT GGG GGA GGT ACA GGA GGT GGC TCA GGT GGG GGC ACT GGT GGA GGC AGT GGAGGT GGC-3’。该序列编码一段联接肽序列。然后对其用如下的引物进行PCR扩增,所用引物引入克隆所需的酶切位点:
上游引物:linker sp:
5’-ATATAG
GCGGCCG TGGTGGAGGCTCTGG-3’(下划线部分核苷酸为Not I酶切识别位点,方框内的核苷酸序列为Xho I酶切识别位点)
下游引物:linker ap:
5’-ATATAG
GGGCCCTTGGCGCC
GCCACCTCCACTGCCTC-3’(下划线部分核苷酸为Apa I酶切识别位点,方框内的核苷酸序列为EcoR V酶切识别位点,GGCGCC为Kas I酶切识别位点)
PCR的反应体系为:
1μl模板DNA(5pM/μl),1μl上游引物(50μM),1μl下游引物(50μM),1μl Tag酶,4μl dNTP混合物(2.5mM),5μl PCR反应缓冲液(10×),37μl dH2O。
PCR的反应程序为:94℃退火2min,然后94℃,30sec;55℃,30sec;72℃,30sec共25个循环。
得到的PCR产物进行凝胶电泳,结果表明得到124bp的条带,结果如图1中的泳道2所示,回收该条带的DNA,对其测序表明所获得的PCR片段即包含Not I-Xho I—联接序列-Eco RV-Kas I-Apa I的DNA片段。把该DNA片段命名为linker。
图1中,泳道1和9是DL2000 plus的marker;泳道2是linker的PCR产物;泳道3是Tat的PCR产物;泳道4是pNZAP-Tat用Xba I和Nhe I双酶切的片段;泳道5是pZAP-Tat用Nhe I和Not I双酶切的片段;泳道6是pNZAP linker用EcoR I和Not I双酶切的片段;泳道7是pZAPlinker用Nhe I和Not I双酶切的片段;泳道8是pdTat用Xho I和EcoR I双酶切的片段。
2、在质粒pcDNA4/TO/myc-His B中插入linker:
pcDNA4/TO/myc-His B(从Invitrogen购买),用Not I和Apa I双酶切pcDNA4/TO/myc-His B,切胶回收得到5133bp的大片段,将步骤1所获得的PCR片段也用Not I和Apa I双酶切,切胶回收得到109bp片段,二者连接,转化大肠杆菌TOP10,在含有50μg/ml氨苄青霉素的LB固体培养基上培养12小时,挑取单克隆进行培养。提取质粒,用引物linker sp和linker ap进行PCR验证,得到124bp片段,结果如图1所示。对质粒进行测序,结果表明该质粒含有该段linker序列,将该质粒命名为pcDNA4/TO/linker/myc-His B。
3、在质粒pcDNA4/TO/linker/myc-His B中插入NZAP序列:
将pcDNA4/TO/linker/myc-His B用EcoRI和Not I双酶切,切胶回收得到5215bp的大片段。质粒pNZAP-myc(pNZAP-myc的构建方法见:Guangxia Gao,Xuemin Guo,Stephen P.Goff1.(2002)Inhibition of Retroviral RNA Production by ZAP,aCCCH-Type Zinc Finger Protein.SCIENCE VOL 297,1703-1706),含有序列表中序列14(编码序列表中序列13的NZAPmyc)的NZAPmyc编码基因,也用EcoRI和NotI双酶切,切胶回收得到1088的片段,二者连接,转化大肠杆菌TOP10,在含有50μg/ml氨苄青霉素的LB固体培养基上培养12小时,挑取单克隆进行培养。提取质粒,用EcoRI和NotI双酶切得到了5215bp的pcDNA4/TO/linker/myc-His B片段和1088bp的NZAP片段,结果如图1中泳道6所示。对质粒进行测序,结果表明该质粒含有序列表中序列6的核苷酸序列,编码序列表中序列5的NZAP-linker-myc,将该质粒命名为pNZAPlinker。
4、在载体pNZAPlinker上插入Tat蛋白编码序列
以质粒pCMVTat(pCMVTat的构建方法见文献:NARASIMHACHAR SRINIVASAKUMAR,NATHALIE CHAZAL,C.HELGA-MARIA,SUSAN PRASAD,MARIE-LOUISE HAMMARSKJOLD,ANDDAVID REKOSH.(1997)The Effect of Viral Regulatory Protein Expression on GeneDelivery bv Human Immunodeficiency Virus Type 1 Vectors Produced in StablePackaging Cell Lines.JVI,Vol.71,No.8,5841-5848)作为模板,然后对其用如下的引物进行PCR扩增,所用引物引入克隆所需的酶切位点:
上游引物:TAT SP:
5’-GGCCGCGCTGCTGAGCC-3’
下游引物:TAT AP:
5’-ATATAG
TTCGAACCGCGGCTGCTTTGATAGA-3’(下划线部分核苷酸为BstBI酶切识别位点)
PCR的反应体系为:
1μl模板(0.1μg/μl),1μl上游引物(50μM),1μl下游引物(50μM),0.25μl Taq酶,4μl dNTP混合物(2.5mM),5μl PCR buffer(10×),37.75μl dH20。
PCR的反应程序为:
94℃退火2min,然后94℃,30sec;55℃,30sec;72℃,30sec共25个循环。
得到的PCR产物进行凝胶电泳,结果表明得到243bp的条带,结果如图1中的泳道3所示,回收该条带的DNA,用BstB I酶切,切胶回收含有Tat基因的小片段233bp;载体pNZAPlinker用EcoR V和BstB I酶切,切胶回收含有NZAP和linker编码基因的大片段6271bp,二者连接,转化大肠杆菌TOP10,在含有50μg/ml氨苄青霉素的LB固体培养基上培养12小时,挑取单克隆进行培养,提取质粒,用Xba I和Nhe I双酶切,得到6027bp和477bp的片段,结果如图1中泳道4所示。该质粒命名为pNZAP-Tat,含有序列表中SEQ ID №:10的核苷酸序列,编码序列表中序列9的NZAP-Tat/myc。
5、ZAP-Tat的表达载体pZAP-Tat的构建
将pNZAP-Tat用Afl II和Not I双酶切,凝胶电泳回收5372bp大片段作为载体,pZAP-myc(pZAP-myc的构建方法见:Guangxia Gao,Xuemin Guo,Stephen P.Goff1.(2002)Inhibition of Retroviral RNA Production by ZAP,a CCCH-Type ZincFinger Protein.SCIENCE VOL 297,1703-1706)用Afl II和Not I双酶切,回收2701bp的ZAP片段,与上面制备的载体连接,转化大肠杆菌TOP10,在含有50μg/ml氨苄青霉素的LB固体培养基上培养12小时,挑取单克隆进行培养,提取质粒用NheI和Not I双酶切,得到6141bp的载体片段和1932bp片段,结果如图1中泳道5所示,该质粒命名为pZAP-Tat,含有序列表中序列2的融合蛋白ZAP-Tat的编码基因。
6、ZAP、ZAP-linker、dTat的表达质粒的构建
含有序列表中序列4(编码序列表中序列3的ZAP-myc)的ZAP-myc编码基因的质粒pZAP-myc的构建方法见文献:Guangxia Gao,Xuemin Guo,Stephen P.Goff1.(2002)Inhibition of Retroviral RNA Production by ZAP,a CCCH-Type Zinc FingerProtein.SCIENCE VOL 297.1703-1706。
将质粒pNZAP linker用Afl II和Not I双酶切,凝胶电泳回收5162bp大片段作为载体,质粒pZAP-myc也用Afl II和Not I双酶切,回收2701bp的ZAP片段,二者连接,转化大肠杆菌TOP10,在含有50μg/ml氨苄青霉素的LB固体培养基上培养12小时,挑取单克隆进行培养,提取质粒用Nhe I和Not I双酶切,得到5931bp的载体片段和1932bp的片段,结果如图1中泳道7所示。该质粒命名为pZAPlinker,含有序列表中序列8的核苷酸序列,编码序列表中序列7的ZAPlinker-myc。将质粒pZAP-Tat用Xho I单酶切,回收5651bp的片段,用Klenow(New EnglandBiolab)补平,自身连接,转化大肠杆菌TOP10,在含有50μg/ml氨苄青霉素的LB固体培养基上培养12小时,挑取单克隆进行培养,提取质粒,用Xho I和Not I双酶切验证,结果如图1中泳道8所示。对用Xho I和EcoR I双酶切为单一片段的质粒测序,结果表明该质粒含有该段dTat-myc序列,将该质粒命名为pdTat,含有序列表中序列12的核苷酸序列,编码序列表中序列11的dTat-myc。
7、ZAP、ZAP-linker、dTat、ZAP-Tat的表达
将pZAP-myc、pZAP linker、pdTat、pZAP-Tat分别用磷酸钙法转染入293T细胞(购自ATCC),质粒用量为5μg/3.5cm培养皿,8小时后换液,用PBS缓冲液洗一次,加入新鲜培养基(DMEM∶FBS(胎牛血清)∶PS(青霉素和链霉素)=100∶10∶1(体积比)),48小时后收集细胞,裂解后进行SDS-PAGE电泳,然后将蛋白转到PVDF膜上,用myc抗体(9E10)(购自Sant Cruz)进行Western免疫检测,检测到ZAP、ZAP-linker、dTat、ZAP-Tat的蛋白表达,结果如图2所示。
实施例2、抗HIV功能检测
实施例1中,利用Western免疫杂交方法,首先证明了ZAP、ZAP-linker、dTat、ZAP-Tat在细胞内的表达,本实施例分析了上述蛋白对HIV RNA的抑制情况,方法如下:
将表达ZAP、ZAP linker、dTat、ZAP-Tat的质粒与携带荧光素酶报告基因的HIV载体pHR’-CMV-luc共表达(pHR’-CMV-luc含有HIV的大部分骨架序列,包括5’和3’LTR,5’剪切位点,包装信号,部分的gag基因编码序列,3’剪切位点等,pHR’-CMV-luc的构建方法见David Derse,Shawn A.Hill,Patricia A.Lloyd,Hye-kyung Chung,and Barry A.Morse.Examining Human T-Lymphotropic VirusType 1 Infection and Replication by Cell-Free Infection with RecombinantVirus Vectors.J Virol.2001 September;75(18):8461-8468),分析上述蛋白对HIV载体表达的抑制情况。为了检测转染效率,使用了表达海肾(Renilla)萤光素酶的质粒pRL-TK(购自Promega)作为转染效率的内参。pHR’-CMV-luc和pRL-TK分别与下述四种质粒中的一种用磷酸钙法共转染293A细胞(购自ATCC):
pcDNA4/TO/myc-His B,pZAP-myc,pZAP linker和pZAP-Tat。所用质粒的质量比为:pcDNA4/TO/myc-His B或pZAP-myc或pZAP-linker或pZAP-Tat∶pHR’-CMV-luc∶pRL-TK=100∶10∶1。先用胰蛋白酶消化以便收获生长到对数期的293A细胞,以5×105个细胞/3.5cm培养皿的数量重新种入3.5cm培养皿,加入新鲜DMEM培养基于37℃、5%CO2的培养箱中培养16-24小时。然后用质粒pcDNA4/TO/myc-His B或pZAP或pZAP-linker或pZAP-Tat:5μg,pHR’-CMV-luc:0.5μg,pRL-TK:0.05μg的量,磷酸钙法分别转染细胞。转染后8小时,吸出培养皿中的培养基,用PBS缓冲液洗一次,重新加入新鲜培养基,培养48小时后,用Promega公司的Dual-luciferase试剂盒检测荧光素酶活性,分别得到萤火虫荧光素酶和海肾萤光素酶的荧光值。海肾萤光素酶的数值用来作为内参,表明转染效率的高低,萤火虫荧光素酶的数值作为报告参数,表明在某一转染效率下报告基因表达量的高低。萤火虫荧光素酶和海肾(Renilla)萤光素酶的荧光值比值是将每组数值做归一化处理,便于与某一标准值比较。以空质粒pcDNA4/TO/myc-His B共转染组的萤火虫荧光素酶和海肾萤光素酶的荧光值的比值作为对照,其它组的比值去除该值,所得比值即为抑制倍数,即:抑制倍数=空质粒共转染组比值/功能蛋白共转染组比值。结果如图3所示,表明pZAP-Tat表达的ZAP-Tat对HIV的抑制作用达到16倍以上,其它质粒表达的蛋白对HIV的抑制作用不足4倍。图3中,pcDNA4表示pcDNA4/TO/myc-HisB和pHR’-CMV-luc和pRL-TK共转染组;ZAP表示pZAP-myc和pHR’-CMV-luc和pRL-TK共转染组;ZAPlinker表示pZAPlinker和pHR’-CMV-luc和pRL-TK共转染组;ZAP-Tat表示pZAP-Tat和pHR’-CMV-luc和pRL-TK共转染组。
序列表
<160>14
<210>1
<211>904
<212>PRT
<213>人工序列
<220>
<223>
<400>1
Met Ala Asp Pro Gly Val Cys Cys Phe Ile Thr Lys Ile Leu Cys Ala
5 10 15
His Gly Gly Arg Met Thr Leu Glu Glu Leu Leu Gly Glu Ile Arg Leu
20 25 30
Pro Glu Ala Gln Leu Tyr Glu Leu Leu Glu Thr Ala Gly Pro Asp Arg
35 40 45
Phe Val Leu Leu Glu Thr Gly Gly Gln Ala Gly Ile Thr Arg Ser Val
50 55 60
Val Ala Thr Thr Arg Ala Arg Val Cys Arg Arg Lys Tyr Cys Gln Arg
65 70 75 80
Pro Cys Asp Ser Leu His Leu Cys Lys Leu Asn Leu Leu Gly Arg Cys
85 90 95
His Tyr Ala Gln Ser Gln Arg Asn Leu Cys Lys Tyr Ser His Asp Val
100 105 110
Leu Ser Glu Gln Asn Phe Gln Ile Leu Lys Asn His Glu Leu Ser Gly
115 120 125
Leu Asn Gln Glu Glu Leu Ala Cys Leu Leu Val Gln Ser Asp Pro Phe
130 135 140
Phe Leu Pro Glu Ile Cys Lys Ser Tyr Lys Gly Glu Gly Arg Lys Gln
145 150 155 160
Thr Cys Gly Gln Pro Gln Pro Cys Glu Arg Leu His Ile Cys Glu His
165 170 175
Phe Thr Arg Gly Asn Cys Ser Tyr Leu Asn Cys Leu Arg Ser His Asn
180 185 190
Leu Met Asp Arg Lys Val Leu Thr Ile Met Arg Glu His Gly Leu Ser
195 200 205
Pro Asp Val Val Gln Asn Ile Gln Asp Ile Cys Asn Asn Lys His Ala
210 215 220
Arg Arg Asn Pro Pro Gly Thr Arg Ala Ala His Pro His Arg Arg Gly
225 230 235 240
Gly Ala His Arg Asp Arg Ser Lys Ser Arg Asp Arg Phe Leu His Asn
245 250 255
Ser Leu Glu Phe Leu Ser Pro Val Val Ser Pro Leu Gly Ser Gly Pro
260 265 270
Pro Ser Pro Asp Val Thr Ser Cys Lys Asp Ser Leu Glu Asp Val Ser
275 280 285
Val Asp Val Thr Gln Lys Phe Lys Tyr Leu Gly Thr His Asp Arg Ala
290 295 300
Gln Leu Ser Pro Val Ser Ser Lys Ala Ala Gly Val Gln Gly Pro Ser
305 310 315 320
Gln Met Arg Ala Ser Gln Glu Phe Ser Glu Asp Gly Asn Leu Asp Asp
325 330 335
Ile Phe Ser Arg Asn Arg Ser Asp Ser Ser Ser Ser Arg Ala Ser Ala
340 345 350
Ala Lys Val Ala Gln Arg Asn Glu Ala Val Ala Met Lys Met Gly Met
355 360 365
Glu Val Lys Gly Lys Lys Glu Ala Pro Asp Ile Asp Arg Val Pro Phe
370 375 380
Leu Asn Ser Tyr Ile Asp Gly Val Thr Met Glu Lys Ala Ser Val Ser
385 390 395 400
Gly Ile Pro Gly Lys Lys Phe Thr Ala Asn Asp Leu Glu Asn Leu Leu
405 410 415
Leu Leu Asn Asp Thr Trp Lys Asn Val Ala Lys Pro Gln Asp Leu Gln
420 425 430
Thr Thr Gly Arg Ile Thr Asp Ser Gly Gln Asp Lys Ala Phe Leu Gln
435 440 445
Asn Lys Tyr Gly Gly Asn Pro Val Trp Ala Ser Ala Ser Thr His Asn
450 455 460
Ala Pro Asn Gly Ser Ser Gln Ile Met Asp Glu Thr Pro Asn Val Ser
465 470 475 480
Lys Ser Ser Thr Ser Gly Phe Ala Ile Lys Pro Ala Ile Ala Gly Gly
485 490 495
Lys Glu Ala Val Tyr Ser Gly Val Gln Ser Pro Arg Ser Gln Val Leu
500 505 510
Ala Val Pro Gly Glu Ala Thr Thr Pro Val Gln Ser Asn Arg Leu Pro
515 520 525
Gln Ser Pro Leu Ser Ser Ser Ser His Arg Ala Ala Ala Ser Gly Ser
530 535 540
Pro Gly Lys Asn Ser Thr His Thr Ser Val Ser Pro Ala Ile Glu Ser
545 550 555 560
Ser Arg Met Thr Ser Asp Pro Asp Glu Tyr Leu Leu Arg Tyr Ile Leu
565 570 575
Asn Pro Leu Phe Arg Met Asp Asn His Gly Pro Lys Glu Ile Cys Gln
580 585 590
Asp His Leu Tyr Lys Gly Cys Gln Gln Ser His Cys Asp Arg Ser His
595 600 605
Phe His Leu Pro Tyr Arg Trp Gln Met Phe Val Tyr Thr Thr Trp Arg
610 615 620
Asp Phe Gln Asp Met Glu Ser Ile Glu Gln Ala Tyr Cys Asp Pro His
625 630 635 640
Val Glu Leu Ile Leu Ile Glu Asn His Gln Ile Asn Phe Gln Lys Met
645 650 655
Thr Cys Asp Ser Tyr Pro Ile Arg Arg Leu Ser Thr Pro Ser Tyr Glu
660 665 670
Glu Lys Pro Leu Ser Ala Val Phe Ala Thr Lys Trp Ile Trp Tyr Trp
675 680 685
Lys Asn Glu Phe Asn Glu Tyr Ile Gln Tyr Gly Asn Glu Ser Pro Gly
690 695 700
His Thr Ser Ser Asp Ile Asn Ser Ala Tyr Leu Glu Ser Phe Phe Gln
705 710 715 720
Ser Cys Pro Arg Gly Val Leu Pro Phe Gln Ala Gly Ser Gln Lys Tyr
725 730 735
Glu Leu Ser Phe Gln Gly Met Ile Gln Thr Asn Ile Ala Ser Lys Thr
740 745 750
Gln Arg His Val Val Arg Arg Pro Val Phe Val Ser Ser Asn Asp Val
755 760 765
Glu Gln Lys Arg Arg Gly Pro Glu Gly Gly Arg Ser Ser Gly Gly Gly
770 775 780
Ser Gly Gly Gly Thr Gly Gly Gly Ser Gly Gly Gly Thr Gly Gly Gly
785 790 795 800
Ser Gly Gly Gly Asp Gly Arg Ala Ala Glu Pro Val Asp Pro Ser Leu
805 810 815
Glu Pro Trp Lys His Pro Gly Ser Gln Pro Arg Thr Ala Cys Asn Asn
820 825 830
Cys Tyr Cys Lys Lys Cys Cys Phe His Cys Tyr Ala Cys Phe Thr Arg
835 840 845
Lys Gly Leu Gly Ile Ser Tyr Gly Arg Lys Lys Arg Arg Gln Arg Arg
850 855 860
Arg Ala His Gln Asn Ser Gln Thr His Gln Ala Ser Leu Ser Lys Gln
865 870 875 880
Pro Arg Phe Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu Asn Met His
885 890 895
Thr Gly His His His His His His
900
<210>2
<211>2712
<212>DNA
<213>人工序列
<220>
<223>
<400>2
atggcagatc ccggggtatg ctgtttcatc accaagatcc tgtgcgccca cgggggccgt 60
atgaccctgg aggaactgct gggtgagatc aggctccccg aggcgcagct ctacgagctg 120
ctggagacgg cggggcccga tcgcttcgtg ctattggaga ctggaggcca ggccgggatc 180
actcggtctg tagtggctac tactcgagcc cgcgtctgcc gtcggaagta ctgccagaga 240
ccctgcgaca gcctgcacct ctgcaagctt aatctgctcg gccggtgcca ctatgcacag 300
tctcagcgga acctctgcaa atattctcac gatgttctct cggaacagaa cttccagatc 360
ctgaagaatc atgagctctc tgggcttaac caagaggagc tagcttgcct cctggtccaa 420
agcgaccctt ttttcctgcc cgagatatgc aagagttaca aaggagaggg ccgaaaacag 480
acctgtgggc agccacagcc atgcgagaga ctccacatct gtgagcactt cacccggggc 540
aactgcagtt acctcaactg tctcaggtct cacaacctga tggacagaaa ggtgttgacc 600
atcatgaggg agcacgggct gagtcctgat gtggtccaga acatccagga catctgcaac 660
aacaaacacg ccaggaggaa cccgcctggc acgagagctg cccatccaca ccgcagaggc 720
ggcgcacaca gagacagaag caaaagcaga gaccgcttcc ttcacaacag tctagaattt 780
ctctcacctg ttgtctcacc tctgggatct ggtccgccta gcccagatgt caccagctgt 840
aaagattccc tggaggatgt gtctgtggat gtcacccaga agttcaagta cttggggacg 900
catgaccgtg cgcagctctc cccagtctca tctaaggctg ctggtgttca aggacccagt 960
caaatgagag caagccaaga gttttcagag gatgggaatc tagatgacat attttctagg 1020
aatcgttctg attcatcatc aagtcgagcc tccgctgcca aggtggcaca aagaaatgaa 1080
gctgtggcca tgaaaatggg catggaggtc aagggcaaga aggaggctcc agacatcgat 1140
cgggtcccat ttttaaatag ttatattgat ggggtgacca tggaaaaagc atcggtctca 1200
ggaattccag gcaaaaagtt cacagccaat gatctggaaa atttgctatt acttaacgac 1260
acttggaaga atgtggctaa gccccaggat ctgcagacca caggcagaat cactgacagt 1320
ggccaagaca aggcattcct gcagaataaa tatggaggaa acccagtgtg ggcaagtgca 1380
tccacccata atgccccaaa tggctctagt caaattatgg atgaaactcc taatgtctct 1440
aaaagtagta ccagtggttt tgccataaaa ccagcaattg ctggaggaaa agaagcagtc 1500
tattctggag ttcagagtcc gagaagccag gtcctagctg tgcctgggga ggctactacc 1560
cctgtacaga gcaacaggct gcctcagtcg cctctgtctt cctcaagcca cagagctgca 1620
gcctctggga gccctggcaa gaactccacc catacctctg tgagcccagc catcgagtct 1680
tcaaggatga catcagaccc cgatgagtat ctcctacgct acatcctaaa tcctttattt 1740
aggatggata atcatggccc gaaggaaatc tgtcaggacc atctgtacaa gggctgtcaa 1800
cagagccact gcgacaggag tcacttccat ctgccctacc ggtggcagat gttcgtatat 1860
accacttgga gggacttcca ggacatggag tctatcgaac aggcctattg tgatccccac 1920
gttgaactca ttttgataga aaaccatcag atcaatttcc agaaaatgac ctgtgactcc 1980
taccccatcc gacgcctctc cactccctca tatgaggaaa agccacttag tgctgtcttc 2040
gccaccaagt ggatttggta ttggaagaat gaatttaatg aatatatcca gtatgggaat 2100
gagagcccag gccacaccag ctctgacatc aactctgcgt acctggagtc tttcttccag 2160
tcttgtccca ggggagtttt gccattccag gctggttcac agaagtacga gttaagcttc 2220
caagggatga ttcagacaaa tatagcttcc aagactcaaa ggcatgttgt cagaaggcca 2280
gtatttgttt cttcgaacga tgtggagcag aagagaagag gtccagaggg cggccgctcg 2340
agtggtggag gctctggggg aggtacagga ggtggctcag gtgggggcac tggtggaggc 2400
agtggaggtg gcgatggccg cgctgctgag ccagtagatc ctagtctaga gccctggaag 2460
catccaggaa gtcagcctag gactgcttgt aacaattgct attgtaaaaa gtgttgcttt 2520
cattgctacg cgtgtttcac aagaaaaggc ttaggcatct cctatggcag gaagaagcgg 2580
agacagcgac gaagagctca tcagaacagt cagactcatc aagcttctct atcaaagcag 2640
ccgcggttcg aacaaaaact catctcagaa gaggatctga atatgcatac cggtcatcat 2700
caccatcacc at 2712
<210>3
<211>808
<212>PRT
<213>人工序列
<220>
<223>
<400>3
Met Ala Asp Pro Gly Val Cys Cys Phe Ile Thr Lys Ile Leu Cys Ala
5 10 15
His Gly Gly Arg Met Thr Leu Glu Glu Leu Leu Gly Glu Ile Arg Leu
20 25 30
Pro Glu Ala Gln Leu Tyr Glu Leu Leu Glu Thr Ala Gly Pro Asp Arg
35 40 45
Phe Val Leu Leu Glu Thr Gly Gly Gln Ala Gly Ile Thr Arg Ser Val
50 55 60
Val Ala Thr Thr Arg Ala Arg Val Cys Arg Arg Lys Tyr Cys Gln Arg
65 70 75 80
Pro Cys Asp Ser Leu His Leu Cys Lys Leu Asn Leu Leu Gly Arg Cys
85 90 95
His Tyr Ala Gln Ser Gln Arg Asn Leu Cys Lys Tyr Ser His Asp Val
100 105 110
Leu Ser Glu Gln Asn Phe Gln Ile Leu Lys Asn His Glu Leu Ser Gly
115 120 125
Leu Asn Gln Glu Glu Leu Ala Cys Leu Leu Val Gln Ser Asp Pro Phe
130 135 140
Phe Leu Pro Glu Ile Cys Lys Ser Tyr Lys Gly Glu Gly Arg Lys Gln
145 150 155 160
Thr Cys Gly Gln Pro Gln Pro Cys Glu Arg Leu His Ile Cys Glu His
165 170 175
Phe Thr Arg Gly Asn Cys Ser Tyr Leu Asn Cys Leu Arg Ser His Asn
180 185 190
Leu Met Asp Arg Lys Val Leu Thr Ile Met Arg Glu His Gly Leu Ser
195 200 205
Pro Asp Val Val Gln Asn Ile Gln Asp Ile Cys Asn Asn Lys His Ala
210 215 220
Arg Arg Asn Pro Pro Gly Thr Arg Ala Ala His Pro His Arg Arg Gly
225 230 235 240
Gly Ala His Arg Asp Arg Ser Lys Ser Arg Asp Arg Phe Leu His Asn
245 250 255
Ser Leu Glu Phe Leu Ser Pro Val Val Ser Pro Leu Gly Ser Gly Pro
260 265 270
Pro Ser Pro Asp Val Thr Ser Cys Lys Asp Ser Leu Glu Asp Val Ser
275 280 285
Val Asp Val Thr Gln Lys Phe Lys Tyr Leu Gly Thr His Asp Arg Ala
290 295 300
Gln Leu Ser Pro Val Ser Ser Lys Ala Ala Gly Val Gln Gly Pro Ser
305 310 315 320
Gln Met Arg Ala Ser Gln Glu Phe Ser Glu Asp Gly Asn Leu Asp Asp
325 330 335
Ile Phe Ser Arg Asn Arg Ser Asp Ser Ser Ser Ser Arg Ala Ser Ala
340 345 350
Ala Lys Val Ala Gln Arg Asn Glu Ala Val Ala Met Lys Met Gly Met
355 360 365
Glu Val Lys Gly Lys Lys Glu Ala Pro Asp Ile Asp Arg Val Pro Phe
370 375 380
Leu Asn Ser Tyr Ile Asp Gly Val Thr Met Glu Lys Ala Ser Val Ser
385 390 395 400
Gly Ile Pro Gly Lys Lys Phe Thr Ala Asn Asp Leu Glu Asn Leu Leu
405 410 415
Leu Leu Asn Asp Thr Trp Lys Asn Val Ala Lys Pro Gln Asp Leu Gln
420 425 430
Thr Thr Gly Arg Ile Thr Asp Ser Gly Gln Asp Lys Ala Phe Leu Gln
435 440 445
Asn Lys Tyr Gly Gly Asn Pro Val Trp Ala Ser Ala Ser Thr His Asn
450 455 460
Ala Pro Asn Gly Ser Ser Gln Ile Met Asp Glu Thr Pro Asn Val Ser
465 470 475 480
Lys Ser Ser Thr Ser Gly Phe Ala Ile Lys Pro Ala Ile Ala Gly Gly
485 490 495
Lys Glu Ala Val Tyr Ser Gly Val Gln Ser Pro Arg Ser Gln Val Leu
500 505 510
Ala Val Pro Gly Glu Ala Thr Thr Pro Val Gln Ser Asn Arg Leu Pro
515 520 525
Gln Ser Pro Leu Ser Ser Ser Ser His Arg Ala Ala Ala Ser Gly Ser
530 535 540
Pro Gly Lys Asn Ser Thr His Thr Ser Val Ser Pro Ala Ile Glu Ser
545 550 555 560
Ser Arg Met Thr Ser Asp Pro Asp Glu Tyr Leu Leu Arg Tyr Ile Leu
565 570 575
Asn Pro Leu Phe Arg Met Asp Asn His Gly Pro Lys Glu Ile Cys Gln
580 585 590
Asp His Leu Tyr Lys Gly Cys Gln Gln Ser His Cys Asp Arg Ser His
595 600 605
Phe His Leu Pro Tyr Arg Trp Gln Met Phe Val Tyr Thr Thr Trp Arg
610 615 620
Asp Phe Gln Asp Met Glu Ser Ile Glu Gln Ala Tyr Cys Asp Pro His
625 630 635 640
Val Glu Leu Ile Leu Ile Glu Asn His Gln Ile Asn Phe Gln Lys Met
645 650 655
Thr Cys Asp Ser Tyr Pro Ile Arg Arg Leu Ser Thr Pro Ser Tyr Glu
660 665 670
Glu Lys Pro Leu Ser Ala Val Phe Ala Thr Lys Trp Ile Trp Tyr Trp
675 680 685
Lys Asn Glu Phe Asn Glu Tyr Ile Gln Tyr Gly Asn Glu Ser Pro Gly
690 695 700
His Thr Ser Ser Asp Ile Asn Ser Ala Tyr Leu Glu Ser Phe Phe Gln
705 710 715 720
Ser Cys Pro Arg Gly Val Leu Pro Phe Gln Ala Gly Ser Gln Lys Tyr
725 730 735
Glu Leu Ser Phe Gln Gly Met Ile Gln Thr Asn Ile Ala Ser Lys Thr
740 745 750
Gln Arg His Val Val Arg Arg Pro Val Phe Val Ser Ser Asn Asp Val
755 760 765
Glu Gln Lys Arg Arg Gly Pro Glu Gly Gly Arg Ser Ser Leu Glu Gly
770 775 780
Pro Arg Phe Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu Asn Met His
785 790 795 800
Thr Gly His His His His His His
805
<210>4
<211>2424
<212>DNA
<213>人工序列
<220>
<223>
<400>4
atggcagatc ccggggtatg ctgtttcatc accaagatcc tgtgcgccca cgggggccgt 60
atgaccctgg aggaactgct gggtgagatc aggctccccg aggcgcagct ctacgagctg 120
ctggagacgg cggggcccga tcgcttcgtg ctattggaga ctggaggcca ggccgggatc 180
actcggtctg tagtggctac tactcgagcc cgcgtctgcc gtcggaagta ctgccagaga 240
ccctgcgaca gcctgcacct ctgcaagctt aatctgctcg gccggtgcca ctatgcacag 300
tctcagcgga acctctgcaa atattctcac gatgttctct cggaacagaa cttccagatc 360
ctgaagaatc atgagctctc tgggcttaac caagaggagc tagcttgcct cctggtccaa 420
agcgaccctt ttttcctgcc cgagatatgc aagagttaca aaggagaggg ccgaaaacag 480
acctgtgggc agccacagcc atgcgagaga ctccacatct gtgagcactt cacccggggc 540
aactgcagtt acctcaactg tctcaggtct cacaacctga tggacagaaa ggtgttgacc 600
atcatgaggg agcacgggct gagtcctgat gtggtccaga acatccagga catctgcaac 660
aacaaacacg ccaggaggaa cccgcctggc acgagagctg cccatccaca ccgcagaggc 720
ggcgcacaca gagacagaag caaaagcaga gaccgcttcc ttcacaacag tctagaattt 780
ctctcacctg ttgtctcacc tctgggatct ggtccgccta gcccagatgt caccagctgt 840
aaagattccc tggaggatgt gtctgtggat gtcacccaga agttcaagta cttggggacg 900
catgaccgtg cgcagctctc cccagtctca tctaaggctg ctggtgttca aggacccagt 960
caaatgagag caagccaaga gttttcagag gatgggaatc tagatgacat attttctagg 1020
aatcgttctg attcatcatc aagtcgagcc tccgctgcca aggtggcaca aagaaatgaa 1080
gctgtggcca tgaaaatggg catggaggtc aagggcaaga aggaggctcc agacatcgat 1140
cgggtcccat ttttaaatag ttatattgat ggggtgacca tggaaaaagc atcggtctca 1200
ggaattccag gcaaaaagtt cacagccaat gatctggaaa atttgctatt acttaacgac 1260
acttggaaga atgtggctaa gccccaggat ctgcagacca caggcagaat cactgacagt 1320
ggccaagaca aggcattcct gcagaataaa tatggaggaa acccagtgtg ggcaagtgca 1380
tccacccata atgccccaaa tggctctagt caaattatgg atgaaactcc taatgtctct 1440
aaaagtagta ccagtggttt tgccataaaa ccagcaattg ctggaggaaa agaagcagtc 1500
tattctggag ttcagagtcc gagaagccag gtcctagctg tgcctgggga ggctactacc 1560
cctgtacaga gcaacaggct gcctcagtcg cctctgtctt cctcaagcca cagagctgca 1620
gcctctggga gccctggcaa gaactccacc catacctctg tgagcccagc catcgagtct 1680
tcaaggatga catcagaccc cgatgagtat ctcctacgct acatcctaaa tcctttattt 1740
aggatggata atcatggccc gaaggaaatc tgtcaggacc atctgtacaa gggctgtcaa 1800
cagagccact gcgacaggag tcacttccat ctgccctacc ggtggcagat gttcgtatat 1860
accacttgga gggacttcca ggacatggag tctatcgaac aggcctattg tgatccccac 1920
gttgaactca ttttgataga aaaccatcag atcaatttcc agaaaatgac ctgtgactcc 1980
taccccatcc gacgcctctc cactccctca tatgaggaaa agccacttag tgctgtcttc 2040
gccaccaagt ggatttggta ttggaagaat gaatttaatg aatatatcca gtatgggaat 2100
gagagcccag gccacaccag ctctgacatc aactctgcgt acctggagtc tttcttccag 2160
tcttgtccca ggggagtttt gccattccag gctggttcac agaagtacga gttaagcttc 2220
caagggatga ttcagacaaa tatagcttcc aagactcaaa ggcatgttgt cagaaggcca 2280
gtatttgttt cttcgaacga tgtggagcag aagagaagag gtccagaggg cggccgctcg 2340
agtctagagg gcccgcggtt cgaacaaaaa ctcatctcag aagaggatct gaatatgcat 2400
accggtcatc atcaccatca ccat 2424
<210>5
<211>311
<212>PRT
<213>人工序列
<220>
<223>
<400>5
Met Ala Asp Pro Gly Val Cys Cys Phe Ile Thr Lys Ile Leu Cys Ala
1 5 10 15
His Gly Gly Arg Met Thr Leu Glu Glu Leu Leu Gly Glu Ile Arg Leu
20 25 30
Pro Glu Ala Gln Leu Tyr Glu Leu Leu Glu Thr Ala Gly Pro Asp Arg
35 40 45
Phe Val Leu Leu Glu Thr Gly Gly Gln Ala Gly Ile Thr Arg Ser Val
50 55 60
Val Ala Thr Thr Arg Ala Arg Val Cys Arg Arg Lys Tyr Cys Gln Arg
65 70 75 80
Pro Cys Asp Ser Leu His Leu Cys Lys Leu Asn Leu Leu Gly Arg Cys
85 90 95
His Tyr Ala Gln Ser Gln Arg Asn Leu Cys Lys Tyr Ser His Asp Val
100 105 110
Leu Ser Glu Gln Asn Phe Gln Ile Leu Lys Asn His Glu Leu Ser Gly
115 120 125
Leu Asn Gln Glu Glu Leu Ala Cys Leu Leu Val Gln Ser Asp Pro Phe
130 135 140
Phe Leu Pro Glu Ile Cys Lys Ser Tyr Lys Gly Glu Gly Arg Lys Gln
145 150 155 160
Thr Cys Gly Gln Pro Gln Pro Cys Glu Arg Leu His Ile Cys Glu His
165 170 175
Phe Thr Arg Gly Asn Cys Ser Tyr Leu Asn Cys Leu Arg Ser His Asn
180 185 190
Leu Met Asp Arg Lys Val Leu Thr Ile Met Arg Glu His Gly Leu Ser
195 200 205
Pro Asp Val Val Gln Asn Ile Gln Asp Ile Cys Asn Asn Lys His Ala
210 215 220
Arg Arg Asn Pro Pro Gly Thr Arg Ala Ala His Pro His Arg Arg Gly
225 230 235 240
Gly Ala His Arg Asp Arg Ser Lys Ser Arg Asp Arg Ser Cys Gly Arg
245 250 255
Ser Ser Gly Gly Gly Ser Gly Gly Gly Thr Gly Gly Gly Ser Gly Gly
260 265 270
Gly Thr Gly Gly Gly Ser Gly Gly Gly Asp Ile Gly Ala Lys Gly Pro
275 280 285
Arg Phe Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu Asn Met His Thr
290 295 300
Gly His His His His His His
305 310
<210>6
<211>933
<212>DNA
<213>人工序列
<220>
<223>
<400>6
atggcagatc ccggggtatg ctgtttcatc accaagatcc tgtgcgccca cgggggccgt 60
atgaccctgg aggaactgct gggtgagatc aggctccccg aggcgcagct ctacgagctg 120
ctggagacgg cggggcccga tcgcttcgtg ctattggaga ctggaggcca ggccgggatc 180
actcggtctg tagtggctac tactcgagcc cgcgtctgcc gtcggaagta ctgccagaga 240
ccctgcgaca gcctgcacct ctgcaagctt aatctgctcg gccggtgcca ctatgcacag 300
tctcagcgga acctctgcaa atattctcac gatgttctct cggaacagaa cttccagatc 360
ctgaagaatc atgagctctc tgggcttaac caagaggagc tagcttgcct cctggtccaa 420
agcgaccctt ttttcctgcc cgagatatgc aagagttaca aaggagaggg ccgaaaacag 480
acctgtgggc agccacagcc atgcgagaga ctccacatct gtgagcactt cacccggggc 540
aactgcagtt acctcaactg tctcaggtct cacaacctga tggacagaaa ggtgttgacc 600
atcatgaggg agcacgggct gagtcctgat gtggtccaga acatccagga catctgcaac 660
aacaaacacg ccaggaggaa cccgcctggc acgagagctg cccatccaca ccgcagaggc 720
ggcgcacaca gagacagaag caaaagcaga gaccgttcct gcggccgctc gagtggtgga 780
ggctctgggg gaggtacagg aggtggctca ggtgggggca ctggtggagg cagtggaggt 840
ggcgatatcg gcgccaaggg cccgcggttc gaacaaaaac tcatctcaga agaggatctg 900
aatatgcata ccggtcatca tcaccatcac cat 933
<210>7
<211>834
<212>PRT
<213>人工序列
<220>
<223>
<400>7
Met Ala Asp Pro Gly Val Cys Cys Phe Ile Thr Lys Ile Leu Cys Ala
1 5 10 15
His Gly Gly Arg Met Thr Leu Glu Glu Leu Leu Gly Glu Ile Arg Leu
20 25 30
Pro Glu Ala Gln Leu Tyr Glu Leu Leu Glu Thr Ala Gly Pro Asp Arg
35 40 45
Phe Val Leu Leu Glu Thr Gly Gly Gln Ala Gly Ile Thr Arg Ser Val
50 55 60
Val Ala Thr Thr Arg Ala Arg Val Cys Arg Arg Lys Tyr Cys Gln Arg
65 70 75 80
Pro Cys Asp Ser Leu His Leu Cys Lys Leu Asn Leu Leu Gly Arg Cys
85 90 95
His Tyr Ala Gln Ser Gln Arg Asn Leu Cys Lys Tyr Ser His Asp Val
100 105 110
Leu Ser Glu Gln Asn Phe Gln Ile Leu Lys Asn His Glu Leu Ser Gly
115 120 125
Leu Asn Gln Glu Glu Leu Ala Cys Leu Leu Val Gln Ser Asp Pro Phe
130 135 140
Phe Leu Pro Glu Ile Cys Lys Ser Tyr Lys Gly Glu Gly Arg Lys Gln
145 150 155 160
Thr Cys Gly Gln Pro Gln Pro Cys Glu Arg Leu His Ile Cys Glu His
165 170 175
Phe Thr Arg Gly Asn Cys Ser Tyr Leu Asn Cys Leu Arg Ser His Asn
180 185 190
Leu Met Asp Arg Lys Val Leu Thr Ile Met Arg Glu His Gly Leu Ser
195 200 205
Pro Asp Val Val Gln Asn Ile Gln Asp Ile Cys Asn Asn Lys His Ala
210 215 220
Arg Arg Asn Pro Pro Gly Thr Arg Ala Ala His Pro His Arg Arg Gly
225 230 235 240
Gly Ala His Arg Asp Arg Ser Lys Ser Arg Asp Arg Phe Leu His Asn
245 250 255
Ser Leu Glu Phe Leu Ser Pro Val Val Ser Pro Leu Gly Ser Gly Pro
260 265 270
Pro Ser Pro Asp Val Thr Ser Cys Lys Asp Ser Leu Glu Asp Val Ser
275 280 285
Val Asp Val Thr Gln Lys Phe Lys Tyr Leu Gly Thr His Asp Arg Ala
290 295 300
Gln Leu Ser Pro Val Ser Ser Lys Ala Ala Gly Val Gln Gly Pro Ser
305 310 315 320
Gln Met Arg Ala Ser Gln Glu Phe Ser Glu Asp Gly Asn Leu Asp Asp
325 330 335
Ile Phe Ser Arg Asn Arg Ser Asp Ser Ser Ser Ser Arg Ala Ser Ala
340 345 350
Ala Lys Val Ala Gln Arg Asn Glu Ala Val Ala Met Lys Met Gly Met
355 360 365
Glu Val Lys Gly Lys Lys Glu Ala Pro Asp Ile Asp Arg Val Pro Phe
370 375 380
Leu Asn Ser Tyr Ile Asp Gly Val Thr Met Glu Lys Ala Ser Val Ser
385 390 395 400
Gly Ile Pro Gly Lys Lys Phe Thr Ala Asn Asp Leu Glu Asn Leu Leu
405 410 415
Leu Leu Asn Asp Thr Trp Lys Asn Val Ala Lys Pro Gln Asp Leu Gln
420 425 430
Thr Thr Gly Arg Ile Thr Asp Ser Gly Gln Asp Lys Ala Phe Leu Gln
435 440 445
Asn Lys Tyr Gly Gly Asn Pro Val Trp Ala Ser Ala Ser Thr His Asn
450 455 460
Ala Pro Asn Gly Ser Ser Gln Ile Met Asp Glu Thr Pro Asn Val Ser
465 470 475 480
Lys Ser Ser Thr Ser Gly Phe Ala Ile Lys Pro Ala Ile Ala Gly Gly
485 490 495
Lys Glu Ala Val Tyr Ser Gly Val Gln Ser Pro Arg Ser Gln Val Leu
500 505 510
Ala Val Pro Gly Glu Ala Thr Thr Pro Val Gln Ser Asn Arg Leu Pro
515 520 525
Gln Ser Pro Leu Ser Ser Ser Ser His Arg Ala Ala Ala Ser Gly Ser
530 535 540
Pro Gly Lys Asn Ser Thr His Thr Ser Val Ser Pro Ala Ile Glu Ser
545 550 555 560
Ser Arg Met Thr Ser Asp Pro Asp Glu Tyr Leu Leu Arg Tyr Ile Leu
565 570 575
Asn Pro Leu Phe Arg Met Asp Asn His Gly Pro Lys Glu Ile Cys Gln
580 585 590
Asp His Leu Tyr Lys Gly Cys Gln Gln Ser His Cys Asp Arg Ser His
595 600 605
Phe His Leu Pro Tyr Arg Trp Gln Met Phe Val Tyr Thr Thr Trp Arg
610 615 620
Asp Phe Gln Asp Met Glu Ser Ile Glu Gln Ala Tyr Cys Asp Pro His
625 630 635 640
Val Glu Leu Ile Leu Ile Glu Asn His Gln Ile Asn Phe Gln Lys Met
645 650 655
Thr Cys Asp Ser Tyr Pro Ile Arg Arg Leu Ser Thr Pro Ser Tyr Glu
660 665 670
Glu Lys Pro Leu Ser Ala Val Phe Ala Thr Lys Trp Ile Trp Tyr Trp
675 680 685
Lys Asn Glu Phe Asn Glu Tyr Ile Gln Tyr Gly Asn Glu Ser Pro Gly
690 695 700
His Thr Ser Ser Asp Ile Asn Ser Ala Tyr Leu Glu Ser Phe Phe Gln
705 710 715 720
Ser Cys Pro Arg Gly Val Leu Pro Phe Gln Ala Gly Ser Gln Lys Tyr
725 730 735
Glu Leu Ser Phe Gln Gly Met Ile Gln Thr Asn Ile Ala Ser Lys Thr
740 745 750
Gln Arg His Val Val Arg Arg Pro Val Phe Val Ser Ser Asn Asp Val
755 760 765
Glu Gln Lys Arg Arg Gly Pro Glu Gly Gly Arg Ser Ser Gly Gly Gly
770 775 780
Ser Gly Gly Gly Thr Gly Gly Gly Ser Gly Gly Gly Thr Gly Gly Gly
785 790 795 800
Ser Gly Gly Gly Asp Ile Gly Ala Lys Gly Pro Arg Phe Glu Gln Lys
805 810 815
Leu Ile Ser Glu Glu Asp Leu Asn Met His Thr Gly His His His His
820 825 830
His His
<210>8
<211>2502
<212>DNA
<213>人工序列
<220>
<223>
<400>8
atggcagatc ccggggtatg ctgtttcatc accaagatcc tgtgcgccca cgggggccgt 60
atgaccctgg aggaactgct gggtgagatc aggctccccg aggcgcagct ctacgagctg 120
ctggagacgg cggggcccga tcgcttcgtg ctattggaga ctggaggcca ggccgggatc 180
actcggtctg tagtggctac tactcgagcc cgcgtctgcc gtcggaagta ctgccagaga 240
ccctgcgaca gcctgcacct ctgcaagctt aatctgctcg gccggtgcca ctatgcacag 300
tctcagcgga acctctgcaa atattctcac gatgttctct cggaacagaa cttccagatc 360
ctgaagaatc atgagctctc tgggcttaac caagaggagc tagcttgcct cctggtccaa 420
agcgaccctt ttttcctgcc cgagatatgc aagagttaca aaggagaggg ccgaaaacag 480
acctgtgggc agccacagcc atgcgagaga ctccacatct gtgagcactt cacccggggc 540
aactgcagtt acctcaactg tctcaggtct cacaacctga tggacagaaa ggtgttgacc 600
atcatgaggg agcacgggct gagtcctgat gtggtccaga acatccagga catctgcaac 660
aacaaacacg ccaggaggaa cccgcctggc acgagagctg cccatccaca ccgcagaggc 720
ggcgcacaca gagacagaag caaaagcaga gaccgcttcc ttcacaacag tctagaattt 780
ctctcacctg ttgtctcacc tctgggatct ggtccgccta gcccagatgt caccagctgt 840
aaagattccc tggaggatgt gtctgtggat gtcacccaga agttcaagta cttggggacg 900
catgaccgtg cgcagctctc cccagtctca tctaaggctg ctggtgttca aggacccagt 960
caaatgagag caagccaaga gttttcagag gatgggaatc tagatgacat attttctagg 1020
aatcgttctg attcatcatc aagtcgagcc tccgctgcca aggtggcaca aagaaatgaa 1080
gctgtggcca tgaaaatggg catggaggtc aagggcaaga aggaggctcc agacatcgat 1140
cgggtcccat ttttaaatag ttatattgat ggggtgacca tggaaaaagc atcggtctca 1200
ggaattccag gcaaaaagtt cacagccaat gatctggaaa atttgctatt acttaacgac 1260
acttggaaga atgtggctaa gccccaggat ctgcagacca caggcagaat cactgacagt 1320
ggccaagaca aggcattcct gcagaataaa tatggaggaa acccagtgtg ggcaagtgca 1380
tccacccata atgccccaaa tggctctagt caaattatgg atgaaactcc taatgtctct 1440
aaaagtagta ccagtggttt tgccataaaa ccagcaattg ctggaggaaa agaagcagtc 1500
tattctggag ttcagagtcc gagaagccag gtcctagctg tgcctgggga ggctactacc 1560
cctgtacaga gcaacaggct gcctcagtcg cctctgtctt cctcaagcca cagagctgca 1620
gcctctggga gccctggcaa gaactccacc catacctctg tgagcccagc catcgagtct 1680
tcaaggatga catcagaccc cgatgagtat ctcctacgct acatcctaaa tcctttattt 1740
aggatggata atcatggccc gaaggaaatc tgtcaggacc atctgtacaa gggctgtcaa 1800
cagagccact gcgacaggag tcacttccat ctgccctacc ggtggcagat gttcgtatat 1860
accacttgga gggacttcca ggacatggag tctatcgaac aggcctattg tgatccccac 1920
gttgaactca ttttgataga aaaccatcag atcaatttcc agaaaatgac ctgtgactcc 1980
taccccatcc gacgcctctc cactccctca tatgaggaaa agccacttag tgctgtcttc 2040
gccaccaagt ggatttggta ttggaagaat gaatttaatg aatatatcca gtatgggaat 2100
gagagcccag gccacaccag ctctgacatc aactctgcgt acctggagtc tttcttccag 2160
tcttgtccca ggggagtttt gccattccag gctggttcac agaagtacga gttaagcttc 2220
caagggatga ttcagacaaa tatagcttcc aagactcaaa ggcatgttgt cagaaggcca 2280
gtatttgttt cttcgaacga tgtggagcag aagagaagag gtccagaggg cggccgctcg 2340
agtggtggag gctctggggg aggtacagga ggtggctcag gtgggggcac tggtggaggc 2400
agtggaggtg gcgatatcgg cgccaagggc ccgcggttcg aacaaaaact catctcagaa 2460
gaggatctga atatgcatac cggtcatcat caccatcacc at 2502
<210>9
<211>381
<212>PRT
<213>人工序列
<220>
<223>
<400>9
Met Ala Asp Pro Gly Val Cys Cys Phe Ile Thr Lys Ile Leu Cys Ala
1 5 10 15
His Gly Gly Arg Met Thr Leu Glu Glu Leu Leu Gly Glu Ile Arg Leu
20 25 30
Pro Glu Ala Gln Leu Tyr Glu Leu Leu Glu Thr Ala Gly Pro Asp Arg
35 40 45
Phe Val Leu Leu Glu Thr Gly Gly Gln Ala Gly Ile Thr Arg Ser Val
50 55 60
Val Ala Thr Thr Arg Ala Arg Val Cys Arg Arg Lys Tyr Cys Gln Arg
65 70 75 80
Pro Cys Asp Ser Leu His Leu Cys Lys Leu Asn Leu Leu Gly Arg Cys
85 90 95
His Tyr Ala Gln Ser Gln Arg Asn Leu Cys Lys Tyr Ser His Asp Val
100 105 110
Leu Ser Glu Gln Asn Phe Gln Ile Leu Lys Asn His Glu Leu Ser Gly
115 120 125
Leu Asn Gln Glu Glu Leu Ala Cys Leu Leu Val Gln Ser Asp Pro Phe
130 135 140
Phe Leu Pro Glu Ile Cys Lys Ser Tyr Lys Gly Glu Gly Arg Lys Gln
145 150 155 160
Thr Cys Gly Gln Pro Gln Pro Cys Glu Arg Leu His Ile Cys Glu His
165 170 175
Phe Thr Arg Gly Asn Cys Ser Tyr Leu Asn Cys Leu Arg Ser His Asn
180 185 190
Leu Met Asp Arg Lys Val Leu Thr Ile Met Arg Glu His Gly Leu Ser
195 200 205
Pro Asp Val Val Gln Asn Ile Gln Asp Ile Cys Asn Asn Lys His Ala
210 215 220
Arg Arg Asn Pro Pro Gly Thr Arg Ala Ala His Pro His Arg Arg Gly
225 230 235 240
Gly Ala His Arg Asp Arg Ser Lys Ser Arg Asp Arg Ser Cys Gly Arg
245 250 255
Ser Ser Gly Gly Gly Ser Gly Gly Gly Thr Gly Gly Gly Ser Gly Gly
260 265 270
Gly Thr Gly Gly Gly Ser Gly Gly Gly Asp Gly Arg Ala Ala Glu Pro
275 280 285
Val Asp Pro Ser Leu Glu Pro Trp Lys His Pro Gly Ser Gln Pro Arg
290 295 300
Thr Ala Cys Asn Asn Cys Tyr Cys Lys Lys Cys Cys Phe His Cys Tyr
305 310 315 320
Ala Cys Phe Thr Arg Lys Gly Leu Gly Ile Ser Tyr Gly Arg Lys Lys
325 330 335
Arg Arg Gln Arg Arg Arg Ala Pro Gln Asp Ser Gln Thr His Gln Ala
340 345 350
Ser Leu Ser Lys Gln Pro Arg Phe Glu Gln Lys Leu Ile Ser Glu Glu
355 360 365
Asp Leu Asn Met His Thr Gly His His His His His His
370 375 380
<210>10
<211>1143
<212>DNA
<213>人工序列
<220>
<223>
<400>10
atggcagatc ccggggtatg ctgtttcatc accaagatcc tgtgcgccca cgggggccgt 60
atgaccctgg aggaactgct gggtgagatc aggctccccg aggcgcagct ctacgagctg 120
ctggagacgg cggggcccga tcgcttcgtg ctattggaga ctggaggcca ggccgggatc 180
actcggtctg tagtggctac tactcgagcc cgcgtctgcc gtcggaagta ctgccagaga 240
ccctgcgaca gcctgcacct ctgcaagctt aatctgctcg gccggtgcca ctatgcacag 300
tctcagcgga acctctgcaa atattctcac gatgttctct cggaacagaa cttccagatc 360
ctgaagaatc atgagctctc tgggcttaac caagaggagc tagcttgcct cctggtccaa 420
agcgaccctt ttttcctgcc cgagatatgc aagagttaca aaggagaggg ccgaaaacag 480
acctgtgggc agccacagcc atgcgagaga ctccacatct gtgagcactt cacccggggc 540
aactgcagtt acctcaactg tctcaggtct cacaacctga tggacagaaa ggtgttgacc 600
atcatgaggg agcacgggct gagtcctgat gtggtccaga acatccagga catctgcaac 660
aacaaacacg ccaggaggaa cccgcctggc acgagagctg cccatccaca ccgcagaggc 720
ggcgcacaca gagacagaag caaaagcaga gaccgttcct gcggccgctc gagtggtgga 780
ggctctgggg gaggtacagg aggtggctca ggtgggggca ctggtggagg cagtggaggt 840
ggcgatggcc gcgctgctga gccagtagat cctagtctag agccctggaa gcatccagga 900
agtcagccta ggactgcttg taacaattgc tattgtaaaa agtgttgctt tcattgctac 960
gcgtgtttca caagaaaagg cttaggcatc tcctatggca ggaagaagcg gagacagcga 1020
cgaagagctc ctcaggacag tcagactcat caagcttctc tatcaaagca gccgcggttc 1080
gaacaaaaac tcatctcaga agaggatctg aatatgcata ccggtcatca tcaccatcac 1140
cat 1143
<210>11
<211>194
<212>PRT
<213>人工序列
<220>
<223>
<400>11
Met Ala Asp Pro Gly Val Cys Cys Phe Ile Thr Lys Ile Leu Cys Ala
1 5 10 15
His Gly Gly Arg Met Thr Leu Glu Glu Leu Leu Gly Glu Ile Arg Leu
20 25 30
Pro Glu Ala Gln Leu Tyr Glu Leu Leu Glu Thr Ala Gly Pro Asp Arg
35 40 45
Phe Val Leu Leu Glu Thr Gly Gly Gln Ala Gly Ile Thr Arg Ser Val
50 55 60
Val Ala Thr Thr Arg Ser Ser Gly Gly Gly Ser Gly Gly Gly Thr Gly
65 70 75 80
Gly Gly Ser Gly Gly Gly Thr Gly Gly Gly Ser Gly Gly Gly Asp Gly
85 90 95
Arg Ala Ala Glu Pro Val Asp Pro Ser Leu Glu Pro Trp Lys His Pro
100 105 110
Gly Ser Gln Pro Arg Thr Ala Cys Asn Asn Cys Tyr Cys Lys Lys Cys
115 120 125
Cys Phe His Cys Tyr Ala Cys Phe Thr Arg Lys Gly Leu Gly Ile Ser
130 135 140
Tyr Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg Ala Pro Gln Asp Ser
145 150 155 160
Gln Thr His Gln Ala Ser Leu Ser Lys Gln Pro Arg Phe Glu Gln Lys
165 170 175
Leu Ile Ser Glu Glu Asp Leu Asn Met His Thr Gly His His His His
180 185 190
His His
<210>12
<211>582
<212>DNA
<213>人工序列
<220>
<223>
<400>12
atggcagatc ccggggtatg ctgtttcatc accaagatcc tgtgcgccca cgggggccgt 60
atgaccctgg aggaactgct gggtgagatc aggctccccg aggcgcagct ctacgagctg 120
ctggagacgg cggggcccga tcgcttcgtg ctattggaga ctggaggcca ggccgggatc 180
actcggtctg tagtggctac tactcgatcg agtggtggag gctctggggg aggtacagga 240
ggtggctcag gtgggggcac tggtggaggc agtggaggtg gcgatggccg cgctgctgag 300
ccagtagatc ctagtctaga gccctggaag catccaggaa gtcagcctag gactgcttgt 360
aacaattgct attgtaaaaa gtgttgcttt cattgctacg cgtgtttcac aagaaaaggc 420
ttaggcatct cctatggcag gaagaagcgg agacagcgac gaagagctcc tcaggacagt 480
cagactcatc aagcttctct atcaaagcag ccgcggttcg aacaaaaact catctcagaa 540
gaggatctga atatgcatac cggtcatcat caccatcacc at 582
<210>13
<211>285
<212>PRT
<213>人工序列
<220>
<223>
<400>13
Met Ala Asp Pro Gly Val Cys Cys Phe Ile Thr Lys Ile Leu Cys Ala
1 5 10 15
His Gly Gly Arg Met Thr Leu Glu Glu Leu Leu Gly Glu Ile Arg Leu
20 25 30
Pro Glu Ala Gln Leu Tyr Glu Leu Leu Glu Thr Ala Gly Pro Asp Arg
35 40 45
Phe Val Leu Leu Glu Thr Gly Gly Gln Ala Gly Ile Thr Arg Ser Val
50 55 60
Val Ala Thr Thr Arg Ala Arg Val Cys Arg Arg Lys Tyr Cys Gln Arg
65 70 75 80
Pro Cys Asp Ser Leu His Leu Cys Lys Leu Asn Leu Leu Gly Arg Cys
85 90 95
His Tyr Ala Gln Ser Gln Arg Asn Leu Cys Lys Tyr Ser His Asp Val
100 105 110
Leu Ser Glu Gln Asn Phe Gln Ile Leu Lys Asn His Glu Leu Ser Gly
115 120 125
Leu Asn Gln Glu Glu Leu Ala Cys Leu Leu Val Gln Ser Asp Pro Phe
130 135 140
Phe Leu Pro Glu Ile Cys Lys Ser Tyr Lys Gly Glu Gly Arg Lys Gln
145 150 155 160
Thr Cys Gly Gln Pro Gln Pro Cys Glu Arg Leu His Ile Cys Glu His
165 170 175
Phe Thr Arg Gly Asn Cys Ser Tyr Leu Asn Cys Leu Arg Ser His Asn
180 185 190
Leu Met Asp Arg Lys Val Leu Thr Ile Met Arg Glu His Gly Leu Ser
195 200 205
Pro Asp ValVal Gln Asn Ile Gln Asp Ile Cys Asn Asn Lys His Ala
210 215 220
Arg Arg Asn Pro Pro Gly Thr Arg Ala Ala His Pro His Arg Arg Gly
225 230 235 240
Gly Ala His Arg Asp Arg Ser Lys Ser Arg Asp Arg Ser Cys Gly Arg
245 250 255
Ser Ser Leu Glu Gly Pro Arg Phe Glu Gln Lys Leu Ile Ser Glu Glu
260 265 270
Asp Leu Asn Met His Thr Gly His His His His His His
275 280 285
<210>14
<211>855
<212>DNA
<213>人工序列
<220>
<223>
<400>14
atggcagatc ccggggtatg ctgtttcatc accaagatcc tgtgcgccca cgggggccgt 60
atgaccctgg aggaactgct gggtgagatc aggctccccg aggcgcagct ctacgagctg 120
ctggagacgg cggggcccga tcgcttcgtg ctattggaga ctggaggcca ggccgggatc 180
actcggtctg tagtggctac tactcgagcc cgcgtctgcc gtcggaagta ctgccagaga 240
ccctgcgaca gcctgcacct ctgcaagctt aatctgctcg gccggtgcca ctatgcacag 300
tctcagcgga acctctgcaa atattctcac gatgttctct cggaacagaa cttccagatc 360
ctgaagaatc atgagctctc tgggcttaac caagaggagc tagcttgcct cctggtccaa 420
agcgaccctt ttttcctgcc cgagatatgc aagagttaca aaggagaggg ccgaaaacag 480
acctgtgggc agccacagcc atgcgagaga ctccacatct gtgagcactt cacccggggc 540
aactgcagtt acctcaactg tctcaggtct cacaacctga tggacagaaa ggtgttgacc 600
atcatgaggg agcacgggct gagtcctgat gtggtccaga acatccagga catctgcaac 660
aacaaacacg ccaggaggaa cccgcctggc acgagagctg cccatccaca ccgcagaggc 720
ggcgcacaca gagacagaag caaaagcaga gaccgttcct gcggccgctc gagtctagag 780
ggcccgcggt tcgaacaaaa actcatctca gaagaggatc tgaatatgca taccggtcat 840
catcaccatc accat 855