实施例
以下给出实施例更具体地说明本发明。但这些实施例只是用于说明,并不限制本发明的技术范围。
[实施例1]
使用RNA聚合酶I启动子/终止子系的HCV表达载体的构建
作为HCV基因组RNA的cDNA,使用来自基因型2a的JFH1株的基因组RNA的cDNA(GenBank登录号AB047639、Kato,T.等人,Gastroenterology,125(2003)1808-1817页)、来自发生了DNA序列变换的JFH1株的基因组RNA的cDNA,其中JFH1株的NS5B中的GDD氨基酸序列变异为GND(Wakita等人,Nat Med.,11(2005)791-796页)、来自J6CF株的基因组RNA的cDNA(GenBank登录号AF177036、Yanagi,M.等人.,Virology,262(1999)250-263页)、来自基因型1a的H77c株的基因组RNA的cDNA(GenBank登录号AF011751、Yanagi,M.等人.,Proc.Natl.Acad.Sci.USA,94(1997)8738-8743页)、以及来自基因型1b的J1株的基因组RNA的cDNA(GenBank登录号D89815、Aizaki,H.等人.Hepatology,27(1998)621-627页)。
使用PCR,分别在上述HCV基因组cDNA(JFH1、JFH1/GND和H77c)的5’端和3’端附加限制酶BsmBI识别序列。利用BsmBI的切断位点与该酶的识别位点有一定距离的性质,在启动子/终止子与HCV基因组之间没有多余碱基序列地将HCV基因组插入到具有人RNA聚合酶I启动子和小鼠RNA聚合酶I终止子序列的pHH21载体(Neumann,G.等人,Proc.Natl.Acad.Sci.USA,96(1999)9345-9350页)的BsmBI位点(图1A)。另外,各克隆的图谱如图1B所示。所得的含有HCV基因组cDNA的载体分别称为pHH JFH1、pHH JFH1/GND和pHH H77c。
并且,分别来自J6CF和JFH1、H77c和JFH1、J1和JFH1的基因组cDNA的嵌合HCV表达载体的制备按以下方法进行。
PHH J6(C-p7)/JFH1的制备
将与来自JFH1株的基因组RNA全部区域对应的cDNA克隆到pUC19质粒中,由此构建作为质粒DNA的pJFH1(Wakita,T.等人.,Nat.Med.,11(2005),791-796页、国际公开WO2004/104198),将其用AgeI消化后,再用BclI进行部分消化,纯化由AgeI位点至最初的BclI位点的片段(2672bp)被除去的质粒DNA片段。另外,将来自J6CF株的基因组cDNA用AgeI和BclI部分消化,将所得的2672bp片段与上述片段连接,得到pUC J6/JFH1。
接着,将pHH JFH1用NocI消化,得到约4.3kb的片段;将pUCJ6/JFH1用NocI消化,得到约8.2kb的片段,将它们用连接酶连接,得到表达载体pHH J6(C-p7)/JFH1。该pHH J6(C-p7)/JFH1的插入片段(J6(C-p7)JFH1,SEQ ID NO.29,图10A-D)含有分别编码来自JFH1株和J6CF株的嵌合的5’非翻译区、来自J6CF株的核心蛋白、E1蛋白、E2蛋白、以及p7蛋白的DNA序列,以及分别编码来自JFH1株的NS2、NS3、NS4A、NS4B、NS5A和NS5B蛋白以及3’非翻译区的DNA序列。
pHH H77c(C-p7)/JFH1的制备
以上述JFH1基因组cDNA为模板,分别加入LA-PCR试剂盒(タカラバイオ社)所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物5-JFH-S(SEQ ID NO.10:GCCATGGCGTTAGTATGAGTGTCGT)和5-JFH-A(SEQ ID NO.11:TCGTGCTCATGGTGCACGGTCTACGAGACC)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ社),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行30个循环。将所得PCR产物作为PCR产物no.1。接着,以JFH1基因组cDNA为模板,分别加入LA-PCR试剂盒(タカラバイオ制备)所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物3-JFH-S(SEQ ID NO.12:GGCATACGCATATGACGCACCTGTGCACGG)和3-JFH-A(SEQ ID NO.13:GCTCTGACGAAGTACGGCACATGTGTC)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ公司),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行25个循环。将所得PCR产物作为PCR产物no.2。再以上述H77c基因组cDNA为模板,分别加入LA-PCR试剂盒(タカラバイオ社)所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物5-H77-S(SEQ ID NO.14:ACCGTGCACCATGAGCACGAATCCTAAACC)和5-H77-A(SEQ ID NO.15:GAAGCCGCACGTAAGGGTATCGATG:)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ社),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行25个循环。将所得PCR产物作为PCR产物no.3。再以上述H77c基因组cDNA为模板,分别加入LA-PCR试剂盒(タカラバイオ社)所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物3-H77-S(SEQ ID NO.16:CATTGTGCCCGCAAAGAGCGTGTGT)和3-H77-A(SEQ ID NO.17:GTGCGTCATATGCGTATGCCCGCTGAGGCA)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ社),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行25个循环。将所得PCR产物作为PCR产物no.4。
接着,将各PCR产物进行纯化,溶解于50μL水中。将PCR产物no.1和PCR产物no.3的DNA稀释100倍,各取1μL混合。以该混合液为模板,不加入引物,在上述条件下进行5个循环的PCR。然后添加引物5-JFH-S(SEQ ID NO.10)和5-H77-A(SEQ ID NO.15),再进行25个循环的PCR,由此纯化扩增的嵌合DNA片段。将该片段克隆到质粒载体pCRII上,确定该DNA片段的碱基序列,确认碱基序列正确。将嵌合DNA片段克隆到pCRII上,将所得的该质粒命名为pCR5HJ。接着,将PCR产物no.2和PCR产物no.4的DNA稀释100倍,各取1μL混合。以该混合液为模板,不加入引物,在上述条件下进行5个循环的PCR。然后添加引物3-H77-S(SEQ ID NO.16)和3-JFH-A(SEQ ID NO.13),再进行25个循环的PCR,由此纯化扩增的嵌合DNA片段。将该片段克隆到质粒载体pCRII上,确定该DNA片段的碱基序列,确认碱基序列正确。将嵌合DNA片段克隆到pCRII上,将所得的该质粒命名为pCR3HJ。
接着,将pCR5HJ用限制酶AgeI和KpnI、将H77c基因组cDNA用限制酶KpnI和AscI、将pCR3HJ用限制酶AscI和NotI消化,将各HCVcDNA片段通过琼脂糖凝胶电泳进行分级,纯化。将该三个DNA片段与将pHH JFH1用AgeI和NotI消化得到的载体连接。将该载体命名为pHH H77c(C-p7)/JFH。该表达载体pHH H77c(C-p7)/JFH的插入片段(H77c(C-p7)/JFH;SEQ ID NO.30,图11A-D)含有分别编码来自JFH1株和H77c株的嵌合的5’非翻译区、来自H77c株的核心蛋白、E1蛋白、E2蛋白和p7蛋白的DNA序列,以及分别编码来自JFH1株的NS2、NS3、NS4A、NS4B、NS5A和NS5B蛋白和3’非翻译区的DNA序列。
PHH J1(C-p7)/JFH1的制备
以上述JFH1基因组cDNA为模板,分别加入LA-PCR试剂盒(タカラバイオ社)中所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物5-JFH-S(SEQ ID NO.10:GCCATGGCGTTAGTATGAGTGTCGT)和5-JFH-A2(SEQ ID NO.18:TTGTGCTCATGGTGCACGGTCTACGAGACC)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ社),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行25个循环。将所得PCR产物作为PCR产物no.5。
接着,以JFH1基因组cDNA为模板,分别加入LA-PCR试剂盒(タカラバイオ社)中所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物3-JFH-S2(SEQ ID NO.19:AGCTTACGCCTATGACGCACCTGTGCACGG)和3-JFH-A(SEQ ID NO.13:GCTCTGACGAAGTACGGCACATGTGTC)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ社),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行25个循环。将所得PCR产物作为PCR产物no.6。
接着,以与来自基因型1b的J1株的基因组RNA对应的cDNA(GenBank登录号D89815、Aizaki,H.等人,Hepatology,27(1998)621-627页)为模板,分别加入LA-PCR试剂盒(タカラバイオ社)中所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物3-J1-S(SEQ IDNO.20:ACCGTGCACCATGAGCACAAATCCTAAACC)和5-J1-A(SEQ IDNO.21:AAGCGGGATGTACCCCATGAG)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ社),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行25个循环。将所得PCR产物作为PCR产物no.7。
接着,以来自上述J1株的基因组cDNA为模板,分别加入LA-PCR试剂盒(タカラバイオ社)中所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物3-J1-S(SEQ ID NO.22:CGGCTGTACATGGATGAATAGCACTGGGTT)和3-J1-A(SEQ ID NO.23:GTGCGTCATAGGCGTAAGCTCGTGGTGGTA)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ社),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行25个循环。将所得PCR产物作为PCR产物no.8。
接着,将各PCR产物进行纯化,溶解于50μL水中。将PCR产物no.5和PCR产物no.7的DNA稀释100倍,各取1μL混合。以该混合液为模板,不加入引物,在上述条件下进行5个循环的PCR。然后添加引物5-JFH-S(SEQ ID NO.10)和5-J1-A(SEQ ID NO.21),再进行25个循环的PCR,由此纯化扩增的嵌合DNA片段。将该片段克隆到质粒载体pCRII上,确定该DNA片段的碱基序列,确认碱基序列正确。将嵌合DNA片段克隆到pCRII上,将所得的该质粒命名为pCR5JJ。
再将PCR产物no.6和PCR产物no.8的DNA稀释100倍,各取1μL混合。以该混合液为模板,不加入引物,在上述条件下进行5个循环的PCR。然后添加引物3-J1-S(SEQ ID NO.22)和3-JFH-A(SEQ ID NO.13),再进行25个循环的PCR,由此纯化扩增的嵌合DNA片段。将该片段克隆到质粒载体pCRII上,确定该DNA片段的碱基序列,确认碱基序列正确。将嵌合DNA片段克隆到pCRII上,将所得的该质粒命名为pCR3JJ。
接着,将pCR5JJ用限制酶AgeI和ClaI、将J1基因组cDNA用限制酶ClaI和AvrII、将pCR3JJ用限制酶AvII和KpnI消化,将各HCVcDNA片段通过琼脂糖凝胶电泳进行分级,纯化。将该三个DNA片段与将pHH JFH1用AgeI和KpnI消化得到的载体连接。将该载体命名为pHH J1(C-p7)/JFH。该表达载体pHH J1(C-p7)/JFH的插入片段(J1(C-p7)/JFH;SEQ ID NO.31,图12A-D)含有分别编码来自JFH1株和J1株的嵌合的5’非翻译区、来自J1株的核心蛋白、E1蛋白、E2蛋白和p7蛋白的DNA序列,以及分别编码来自JFH1株的NS2、NS3、NS4A、NS4B、NS5A和NS5B蛋白和3’非翻译区的DNA序列。
pHH J1(C-NS2)/JFH1的制备
以上述JFH1基因组cDNA为模板,分别加入LA-PCR试剂盒(タカラバイオ社)中所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物5-JFH-S(SEQ ID NO.10:GCCATGGCGTTAGTATGAGTGTCGT)和5-JFH-A2(SEQ ID NO.18:TTGTGCTCATGGTGCACGGTCTACGAGACC)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ社),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行25个循环。将所得PCR产物作为PCR产物no.9。
接着,以JFH1基因组cDNA为模板,分别加入LA-PCR试剂盒(タカラバイオ社)中所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物3-JFH-NS3-S(SEQ ID NO.24:GCGACTCCTTGCTCCCATCACTGCTTATGC)和3-JFH-NS3-A(SEQ ID NO.25:TGGGAGACCTTGTAACAACGTCGAGTGT)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ社),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行25个循环。将所得PCR产物作为PCR产物no.10。
接着,以来自基因型1b的J1株的基因组RNA的cDNA(GenBank登录号D89815、Aizaki,H.等人.Hepatology,27(1998)621-627页)为模板,分别加入LA-PCR试剂盒(タカラバイオ社)中所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物5-J1-S(SEQ ID NO.20:ACCGTGCACCATGAGCACAAATCCTAA ACC)和5-J1-A(SEQ ID NO.21:AAGCGGGATGTACCCCATGAG)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ社),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行25个循环。将所得PCR产物作为PCR产物no.11。
接着,以来自J1株的基因组cDNA为模板,分别加入LA-PCR试剂盒(タカラバイオ社)中所附带的10×缓冲液5μL、2.5mM dNTP混合液5μL、10μM引物3-J1-S(SEQ ID NO.22:CGGCTGTACATGGATGAATAGCACTGGGTT)和3-J1-NS3-A(SEQ ID NO.26:CATAAGCAGTGATGGGAGCAAGGAGTCGCC)各1μL,最后加入去离子水,使总量为49μL。接着加入1μL Takara LA Taq(タカラバイオ社),然后进行PCR反应。PCR反应是以含有94℃1分钟、64℃2分钟、72℃3分钟的步骤作为一个循环,进行25个循环。将所得PCR产物作为PCR产物no.12。
将各PCR产物进行纯化,溶解于50μL水中。将PCR产物no.9和PCR产物no.11的DNA稀释100倍,各取1μL混合。以该混合液为模板,不加入引物,在上述条件下进行5个循环的PCR。然后添加5-JFH-S(SEQ ID NO.10)和5-J1-A(SEQ ID NO.21)作为引物,再进行25个循环的PCR,由此纯化扩增的嵌合DNA片段。将该片段克隆到质粒载体pCRII上,确定该DNA片段的碱基序列,确认碱基序列正确。将嵌合DNA片段克隆到pCRII上,将所得的该质粒命名为pCR5JJ。
再将PCR产物no.10和PCR产物no.12的DNA稀释100倍,各取1μL混合。以该混合液为模板,不加入引物,在上述条件下进行5个循环的PCR。然后添加3-J1-S(SEQ ID NO.22)和3-JFH-NS3-A(SEQ IDNO.25)作为引物,再进行25个循环的PCR,由此纯化扩增的嵌合DNA片段。将该片段克隆到质粒载体pCRII上,确定该DNA片段的碱基序列,确认碱基序列正确。将嵌合DNA片段克隆到pCRII上,将所得的该质粒命名为pCR3JJNS3。
接着,将pCR5JJ用限制酶AgeI和ClaI、将J1基因组cDNA用限制酶ClaI和AvrII、将pCR3JJNS3用限制酶AvII和BspDI消化,将各HCV cDNA片段通过琼脂糖凝胶电泳进行分级,纯化。将该三个DNA片段与将pHH JFH1用AgeI和BspDI消化得到的载体连接。将该载体命名为pHH J1(C-NS2)/JFH。该表达载体pHH J1(C-NS2)/JFH的插入片段(J1(C-NS2)/JFH;SEQ ID NO.32,图13A-D)含有:分别编码来自JFH1株和J1株的嵌合的5’非翻译区、来自J1株的核心蛋白、E1蛋白、E2蛋白、p7蛋白和NS2蛋白的DNA序列,以及分别编码来自JFH1株的NS3、NS4A、NS4B、NS5A和NS5B蛋白以及3’非翻译区的DNA序列。
各表达载体克隆的图谱如图1C所示。
[实施例2]
细胞内HCV RNA合成的确认
使用Fugene6(ロシユ社),按照所附说明书,将实施例1中制备的pHH JFH1和pHH JFH1/GND导入到Huh7和HepG2细胞中。24小时后,通过Isogen(日本ジ一ン社),按照所附说明书,由各细胞制备RNA。
以这些RNA为模板,如下通过RACE法确认是否由pHH JFH1和pHH JFH1/GND合成了HCV RNA。
通过T4RNA连接酶(Takara社),使用所附缓冲液,将2μg上述制备的RNA和2.5μM RNA接头(SEQ ID NO.1:GCUGAUGGCGAUGAAUGAACACUGCGUUUGCUGGCUUUGAUGAAA)连接。接着,以连接有RNA接头的RNA为模板,使用与HCV RNA互补的引物(SEQID NO.2:gtaccccatgaggtcggcaaag),通过反转录酶SuperScript III(インビトロジエン社),按照所附说明书合成cDNA。
接着,以合成的上述cDNA为模板,使用5’末端RNA接头的正义引物(SEQ ID NO.3:GCTGATGGCGATGAATGAACACTG)和3’末端的反义引物(SEQ ID NO.4:gaccgctccgaagttttccttg)两种引物,通过PCR扩增DNA。再以扩增的DNA为模板,使用含有扩增的DNA内侧序列的5’末端的引物(SEQ ID NO.5:GAACACTGCGTTTGCTGGCTTTGATG)和3’末端的引物(SEQID NO.6:cgccctatcaggcagtaccacaag)的引物对进行第二次PCR。该PCR是使用市售的试剂盒Ex Taq(Takara),在96℃加热处理5分钟,然后以96℃1分钟、55℃1分钟、72℃2分钟的反应循环进行35个循环,之后在4℃下保存。接着,为了确认是否得到了上述第二次的PCR产物,将反应液的一部分通过琼脂糖凝胶电泳确认。结果(图2),在Huh7细胞和HepG2细胞中均发生了由pHH JFH1和pHH JFH1/GND转录为HCV RNA的情况。并且,将第二次的PCR产物克隆到pGEM-T Easy(Promega社)载体中。在所得质粒DNA中克隆的DNA碱基序列用DNA测序仪(ABI PRISM377)确定。结果,由pHH JFH1和pHH JFH1/GND转录的HCV RNA的5’末端序列与HCV基因组的5’末端相同。图3中给出了由pHH JFH1转录的HCV RNA的5’末端和接头序列。
[实施例3]
细胞内的HCV蛋白质表达的确认
在导入了pHH JFH1、pHH H77c和pHH JFH1/GND的细胞中,确认转录之后是否发生了HCV蛋白质的翻译。
使用Fugene6(ロシユ社),按照所附说明书,将pHH JFH1、pHH H77c和pHH JFH1/GND导入到Huh7细胞中。培养4天后,按照常规方法制备细胞提取液。接着,通过SDS-PAGE和蛋白质印迹法对这些提取液中的蛋白质进行分析。分析时,将含有核心基因的表达质粒DNA瞬时转染Huh7细胞,以所得细胞提取液作为阳性对照。并且,将由未转染的Huh7细胞得到的细胞提取液作为阴性对照。将由各细胞克隆提取的试样进行SDS-PAGE后,转印PVDF膜(Immobilon-P,Millipore社制),使用抗核心特异性抗体(兔多克隆抗体)、以及小鼠单克隆抗体(Austral社)和识别这些抗体的HRP标记二次抗体作为抗NS5A抗体,通过ECL(アマシヤムフアルマシア社)检测在细胞内被翻译的核心蛋白和NS5A蛋白。
结果如图4所示,在导入了pHH JFH1的细胞中,可见核心和NS5A蛋白的表达。但在导入pHH H77c和pHH JFH1/GND的细胞中未检测出这些蛋白质的表达。由该结果可认为,在导入了pHH JFH1的细胞中,由pHH JFH1转录为RNA之后,该RNA在细胞内复制,然后发生蛋白质的翻译,达到了蛋白质可检测的水平,而在导入了pHH JFH1/GND的细胞中,由pHH JFH1/GND转录RNA后,该RNA变异为NS5B,因此在细胞内无法复制,未达到可检测蛋白质的水平。另一方面,HCV基因组RNA虽然在培养细胞中自主复制(Yi,M.& Lemon ST.,J.Virol.,78(2004)7904-7915页),但在pHH H77c中未能检测HCV蛋白的表达的原因尚不明确。
下面,对于在什么样的细胞株中表达HCV蛋白进行研究。使用Fugen6(ロシユ社),按照所附说明书,将pHH JFH1和pHH JFH1/GND与GFP表达载体(pGreenLantern:Life Technologies Inc.)一起导入到Huh7、RCYM1RC、5-15RC、HepG2和293T细胞中。导入4天后,通过荧光显微镜观察GFP的表达,确认导入到各载体中后,由各细胞制备细胞提取液。接着,通过SDS-PAGE和蛋白质印迹法对这些提取液中的核心蛋白进行分析。核心蛋白的检测是使用抗核心特异性兔多克隆抗体和识别该抗体的HRP标记二次抗体,通过ECL(アマシヤムフアルマシア社)进行。结果如图5所示,在Huh7、RCYM1RC、5-15RC、HepG2细胞中检测出了核心蛋白。
[实施例4]
HCV颗粒的生成
接着,为了确认是否由pHH JFH1导入细胞生成了HCV颗粒,对培养上清中的核心蛋白进行分析。作为pHH JFH1和对照,使用Fugene6,向HepG2细胞中导入表达核心、E1、E2和p7的载体pCAG327JFH1,2天后除去培养上清,加入新鲜的培养基,再培养两天。接着回收细胞和培养液,由细胞提取蛋白质,按照实施例3所示的方法对细胞中的核心蛋白的表达进行分析。结果确认了核心蛋白的表达(图6A)。为了确认培养液中HCV颗粒的存在,通过蔗糖密度梯度对培养液进行分级。在10-60%(重量/重量)的密度梯度蔗糖溶液(溶解于50mM Tris pH 7.5/0.1M NaCl/1mM EDTA中)再铺0.2mL样品的培养上清。将其用ベツクマンロ一タ一S W41E、以35,000RPM、4℃离心16小时,离心结束后,从离心管底将各级分回收0.5mL。对各级分的密度、HCV核心蛋白浓度进行定量。HCV核心蛋白的测定是使用オ一ソHCV抗原IRMA测试仪进行(Aoyagi等人.,J.Clin.Microbiol.,37(1999)1802-1808页)。
如图6B所示,在1.15~1.18mg/mL的级分中可见核心蛋白的峰。与此相对,在表达核心、E1、E2和p7的细胞中未观察到核心蛋白的峰。
存在于1.15~1.18mg/mL级分中的核心蛋白和HCV RNA如果形成HCV颗粒,则应该对表面活性NP40的处理具有感受性。因此,将pHHJFH1导入HepG2中,将2~4天的培养液用0.2%的NP40处理20分钟,然后通过蔗糖密度梯度分级。如图7所示,通过进行NP40处理,在1.20mg/mL的级分中可见核心蛋白的峰。即,含有脂质的比重轻的表面膜由于NP40而从病毒颗粒上剥离,形成未保持类病毒结构的核酸和只有核心蛋白的核心颗粒,比重变重。
由以上结果可以确认,通过将pHH JFH1导入HepG2细胞,由HCV基因组cDNA转录病毒RNA后,发生病毒蛋白的合成和病毒RNA的复制,形成病毒颗粒,分泌到培养液中。
[实施例5]
HCV颗粒的感染性确认
对实施例4所得的HCV颗粒是否具有感染性进行研究。使用Fugene6将pHH JFH1导入到Huh7细胞和HepG2细胞中,将培养3~5天的培养上清用超滤膜(截留分子量1×105Da)浓缩30倍。接着,在100μL浓缩的含有HCV颗粒的培养液中将Huh7.5.1细胞在15mm盖玻片上培养,4天后,将细胞用抗NS5A抗体荧光染色。记数抗NS5A抗体染色阳性,即感染细胞数,结果如图8所示,可见部分感染细胞。由这些结果可知,通过将pHH JFH1导入到Huh7或HepG2细胞中,分泌到培养液中的HCV颗粒具有感染的能力。
[实施例6]
感染性HCV颗粒产生细胞株的获得
利用上述实施例所示的系统,尝试建立持续生产感染性HCV颗粒的细胞株。
将pHH JFH1用NheI消化,将pSV40/Zeo2(Invitrogen)用Nhe I和Xba I消化,获得SV40启动子,在SV40启动子下游连接Zeocin抗性基因,再在其下游连接附加有SV40polyA的信号,将所得的表达单元(SV40启动子/Zeo/polyA)整合到上述pHH JFH1用NheI消化的位点,制备载体。将该载体命名为pHH/ZeoJFH1(图9)。
通过Fugene6,将pHH/ZeoJFH1导入到HepG2细胞中。用含有Zeocin的培养基培养。然后每周更换两次培养液,用含有Zeocin的培养基继续培养。上述培养21天后,由培养皿克隆存活细胞的细胞集落,继续培养。通过上述集落的克隆获得了100个细胞克隆。这些克隆的培养上清中的核心蛋白质量是使用オ一ソHCV抗原IRMA测试仪进行(Aoyagi等人.,J.Clin.Microbiol.,37(1999)1802-1808页),筛选出核心蛋白表达量多的克隆HepG2/No59细胞。
使用10cm的培养皿,用8mL的培养基培养2×106个HepG2/No59细胞。48小时后回收培养上清。
测定培养上清中的核心蛋白,结果可以确认:HepG2/No59细胞产生了1,400fmol/L的核心蛋白。
由培养上清制备RNA,由该RNA进行HCV基因组RNA量的测定。HCV RNA的定量RT-PCR检测是按照Takeuchi等人的方法(Takeuchi T等人.,Gastroenterology,116(1999)636-642页),检测HCV RNA的5’非翻译区的RNA来进行。具体来说,使用以下的合成引物和EZ rTth RNAPCR试剂盒(Applied Biosystems),对由细胞提取的RNA中所含的HCVRNA进行PCR扩增,通过ABI Prism 7700测序系统(Applied Biosystems)检测。探针
R6-130-S17:5’-CGGGAGAGCCATAGTGG-3’(SEQ ID No.7)
R6-290-R19:5’-AGTACCACAAGGCCTTTCG-3’(SEQ ID No.8)
TaqMan探针,R6-148-S21FT:5’-CTGCGGAACCGGTGAGTACAC-3’
(SEQ ID No.9)
结果可知,HepG2/No59细胞培养上清中的HCV RNA为6.1×108拷贝/mL。该产生量比目前报道的量高约60倍。
进一步使用超滤膜(截留分子量1×105Da),将HepG2/No59细胞培养上清浓缩30倍,用100μL浓缩的含有HCV颗粒的培养液,在15mm盖玻片上培养Huh7.5.1细胞,4天后,将细胞用抗NS5A抗体免疫染色。结果,可检测出抗NS5A抗体染色阳性、即感染细胞。由以上可以确认,分泌到HepG2/No59细胞培养液中的HCV颗粒具有感染能力。
[实施例7]
通过嵌合HCV表达载体制备HCV颗粒
使用Fugene6(ロシユ社),按照所附说明书,将实施例1中制备的嵌合HCV表达载体pHH H77c(C-p7)/JFH1、pHH J6(C-p7)/JFH1、pHHJ1(C-p7)/JFH1和pHH J1(C-NS2)/JFH1导入到由Huh7细胞派生的细胞株Huh7.5.1细胞(Zhong,J.等人.,Proc.Natl.Acad.Sci USA,102,9294-9299页(2005))中。导入3天后回收这些培养上清,测定培养上清中所含的核心蛋白的量。核心蛋白质量的测定是使用オ一ソHCV抗原IRMA测试仪(Aoyagi等人.,J.Clin.Microbiol.,37(1999)1802-1808页)。使用未导入载体的细胞的培养上清作为阴性对照,使用导入了pHH JFH1的细胞的培养上清作为阳性对照。其实验结果的一个例子如表1所示。导入了各嵌合HCV表达载体的细胞上清中可见核心蛋白,由此判断生成了病毒颗粒。还证实pHH J6(C-p7)/JFH1具有比pHH JFH1高10倍以上的HCV产生能力。
[表1]
表达载体 |
导入载体3天后的核心蛋白质量(fmol/L) |
未导入 |
0 |
pHH JFH1 |
252.164 |
pHH J6(C-p7)/JFH1 |
2272.878 |
pHH H77c(C-p7)/JFH1 |
29.555 |
pHH J1(C-p7)/JFH1 |
0.403 |
pHH J1(C-NS2)/JFH1 |
11.004 |
[实施例8]
将pHH J6(C-p7)/JFH1和pHH J1(C-p7)/JFH1按照实施例7所示的方法导入到Huh7.5.1细胞中,将导入3天后的培养上清用超滤膜(截留分子量1×105Da)浓缩。加入100μL浓缩的含有HCV颗粒的培养上清,将Huh7.5.1细胞在15mm盖玻片上培养,4天后用抗NS5A抗体进行免疫染色。结果,用由导入了pHH J6(C-p7)/JFH1的细胞得到的培养上清处理的细胞中可见被抗NS5A抗体强烈染色的细胞。另一方面,用由导入了pHH J1(C-p7)/JFH1的细胞得到的培养上清处理的细胞中,与pHHJ6(C-p7)/JFH1比较,抗NS5A抗体染色强度弱,但与对照细胞比较则明显被染色。以上表明,导入了pHH J6(C-p7)/JFH1和pHH J1(C-p7)/JFH1的细胞中产生了感染性HCV。
按照实施例6所示的方法,向pHH J1(C-p7)/JFH1中插入Zeocin抗性基因表达单元,制备pHH/Zeo J1(C-p7)/JFH1。接着,将pHH/ZeoJ1(C-p7)/JFH1导入Huh7.5.1,得到在含有Zeocin的培养基中增殖、可稳定表达病毒颗粒的细胞。将8mL该细胞的培养上清用超滤膜(截留分子量1×105Da)浓缩。该浓缩的培养上清中的HCV核心蛋白质量是2365fmol/L。使用100μL该浓缩培养上清,使Huh7.5.1感染,4天后将细胞用抗NS5A抗体免疫染色。结果,由导入了pHH/Zeo J1(C-p7)/JFH1的稳定表达性细胞得到培养上清,在用该培养细胞处理的细胞中可见被抗NS5A抗体强烈染色的细胞。由此表明由本细胞产生了感染性HCV。
上述嵌合HCV表达载体显示可在其所感染的细胞内产生感染性HCV。
序列表自由文本
SEQ ID NO.1的序列表示合成RNA。
SEQ ID NO.2~SEQ ID NO.9的序列表示合成DNA。
SEQ ID NO.10~26的序列表示引物。
SEQ ID NO.29~32的序列表示嵌合DNA。
SEQ ID NO.33~45的序列表示合成DNA。
序列表
<110>Director General of National Institute of Infectious Diseases
Tokyo Metropol itan Organization for Medical Research
TORAY Industries,Inc.
<120>感染性丙型肝炎病毒颗粒高效生产体系
<130>PH-2922-PCT
<150>JP 2005-287646
<151>2005-09-30
<160>45
<170>PatentIn Ver.2.1
<210>1
<211>45
<212>RNA
<213>人工序列(Artificial Sequence)
<220>
<223>人工序列说明:合成的(synthetic)
<220>
<223>发明人:Tanabe,Jun-ichi;Sone,Saburo;Wakita,Takaji
发明人:Ishii,Takashi;Suzuki,Ryosuke;Suzuki,Tetsurou
发明人:Miyamura Tatsuo
<400>1
gcugauggcg augaaugaac acugcguuug cuggcuuuga ugaaa 45
<210>2
<211>22
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>2
gtaccccatg aggtcggcaa ag 22
<210>3
<211>24
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>3
gctgatggcg atgaatgaac actg 24
<210>4
<211>22
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>4
gaccgctccg aagttttcct tg 22
<210>5
<211>26
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>5
gaacactgcg tttgctggct ttgatg 26
<210>6
<211>24
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>6
cgccctatca ggcagtacca caag 24
<210>7
<211>17
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>7
cgggagagcc atagtgg 17
<210>8
<211>19
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>8
agtaccacaa ggcctttcg 19
<210>9
<211>21
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>9
ctgcggaacc ggtgagtaca c 21
<210>10
<211>25
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>10
gccatggcgt tagtatgagt gtcgt 25
<210>11
<211>30
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>11
tcgtgctcat ggtgcacggt ctacgagacc 30
<210>12
<211>30
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>12
ggcatacgca tatgacgcac ctgtgcacgg 30
<210>13
<211>27
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>13
gctctgacga agtacggcac atgtgtc 27
<210>14
<211>30
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>14
accgtgcacc atgagcacga atcctaaacc 30
<210>15
<211>25
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>15
gaagccgcac gtaagggtat cgatg 25
<210>16
<211>25
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>16
cattgtgccc gcaaagagcg tgtgt 25
<210>17
<211>30
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>17
gtgcgtcata tgcgtatgcc cgctgaggca 30
<210>18
<211>30
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>18
ttgtgctcat ggtgcacggt ctacgagacc 30
<210>19
<211>30
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>19
agcttacgcc tatgacgcac ctgtgcacgg 30
<210>20
<211>30
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>20
accgtgcacc atgagcacaa atcctaaacc 30
<210>21
<211>21
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>21
aagcgggatg taccccatga g 21
<210>22
<211>30
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>22
cggctgtaca tggatgaata gcactgggtt 30
<210>23
<211>30
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>23
gtgcgtcata ggcgtaagct cgtggtggta 30
<210>24
<211>30
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>24
gcgactcctt gctcccatca ctgcttatgc 30
<210>25
<211>28
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>25
tgggagacct tgtaacaacg tcgagtgt 28
<210>26
<211>30
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:引物
<400>26
cataagcagt gatgggagca aggagtcgcc 30
<210>27
<211>9678
<212>DNA
<213>丙型肝炎病毒
<220>
<221>CDS
<222>(341)..(9442)
<400>27
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180
aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240
caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300
cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atg agc aca aat cct 355
Met Ser Thr Asn Pro
1 5
aaa cct caa aga aaa acc aaa aga aac acc aac cgt cgc cca gaa gac 403
Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Glu Asp
10 15 20
gtt aag ttc ccg ggc ggc ggc cag atc gtt ggc gga gta tac ttg ttg 451
Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr Leu Leu
25 30 35
ccg cgc agg ggc ccc agg ttg ggt gtg cgc acg aca agg aaa act tcg 499
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr Thr Arg Lys Thr Ser
40 45 50
gag cgg tcc cag cca cgt ggg aga cgc cag ccc atc ccc aaa gat cgg 547
Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys Asp Arg
55 60 65
cgc tcc act ggc aag gcc tgg gga aaa cca ggt cgc ccc tgg ccc cta 595
Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly Arg Pro Trp Pro Leu
70 75 80 85
tat ggg aat gag gga ctc ggc tgg gca gga tgg ctc ctg tcc ccc cga 643
Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg
90 95 100
ggc tct cgc ccc tcc tgg ggc ccc act gac ccc cgg cat agg tcg cgc 691
Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg His Arg Ser Arg
105 110 115
aac gtg ggt aaa gtc atc gac acc cta acg tgt ggc ttt gcc gac ctc 739
Asn Val Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu
120 125 130
atg ggg tac atc ccc gtc gta ggc gcc ccg ctt agt ggc gcc gcc aga 787
Met Gly Tyr Ile Pro Val Val Gly Ala Pro Leu Ser Gly Ala Ala Arg
135 140 145
gct gtc gcg cac ggc gtg aga gtc ctg gag gac ggg gtt aat tat gca 835
Ala Val Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala
150 155 160 165
aca ggg aac cta ccc ggt ttc ccc ttt tct atc ttc ttg ctg gcc ctg 883
Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser Ile Phe Leu Leu Ala Leu
170 175 180
ttg tcc tgc atc acc gtt ccg gtc tct gct gcc cag gtg aag aat acc 931
Leu Ser Cys Ile Thr Val Pro Val Ser Ala Ala Gln Val Lys Asn Thr
185 190 195
agt agc agc tac atg gtg acc aat gac tgc tcc aat gac agc atc act 979
Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser Asn Asp Ser Ile Thr
200 205 210
tgg cag ctc gag gct gcg gtt ctc cac gtc ccc ggg tgc gtc ccg tgc 1027
Trp Gln Leu Glu Ala Ala Val Leu His Val Pro Gly Cys Val Pro Cys
215 220 225
gag aga gtg ggg aat acg tca cgg tgt tgg gtg cca gtc tcg cca aac 1075
Glu Arg Val Gly Asn Thr Ser Arg Cys Trp Val Pro Val Ser Pro Asn
230 235 240 245
atg gct gtg cgg cag ccc ggt gcc ctc acg cag ggt ctg cgg acg cac 1123
Met Ala Val Arg Gln Pro Gly Ala Leu Thr Gln Gly Leu Arg Thr His
250 255 260
atc gat atg gtt gtg atg tcc gcc acc ttc tgc tct gct ctc tac gtg 1171
Ile Asp Met Val Val Met Ser Ala Thr Phe Cys Ser Ala Leu Tyr Val
265 270 275
ggg gac ctc tgt ggc ggg gtg atg ctc gcg gcc cag gtg ttc atc gtc 1219
Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala Gln Val Phe Ile Val
280 285 290
tcg ccg cag tac cac tgg ttt gtg caa gaa tgc aat tgc tcc atc tac 1267
Ser Pro Gln Tyr His Trp Phe Val Gln Glu Cys Asn Cys Ser Ile Tyr
295 300 305
cct ggc acc atc act gga cac cgc atg gca tgg gac atg atg atg aac 1315
Pro Gly Thr Ile Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn
310 315 320 325
tgg tcg ccc acg gcc acc atg atc ctg gcg tac gtg atg cgc gtc ccc 1363
Trp Ser Pro Thr Ala Thr Met Ile Leu Ala Tyr Val Met Arg Val Pro
330 335 340
gag gtc atc ata gac atc gtt agc ggg gct cac tgg ggc gtc atg ttc 1411
Glu Val Ile Ile Asp Ile Val Ser Gly Ala His Trp Gly Val Met Phe
345 350 355
ggc ttg gcc tac ttc tct atg cag gga gcg tgg gcg aag gtc att gtc 1459
Gly Leu Ala Tyr Phe Ser Met Gln Gly Ala Trp Ala Lys Val Ile Val
360 365 370
atc ctt ctg ctg gcc gct ggg gtg gac gcg ggc acc acc acc gtt gga 1507
Ile Leu Leu Leu Ala Ala Gly Val Asp Ala Gly Thr Thr Thr Val Gly
375 380 385
ggc gct gtt gca cgt tcc acc aac gtg att gcc ggc gtg ttc agc cat 1555
Gly Ala Val Ala Arg Ser Thr Asn Val Ile Ala Gly Val Phe Ser His
390 395 400 405
ggc cct cag cag aac att cag ctc att aac acc aac ggc agt tgg cac 1603
Gly Pro Gln Gln Asn Ile Gln Leu Ile Asn Thr Asn Gly Ser Trp His
410 415 420
atc aac cgt act gcc ttg aat tgc aat gac tcc ttg aac acc ggc ttt 1651
Ile Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Asn Thr Gly Phe
425 430 435
ctc gcg gcc ttg ttc tac acc aac cgc ttt aac tcg tca ggg tgt cca 1699
Leu Ala Ala Leu Phe Tyr Thr Asn Arg Phe Asn Ser Ser Gly Cys Pro
440 445 450
ggg cgc ctg tcc gcc tgc cgc aac atc gag gct ttc cgg ata ggg tgg 1747
Gly Arg Leu Ser Ala Cys Arg Asn Ile Glu Ala Phe Arg Ile Gly Trp
455 460 465
ggc acc cta cag tac gag gat aat gtc acc aat cca gag gat atg agg 1795
Gly Thr Leu Gln Tyr Glu Asp Asn Val Thr Asn Pro Glu Asp Met Arg
470 475 480 485
ccg tac tgc tgg cac tac ccc cca aag ccg tgt ggc gta gtc ccc gcg 1843
Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys Gly Val Val Pro Ala
490 495 500
agg tct gtg tgt ggc cca gtg tac tgt ttc acc ccc agc ccg gta gta 1891
Arg Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val
505 510 515
gtg ggc acg acc gac aga cgt gga gtg ccc acc tac aca tgg gga gag 1939
Val Gly Thr Thr Asp Arg Arg Gly Val Pro Thr Tyr Thr Trp Gly Glu
520 525 530
aat gag aca gat gtc ttc cta ctg aac agc acc cga ccg ccg cag ggc 1987
Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr Arg Pro Pro Gln Gly
535 540 545
tca tgg ttc ggc tgc acg tgg atg aac tcc act ggt ttc acc aag act 2035
Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Thr
550 555 560 565
tgt ggc gcg cca cct tgc cgc acc aga gct gac ttc aac gcc agc acg 2083
Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp Phe Asn Ala Ser Thr
570 575 580
gac ttg ttg tgc cct acg gat tgt ttt agg aag cat cct gat gcc act 2131
Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys His Pro Asp Ala Thr
585 590 595
tat att aag tgt ggt tct ggg ccc tgg ctc aca cca aag tgc ctg gtc 2179
Tyr Ile Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Lys Cys Leu Val
600 605 610
cac tac cct tac aga ctc tgg cat tac ccc tgc aca gtc aat ttt acc 2227
His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr
615 620 625
atc ttc aag ata aga atg tat gta ggg ggg gtt gag cac agg ctc acg 2275
Ile Phe Lys Ile Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Thr
630 635 640 645
gcc gca tgc aac ttc act cgt ggg gat cgc tgc gac ttg gag gac agg 2323
Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys Asp Leu Glu Asp Arg
650 655 660
gac agg agt cag ctg tct cct ctg ttg cac tct acc acg gaa tgg gcc 2371
Asp Arg Ser Gln Leu Ser Pro Leu Leu His Ser Thr Thr Glu Trp Ala
665 670 675
atc ctg ccc tgc acc tac tca gac tta ccc gct ttg tca act ggt ctt 2419
Ile Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala Leu Ser Thr Gly Leu
680 685 690
ctc cac ctt cac cag aac atc gtg gac gta caa tac atg tat ggc ctc 2467
Leu His Leu His Gln Asn Ile Val Asp Val Gln Tyr Met Tyr Gly Leu
695 700 705
tca cct gct atc aca aaa tac gtc gtt cga tgg gag tgg gtg gta ctc 2515
Ser Pro Ala Ile Thr Lys Tyr Val Val Arg Trp Glu Trp Val Val Leu
710 715 720 725
tta ttc ctg ctc tta gcg gac gcc aga gtc tgc gcc tgc ttg tgg atg 2563
Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met
730 735 740
ctc atc ttg ttg ggc cag gcc gaa gca gca ttg gag aag ttg gtc gtc 2611
Leu Ile Leu Leu Gly Gln Ala Glu Ala Ala Leu Glu Lys Leu Val Val
745 750 755
ttg cac gct gcg agt gcg gct aac tgc cat ggc ctc cta tat ttt gcc 2659
Leu His Ala Ala Ser Ala Ala Asn Cys His Gly Leu Leu Tyr Phe Ala
760 765 770
atc ttc ttc gtg gca gct tgg cac atc agg ggt cgg gtg gtc ccc ttg 2707
Ile Phe Phe Val Ala Ala Trp His Ile Arg Gly Arg Val Val Pro Leu
775 780 785
acc acc tat tgc ctc act ggc cta tgg ccc ttc tgc cta ctg ctc atg 2755
Thr Thr Tyr Cys Leu Thr Gly Leu Trp Pro Phe Cys Leu Leu Leu Met
790 795 800 805
gca ctg ccc cgg cag gct tat gcc tat gac gca cct gtg cac gga cag 2803
Ala Leu Pro Arg Gln Ala Tyr Ala Tyr Asp Ala Pro Val His Gly Gln
810 815 820
ata ggc gtg ggt ttg ttg ata ttg atc acc ctc ttc aca ctc acc ccg 2851
Ile Gly Val Gly Leu Leu Ile Leu Ile Thr Leu Phe Thr Leu Thr Pro
825 830 835
ggg tat aag acc ctc ctc ggc cag tgt ctg tgg tgg ttg tgc tat ctc 2899
Gly Tyr Lys Thr Leu Leu Gly Gln Cys Leu Trp Trp Leu Cys Tyr Leu
840 845 850
ctg acc ctg ggg gaa gcc atg att cag gag tgg gta cca ccc atg cag 2947
Leu Thr Leu Gly Glu Ala Met Ile Gln Glu Trp Val Pro Pro Met Gln
855 860 865
gtg cgc ggc ggc cgc gat ggc atc gcg tgg gcc gtc act ata ttc tgc 2995
Val Arg Gly Gly Arg Asp Gly Ile Ala Trp Ala Val Thr Ile Phe Cys
870 875 880 885
ccg ggt gtg gtg ttt gac att acc aaa tgg ctt ttg gcg ttg ctt ggg 3043
Pro Gly Val Val Phe Asp Ile Thr Lys Trp Leu Leu Ala Leu Leu Gly
890 895 900
cct gct tac ctc tta agg gcc gct ttg aca cat gtg ccg tac ttc gtc 3091
Pro Ala Tyr Leu Leu Arg Ala Ala Leu Thr His Val Pro Tyr Phe Val
905 910 915
aga gct cac gct ctg ata agg gta tgc gct ttg gtg aag cag ctc gcg 3139
Arg Ala His Ala Leu Ile Arg Val Cys Ala Leu Val Lys Gln Leu Ala
920 925 930
ggg ggt agg tat gtt cag gtg gcg cta ttg gcc ctt ggc agg tgg act 3187
Gly Gly Arg Tyr Val Gln Val Ala Leu Leu Ala Leu Gly Arg Trp Thr
935 940 945
ggc acc tac atc tat gac cac ctc aca cct atg tcg gac tgg gcc gct 3235
Gly Thr Tyr Ile Tyr Asp His Leu Thr Pro Met Ser Asp Trp Ala Ala
950 955 960 965
agc ggc ctg cgc gac tta gcg gtc gcc gtg gaa ccc atc atc ttc agt 3283
Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Ile Ile Phe Ser
970 975 980
ccg atg gag aag aag gtc atc gtc tgg gga gcg gag acg gct gca tgt 3331
Pro Met Glu Lys Lys Val Ile Val Trp Gly Ala Glu Thr Ala Ala Cys
985 990 995
ggg gac att cta cat gga ctt ccc gtg tcc gcc cga ctc ggc cag gag 3379
Gly Asp Ile Leu His Gly Leu Pro Val Ser Ala Arg Leu Gly Gln Glu
1000 1005 1010
atc ctc ctc ggc cca gct gat ggc tac acc tcc aag ggg tgg aag ctc 3427
Ile Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lys Gly Trp Lys Leu
1015 1020 1025
ctt gct ccc atc act gct tat gcc cag caa aca cga ggc ctc ctg ggc 3475
Leu Ala Pro Ile Thr Ala Tyr Ala Gln Gln Thr Arg Gly Leu Leu Gly
1030 1035 1040 1045
gcc ata gtg gtg agt atg acg ggg cgt gac agg aca gaa cag gcc ggg 3523
Ala Ile Val Val Ser Met Thr Gly Arg Asp Arg Thr Glu Gln Ala Gly
1050 1055 1060
gaa gtc caa atc ctg tcc aca gtc tct cag tcc ttc ctc gga aca acc 3571
Glu Val Gln Ile Leu Ser Thr Val Ser Gln Ser Phe Leu Gly Thr Thr
1065 1070 1075
atc tcg ggg gtt ttg tgg act gtt tac cac gga gct ggc aac aag act 3619
Ile Ser Gly Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr
1080 1085 1090
cta gcc ggc tta cgg ggt ccg gtc acg cag atg tac tcg agt gct gag 3667
Leu Ala Gly Leu Arg Gly Pro Val Thr Gln Met Tyr Ser Ser Ala Glu
1095 1100 1105
ggg gac ttg gta ggc tgg ccc agc ccc cct ggg acc aag tct ttg gag 3715
Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu
1110 1115 1120 1125
ccg tgc aag tgt gga gcc gtc gac cta tat ctg gtc acg cgg aac gct 3763
Pro Cys Lys Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn Ala
1130 1135 1140
gat gtc atc ccg gct cgg aga cgc ggg gac aag cgg gga gca ttg ctc 3811
Asp Val Ile Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu
1145 1150 1155
tcc ccg aga ccc att tcg acc ttg aag ggg tcc tcg ggg ggg ccg gtg 3859
Ser Pro Arg Pro Ile Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val
1160 1165 1170
ctc tgc cct agg ggc cac gtc gtt ggg ctc ttc cga gca gct gtg tgc 3907
Leu Cys Pro Arg Gly His Val Val Gly Leu Phe Arg Ala Ala Val Cys
1175 1180 1185
tct cgg ggc gtg gcc aaa tcc atc gat ttc atc ccc gtt gag aca ctc 3955
Ser Arg Gly Val Ala Lys Ser Ile Asp Phe Ile Pro Val Glu Thr Leu
1190 1195 1200 1205
gac gtt gtt aca agg tct ccc act ttc agt gac aac agc acg cca ccg 4003
Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro
1210 1215 1220
gct gtg ccc cag acc tat cag gtc ggg tac ttg cat gct cca act ggc 4051
Ala Val Pro Gln Thr Tyr Gln Val Gly Tyr Leu His Ala Pro Thr Gly
1225 1230 1235
agt gga aag agc acc aag gtc cct gtc gcg tat gcc gcc cag ggg tac 4099
Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr Ala Ala Gln Gly Tyr
1240 1245 1250
aaa gta cta gtg ctt aac ccc tcg gta gct gcc acc ctg ggg ttt ggg 4147
Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly
1255 1260 1265
gcg tac cta tcc aag gca cat ggc atc aat ccc aac att agg act gga 4195
Ala Tyr Leu Ser Lys Ala His Gly Ile Asn Pro Asn Ile Arg Thr Gly
1270 1275 1280 1285
gtc agg acc gtg atg acc ggg gag gcc atc acg tac tcc aca tat ggc 4243
Val Arg Thr Val Met Thr Gly Glu Ala Ile Thr Tyr Ser Thr Tyr Gly
1290 1295 1300
aaa ttt ctc gcc gat ggg ggc tgc gct agc ggc gcc tat gac atc atc 4291
Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly Ala Tyr Asp Ile Ile
1305 1310 1315
ata tgc gat gaa tgc cac gct gtg gat gct acc tcc att ctc ggc atc 4339
Ile Cys Asp Glu Cys His Ala Val Asp Ala Thr Ser Ile Leu Gly Ile
1320 1325 1330
gga acg gtc ctt gat caa gca gag aca gcc ggg gtc aga cta act gtg 4387
Gly Thr Val Leu Asp Gln Ala Glu Thr Ala Gly Val Arg Leu Thr Val
1335 1340 1345
ctg gct acg gcc aca ccc ccc ggg tca gtg aca acc ccc cat ccc gat 4435
Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asp
1350 1355 1360 1365
ata gaa gag gta ggc ctc ggg cgg gag ggt gag atc ccc ttc tat ggg 4483
Ile Glu Glu Val Gly Leu Gly Arg Glu Gly Glu Ile Pro Phe Tyr Gly
1370 1375 1380
agg gcg att ccc cta tcc tgc atc aag gga ggg aga cac ctg att ttc 4531
Arg Ala Ile Pro Leu Ser Cys Ile Lys Gly Gly Arg His Leu Ile Phe
1385 1390 1395
tgc cac tca aag aaa aag tgt gac gag ctc gcg gcg gcc ctt cgg ggc 4579
Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Ala Leu Arg Gly
1400 1405 1410
atg ggc ttg aat gcc gtg gca tac tat aga ggg ttg gac gtc tcc ata 4627
Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Ile
1415 1420 1425
ata cca gct cag gga gat gtg gtg gtc gtc gcc acc gac gcc ctc atg 4675
Ile Pro Ala Gln Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met
1430 1435 1440 1445
acg ggg tac act gga gac ttt gac tcc gtg atc gac tgc aat gta gcg 4723
Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val Ile Asp Cys Asn Val Ala
1450 1455 1460
gtc acc caa gct gtc gac ttc agc ctg gac ccc acc ttc act ata acc 4771
Val Thr Gln Ala Val Asp Phe Ser Leu Asp Pro Thr Phe Thr Ile Thr
1465 1470 1475
aca cag act gtc cca caa gac gct gtc tca cgc agt cag cgc cgc ggg 4819
Thr Gln Thr Val Pro Gln Asp Ala Val Ser Arg Ser Gln Arg Arg Gly
1480 1485 1490
cgc aca ggt aga gga aga cag ggc act tat agg tat gtt tcc act ggt 4867
Arg Thr Gly Arg Gly Arg Gln Gly Thr Tyr Arg Tyr Val Ser Thr Gly
1495 1500 1505
gaa cga gcc tca gga atg ttt gac agt gta gtg ctt tgt gag tgc tac 4915
Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr
1510 1515 1520 1525
gac gca ggg gct gcg tgg tac gat ctc aca cca gcg gag acc acc gtc 4963
Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro Ala Glu Thr Thr Val
1530 1535 1540
agg ctt aga gcg tat ttc aac acg ccc ggc cta ccc gtg tgt caa gac 5011
Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gln Asp
1545 1550 1555
cat ctt gaa ttt tgg gag gca gtt ttc acc ggc ctc aca cac ata gac 5059
His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His Ile Asp
1560 1565 1570
gcc cac ttc ctc tcc caa aca aag caa gcg ggg gag aac ttc gcg tac 5107
Ala His Phe Leu Ser Gln Thr Lys Gln Ala Gly Glu Asn Phe Ala Tyr
1575 1580 1585
cta gta gcc tac caa gct acg gtg tgc gcc aga gcc aag gcc cct ccc 5155
Leu Val Ala Tyr Gln Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro
1590 1595 1600 1605
ccg tcc tgg gac gcc atg tgg aag tgc ctg gcc cga ctc aag cct acg 5203
Pro Ser Trp Asp Ala Met Trp Lys Cys Leu Ala Arg Leu Lys Pro Thr
1610 1615 1620
ctt gcg ggc ccc aca cct ctc ctg tac cgt ttg ggc cct att acc aat 5251
Leu Ala Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro Ile Thr Asn
1625 1630 1635
gag gtc acc ctc aca cac cct ggg acg aag tac atc gcc aca tgc atg 5299
Glu Val Thr Leu Thr His Pro Gly Thr Lys Tyr Ile Ala Thr Cys Met
1640 1645 1650
caa gct gac ctt gag gtc atg acc agc acg tgg gtc cta gct gga gga 5347
Gln Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly
1655 1660 1665
gtc ctg gca gcc gtc gcc gca tat tgc ctg gcg act gga tgc gtt tcc 5395
Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val Ser
1670 1675 1680 1685
atc atc ggc cgc ttg cac gtc aac cag cga gtc gtc gtt gcg ccg gat 5443
Ile Ile Gly Arg Leu His Val Asn Gln Arg Val Val Val Ala Pro Asp
1690 1695 1700
aag gag gtc ctg tat gag gct ttt gat gag atg gag gaa tgc gcc tct 5491
Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser
1705 1710 1715
agg gcg gct ctc atc gaa gag ggg cag cgg ata gcc gag atg ttg aag 5539
Arg Ala Ala Leu Ile Glu Glu Gly Gln Arg Ile Ala Glu Met Leu Lys
1720 1725 1730
tcc aag atc caa ggc ttg ctg cag cag gcc tct aag cag gcc cag gac 5587
Ser Lys Ile Gln Gly Leu Leu Gln Gln Ala Ser Lys Gln Ala Gln Asp
1735 1740 1745
ata caa ccc gct atg cag gct tca tgg ccc aaa gtg gaa caa ttt tgg 5635
Ile Gln Pro Ala Met Gln Ala Ser Trp Pro Lys Val Glu Gln Phe Trp
1750 1755 1760 1765
gcc aga cac atg tgg aac ttc att agc ggc atc caa tac ctc gca gga 5683
Ala Arg His Met Trp Asn Phe Ile Ser Gly Ile Gln Tyr Leu Ala Gly
1770 1775 1780
ttg tca aca ctg cca ggg aac ccc gcg gtg gct tcc atg atg gca ttc 5731
Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala Ser Met Met Ala Phe
1785 1790 1795
agt gcc gcc ctc acc agt ccg ttg tcg acc agt acc acc atc ctt ctc 5779
Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr Ile Leu Leu
1800 1805 1810
aac atc atg gga ggc tgg tta gcg tcc cag atc gca cca ccc gcg ggg 5827
Asn Ile Met Gly Gly Trp Leu Ala Ser Gln Ile Ala Pro Pro Ala Gly
1815 1820 1825
gcc acc ggc ttt gtc gtc agt ggc ctg gtg ggg gct gcc gtg ggc agc 5875
Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser
1830 1835 1840 1845
ata ggc ctg ggt aag gtg ctg gtg gac atc ctg gca gga tat ggt gcg 5923
Ile Gly Leu Gly Lys Val Leu Val Asp Ile Leu Ala Gly Tyr Gly Ala
1850 1855 1860
ggc att tcg ggg gcc ctc gtc gca ttc aag atc atg tct ggc gag aag 5971
Gly Ile Ser Gly Ala Leu Val Ala Phe Lys Ile Met Ser Gly Glu Lys
1865 1870 1875
ccc tct atg gaa gat gtc atc aat cta ctg cct ggg atc ctg tct ccg 6019
Pro Ser Met Glu Asp Val Ile Asn Leu Leu Pro Gly Ile Leu Ser Pro
1880 1885 1890
gga gcc ctg gtg gtg ggg gtc atc tgc gcg gcc att ctg cgc cgc cac 6067
Gly Ala Leu Val Val Gly Val Ile Cys Ala Ala Ile Leu Arg Arg His
1895 1900 1905
gtg gga ccg ggg gag ggc gcg gtc caa tgg atg aac agg ctt att gcc 6115
Val Gly Pro Gly Glu Gly Ala Val Gln Trp Met Asn Arg Leu Ile Ala
1910 1915 1920 1925
ttt gct tcc aga gga aac cac gtc gcc cct act cac tac gtg acg gag 6163
Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu
1930 1935 1940
tcg gat gcg tcg cag cgt gtg acc caa cta ctt ggc tct ctt act ata 6211
Ser Asp Ala Ser Gln Arg Val Thr Gln Leu Leu Gly Ser Leu Thr Ile
1945 1950 1955
acc agc cta ctc aga aga ctc cac aat tgg ata act gag gac tgc ccc 6259
Thr Ser Leu Leu Arg Arg Leu His Asn Trp Ile Thr Glu Asp Cys Pro
1960 1965 1970
atc cca tgc tcc gga tcc tgg ctc cgc gac gtg tgg gac tgg gtt tgc 6307
Ile Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Val Cys
1975 1980 1985
acc atc ttg aca gac ttc aaa aat tgg ctg acc tct aaa ttg ttc ccc 6355
Thr Ile Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro
1990 1995 2000 2005
aag ctg ccc ggc ctc ccc ttc atc tct tgt caa aag ggg tac aag ggt 6403
Lys Leu Pro Gly Leu Pro Phe Ile Ser Cys Gln Lys Gly Tyr Lys Gly
2010 2015 2020
gtg tgg gcc ggc act ggc atc atg acc acg cgc tgc cct tgc ggc gcc 6451
Val Trp Ala Gly Thr Gly Ile Met Thr Thr Arg Cys Pro Cys Gly Ala
2025 2030 2035
aac atc tct ggc aat gtc cgc ctg ggc tct atg agg atc aca ggg cct 6499
Asn Ile Ser Gly Asn Val Arg Leu Gly Ser Met Arg Ile Thr Gly Pro
2040 2045 2050
aaa acc tgc atg aac acc tgg cag ggg acc ttt cct atc aat tgc tac 6547
Lys Thr Cys Met Asn Thr Trp Gln Gly Thr Phe Pro Ile Asn Cys Tyr
2055 2060 2065
acg gag ggc cag tgc gcg ccg aaa ccc ccc acg aac tac aag acc gcc 6595
Thr Glu Gly Gln Cys Ala Pro Lys Pro Pro Thr Asn Tyr Lys Thr Ala
2070 2075 2080 2085
atc tgg agg gtg gcg gcc tcg gag tac gcg gag gtg acg cag cat ggg 6643
Ile Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gln His Gly
2090 2095 2100
tcg tac tcc tat gta aca gga ctg acc act gac aat ctg aaa att cct 6691
Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp Asn Leu Lys Ile Pro
2105 2110 2115
tgc caa cta cct tct cca gag ttt ttc tcc tgg gtg gac ggt gtg cag 6739
Cys Gln Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gln
2120 2125 2130
atc cat agg ttt gca ccc aca cca aag ccg ttt ttc cgg gat gag gtc 6787
Ile His Arg Phe Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu Val
2135 2140 2145
tcg ttc tgc gtt ggg ctt aat tcc tat gct gtc ggg tcc cag ctt ccc 6835
Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val Gly Ser Gln Leu Pro
2150 2155 2160 2165
tgt gaa cct gag ccc gac gca gac gta ttg agg tcc atg cta aca gat 6883
Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg Ser Met Leu Thr Asp
2170 2175 2180
ccg ccc cac atc acg gcg gag act gcg gcg cgg cgc ttg gca cgg gga 6931
Pro Pro His Ile Thr Ala Glu Thr Ala Ala Arg Arg Leu Ala Arg Gly
2185 2190 2195
tca cct cca tct gag gcg agc tcc tca gtg agc cag cta tca gca ccg 6979
Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser Gln Leu Ser Ala Pro
2200 2205 2210
tcg ctg cgg gcc acc tgc acc acc cac agc aac acc tat gac gtg gac 7027
Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn Thr Tyr Asp Val Asp
2215 2220 2225
atg gtc gat gcc aac ctg ctc atg gag ggc ggt gtg gct cag aca gag 7075
Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly Val Ala Gln Thr Glu
2230 2235 2240 2245
cct gag tcc agg gtg ccc gtt ctg gac ttt ctc gag cca atg gcc gag 7123
Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu Glu Pro Met Ala Glu
2250 2255 2260
gaa gag agc gac ctt gag ccc tca ata cca tcg gag tgc atg ctc ccc 7171
Glu Glu Ser Asp Leu Glu Pro Ser Ile Pro Ser Glu Cys Met Leu Pro
2265 2270 2275
agg agc ggg ttt cca cgg gcc tta ccg gct tgg gca cgg cct gac tac 7219
Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr
2280 2285 2290
aac ccg ccg ctc gtg gaa tcg tgg agg agg cca gat tac caa ccg ccc 7267
Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro Asp Tyr Gln Pro Pro
2295 2300 2305
acc gtt gct ggt tgt gct ctc ccc ccc ccc aag aag gcc ccg acg cct 7315
Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Ala Pro Thr Pro
2310 2315 2320 2325
ccc cca agg aga cgc cgg aca gtg ggt ctg agc gag agc acc ata tca 7363
Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr Ile Ser
2330 2335 2340
gaa gcc ctc cag caa ctg gcc atc aag acc ttt ggc cag ccc ccc tcg 7411
Glu Ala Leu Gln Gln Leu Ala Ile Lys Thr Phe Gly Gln Pro Pro Ser
2345 2350 2355
agc ggt gat gca ggc tcg tcc acg ggg gcg ggc gcc gcc gaa tcc ggc 7459
Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly Ala Ala Glu Ser Gly
2360 2365 2370
ggt ccg acg tcc cct ggt gag ccg gcc ccc tca gag aca ggt tcc gcc 7507
Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser Glu Thr Gly Ser Ala
2375 2380 2385
tcc tct atg ccc ccc ctc gag ggg gag cct gga gat ccg gac ctg gag 7555
Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Glu
2390 2395 2400 2405
tct gat cag gta gag ctt caa cct ccc ccc cag ggg ggg ggg gta gct 7603
Ser Asp Gln Val Glu Leu Gln Pro Pro Pro Gln Gly Gly Gly Val Ala
2410 2415 2420
ccc ggt tcg ggc tcg ggg tct tgg tct act tgc tcc gag gag gac gat 7651
Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys Ser Glu Glu Asp Asp
2425 2430 2435
acc acc gtg tgc tgc tcc atg tca tac tcc tgg acc ggg gct cta ata 7699
Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu Ile
2440 2445 2450
act ccc tgt agc ccc gaa gag gaa aag ttg cca atc aac cct ttg agt 7747
Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro Ile Asn Pro Leu Ser
2455 2460 2465
aac tcg ctg ttg cga tac cat aac aag gtg tac tgt aca aca tca aag 7795
Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr Cys Thr Thr Ser Lys
2470 2475 2480 2485
agc gcc tca cag agg gct aaa aag gta act ttt gac agg acg caa gtg 7843
Ser Ala Ser Gln Arg Ala Lys Lys Val Thr Phe Asp Arg Thr Gln Val
2490 2495 2500
ctc gac gcc cat tat gac tca gtc tta aag gac atc aag cta gcg gct 7891
Leu Asp Ala His Tyr Asp Ser Val Leu Lys Asp Ile Lys Leu Ala Ala
2505 2510 2515
tcc aag gtc agc gca agg ctc ctc acc ttg gag gag gcg tgc cag ttg 7939
Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gln Leu
2520 2525 2530
act cca ccc cat tct gca aga tcc aag tat gga ttc ggg gcc aag gag 7987
Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu
2535 2540 2545
gtc cgc agc ttg tcc ggg agg gcc gtt aac cac atc aag tcc gtg tgg 8035
Val Arg Ser Leu Ser Gly Arg Ala Val Asn His Ile Lys Ser Val Trp
2550 2555 2560 2565
aag gac ctc ctg gaa gac cca caa aca cca att ccc aca acc atc atg 8083
Lys Asp Leu Leu Glu Asp Pro Gln Thr Pro Ile Pro Thr Thr Ile Met
2570 2575 2580
gcc aaa aat gag gtg ttc tgc gtg gac ccc gcc aag ggg ggt aag aaa 8131
Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala Lys Gly Gly Lys Lys
2585 2590 2595
cca gct cgc ctc atc gtt tac cct gac ctc ggc gtc cgg gtc tgc gag 8179
Pro Ala Arg Leu Ile Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu
2600 2605 2610
aaa atg gcc ctc tat gac att aca caa aag ctt cct cag gcg gta atg 8227
Lys Met Ala Leu Tyr Asp Ile Thr Gln Lys Leu Pro Gln Ala Val Met
2615 2620 2625
gga gct tcc tat ggc ttc cag tac tcc cct gcc caa cgg gtg gag tat 8275
Gly Ala Ser Tyr Gly Phe Gln Tyr Ser Pro Ala Gln Arg Val Glu Tyr
2630 2635 2640 2645
ctc ttg aaa gca tgg gcg gaa aag aag gac ccc atg ggt ttt tcg tat 8323
Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro Met Gly Phe Ser Tyr
2650 2655 2660
gat acc cga tgc ttc gac tca acc gtc act gag aga gac atc agg acc 8371
Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp Ile Arg Thr
2665 2670 2675
gag gag tcc ata tac cag gcc tgc tcc ctg ccc gag gag gcc cgc act 8419
Glu Glu Ser Ile Tyr Gln Ala Cys Ser Leu Pro Glu Glu Ala Arg Thr
2680 2685 2690
gcc ata cac tcg ctg act gag aga ctt tac gta gga ggg ccc atg ttc 8467
Ala Ile His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Phe
2695 2700 2705
aac agc aag ggt caa acc tgc ggt tac aga cgt tgc cgc gcc agc ggg 8515
Asn Ser Lys Gly Gln Thr Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly
2710 2715 2720 2725
gtg cta acc act agc atg ggt aac acc atc aca tgc tat gtg aaa gcc 8563
Val Leu Thr Thr Ser Met Gly Asn Thr Ile Thr Cys Tyr Val Lys Ala
2730 2735 2740
cta gcg gcc tgc aag gct gcg ggg ata gtt gcg ccc aca atg ctg gta 8611
Leu Ala Ala Cys Lys Ala Ala Gly Ile Val Ala Pro Thr Met Leu Val
2745 2750 2755
tgc ggc gat gac cta gta gtc atc tca gaa agc cag ggg act gag gag 8659
Cys Gly Asp Asp Leu Val Val Ile Ser Glu Ser Gln Gly Thr Glu Glu
2760 2765 2770
gac gag cgg aac ctg aga gcc ttc acg gag gcc atg acc agg tac tct 8707
Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser
2775 2780 2785
gcc cct cct ggt gat ccc ccc aga ccg gaa tat gac ctg gag cta ata 8755
Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu Ile
2790 2795 2800 2805
aca tcc tgt tcc tca aat gtg tct gtg gcg ttg ggc ccg cgg ggc cgc 8803
Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu Gly Pro Arg Gly Arg
2810 2815 2820
cgc aga tac tac ctg acc aga gac cca acc act cca ctc gcc cgg gct 8851
Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala
2825 2830 2835
gcc tgg gaa aca gtt aga cac tcc cct atc aat tca tgg ctg gga aac 8899
Ala Trp Glu Thr Val Arg His Ser Pro Ile Asn Ser Trp Leu Gly Asn
2840 2845 2850
atc atc cag tat gct cca acc ata tgg gtt cgc atg gtc cta atg aca 8947
Ile Ile Gln Tyr Ala Pro Thr Ile Trp Val Arg Met Val Leu Met Thr
2855 2860 2865
cac ttc ttc tcc att ctc atg gtc caa gac acc ctg gac cag aac ctc 8995
His Phe Phe Ser Ile Leu Met Val Gln Asp Thr Leu Asp Gln Asn Leu
2870 2875 2880 2885
aac ttt gag atg tat gga tca gta tac tcc gtg aat cct ttg gac ctt 9043
Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Asn Pro Leu Asp Leu
2890 2895 2900
cca gcc ata att gag agg tta cac ggg ctt gac gcc ttt tct atg cac 9091
Pro Ala Ile Ile Glu Arg Leu His Gly Leu Asp Ala Phe Ser Met His
2905 2910 2915
aca tac tct cac cac gaa ctg acg cgg gtg gct tca gcc ctc aga aaa 9139
Thr Tyr Ser His His Glu Leu Thr Arg Val Ala Ser Ala Leu Arg Lys
2920 2925 2930
ctt ggg gcg cca ccc ctc agg gtg tgg aag agt cgg gct cgc gca gtc 9187
Leu Gly Ala Pro Pro Leu Arg Val Trp Lys Ser Arg Ala Arg Ala Val
2935 2940 2945
agg gcg tcc ctc atc tcc cgt gga ggg aaa gcg gcc gtt tgc ggc cga 9235
Arg Ala Ser Leu Ile Ser Arg Gly Gly Lys Ala Ala Val Cys Gly Arg
2950 2955 2960 2965
tat ctc ttc aat tgg gcg gtg aag acc aag ctc aaa ctc act cca ttg 9283
Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu
2970 2975 2980
ccg gag gcg cgc cta ctg gac tta tcc agt tgg ttc acc gtc ggc gcc 9331
Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp Phe Thr Val Gly Ala
2985 2990 2995
ggc ggg ggc gac att ttt cac agc gtg tcg cgc gcc cga ccc cgc tca 9379
Gly Gly Gly Asp Ile Phe His Ser Val Ser Arg Ala Arg Pro Arg Ser
3000 3005 3010
tta ctc ttc ggc cta ctc cta ctt ttc gta ggg gta ggc ctc ttc cta 9427
Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly Val Gly Leu Phe Leu
3015 3020 3025
ctc ccc gct cgg tag agcggcacac actaggtaca ctccatagct aactgttcct 9482
Leu Pro Ala Arg
3030
tttttttttt tttttttttt tttttttttt tttttttttt ttcttttttt tttttttccc 9542
tctttcttcc cttctcatct tattctactt tctttcttgg tggctccatc ttagccctag 9602
tcacggctag ctgtgaaagg tccgtgagcc gcatgactgc agagagtgcc gtaactggtc 9662
tctctgcaga tcatgt 9678
<210>28
<211>3033
<212>PRT
<213>丙型肝炎病毒
<400>28
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Glu Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Asp Arg Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly
65 70 75 80
Arg Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg His Arg Ser Arg Asn Val Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Val Val Gly Ala Pro Leu
130 135 140
Ser Gly Ala Ala Arg Ala Val Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Ile Thr Val Pro Val Ser Ala Ala
180 185 190
Gln Val Lys Asn Thr Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser
195 200 205
Asn Asp Ser Ile Thr Trp Gln Leu Glu Ala Ala Val Leu His Val Pro
210 215 220
Gly Cys Val Pro Cys Glu Arg Val Gly Asn Thr Ser Arg Cys Trp Val
225 230 235 240
Pro Val Ser Pro Asn Met Ala Val Arg Gln Pro Gly Ala Leu Thr Gln
245 250 255
Gly Leu Arg Thr His Ile Asp Met Val Val Met Ser Ala Thr Phe Cys
260 265 270
Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala
275 280 285
Gln Val Phe Ile Val Ser Pro Gln Tyr His Trp Phe Val Gln Glu Cys
290 295 300
Asn Cys Ser Ile Tyr Pro Gly Thr Ile Thr Gly His Arg Met Ala Trp
305 310 315 320
Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met Ile Leu Ala Tyr
325 330 335
Val Met Arg Val Pro Glu Val Ile Ile Asp Ile Val Ser Gly Ala His
340 345 350
Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gln Gly Ala Trp
355 360 365
Ala Lys Val Ile Val Ile Leu Leu Leu Ala Ala Gly Val Asp Ala Gly
370 375 380
Thr Thr Thr Val Gly Gly Ala Val Ala Arg Ser Thr Asn Val Ile Ala
385 390 395 400
Gly Val Phe Ser His Gly Pro Gln Gln Asn Ile Gln Leu Ile Asn Thr
405 410 415
Asn Gly Ser Trp His Ile Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser
420 425 430
Leu Asn Thr Gly Phe Leu Ala Ala Leu Phe Tyr Thr Asn Arg Phe Asn
435 440 445
Ser Ser Gly Cys Pro Gly Arg Leu Ser Ala Cys Arg Asn Ile Glu Ala
450 455 460
Phe Arg Ile Gly Trp Gly Thr Leu Gln Tyr Glu Asp Asn Val Thr Asn
465 470 475 480
Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys
485 490 495
Gly Val Val Pro Ala Arg Ser Val Cys Gly Pro Val Tyr Cys Phe Thr
500 505 510
Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Arg Gly Val Pro Thr
515 520 525
Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr
530 535 540
Arg Pro Pro Gln Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr
545 550 555 560
Gly Phe Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp
565 570 575
Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys
580 585 590
His Pro Asp Ala Thr Tyr Ile Lys Cys Gly Ser Gly Pro Trp Leu Thr
595 600 605
Pro Lys Cys Leu Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys
610 615 620
Thr Val Asn Phe Thr Ile Phe Lys Ile Arg Met Tyr Val Gly Gly Val
625 630 635 640
Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys
645 650 655
Asp Leu Glu Asp Arg Asp Arg Ser Gln Leu Ser Pro Leu Leu His Ser
660 665 670
Thr Thr Glu Trp Ala Ile Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala
675 680 685
Leu Ser Thr Gly Leu Leu His Leu His Gln Asn Ile Val Asp Val Gln
690 695 700
Tyr Met Tyr Gly Leu Ser Pro Ala Ile Thr Lys Tyr Val Val Arg Trp
705 710 715 720
Glu Trp Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys
725 730 735
Ala Cys Leu Trp Met Leu Ile Leu Leu Gly Gln Ala Glu Ala Ala Leu
740 745 750
Glu Lys Leu Val Val Leu His Ala Ala Ser Ala Ala Asn Cys His Gly
755 760 765
Leu Leu Tyr Phe Ala Ile Phe Phe Val Ala Ala Trp His Ile Arg Gly
770 775 780
Arg Val Val Pro Leu Thr Thr Tyr Cys Leu Thr Gly Leu Trp Pro Phe
785 790 795 800
Cys Leu Leu Leu Met Ala Leu Pro Arg Gln Ala Tyr Ala Tyr Asp Ala
805 810 815
Pro Val His Gly Gln Ile Gly Val Gly Leu Leu Ile Leu Ile Thr Leu
820 825 830
Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Gly Gln Cys Leu Trp
835 840 845
Trp Leu Cys Tyr Leu Leu Thr Leu Gly Glu Ala Met Ile Gln Glu Trp
850 855 860
Val Pro Pro Met Gln Val Arg Gly Gly Arg Asp Gly Ile Ala Trp Ala
865 870 875 880
Val Thr Ile Phe Cys Pro Gly Val Val Phe Asp Ile Thr Lys Trp Leu
885 890 895
Leu Ala Leu Leu Gly Pro Ala Tyr Leu Leu Arg Ala Ala Leu Thr His
900 905 910
Val Pro Tyr Phe Val Arg Ala His Ala Leu Ile Arg Val Cys Ala Leu
915 920 925
Val Lys Gln Leu Ala Gly Gly Arg Tyr Val Gln Val Ala Leu Leu Ala
930 935 940
Leu Gly Arg Trp Thr Gly Thr Tyr Ile Tyr Asp His Leu Thr Pro Met
945 950 955 960
Ser Asp Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu
965 970 975
Pro Ile Ile Phe Ser Pro Met Glu Lys Lys Val Ile Val Trp Gly Ala
980 985 990
Glu Thr Ala Ala Cys Gly Asp Ile Leu His Gly Leu Pro Val Ser Ala
995 1000 1005
Arg Leu Gly Gln Glu Ile Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser
1010 1015 1020
Lys Gly Trp Lys Leu Leu Ala Pro Ile Thr Ala Tyr Ala Gln Gln Thr
1025 1030 1035 1040
Arg Gly Leu Leu Gly Ala Ile Val Val Ser Met Thr Gly Arg Asp Arg
1045 1050 1055
Thr Glu Gln Ala Gly Glu Val Gln Ile Leu Ser Thr Val Ser Gln Ser
1060 1065 1070
Phe Leu Gly Thr Thr Ile Ser Gly Val Leu Trp Thr Val Tyr His Gly
1075 1080 1085
Ala Gly Asn Lys Thr Leu Ala Gly Leu Arg Gly Pro Val Thr Gln Met
1090 1095 1100
Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly
1105 1110 1115 1120
Thr Lys Ser Leu Glu Pro Cys Lys Cys Gly Ala Val Asp Leu Tyr Leu
1125 1130 1135
Val Thr Arg Asn Ala Asp Val Ile Pro Ala Arg Arg Arg Gly Asp Lys
1140 1145 1150
Arg Gly Ala Leu Leu Ser Pro Arg Pro Ile Ser Thr Leu Lys Gly Ser
1155 1160 1165
Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Val Val Gly Leu Phe
1170 1175 1180
Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser Ile Asp Phe Ile
1185 1190 1195 1200
Pro Val Glu Thr Leu Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp
1205 1210 1215
Asn Ser Thr Pro Pro Ala Val Pro Gln Thr Tyr Gln Val Gly Tyr Leu
1220 1225 1230
His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr
1235 1240 1245
Ala Ala Gln Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala
1250 1255 1260
Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly Ile Asn Pro
1265 1270 1275 1280
Asn Ile Arg Thr Gly Val Arg Thr Val Met Thr Gly Glu Ala Ile Thr
1285 1290 1295
Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly
1300 1305 1310
Ala Tyr Asp Ile Ile Ile Cys Asp Glu Cys His Ala Val Asp Ala Thr
1315 1320 1325
Ser Ile Leu Gly Ile Gly Thr Val Leu Asp Gln Ala Glu Thr Ala Gly
1330 1335 1340
Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr
1345 1350 1355 1360
Thr Pro His Pro Asp Ile Glu Glu Val Gly Leu Gly Arg Glu Gly Glu
1365 1370 1375
Ile Pro Phe Tyr Gly Arg Ala Ile Pro Leu Ser Cys Ile Lys Gly Gly
1380 1385 1390
Arg His Leu Ile Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala
1395 1400 1405
Ala Ala Leu Arg Gly Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly
1410 1415 1420
Leu Asp Val Ser Ile Ile Pro Ala Gln Gly Asp Val Val Val Val Ala
1425 1430 1435 1440
Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val Ile
1445 1450 1455
Asp Cys Asn Val Ala Val Thr Gln Ala Val Asp Phe Ser Leu Asp Pro
1460 1465 1470
Thr Phe Thr Ile Thr Thr Gln Thr Val Pro Gln Asp Ala Val Ser Arg
1475 1480 1485
Ser Gln Arg Arg Gly Arg Thr Gly Arg Gly Arg Gln Gly Thr Tyr Arg
1490 1495 1500
Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val
1505 1510 1515 1520
Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro
1525 1530 1535
Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu
1540 1545 1550
Pro Val Cys Gln Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly
1555 1560 1565
Leu Thr His Ile Asp Ala His Phe Leu Ser Gln Thr Lys Gln Ala Gly
1570 1575 1580
Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gln Ala Thr Val Cys Ala Arg
1585 1590 1595 1600
Ala Lys Ala Pro Pro Pro Ser Trp Asp Ala Met Trp Lys Cys Leu Ala
1605 1610 1615
Arg Leu Lys Pro Thr Leu Ala Gly Pro Thr Pro Leu Leu Tyr Arg Leu
1620 1625 1630
Gly Pro Ile Thr Asn Glu Val Thr Leu Thr His Pro Gly Thr Lys Tyr
1635 1640 1645
Ile Ala Thr Cys Met Gln Ala Asp Leu Glu Val Met Thr Ser Thr Trp
1650 1655 1660
Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala
1665 1670 1675 1680
Thr Gly Cys Val Ser Ile Ile Gly Arg Leu His Val Asn Gln Arg Val
1685 1690 1695
Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met
1700 1705 1710
Glu Glu Cys Ala Ser Arg Ala Ala Leu Ile Glu Glu Gly Gln Arg Ile
1715 1720 1725
Ala Glu Met Leu Lys Ser Lys Ile Gln Gly Leu Leu Gln Gln Ala Ser
1730 1735 1740
Lys Gln Ala Gln Asp Ile Gln Pro Ala Met Gln Ala Ser Trp Pro Lys
1745 1750 1755 1760
Val Glu Gln Phe Trp Ala Arg His Met Trp Asn Phe Ile Ser Gly Ile
1765 1770 1775
Gln Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala
1780 1785 1790
Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser
1795 1800 1805
Thr Thr Ile Leu Leu Asn Ile Met Gly Gly Trp Leu Ala Ser Gln Ile
1810 1815 1820
Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly
1825 1830 1835 1840
Ala Ala Val Gly Ser Ile Gly Leu Gly Lys Val Leu Val Asp Ile Leu
1845 1850 1855
Ala Gly Tyr Gly Ala Gly Ile Ser Gly Ala Leu Val Ala Phe Lys Ile
1860 1865 1870
Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val Ile Asn Leu Leu Pro
1875 1880 1885
Gly Ile Leu Ser Pro Gly Ala Leu Val Val Gly Val Ile Cys Ala Ala
1890 1895 1900
Ile Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gln Trp Met
1905 1910 1915 1920
Asn Arg Leu Ile Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr
1925 1930 1935
His Tyr Val Thr Glu Ser Asp Ala Ser Gln Arg Val Thr Gln Leu Leu
1940 1945 1950
Gly Ser Leu Thr Ile Thr Ser Leu Leu Arg Arg Leu His Asn Trp Ile
1955 1960 1965
Thr Glu Asp Cys Pro Ile Pro Cys Ser Gly Ser Trp Leu Arg Asp Val
1970 1975 1980
Trp Asp Trp Val Cys Thr Ile Leu Thr Asp Phe Lys Asn Trp Leu Thr
1985 1990 1995 2000
Ser Lys Leu Phe Pro Lys Leu Pro Gly Leu Pro Phe Ile Ser Cys Gln
2005 2010 2015
Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly Ile Met Thr Thr Arg
2020 2025 2030
Cys Pro Cys Gly Ala Asn Ile Ser Gly Asn Val Arg Leu Gly Ser Met
2035 2040 2045
Arg Ile Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gln Gly Thr Phe
2050 2055 2060
Pro Ile Asn Cys Tyr Thr Glu Gly Gln Cys Ala Pro Lys Pro Pro Thr
2065 2070 2075 2080
Asn Tyr Lys Thr Ala Ile Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu
2085 2090 2095
Val Thr Gln His Gly Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp
2100 2105 2110
Asn Leu Lys Ile Pro Cys Gln Leu Pro Ser Pro Glu Phe Phe Ser Trp
2115 2120 2125
Val Asp Gly Val Gln Ile His Arg Phe Ala Pro Thr Pro Lys Pro Phe
2130 2135 2140
Phe Arg Asp Glu Val Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val
2145 2150 2155 2160
Gly Ser Gln Leu Pro Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg
2165 2170 2175
Ser Met Leu Thr Asp Pro Pro His Ile Thr Ala Glu Thr Ala Ala Arg
2180 2185 2190
Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser
2195 2200 2205
Gln Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn
2210 2215 2220
Thr Tyr Asp Val Asp Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly
2225 2230 2235 2240
Val Ala Gln Thr Glu Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu
2245 2250 2255
Glu Pro Met Ala Glu Glu Glu Ser Asp Leu Glu Pro Ser Ile Pro Ser
2260 2265 2270
Glu Cys Met Leu Pro Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala Trp
2275 2280 2285
Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro
2290 2295 2300
Asp Tyr Gln Pro Pro Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys
2305 2310 2315 2320
Lys Ala Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser
2325 2330 2335
Glu Ser Thr Ile Ser Glu Ala Leu Gln Gln Leu Ala Ile Lys Thr Phe
2340 2345 2350
Gly Gln Pro Pro Ser Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly
2355 2360 2365
Ala Ala Glu Ser Gly Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser
2370 2375 2380
Glu Thr Gly Ser Ala Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly
2385 2390 2395 2400
Asp Pro Asp Leu Glu Ser Asp Gln Val Glu Leu Gln Pro Pro Pro Gln
2405 2410 2415
Gly Gly Gly Val Ala Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys
2420 2425 2430
Ser Glu Glu Asp Asp Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp
2435 2440 2445
Thr Gly Ala Leu Ile Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro
2450 2455 2460
Ile Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr
2465 2470 2475 2480
Cys Thr Thr Ser Lys Ser Ala Ser Gln Arg Ala Lys Lys Val Thr Phe
2485 2490 2495
Asp Arg Thr Gln Val Leu Asp Ala His Tyr Asp Ser Val Leu Lys Asp
2500 2505 2510
Ile Lys Leu Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu
2515 2520 2525
Glu Ala Cys Gln Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly
2530 2535 2540
Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His
2545 2550 2555 2560
Ile Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Pro Gln Thr ProIle
2565 2570 2575
Pro Thr Thr Ile Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala
2580 2585 2590
Lys Gly Gly Lys Lys Pro Ala Arg Leu Ile Val Tyr Pro Asp Leu Gly
2595 2600 2605
Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Ile Thr Gln Lys Leu
2610 2615 2620
Pro Gln Ala Val Met Gly Ala Ser Tyr Gly Phe Gln Tyr Ser Pro Ala
2625 2630 2635 2640
Gln Arg Val Glu Tyr Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro
2645 2650 2655
Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu
2660 2665 2670
Arg Asp Ile Arg Thr Glu Glu Ser Ile Tyr Gln Ala Cys Ser Leu Pro
2675 2680 2685
Glu Glu Ala Arg Thr Ala Ile His Ser Leu Thr Glu Arg Leu Tyr Val
2690 2695 2700
Gly Gly Pro Met Phe Asn Ser Lys Gly Gln Thr Cys Gly Tyr Arg Arg
2705 2710 2715 2720
Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr Ile Thr
2725 2730 2735
Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly Ile Val Ala
2740 2745 2750
Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val Ile Ser Glu Ser
2755 2760 2765
Gln Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala
2770 2775 2780
Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr
2785 2790 2795 2800
Asp Leu Glu Leu Ile Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu
2805 2810 2815
Gly Pro Arg Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr
2820 2825 2830
Pro Leu Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro Ile Asn
2835 2840 2845
Ser Trp Leu Gly Asn Ile Ile Gln Tyr Ala Pro Thr Ile Trp Val Arg
2850 2855 2860
Met Val Leu Met Thr His Phe Phe Ser Ile Leu Met Val Gln Asp Thr
2865 2870 2875 2880
Leu Asp Gln Asn Leu Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val
2885 2890 2895
Asn Pro Leu Asp Leu Pro Ala Ile Ile Glu Arg Leu His Gly Leu Asp
2900 2905 2910
Ala Phe Ser Met His Thr Tyr Ser His His Glu Leu Thr Arg Val Ala
2915 2920 2925
Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Val Trp Lys Ser
2930 2935 2940
Arg Ala Arg Ala Val Arg Ala Ser Leu Ile Ser Arg Gly Gly Lys Ala
2945 2950 2955 2960
Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu
2965 2970 2975
Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp
2980 2985 2990
Phe Thr Val Gly Ala Gly Gly Gly Asp Ile Phe His Ser Val Ser Arg
2995 3000 3005
Ala Arg Pro Arg Ser Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly
3010 3015 3020
Val Gly Leu Phe Leu Leu Pro Ala Arg
3025 3030
<210>29
<211>9678
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:嵌合DNA
<400>29
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180
aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240
caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300
tgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 360
tcaaagaaaa accaaaagaa acaccaaccg tcgcccacaa gacgttaagt ttccgggcgg 420
cggccagatc gttggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480
cgcgacaagg aagacttcgg agcggtccca gccacgtgga aggcgccagc ccatccctaa 540
agatcggcgc tccactggca aatcctgggg aaaaccagga tacccctggc ccctatacgg 600
gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggttccc gtccctcttg 660
gggccccaat gacccccggc ataggtcgcg caacgtgggt aaggtcatcg ataccctaac 720
gtgcggcttt gccgacctca tggggtacat ccctgtcgtg ggcgccccgc tcggcggcgt 780
cgccagagct ctcgcgcatg gcgtgagagt cctggaggac ggggttaatt ttgcaacagg 840
gaacttaccc ggttgctcct tttctatctt cttgctggcc ctgctgtcct gcatcaccac 900
cccggtctcc gctgccgaag tgaagaacat cagtaccggc tacatggtga ctaacgactg 960
caccaatgac agcattacct ggcagctcca ggctgctgtc ctccacgtcc ccgggtgcgt 1020
cccgtgcgag aaagtgggga atgcatctca gtgctggata ccggtctcac cgaatgtggc 1080
cgtgcagcgg cccggcgccc tcacgcaggg cttgcggacg cacatcgaca tggttgtgat 1140
gtccgccacg ctctgctctg ccctctacgt gggggacctc tgcggtgggg tgatgctcgc 1200
agcccaaatg ttcattgtct cgccgcagca ccactggttt gtccaagact gcaattgctc 1260
catctaccct ggtaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320
gcccacggct accatgatct tggcgtacgc gatgcgtgtc cccgaggtca ttatagacat 1380
cattagcggg gctcattggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440
gtgggcgaaa gtcgttgtca tccttctgtt ggccgccggg gtggacgcgc gcacccatac 1500
tgttgggggt tctgccgcgc agaccaccgg gcgcctcacc agcttatttg acatgggccc 1560
caggcagaaa atccagctcg ttaacaccaa tggcagctgg cacatcaacc gcaccgccct 1620
gaactgcaat gactccttgc acaccggctt tatcgcgtct ctgttctaca cccacagctt 1680
caactcgtca ggatgtcccg aacgcatgtc cgcctgccgc agtatcgagg ccttccgggt 1740
gggatggggc gccttgcaat atgaggataa tgtcaccaat ccagaggata tgagacccta 1800
ttgctggcac tacccaccaa ggcagtgtgg cgtggtctcc gcgaagactg tgtgtggccc 1860
agtgtactgt ttcaccccca gcccagtggt agtgggcacg accgacaggc ttggagcgcc 1920
cacttacacg tggggggaga atgagacaga tgtcttccta ttgaacagca ctcgaccacc 1980
gctggggtca tggttcggct gcacgtggat gaactcttct ggctacacca agacttgcgg 2040
cgcaccaccc tgccgtacta gagctgactt caacgccagc acggacctgt tgtgccccac 2100
ggactgtttt aggaagcatc ctgataccac ttacctcaaa tgcggctctg ggccctggct 2160
cacgccaagg tgcctgatcg actaccccta caggctctgg cattacccct gcacagttaa 2220
ctataccatc ttcaaaataa ggatgtatgt gggaggggtt gagcacaggc tcacggctgc 2280
atgcaatttc actcgtgggg atcgttgcaa cttggaggac agagacagaa gtcaactgtc 2340
tcctttgttg cactccacca cggaatgggc cattttacct tgctcttact cggacctgcc 2400
cgccttgtcg actggtcttc tccacctcca ccaaaacatc gtggacgtac aattcatgta 2460
tggcctatca cctgccctca caaaatacat cgtccgatgg gagtgggtaa tactcttatt 2520
cctgctctta gcggacgcca gggtttgcgc ctgcttatgg atgctcatct tgttgggcca 2580
ggccgaagca gcactagaga agctggtcat cttgcacgct gcgagcgcag ctagctgcaa 2640
tggcttccta tattttgtca tctttttcgt ggctgcttgg tacatcaagg gtcgggtagt 2700
ccccttagct acctattccc tcactggcct gtggtccttt agcctactgc tcctagcatt 2760
gccccaacag gcttatgctt atgacgcatc tgtgcatggc cagataggag cggctctgct 2820
ggtaatgatc accctcttca cactcacccc ggggtataag accctcctcg gccagtgtct 2880
gtggtggttg tgctatctcc tgaccctggg ggaagccatg attcaggagt gggtaccacc 2940
catgcaggtg cgcggcggcc gcgatggcat cgcgtgggcc gtcactatat tctgcccggg 3000
tgtggtgttt gacattacca aatggctttt ggcgttgctt gggcctgctt acctcttaag 3060
ggccgctttg acacatgtgc cgtacttcgt cagagctcac gctctgataa gggtatgcgc 3120
tttggtgaag cagctcgcgg ggggtaggta tgttcaggtg gcgctattgg cccttggcag 3180
gtggactggc acctacatct atgaccacct cacacctatg tcggactggg ccgctagcgg 3240
cctgcgcgac ttagcggtcg ccgtggaacc catcatcttc agtccgatgg agaagaaggt 3300
catcgtctgg ggagcggaga cggctgcatg tggggacatt ctacatggac ttcccgtgtc 3360
cgcccgactc ggccaggaga tcctcctcgg cccagctgat ggctacacct ccaaggggtg 3420
gaagctcctt gctcccatca ctgcttatgc ccagcaaaca cgaggcctcc tgggcgccat 3480
agtggtgagt atgacggggc gtgacaggac agaacaggcc ggggaagtcc aaatcctgtc 3540
cacagtctct cagtccttcc tcggaacaac catctcgggg gttttgtgga ctgtttacca 3600
cggagctggc aacaagactc tagccggctt acggggtccg gtcacgcaga tgtactcgag 3660
tgctgagggg gacttggtag gctggcccag cccccctggg accaagtctt tggagccgtg 3720
caagtgtgga gccgtcgacc tatatctggt cacgcggaac gctgatgtca tcccggctcg 3780
gagacgcggg gacaagcggg gagcattgct ctccccgaga cccatttcga ccttgaaggg 3840
gtcctcgggg gggccggtgc tctgccctag gggccacgtc gttgggctct tccgagcagc 3900
tgtgtgctct cggggcgtgg ccaaatccat cgatttcatc cccgttgaga cactcgacgt 3960
tgttacaagg tctcccactt tcagtgacaa cagcacgcca ccggctgtgc cccagaccta 4020
tcaggtcggg tacttgcatg ctccaactgg cagtggaaag agcaccaagg tccctgtcgc 4080
gtatgccgcc caggggtaca aagtactagt gcttaacccc tcggtagctg ccaccctggg 4140
gtttggggcg tacctatcca aggcacatgg catcaatccc aacattagga ctggagtcag 4200
gaccgtgatg accggggagg ccatcacgta ctccacatat ggcaaatttc tcgccgatgg 4260
gggctgcgct agcggcgcct atgacatcat catatgcgat gaatgccacg ctgtggatgc 4320
tacctccatt ctcggcatcg gaacggtcct tgatcaagca gagacagccg gggtcagact 4380
aactgtgctg gctacggcca caccccccgg gtcagtgaca accccccatc ccgatataga 4440
agaggtaggc ctcgggcggg agggtgagat ccccttctat gggagggcga ttcccctatc 4500
ctgcatcaag ggagggagac acctgatttt ctgccactca aagaaaaagt gtgacgagct 4560
cgcggcggcc cttcggggca tgggcttgaa tgccgtggca tactatagag ggttggacgt 4620
ctccataata ccagctcagg gagatgtggt ggtcgtcgcc accgacgccc tcatgacggg 4680
gtacactgga gactttgact ccgtgatcga ctgcaatgta gcggtcaccc aagctgtcga 4740
cttcagcctg gaccccacct tcactataac cacacagact gtcccacaag acgctgtctc 4800
acgcagtcag cgccgcgggc gcacaggtag aggaagacag ggcacttata ggtatgtttc 4860
cactggtgaa cgagcctcag gaatgtttga cagtgtagtg ctttgtgagt gctacgacgc 4920
aggggctgcg tggtacgatc tcacaccagc ggagaccacc gtcaggctta gagcgtattt 4980
caacacgccc ggcctacccg tgtgtcaaga ccatcttgaa ttttgggagg cagttttcac 5040
cggcctcaca cacatagacg cccacttcct ctcccaaaca aagcaagcgg gggagaactt 5100
cgcgtaccta gtagcctacc aagctacggt gtgcgccaga gccaaggccc ctcccccgtc 5160
ctgggacgcc atgtggaagt gcctggcccg actcaagcct acgcttgcgg gccccacacc 5220
tctcctgtac cgtttgggcc ctattaccaa tgaggtcacc ctcacacacc ctgggacgaa 5280
gtacatcgcc acatgcatgc aagctgacct tgaggtcatg accagcacgt gggtcctagc 5340
tggaggagtc ctggcagccg tcgccgcata ttgcctggcg actggatgcg tttccatcat 5400
cggccgcttg cacgtcaacc agcgagtcgt cgttgcgccg gataaggagg tcctgtatga 5460
ggcttttgat gagatggagg aatgcgcctc tagggcggct ctcatcgaag aggggcagcg 5520
gatagccgag atgttgaagt ccaagatcca aggcttgctg cagcaggcct ctaagcaggc 5580
ccaggacata caacccgcta tgcaggcttc atggcccaaa gtggaacaat tttgggccag 5640
acacatgtgg aacttcatta gcggcatcca atacctcgca ggattgtcaa cactgccagg 5700
gaaccccgcg gtggcttcca tgatggcatt cagtgccgcc ctcaccagtc cgttgtcgac 5760
cagtaccacc atccttctca acatcatggg aggctggtta gcgtcccaga tcgcaccacc 5820
cgcgggggcc accggctttg tcgtcagtgg cctggtgggg gctgccgtgg gcagcatagg 5880
cctgggtaag gtgctggtgg acatcctggc aggatatggt gcgggcattt cgggggccct 5940
cgtcgcattc aagatcatgt ctggcgagaa gccctctatg gaagatgtca tcaatctact 6000
gcctgggatc ctgtctccgg gagccctggt ggtgggggtc atctgcgcgg ccattctgcg 6060
ccgccacgtg ggaccggggg agggcgcggt ccaatggatg aacaggctta ttgcctttgc 6120
ttccagagga aaccacgtcg cccctactca ctacgtgacg gagtcggatg cgtcgcagcg 6180
tgtgacccaa ctacttggct ctcttactat aaccagccta ctcagaagac tccacaattg 6240
gataactgag gactgcccca tcccatgctc cggatcctgg ctccgcgacg tgtgggactg 6300
ggtttgcacc atcttgacag acttcaaaaa ttggctgacc tctaaattgt tccccaagct 6360
gcccggcctc cccttcatct cttgtcaaaa ggggtacaag ggtgtgtggg ccggcactgg 6420
catcatgacc acgcgctgcc cttgcggcgc caacatctct ggcaatgtcc gcctgggctc 6480
tatgaggatc acagggccta aaacctgcat gaacacctgg caggggacct ttcctatcaa 6540
ttgctacacg gagggccagt gcgcgccgaa accccccacg aactacaaga ccgccatctg 6600
gagggtggcg gcctcggagt acgcggaggt gacgcagcat gggtcgtact cctatgtaac 6660
aggactgacc actgacaatc tgaaaattcc ttgccaacta ccttctccag agtttttctc 6720
ctgggtggac ggtgtgcaga tccataggtt tgcacccaca ccaaagccgt ttttccggga 6780
tgaggtctcg ttctgcgttg ggcttaattc ctatgctgtc gggtcccagc ttccctgtga 6840
acctgagccc gacgcagacg tattgaggtc catgctaaca gatccgcccc acatcacggc 6900
ggagactgcg gcgcggcgct tggcacgggg atcacctcca tctgaggcga gctcctcagt 6960
gagccagcta tcagcaccgt cgctgcgggc cacctgcacc acccacagca acacctatga 7020
cgtggacatg gtcgatgcca acctgctcat ggagggcggt gtggctcaga cagagcctga 7080
gtccagggtg cccgttctgg actttctcga gccaatggcc gaggaagaga gcgaccttga 7140
gccctcaata ccatcggagt gcatgctccc caggagcggg tttccacggg ccttaccggc 7200
ttgggcacgg cctgactaca acccgccgct cgtggaatcg tggaggaggc cagattacca 7260
accgcccacc gttgctggtt gtgctctccc cccccccaag aaggccccga cgcctccccc 7320
aaggagacgc cggacagtgg gtctgagcga gagcaccata tcagaagccc tccagcaact 7380
ggccatcaag acctttggcc agcccccctc gagcggtgat gcaggctcgt ccacgggggc 7440
gggcgccgcc gaatccggcg gtccgacgtc ccctggtgag ccggccccct cagagacagg 7500
ttccgcctcc tctatgcccc ccctcgaggg ggagcctgga gatccggacc tggagtctga 7560
tcaggtagag cttcaacctc ccccccaggg ggggggggta gctcccggtt cgggctcggg 7620
gtcttggtct acttgctccg aggaggacga taccaccgtg tgctgctcca tgtcatactc 7680
ctggaccggg gctctaataa ctccctgtag ccccgaagag gaaaagttgc caatcaaccc 7740
tttgagtaac tcgctgttgc gataccataa caaggtgtac tgtacaacat caaagagcgc 7800
ctcacagagg gctaaaaagg taacttttga caggacgcaa gtgctcgacg cccattatga 7860
ctcagtctta aaggacatca agctagcggc ttccaaggtc agcgcaaggc tcctcacctt 7920
ggaggaggcg tgccagttga ctccacccca ttctgcaaga tccaagtatg gattcggggc 7980
caaggaggtc cgcagcttgt ccgggagggc cgttaaccac atcaagtccg tgtggaagga 8040
cctcctggaa gacccacaaa caccaattcc cacaaccatc atggccaaaa atgaggtgtt 8100
ctgcgtggac cccgccaagg ggggtaagaa accagctcgc ctcatcgttt accctgacct 8160
cggcgtccgg gtctgcgaga aaatggccct ctatgacatt acacaaaagc ttcctcaggc 8220
ggtaatggga gcttcctatg gcttccagta ctcccctgcc caacgggtgg agtatctctt 8280
gaaagcatgg gcggaaaaga aggaccccat gggtttttcg tatgataccc gatgcttcga 8340
ctcaaccgtc actgagagag acatcaggac cgaggagtcc atataccagg cctgctccct 8400
gcccgaggag gcccgcactg ccatacactc gctgactgag agactttacg taggagggcc 8460
catgttcaac agcaagggtc aaacctgcgg ttacagacgt tgccgcgcca gcggggtgct 8520
aaccactagc atgggtaaca ccatcacatg ctatgtgaaa gccctagcgg cctgcaaggc 8580
tgcggggata gttgcgccca caatgctggt atgcggcgat gacctagtag tcatctcaga 8640
aagccagggg actgaggagg acgagcggaa cctgagagcc ttcacggagg ccatgaccag 8700
gtactctgcc cctcctggtg atccccccag accggaatat gacctggagc taataacatc 8760
ctgttcctca aatgtgtctg tggcgttggg cccgcggggc cgccgcagat actacctgac 8820
cagagaccca accactccac tcgcccgggc tgcctgggaa acagttagac actcccctat 8880
caattcatgg ctgggaaaca tcatccagta tgctccaacc atatgggttc gcatggtcct 8940
aatgacacac ttcttctcca ttctcatggt ccaagacacc ctggaccaga acctcaactt 9000
tgagatgtat ggatcagtat actccgtgaa tcctttggac cttccagcca taattgagag 9060
gttacacggg cttgacgcct tttctatgca cacatactct caccacgaac tgacgcgggt 9120
ggcttcagcc ctcagaaaac ttggggcgcc acccctcagg gtgtggaaga gtcgggctcg 9180
cgcagtcagg gcgtccctca tctcccgtgg agggaaagcg gccgtttgcg gccgatatct 9240
cttcaattgg gcggtgaaga ccaagctcaa actcactcca ttgccggagg cgcgcctact 9300
ggacttatcc agttggttca ccgtcggcgc cggcgggggc gacatttttc acagcgtgtc 9360
gcgcgcccga ccccgctcat tactcttcgg cctactccta cttttcgtag gggtaggcct 9420
cttcctactc cccgctcggt agagcggcac acactaggta cactccatag ctaactgttc 9480
cttttttttt tttttttttt tttttttttt tttttttttt ttttcttttt tttttttttc 9540
cctctttctt cccttctcat cttattctac tttctttctt ggtggctcca tcttagccct 9600
agtcacggct agctgtgaaa ggtccgtgag ccgcatgact gcagagagtg ccgtaactgg 9660
tctctctgca gatcatgt 9678
<210>30
<211>9666
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:嵌合DNA
<400>30
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccagg 180
acgaccgggt cctttcttgg ataaacccgc tcaatgcctg gagatttggg cgtgcccccg 240
caagactgct agccgagtag tgttgggtcg cgaaaggcct tgtggtactg cctgataggg 300
tgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacga atcctaaacc 360
tcaaagaaaa accaaacgta acaccaaccg tcgcccacag gacgtcaagt tcccgggtgg 420
cggtcagatc gttggtggag tttacttgtt gccgcgcagg ggccctagat tgggtgtgcg 480
cgcgacgagg aagacttccg agcggtcgca acctcgaggt agacgtcagc ctatccccaa 540
ggcacgtcgg cccgagggca ggacctgggc tcagcccggg tacccttggc ccctctatgg 600
caatgagggt tgcgggtggg cgggatggct cctgtctccc cgtggctctc ggcctagctg 660
gggccccaca gacccccggc gtaggtcgcg caatttgggt aaggtcatcg atacccttac 720
gtgcggcttc gccgacctca tggggtacat accgctcgtc ggcgcccctc ttggaggcgc 780
tgccagggcc ctggcgcatg gcgtccgggt tctggaagac ggcgtgaact atgcaacagg 840
gaaccttcct ggttgctctt tctctatctt ccttctggcc ctgctctctt gcctgactgt 900
gcccgcttca gcctaccaag tgcgcaattc ctcggggctt taccatgtca ccaatgattg 960
ccctaactcg agtattgtgt acgaggcggc cgatgccatc ctgcacactc cggggtgtgt 1020
cccttgcgtt cgcgagggta acgcctcgag gtgttgggtg gcggtgaccc ccacggtggc 1080
caccagggac ggcaaactcc ccacaacgca gcttcgacgt catatcgatc tgcttgtcgg 1140
gagcgccacc ctctgctcgg ccctctacgt gggggacctg tgcgggtctg tctttcttgt 1200
tggtcaactg tttaccttct ctcccaggcg ccactggacg acgcaagact gcaattgttc 1260
tatctatccc ggccatataa cgggtcatcg catggcatgg gatatgatga tgaactggtc 1320
ccctacggca gcgttggtgg tagctcagct gctccggatc ccacaagcca tcatggacat 1380
gatcgctggt gctcactggg gagtcctggc gggcatagcg tatttctcca tggtggggaa 1440
ctgggcgaag gtcctggtag tgctgctgct atttgccggc gtcgacgcgg aaacccacgt 1500
caccggggga aatgccggcc gcaccacggc tgggcttgtt ggtctcctta caccaggcgc 1560
caagcagaac atccaactga tcaacaccaa cggcagttgg cacatcaata gcacggcctt 1620
gaattgcaat gaaagcctta acaccggctg gttagcaggg ctcttctatc aacacaaatt 1680
caactcttca ggctgtcctg agaggttggc cagctgccga cgccttaccg attttgccca 1740
gggctggggt cctatcagtt atgccaacgg aagcggcctc gacgaacgcc cctactgctg 1800
gcactaccct ccaagacctt gtggcattgt gcccgcaaag agcgtgtgtg gcccggtata 1860
ttgcttcact cccagccccg tggtggtggg aacgaccgac aggtcgggcg cgcctaccta 1920
cagctggggt gcaaatgata cggatgtctt cgtccttaac aacaccaggc caccgctggg 1980
caattggttc ggttgtacct ggatgaactc aactggattc accaaagtgt gcggagcgcc 2040
cccttgtgtc atcggagggg tgggcaacaa caccttgctc tgccccactg attgcttccg 2100
caaacatccg gaagccacat actctcggtg cggctccggt ccctggatta cacccaggtg 2160
catggtcgac tacccgtata ggctttggca ctatccttgt accatcaatt acaccatatt 2220
caaagtcagg atgtacgtgg gaggggtcga gcacaggctg gaagcggcct gcaactggac 2280
gcggggcgaa cgctgtgatc tggaagacag ggacaggtcc gagctcagcc cgttgctgct 2340
gtccaccaca cagtggcagg tccttccgtg ttctttcacg accctgccag ccttgtccac 2400
cggcctcatc cacctccacc agaacattgt ggacgtgcag tacttgtacg gggtagggtc 2460
aagcatcgcg tcctgggcca ttaagtggga gtacgtcgtt ctcctgttcc ttctgcttgc 2520
agacgcgcgc gtctgctcct gcttgtggat gatgttactc atatcccaag cggaggcggc 2580
tttggagaac ctcgtaatac tcaatgcagc atccctggcc gggacgcacg gtcttgtgtc 2640
cttcctcgtg ttcttctgct ttgcgtggta tctgaagggt aggtgggtgc ccggagcggt 2700
ctacgccctc tacgggatgt ggcctctcct cctgctcctg ctggcgttgc ctcagcgggc 2760
atacgcatat gacgcacctg tgcacggaca gataggcgtg ggtttgttga tattgatcac 2820
cctcttcaca ctcaccccgg ggtataagac cctcctcggc cagtgtctgt ggtggttgtg 2880
ctatctcctg accctggggg aagccatgat tcaggagtgg gtaccaccca tgcaggtgcg 2940
cggcggccgc gatggcatcg cgtgggccgt cactatattc tgcccgggtg tggtgtttga 3000
cattaccaaa tggcttttgg cgttgcttgg gcctgcttac ctcttaaggg ccgctttgac 3060
acatgtgccg tacttcgtca gagctcacgc tctgataagg gtatgcgctt tggtgaagca 3120
gctcgcgggg ggtaggtatg ttcaggtggc gctattggcc cttggcaggt ggactggcac 3180
ctacatctat gaccacctca cacctatgtc ggactgggcc gctagcggcc tgcgcgactt 3240
agcggtcgcc gtggaaccca tcatcttcag tccgatggag aagaaggtca tcgtctgggg 3300
agcggagacg gctgcatgtg gggacattct acatggactt cccgtgtccg cccgactcgg 3360
ccaggagatc ctcctcggcc cagctgatgg ctacacctcc aaggggtgga agctccttgc 3420
tcccatcact gcttatgccc agcaaacacg aggcctcctg ggcgccatag tggtgagtat 3480
gacggggcgt gacaggacag aacaggccgg ggaagtccaa atcctgtcca cagtctctca 3540
gtccttcctc ggaacaacca tctcgggggt tttgtggact gtttaccacg gagctggcaa 3600
caagactcta gccggcttac ggggtccggt cacgcagatg tactcgagtg ctgaggggga 3660
cttggtaggc tggcccagcc cccctgggac caagtctttg gagccgtgca agtgtggagc 3720
cgtcgaccta tatctggtca cgcggaacgc tgatgtcatc ccggctcgga gacgcgggga 3780
caagcgggga gcattgctct ccccgagacc catttcgacc ttgaaggggt cctcgggggg 3840
gccggtgctc tgccctaggg gccacgtcgt tgggctcttc cgagcagctg tgtgctctcg 3900
gggcgtggcc aaatccatcg atttcatccc cgttgagaca ctcgacgttg ttacaaggtc 3960
tcccactttc agtgacaaca gcacgccacc ggctgtgccc cagacctatc aggtcgggta 4020
cttgcatgct ccaactggca gtggaaagag caccaaggtc cctgtcgcgt atgccgccca 4080
ggggtacaaa gtactagtgc ttaacccctc ggtagctgcc accctggggt ttggggcgta 4140
cctatccaag gcacatggca tcaatcccaa cattaggact ggagtcagga ccgtgatgac 4200
cggggaggcc atcacgtact ccacatatgg caaatttctc gccgatgggg gctgcgctag 4260
cggcgcctat gacatcatca tatgcgatga atgccacgct gtggatgcta cctccattct 4320
cggcatcgga acggtccttg atcaagcaga gacagccggg gtcagactaa ctgtgctggc 4380
tacggccaca ccccccgggt cagtgacaac cccccatccc gatatagaag aggtaggcct 4440
cgggcgggag ggtgagatcc ccttctatgg gagggcgatt cccctatcct gcatcaaggg 4500
agggagacac ctgattttct gccactcaaa gaaaaagtgt gacgagctcg cggcggccct 4560
tcggggcatg ggcttgaatg ccgtggcata ctatagaggg ttggacgtct ccataatacc 4620
agctcaggga gatgtggtgg tcgtcgccac cgacgccctc atgacggggt acactggaga 4680
ctttgactcc gtgatcgact gcaatgtagc ggtcacccaa gctgtcgact tcagcctgga 4740
ccccaccttc actataacca cacagactgt cccacaagac gctgtctcac gcagtcagcg 4800
ccgcgggcgc acaggtagag gaagacaggg cacttatagg tatgtttcca ctggtgaacg 4860
agcctcagga atgtttgaca gtgtagtgct ttgtgagtgc tacgacgcag gggctgcgtg 4920
gtacgatctc acaccagcgg agaccaccgt caggcttaga gcgtatttca acacgcccgg 4980
cctacccgtg tgtcaagacc atcttgaatt ttgggaggca gttttcaccg gcctcacaca 5040
catagacgcc cacttcctct cccaaacaaa gcaagcgggg gagaacttcg cgtacctagt 5100
agcctaccaa gctacggtgt gcgccagagc caaggcccct cccccgtcct gggacgccat 5160
gtggaagtgc ctggcccgac tcaagcctac gcttgcgggc cccacacctc tcctgtaccg 5220
tttgggccct attaccaatg aggtcaccct cacacaccct gggacgaagt acatcgccac 5280
atgcatgcaa gctgaccttg aggtcatgac cagcacgtgg gtcctagctg gaggagtcct 5340
ggcagccgtc gccgcatatt gcctggcgac tggatgcgtt tccatcatcg gccgcttgca 5400
cgtcaaccag cgagtcgtcg ttgcgccgga taaggaggtc ctgtatgagg cttttgatga 5460
gatggaggaa tgcgcctcta gggcggctct catcgaagag gggcagcgga tagccgagat 5520
gttgaagtcc aagatccaag gcttgctgca gcaggcctct aagcaggccc aggacataca 5580
acccgctatg caggcttcat ggcccaaagt ggaacaattt tgggccagac acatgtggaa 5640
cttcattagc ggcatccaat acctcgcagg attgtcaaca ctgccaggga accccgcggt 5700
ggcttccatg atggcattca gtgccgccct caccagtccg ttgtcgacca gtaccaccat 5760
ccttctcaac atcatgggag gctggttagc gtcccagatc gcaccacccg cgggggccac 5820
cggctttgtc gtcagtggcc tggtgggggc tgccgtgggc agcataggcc tgggtaaggt 5880
gctggtggac atcctggcag gatatggtgc gggcatttcg ggggccctcg tcgcattcaa 5940
gatcatgtct ggcgagaagc cctctatgga agatgtcatc aatctactgc ctgggatcct 6000
gtctccggga gccctggtgg tgggggtcat ctgcgcggcc attctgcgcc gccacgtggg 6060
accgggggag ggcgcggtcc aatggatgaa caggcttatt gcctttgctt ccagaggaaa 6120
ccacgtcgcc cctactcact acgtgacgga gtcggatgcg tcgcagcgtg tgacccaact 6180
acttggctct cttactataa ccagcctact cagaagactc cacaattgga taactgagga 6240
ctgccccatc ccatgctccg gatcctggct ccgcgacgtg tgggactggg tttgcaccat 6300
cttgacagac ttcaaaaatt ggctgacctc taaattgttc cccaagctgc ccggcctccc 6360
cttcatctct tgtcaaaagg ggtacaaggg tgtgtgggcc ggcactggca tcatgaccac 6420
gcgctgccct tgcggcgcca acatctctgg caatgtccgc ctgggctcta tgaggatcac 6480
agggcctaaa acctgcatga acacctggca ggggaccttt cctatcaatt gctacacgga 6540
gggccagtgc gcgccgaaac cccccacgaa ctacaagacc gccatctgga gggtggcggc 6600
ctcggagtac gcggaggtga cgcagcatgg gtcgtactcc tatgtaacag gactgaccac 6660
tgacaatctg aaaattcctt gccaactacc ttctccagag tttttctcct gggtggacgg 6720
tgtgcagatc cataggtttg cacccacacc aaagccgttt ttccgggatg aggtctcgtt 6780
ctgcgttggg cttaattcct atgctgtcgg gtcccagctt ccctgtgaac ctgagcccga 6840
cgcagacgta ttgaggtcca tgctaacaga tccgccccac atcacggcgg agactgcggc 6900
gcggcgcttg gcacggggat cacctccatc tgaggcgagc tcctcagtga gccagctatc 6960
agcaccgtcg ctgcgggcca cctgcaccac ccacagcaac acctatgacg tggacatggt 7020
cgatgccaac ctgctcatgg agggcggtgt ggctcagaca gagcctgagt ccagggtgcc 7080
cgttctggac tttctcgagc caatggccga ggaagagagc gaccttgagc cctcaatacc 7140
atcggagtgc atgctcccca ggagcgggtt tccacgggcc ttaccggctt gggcacggcc 7200
tgactacaac ccgccgctcg tggaatcgtg gaggaggcca gattaccaac cgcccaccgt 7260
tgctggttgt gctctccccc cccccaagaa ggccccgacg cctcccccaa ggagacgccg 7320
gacagtgggt ctgagcgaga gcaccatatc agaagccctc cagcaactgg ccatcaagac 7380
ctttggccag cccccctcga gcggtgatgc aggctcgtcc acgggggcgg gcgccgccga 7440
atccggcggt ccgacgtccc ctggtgagcc ggccccctca gagacaggtt ccgcctcctc 7500
tatgcccccc ctcgaggggg agcctggaga tccggacctg gagtctgatc aggtagagct 7560
tcaacctccc ccccaggggg ggggggtagc tcccggttcg ggctcggggt cttggtctac 7620
ttgctccgag gaggacgata ccaccgtgtg ctgctccatg tcatactcct ggaccggggc 7680
tctaataact ccctgtagcc ccgaagagga aaagttgcca atcaaccctt tgagtaactc 7740
gctgttgcga taccataaca aggtgtactg tacaacatca aagagcgcct cacagagggc 7800
taaaaaggta acttttgaca ggacgcaagt gctcgacgcc cattatgact cagtcttaaa 7860
ggacatcaag ctagcggctt ccaaggtcag cgcaaggctc ctcaccttgg aggaggcgtg 7920
ccagttgact ccaccccatt ctgcaagatc caagtatgga ttcggggcca aggaggtccg 7980
cagcttgtcc gggagggccg ttaaccacat caagtccgtg tggaaggacc tcctggaaga 8040
cccacaaaca ccaattccca caaccatcat ggccaaaaat gaggtgttct gcgtggaccc 8100
cgccaagggg ggtaagaaac cagctcgcct catcgtttac cctgacctcg gcgtccgggt 8160
ctgcgagaaa atggccctct atgacattac acaaaagctt cctcaggcgg taatgggagc 8220
ttcctatggc ttccagtact cccctgccca acgggtggag tatctcttga aagcatgggc 8280
ggaaaagaag gaccccatgg gtttttcgta tgatacccga tgcttcgact caaccgtcac 8340
tgagagagac atcaggaccg aggagtccat ataccaggcc tgctccctgc ccgaggaggc 8400
ccgcactgcc atacactcgc tgactgagag actttacgta ggagggccca tgttcaacag 8460
caagggtcaa acctgcggtt acagacgttg ccgcgccagc ggggtgctaa ccactagcat 8520
gggtaacacc atcacatgct atgtgaaagc cctagcggcc tgcaaggctg cggggatagt 8580
tgcgcccaca atgctggtat gcggcgatga cctagtagtc atctcagaaa gccaggggac 8640
tgaggaggac gagcggaacc tgagagcctt cacggaggcc atgaccaggt actctgcccc 8700
tcctggtgat ccccccagac cggaatatga cctggagcta ataacatcct gttcctcaaa 8760
tgtgtctgtg gcgttgggcc cgcggggccg ccgcagatac tacctgacca gagacccaac 8820
cactccactc gcccgggctg cctgggaaac agttagacac tcccctatca attcatggct 8880
gggaaacatc atccagtatg ctccaaccat atgggttcgc atggtcctaa tgacacactt 8940
cttctccatt ctcatggtcc aagacaccct ggaccagaac ctcaactttg agatgtatgg 9000
atcagtatac tccgtgaatc ctttggacct tccagccata attgagaggt tacacgggct 9060
tgacgccttt tctatgcaca catactctca ccacgaactg acgcgggtgg cttcagccct 9120
cagaaaactt ggggcgccac ccctcagggt gtggaagagt cgggctcgcg cagtcagggc 9180
gtccctcatc tcccgtggag ggaaagcggc cgtttgcggc cgatatctct tcaattgggc 9240
ggtgaagacc aagctcaaac tcactccatt gccggaggcg cgcctactgg acttatccag 9300
ttggttcacc gtcggcgccg gcgggggcga catttttcac agcgtgtcgc gcgcccgacc 9360
ccgctcatta ctcttcggcc tactcctact tttcgtaggg gtaggcctct tcctactccc 9420
cgctcggtag agcggcacac actaggtaca ctccatagct aactgttcct tttttttttt 9480
tttttttttt tttttttttt tttttttttt ttcttttttt tttttttccc tctttcttcc 9540
cttctcatct tattctactt tctttcttgg tggctccatc ttagccctag tcacggctag 9600
ctgtgaaagg tccgtgagcc gcatgactgc agagagtgcc gtaactggtc tctctgcaga 9660
tcatgt 9666
<210>31
<211>9666
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成DNA
<400>31
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccagg 180
acgaccgggt cctttcttgg atcaacccgc tcaatgcctg gagatttggg cgtgcccccg 240
cgagactgct agccgagtag tgttgggtcg cgaaaggcct tgtggtactg cctgataggg 300
tgcttgcgag tgccccggga ggtctcgtag accgtgcatc atgagcacaa atcctaaacc 360
ccaaagaaaa accaaacgta acaccaaccg tcgcccacag gacgttaagt tcccgggcgg 420
tggtcagatc gtcggtggag tttacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480
tgcgactagg aagacttccg agcggtcgca acctcgtgga aggcgacaac ctatccccaa 540
ggctcgccgg cccgagggca ggacctgggc tcagcctggg tatccttggc ccctctatgg 600
caatgagggc ttggggtggg caggatggct cctgtcaccc cgcggctctc ggcctagttg 660
gggccctaat gacccccggc gtaggtcgcg taatttgggt aaggtcatcg atacccttac 720
atgcggcttc gccgacctca tggggtacat cccgcttgtc ggcgccccct tagggggcgc 780
tgccagggcc ctggcacatg gtgtccgggt tctggaggac ggcgtgaact atgcaacagg 840
gaatttgccc ggttgctctt tctctatctt cctcttagct ctgctgtcct gtttgaccat 900
cccagcttcc gcttatgaag tgcgcaacgt gtccgggata taccatgtca caaacgactg 960
ctccaactca agcattgtgt atgaggcggc ggacgtgatc atgcatgccc ccgggtgcgt 1020
gccctgcgtt cgggagaaca attcctcccg ttgctgggta gcgctcactc ccacgctcgc 1080
ggccaggaat gccagcgtcc ccactacgac attacgacgc cacgtcgact tgctcgttgg 1140
gacggctgct ttctgctccg ctatgtacgt gggggatctc tgcggatctg ttttcctcat 1200
ctcccagctg ttcaccttct cgcctcgccg gcatgagaca gtacaggact gcaactgctc 1260
aatctatccc ggccacgtat caggccatcg tatggcttgg gatatgatga tgaactggtc 1320
gcccacggca gccttagtgg tgtcgcagtt actccggatc ccacaagctg tcatggacat 1380
ggtggcgggg gcccactggg gagtcctagc gggccttgcc tactattcca tggtggggaa 1440
ctgggctaag gttttgattg tgatgctact ctttgccggc gttgacgggc atacccgcgt 1500
gacggggggg gtgcaaggcc atgtcacctc tacactcacg tccctcttta gacctggggc 1560
gtcccagaaa attcagcttg taaacaccaa tggcagttgg catatcaaca ggactgccct 1620
gaactgcaat gactccctca aaactgggtt tcttgccgcg ctgttctaca cacacaagtt 1680
caacgcgtcc ggatgcccgg agcgcatggc cagctgtcgc tccattgaca agttcgacca 1740
gggatggggt cccatcacct atgctcaacc tgacaactcg gaccagaggc cgtattgctg 1800
gcactacgca cctcgacagt gtggtatcgt acccgcgtcg caggtgtgcg gtccagtgta 1860
ttgcttcacc ccaagccctg ttgtagtggg gacgaccgat cgtttcggcg cccctacgta 1920
taactggggg gacaatgaga cggacgtgct gctcctaaac aacacgcggc cgccgcatgg 1980
caactggttc ggctgtacat ggatgaatag cactgggttc accaagacgt gcggaggccc 2040
cccgtgtaac atcagggggg tcggcaacaa caccttgacc tgccccacgg actgcttccg 2100
gaagcacccc gacgccactt acacaaaatg tggttcgggc ccttggttga cacctaggtg 2160
cttggttgac tacccataca ggctctggca ctacccctgc actgtcaact ttaccatctt 2220
caaggttagg atgtatgtgg ggggcgtgga gcacaggctt gatgctgcat gcaactggac 2280
tcgaggagag cgttgcgact tggaggacag ggatagagca gagctcagcc cgctattgct 2340
gtctacaaca gagtggcaga tactgccctg ttcctacacc accctaccgg ctctgtccac 2400
tggtttaatc cacctccacc agaacatcgt ggacatacaa tacctgtacg gtatagggtc 2460
ggcggtcgtc tccattgcca tcaagtggga gtatgtcgtg ctgctcttcc ttctcctggc 2520
ggacgcgcgc gtctgtgcct gcttgtggat gatgctgctg atagcccagg ccgaggctgc 2580
cttagagaac ttggtggtcc tcaatgcggc gtccgtggtc ggagcgcatg gcatgctccc 2640
cttctttatg ttcttctgtg ccgcctggta catgaagggc aggctggtcc ctggagcggc 2700
atacgctttc tacggtgtat ggccgctgct cctgctcctg ctagcattac caccacgagc 2760
ttacgcctat gacgcacctg tgcacggaca gataggcgtg ggtttgttga tattgatcac 2820
cctcttcaca ctcaccccgg ggtataagac cctcctcggc cagtgtctgt ggtggttgtg 2880
ctatctcctg accctggggg aagccatgat tcaggagtgg gtaccaccca tgcaggtgcg 2940
cggcggccgc gatggcatcg cgtgggccgt cactatattc tgcccgggtg tggtgtttga 3000
cattaccaaa tggcttttgg cgttgcttgg gcctgcttac ctcttaaggg ccgctttgac 3060
acatgtgccg tacttcgtca gagctcacgc tctgataagg gtatgcgctt tggtgaagca 3120
gctcgcgggg ggtaggtatg ttcaggtggc gctattggcc cttggcaggt ggactggcac 3180
ctacatctat gaccacctca cacctatgtc ggactgggcc gctagcggcc tgcgcgactt 3240
agcggtcgcc gtggaaccca tcatcttcag tccgatggag aagaaggtca tcgtctgggg 3300
agcggagacg gctgcatgtg gggacattct acatggactt cccgtgtccg cccgactcgg 3360
ccaggagatc ctcctcggcc cagctgatgg ctacacctcc aaggggtgga agctccttgc 3420
tcccatcact gcttatgccc agcaaacacg aggcctcctg ggcgccatag tggtgagtat 3480
gacggggcgt gacaggacag aacaggccgg ggaagtccaa atcctgtcca cagtctctca 3540
gtccttcctc ggaacaacca tctcgggggt tttgtggact gtttaccacg gagctggcaa 3600
caagactcta gccggcttac ggggtccggt cacgcagatg tactcgagtg ctgaggggga 3660
cttggtaggc tggcccagcc cccctgggac caagtctttg gagccgtgca agtgtggagc 3720
cgtcgaccta tatctggtca cgcggaacgc tgatgtcatc ccggctcgga gacgcgggga 3780
caagcgggga gcattgctct ccccgagacc catttcgacc ttgaaggggt cctcgggggg 3840
gccggtgctc tgccctaggg gccacgtcgt tgggctcttc cgagcagctg tgtgctctcg 3900
gggcgtggcc aaatccatcg atttcatccc cgttgagaca ctcgacgttg ttacaaggtc 3960
tcccactttc agtgacaaca gcacgccacc ggctgtgccc cagacctatc aggtcgggta 4020
cttgcatgct ccaactggca gtggaaagag caccaaggtc cctgtcgcgt atgccgccca 4080
ggggtacaaa gtactagtgc ttaacccctc ggtagctgcc accctggggt ttggggcgta 4140
cctatccaag gcacatggca tcaatcccaa cattaggact ggagtcagga ccgtgatgac 4200
cggggaggcc atcacgtact ccacatatgg caaatttctc gccgatgggg gctgcgctag 4260
cggcgcctat gacatcatca tatgcgatga atgccacgct gtggatgcta cctccattct 4320
cggcatcgga acggtccttg atcaagcaga gacagccggg gtcagactaa ctgtgctggc 4380
tacggccaca ccccccgggt cagtgacaac cccccatccc gatatagaag aggtaggcct 4440
cgggcgggag ggtgagatcc ccttctatgg gagggcgatt cccctatcct gcatcaaggg 4500
agggagacac ctgattttct gccactcaaa gaaaaagtgt gacgagctcg cggcggccct 4560
tcggggcatg ggcttgaatg ccgtggcata ctatagaggg ttggacgtct ccataatacc 4620
agctcaggga gatgtggtgg tcgtcgccac cgacgccctc atgacggggt acactggaga 4680
ctttgactcc gtgatcgact gcaatgtagc ggtcacccaa gctgtcgact tcagcctgga 4740
ccccaccttc actataacca cacagactgt cccacaagac gctgtctcac gcagtcagcg 4800
ccgcgggcgc acaggtagag gaagacaggg cacttatagg tatgtttcca ctggtgaacg 4860
agcctcagga atgtttgaca gtgtagtgct ttgtgagtgc tacgacgcag gggctgcgtg 4920
gtacgatctc acaccagcgg agaccaccgt caggcttaga gcgtatttca acacgcccgg 4980
cctacccgtg tgtcaagacc atcttgaatt ttgggaggca gttttcaccg gcctcacaca 5040
catagacgcc cacttcctct cccaaacaaa gcaagcgggg gagaacttcg cgtacctagt 5100
agcctaccaa gctacggtgt gcgccagagc caaggcccct cccccgtcct gggacgccat 5160
gtggaagtgc ctggcccgac tcaagcctac gcttgcgggc cccacacctc tcctgtaccg 5220
tttgggccct attaccaatg aggtcaccct cacacaccct gggacgaagt acatcgccac 5280
atgcatgcaa gctgaccttg aggtcatgac cagcacgtgg gtcctagctg gaggagtcct 5340
ggcagccgtc gccgcatatt gcctggcgac tggatgcgtt tccatcatcg gccgcttgca 5400
cgtcaaccag cgagtcgtcg ttgcgccgga taaggaggtc ctgtatgagg cttttgatga 5460
gatggaggaa tgcgcctcta gggcggctct catcgaagag gggcagcgga tagccgagat 5520
gttgaagtcc aagatccaag gcttgctgca gcaggcctct aagcaggccc aggacataca 5580
acccgctatg caggcttcat ggcccaaagt ggaacaattt tgggccagac acatgtggaa 5640
cttcattagc ggcatccaat acctcgcagg attgtcaaca ctgccaggga accccgcggt 5700
ggcttccatg atggcattca gtgccgccct caccagtccg ttgtcgacca gtaccaccat 5760
ccttctcaac atcatgggag gctggttagc gtcccagatc gcaccacccg cgggggccac 5820
cggctttgtc gtcagtggcc tggtgggggc tgccgtgggc agcataggcc tgggtaaggt 5880
gctggtggac atcctggcag gatatggtgc gggcatttcg ggggccctcg tcgcattcaa 5940
gatcatgtct ggcgagaagc cctctatgga agatgtcatc aatctactgc ctgggatcct 6000
gtctccggga gccctggtgg tgggggtcat ctgcgcggcc attctgcgcc gccacgtggg 6060
accgggggag ggcgcggtcc aatggatgaa caggcttatt gcctttgctt ccagaggaaa 6120
ccacgtcgcc cctactcact acgtgacgga gtcggatgcg tcgcagcgtg tgacccaact 6180
acttggctct cttactataa ccagcctact cagaagactc cacaattgga taactgagga 6240
ctgccccatc ccatgctccg gatcctggct ccgcgacgtg tgggactggg tttgcaccat 6300
cttgacagac ttcaaaaatt ggctgacctc taaattgttc cccaagctgc ccggcctccc 6360
cttcatctct tgtcaaaagg ggtacaaggg tgtgtgggcc ggcactggca tcatgaccac 6420
gcgctgccct tgcggcgcca acatctctgg caatgtccgc ctgggctcta tgaggatcac 6480
agggcctaaa acctgcatga acacctggca ggggaccttt cctatcaatt gctacacgga 6540
gggccagtgc gcgccgaaac cccccacgaa ctacaagacc gccatctgga gggtggcggc 6600
ctcggagtac gcggaggtga cgcagcatgg gtcgtactcc tatgtaacag gactgaccac 6660
tgacaatctg aaaattcctt gccaactacc ttctccagag tttttctcct gggtggacgg 6720
tgtgcagatc cataggtttg cacccacacc aaagccgttt ttccgggatg aggtctcgtt 6780
ctgcgttggg cttaattcct atgctgtcgg gtcccagctt ccctgtgaac ctgagcccga 6840
cgcagacgta ttgaggtcca tgctaacaga tccgccccac atcacggcgg agactgcggc 6900
gcggcgcttg gcacggggat cacctccatc tgaggcgagc tcctcagtga gccagctatc 6960
agcaccgtcg ctgcgggcca cctgcaccac ccacagcaac acctatgacg tggacatggt 7020
cgatgccaac ctgctcatgg agggcggtgt ggctcagaca gagcctgagt ccagggtgcc 7080
cgttctggac tttctcgagc caatggccga ggaagagagc gaccttgagc cctcaatacc 7140
atcggagtgc atgctcccca ggagcgggtt tccacgggcc ttaccggctt gggcacggcc 7200
tgactacaac ccgccgctcg tggaatcgtg gaggaggcca gattaccaac cgcccaccgt 7260
tgctggttgt gctctccccc cccccaagaa ggccccgacg cctcccccaa ggagacgccg 7320
gacagtgggt ctgagcgaga gcaccatatc agaagccctc cagcaactgg ccatcaagac 7380
ctttggccag cccccctcga gcggtgatgc aggctcgtcc acgggggcgg gcgccgccga 7440
atccggcggt ccgacgtccc ctggtgagcc ggccccctca gagacaggtt ccgcctcctc 7500
tatgcccccc ctcgaggggg agcctggaga tccggacctg gagtctgatc aggtagagct 7560
tcaacctccc ccccaggggg ggggggtagc tcccggttcg ggctcggggt cttggtctac 7620
ttgctccgag gaggacgata ccaccgtgtg ctgctccatg tcatactcct ggaccggggc 7680
tctaataact ccctgtagcc ccgaagagga aaagttgcca atcaaccctt tgagtaactc 7740
gctgttgcga taccataaca aggtgtactg tacaacatca aagagcgcct cacagagggc 7800
taaaaaggta acttttgaca ggacgcaagt gctcgacgcc cattatgact cagtcttaaa 7860
ggacatcaag ctagcggctt ccaaggtcag cgcaaggctc ctcaccttgg aggaggcgtg 7920
ccagttgact ccaccccatt ctgcaagatc caagtatgga ttcggggcca aggaggtccg 7980
cagcttgtcc gggagggccg ttaaccacat caagtccgtg tggaaggacc tcctggaaga 8040
cccacaaaca ccaattccca caaccatcat ggccaaaaat gaggtgttct gcgtggaccc 8100
cgccaagggg ggtaagaaac cagctcgcct catcgtttac cctgacctcg gcgtccgggt 8160
ctgcgagaaa atggccctct atgacattac acaaaagctt cctcaggcgg taatgggagc 8220
ttcctatggc ttccagtact cccctgccca acgggtggag tatctcttga aagcatgggc 8280
ggaaaagaag gaccccatgg gtttttcgta tgatacccga tgcttcgact caaccgtcac 8340
tgagagagac atcaggaccg aggagtccat ataccaggcc tgctccctgc ccgaggaggc 8400
ccgcactgcc atacactcgc tgactgagag actttacgta ggagggccca tgttcaacag 8460
caagggtcaa acctgcggtt acagacgttg ccgcgccagc ggggtgctaa ccactagcat 8520
gggtaacacc atcacatgct atgtgaaagc cctagcggcc tgcaaggctg cggggatagt 8580
tgcgcccaca atgctggtat gcggcgatga cctagtagtc atctcagaaa gccaggggac 8640
tgaggaggac gagcggaacc tgagagcctt cacggaggcc atgaccaggt actctgcccc 8700
tcctggtgat ccccccagac cggaatatga cctggagcta ataacatcct gttcctcaaa 8760
tgtgtctgtg gcgttgggcc cgcggggccg ccgcagatac tacctgacca gagacccaac 8820
cactccactc gcccgggctg cctgggaaac agttagacac tcccctatca attcatggct 8880
gggaaacatc atccagtatg ctccaaccat atgggttcgc atggtcctaa tgacacactt 8940
cttctccatt ctcatggtcc aagacaccct ggaccagaac ctcaactttg agatgtatgg 9000
atcagtatac tccgtgaatc ctttggacct tccagccata attgagaggt tacacgggct 9060
tgacgccttt tctatgcaca catactctca ccacgaactg acgcgggtgg cttcagccct 9120
cagaaaactt ggggcgccac ccctcagggt gtggaagagt cgggctcgcg cagtcagggc 9180
gtccctcatc tcccgtggag ggaaagcggc cgtttgcggc cgatatctct tcaattgggc 9240
ggtgaagacc aagctcaaac tcactccatt gccggaggcg cgcctactgg acttatccag 9300
ttggttcacc gtcggcgccg gcgggggcga catttttcac agcgtgtcgc gcgcccgacc 9360
ccgctcatta ctcttcggcc tactcctact tttcgtaggg gtaggcctct tcctactccc 9420
cgctcggtag agcggcacac actaggtaca ctccatagct aactgttcct tttttttttt 9480
tttttttttt tttttttttt tttttttttt ttcttttttt tttttttccc tctttcttcc 9540
cttctcatct tattctactt tctttcttgg tggctccatc ttagccctag tcacggctag 9600
ctgtgaaagg tccgtgagcc gcatgactgc agagagtgcc gtaactggtc tctctgcaga 9660
tcatgt 9666
<210>32
<211>9666
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成DNA
<400>32
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccagg 180
acgaccgggt cctttcttgg atcaacccgc tcaatgcctg gagatttggg cgtgcccccg 240
cgagactgct agccgagtag tgttgggtcg cgaaaggcct tgtggtactg cctgataggg 300
tgcttgcgag tgccccggga ggtctcgtag accgtgcatc atgagcacaa atcctaaacc 360
ccaaagaaaa accaaacgta acaccaaccg tcgcccacag gacgttaagt tcccgggcgg 420
tggtcagatc gtcggtggag tttacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480
tgcgactagg aagacttccg agcggtcgca acctcgtgga aggcgacaac ctatccccaa 540
ggctcgccgg cccgagggca ggacctgggc tcagcctggg tatccttggc ccctctatgg 600
caatgagggc ttggggtggg caggatggct cctgtcaccc cgcggctctc ggcctagttg 660
gggccctaat gacccccggc gtaggtcgcg taatttgggt aaggtcatcg atacccttac 720
atgcggcttc gccgacctca tggggtacat cccgcttgtc ggcgccccct tagggggcgc 780
tgccagggcc ctggcacatg gtgtccgggt tctggaggac ggcgtgaact atgcaacagg 840
gaatttgccc ggttgctctt tctctatctt cctcttagct ctgctgtcct gtttgaccat 900
cccagcttcc gcttatgaag tgcgcaacgt gtccgggata taccatgtca caaacgactg 960
ctccaactca agcattgtgt atgaggcggc ggacgtgatc atgcatgccc ccgggtgcgt 1020
gccctgcgtt cgggagaaca attcctcccg ttgctgggta gcgctcactc ccacgctcgc 1080
ggccaggaat gccagcgtcc ccactacgac attacgacgc cacgtcgact tgctcgttgg 1140
gacggctgct ttctgctccg ctatgtacgt gggggatctc tgcggatctg ttttcctcat 1200
ctcccagctg ttcaccttct cgcctcgccg gcatgagaca gtacaggact gcaactgctc 1260
aatctatccc ggccacgtat caggccatcg tatggcttgg gatatgatga tgaactggtc 1320
gcccacggca gccttagtgg tgtcgcagtt actccggatc ccacaagctg tcatggacat 1380
ggtggcgggg gcccactggg gagtcctagc gggccttgcc tactattcca tggtggggaa 1440
ctgggctaag gttttgattg tgatgctact ctttgccggc gttgacgggc atacccgcgt 1500
gacggggggg gtgcaaggcc atgtcacctc tacactcacg tccctcttta gacctggggc 1560
gtcccagaaa attcagcttg taaacaccaa tggcagttgg catatcaaca ggactgccct 1620
gaactgcaat gactccctca aaactgggtt tcttgccgcg ctgttctaca cacacaagtt 1680
caacgcgtcc ggatgcccgg agcgcatggc cagctgtcgc tccattgaca agttcgacca 1740
gggatggggt cccatcacct atgctcaacc tgacaactcg gaccagaggc cgtattgctg 1800
gcactacgca cctcgacagt gtggtatcgt acccgcgtcg caggtgtgcg gtccagtgta 1860
ttgcttcacc ccaagccctg ttgtagtggg gacgaccgat cgtttcggcg cccctacgta 1920
taactggggg gacaatgaga cggacgtgct gctcctaaac aacacgcggc cgccgcatgg 1980
caactggttc ggctgtacat ggatgaatag cactgggttc accaagacgt gcggaggccc 2040
cccgtgtaac atcagggggg tcggcaacaa caccttgacc tgccccacgg actgcttccg 2100
gaagcacccc gacgccactt acacaaaatg tggttcgggc ccttggttga cacctaggtg 2160
cttggttgac tacccataca ggctctggca ctacccctgc actgtcaact ttaccatctt 2220
caaggttagg atgtatgtgg ggggcgtgga gcacaggctt gatgctgcat gcaactggac 2280
tcgaggagag cgttgcgact tggaggacag ggatagagca gagctcagcc cgctattgct 2340
gtctacaaca gagtggcaga tactgccctg ttcctacacc accctaccgg ctctgtccac 2400
tggtttaatc cacctccacc agaacatcgt ggacatacaa tacctgtacg gtatagggtc 2460
ggcggtcgtc tccattgcca tcaagtggga gtatgtcgtg ctgctcttcc ttctcctggc 2520
ggacgcgcgc gtctgtgcct gcttgtggat gatgctgctg atagcccagg ccgaggctgc 2580
cttagagaac ttggtggtcc tcaatgcggc gtccgtggtc ggagcgcatg gcatgctccc 2640
cttctttatg ttcttctgtg ccgcctggta catgaagggc aggctggtcc ctggagcggc 2700
atacgctttc tacggtgtat ggccgctgct cctgctcctg ctagcattac caccacgagc 2760
ttacgccatg gaccgggaga tggttgcatc ttgcggaggc ggggtttttg taggtctagc 2820
actcctgacc ttgtcaccat actgtaaagt gttcctcgct aggctcatat ggtggttaca 2880
atattttatc accaaagccg aggcgcattt gcaagtgtcg ctcccccccc tcaacgttcg 2940
aggcggacgc gatgccatca tcctcctcat gtgcgcggtc cacccagagc taatctttga 3000
catcaccaaa cttctgctct ccatactcgg tccgctcatg gtgctccaag ctagtttaat 3060
ccgagtgccg tacttcgtgc gcgctcaagg gctcattcgc gcatgcatgt tggtgcggaa 3120
agctgccggg ggccattatg tccaaatggc cttcgtgaag ctagctgcgc tgacaggcac 3180
gtacgtttat gaccacctca ctccactgca ggattgggcc catgtgggcc tacgagacct 3240
tgcggtggca gtagagcccg ttgtcttttc tgccatggag accaaggtca tcacctgggg 3300
ggcagacacc gcggcgtgtg gggacattat ctcaggtcta cccgtctccg cccgaagggg 3360
gaaggagata cttttgggac cggccgatag ttttgaaggg caggggtggc gactccttgc 3420
tcccatcact gcttatgccc agcaaacacg aggcctcctg ggcgccatag tggtgagtat 3480
gacggggcgt gacaggacag aacaggccgg ggaagtccaa atcctgtcca cagtctctca 3540
gtccttcctc ggaacaacca tctcgggggt tttgtggact gtttaccacg gagctggcaa 3600
caagactcta gccggcttac ggggtccggt cacgcagatg tactcgagtg ctgaggggga 3660
cttggtaggc tggcccagcc cccctgggac caagtctttg gagccgtgca agtgtggagc 3720
cgtcgaccta tatctggtca cgcggaacgc tgatgtcatc ccggctcgga gacgcgggga 3780
caagcgggga gcattgctct ccccgagacc catttcgacc ttgaaggggt cctcgggggg 3840
gccggtgctc tgccctaggg gccacgtcgt tgggctcttc cgagcagctg tgtgctctcg 3900
gggcgtggcc aaatccatcg atttcatccc cgttgagaca ctcgacgttg ttacaaggtc 3960
tcccactttc agtgacaaca gcacgccacc ggctgtgccc cagacctatc aggtcgggta 4020
cttgcatgct ccaactggca gtggaaagag caccaaggtc cctgtcgcgt atgccgccca 4080
ggggtacaaa gtactagtgc ttaacccctc ggtagctgcc accctggggt ttggggcgta 4140
cctatccaag gcacatggca tcaatcccaa cattaggact ggagtcagga ccgtgatgac 4200
cggggaggcc atcacgtact ccacatatgg caaatttctc gccgatgggg gctgcgctag 4260
cggcgcctat gacatcatca tatgcgatga atgccacgct gtggatgcta cctccattct 4320
cggcatcgga acggtccttg atcaagcaga gacagccggg gtcagactaa ctgtgctggc 4380
tacggccaca ccccccgggt cagtgacaac cccccatccc gatatagaag aggtaggcct 4440
cgggcgggag ggtgagatcc ccttctatgg gagggcgatt cccctatcct gcatcaaggg 4500
agggagacac ctgattttct gccactcaaa gaaaaagtgt gacgagctcg cggcggccct 4560
tcggggcatg ggcttgaatg ccgtggcata ctatagaggg ttggacgtct ccataatacc 4620
agctcaggga gatgtggtgg tcgtcgccac cgacgccctc atgacggggt acactggaga 4680
ctttgactcc gtgatcgact gcaatgtagc ggtcacccaa gctgtcgact tcagcctgga 4740
ccccaccttc actataacca cacagactgt cccacaagac gctgtctcac gcagtcagcg 4800
ccgcgggcgc acaggtagag gaagacaggg cacttatagg tatgtttcca ctggtgaacg 4860
agcctcagga atgtttgaca gtgtagtgct ttgtgagtgc tacgacgcag gggctgcgtg 4920
gtacgatctc acaccagcgg agaccaccgt caggcttaga gcgtatttca acacgcccgg 4980
cctacccgtg tgtcaagacc atcttgaatt ttgggaggca gttttcaccg gcctcacaca 5040
catagacgcc cacttcctct cccaaacaaa gcaagcgggg gagaacttcg cgtacctagt 5100
agcctaccaa gctacggtgt gcgccagagc caaggcccct cccccgtcct gggacgccat 5160
gtggaagtgc ctggcccgac tcaagcctac gcttgcgggc cccacacctc tcctgtaccg 5220
tttgggccct attaccaatg aggtcaccct cacacaccct gggacgaagt acatcgccac 5280
atgcatgcaa gctgaccttg aggtcatgac cagcacgtgg gtcctagctg gaggagtcct 5340
ggcagccgtc gccgcatatt gcctggcgac tggatgcgtt tccatcatcg gccgcttgca 5400
cgtcaaccag cgagtcgtcg ttgcgccgga taaggaggtc ctgtatgagg cttttgatga 5460
gatggaggaa tgcgcctcta gggcggctct catcgaagag gggcagcgga tagccgagat 5520
gttgaagtcc aagatccaag gcttgctgca gcaggcctct aagcaggccc aggacataca 5580
acccgctatg caggcttcat ggcccaaagt ggaacaattt tgggccagac acatgtggaa 5640
cttcattagc ggcatccaat acctcgcagg attgtcaaca ctgccaggga accccgcggt 5700
ggcttccatg atggcattca gtgccgccct caccagtccg ttgtcgacca gtaccaccat 5760
ccttctcaac atcatgggag gctggttagc gtcccagatc gcaccacccg cgggggccac 5820
cggctttgtc gtcagtggcc tggtgggggc tgccgtgggc agcataggcc tgggtaaggt 5880
gctggtggac atcctggcag gatatggtgc gggcatttcg ggggccctcg tcgcattcaa 5940
gatcatgtct ggcgagaagc cctctatgga agatgtcatc aatctactgc ctgggatcct 6000
gtctccggga gccctggtgg tgggggtcat ctgcgcggcc attctgcgcc gccacgtggg 6060
accgggggag ggcgcggtcc aatggatgaa caggcttatt gcctttgctt ccagaggaaa 6120
ccacgtcgcc cctactcact acgtgacgga gtcggatgcg tcgcagcgtg tgacccaact 6180
acttggctct cttactataa ccagcctact cagaagactc cacaattgga taactgagga 6240
ctgccccatc ccatgctccg gatcctggct ccgcgacgtg tgggactggg tttgcaccat 6300
cttgacagac ttcaaaaatt ggctgacctc taaattgttc cccaagctgc ccggcctccc 6360
cttcatctct tgtcaaaagg ggtacaaggg tgtgtgggcc ggcactggca tcatgaccac 6420
gcgctgccct tgcggcgcca acatctctgg caatgtccgc ctgggctcta tgaggatcac 6480
agggcctaaa acctgcatga acacctggca ggggaccttt cctatcaatt gctacacgga 6540
gggccagtgc gcgccgaaac cccccacgaa ctacaagacc gccatctgga gggtggcggc 6600
ctcggagtac gcggaggtga cgcagcatgg gtcgtactcc tatgtaacag gactgaccac 6660
tgacaatctg aaaattcctt gccaactacc ttctccagag tttttctcct gggtggacgg 6720
tgtgcagatc cataggtttg cacccacacc aaagccgttt ttccgggatg aggtctcgtt 6780
ctgcgttggg cttaattcct atgctgtcgg gtcccagctt ccctgtgaac ctgagcccga 6840
cgcagacgta ttgaggtcca tgctaacaga tccgccccac atcacggcgg agactgcggc 6900
gcggcgcttg gcacggggat cacctccatc tgaggcgagc tcctcagtga gccagctatc 6960
agcaccgtcg ctgcgggcca cctgcaccac ccacagcaac acctatgacg tggacatggt 7020
cgatgccaac ctgctcatgg agggcggtgt ggctcagaca gagcctgagt ccagggtgcc 7080
cgttctggac tttctcgagc caatggccga ggaagagagc gaccttgagc cctcaatacc 7140
atcggagtgc atgctcccca ggagcgggtt tccacgggcc ttaccggctt gggcacggcc 7200
tgactacaac ccgccgctcg tggaatcgtg gaggaggcca gattaccaac cgcccaccgt 7260
tgctggttgt gctctccccc cccccaagaa ggccccgacg cctcccccaa ggagacgccg 7320
gacagtgggt ctgagcgaga gcaccatatc agaagccctc cagcaactgg ccatcaagac 7380
ctttggccag cccccctcga gcggtgatgc aggctcgtcc acgggggcgg gcgccgccga 7440
atccggcggt ccgacgtccc ctggtgagcc ggccccctca gagacaggtt ccgcctcctc 7500
tatgcccccc ctcgaggggg agcctggaga tccggacctg gagtctgatc aggtagagct 7560
tcaacctccc ccccaggggg ggggggtagc tcccggttcg ggctcggggt cttggtctac 7620
ttgctccgag gaggacgata ccaccgtgtg ctgctccatg tcatactcct ggaccggggc 7680
tctaataact ccctgtagcc ccgaagagga aaagttgcca atcaaccctt tgagtaactc 7740
gctgttgcga taccataaca aggtgtactg tacaacatca aagagcgcct cacagagggc 7800
taaaaaggta acttttgaca ggacgcaagt gctcgacgcc cattatgact cagtcttaaa 7860
ggacatcaag ctagcggctt ccaaggtcag cgcaaggctc ctcaccttgg aggaggcgtg 7920
ccagttgact ccaccccatt ctgcaagatc caagtatgga ttcggggcca aggaggtccg 7980
cagcttgtcc gggagggccg ttaaccacat caagtccgtg tggaaggacc tcctggaaga 8040
cccacaaaca ccaattccca caaccatcat ggccaaaaat gaggtgttct gcgtggaccc 8100
cgccaagggg ggtaagaaac cagctcgcct catcgtttac cctgacctcg gcgtccgggt 8160
ctgcgagaaa atggccctct atgacattac acaaaagctt cctcaggcgg taatgggagc 8220
ttcctatggc ttccagtact cccctgccca acgggtggag tatctcttga aagcatgggc 8280
ggaaaagaag gaccccatgg gtttttcgta tgatacccga tgcttcgact caaccgtcac 8340
tgagagagac atcaggaccg aggagtccat ataccaggcc tgctccctgc ccgaggaggc 8400
ccgcactgcc atacactcgc tgactgagag actttacgta ggagggccca tgttcaacag 8460
caagggtcaa acctgcggtt acagacgttg ccgcgccagc ggggtgctaa ccactagcat 8520
gggtaacacc atcacatgct atgtgaaagc cctagcggcc tgcaaggctg cggggatagt 8580
tgcgcccaca atgctggtat gcggcgatga cctagtagtc atctcagaaa gccaggggac 8640
tgaggaggac gagcggaacc tgagagcctt cacggaggcc atgaccaggt actctgcccc 8700
tcctggtgat ccccccagac cggaatatga cctggagcta ataacatcct gttcctcaaa 8760
tgtgtctgtg gcgttgggcc cgcggggccg ccgcagatac tacctgacca gagacccaac 8820
cactccactc gcccgggctg cctgggaaac agttagacac tcccctatca attcatggct 8880
gggaaacatc atccagtatg ctccaaccat atgggttcgc atggtcctaa tgacacactt 8940
cttctccatt ctcatggtcc aagacaccct ggaccagaac ctcaactttg agatgtatgg 9000
atcagtatac tccgtgaatc ctttggacct tccagccata attgagaggt tacacgggct 9060
tgacgccttt tctatgcaca catactctca ccacgaactg acgcgggtgg cttcagccct 9120
cagaaaactt ggggcgccac ccctcagggt gtggaagagt cgggctcgcg cagtcagggc 9180
gtccctcatc tcccgtggag ggaaagcggc cgtttgcggc cgatatctct tcaattgggc 9240
ggtgaagacc aagctcaaac tcactccatt gccggaggcg cgcctactgg acttatccag 9300
ttggttcacc gtcggcgccg gcgggggcga catttttcac agcgtgtcgc gcgcccgacc 9360
ccgctcatta ctcttcggcc tactcctact tttcgtaggg gtaggcctct tcctactccc 9420
cgctcggtag agcggcacac actaggtaca ctccatagct aactgttcct tttttttttt 9480
tttttttttt tttttttttt tttttttttt ttcttttttt tttttttccc tctttcttcc 9540
cttctcatct tattctactt tctttcttgg tggctccatc ttagccctag tcacggctag 9600
ctgtgaaagg tccgtgagcc gcatgactgc agagagtgcc gtaactggtc tctctgcaga 9660
tcatgt 9666
<210>33
<211>18
<212>DNA
<213>人工序列
<220>
<221>n is A,G,C or T
<222>(7)
<220>
<223>人工序列说明:合成的
<400>33
cgtctcntat tacctgcc 18
<210>34
<211>18
<212>DNA
<213>人工序列
<220>
<221>n is A,G,C or T
<222>(12)
<220>
<223>人工序列说明:合成的
<400>34
gatcatgtcc cngagacg 18
<210>35
<211>18
<212>DNA
<213>人工序列
<220>
<221>n is A,G,C or T
<222>(7)
<220>
<223>人工序列说明:合成的
<400>35
gcagagnata atggacgg 18
<210>36
<211>18
<212>DNA
<213>人工序列
<220>
<221>n is A,G,C or T
<222>(12)
<220>
<223>人工序列说明:合成的
<400>36
ctagtacagg gnctctgc 18
<210>37
<211>33
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>37
gggttattgg agacggtacc gtctcctccc ccc 33
<210>38
<211>33
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>38
cccaataacc tctgccatgg cagaggaggg ggg 33
<210>39
<211>11
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>39
tattacctgc c 11
<210>40
<211>11
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>40
ctagtacagg g 11
<210>41
<211>15
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>41
gggttattac ctgcc 15
<210>42
<211>14
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>42
gatcatgtcc cccc 14
<210>43
<211>15
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>43
cccaataatg gacgg 15
<210>44
<211>14
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>44
ctagtacagg gggg 14
<210>45
<211>32
<212>DNA
<213>人工序列
<220>
<223>人工序列说明:合成的
<400>45
tgctggcttt gatgaaaacc tgcccctaat ag 32