CN1173036C - 马传染性贫血病毒驴白细胞弱毒疫苗株的全长基因序列 - Google Patents
马传染性贫血病毒驴白细胞弱毒疫苗株的全长基因序列 Download PDFInfo
- Publication number
- CN1173036C CN1173036C CNB991058526A CN99105852A CN1173036C CN 1173036 C CN1173036 C CN 1173036C CN B991058526 A CNB991058526 A CN B991058526A CN 99105852 A CN99105852 A CN 99105852A CN 1173036 C CN1173036 C CN 1173036C
- Authority
- CN
- China
- Prior art keywords
- leu
- lys
- gly
- ile
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/15011—Lentivirus, not HIV, e.g. FIV, SIV
- C12N2740/15022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Abstract
本发明提供了马传染性贫血病毒(EIAV)驴白细胞弱毒疫苗株含有8258个碱基的前病毒DNA全长基因序列及其结构,其所编码的全部蛋白质的基因序列和氨基酸序列和蛋白质的二级结构,以及该疫苗毒株的调控序列和非必需区序列。这些序列和结构的数据可应用于包括艾滋病毒在内的所有慢病毒属病毒的疫苗研制和使用该属病毒作为载体进行的基因治疗,还可应用于对EIAV的血清学及分子生物学诊断方法及其所需试剂的研制。
Description
技术领域
本发明属于病毒学和分子生物学专业领域,涉及对一种慢病毒疫苗株的全长基因序列的发现,确切地说是阐明我国成功研制的用于控制马传染性贫血病的减毒活疫苗毒株的全基因结构和序列、其编码蛋白质的序列以及它们的应用范围。
背景技术
马传染性贫血病毒(Equine Infectious Anemia Virus,EIAV)是引起马属动物发生传染性贫血症病原体,对畜牧业具有巨大危害,是兽医界颇受重视的病原体之一。EIAV属逆转录病毒科慢病毒属(lentivirus),与同属于慢病毒属的人类免疫缺陷病毒(Human ImmunodificiencyVirus,HIV)在基因组结构,基因编码蛋白以及基因调控方式等方面有许多相似之处(J.M.Coffin,The Structure And Classification of Retroviruses,in:TheRetroviridae,Vol.1,p19,edited by J.A.Levy,Plenum press)。由于EIAV是最早发现的动物病毒之一和最先发现的慢病毒,其基因高度变异引起慢性迁延性的疾病和自身免疫性病理,从而成为研究慢病毒感染、免疫病理、病毒酶功能以及免疫保护机理的重要动物模型(R.C.Montelaro et,al Equine Retroviruses,In:vol.2,P.257)。
我国自六十年代起投入巨资对该病毒的生物学性状进行了研究,分离并培育出与国外EIAV毒株有明显生物学差异的强毒株并进行了体外驴白细胞传代。经过多年努力,逐步驯化了该病毒并使之成为无致病力,但可以使动物在接种之后产生免疫保护的疫苗株。该疫苗株自1976年开始生产,1978年在全国大规模应用(沈荣显等,马传染性贫血免疫的研究。中国农业科学,第4期P1-15,1979)。至今已接种7000万匹次的马、骡、驴,完全控制了该病在我国的流行。
对病毒基因组的研究是在70年代分子生物学技术发展并得到广泛应用之后才兴起的。现在GenBank中已发表的马传染性贫血病毒的基因组序列均来源于美国强毒株(Wyoming株)和日本强毒株(Goshun株),以及由它们衍生得到的细胞培养适应株的基因序列。然而这些毒株均不是疫苗毒株。我国研制成功的EIAV弱毒疫苗至今还是目前世界上唯一经过大规模应用、长时间检验而被证明是安全和有效的慢病毒疫苗(R.C.Montelaro,et al.in:Vaccines against Retroviruses,Vol.4,P605,R.C.Montelaro et,al Equine Retroviruses,in:vol.2,P.257)。由于我国的马传染性贫血疫苗毒株是由经典路线制造出的,其基因组序列尚未被阐明。这一方面不能从基因水平上保护我国的EIAV疫苗的知识产权,另一方面也限制了该疫苗模型对其他慢病毒疫苗研制的指导作用。
发明内容
本发明的任务在于阐明一种能有效保护马属动物免于患马传染性贫血病的EIAV弱毒活疫苗株的全基因的结构和核苷酸序列及由其编码蛋白的结构和氨基酸序列,保护我国自主知识产权。同时经过将EIAV弱毒疫苗基因序列和蛋白结构与EIAV强毒株进行对比,揭示该疫苗毒株在传代过程中毒力减弱的分子机理和其诱导保护性免疫的组分和机理,从而为至今尚未突破的包括HIV在内的其他慢病毒疫苗的研究提供重要的参考。该发明可直接指导艾滋病疫苗和其它慢病毒疫苗的研制,进一步研究该疫苗毒株及其各主要基因和所编码的蛋白质可分别用于EIAV疫苗株和EIAV强毒株的核酸鉴别诊断和血清学鉴别诊断试剂的研制,并且用其构建的载体还可用于基因治疗。
本发明是通过以下技术手段实现的:首先用PCR方法扩增EIAV疫苗株的基因,分别克隆到质粒载体中再进行DNA序列的分析,得到该病毒的全长基因序列。
经传统生物学手段研制的EIAV驴白细胞弱毒疫苗毒株(第125代),来源于农业部授权该毒种保存单位中国农业科学院哈尔滨兽医研究所马传染性贫血研究室。该疫苗毒株在体外驴白细胞复制过程中以前病毒DNA的形式整合到驴白细胞的染色体上,本发明取此前病毒DNA作为扩增病毒基因的材料。本发明首先利用基因组DNA提取试剂盒从病毒感染的驴白细胞中提取染色体DNA,并以此为模板用PCR方法扩增EIAV疫苗毒株的前病毒DNA。扩增引物的设计是根据国际EIAV强毒株序列,先用其各区段EIAV基因进行预扩增。经过多次摸索,并根据对得到的扩增片段进行测序中获得的EIAV疫苗株的部分序列资料,设计出EIAV疫苗弱毒株特异性引物,分段扩增病毒基因并克隆到质粒载体上,进而对全部基因进行克隆和序列测定,获得了病毒基因组全长序列(见说明书附图1)和其主要结构基因(gag基因,pol基因和env基因)及主要调控基因(5’LTR,3’LTR,rev基因,S2基因,tat基因等)的序列(分别见说明书附图4,6,8,2,3,12,14,10)。
利用GCG软件对全长基因序列的开放读码框架进行分析,得到各个结构基因及调控基因所编码的蛋白质的氨基酸序列(详细序列分别见说明书附图5,7,9,11,13,15)。各基因在全长基因序列中的组合方式以及它们的相对位置见说明书附图1,各基因在全长基因序列中的具体位置见说明书附图17。
将得到的疫苗株序列与GenBank所发表的国际标准株序列(Wyoming株,GeneBankAccession Number:AF028232)进行核苷酸和氨基酸同源性的比较,结果发现,各基因与国外标准野毒株的核苷酸同源性在73.46-90.06%之间,其中env基因、rev基因和S2基因与国际标准株的差异较大,同源性分别为73.46%,73.54%和75.76%。氨基酸序列与国际标准株相应序列的同源性比较结果发现,外膜蛋白(Env蛋白)及Rev蛋白和S2蛋白的变异均较大,氨基酸同源性分别为67.41%,64.85%和54.54%(详见说明书附图18)。
另外,还利用GCG软件对该疫苗株各结构基因和调控基因所编码蛋白质的二级结构进行预测,分析结果见说明附图19,20,21,22,22,23,24。疫苗株Env和Tat蛋白二级结构与国外标准株的相应蛋白的二级结构的比较发现有显著差异(见说明书附图25,26)。马传贫驴白细胞疫苗株的Env蛋白与国外标准强毒株(AF028232)的Env蛋白在多个区域的α螺旋、β片层和转角等结构都有不同,其中转角结构的数量和位置的不同,可能是导致了两者间二级结构有明显差异的主要原因。马传贫驴白细胞疫苗株的Tat蛋白的二级结构图羧基端有一明显的疏水基团(菱形框所示),其临近区域为β片层结构并形成较集中的亲水基团,其氨基端有四个转角结构;国外标准强毒株(AF028232)的Tat蛋白的二级结构图的羧基端无疏水基团,其临近区域为松散的无规则卷曲结构,并有两个独立的亲水基团,其氨基端有丰富的转角结构。
通过对氨基酸序列分析发现,马传贫疫苗株的Env蛋白包含19个潜在的糖基化位点,国外标准强毒株(AF028232)的Env蛋白包含23个潜在的糖基化位点,各个位点的位置见见说明书附图27。
通过对各个基因编码的蛋白组成及等电点(PI)进行分析,得到各个蛋白的分子量和等电点,具体数据见说明书附图28。
本发明是在国内外首次阐明我国研制的EIAV弱毒疫苗株的全基因序列,将从基因水平上提供对我国自主研制的该疫苗的知识产权保护。鉴于我国的EIAV弱毒疫苗是目前世界上唯一经受过长时间和大规模现场应用验证的安全而有效的慢病毒疫苗,本发明阐明的该疫苗弱毒株全基因序列和结构的特征、各结构基因和调控基因的特征及其编码的蛋白质的特征和功能将为其他慢病毒疫苗的研制提供重要的指导,并将大大推动这些疫苗研究的进程。当前艾滋病毒(HIV)疫苗的研究正处于进退两难之中,一方面现有各类基因工程HIV疫苗均未显示明显有效的迹象,而另一方面显示出一定保护作用的HIV弱毒活疫苗又因不安全而无法推进(邵一鸣,艾滋病疫苗研究现状及其发展方向,中国科学发展报告,1999,94-101,科学出版社)。历史上在人及动物最有效的疫苗都是弱毒活疫苗(R.C.Montelaro,etal.in:Vaccines against Retroviruses,Vol.4,P605,)。由于HIV和EIAV均属慢病毒,其基因结构和编码蛋白的功能均很相似,因而EIAV弱毒疫苗的成功经验为艾滋病毒弱毒疫苗的研制提供了很好的借鉴。本发明使得这种对HIV疫苗具有重要意义的借鉴由可能变为现实。该发明除可直接指导艾滋病疫苗和其它慢病毒疫苗的研制外,通过进一步研究该疫苗毒株及其各主要基因和所编码的蛋白质可分别用于对EIAV疫苗株和强毒株的核酸诊断和血清学诊断。使用EIAV弱毒疫苗株构建携带外源基因的载体还有望用于基因治疗。
具体实施方式
以下实施例对于本发明马传染性贫血病毒(EIAV)驴白细胞弱毒疫苗全长基因序列的应用进行详细的说明,但不意味着限制本发明的内容。
实施例1 马传染性贫血病毒(EIAV)驴白细胞弱毒疫苗全长基因结构及序列分析可用于阐明该疫苗毒株的致弱机制和诱导保护性免疫的机理
将EIAV弱毒疫苗全长基因序列与国外EIAV强毒株、中国EIAV强毒株及部分致弱株的序列进行比较,从基因和蛋白水平来阐明强弱毒株之间在结构和功能上的差异,从中确定与EIAV的致病性和免疫原性相关的基因及其蛋白的组分或基因及蛋白的调控机制,最终阐明EIAV驴白细胞弱毒疫苗毒力致弱和诱导保护性免疫的基因和蛋白结构及其调控的机理,还可为研究逆转录病毒的基因调控和基因功能提供重要的理论基础。
实施例2 马传染性贫血病毒驴白细胞弱毒疫苗全长基因结构和序列可用于艾滋病毒(HIV)及其他慢病毒减毒活疫苗的构建。
根据实施例1的设计方案,参照马传染性贫血病毒弱毒疫苗的致弱和免疫保护机制对艾滋病毒(HIV)及其它已发现的动物慢病毒,包括猴免疫缺损病毒(Simian ImmunodeficiencyVirus,SIV)、猫免疫缺损病毒(Feline Immunodeficiency Virus,FIV)、牛免疫缺损病毒(BovineImmunodeficiency Virus,BIV)、维斯纳慢病毒(Visna Lentivirus)、和山羊关节炎脑炎病毒(Caprine Arthritis-Encephalitis Virus,CAEV)进行基因改造,从而构建相应的减毒活疫苗,进行实验室和临床研究。
实施例3 马传染性贫血病毒驴白细胞弱毒疫苗全长基因结构及序列和其编码蛋白的氨基酸及其二、三级结构分析可用于艾滋病毒(HIV)及其他慢病毒基因工程疫苗的构建。
根据实施例1的设计方案,将艾滋病毒(HIV)及其他慢病毒(SIV、FIV、BIV、Visna和CAEV等)的基因组避免或去除与毒力有关的基因,或根据基因序列及蛋白二级结构分析结果(图25、26、27)对有关蛋白进行结构改造,将能诱导保护性免疫的蛋白的基因分别克隆到各类表达载体中,构建成各种形式(多肽、亚单位、病毒样颗粒和活载体等)的基因工程疫苗,进行实验室和临床研究。
实施例4 马传染性贫血病毒驴白细胞弱毒疫苗全长基因序列及其编码的蛋白质的氨基酸序列可用于构建基因重组抗原,用于马传染性贫血病毒感染的血清学诊断。
根据马传染性贫血病毒疫苗株全基因序列及其编码的氨基酸序列分析的研究结果,可以分别选取包含主要抗原表位的基因,构建原核和真核表达载体,在大肠杆菌或真核细胞中表达EIAV蛋白,经层析方法纯化后可用于血清学诊断和制备诊断试剂。
实施例5 马传染性贫血病毒驴白细胞弱毒疫苗全长基因序列可用于马传染性贫血病毒疫苗株和野毒株感染的PCR鉴别诊断。
根据实施例2的研究结果,选择EIAV野毒株和疫苗株的基因序列差异最大的区域设计PCR引物,根据PCR扩增产物的差异来对强弱毒株的感染进行鉴别诊断。
实施例6 马传染性贫血病毒驴白细胞弱毒疫苗全长基因序列可用于构建进行基因治疗的基因转移载体。
利用中国株马传染性贫血病毒疫苗株构建用于基因治疗的基因转移载体。马传染性贫血病毒不能引起人类发病,采用文献报道的方法构建源于该疫苗株的基因转移载体,不但可以克服以往所用的鼠白血病病毒来源的基因转移载体的基因转移效率较低和不能转染未分裂细胞等缺陷,而且有较好的安全保障。
实施例8 利用GCG软件包对马传染性贫血病毒驴白细胞疫苗株(chb101)的Env、Tat蛋白的二级结构进行预测分析,并与国外强毒株(GenBank Accession Number:AF028232)的Env、Tat蛋白的二级结构进行比较(如图25、图26所示),可以发现EIAV强毒株与疫苗毒株的蛋白在二级结构水平上有显著差异,其中α螺旋、β片层和转角结构的数量和位置均有不同。强烈提示这种蛋白质高级结构的差异可能是它们功能差异的基础,是基因工程、疫苗的重要候选抗原。根据此原理,也可对艾滋病毒和其他慢病毒的相应蛋白基因进行改造,以研究将其作为疫苗抗原的可能性。
附图说明
以下为说明书附图的简要说明:
图1为马传染性贫血病毒驴白细胞弱毒疫苗株全长基因序列(8258个核苷酸),5’-3’方向。
图2为5’LTR(位于全长基因的第1-325位核苷酸)的全部DNA序列(5’-3’),包括U3区(核苷酸1-207位)、R区(核苷酸208-285位)和U5区(核苷酸286-325位),全长325个核苷酸。
图3为3’LTR(位于全长基因的第7922-8258位核苷酸)的DNA序列(5’-3’),包括U3区(核苷酸7922-8140位)、R区(核苷酸8141-8218位)和U5区(核苷酸8219-8258位),全长337个核苷酸。
图4为gag基因(位于全长基因的第466-1926位核苷酸)的全部DNA序列(5’-3’),全长1461个核苷酸。
图5为gag基因编码的氨基酸序列,全长486个核苷酸。
图6为pol基因(位于全长基因的第1689-5120位核苷酸)的全部DNA序列(5’-3’)。
图7为pol基因编码的氨基酸序列,全长1143个核苷酸。
图8为Env基因(位于全长基因的第5313-7904位核苷酸)的全部DNA序列(5’-3’)。
图9为env基因编码的氨基酸序列,全长863个核苷酸。
图10为tat基因(包括第一外显子位于全长基因的第365-462位核苷酸,第二外显子位于全长基因的第5138-5276位核苷酸)的全部DNA序列(5’-3’),全长237个核苷酸。
图11为tat基因编码的氨基酸序列,全长78个核苷酸。
图12为rev基因(包括第一外显子位于全长基因的第5454-5546位核苷酸,第二外显子位于全长基因的第7250-7651位核苷酸)的全部DNA序列(5’-3’),全长495个核苷酸。
图13为rev基因编码的氨基酸序列,全长164个核苷酸。
图14为S2基因(位于全长基因的第5287-5493位核苷酸)的全部DNA序列(5’-3’)。
图15为S2基因编码的氨基酸序列,全长68个核苷酸。
图16为EIAV驴白细胞弱毒疫苗株的基因结构图,图中可见5’LTR,3’LTR,gag,pol,env,S2,rev,tat基因的相对位置和长度。
图17为马传染性贫血病毒驴白细胞弱毒疫苗株各功能基因在全长基因序列中的位置。
图18为马传染性贫血病毒驴白细胞弱毒疫苗株各功能基因的核苷酸序列及其编码的氨基酸序列与国际标准株(Wyoming株,GeneBank Accession Number:AF028232)相应序列的同源性比较结果。
图19为马传染性贫血病毒驴白细胞弱毒疫苗株env基因编码蛋白的二级结构示意图,图标1所指多边形框代表亲水基因,图标2所指菱形框代表疏水基因。
图20为马传染性贫血病毒驴白细胞弱毒疫苗株gag基因编码蛋白的二级结构示意图,图标1所指多边形框代表亲水基因,图标2所指菱形框代表疏水基因。
图21为马传染性贫血病毒驴白细胞弱毒疫苗株pol基因编码蛋白的二级结构示意图,图标1所指多边形框代表亲水基因,图标2所指菱形框代表疏水基因。
图22为马传染性贫血病毒驴白细胞弱毒疫苗株rev基因编码蛋白的二级结构示意图,图标1所指多边形框代表亲水基因,图标2所指菱形框代表疏水基因。
图23为马传染性贫血病毒驴白细胞弱毒疫苗株tat基因编码蛋白的二级结构示意图,图标1所指多边形框代表亲水基因,图标2所指菱形框代表疏水基因。
图24为马传染性贫血病毒驴白细胞弱毒疫苗株S2基因编码蛋白的二级结构示意图,图标1所指多边形框代表亲水基因,图标2所指菱形框代表疏水基因。
图25为马传贫驴白细胞疫苗株的Env蛋白与国外强毒株(GenBank Accession Number:AF028232)的Env蛋白的二级结构的比较示意图。图中chb101为疫苗株Env蛋白的二级结构图,af028232为国外强毒Env蛋白的二级结构图。图标1所指多边形框代表亲水基因,图标2所指菱形框代表疏水基因。比较可见二者在多个区域的α螺旋、β片层和转角等结构都有不同,其中转角结构的数量和位置的不同,可能是导致了两者间二级结构有明显差异的主要原因。
图26为马传贫驴白细胞疫苗株的Tat蛋白与国外强毒株(GenBank Accession Number:AF028232)的Tat蛋白的二级结构的比较示意图。图中chb101为疫苗株Tat蛋白的二级结构图,af028232为国外强毒株Tat蛋白的二级结构图。二者比较可见chb101的羧基端有一明显的疏水基团(菱形框所示),其临近区域为β片层结构并形成较集中的亲水基团,其氨基端有四个转角结构;af028232的羧基端无疏水基团,其临近区域为松散的无规则卷曲结构,并有两个独立的亲水基团,其氨基端有丰富的转角结构。
图27为马传贫弱毒疫苗株Env蛋白与国外强毒株Env蛋白潜在的糖基化位点图。图中chb101-env为为马传贫弱毒疫苗株Env蛋白的氨基酸序列,28232-env为国外强毒株Env蛋白的氨基酸序列,方框内为潜在的糖基化位点的序列。
图28为马传贫弱毒疫苗株各功能蛋白的分子量及等电点的理论值,其中env基因编码的前体蛋白包含19个糖基化位点,完全糖化后分子量视糖基化程度而定,表中所列为未经糖基化的蛋白分子量。
序列表
<110> 卫生部艾滋病预防与控制中心
中国农业科学院哈尔滨兽医研究所
<120> 马传染性贫血病毒驴白细胞弱毒疫苗株的全长基因序列
<130> P2000121C
<140> PCT/CN00/00096
<141> 2000-04-21
<150> CN99105852.6
<151> 1999-04-21
<160> 13
<170> PatentIn version 3.1
<210> 1
<211> 8258
<212> DNA
<213> 马传染性贫血病毒驴白细胞弱毒疫苗株(EIAV)
<400> 1
tgtggggttt ttatgagggg ttttataaat gattataaga gtaaaaagaa gggggctgat 60
gctctcataa ccttgtataa cccaaaggac tagctcatgt tgctaggcaa ctaaaccgca 120
atatcctgta gttcctcttg cgttccgcat ttgtgacgtt ttaagttcct gtttttacag 180
tatataagtg cttgtattct gacaattggg cactcagatt ctgcggtctg agtcccttct 240
ctgctgggct agactagcct ttgtaataaa tataattctc tgctaagtcc ctgtctctag 300
tttgtcttgt tttcaagatc taacagctgg cgcccgaaca gggacctgag ggcgcagacc 360
ctgcctgctg aacctggctg atcataggat ccctaggaca gcagaggaga acttacagaa 420
gtcttctgga ggtgttcctg gccacaacac aggaagacag gtaagatggg agactctttg 480
acatggagca aagcgctcaa gaagttagag aaggtgacgg tacaagggtc tcaaaagcta 540
actagtggta actgtaattg ggcgctgaat ttggtggact tattccatga caccaatttt 600
ggtaaagaaa aagactggca attaagggac gtcattccat tgttagagga cgtttcccag 660
acgttgtcag gacaagagag agaggcattt gaaaaaactt ggtgggcaat agctgccgtt 720
aagatgggct tacaaattaa tactgtgaat gatgcaaaaa caacattttc tatattaaaa 780
gccaagtttg aaagaaagac tgcaaataat accaaaaagc agtctgagcc cgaggaagaa 840
tacccaataa tgattgatgg ggctggaaac agaaactttc ggccattaac acccagagga 900
tatactacct gggtaaatac tatacagcaa aacaatctct taaatgaagc tagtgtgaat 960
ttatttggta ttttatcagt agactgtact tctgaggaaa tgaatgcatt tttggatgta 1020
gtaccaggac aagcaggaca aaaacaagta ctattggata atcttgataa gattgcagaa 1080
gaatgggatc gtaggcaccc gttgccaaat cctccattag tggcaccacc acaagggcct 1140
attcccatga cagcaaggtt cattagggga ttgggagttc ctagagaaag acagatgaaa 1200
cctgcttttg atcagtttag acaaacttat agacaatgga taatagaagc aatgacagaa 1260
gggataaaaa taatgattgg gaaacccaaa gcgcaaaata ttaggcaagg acccaaagaa 1320
ccctatccag agtttataga cagattgctg tctcagataa aaagtgaggg acatccggct 1380
gatataacta aattcctgac agacacttta actattcaga atgctaatga tgaatgcaaa 1440
aatgctatga gacatttgag gccagaagat actttagaag agaaaatgta tgcatgtaga 1500
gatattggca ctatgagaca aaaaatggca ttattagcca aggcacttca agcaggatta 1560
gctggtccta tgaagggagg aatatttaaa gggggaccct taggggcgaa gcagacatgt 1620
tataattgtg gaaaaccagg acatttttct agtcaatgta aagcacctaa aatatgtttt 1680
aagtgcaaac agccaggaca tttctcaaaa caatgtataa atgctccaaa aaacgggaaa 1740
caaggggctc aggggaggcc ccagaaacaa actttccctg tgcagaagga gtcaatgaac 1800
aaaacacaaa aagaggagaa acagcaaggg accttatatc cagatttaag tcagatgaaa 1860
caggaataca agatcaagga agaggaaaat caagaggatc tcaatctgaa cagtttgtgg 1920
gagtaactta taatttagaa aagagaccaa ctacaatagt cttgattaat gacacaccct 1980
taaatgtatt gttggacaca ggagcagaca catcagtact aactattgca cattgtaata 2040
ggttaaagta tggaggaaga aaatatcaag gtacaggtat tgttggggtt ggaggtaatg 2100
tagaaacatt ttccactcct gttacagtga aaaagaaagg aaaacaaatt aaaactagaa 2160
tgttagtagc agatatccca gttactattt tggggcgaga tattcttcaa gaattaggcg 2220
cacaattact aatggctcaa ctttcaaaag aaataacccc aagagaaatt aaattaaaaa 2280
caggcacagt agggcctaag gttccccaat ggccacttac taaagagaag ttgttaggtg 2340
ctaaagaaat agtcaaaaaa ttgttggatg aaggtaaaat atcagaagcc agtgatgata 2400
atccttataa ttctcctata tttgtaataa aaaagaaatc tggaaagtgg agattattgc 2460
aagatttaag agagttaatt aagggtggta caagtagaac tgaaatatcc agaggattac 2520
ctcatccagg gggattaatt aaatgtaatc atatgacagt attagatatt ggagatgcat 2580
atttcactat accattagat ccaaagttta gacaatatac agcatttact gtgccatcca 2640
ttaatcatca ggaaccagat aaaagatatg tgtggaattg cttgccacaa ggttttgtgt 2700
taagtccata catatatcaa aaaacattac aggacatatt acaagctttt agagaaaggc 2760
atccagatgt acaattatat caatatatgg atgatttatt cattgggagt aatgaatcta 2820
aaagacaaca taaggaacta gtagaagaat taagagctat tcttttagaa aagggctttg 2880
agacgcctgg ggataaattg caggaagaag caccctataa ttggctggga tatcaactta 2940
gtccaggcaa ttggaaagta caaaagatgc aattagaatt ggtaaaagag ccaacattaa 3000
atgatgtgca aaaatcaaag ggaaatataa catggatgag ctcaggggtt cctggattaa 3060
cagtgaagca aatagctgct accactaaag gttgcttaga tttaaatcat aaaggtagta 3120
ggaccagaga agcccaaaaa gacttagagg aaattattaa aagtttcaga agctcaggat 3180
tcccatatta taacccagaa gaagaagtaa tctgtgagat tgaaattact aaaaattatg 3240
aggctactta tataataaaa cagtctcaag gaatattgtg ggcaggaaag aaaattatga 3300
gggctaataa aggatggtcc gcagcaaaaa atctaatgtt attgttacaa catgtagcca 3360
cagaaagtat tgttagaatt ggaacatgtc caaaatttaa agtacctttt actaaagaac 3420
aagtcaaatg ggaaatggaa aagggatggt attattcatg gctaccagac atggtatatt 3480
cacatcaagt tgttcatgat gattggagac tgaaattagt agagcaacca acatctggta 3540
taacaattta tactgatggg ggtaaacaga atgaagaagg agttgcagct tatgtgacta 3600
gtaatgggaa aactaaacaa aaaaggttag ggcctgttac tcatcaaact gctgagagga 3660
tagcaataca aatggcatta gaagatactg aagagacatt ggtaaatata gtaactgata 3720
gttactactg ttggaaaaat attacagaag gattagggtt agaaggacca gacagcccct 3780
ggtggccaat aattcaaaat attagggcta aagaaatggt ttattttgct tgggtaccag 3840
gtcacaaagg aatatatggc aatcaattgg cagatgaggc tactaaaata acagaggaaa 3900
ttatgttagc atatcaaggc acacagatta gggaaaaaag agatgaagat gcagggtatg 3960
atttgtgtat tccttatgac ataatgatac ctgtctctga gacaaaagtt atacccacag 4020
atgtaaaaat acaggtacct cacaaatgtt ttggatgggt aactggtaag tcatcaatgg 4080
ctaagcaagg attattaatc aatgggggaa taattgatga aggatacaca ggtgaaatac 4140
aggtaatttg tactaatatt ggaaagagta acatgaaact cagggaagga caaaagtttg 4200
cacaattaat catattacag catcgatcaa atgataaaca aatctgggat gaaaataaaa 4260
catctcaaag gggagataaa gggtttggaa gcacaggtat attttgggta gagaatatcc 4320
aagaggcgca agatgaacat gaaaattggc atacatctcc aaagatattg gcaaaaagat 4380
atgggttacc attgactgta gctaaacaga taactcaaga atgccctcat tgtactaaac 4440
aaggatctgg accagcaggt tgtgtaatga gatctcctaa tcattggcag gctgattgta 4500
cacatttaga aaacagggta ataatgacat ttgtagagtc taattcagga tacattcatg 4560
ctactctatt gtccaaagaa aatgccttgt gtccttcatt ggctattttg gaatgggtga 4620
ggttattttc tcctaaatct ttacatacag acaatggtac taattttgtg gcagagtcag 4680
tagcaaatct gttgaaattc ctgaaggtga cacatactac aggaatacct tatcacccag 4740
agagccaagg cattgtggaa agagcaaaca ggacattaaa agaaagaatt aaaagtcata 4800
gaggaaatac tcagacactt gaagcagcat tacaacttgc tctcattact tgtaacaaag 4860
ggagggaaag tatgggagga caaactccat gggaagtatt tattactaat caggctcaaa 4920
caatacatga agaactttta ttacaacaag cacaatcttc taaaaaattt tgtttttata 4980
aaattcctgg tgagcataat tggaaggggc ccaccagagt gttgtggaaa ggtgatggag 5040
cagtagtggt caatgatgag gaaaaaggaa taattgctgt gcctttaacc aggactaaat 5100
tattaataag accaaattga gcattgtttc aggaatcacc accagtcagc tatcattgtc 5160
aactgtgttt cctgagatca ttgggaattg actaccttga cagctcgctg aagaagaaga 5220
acaaacaaag acagaaggcc atcagggagg aagacaacct cagtatcttg ttataaggtt 5280
tggtgtatgg gattatttgg taaaggggta acatggtcag cattacattc tatgggggta 5340
tcccaggggg aatatcaacc cctatcaccc aacaaacaga atcaacagac acacagaaag 5400
gggatcatat ggtatatcaa ccctattgtt ataatgatag ccataaagaa gaaatggcag 5460
agacaagaga cacaagatac caagaagaaa tgaaccggaa agaagataaa gaagataaaa 5520
gaaagaataa ctggtggaag ataggtatgt tcttattgtg tctgttagag atcactggag 5580
gattcctctg gtggtatgag aggcaacaac attcatatta tataagattg gttacaatag 5640
gaggtagact gaatggttca ggaatgacta gtgccataaa atgttggggt tcatttcctg 5700
ggtgtaggcc atttactaac tatttcagtt atgagactaa taggactgtt agtagagata 5760
ataatactgc tactctgtta gatacttatc aaagagaaat aacaaacata tacaggacat 5820
cttgtgtgga tagtgatcac tgtcaagaat ataaatgtaa gcaagtacag ttgaaaaaga 5880
acagcaataa cattataatg aataattgta gtaacaatag gtgtgaagag ttttgggggt 5940
ttagctggtt agaatgtaat cagacagaaa atgcaataac tatattggtc ccagaaatag 6000
aaatacagca aagaaagaac acttggattc caaaaaggtg tgagaaaact tgggctaagg 6060
taaaacattg tccaatggat ttattatatg gtataaataa aataagaatg tgtgtccaac 6120
ctccattctt tttgtttaaa cagaatgata cttctaataa tactaatatt ctcagtaatt 6180
gtggaccttt agtatttctt ggaatatttg aggacaataa ggcagcaatc cagaatggga 6240
gttgcactct tcacaggaca aatattaaca ggccagatta tagtggattt taccaagtgc 6300
ctatatttta tatatgcacc ttgacaggat ttcaaagttg taataatgga tcaataatta 6360
gtataattat gtatgagtct aataatgttc aatacttgtt atgcaatact agtaatacta 6420
atagtaccaa taatgctaat gtctcttgtg tggtacaaag ttttggagtg ataggacagg 6480
cacatgtggc attgcccaga aaaaataaga ggttacaatc tccaaagttt gctcactata 6540
attgcaccat aaataataaa acagagttaa ggcgatggca attggtaaaa acatcaggca 6600
tcactccttt acccatttcc tctacagcta atactggatt agtcagacac aagagagact 6660
ttggtatatc tgctataata gctgccattg tagctgctag tgctattgct gctagtgcta 6720
ctatgtctta tatcgctttg acagaagtca acaaattaga tagtgtacaa aatcatactt 6780
ttgaagtaga gaacaatact atcaataaca tagagttaac agaagagcaa attcatatat 6840
tatatgctat ggttctccaa acacatgcag atgttcaatt gttaaaagaa caacaaaaga 6900
ttgaggaaac atttaattta attggatgta tagaaagatc acatacattt tgtcatactg 6960
gacatccctg gaatgaatca tggggtcagt taaatgattc tacacagtgg gatgactggg 7020
tagataagat ggaaaattta aatcatgata tattaacaac acttcatact gctagaaata 7080
atctagaaca atctatgata actttcaata cacctgacag tgtagcacaa tttggaaaaa 7140
atatttggag tcatattgca aattggattc ctagattagg agcttccata attaaatata 7200
tagtgttgat attacttata tatgtgttac taacctctgc acctaagatc ctcagaggcc 7260
tcttgacaac gatgagtggt gcaggatcct ccgccagtcg ctacctgaag aaaagatacc 7320
atcacaaaca tgcatcgcga ggagacatct gggcccaggt ccaatatcat gcgtacctgg 7380
cagacgagac tcatggctca ggggacaagt ccaacatgcg gaagctctcc aggaacaact 7440
ggaatggcga atcagaggag tacaacagac gacaaaaaaa ttggaaaaag ttattaaaga 7500
gatctggaga gaattacaat acacacgaag acaacatggg gactatgggt cgtttggtga 7560
ctaccgccgc cgagaagaag aacgtcgggg tgaatcctca ccaagggtcc ttaaccctgg 7620
agattcaaag caaaggagga aacatctatg actgttgcat taaggctcaa gaaggaactc 7680
ttgctattcc ttgctgtggc ttcccactat ggccgttttg gggacttata atcatattag 7740
aacgcttgtt gggatatggg cttcgggaaa ttgcaaaaat tataatgatt ctagggaaag 7800
gactaagtat aataattaca ggattaagaa aattatgtga ttatattggg aaaatgctaa 7860
atccagctac atctcatgta acaatgcctc aatatgatgt ttagaaaaac aaggggggaa 7920
ctgtgggatt aatataagat tcttataagt gaatatgaaa gttgctgatg ctctcaagtt 7980
gctgatgctc tcataacctt atgactagct catgttgcca ggcaactgaa ctgtgataac 8040
cttttgttcc tcattatagt tccgcttttg tatagttccg cttttgtgac gcgttaagtt 8100
cctgttttta cagtatataa gtgcttatat tctgacattt ggtcactcag attctgcggt 8160
ctgagtccct tctctgctgg gctagactag cctttgtaat aaatataatt ctctgctaag 8220
tccctgtctc tagtttgtct tgttttcaag atctaaca 8258
<210> 2
<211> 1461
<212> DNA
<213> EIAV
<400> 2
atgggagact ctttgacatg gagcaaagcg ctcaagaagt tagagaaggt gacggtacaa 60
gggtctcaaa agctaactag tggtaactgt aattgggcgc tgaatttggt ggacttattc 120
catgacacca attttggtaa agaaaaagac tggcaattaa gggacgtcat tccattgtta 180
gaggacgttt cccagacgtt gtcaggacaa gagagagagg catttgaaaa aacttggtgg 240
gcaatagctg ccgttaagat gggcttacaa attaatactg tgaatgatgc aaaaacaaca 300
ttttctatat taaaagccaa gtttgaaaga aagactgcaa ataataccaa aaagcagtct 360
gagcccgagg aagaataccc aataatgatt gatggggctg gaaacagaaa ctttcggcca 420
ttaacaccca gaggatatac tacctgggta aatactatac agcaaaacaa tctcttaaat 480
gaagctagtg tgaatttatt tggtatttta tcagtagact gtacttctga ggaaatgaat 540
gcatttttgg atgtagtacc aggacaagca ggacaaaaac aagtactatt ggataatctt 600
gataagattg cagaagaatg ggatcgtagg cacccgttgc caaatcctcc attagtggca 660
ccaccacaag ggcctattcc catgacagca aggttcatta ggggattggg agttcctaga 720
gaaagacaga tgaaacctgc ttttgatcag tttagacaaa cttatagaca atggataata 780
gaagcaatga cagaagggat aaaaataatg attgggaaac ccaaagcgca aaatattagg 840
caaggaccca aagaacccta tccagagttt atagacagat tgctgtctca gataaaaagt 900
gagggacatc cggctgatat aactaaattc ctgacagaca ctttaactat tcagaatgct 960
aatgatgaat gcaaaaatgc tatgagacat ttgaggccag aagatacttt agaagagaaa 1020
atgtatgcat gtagagatat tggcactatg agacaaaaaa tggcattatt agccaaggca 1080
cttcaagcag gattagctgg tcctatgaag ggaggaatat ttaaaggggg acccttaggg 1140
gcgaagcaga catgttataa ttgtggaaaa ccaggacatt tttctagtca atgtaaagca 1200
cctaaaatat gttttaagtg caaacagcca ggacatttct caaaacaatg tagaaatgct 1260
ccaaaaaacg ggaaacaagg ggctcagggg aggccccaga aacaaacttt ccctgtgcag 1320
aaggagtcaa tgaacaaaac acaaaaagag gagaaacagc aagggacctt atatccagat 1380
ttaagtcaga tgaaacagga atacaagatc aaggaagagg aaaatcaaga ggatctcaat 1440
ctgaacagtt tgtgggagta a 1461
<210> 3
<211> 3432
<212> DNA
<213> EIAV
<400> 3
acagccagga catttctcaa aacaatgtat aaatgctcca aaaaacggga aacaaggggc 60
tcaggggagg ccccagaaac aaactttccc tgtgcagaag gagtcaatga acaaaacaca 120
aaaagaggag aaacagcaag ggaccttata tccagattta agtcagatga aacaggaata 180
caagatcaag gaagaggaaa atcaagagga tctcaatctg aacagtttgt gggagtaact 240
tataatttag aaaagagacc aactacaata gtcttgatta atgacacacc cttaaatgta 300
ttgttggaca caggagcaga cacatcagta ctaactattg cacattgtaa taggttaaag 360
tatggaggaa gaaaatatca aggtacaggt attgttgggg ttggaggtaa tgtagaaaca 420
ttttccactc ctgttacagt gaaaaagaaa ggaaaacaaa ttaaaactag aatgttagta 480
gcagatatcc cagttactat tttggggcga gatattcttc aagaattagg cgcacaatta 540
ctaatggctc aactttcaaa agaaataacc ccaagagaaa ttaaattaaa aacaggcaca 600
gtagggccta aggttcccca atggccactt actaaagaga agttgttagg tgctaaagaa 660
atagtcaaaa aattgttgga tgaaggtaaa atatcagaag ccagtgatga taatccttat 720
aattctccta tatttgtaat aaaaaagaaa tctggaaagt ggagattatt gcaagattta 780
agagagttaa ttaagggtgg tacaagtaga actgaaatat ccagaggatt acctcatcca 840
gggggattaa ttaaatgtaa tcatatgaca gtattagata ttggagatgc atatttcact 900
ataccattag atccaaagtt tagacaatat acagcattta ctgtgccatc cattaatcat 960
caggaaccag ataaaagata tgtgtggaat tgcttgccac aaggttttgt gttaagtcca 1020
tacatatatc aaaaaacatt acaggacata ttacaagctt ttagagaaag gcatccagat 1080
gtacaattat atcaatatat ggatgattta ttcattggga gtaatgaatc taaaagacaa 1140
cataaggaac tagtagaaga attaagagct attcttttag aaaagggctt tgagacgcct 1200
ggggataaat tgcaggaaga agcaccctat aattggctgg gatatcaact tagtccaggc 1260
aattggaaag tacaaaagat gcaattagaa ttggtaaaag agccaacatt aaatgatgtg 1320
caaaaatcaa agggaaatat aacatggatg agctcagggg ttcctggatt aacagtgaag 1380
caaatagctg ctaccactaa aggttgctta gatttaaatc ataaaggtag taggaccaga 1440
gaagcccaaa aagacttaga ggaaattatt aaaagtttca gaagctcagg attcccatat 1500
tataacccag aagaagaagt aatctgtgag attgaaatta ctaaaaatta tgaggctact 1560
tatataataa aacagtctca aggaatattg tgggcaggaa agaaaattat gagggctaat 1620
aaaggatggt ccgcagcaaa aaatctaatg ttattgttac aacatgtagc cacagaaagt 1680
attgttagaa ttggaacatg tccaaaattt aaagtacctt ttactaaaga acaagtcaaa 1740
tgggaaatgg aaaagggatg gtattattca tggctaccag acatggtata ttcacatcaa 1800
gttgttcatg atgattggag actgaaatta gtagagcaac caacatctgg tataacaatt 1860
tatactgatg ggggtaaaca gaatgaagaa ggagttgcag cttatgtgac tagtaatggg 1920
aaaactaaac aaaaaaggtt agggcctgtt actcatcaaa ctgctgagag gatagcaata 1980
caaatggcat tagaagatac tgaagagaca ttggtaaata tagtaactga tagttactac 2040
tgttggaaaa atattacaga aggattaggg ttagaaggac cagacagccc ctggtggcca 2100
ataattcaaa atattagggc taaagaaatg gtttattttg cttgggtacc aggtcacaaa 2160
ggaatatatg gcaatcaatt ggcagatgag gctactaaaa taacagagga aattatgtta 2220
gcatatcaag gcacacagat tagggaaaaa agagatgaag atgcagggta tgatttgtgt 2280
attccttatg acataatgat acctgtctct gagacaaaag ttatacccac agatgtaaaa 2340
atacaggtac ctcacaaatg ttttggatgg gtaactggta agtcatcaat ggctaagcaa 2400
ggattattaa tcaatggggg aataattgat gaaggataca caggtgaaat acaggtaatt 2460
tgtactaata ttggaaagag taacatgaaa ctcagggaag gacaaaagtt tgcacaatta 2520
atcatattac agcatcgatc aaatgataaa caaatctggg atgaaaataa aacatctcaa 2580
aggggagata aagggtttgg aagcacaggt atattttggg tagagaatat ccaagaggcg 2640
caagatgaac atgaaaattg gcatacatct ccaaagatat tggcaaaaag atatgggtta 2700
ccattgactg tagctaaaca gataactcaa gaatgccctc attgtactaa acaaggatct 2760
ggaccagcag gttgtgtaat gagatctcct aatcattggc aggctgattg tacacattta 2820
gaaaacaggg taataatgac atttgtagag tctaattcag gatacattca tgctactcta 2880
ttgtccaaag aaaatgcctt gtgtccttca ttggctattt tggaatgggt gaggttattt 2940
tctcctaaat ctttacatac agacaatggt actaattttg tggcagagtc agtagcaaat 3000
ctgttgaaat tcctgaaggt gacacatact acaggaatac cttatcaccc agagagccaa 3060
ggcattgtgg aaagagcaaa caggacatta aaagaaagaa ttaaaagtca tagaggaaat 3120
actcagacac ttgaagcagc attacaactt gctctcatta cttgtaacaa agggagggaa 3180
agtatgggag gacaaactcc atgggaagta tttattacta atcaggctca aacaatacat 3240
gaagaacttt tattacaaca agcacaatct tctaaaaaat tttgttttta taaaattcct 3300
ggtgagcata attggaaggg gcccaccaga gtgttgtgga aaggtgatgg agcagtagtg 3360
gtcaatgatg aggaaaaagg aataattgct gtgcctttaa ccaggactaa attattaata 3420
agaccaaatt ga 3432
<210> 4
<211> 2592
<212> DNA
<213> EIAV
<400> 4
atggtcagca ttacattcta tgggggtatc ccagggggaa tatcaacccc tatcacccaa 60
caaacagaat caacagacac acagaaaggg gatcatatgg tatatcaacc ctattgttat 120
aatgatagcc ataaagaaga aatggcagag acaagagaca caagatacca agaagaaatg 180
aaccggaaag aagataaaga agataaaaga aagaataact ggtggaagat aggtatgttc 240
ttattgtgtc tgttagagat cactggagga ttcctctggt ggtatgagag gcaacaacat 300
tcatattata taagattggt tacaatagga ggtagactga atggttcagg aatgactagt 360
gccataaaat gttggggttc atttcctggg tgtaggccat ttactaacta tttcagttat 420
gagactaata ggactgttag tagagataat aatactgcta ctctgttaga tacttatcaa 480
agagaaataa caaacatata caggacatct tgtgtggata gtgatcactg tcaagaatat 540
aaatgtaagc aagtacagtt gaaaaagaac agcaataaca ttataatgaa taattgtagt 600
aacaataggt gtgaagagtt ttgggggttt agctggttag aatgtaatca gacagaaaat 660
gcaataacta tattggtccc agaaatagaa atacagcaaa gaaagaacac ttggattcca 720
aaaaggtgtg agaaaacttg ggctaaggta aaacattgtc caatggattt attatatggt 780
ataaataaaa taagaatgtg tgtccaacct ccattctttt tgtttaaaca gaatgatact 840
tctaataata ctaatattct cagtaattgt ggacctttag tatttcttgg aatatttgag 900
gacaataagg cagcaatcca gaatgggagt tgcactcttc acaggacaaa tattaacagg 960
ccagattata gtggatttta ccaagtgcct atattttata tatgcacctt gacaggattt 1020
caaagttgta ataatggatc aataattagt ataattatgt atgagtctaa taatgttcaa 1080
tacttgttat gcaatactag taatactaat agtaccaata atgctaatgt ctcttgtgtg 1140
gtacaaagtt ttggagtgat aggacaggca catgtggcat tgcccagaaa aaataagagg 1200
ttacaatctc caaagtttgc tcactataat tgcaccataa ataataaaac agagttaagg 1260
cgatggcaat tggtaaaaac atcaggcatc actcctttac ccatttcctc tacagctaat 1320
actggattag tcagacacaa gagagacttt ggtatatctg ctataatagc tgccattgta 1380
gctgctagtg ctattgctgc tagtgctact atgtcttata tcgctttgac agaagtcaac 1440
aaattagata gtgtacaaaa tcatactttt gaagtagaga acaatactat caataacata 1500
gagttaacag aagagcaaat tcatatatta tatgctatgg ttctccaaac acatgcagat 1560
gttcaattgt taaaagaaca acaaaagatt gaggaaacat ttaatttaat tggatgtata 1620
gaaagatcac atacattttg tcatactgga catccctgga atgaatcatg gggtcagtta 1680
aatgattcta cacagtggga tgactgggta gataagatgg aaaatttaaa tcatgatata 1740
ttaacaacac ttcatactgc tagaaataat ctagaacaat ctatgataac tttcaataca 1800
cctgacagtg tagcacaatt tggaaaaaat atttggagtc atattgcaaa ttggattcct 1860
agattaggag cttccataat taaatatata gtgttgatat tacttatata tgtgttacta 1920
acctctgcac ctaagatcct cagaggcctc ttgacaacga tgagtggtgc aggatcctcc 1980
gccagtcgct acctgaagaa aagataccat cacaaacatg catcgcgagg agacatctgg 2040
gcccaggtcc aatatcatgc gtacctggca gacgagactc atggctcagg ggacaagtcc 2100
aacatgcgga agctctccag gaacaactgg aatggcgaat cagaggagta caacagacga 2160
caaaaaaatt ggaaaaagtt attaaagaga tctggagaga attacaatac acacgaagac 2220
aacatgggga ctatgggtcg tttggtgact accgccgccg agaagaagaa cgtcggggtg 2280
aatcctcacc aagggtcctt aaccctggag attcaaagca aaggaggaaa catctatgac 2340
tgttgcatta aggctcaaga aggaactctt gctattcctt gctgtggctt cccactatgg 2400
ccgttttggg gacttataat catattagaa cgcttgttgg gatatgggct tcgggaaatt 2460
gcaaaaatta taatgattct agggaaagga ctaagtataa taattacagg attaagaaaa 2520
ttatgtgatt atattgggaa aatgctaaat ccagctacat ctcatgtaac aatgcctcaa 2580
tatgatgttt ag 2592
<210> 5
<211> 495
<212> DNA
<213> EIAV
<400> 5
atggcagaga caagagacac aagataccaa gaagaaatga accggaaaga agataaagaa 60
gataaaagaa agaataactg gtggaagata ggtcctcaga ggcctcttga caacgatgag 120
tggtgcagga tcctccgcca gtcgctacct gaagaaaaga taccatcaca aacatgcatc 180
gcgaggagac atctgggccc aggtccaata tcatgcgtac ctggcagacg agactcatgg 240
ctcaggggac aagtccaaca tgcggaagct ctccaggaac aactggaatg gcgaatcaga 300
ggagtacaac agacgacaaa aaaattggaa aaagttatta aagagatctg gagagaatta 360
caatacacac gaagacaaca tggggactat gggtcgtttg gtgactaccg ccgccgagaa 420
gaagaacgtc ggggtgaatc ctcaccaagg gtccttaacc ctggagattc aaagcaaagg 480
aggaaacatc tatga 495
<210> 6
<211> 237
<212> DNA
<213> EIAV
<400> 6
ctgctgaacc tggctgatca taggatccct aggacagcag aggagaactt acagaagtct 60
tctggaggtg ttcctggcca caacacagga agacaggtac caccagtcag ctatcattgt 120
caactgtgtt tcctgagatc attgggaatt gactaccttg acagctcgct gaagaagaag 180
aacaaacaaa gacagaaggc catcagggag gaagacaacc tcagtatctt gttataa 237
<210> 7
<211> 207
<212> DNA
<213> EIAV
<400> 7
atgggattat ttggtaaagg ggtaacatgg tcagcattac attctatggg ggtatcccag 60
ggggaatatc aacccctatc acccaacaaa cagaatcaac agacacacag aaaggggatc 120
atatggtata tcaaccctat tgttataatg atagccataa agaagaaatg gcagagacaa 180
gagacacaag ataccaagaa gaaatga 207
<210> 8
<211> 486
<212> PRT
<213> EIAV
<400> 8
Met Gly Asp Ser Leu Thr Trp Ser Lys Ala Leu Lys Lys Leu Glu Lys
1 5 10 15
Val Thr Val Gln Gly Ser Gln Lys Leu Thr Ser Gly Asn Cys Asn Trp
20 25 30
Ala Leu Asn Leu Val Asp Leu Phe His Asp Thr Asn Phe Gly Lys Glu
35 40 45
Lys Asp Trp Gln Leu Arg Asp Val Ile Pro Leu Leu Glu Asp Val Ser
50 55 60
Gln Thr Leu Ser Gly Gln Glu Arg Glu Ala Phe Glu Lys Thr Trp Trp
65 70 75 80
Ala Ile Ala Ala Val Lys Met Gly Leu Gln Ile Asn Thr Val Asn Asp
85 90 95
Ala Lys Thr Thr Phe Ser Ile Leu Lys Ala Lys Phe Glu Arg Lys Thr
100 105 110
Ala Asn Asn Thr Lys Lys Gln Ser Glu Pro Glu Glu Glu Tyr Pro Ile
115 120 125
Met Ile Asp Gly Ala Gly Asn Arg Asn Phe Arg Pro Leu Thr Pro Arg
130 135 140
Gly Tyr Thr Thr Trp Val Asn Thr Ile Gln Gln Asn Asn Leu Leu Asn
145 150 155 160
Glu Ala Ser Val Asn Leu Phe Gly Ile Leu Ser Val Asp Cys Thr Ser
165 170 175
Glu Glu Met Asn Ala Phe Leu Asp Val Val Pro Gly Gln Ala Gly Gln
180 185 190
Lys Gln Val Leu Leu Asp Asn Leu Asp Lys Ile Ala Glu Glu Trp Asp
195 200 205
Arg Arg His Pro Leu Pro Asn Pro Pro Leu Val Ala Pro Pro Gln Gly
210 215 220
Pro Ile Pro Met Thr Ala Arg Phe Ile Arg Gly Leu Gly Val Pro Arg
225 230 235 240
Glu Arg Gln Met Lys Pro Ala Phe Asp Gln Phe Arg Gln Thr Tyr Arg
245 250 255
Gln Trp Ile Ile Glu Ala Met Thr Glu Gly Ile Lys Ile Met Ile Gly
260 265 270
Lys Pro Lys Ala Gln Asn Ile Arg Gln Gly Pro Lys Glu Pro Tyr Pro
275 280 285
Glu Phe Ile Asp Arg Leu Leu Ser Gln Ile Lys Ser Glu Gly His Pro
290 295 300
Ala Asp Ile Thr Lys Phe Leu Thr Asp Thr Leu Thr Ile Gln Asn Ala
305 310 315 320
Asn Asp Glu Cys Lys Asn Ala Met Arg His Leu Arg Pro Glu Asp Thr
325 330 335
Leu Glu Glu Lys Met Tyr Ala Cys Arg Asp Ile Gly Thr Met Arg Gln
340 345 350
Lys Met Ala Leu Leu Ala Lys Ala Leu Gln Ala Gly Leu Ala Gly Pro
355 360 365
Met Lys Gly Gly Ile Phe Lys Gly Gly Pro Leu Gly Ala Lys Gln Thr
370 375 380
Cys Tyr Asn Cys Gly Lys Pro Gly His Phe Ser Ser Gln Cys Lys Ala
385 390 395 400
Pro Lys Ile Cys Phe Lys Cys Lys Gln Pro Gly His Phe Ser Lys Gln
405 410 415
Cys Arg Asn Ala Pro Lys Asn Gly Lys Gln Gly Ala Gln Gly Arg Pro
420 425 430
Gln Lys Gln Thr Phe Pro Val Gln Lys Glu Ser Met Asn Lys Thr Gln
435 440 445
Lys Glu Glu Lys Gln Gln Gly Thr Leu Tyr Pro Asp Leu Ser Gln Met
450 455 460
Lys Gln Glu Tyr Lys Ile Lys Glu Glu Glu Asn Gln Glu Asp Leu Asn
465 470 475 480
Leu Asn Ser Leu Trp Glu
485
<210> 9
<211> 1143
<212> PRT
<213> EIAV
<400> 9
Thr Ala Arg Thr Phe Leu Lys Thr Met Tyr Lys Cys Ser Lys Lys Arg
1 5 10 15
Glu Thr Arg Gly Ser Gly Glu Ala Pro Glu Thr Asn Phe Pro Cys Ala
20 25 30
Glu Gly Val Asn Glu Gln Asn Thr Lys Arg Gly Glu Thr Ala Arg Asp
35 40 45
Leu Ile Ser Arg Phe Lys Ser Asp Glu Thr Gly Ile Gln Asp Gln Gly
50 55 60
Arg Gly Lys Ser Arg Gly Ser Gln Ser Glu Gln Phe Val Gly Val Thr
65 70 75 80
Tyr Asn Leu Glu Lys Arg Pro Thr Thr Ile Val Leu Ile Asn Asp Thr
85 90 95
Pro Leu Asn Val Leu Leu Asp Thr Gly Ala Asp Thr Ser Val Leu Thr
100 105 110
Ile Ala His Cys Asn Arg Leu Lys Tyr Gly Gly Arg Lys Tyr Gln Gly
115 120 125
Thr Gly Ile Val Gly Val Gly Gly Asn Val Glu Thr Phe Ser Thr Pro
130 135 140
Val Thr Val Lys Lys Lys Gly Lys Gln Ile Lys Thr Arg Met Leu Val
145 150 155 160
Ala Asp Ile Pro Val Thr Ile Leu Gly Arg Asp Ile Leu Gln Glu Leu
165 170 175
Gly Ala Gln Leu Leu Met Ala Gln Leu Ser Lys Glu Ile Thr Pro Arg
180 185 190
Glu Ile Lys Leu Lys Thr Gly Thr Val Gly Pro Lys Val Pro Gln Trp
195 200 205
Pro Leu Thr Lys Glu Lys Leu Leu Gly Ala Lys Glu Ile Val Lys Lys
210 215 220
Leu Leu Asp Glu Gly Lys Ile Ser Glu Ala Ser Asp Asp Asn Pro Tyr
225 230 235 240
Asn Ser Pro Ile Phe Val Ile Lys Lys Lys Ser Gly Lys Trp Arg Leu
245 250 255
Leu Gln Asp Leu Arg Glu Leu Ile Lys Gly Gly Thr Ser Arg Thr Glu
260 265 270
Ile Ser Arg Gly Leu Pro His Pro Gly Gly Leu Ile Lys Cys Asn His
275 280 285
Met Thr Val Leu Asp Ile Gly Asp Ala Tyr Phe Thr Ile Pro Leu Asp
290 295 300
Pro Lys Phe Arg Gln Tyr Thr Ala Phe Thr Val Pro Ser Ile Asn His
305 310 315 320
Gln Glu Pro Asp Lys Arg Tyr Val Trp Asn Cys Leu Pro Gln Gly Phe
325 330 335
Val Leu Ser Pro Tyr Ile Tyr Gln Lys Thr Leu Gln Asp Ile Leu Gln
340 345 350
Ala Phe Arg Glu Arg His Pro Asp Val Gln Leu Tyr Gln Tyr Met Asp
355 360 365
Asp Leu Phe Ile Gly Ser Asn Glu Ser Lys Arg Gln His Lys Glu Leu
370 375 380
Val Glu Glu Leu Arg Ala Ile Leu Leu Glu Lys Gly Phe Glu Thr Pro
385 390 395 400
Gly Asp Lys Leu Gln Glu Glu Ala Pro Tyr Asn Trp Leu Gly Tyr Gln
405 410 415
Leu Ser Pro Gly Asn Trp Lys Val Gln Lys Met Gln Leu Glu Leu Val
420 425 430
Lys Glu Pro Thr Leu Asn Asp Val Gln Lys Ser Lys Gly Asn Ile Thr
435 440 445
Trp Met Ser Ser Gly Val Pro Gly Leu Thr Val Lys Gln Ile Ala Ala
450 455 460
Thr Thr Lys Gly Cys Leu Asp Leu Asn His Lys Gly Ser Arg Thr Arg
465 470 475 480
Glu Ala Gln Lys Asp Leu Glu Glu Ile Ile Lys Ser Phe Arg Ser Ser
485 490 495
Gly Phe Pro Tyr Tyr Asn Pro Glu Glu Glu Val Ile Cys Glu Ile Glu
500 505 510
Ile Thr Lys Asn Tyr Glu Ala Thr Tyr Ile Ile Lys Gln Ser Gln Gly
515 520 525
Ile Leu Trp Ala Gly Lys Lys Ile Met Arg Ala Asn Lys Gly Trp Ser
530 535 540
Ala Ala Lys Asn Leu Met Leu Leu Leu Gln His Val Ala Thr Glu Ser
545 550 555 560
Ile Val Arg Ile Gly Thr Cys Pro Lys Phe Lys Val Pro Phe Thr Lys
565 570 575
Glu Gln Val Lys Trp Glu Met Glu Lys Gly Trp Tyr Tyr Ser Trp Leu
580 585 590
Pro Asp Met Val Tyr Ser His Gln Val Val His Asp Asp Trp Arg Leu
595 600 605
Lys Leu Val Glu Gln Pro Thr Ser Gly Ile Thr Ile Tyr Thr Asp Gly
610 615 620
Gly Lys Gln Asn Glu Glu Gly Val Ala Ala Tyr Val Thr Ser Asn Gly
625 630 635 640
Lys Thr Lys Gln Lys Arg Leu Gly Pro Val Thr His Gln Thr Ala Glu
645 650 655
Arg Ile Ala Ile Gln Met Ala Leu Glu Asp Thr Glu Glu Thr Leu Val
660 665 670
Asn Ile Val Thr Asp Ser Tyr Tyr Cys Trp Lys Asn Ile Thr Glu Gly
675 680 685
Leu Gly Leu Glu Gly Pro Asp Ser Pro Trp Trp Pro Ile Ile Gln Asn
690 695 700
Ile Arg Ala Lys Glu Met Val Tyr Phe Ala Trp Val Pro Gly His Lys
705 710 715 720
Gly Ile Tyr Gly Asn Gln Leu Ala Asp Glu Ala Thr Lys Ile Thr Glu
725 730 735
Glu Ile Met Leu Ala Tyr Gln Gly Thr Gln Ile Arg Glu Lys Arg Asp
740 745 750
Glu Asp Ala Gly Tyr Asp Leu Cys Ile Pro Tyr Asp Ile Met Ile Pro
755 760 765
Val Ser Glu Thr Lys Val Ile Pro Thr Asp Val Lys Ile Gln Val Pro
770 775 780
His Lys Cys Phe Gly Trp Val Thr Gly Lys Ser Ser Met Ala Lys Gln
785 790 795 800
Gly Leu Leu Ile Asn Gly Gly Ile Ile Asp Glu Gly Tyr Thr Gly Glu
805 810 815
Ile Gln Val Ile Cys Thr Asn Ile Gly Lys Ser Asn Met Lys Leu Arg
820 825 830
Glu Gly Gln Lys Phe Ala Gln Leu Ile Ile Leu Gln His Arg Ser Asn
835 840 845
Asp Lys Gln Ile Trp Asp Glu Asn Lys Thr Ser Gln Arg Gly Asp Lys
850 855 860
Gly Phe Gly Ser Thr Gly Ile Phe Trp Val Glu Asn Ile Gln Glu Ala
865 870 875 880
Gln Asp Glu His Glu Asn Trp His Thr Ser Pro Lys Ile Leu Ala Lys
885 890 895
Arg Tyr Gly Leu Pro Leu Thr Val Ala Lys Gln Ile Thr Gln Glu Cys
900 905 910
Pro His Cys Thr Lys Gln Gly Ser Gly Pro Ala Gly Cys Val Met Arg
915 920 925
Ser Pro Asn His Trp Gln Ala Asp Cys Thr His Leu Glu Asn Arg Val
930 935 940
Ile Met Thr Phe Val Glu Ser Asn Ser Gly Tyr Ile His Ala Thr Leu
945 950 955 960
Leu Ser Lys Glu Asn Ala Leu Cys Pro Ser Leu Ala Ile Leu Glu Trp
965 970 975
Val Arg Leu Phe Ser Pro Lys Ser Leu His Thr Asp Asn Gly Thr Asn
980 985 990
Phe Val Ala Glu Ser Val Ala Asn Leu Leu Lys Phe Leu Lys Val Thr
995 1000 1005
His Thr Thr Gly Ile Pro Tyr His Pro Glu Ser Gln Gly Ile Val
1010 1015 1020
Glu Arg Ala Asn Arg Thr Leu Lys Glu Arg Ile Lys Ser His Arg
1025 1030 1035
Gly Asn Thr Gln Thr Leu Glu Ala Ala Leu Gln Leu Ala Leu Ile
1040 1045 1050
Thr Cys Asn Lys Gly Arg Glu Ser Met Gly Gly Gln Thr Pro Trp
1055 1060 1065
Glu Val Phe Ile Thr Asn Gln Ala Gln Thr Ile His Glu Glu Leu
1070 1075 1080
Leu Leu Gln Gln Ala Gln Ser Ser Lys Lys Phe Cys Phe Tyr Lys
1085 1090 1095
Ile Pro Gly Glu His Asn Trp Lys Gly Pro Thr Arg Val Leu Trp
1100 1105 1110
Lys Gly Asp Gly Ala Val Val Val Asn Asp Glu Glu Lys Gly Ile
1115 1120 1125
Ile Ala Val Pro Leu Thr Arg Thr Lys Leu Leu Ile Arg Pro Asn
1130 1135 1140
<210> 10
<211> 863
<212> PRT
<213> EIAV
<400> 10
Met Val Ser Ile Thr Phe Tyr Gly Gly Ile Pro Gly Gly Ile Ser Thr
1 5 10 15
Pro Ile Thr Gln Gln Thr Glu Ser Thr Asp Thr Gln Lys Gly Asp His
20 25 30
Met Val Tyr Gln Pro Tyr Cys Tyr Asn Asp Ser His Lys Glu Glu Met
35 40 45
Ala Glu Thr Arg Asp Thr Arg Tyr Gln Glu Glu Met Asn Arg Lys Glu
50 55 60
Asp Lys Glu Asp Lys Arg Lys Asn Asn Trp Trp Lys Ile Gly Met Phe
65 70 75 80
Leu Leu Cys Leu Leu Glu Ile Thr Gly Gly Phe Leu Trp Trp Tyr Glu
85 90 95
Arg Gln Gln His Ser Tyr Tyr Ile Arg Leu Val Thr Ile Gly Gly Arg
100 105 110
Leu Asn Gly Ser Gly Met Thr Ser Ala Ile Lys Cys Trp Gly Ser Phe
115 120 125
Pro Gly Cys Arg Pro Phe Thr Asn Tyr Phe Ser Tyr Glu Thr Asn Arg
130 135 140
Thr Val Ser Arg Asp Asn Asn Thr Ala Thr Leu Leu Asp Thr Tyr Gln
145 150 155 160
Arg Glu Ile Thr Asn Ile Tyr Arg Thr Ser Cys Val Asp Ser Asp His
165 170 175
Cys Gln Glu Tyr Lys Cys Lys Gln Val Gln Leu Lys Lys Asn Ser Asn
180 185 190
Asn Ile Ile Met Asn Asn Cys Ser Asn Asn Arg Cys Glu Glu Phe Trp
195 200 205
Gly Phe Ser Trp Leu Glu Cys Asn Gln Thr Glu Asn Ala Ile Thr Ile
210 215 220
Leu Val Pro Glu Ile Glu Ile Gln Gln Arg Lys Asn Thr Trp Ile Pro
225 230 235 240
Lys Arg Cys Glu Lys Thr Trp Ala Lys Val Lys His Cys Pro Met Asp
245 250 255
Leu Leu Tyr Gly Ile Asn Lys Ile Arg Met Cys Val Gln Pro Pro Phe
260 265 270
Phe Leu Phe Lys Gln Asn Asp Thr Ser Asn Asn Thr Asn Ile Leu Ser
275 280 285
Asn Cys Gly Pro Leu Val Phe Leu Gly Ile Phe Glu Asp Asn Lys Ala
290 295 300
Ala Ile Gln Asn Gly Ser Cys Thr Leu His Arg Thr Asn Ile Asn Arg
305 310 315 320
Pro Asp Tyr Ser Gly Phe Tyr Gln Val Pro Ile Phe Tyr Ile Cys Thr
325 330 335
Leu Thr Gly Phe Gln Ser Cys Asn Asn Gly Ser Ile Ile Ser Ile Ile
340 345 350
Met Tyr Glu Ser Asn Asn Val Gln Tyr Leu Leu Cys Asn Thr Ser Asn
355 360 365
Thr Asn Ser Thr Asn Asn Ala Asn Val Ser Cys Val Val Gln Ser Phe
370 375 380
Gly Val Ile Gly Gln Ala His Val Ala Leu Pro Arg Lys Asn Lys Arg
385 390 395 400
Leu Gln Ser Pro Lys Phe Ala His Tyr Asn Cys Thr Ile Asn Asn Lys
405 410 415
Thr Glu Leu Arg Arg Trp Gln Leu Val Lys Thr Ser Gly Ile Thr Pro
420 425 430
Leu Pro Ile Ser Ser Thr Ala Asn Thr Gly Leu Val Arg His Lys Arg
435 440 445
Asp Phe Gly Ile Ser Ala Ile Ile Ala Ala Ile Val Ala Ala Ser Ala
450 455 460
Ile Ala Ala Ser Ala Thr Met Ser Tyr Ile Ala Leu Thr Glu Val Asn
465 470 475 480
Lys Leu Asp Ser Val Gln Asn His Thr Phe Glu Val Glu Asn Asn Thr
485 490 495
Ile Asn Asn Ile Glu Leu Thr Glu Glu Gln Ile His Ile Leu Tyr Ala
500 505 510
Met Val Leu Gln Thr His Ala Asp Val Gln Leu Leu Lys Glu Gln Gln
515 520 525
Lys Ile Glu Glu Thr Phe Asn Leu Ile Gly Cys Ile Glu Arg Ser His
530 535 540
Thr Phe Cys His Thr Gly His Pro Trp Asn Glu Ser Trp Gly Gln Leu
545 550 555 560
Asn Asp Ser Thr Gln Trp Asp Asp Trp Val Asp Lys Met Glu Asn Leu
565 570 575
Asn His Asp Ile Leu Thr Thr Leu His Thr Ala Arg Asn Asn Leu Glu
580 585 590
Gln Ser Met Ile Thr Phe Asn Thr Pro Asp Ser Val Ala Gln Phe Gly
595 600 605
Lys Asn Ile Trp Ser His Ile Ala Asn Trp Ile Pro Arg Leu Gly Ala
610 615 620
Ser Ile Ile Lys Tyr Ile Val Leu Ile Leu Leu Ile Tyr Val Leu Leu
625 630 635 640
Thr Ser Ala Pro Lys Ile Leu Arg Gly Leu Leu Thr Thr Met Ser Gly
645 650 655
Ala Gly Ser Ser Ala Ser Arg Tyr Leu Lys Lys Arg Tyr His His Lys
660 665 670
His Ala Ser Arg Gly Asp Ile Trp Ala Gln Val Gln Tyr His Ala Tyr
675 680 685
Leu Ala Asp Glu Thr His Gly Ser Gly Asp Lys Ser Asn Met Arg Lys
690 695 700
Leu Ser Arg Asn Asn Trp Asn Gly Glu Ser Glu Glu Tyr Asn Arg Arg
705 710 715 720
Gln Lys Asn Trp Lys Lys Leu Leu Lys Arg Ser Gly Glu Asn Tyr Asn
725 730 735
Thr His Glu Asp Asn Met Gly Thr Met Gly Arg Leu Val Thr Thr Ala
740 745 750
Ala Glu Lys Lys Asn Val Gly Val Asn Pro His Gln Gly Ser Leu Thr
755 760 765
Leu Glu Ile Gln Ser Lys Gly Gly Asn Ile Tyr Asp Cys Cys Ile Lys
770 775 780
Ala Gln Glu Gly Thr Leu Ala Ile Pro Cys Cys Gly Phe Pro Leu Trp
785 790 795 800
Pro Phe Trp Gly Leu Ile Ile Ile Leu Glu Arg Leu Leu Gly Tyr Gly
805 810 815
Leu Arg Glu Ile Ala Lys Ile Ile Met Ile Leu Gly Lys Gly Leu Ser
820 825 830
Ile Ile Ile Thr Gly Leu Arg Lys Leu Cys Asp Tyr Ile Gly Lys Met
835 840 845
Leu Asn Pro Ala Thr Ser His Val Thr Met Pro Gln Tyr Asp Val
850 855 860
<210> 11
<211> 164
<212> PRT
<213> EIAV
<400> 11
Met Ala Glu Thr Arg Asp Thr Arg Tyr Gln Glu Glu Met Asn Arg Lys
1 5 10 15
Glu Asp Lys Glu Asp Lys Arg Lys Asn Asn Trp Trp Lys Ile Gly Pro
20 25 30
Gln Arg Pro Leu Asp Asn Asp Glu Trp Cys Arg Ile Leu Arg Gln Ser
35 40 45
Leu Pro Glu Glu Lys Ile Pro Ser Gln Thr Cys Ile Ala Arg Arg His
50 55 60
Leu Gly Pro Gly Pro Ile Ser Cys Val Pro Gly Arg Arg Asp Ser Trp
65 70 75 80
Leu Arg Gly Gln Val Gln His Ala Glu Ala Leu Gln Glu Gln Leu Glu
85 90 95
Trp Arg Ile Arg Gly Val Gln Gln Thr Thr Lys Lys Leu Glu Lys Val
100 105 110
Ile Lys Glu Ile Trp Arg Glu Leu Gln Tyr Thr Arg Arg Gln His Gly
115 120 125
Asp Tyr Gly Ser Phe Gly Asp Tyr Arg Arg Arg Glu Glu Glu Arg Arg
130 135 140
Gly Glu Ser Ser Pro Arg Val Leu Asn Pro Gly Asp Ser Lys Gln Arg
145 150 155 160
Arg Lys His Leu
<210> 12
<211> 78
<212> PRT
<213> EIAV
<400> 12
Leu Leu Asn Leu Ala Asp His Arg Ile Pro Arg Thr Ala Glu Glu Asn
1 5 10 15
Leu Gln Lys Ser Ser Gly Gly Val Pro Gly His Asn Thr Gly Arg Gln
20 25 30
Val Pro Pro Val Ser Tyr His Cys Gln Leu Cys Phe Leu Arg Ser Leu
35 40 45
Gly Ile Asp Tyr Leu Asp Ser Ser Leu Lys Lys Lys Asn Lys Gln Arg
50 55 60
Gln Lys Ala Ile Arg Glu Glu Asp Asn Leu Ser Ile Leu Leu
65 70 75
<210> 13
<211> 68
<212> PRT
<213> EIAV
<400> 13
Met Gly Leu Phe Gly Lys Gly Val Thr Trp Ser Ala Leu His Ser Met
1 5 10 15
Gly Val Ser Gln Gly Glu Tyr Gln Pro Leu Ser Pro Asn Lys Gln Asn
20 25 30
Gln Gln Thr His Arg Lys Gly Ile Ile Trp Tyr Ile Asn Pro Ile Val
35 40 45
Ile Met Ile Ala Ile Lys Lys Lys Trp Gln Arg Gln Glu Thr Gln Asp
50 55 60
Thr Lys Lys Lys
65
Claims (17)
1、马传染性贫血病毒驴白细胞弱毒疫苗株的全长前病毒基因组序列,包含SEQ ID NO:1的序列。
2、一种DNA序列,其包含SEQ ID NO:1的1-325位核苷酸序列,其为马传染性贫血病毒驴白细胞弱毒疫苗株的全长前病毒基因组的5’LTR。
3、一种DNA序列,其包含SEQ ID NO:1的7922-8258位核苷酸序列,其为马传染性贫血病毒驴白细胞弱毒疫苗株的全长前病毒基因组的3’LTR。
4、一种DNA序列,其包含SEQ ID NO:1的466-1926位核苷酸序列,其为马传染性贫血病毒驴白细胞弱毒疫苗株的全长前病毒基因组中的gag基因。
5、一种DNA序列,其包含SEQ ID NO:1的1689-5120位核苷酸序列,其为马传染性贫血病毒驴白细胞弱毒疫苗株的全长前病毒基因组中的pol基因。
6、一种DNA序列,其包含SEQ ID NO:1的5313-7904位核苷酸序列,其为马传染性贫血病毒驴白细胞弱毒疫苗株的全长前病毒基因组中的env基因。
7、一种DNA序列,其包含SEQ ID NO:1的5454-5546及7250-7651位核苷酸序列,其为马传染性贫血病毒驴白细胞弱毒疫苗株的全长前病毒基因组中的rev基因。
8、一种DNA序列,其包含SEQ ID NO:1的365-462及5138-5276位核苷酸序列,其为马传染性贫血病毒驴白细胞弱毒疫苗株的全长前病毒基因组中的tat基因。
9、一种DNA序列,其包含SEQ ID NO:1的5287-5493位核苷酸序列,其为马传染性贫血病毒驴白细胞弱毒疫苗株的全长前病毒基因组中的S2基因。
10、由权利要求4的DNA序列编码的氨基酸序列,其包含SEQID NO:8的序列。
11、由权利要求5的DNA序列编码的氨基酸序列,其包含SEQID NO:9的序列。
12、由权利要求6的DNA序列编码的氨基酸序列,其包含SEQID NO:10的序列。
13、由权利要求7的DNA序列编码的氨基酸序列,其包含SEQID NO:11的序列。
14、由权利要求8的DNA序列编码的氨基酸序列,其包含SEQID NO:12的序列。
15、由权利要求9的DNA序列编码的氨基酸序列,其包含SEQID NO:13的序列。
16、权利要求1-15任一项的序列在制备多肽疫苗、基因工程亚单位疫苗、基因缺失疫苗、DNA疫苗、活载体疫苗和诊断试剂中的应用。
17、权利要求1-9任一项的序列在制备基因治疗用基因转移载体或感染性分子克隆中的应用。
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB991058526A CN1173036C (zh) | 1999-04-21 | 1999-04-21 | 马传染性贫血病毒驴白细胞弱毒疫苗株的全长基因序列 |
ES00918668T ES2288848T3 (es) | 1999-04-21 | 2000-04-21 | Secuencia genetica completa de la cepa de vacuna de leucocitos de asno frente al virus de la anemia infecciosa equina y su aplicacion. |
PCT/CN2000/000096 WO2000063387A1 (fr) | 1999-04-21 | 2000-04-21 | Sequence genetique complete de la souche d'un vaccin contre le virus de l'anemie infectieuse equine et son application |
DE60035680T DE60035680T2 (de) | 1999-04-21 | 2000-04-21 | Komplette gensequenz des esel-leukozyten-impfstoffstammes des equinen infektiösen anämievirus und ihre anwendung |
AT00918668T ATE368117T1 (de) | 1999-04-21 | 2000-04-21 | Komplette gensequenz des esel-leukozyten- impfstoffstammes des equinen infektiösen anemievirus und ihre anwendung |
AU39554/00A AU3955400A (en) | 1999-04-21 | 2000-04-21 | The full gene sequence of the donkey leukocyte vaccine strain of the equine infectious anemia virus and their application |
EP00918668A EP1174507B1 (en) | 1999-04-21 | 2000-04-21 | The full gene sequence of the donkey leukocyte vaccine strain of the equine infectious anemia virus and their application |
US09/959,120 US6987020B1 (en) | 1999-04-21 | 2000-04-21 | Full-gene sequence of the donkey leukocyte vaccine strain of the equine infectious anemia virus and their application |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB991058526A CN1173036C (zh) | 1999-04-21 | 1999-04-21 | 马传染性贫血病毒驴白细胞弱毒疫苗株的全长基因序列 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1271015A CN1271015A (zh) | 2000-10-25 |
CN1173036C true CN1173036C (zh) | 2004-10-27 |
Family
ID=5272155
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB991058526A Expired - Lifetime CN1173036C (zh) | 1999-04-21 | 1999-04-21 | 马传染性贫血病毒驴白细胞弱毒疫苗株的全长基因序列 |
Country Status (8)
Country | Link |
---|---|
US (1) | US6987020B1 (zh) |
EP (1) | EP1174507B1 (zh) |
CN (1) | CN1173036C (zh) |
AT (1) | ATE368117T1 (zh) |
AU (1) | AU3955400A (zh) |
DE (1) | DE60035680T2 (zh) |
ES (1) | ES2288848T3 (zh) |
WO (1) | WO2000063387A1 (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU8710301A (en) * | 2000-09-09 | 2002-03-22 | Akzo Nobel Nv | Eiav chimeric vaccine and diagnostic |
US6461616B1 (en) * | 2000-09-09 | 2002-10-08 | Akzo Nobel Nv | EIAV p26 deletion vaccine and diagnostic |
CN101705246B (zh) * | 2007-04-29 | 2012-06-27 | 中国农业科学院哈尔滨兽医研究所 | 慢病毒基因转移载体、其制备方法和应用 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0491930B1 (en) | 1990-07-12 | 1997-01-15 | The President And Fellows Of Harvard College | Primate lentivirus vaccines |
CN1059701C (zh) * | 1996-04-03 | 2000-12-20 | 中国农业科学院哈尔滨兽医研究所 | 马传染性贫血病驴白细胞弱毒株及其培育方法 |
EP0914422B1 (en) * | 1996-04-15 | 2009-11-18 | Vanderbilt University | Mammalian genes involved in viral infection |
CA2288328A1 (en) | 1997-05-13 | 1998-11-19 | University Of North Carolina At Chapel Hill | Lentivirus-based gene transfer vectors |
DE69738969D1 (de) * | 1997-12-30 | 2008-10-16 | Univ Minas Gerais | Verfahren zur expression und herstellung des rekombinanten proteins gp90 der glykoproteinhülle des infektiösen pferdeanämievirus eiav |
-
1999
- 1999-04-21 CN CNB991058526A patent/CN1173036C/zh not_active Expired - Lifetime
-
2000
- 2000-04-21 AU AU39554/00A patent/AU3955400A/en not_active Abandoned
- 2000-04-21 ES ES00918668T patent/ES2288848T3/es not_active Expired - Lifetime
- 2000-04-21 WO PCT/CN2000/000096 patent/WO2000063387A1/zh active IP Right Grant
- 2000-04-21 US US09/959,120 patent/US6987020B1/en not_active Expired - Lifetime
- 2000-04-21 DE DE60035680T patent/DE60035680T2/de not_active Expired - Lifetime
- 2000-04-21 AT AT00918668T patent/ATE368117T1/de active
- 2000-04-21 EP EP00918668A patent/EP1174507B1/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
ATE368117T1 (de) | 2007-08-15 |
EP1174507B1 (en) | 2007-07-25 |
DE60035680D1 (de) | 2007-09-06 |
AU3955400A (en) | 2000-11-02 |
US6987020B1 (en) | 2006-01-17 |
WO2000063387A1 (fr) | 2000-10-26 |
ES2288848T3 (es) | 2008-02-01 |
CN1271015A (zh) | 2000-10-25 |
EP1174507A1 (en) | 2002-01-23 |
DE60035680T2 (de) | 2008-04-30 |
EP1174507A4 (en) | 2004-03-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1282656C (zh) | 来自gagp17和p24保守区域的HIV肽以及它们在例如疫苗中的应用 | |
CN1650012A (zh) | 来自致病病毒的备选阅读框所编码的抗原 | |
CN1636071A (zh) | 细小病毒b19的诊断测试 | |
CN1122108C (zh) | 口蹄疫病毒的免疫原性肽 | |
CN1458936A (zh) | 来自Tat、Rev和Nef保守区的HIV肽以及它们作为例如疫苗组分的应用 | |
CN1255540C (zh) | 一株疫苗诱导的乙型肝炎病毒株及其应用 | |
CN1313491C (zh) | 猫ω干扰素及其编码基因与应用 | |
CN1173036C (zh) | 马传染性贫血病毒驴白细胞弱毒疫苗株的全长基因序列 | |
CN1812810A (zh) | 针对fiv感染的免疫接种材料和方法 | |
CN100339396C (zh) | 抑制i型人免疫缺陷病毒感染的三螺旋蛋白及其编码基因与应用 | |
CN1516738A (zh) | gp41片段的乙酰化 | |
CN1216902C (zh) | 非典型性肺炎病毒特异蛋白质和临床检测的方法及试剂盒 | |
CN101057975A (zh) | 一种抗免疫耐受性和免疫缺陷性病毒的鸡尾酒疫苗及其应用 | |
CN1273593C (zh) | 丙型肝炎病毒多表位抗原的基因克隆及其编码序列 | |
CN1570115A (zh) | 优化的sars冠状病毒刺突蛋白基因 | |
CN1763203A (zh) | 一种基因重组人巨细胞病毒融合蛋白pp150/MDBP及其制备方法与应用 | |
CN1495262A (zh) | gp350/220的非剪接变异体 | |
CN1244698C (zh) | 一种重组sars病毒基因在多形汉逊酵母中的表达及用途 | |
CN1163604C (zh) | 减毒水痘病毒冈株的基因62和利用基因62的减毒水痘活疫苗用病毒株的鉴定方法 | |
CN1268392C (zh) | 预防和治疗人丙型肝炎病毒感染的重组蛋白疫苗及其用途 | |
CN1882690A (zh) | 从新hcv株衍生的核酸和基因以及使用所述基因的复制子-复制细胞 | |
CN1606622A (zh) | 通过全面检测和定量食物中毒细菌的热稳定直接溶血素相关的基因 ( t d h ) 相关溶血素基因)对产溶血素细菌进行检测和定量的方法 | |
CN1604911A (zh) | Hiv转录阻遏物及其方法 | |
CN1206356C (zh) | 甲肝病毒中国株和减毒株的培育及互补脱氧核糖核酸序列 | |
CN1304580C (zh) | 重组表达人获得性免疫缺陷病毒I型的表面糖蛋白gp160 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CX01 | Expiry of patent term | ||
CX01 | Expiry of patent term |
Granted publication date: 20041027 |