CN1954071A - 细胞周期基因和相关使用方法 - Google Patents

细胞周期基因和相关使用方法 Download PDF

Info

Publication number
CN1954071A
CN1954071A CNA200480042006XA CN200480042006A CN1954071A CN 1954071 A CN1954071 A CN 1954071A CN A200480042006X A CNA200480042006X A CN A200480042006XA CN 200480042006 A CN200480042006 A CN 200480042006A CN 1954071 A CN1954071 A CN 1954071A
Authority
CN
China
Prior art keywords
plant
gene
oligonucleotide
seq
group
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA200480042006XA
Other languages
English (en)
Inventor
理查德·L·福斯特
玛丽·B·康内特
萨拉·简·埃默森
默里·罗伯特·格里戈尔
科琳·M·希金斯
史蒂文·特洛伊·伦德
安德烈亚斯·马古辛
罗伯特·J·科德奇基
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ArborGen LLC
Original Assignee
ArborGen LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ArborGen LLC filed Critical ArborGen LLC
Publication of CN1954071A publication Critical patent/CN1954071A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • C07K14/4701Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
    • C07K14/4738Cell cycle regulated proteins, e.g. cyclin, CDC, INK-CCR
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A40/00Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
    • Y02A40/10Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
    • Y02A40/146Genetically Modified [GMO] plants, e.g. transgenic plants

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Molecular Biology (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Wood Science & Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Cell Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Medicinal Chemistry (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • Toxicology (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Microbiology (AREA)
  • Botany (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Peptides Or Proteins (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)
  • Paper (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

本发明提供新颖植物多糖合成基因和由这些基因编码的多肽。这些基因和多肽序列可用于调控多糖合成和植物表现型。此外,这些基因可用于植物多肽合成基因的表达型分析。本发明特别提供从桉树(Eucalyptus)与松树(Pinus)分离的细胞周期多核苷酸和多肽序列。

Description

细胞周期基因和相关使用方法
相关申请案交叉引用
本申请案主张于2003年12月30日申请的第60/533036号美国临时申请案的优先权,该案全文在此明确地作为参考。
技术领域
本发明一般涉及植物细胞周期基因和由这些基因所编码的多肽,以及所述多核苷酸与多肽序列用于调控植物细胞周期的用途的领域。本发明特别提供从桉树与松树分离的细胞周期多核苷酸和多肽序列和其相关序列。
背景技术
细胞生长与分裂是由不同组基因的瞬时表达而得以控制的,以允许正在分裂的细胞进行细胞周期的不同阶段。植物中连续生长和器官发生(organogesis)需要细胞周期机构的精确功能。直接受细胞分裂速率和模式影响的植物发育也受环境因素影响,例如温度、营养可用性、光照等,参看Gastal和Nelon,Plant Physiol.105:191-7(1994),Ben-Haj-Sahal和Tardieu,Plant Physiol.109:861-7(1995),以及Sacks等人,Plant Physiol.114:519-27(1997)。植物发育和表现型与细胞周期相联系,并且改变涉及细胞周期的基因的表达会是更改植物发育与改变植物表现型的有用方法。
因为细胞周期推动植物发育(包括生长速率)、对环境因素的反应和所产生的植物表现型,所以改变细胞周期基因表达的能力是极为有力的。特别地,在维管形成层中与改变细胞周期基因表达相关的植物细胞周期和表现型的控制尤其可用于改变木材特性,特别是木材与木浆特性。举例来说,可通过改变细胞周期基因表达而达成的对木浆的改进包括增加或减少木质素和纤维素含量,以及改变细胞长度、直径和内腔直径。通过操纵植物细胞周期,特别是形成层细胞周期(即细胞分裂速率与角度),也可以设计出尺寸稳定性增加、抗拉强度增加、抗剪强度增加、抗压强度增加、抗震强度增加、刚度增加、硬度增加或降低、螺旋性降低、收缩率降低和具有关于重量、密度与比重的所需特征的更佳木材。
A.细胞周期基因和蛋白质
1.周期素依赖性蛋白激酶
细胞周期进程主要是由周期素依赖性激酶(CDK)调控。CDK是真核丝氨酸/苏氨酸蛋白激酶的保守家族,其需要以周期素亚单位形成异二聚体以获活性。关于综述参看,例如Joubes等人,Plant Mol.Biol.43:607-20(2000),Stals和Inze,Trends Plant Sci.6:359-64(2001),以及John等人,Protoplasma 216:119-42(2001)。
存在5亚类CDK,各自具有不同的周期素结合共有序列。在A型CDK中,周期素结合共有序列为PSTAIRE.Id。在B-1型、B-2型和C型CDK中,周期素结合共有序列分别为PPTTLRE、PPTALRE和PITAIRE。Joubes等人,Plant Physiol,126:1403-15(2001)。
细胞周期进程部分程度上受CDK活性的改变指引。CDK活性由许多不同细胞周期蛋白组份调节,例如由于生物合成与蛋白质水解速率的改变引起的个别周期素的丰度的改变。周期素浓度的波动导致CDK活性的相应波动。周期素积累对于终止细胞周期的G1期来说尤其重要,这是因为DNA复制是由CDK活性的增加而引发的。
CDK的活化也需要由CDK活化激酶(CAK)引起的CDK的T环内苏氨酸残基的磷酸化。Umeda等人,Proc.Nat′l Acad.Set.U.S.A.97:13396-400(2000)。Yamaguchi等人,PlantJ.24:11-20(2000)提议,周期素H是CAK的调控亚单位。CDK活性进一步受其与CDK调控亚单位之间的相互作用调控,CDK调控亚单位是一种涉及细胞周期调控的小(70-100AA)蛋白质。
细胞必须退出细胞周期以准备进行分化、衰老或凋亡。这个过程涉及CDK活性的下调。CDK抑制剂(CKI)是低分子量蛋白质,其对于细胞周期调控和发育相当重要。CKI以化学计量与CDK结合,并下调CDK活性。
已知ICK1的许多生物化学特性,ICK1是从拟南芥(Arabidopsis thaliana)中鉴别出的第一个植物CKI。Wang等人,Nature 386:451-2(1997)Wang等人,Plant J.24:613-23(2000)。ICK1在许多组织类型中以低水平表达,并且可能存在ICK1的临界水平,细胞在能够进入细胞周期之前必须克服这个临界水平。Wang等人,PlantJ.24:613-23(2000)。ICK1由植物生长调控剂脱落酸(ABA)诱导,其通过阻断DNA复制来抑制细胞分裂。当ICK1表达增加时,Cdc2样H1组蛋白活性相应减少。已经显示ICK1活体外与周期素C2c2a和CycD3结合,并且缺失实验已经鉴别出用于这两个互作的不同域。
已知改变CDK调控蛋白或其亚单位的表达引起植物表现型的变化。拟南芥CDK调控亚单位CKS1 At的过表达导致叶片尺寸、根生长速率和分生组织尺寸的减小。此外,CKS1 At的过表达还导致细胞周期进程受到抑制,同时细胞周期G1与G2期的持续时间延长。
2.周期素
周期素是周期素依赖性激酶(CDK)的正调控亚单位,并且是CDK活性所必需的。Fowler等人,Mol.Biotech.10,123,126。周期素和CDK复合物提供细胞周期转变的瞬时调控。也有证据表明,周期素提供特异性CDK活性的空间调控,不同地靶向于细胞骨架、纺锤体、成膜体、核膜和染色体。
将植物周期素划分为5个主要组:A、B、C、D和H。Renaudin等人,Plant Mol.Biol.32:1003-18(1996)和Yamaguchi等人,(上文,2000)。周期素可分为有丝分裂周期素(A和B)和G1周期素。
有丝分裂周期素具有位于N端区的共有序列(R-x-x-L-x-x-I-x-N),称为破坏盒,其邻近于富赖氨酸区。破坏盒和富赖氨酸区靶向于有丝分裂周期素以用于在有丝分裂期间进行泛素依赖性蛋白水解。Stals,上文,第361页和Fowler,上文,第126页。A与B周期素中的破坏盒略有差异,并且认为此差异导致A与B周期素的不同降解时间。Fowler,上文,第126页。A型周期素在细胞周期的S期、G2期和M早期积累,而B型周期素在G2晚期和M早期积累。Mironov等人,Plant Cell11:509-22(1999)。已知A型周期素的三个亚组是在植物中,而仅一个亚组是在动物中。在含微管成膜体处,周期素Al(cycAl:zm;l来自Zea cans)在胞质分裂期间最为集中。周期素A2的表达在根中由生长素上调,而在茎尖中是由细胞分裂素上调。Abrahams等人,Biochim.Biophys.Acta28:1-2(2001)。
人们认为,已知其中五个亚组的D型周期素控制G1期进程,其响应于生长因子和营养物质。Riou-Khamlichi等人,Mol Cell Biol,20:4513-21(2000)。举例来说,如在蔗糖暴露后30分钟cycD2mRNA的增加和蔗糖暴露后4小时cycD3增加所示,D型周期素的表达由蔗糖上调。这个时间分别对应G1早期和G1晚期。Cockcroft等人,Nature 405:575-9(2000)。此外,在拟南芥中,显示D3周期素由油菜素甾醇(brassinosteroid)(表油菜素内酯(epi-brassinolide))上调。
周期素D2与CDKA结合产生活性复合物,此复合物与视网膜母细胞瘤相关蛋白(Rb)结合并对其进行磷酸化。在活跃增生组织中发现这个过程,表明其在G1晚期与S早期发挥重要作用。三种不同D3型周期素在番茄果实发育期间具有活性。这些蛋白质均含有视网膜母细胞瘤结合基序和PEST破坏基序。这些D3周期素的时空表达存在差异,推测其在果实发育期间的不同作用。
周期素D的过表达提高总体生长速率。烟草中周期素D2的过表达缩短G1期,而加快细胞周期循环速率。
在白杨(poplar,Populus tremula x tremuloides)和水稻(rice,Oryza sativa)中鉴别出C型和H型周期素,但是尚不清楚其确切功能。也已经鉴别出具有较低肽序列保守程度的推定周期素。举例来说,拟南芥CycJ18与同源物在周期素盒结构域中仅具有20%一致性。CycJ18主要表达于幼苗中。拟南芥F3O9.13蛋白与周期素家族也具有相似性。
3.组蛋白乙酰基转移酶/去乙酰基转移酶
组蛋白乙酰基转移酶(HA)和组蛋白去乙酰基转移酶(HAD)控制组氨酸乙酰化的净水平。人们认为,组氨酸乙酰化和去乙酰化通过改变核小体DNA对于DNA结合转录活化剂、其他染色质修饰酶或能够替换核小体的多亚单位染色质改造复合物的可接近性而得以发挥其对基因表达的调控作用。Lusser等人,Nucleic Acids Res.27:4427-35(1999)。因此,总体来说,HAD涉及基因表达的抑制,而HA与基因活化相关。
HA在于核心组氨酸的氨基末端附近成簇的保守赖氨酸残基的ε-氨基处达成乙酰化作用,而上调基因表达。
HAD从核小体的核心组氨酸移除乙酰基。HAD组存在许多家族成员,其中许多在进化过程中呈保守性。Lechner等人,Biochim Biophys Acta5:181-8(1996)。HAD充当有助于染色质浓缩的多蛋白质复合物的部分。
HAD和HA识别核小体上非常独特的乙酰化类型。人们认为,不同类型的HAD与基因组的特定区域互相作用以影响基因沉默。
Schultz等人,Genes Dev.15:428-43(2001)证实Kruppel相关性盒锌指蛋白(KRAB-ZFP)的超家族经由共阻遏物KAP-1的PHD(植物同源异型域(planthomeodomain))与溴域(bromodomain)连接于核小体改造和组氨酸去乙酰化复合物,以形成转录抑制所必需的协作单元。已经鉴别出玉米HDAC(HD2),它与其他真核HDAC不具有序列同源性,而是与肽基-脯氨酰基顺-反异构酶(PPIase)具有序列相似性。
经组氨酸去乙酰化干扰而产生的作用于(例如)Tian和Chen,Proc.Nat′l Acad.Sci.USA 98:200-5(2001)中有所论述。
4.肽基脯氨酰基顺-反异构酶
肽基脯氨酰基异构酶(例如肽基脯氨酰基顺-反异构酶、肽基-脯氨酰基顺-反异构酶、PPIase、旋转异构酶、亲环素(cyclophilin))催化脯氨酸残基处顺式与反式构象间肽键的互相转化。Sheldon和Venis,Biochem J.315:965-70(1996)。认为这种互相转化为蛋白质折叠中的限速步骤。PPIase属于存在于动物、真菌、细菌和植物中的蛋白保守家族。PPIase涉及许多反应,包括对环境应力、钙信号、转录抑制、细胞周期控制等的反应。Viaud等人,Plant Cell14:917-30(2002)。
5.视网膜母细胞瘤相关蛋白
据推定,视网膜母细胞瘤相关蛋白调控细胞周期经G1期并进入S期的进程。Xie等人,EMBOJ.15:4900-8(1996)和Ach等人,Moh Cell Biol.17:5077-86(1997)。
尽管Rb在哺乳动物系统中已得以充分鉴定,但是在植物中并未充分鉴定Rb相关蛋白在调控G1期进程和进入S期中的作用。然而,已知RB相关蛋白通过其与涉及细胞周期调控的各种其他细胞蛋白(例如周期素、WD40蛋白)相联系而发挥作用,Soni等人,Plant.Cell.7:85-103(1995);Grafi等人,Proc,Nath Acad.Sci.U.S.A.93:8962(1996);Ach等人,Plant Cell9:1595-606(1997);Umen和Goodenough,Genes Dev.15:1652-61(2001);Mariconti等人,J.Biol.Chem.277:9911-9(2002)。
6.WD40重复蛋白
WD40是涉及许多不同蛋白质-蛋白质互作的常见重复基序。在具有包括信号转导、mRNA前体加工和细胞骨架组装中的接头蛋白/调控模组在内的多种功能的蛋白质中发现WD40域。Goh等人,Eur. J.Biochem.267:434-49(2000)。
40个残基长的WD40域通常含有距N端11-24个残基的GH二肽和C端处的WD二肽。同前(Id)。GH二肽与WD二肽之间存在保守核心,其充当蛋白质可以稳定或可逆结合的稳定平台。该核心形成具有若干叶片的螺旋桨样结构。每个叶片由四股反向平行P片层组成。每个WD40序列重复形成一个叶片的前面三股,并且最后一股在下一叶片中。最后一个C端WD40重复完成首个WD40重复的叶片结构以创建闭环螺旋桨结构。有人提出,螺旋桨顶部和底部表面上的残基协调与其他蛋白质和/或小配体间的互作。
在酵母中所进行的研究证实,含有WD40基序的Cdc20是有丝分裂周期素的蛋白质水解所必需的。这个过程由称为后期促进复合物(APC)或循环体(cyclosome)的泛素-蛋白连接酶所介导。在26S蛋白酶体的泛素化和蛋白质水解之后,细胞可以分凝染色质,并且退出有丝分裂。Cdc20也含有破坏盒域。
7.WEE1样蛋白
WEE1控制周期素依赖性激酶的活性。WEE1自身为丝氨酸/苏氨酸激酶。Sorrell等人,Planta215:518-22(2002)。这些蛋白激酶的酶活性受催化域的活化部分中的特定残基的磷酸化控制,有时与C端自动调控尾中的可逆构象改变有关。此过程在从真菌到动物和植物的真核细胞中保守。相似地,各种生物体的WEEI蛋白之间存在高度同源性。举例来说,人类和玉米WEE1蛋白的蛋白激酶域之间存在50%一致性。
显示WEE1仅在活跃分裂组织中表达,并且相信其通过充当有丝分裂的负调控子来抑制细胞分裂。人们相信,WEE1在有丝分裂之前通过保护细胞核以使其免于经细胞质活化的周期素B1-CDC2复合物来阻止从G2进入M。举例来说,AtWEE1(来自拟南芥)和ZmWEE1(来自Zea cans)在裂殖酵母(fission yeast)中的过表达抑制细胞分裂,导致伸长的细胞。Sun等人,Proc.Nat′lAcad.Sci.U S A96:4180-5(1999)。
B.植物发育中的表达型与微阵列分析
植物表现型的多基因控制在确定负责表现型决定的基因方面存在困难。鉴别影响植物表现型的基因和基因表达差异的一个主要障碍在于难以同时研究多个基因的表达。鉴别并探知基因表达和影响植物表现型的基因间的相互关系的另一个困难在于植物所展示出的对环境因素的高度敏感性。
使用基因组范围表达型分析已经取得了新的进展。具体来说,使用DNA微阵列可用于在一个单独实验中研究许多基因的表达。已经使用表达型分析进行了植物基因对发育和环境刺激的反应的几项研究。举例来说,采用微阵列分析来研究草莓果实成熟期间的基因表达,Aharoni等人,Plant Physiol.129:1019-1031(2002),拟南芥中的伤害反应,Cheong等人,Plant Physiol.129:661-7(2002),拟南芥中的病原体反应,Schenk等人,Proc.Nat′l Acad.Sci.97:11655-60(2000),和大豆中生长素反应,Thibaud-Nissen等人,PlantPhysiol.132:118。Whetten等人,Plant Mol Biol.47:275-91(2001)公开了使用cDNA探针进行的火炬松(Pinus taeda L.)细胞壁生物合成基因的表达型分析。Whetten等人研究了分化初生与成熟次生木质部间差异表达的基因。此外,为了确定某些环境刺激对基因表达的影响,比较了应压木与正常木的基因表达。2300个研究元素中有156个展示差异表达。Whetten,上文,第285页。初生木与成熟木的比较展示差异表达的188个元素。同前,第286页。
尽管表达型分析,特别是DNA微阵列,提供用于基因组范围表达分析的便利工具,但是其用途已经限定于可得到完整基因组序列和大cDNA集合的生物体。参看Hertzberg等人,Proc.Nat′l Acad.Sci.98:14732-7(2001a),Hertzberg等人,Plant J.,25:585(2001b)。举例来说,Whetten(同上文)阐述,“这个有趣的问题的更完整分析等待大批松树和白杨EST的完成(A more complete analysis of this interesting question awaits the completionof a larger set of both pine and poplar ESTs)”。Whetten等人,第286页。此外,包含cDNA或EST探针的微阵列可能不能够区分相同家族的基因,因为这些基因间具有序列相似性。也就是说,当cDNA或EST用作微阵列探针时,其可能连接于相同家族的一个以上的基因。
对各种类型植物组织在不同的植物发育阶段,和经不同环境因素刺激之后的细胞周期基因表达的更好了解将会有助于通过操纵基因表达而产生具有更多所需表现型的植物的方法。控制植物构造和农艺学重要特征的能力可能通过对于细胞周期基因表达如何引起植物组织形成,细胞周期基因表达如何引起植物细胞进入或退出细胞分裂以及植物生长与细胞周期如何相联系的更好了解而得以改进。在许多基因中,其表达在植物发育期间会改变,其中仅一部分可能在植物发育的任何给定阶段中导致表现型改变。
发明内容
因此,需要可用于确定发生于植物细胞周期期间内的细胞周期基因表达改变的工具和方法。也需要可用于这些方法的多核苷酸。另外还需要可以将细胞周期基因表达改变与表现型或植物发育阶段相关联的方法。另外还需要用来鉴别影响植物表现型并且可以经操纵以获得所需表现型的细胞周期基因和基因产物。
一方面,本发明提供包含选自由SEQ ID NO:1-237和其保守性变体组成的群组的核酸序列的经分离多核苷酸。
另一方面,本发明提供包含至少一个具有SEQ ID NO:1-237和其保守性变体中任何一个序列的多核苷酸的DNA构建体。
本发明另一方面是关于经包含选自由SEQ ID NO:1-237和其保守性变体组成的群组的核酸序列的DNA构建体转化的植物细胞。
本发明另一方面是关于转基因植物,其包含经包含选自由SEQ ID NO:1-237和其保守性变体组成的群组的核酸序列的DNA构建体转化的植物细胞。
本发明另一方面是关于包含编码选自SEQ ID NO:261-497中任何一者的多肽的催化域或底物结合域的序列的经分离多核苷酸,其中所述多核苷酸编码具有所述选自SEQID NO:261-497中任何一者的多肽的活性的多肽。
本发明另一方面是关于制造经转化植物的方法,所述方法包括:用包含至少一个具有SEQ ID NO:1-237中任一序列的多核苷酸的DNA构建体转化植物细胞;和在促进植物生长的条件下培养所述经转化植物细胞。
另一方面,本发明提供获自转基因树木的木材。
另一方面,本发明提供获自经本发明DNA构建体转化的转基因树木的木浆。
本发明另一方面是关于制造木材的方法,所述方法包括:用包含具有选自由SEQ IDNO:1-237和其保守性变体组成群组的核酸序列的多核苷酸的DNA构建体转化植物;在促进植物生长的条件下培养所述经转化植物细胞;和从植物获得木材。
本发明另外提供制造木浆的方法,所述方法包括:用包含具有选自由SEQ ID NO:1-237和其保守性变体组成群组的核酸序列的多核苷酸的DNA构建体转化植物;在促进植物生长的条件下培养所述经转化植物细胞;和从植物获得木浆。
另一方面,本发明提供包含由经分离多核苷酸编码的氨基酸序列的经分离多肽,所述经分离多核苷酸包含选自由SEQ ID NO:1-237和其保守性变体组成群组的核酸序列。
本发明也提供包含选自由261-497组成群组的氨基酸序列的经分离多肽。
本发明另外提供改变植物的植物表现型的方法,所述方法包括:改变植物中由SEQID NO:1-237中任何一者所编码的多肽的表达。
另一方面,本发明提供包含选自由SEQ ID NO:471-697组成的群组的核酸的多核苷酸。
本发明一方面关于使两个不同样品中的基因表达相关联的方法,所述方法包括:检测第一样品中一个或一个以上编码由选自SEQ ID NO:1-237和其保守性变体组成群组的核酸序列所编码的产物的基因的表达水平;检测第二样品中所述一个或一个以上基因的表达水平;将第一样品中一个或一个以上基因的表达水平与第二样品中一个或一个以上基因的表达水平比较;和使第一与第二样品之间一个或一个以上基因的表达水平差异相关联。
本发明另一方面关于使植物所具有的表现型与植物中一个或一个以上基因的基因表达水平相关联的方法,所述方法包括:检测具有一表现型的第一植物中一个或一个以上编码由选自SEQ ID NO:1-237和其保守性变体组成群组的核酸序列所编码的产物的基因的表达水平;检测缺少所述表现型的第二植物中所述一个或一个以上基因的表达水平;将第一植物中一个或一个以上基因的表达水平与第二植物中一个或一个以上基因的表达水平比较;和使第一与第二植物之间一个或一个以上基因的表达水平差异与所具有的表现型相关联。
另一方面,本发明提供使基因表达与细胞周期的阶段相关联的方法,所述方法包括:检测细胞周期第一阶段的第一植物细胞中一个或一个以上编码由选自SEQ ID NO:1-237和其保守性变体组成群组的核酸序列所编码的产物的基因的表达水平;检测细胞周期第二个不同阶段的第二植物细胞中所述一个或一个以上基因的表达水平;将第一植物细胞中一个或一个以上基因的表达水平与第二植物细胞中一个或一个以上基因的表达水平比较;和使第一与第二样品之间一个或一个以上基因的表达水平差异与细胞周期第一或第二阶段相关联。
本发明一方面关于检测一个或一个以上基因的表达的组合,其包含两个或两个以上寡核苷酸,其中每个寡核苷酸能够与选自由SEQ ID NO:1-237组成的群组的核酸序列杂交。
本发明一方面关于检测一个或一个以上基因的表达的组合,其包含两个或两个以上寡核苷酸,其中每个寡核苷酸能够与由选自由SEQ ID NO:1-237组成群组的核酸序列所编码的核酸序列杂交。
本发明另外提供包含用来检测一个或一个以上基因的表达的组合的微阵列,其包含两个或两个以上寡核苷酸,其中每个寡核苷酸能够与选自由SEQ ID NO:1-237组成群组的核酸序列杂交,或者其中每个寡核苷酸能够与由选自由SEQ ID NO:1-237组成群组的核酸序列所编码的核酸序列杂交,其中所述两个或两个以上寡核苷酸各自占据所述固体载体上的独特位置。
另一方面,本发明提供检测样品中一个或一个以上基因的方法,所述方法包括:使样品与两个或两个以上寡核苷酸接触,其中每个寡核苷酸能够在标准杂交条件下与包含选自由SEQ ID NO:1-237组成群组的核酸序列的基因杂交;和检测与所述一个或一个以上寡核苷酸杂交的所述感兴趣的一个或一个以上基因。
本发明也提供检测样品中由一个或一个以上基因所编码的一个或一个以上核酸序列的方法,所述方法包括:使样品与两个或两个以上寡核苷酸接触,其中每个寡核苷酸能够在标准杂交条件下与由包含选自由SEQ ID NO:1-237组成群组的核酸序列的基因所编码的核酸序列杂交;和检测与所述一个或一个以上寡核苷酸杂交的所述一个或一个以上核酸序列。
本发明另外提供检测基因表达的试剂盒,所述试剂盒包含本发明的微阵列和一种或一种以上用于核苷酸杂交反应的缓冲液或试剂。
根据其后的“具体实施方式”可明显了解本发明的其他特征、目的和优势。然而,应了解,尽管“具体实施方式”指出本发明的优选实施例,但是仅是说明性的,而非加以限制。根据“具体实施方式”,所属领域技术人员显然可在本发明精神和范围内做出各种改变和修正。
附图说明
图1是例示性微阵列取样参数。
图2是pWVK202质粒图。
图3是pGrowth14质粒图。
图4是pGrowth15质粒图。
图5是pGrowth16质粒图。
图6是pGrowth18质粒图。
图7是pGrowth19质粒图。
图8是pGrowth20质粒图。
表格清单
表1:展示以任何一个样品与其他三个样品的平均信号相比,具有大于两倍信号的基因。
表2:鉴别用于实例17中所述构建体的质粒、基因和Genesis ID号。
表3:美洲黑杨(Populus deltoids)的生根培养基。表4:pGrowth信息。
表5:展示以任何一个样品与其他三个样品的平均信号相比,具有大于两倍信号的基因。
表6:差异表达的cDNA。
表7:一致ID信息。
表8:pGrowth信息。
表9:巨桉(Eucalyptus grandis)细胞周期基因和蛋白质。
表10:辐射松(Pinus radiata)细胞周期基因和蛋白质。
表11:本发明的注释肽序列。
表12:桉树计算机模拟数据。
表13:松树计算机模拟数据。
表14:寡核苷酸表。
表15:肽表。
表16:BLAST序列比对表。
具体实施方式
本发明者已经发现可用于鉴别影响表现型的多基因因素并可用于通过操纵基因表达而影响植物表现型的新颖经分离细胞周期基因和多核苷酸。来源于商业上重要的森林属松树与桉树植物的这些基因涉及植物细胞周期,并且至少部分程度上负责商业木材中重要表现型特征的表现,例如刚度、强度、密度、纤维尺寸、粗糙度、纤维素与木质素含量和提取物含量。总体来说,基因和多核苷酸所编码的蛋白质可以是周期素、周期素依赖性激酶、周期素依赖性激酶抑制剂、组蛋白乙酰基转移酶、组蛋白去乙酰基酶、肽基-脯氨酰基顺-反异构酶、视网膜母细胞瘤相关蛋白、WEE1-样蛋白或WD40重复蛋白或其催化域,或具有相同功能的多肽,并且本发明进一步包括这些蛋白质和多肽。
用于选择细胞周期基因序列以供操纵的本发明方法允许更好地设计和控制具有高度地经基因工程改造的表现型的转基因植物。控制商业上重要森林属植物的构造和农艺学重要特征的能力可以通过从这些方法所获得的信息而得以改进,例如哪些基因影响哪些表现型,哪些基因影响进入细胞周期的哪个阶段,哪些基因在植物发育的哪个阶段有活性,和哪些基因在哪个组织中在细胞周期或植物发育的给定点被表达。
除另有说明外,本发明所用的所有科技术语都以与普通技术用法一致的方式使用。一般来说,本说明书的术语和所述实验程序,分别包括细胞培养、分子遗传学和核酸化学与杂交,是众所熟知的,并且为本领域普遍使用。重组核酸方法、寡核苷酸合成、细胞培养、组织培养、转化、转染、转导、分析化学、有机合成化学、化学合成、化学分析和医药调配与给药均采用标准技术。一般来说,根据厂商说明书进行酶反应和纯化和/或分离步骤。如果无相反指示,则根据例如在Sambrook等人,MOLECULAR CLONING ALABORATORY MANUAL,第2版(Cold Spring Harbor Laboratory Press,1989)和CURRENTPROTOCOLS IN MOLECULAR BIOLOGY,John Wiley & Sons,1989)中所公开的常规方法实施所述技术与程序。下文更加详细地讨论与本发明有关的特殊科学方法。然而,本讨论仅作为实例提供,而并不限制本发明方法可得以实施的方式。
A.植物细胞周期基因和蛋白质
1.细胞周期基因、多核苷酸和多肽序列
本发明一方面涉及新颖植物细胞周期基因和由这些基因所编码的多肽。如本文所用术语“植物细胞周期基因”是指编码在植物细胞周期中起作用的蛋白质的基因,并且术语“植物细胞周期蛋白”是指在植物细胞周期中起作用的蛋白质。存在数个已知的植物细胞周期蛋白家族,包括周期素、周期素依赖性激酶、周期素依赖性激酶抑制剂、组蛋白乙酰基转移酶、组蛋白去乙酰基转移酶、肽基-脯氨酰基顺-反异构酶、视网膜母细胞瘤相关蛋白、WEE1-样蛋白和WD40重复蛋白。尽管每个基因和蛋白质家族当中存在显著序列同源性,但是每个家族的每个成员会呈现不同的生物化学特性,并且改变这些基因中至少一者的表达可以导致不同的植物表现型。
本发明提供新颖植物细胞周期基因和多核苷酸,以及新颖细胞周期蛋白和多肽。根据本发明一实施例,所述新颖植物细胞周期基因与松树或桉树种的野生型植物中所表达的一样。本发明的例示性新颖植物细胞周期基因序列阐述于表9和10中,其分别描述巨桉序列和辐射松序列。对应的基因产物,即寡核苷酸和多肽也列于表14、15和16中。附录1中的序列列表提供本发明这些方面的序列。
本发明的序列具有细胞周期活性,并且编码在细胞周期中具有活性的蛋白质,例如上述细胞周期家族蛋白。如下文所详述,操纵细胞周期基因和多核苷酸的表达,或者操纵所编码蛋白质和多肽的活性,可以引起转基因植物具有与相同种野生型植物表现型不同的所需表现型。
本说明书通篇提及细胞周期基因产物。如本文所用“细胞周期基因产物”是由细胞周期基因所编码的产物,并且包括核苷酸产物(例如RNA)和氨基酸产物(例如蛋白质和多肽)。本发明特殊细胞周期基因的实例包括SEQ ID NO:1-237。本发明特殊细胞周期基因产物的实例包括由SEQ ID NO:1-237中任何一者所编码的产物。本文也提及细胞周期蛋白和细胞周期多肽。本发明特殊细胞周期蛋白和多肽的实例包括由SEQ ID NO:1-237中任一者所编码的多肽,或者包含SEQ ID NO:261-497中任一氨基酸序列的多肽。本发明一方面针对这些细胞周期基因和细胞周期基因产物的子集,即SEQ ID NO:1-12、14-58、60-62、64-70、72-75、77-83、85-86、88-91、93-119、121-130、132-148、150-156、158-191、193-207、209-218、220-221、223-231、233-237,其各自保守性变体(如下文所定义的术语)和由其所编码的核苷酸和氨基酸产物。本发明另一方面针对所述细胞周期基因和细胞周期基因产物的子集,即SEQ ID NO:1-12、14、16-26、30-37、40-41、43-76、78-103、106、108-113、116-121、124-125、128-147、150-152、154-155、161-162、164-172、174、177-183、185-191、193-197、200-204、208-213与215-234,其各自保守性变体,和由其所编码的核苷酸和氨基酸产物。本发明另一方面针对所述细胞周期基因和细胞周期基因产物的子集,即SEQ ID NO:1-12、14、16-26、30-37、40-41、43-58、60-62、64-70、72-75、78-83、85-86、88-91、93-103、106、108-113、116-119、121、124-125、128-130、132-147、150-152、154-155、161-162、164-172、174、177-183、185-191、193-197、200-204、209-213、215-218、220-221、223-231与233-234,其各自保守性变体,和由其所编码的核苷酸和氨基酸产物。
本发明也包括本文所公开核苷酸序列的互补序列、反向序列或反向互补序列。
本发明也包括本文所公开序列的保守性变体。如本文所用术语“变体”是指与参考序列相比具有一个或一个以上核苷酸碱基或氨基酸残基不同的核苷酸序列或氨基酸序列,其为该参考序列的一种变体。
因此,一方面本发明包括保守性变体多核苷酸。如本文所用术语“保守性变体多核苷酸”是指在严谨条件下与寡核苷酸探针杂交的多核苷酸,而所述寡核苷酸探针在可比条件下与参照基因结合,该保守性变体是所述参照基因的一种变体。因此,举例来说,SEQ ID NO:1的保守性变体在严谨条件下与寡核苷酸探针杂交,而所述寡核苷酸探针在可比条件下与SEQ ID NO:1结合。本发明一方面提供展示与其各自参照序列相比至少约75%序列一致性的保守性变体多核苷酸。
“序列一致性”具有本领域公认的含义,并且可以使用公开技术进行计算。参看COMPUTATIONAL MOLECULAR BIOLOGY,Lesk编(Oxford University Press,1988)、BIOCOMPUTING:INFORMATICS AND GENOME PROJECTS,Smith编(Academic Press,1993)、COMPUTER ANALYSIS OF SEQUENCE DATA,PART I,Griffin & Griffin编(Humana Press,1994)、SEQUENCE ANALYSIS IN MOLECULAR BIOLOGY,Von Heinje编,Academic Press(1987)、SEQUENCE ANALYSIS PRIMER,Gribskov & Devereux编(Macmillan Stockton Press,1991)和Carillo & Lipton,SIAMJ.AppliedMath.48:1073(1988)。通常用于确定两个序列间一致性或相似性的方法包括(但不限于)GUIDE To HUGE COMPUTERS,Bishop编(Academic Press,1994)和Carillo & Lipton(同上)中所公开的方法。确定一致性和相似性的方法被编入计算机程序。确定两个序列之间一致性和相似性的优选计算机程序包括(但不限于)GCG程序包(Devereux等人,Nucleic Acids Research12:387(1984)),BLASTP,BLASTN,FASTA(Atschul等人,J.Mol.Biol.215:403(1990))和FASTDB(Brutlag等人,Comp.App.Biosci.6:237(1990))。
本发明包括与SEQ ID NO:1至237中任何一者相比具有大于或等于99%、98%、97%、96%、95%、94%、93%、92%、91%、90%、89%、88%、87%、86%、85%、84%、83%、82%、81%、80%、79%、78%、77%、76%、75%、74%、73%、72%、71%、70%、69%、68%、67%、66%、65%、64%、63%、62%、61%或60%序列一致性的保守性变体多核苷酸。在这些变体中,变体与参照序列之间的差异可能发生在参照核苷酸序列的5′或3′末端位置,或者在这些末端位置之间的任何位置,单独分散于参照序列中的核苷酸当中,或者在参照序列内的一个或一个以上连续基团中。
本发明所预期及涵盖的其他保守性变体多核苷酸包括包含因总共小于总序列长度10%的缺失和/或插入,而与SEQ ID NO:1-237多核苷酸序列或其互补、反向互补或反向序列不同的序列的多核苷酸。
本发明也包括除了在初级结构(序列)中与SEQ ID NO:1至237享有高度相似性之外还具有以下特征中至少一者的保守性变体多核苷酸:(i)其含有编码与由参照多核苷酸所编码多肽在细胞周期中具有实质上相同功能特性的多肽的开放阅读框或部分开放阅读框,或者(ii)其共有核苷酸域或经编码蛋白质域。本发明包括编码具有由参照多核苷酸所编码蛋白质的酶或生物活性或结合特性的蛋白质的SEQ ID NO:1-237保守性变体。由于这些保守性变体具有由参照多核苷酸所编码蛋白质的酶或结合活性,因此其为功能性变体。
根据本发明,多核苷酸变体可以包括“洗牌基因(shuffled gene)”,例如美国专利第6,500,639号、第6,500,617号、第6,436,675号、第6,379,964号、第6,352,859号、第6,335,198号、第6,326,204号和第6,287,862号中所述的基因。本发明核苷酸序列的变体也可以是如美国专利第6,132,970号中所公开的经修饰的多核苷酸,该专利案以引用的方式并入本文。
根据一实施例,本发明提供编码来自以下家族中一个家族的细胞周期蛋白的多核苷酸:周期素、周期素依赖性激酶、周期素依赖性激酶抑制剂、组蛋白乙酰基转移酶、组蛋白去乙酰基酶、肽基-脯氨酰基顺-反异构酶、视网膜母细胞瘤相关蛋白、WEE1-样蛋白或WD40重复蛋白。SEQ ID NO:1-237提供这些多核苷酸的实例。
根据另一实施例,本发明的多核苷酸编码由SEQ ID NO:1-237中任一者所编码的多肽或者包含SEQ ID NO:261-497中任一者的多肽的催化域或蛋白质结合域。本发明细胞周期蛋白的催化和蛋白质结合域为所属技术领域已知。这些蛋白质的保守序列展示于条目1-195中下划线、粗体和/或斜体文本。
本发明也涵盖与上述序列不同的作为保守性变体的多核苷酸,但是由于遗传密码的简并性,其编码与本发明多核苷酸所编码的多肽相同的多肽。本发明也包括包含因替换而与上述多核苷酸不同的序列的保守性变体多核苷酸,所述替换不影响所编码多肽序列的氨基酸序列,或者引起所编码多肽序列的保守性替换。
本发明也包括由包含SEQ ID NO:1-237中任一者或上述其保守性变体中任一者的多核苷酸所编码的经分离多肽。本发明也包括包含SEQ ID NO:261-497和495-497的多肽以及这些多肽的保守性变体。本发明另一方面包括包含SEQ ID NO:261-272、274-318、320-322、324-330、332-335、337-343、345-346、348-351、353-379、381-390、392-408、410-416、418-451、453-467、469-478、480-481、483-491和493-494的多肽和其保守性变体。本发明另一方面包括包含SEQ ID NO:261-272、274、276-286、289、290-297、300-301、303-345、347-363、366、368-373、376-381、384-385、388-407、410-412、414-415、20-422、424-432、434、37-443、445-451、453-457、460-464、468-473和475-494的多肽和其保守性变体。本发明另一方面包括包含SEQ ID NO:261-272、274、276-286、290-297、300-301、303-318、320-322、324-330、332-335、337-343、345、348-351、353-363、366、368-373、376-381、384-385、88-390、392-407、410-412、414-415、421-422、424-432、434、437-443、445-451、453-457、460-464、469-473、475-478、480-481、483-491和493-494的多肽和其保守性变体。
根据本发明,变体多肽或蛋白质是指通过添加、缺失或替换一个或一个以上氨基酸而改变的氨基酸序列。
本发明包括保守性变体多肽。如本文所用术语“保守性变体多肽”是指与产生该保守性变体的蛋白质相比具有相似结构、化学或生物特性的多肽。可以使用所属技术领域熟知的计算机程序,例如Vector NTI Suite(InforMax,MD)软件,来确定哪些氨基酸残基可以经替换、插入或缺失。在本发明一实施例中,保守性变体多肽展示与其各自参照序列相比至少约75%序列一致性。
保守性变体蛋白质包括多肽的“同源异构体(isoform)”或“类似物(analog)”。多肽同源异构体和类似物是指具有相同物理和生理特性和相同生物功能的蛋白质,但是其氨基酸序列中有一个或一个以上不同,或者其序列包括非天然氨基酸。
本发明预期并涵盖包含因总共小于总序列长度10%的氨基酸替换、插入和/或缺失而与SEQ ID NO:261-497多肽序列不同的序列的多肽。
本发明一方面提供在细胞周期中与产生变体的蛋白质具有相同功能的保守性变体多肽,如通过一个或一个以上适当测定法所确定,例如下文所述的测定。本发明包括充当细胞周期蛋白的变体多肽,例如具有周期素、周期素依赖性激酶、周期素依赖性激酶抑制剂、组蛋白乙酰基转移酶、组蛋白去乙酰基酶、肽基-脯氨酰基顺-反异构酶、视网膜母细胞瘤相关蛋白、WEE 1-样蛋白和WD40重复蛋白的生物活性的变体多肽,并且因此能够调节植物中的细胞周期。如上所述,本发明包括编码充当细胞周期蛋白的多肽的变体多核苷酸。
可以用所属技术领域已知的任何方法研究细胞周期蛋白的活性和物理特性。以下测定方法实例并不完全,而是用于提供研究所述活性和辨别细胞周期蛋白变体的蛋白质特征的一些指导。
如Yamaguchi等人,Proc.Natl.Acad.Sci.U.S.A.100:8019(2003)所述,可以用洛克维汀(roscovitine)来评价CDK活性。如Joubes等人,Plant Physiol.121:857(1999)所述,可以用放射自显影法来检测由CDK产生的组蛋白H1磷酸化而测定CDK组蛋白激酶活性。
可以用Zhou等人,Planta.6:604(2003)所述方法的变更来测定CKI活性。经修改的方法可以采用共转化或连续转化来鉴别CKI与周期素活体内互作。举例来说,在第一种转化中,可以使用美国专利公开案第2002/0100083号所述的方法用遗传霉素(geneticin)选择来转化松树组织以获得具有cycD3和cdc2a同源物的转基因植物。如美国临时专利第60/476,189号所述,可以使用α-甲基色氨酸作为可选标记来进行第二转化以获得具有ICK1同源物的转化体。能够在遗传霉素和α-甲基色氨酸上生长的组织含有ICK1同源物以及cycD3和cdc2a同源物。通过将具有cycD3和cdc2a同源物的转化体的表现型和具有ICK1同源物以及cycD3和cdc2a同源物的转化体进行比较来确定CKI活性。
可以通过Tian等人,Genetics165:399(2003)中所述的拟南芥突变体的互补来评价组蛋白去乙酰基酶活性。如Balasubramanyam等人,J.Biol.Chem.278:19134(2003)所述,可以用漆树酸来评价组蛋白乙酰基转移酶活性。如Bhat等人,Plant J.33:455(2003)所述,也可以用经曲古抑菌素A(trichostatin A)处理的植物系来评价组蛋白乙酰基转移酶活性。也可以用上文Bhat等人所述的植物系,使用Rossi等人,Plant Mol Biol.51:401(2003)中所述共沉淀法来测定视网膜母细胞瘤相关蛋白。
可以如Edvardsson等人,FEBS Lett.542:137(2003)中所述测定肽基-脯氨酰基异构酶。可以基于所具有的WD40基序和其与cdc2相互作用的能力来评估WD40蛋白。可以用所属技术领域已知的任何激酶活性测定来测定WEE-1。
2.细胞周期基因、多核苷酸和多肽序列的使用方法
本发明提供植物细胞周期基因和其保守性变体的使用方法。本发明包括改变植物细胞周期基因和/或基因产物的表达的方法和构建体,其目的在于包括(但不限于)(i)研究细胞周期内功能和对植物表现型的最终作用,和(ii)引起植物表现型的变化。举例来说,本发明包括通过改变一个或一个以上植物细胞周期基因的表达来改变木材品质、纤维发育、细胞壁多糖含量、果实成熟和植物生长与产量的方法与工具。
本发明包含改变细胞周期基因和上述变体中任一者的表达的方法。因此,例如,本发明包含改变存在于桉树或松树种的野生型植株基因组中的细胞周期基因的表达。在一实施例中,细胞周期基因包含选自以下序列的核苷酸序列:SEQ ID NO:1-237,其包含SEQ ID NO:SEQ ID NOs:1-12、14-58、60-62、64-70、72-75、77-83、85-86、88-91、93-119、121-130、132-148、150-156、158-191、193-207、209-218、220-221、223-231和233-237的子集,其包含SEQ ID NO:1-12、14、16-26、30-37、40-41、43-76、78-103、106、108-113、116-121、124-125、128-147、150-152、154-155、161-162、164-172、174、177-183、185-191、193-197、200-204、208-213和215-234的子集,其包含SEQ ID NO:1-12、14、16-26、30-37、40-41、43-58、60-62、64-70、72-75、78-83、85-86、88-91、93-103、106、108-113、116-119、121、124-125、128-130、132-147、150-152、154-155、161-162、164-172、174、177-183、185-191、193-197、200-204、209-213、215-218、220-221、223-231和233-234的子集,或者其上述保守性变体。
可以根据本发明使用以改变基因表达的技术包括(但不限于):(i)过表达基因产物,(ii)破坏基因转录本,例如破坏基因mRNA转录本,(iii)破坏基因所编码多肽的功能,或(iv)破坏基因自身。基因产物的过表达,反义RNA、核糖酶的使用,以及双链RNA干扰(dsRNAi)的使用是发现基因的功能性作用,和产生具有与相同种野生型植株不同表现型的植株的重要技术。
目标基因的过表达通常是通过将基因或cDNA克隆进表达载体,并将载体导入受体细胞中而得以实现。或者,过表达可以通过将外源启动子引入细胞中以驱动基因组中基因的表达而得以实现。可以通过将经转化以过表达基因的植株与未经转化以过表达基因的植株进行比较来评估所给基因的过表达对细胞功能、生物化学和/或生理特性的作用。
反义RNA、核糖酶和dsRNAi技术通常靶向于基因的RNA转录本,通常是mRNA。反义RNA技术涉及在细胞中表达RNA分子(或RNA衍生物)或将RNA分子(或RNA衍生物)引入细胞中,所述RNA分子(或RNA衍生物)与细胞中特定mRNA中所发现的序列互补或与其反义。通过与mRNA相联系,反义RNA可以抑制所编码基因产物的翻译。例如在欧洲专利公开案第271988号,Smith等人,Nature,334:724-726(1988),Smith等人,Plant Mol.Biol,14:369-379(1990)中已经描述使用反义技术来减少或抑制特殊植物基因的表达。
核糖酶是具有催化域和与特定mRNA互补的序列的RNA。核糖酶通过与mRNA(经过核糖酶的互补域)相联系并随后用催化域裂解(降解)信使来起作用。
RNA干扰(RNAi)涉及转录后基因沉默(PTGS)调控过程,其中特异mRNA的稳态水平通过经转录、通常为经完全加工的mRNA的序列特异性降解而降低,而目标基因自身的重新(denovo)转录速率不改变。举例来说,在Elibashir等人,Methods Enzymol26:199(2002);McManus & Sharp,Nature Rev.Genetics 3:737(2002);PCT申请第WO01/75164号;Martinez等人,Cell110:563(2002);Elbashir等人,上文;Lagos-Quintana等人,Curr.Biol12:735(2002);Tuschl等人,Nat.Biotechnol.20:446(2002);Tuschl,Chemhiochem.2:239(2001);Harborth等人,J.Cell Sci.114:4557(2001);等人,EMBO J.20:6877(2001);Lagos-Quintana等人,Science.294:8538(2001);Hutvagner等人,loc cit,834;Elbashir等人,Nature.411:494(2001)中论及RNAi技术。
本发明提供包含至少一个SEQ ID NO:1-235多核苷酸或其保守性变体(例如上述保守性变体)的DNA构建体。可以使用任何所属技术领域已知的方法来产生本发明的DNA构建体。参看,例如Sambrook等人,同上。
本发明包括视情况包含启动子的DNA构建体。可以使用所属技术领域已知的任何合适启动子。启动子为结合于RNA聚合酶和/或其他转录调控元件的核酸,优选为DNA。如与任何启动子相同,本发明的启动子有助于或控制DNA或RNA转录以从经操作连接于该启动子的核酸分子产生mRNA分子。RNA可以编码蛋白质或多肽,或者可以编码反义RNA分子或可用于RNAi的分子。可用于本发明的启动子包括组成型启动子、可诱导启动子、经瞬时调控启动子和组织优选启动子。
可用组成型植物启动子的实例包括:花椰菜花叶病毒(CaMV)35S启动子,其在大部分植物组织中具有组成型高水平表达(Odel等人Nature313:810(1985));胭脂碱合成酶启动子(An等人Plant Physiol.88:547(1988));和章鱼碱合成酶启动子(Fromm等人Plant Cell1:977(1989))。应注意,尽管CaMV35S启动子通常称为组成型启动子,但是可以发现一些组织优选性。本发明预想CaMV35S的用途,而不考虑可能在本发明使用期间所展示的任何组织优选性。
可诱导启动子调控对环境、激素和化学信号做出反应的基因表达。激素可诱导启动子的实例包括生长素可诱导启动子(Baumann等人Plant Cell11:323-334(1999))、细胞分裂素可诱导启动子(Guevara-Garcia,PlantMol.Biol.38:743-753(1998))和赤霉素反应性启动子(Shi等人Plant Mol.Biol.38:1053-1060(1998))。此外,本发明的DNA构建体和方法中可以使用对热、光、创伤、病原体抗性和例如茉莉酸甲酯或水杨酸的化学物质做出反应的启动子。
组织优选启动子允许本发明的多核苷酸在某些植物组织中优选表达。组织优选启动子也可用于引导某些植物组织中反义RNA和siRNA的表达,其可用于抑制或完全阻断上述目标基因的表达。如本文所用维管植物组织是指木质部、韧皮部或维管形成层组织。其他优选组织包括顶端分生组织、根、种子和花。一方面,本发明的组织优选启动子为“木质部优选”、“形成层优选”或“韧皮部优选”,并且优选引导木质部、形成层或韧皮部中分别经操作连接的核酸序列的直接表达。另一方面,本发明的DNA构建体包含对木质部、形成层或韧皮部具有组织特异性的启动子,其中所述启动子仅在木质部、形成层或韧皮部具有活性。
维管优选启动子优选是在木质部、形成层或韧皮部组织中的任一组织中具有活性,或者在这三个组织类型的至少两者中具有活性。维管特异性启动子优选是在木质部、形成层或韧皮部中的任一者中具有特异性活性,或者在这三者的至少两者中具有特异性活性。换句话说,启动子仅在植物的木质部、形成层或韧皮部组织中具有活性。然而,注意,由于植物中的溶质运输,因此在一个组织中特异或优选表达的产物可能在表达发生之后在植物的其他地方发现。
在另一实施例中,启动子是处在瞬时调控下,其中启动子起始表达的能力与例如细胞周期阶段或植物发育阶段等因素有关。举例来说,周期素D2基因的启动子可能仅在G1期和S早期表达,并且特定周期素基因的启动子可能仅在发育幼苗的初生维管极内表达。
此外,特定细胞周期基因的启动子可能仅在发育次生维管系统的形成层内表达。在形成层内,特定细胞周期基因启动子可能仅在茎或根中表达。此外,细胞周期启动子可能仅在春季(早材形成)或仅在秋季表达。
启动子可以经操作连接至多核苷酸。如本文所用,经操作连接是指将编码结构基因的多核苷酸连接至启动子,从而使启动子控制结构基因的转录。如果所需多核苷酸包含编码蛋白质产物的序列,那么编码区可以经操作连接至调控元件,例如启动子和终止子,而引起相关信使RNA转录本和/或由所需多核苷酸所编码的蛋白质产物的表达。在本实例中,多核苷酸以5′至3′方向经操作连接至启动子,并且视情况连接至终止子序列。
或者,本发明提供包含“反义”方向多核苷酸的DNA构建体,其转录产生可以形成影响植物细胞中内源细胞周期基因表达的二级结构的核苷酸。在另一变体中,DNA构建体可以包含转录后产生双链RNA产物的多核苷酸,所述双链RNA产物起始该多核苷酸相关的细胞周期基因的RNA干扰。本发明的多核苷酸可以位于t-DNA内,而使得左边和右边t-DNA边界序列位于其侧翼,或者位于多核苷酸的任一侧。
应了解,本发明包括包含一个或一个以上任何上述多核苷酸的DNA构建体。因此,例如,构建体可以包含含有1、2、3、4、5、6、7、8、9、10或更多多核苷酸的t-DNA。
本发明也包括包含启动子的DNA构建体,所述启动子包括一个或一个以上的调控元件。或者,本发明包括包含与启动子分离的调控元件的DNA构建体。调控元件赋予启动子区许多重要特征。一些元件结合于提高经操作连接的核酸的转录速率的转录因子。其他元件结合于抑制转录活性的阻遏物。转录因子对启动子活性的作用可以确定启动子活性是高还是低,即启动子是“强”还是“弱”。
本发明的DNA构建体可以包括充当可用于鉴别和选择经转化植物细胞或植株的可选标记的核苷酸序列。这些标记的实例包括(但不限于)新霉素(neomycin)磷酸转移酶(nptH)基因(Potrykus等人,Mol.Gen.Genet.199:183-188(1985)),其提供卡那霉素(kanamycin)抗性。可以使用合适抗生素(卡那霉素或G418)来选择表达nptII基因的细胞。其他通常使用的可选标记包括突变EPSP合成酶基因(Hinchee等人,Bio/Technology6:915-922(1988)),其提供草甘膦抗性;和突变乙酰乳酸合成酶基因(ALS),其提供咪唑啉酮或磺酰基脲抗性(欧洲专利申请案154,204,1985)。
本发明也包括包含上述DNA构建体的载体。载体可以包括特定宿主细胞的复制起点(复制子)。各种原核生物复制子为所属领域技术人员已知,并且引导原核宿主细胞中重组分子的自主复制和维持。
在一实施例中,本发明采用如2003年6月6日申请的美国专利第60/476,222号中所述的pWVR8载体,或者如Gleave,Plant Mol.Biol,20:1203-27(1992)所述的pART27。
本发明也包括经本发明DNA构建体转化的宿主细胞。如本文所用宿主细胞是指其中表达本发明多核苷酸的细胞。因此,宿主细胞可以为单独细胞、细胞培养物或者是生物体一部分的细胞。宿主细胞也可以为胚胎、胚乳、精细胞或卵细胞或者受精卵的一部分。在一实施例中,宿主细胞为植物细胞。
本发明进一步提供包含本发明DNA构建体的转基因植物。本发明包括为被子植物或裸子植物的转基因植物。可以用本发明的DNA构建体转化许多植物,单子叶植物(例如,草、玉米、谷物、燕麦、小麦和大麦)、双子叶植物(例如,拟南芥、烟草、豆类、紫花苜蓿、橡树、桉树、枫树)和裸子植物(例如,苏格兰松(Scots pine);参看Aronen,Finnish Forest Res.Papers,第595卷,1996)、白云杉(Ellis等人,Biotechnology11:84-89,1993)与落叶松(Huang等人,In Vitro Cell27:201-207,1991)。
植物也包括草坪草、小麦、玉米、水稻、甜菜、马铃薯、番茄、莴苣、胡萝卜、草莓、木薯、甘薯、天竺葵、大豆和各种类型木本植物。木本植物包括树木,例如棕榈树、栎树、松树、枫树、冷杉、苹果树、无花果树、李树和阿拉伯橡胶树。木本植物也包括蔷薇和葡萄树。
在一实施例中,用本发明的DNA构建体转化木本植物,即茎存活数年且每年通过添加木本组织来增加直径的树木或灌木。本发明包括转化包括在商业森林工业中具有重要性的桉树和松树种在内的植物的方法,例如选自由巨桉和其杂交种和火炬松,以及自其衍生的经转化植物、木材和木浆组成的群组的植物。合适植物的其他实例包括选自由以下各树种组成的群组的植物:北美短叶松(Pinus banksiana)、塞埔路斯松(Pinus brutia)、加勒比松(Pinus caribaea)、沙松(Pinus clausa)、扭叶松(Pinus contorta)、大果松(Pinuscoulteri)、萌芽松(Pinus echinata)、阿富汗松(Pinus eldarica)、湿地松(Pinus elliotii)、黑材松(Pinus jeffreyi)、糖松(Pinus lambertiana)、马尾松(Pinus massoniana)、西部白松(Pinus monticola)、欧洲黑松(Pinus nigra)、长叶松(Pinus palustris)、海岸松(Pinuspinaster)、美国黄松(Pinus ponderosa)、辐射松(Pinus radiata)、多脂松(Pinus resinosa)、刚松(Pinus rigida)、沼松(Pinus serotina)、北美乔松(Pinus strobus)、欧洲赤松(Pinussylvestris)、火炬松(Pinus taeda)、矮松(Pinus virginiana)、太平洋银冷杉(Abies amabilis)、胶冷杉(Abies balsamea)、科罗拉多冷杉(Abies concolor)、北美冷杉(Abies grandis)、高山冷杉(Abies lasiocarpa)、加州红冷杉(Abies magnifica)、壮丽冷杉(Abies procera)、美国扁柏(Chamaecyparis lawsoniona)、黄扁柏(Chamaecyparis nootkatensis)、美国尖叶扁柏(Chamaecyparis thyoides)、东方红柏(Juniperus virginiana)、欧洲落叶松(Larixdecidua)、美洲落叶松(Larix laricina)、日本落叶松(Larix leptolepis)、西部落叶松(Larixoccidentalis)、西伯利亚落叶松(Larix siberica)、下延香松(Libocedrus decurrens)、挪威云杉(Picea abies)、白香云杉(Picea engelmanni)、白云杉(Picea glauca)、黑云杉(Piceamariana)、北美云杉(Picea pungens)、红云杉(Picea rubens)、西加云杉(Picea sitchensis)、北美黄杉(Pseudotsuga menziesii)、巨杉(Sequoia gigantea)、北美红杉(Sequoiasempervirens)、落羽杉(Taxodium distichum)、加拿大铁杉(Tsuga canadensis)、西部铁杉(Tsuga heterophylla)、高山铁杉(Tsuga mertensiana)、北美香柏(Thuja occidentalis)、北美乔柏(Thuja plicata)、白桉(Eucalyptus alba)、Eucalyptus bancroftii、葡萄桉(Eucalyptusbotryoides)、苹果桉(Eucalyptus bridgesiana)、美叶桉(Eucalyptus calophylla)、赤桉(Eucalyptus camaldulensis)、柠檬桉(Eucalyptus citriodora)、Eucalyptus cladocalyx、浆果桉(Eucalyptus coccifera)、Eucalyptus curtisii、山桉(Eucalyptus dalrympleana)、粗皮桉(Eucalyptus deglupta)、Eucalyptus delagatensis、异色桉(Eucalyptus diversicolor)、邓恩桉(Eucalyptus dunnii)、红花桉(Eucalyptus ficifolia)、蓝桉(Eucalyptus globulus)、棒头桉(Eucalyptus gomphocephala)、古尼桉(Eucalyptus gunnii)、Eucalyptus henryi、银顶纤皮桉(Eucalyptus laevopinea)、毛皮桉(Eucalyptus macarthurii)、长喙桉(Eucalyptusmacrorhyncha)、斑皮桉(Eucalyptus maculata)、卡瑞桉(Eucalyptus marginata)、Eucalyptusmegacarpa、蜜味桉(Eucalyptus melliodora)、Eucalyptus nicholii、亮果桉(Eucalyptusnitens)、Eucalyptus nova-angelica、斜叶桉(Eucalyptus obliqua)、尾叶桉(Eucalyptusoccidentalis)、Eucalyptus obtusiflora、蓝岭桉(Eucalyptus oreades)、少花桉(Eucalyptuspauciflora)、多苞叶桉(Eucalyptus polybractea)、王桉(Eucalyptus regnans)、树脂桉(Eucalyptus resinifera)、大叶桉(Eucalyptus robusta)、野桉(Eucalyptus rudis)、柳叶桉(Eucalyptus saligna)、Eucalyptus sideroxylon、Eucalyptus stuartiana、细叶桉(Eucalyptustereticomis)、毛叶桉(Eucalyptus torelliana)、果桉(Eucalyptus urnigera)、尾叶桉(Eucalyptus urophylla)、多枝桉(Eucalyptus viminalis)、Eucalyptus viridis、鞣桉(Eucalyptus wandoo)与Eucalyptus youmanni。
如本文所用术语“植物”也旨在包括植物的果实、种子、花、球果等。本发明的经转化植物可以是直接转染体,意思是例如经由农杆菌(Agrobacterium)将DNA构建体直接引入植物,或者所述植物可以是经转染植物的后代。可以通过有性生殖,即受精来产生第二代或后代植物。此外,植物可以是配子体(单倍体阶段)或孢子体(二倍体阶段)。
如本文所用术语“植物组织”涵盖植物的任何部分,包括植物细胞。植物细胞包括悬浮培养物、胼胝质、胚胎、分生组织区、愈伤组织、叶、根、嫩枝、配子体、孢子体、花粉、种子和小孢子。植物组织可以在液体或固体培养基中生长,或者在罐、温室或大田中的土壤或合适培养基中生长。如本文所用“植物组织”也指无论有性或无性产生的植株、种子、后代或繁殖体的克隆体,和所述任一者的后代,例如插枝(cutting)或种子。
根据本发明的一个方面,经本发明DNA构建体转化的转基因植物具有与未经DNA构建体转化的植物不同的表现型。
如本文所用“表现型”是指植物的辨别特性或特征,其可以根据本发明通过将本发明的一个或一个以上DNA构建体整合进植物至少一个植物细胞的基因组内而得以改变。DNA构建体可以通过改变经转化植物细胞或整个植物的任何一个或一个以上遗传、分子、生物化学、生理、形态或农艺学特征或特性来更改经转化植物的表现型。
在一实施例中,植物经本发明DNA构建体的转化可以产生包括(但不限于)以下表现型中任一种或一种以上的表现型:耐旱性增加、除草剂抗性、高度减小或增加、分枝减少或增加、耐寒耐冻性增强、活力提高、颜色增强、健康与营养特征增强、储存性改善、产量增加、耐盐性增强、木材抗腐烂性增强、真菌疾病抗性增强、对昆虫害虫的吸引力改变、重金属耐性增强、疾病耐性增强、昆虫耐性增强、水胁迫耐性增强、甜度提高、质地改善、磷酸盐含量降低、出芽增加、微量营养素吸收增加、淀粉组成改善、花寿命提高、产生新颖树脂和产生新颖蛋白质或肽。
在另一实施例中,受影响的表现型包括以下特性中的一个或一个以上特性:与未经DNA构建体转化的相同种植物相比,形成应力木的倾向、幼年期缩短、幼年期增长、自体脱落分枝、生殖发育加速或生殖发育延迟。
在另一实施例种,在转基因植物中不同的表现型包括一个或一个以上的以下特性:木质素品质、木质素结构、木材组成、木材外观、木材密度、木材强度、木材刚度、纤维素聚合化、纤维尺寸、内腔尺寸、其他植物组份、植物细胞分裂、植物细胞发育、每单位面积细胞数目、细胞尺寸、细胞形状、细胞壁组成、木材形成速率、木材美学外观、茎缺陷形成、平均微纤丝角度、S2细胞壁层宽度、生长速率、根形成速率、根与枝营养发育比率、叶面积指数和叶形。
可以通过任何合适手段评价表现型。可以基于其整体形态来评估植物。可以用肉眼观测转基因植物,可以对其称重和测量高度。可以通过分离植物组织的个别层(即韧皮部和形成层)来研究植物,其进一步分成分生组织细胞、早期扩张、晚期扩张、次生壁形成和晚期细胞成熟。参看,例如Hertzberg,同上文。也可以用显微镜分析或化学分析来评价植物。
显微镜分析包括研究细胞类型、发育阶段和组织与细胞的染料吸收。举例来说,可以使用透射电子显微椭圆光度法来观测纤维形态,例如木浆纤维的纤维壁厚度和微纤丝角度。参看Ye和Sundstr_m,Tappi J.,80:181(1997)。可以通过测量可见光和近红外光谱数据结合多变量分析来确定湿木和立木的木材强度、密度和纹理斜度。参看美国专利公开案第2002/0107644号和第2002/0113212号。可以用扫描电子显微镜法测量内腔尺寸。可以如Marita等人,J.Chem.Soc,Perkin Trans.I2939(2001)所述用核磁共振光谱法观测木质素结构和化学特性。
可以通过任何已知的标准分析法来评估木质素、纤维素、碳水化合物和其他植物提取物的生物化学特征,所述方法包括分光光度法、荧光光谱法、HPLC、质谱法和组织染色法。
如本文所用“转化”是指将核酸插入植物细胞基因组中的过程。所述插入涵盖稳定引入至植物细胞中,并且传递给后代。转化也指瞬时插入核酸,其中所产生的转化体瞬时表达该核酸。可以用所属技术领域各种熟知方法在天然或人工条件下进行转化。可以用将核酸序列插入原核或真核宿主细胞中的任何已知方法完成转化,所述方法包括:农杆菌介导转化法、病毒侵染法、晶须法(whisker)、电穿孔法、微量注射法、聚乙二醇处理法、热休克法、脂质体转染法和粒子轰击法。也可以用如Svab等人,Proc.NatlAcad.Sci.87:8526-30(1990)所述的叶绿体转化法完成转化。
根据本发明一实施例,如美国专利申请案第60/476,222号(同上文)所述进行桉树转化,该申请案全文以引用的方式并入本文。根据本发明另一实施例,用美国专利申请公开案第2002/0100083号中所述的方法进行松树转化。
本发明另一方面提供从经本发明DNA构建体转化的植物获得木材和/或制造木浆的方法。制造转基因植物的方法提供于上文并且为所属技术领域已知。可以在任何合适条件下培养或生长经转化的植物。举例来说,可以如美国专利申请公开案第2002/0100083号所述培养并生长松树。举例来说,可以如在Mechanization in Short Rotation,IntensiveCulture Forestry Conference,Mobile,AL,1994中的Rydelius等人,GROWING EUCALYPTUSFOR PULPAND ENERGY来培养和生长桉树。可以通过所属技术领域中任何手段从植物获得木材和木浆。
如上所述,根据本发明所获的木材或木浆可以展示包括(但不限于)以下任一特征或一个以上特征的改进特征:木质素组成、木质素结构、木材组成、纤维素聚合化、纤维尺寸、纤维与其他植物组份的比率、植物细胞分裂、植物细胞发育、每单位面积细胞数目、细胞尺寸、细胞形状、细胞壁组成、木材形成速率、木材美学外观、茎缺陷形成、生长速率、根形成速率、根与枝营养发育比率、叶面积指数和叶形;所述改进特征包括:木质素含量增加或减少、木质素可化学处理性增加、木质素反应性改善、纤维素含量增加或减少、尺寸稳定性增加、抗拉强度增加、抗剪强度增加、抗压强度增加、抗震强度增加、刚度增加、硬度增加或减少、螺旋性降低、收缩率降低和重量、密度与比重差异减小。
B.细胞周期基因的表达型分析
本发明还提供进行细胞周期基因表达型分析的方法与工具。表达型分析可用于确定基因是转录还是翻译,比较不同组织中特定基因的转录水平,基因分型,估计DNA拷贝数,确定血缘一致性,测量mRNA降解率,识别蛋白质结合部位,确定基因产物的亚细胞定位,使基因表达与表现型或其他现象相关联,和确定操纵特定基因对其他基因的影响。表达型分析尤其可用于鉴别复杂的、多基因事件中的基因表达。为此,表达型分析可用于使基因表达与植物表现型和植物组织形成以及其与细胞周期相互关系相关联。
植物基因组中仅有一小部分基因在给定组织样品的给定时间内表达,并且所有经表达的基因可能不影响植物表现型。为了鉴别能够影响所感兴趣的表现型的基因,本发明提供用于确定(例如)细胞周期中给定时间点的基因表达型、植物发育中给定时间点的基因表达型和给定组织样品的基因表达型的方法与工具。本发明也提供鉴别细胞周期基因的方法与手段,可以通过操纵这些基因的表达来改变植物表现型或者来改变细胞周期基因产物的生物活性。为了支持这些方法,本发明也提供辨别相同家族不同基因的表达的方法与工具。
如本文所用“基因表达”是指将DNA序列转录成RNA序列,接着将RNA翻译成蛋白质的过程,其可能经或可能不经转录后加工。因此,细胞周期阶段和/或发育阶段与基因表达之间的关系可通过定量或定性检测RNA或蛋白质水平的变化而进行观测。如本文所用术语“生物活性”包括(但不限于)蛋白质基因产物的活性,包括酶活性。
本发明提供可用于这些表达型分析法的寡核苷酸。每个寡核苷酸能够在一组给定条件下与细胞周期基因或基因产物杂交。本发明一方面提供多个寡核苷酸,其中每个寡核苷酸能够在一组给定条件下与不同细胞周期基因产物杂交。本发明的寡核苷酸实例包括SEQ ID NO:471-697。SEQ ID NO471-697的每个寡核苷酸在标准条件下与SEQ ID NO:1-237中一者的不同基因产物杂交。本发明的寡核苷酸可用于以上述任何方法来确定一个或一个以上细胞周期基因的表达。
1.细胞、组织、核酸和蛋白质样品
用于本发明方法的样品可以来源于植物组织。合适植物组织包括(但不限于)体细胞胚、花粉、叶、茎、胼胝质、匍匐茎、微管、嫩枝、木质部、雄球花(male strolbili)、花粉球果、维管组织、顶端分生组织、维管形成层、木质部、根、花和种子。
根据本发明如前文所述来使用“植物组织”。植物组织可获自上文所述的任何植物类型或物种。
根据本发明的一个方面,样品获自细胞周期不同阶段的植物组织,获白不同发育阶段的植物组织,获自一年不同时间的植物组织(例如,春季对夏季),获自经受不同环境条件(例如,光与温度的变化)的植物组织和/或获自不同类型的植物组织和细胞。根据一实施例,在成熟期的不同阶段内和在一年的不同季节内获得植物组织。举例来说,可以从茎分裂细胞、分化中的木质部、发育早期木材细胞、已分化春季木材细胞和已分化夏季木材细胞收集植物组织。作为另一实例,可将获白具有发育中木材的植物的样品中的基因表达与获自无发育中木材的植物的样品中的基因表达相比较。
分化木质部包括获自应压木、单面木(side-wood)和正常垂直木质部的样品。已知从松树和桉树获得样品以用于表达型分析的方法。参看,例如Allona等人,Proc.Nat′lAcad.Sci.95:9693-8(1998)和Whetton等人,Plant Mol.Biol.47:275-91和Kirst等人,INT′LUNION OF FORESTRY RESEARCH ORGANIZATIONS BIENNIAL CONFERENCE,S6.8(2003年6月,Umea,Sweden)。
在本发明一实施例中,将一种类型组织中的基因表达与一种不同类型组织中的基因表达或与不同发育阶段的相同类型组织中的基因表达进行比较。也可以比较在一年不同时间(不同季节)采样的一种类型组织中的基因表达。举例来说,幼年次生木质部的基因表达可以与成熟次生木质部的基因表达进行比较。相似地,形成层中的基因表达可以与木质部中的基因表达进行比较。此外,顶端分生组织中的基因表达可以与形成层中的基因表达进行比较。
在一替代性实施例中,将基因表达的差异确定为细胞周期中发展的不同组织的细胞。以此方法使不同组织的细胞同步化,并且分析其基因表达型。已知同步化样品中细胞周期阶段的方法。这些方法包括(例如)低温驯化、光周期和蚜肠霉素(aphidicoline)。参看,例如Nagata等人,Int.Rev.Cytol.132:1-30(1992),Breyne和Zabeau,Curr.Opin.PlantBiol.4:136-42,140(2001)。在细胞周期的一特定阶段内获得样品,并且将这个样品中的基因表达与在细胞周期的不同阶段内获得的样品进行比较。举例来说,可以在细胞周期的任何时期内研究组织,例如有丝分裂、G1、G0、S和G2期。具体来说,可以在G1、G2和分裂中期检测点时研究基因表达的变化。
在本发明另一实施例中,从具有特殊表现型的植物获得样品,并且将这个样品中的基因表达与获自相同种但不具有所述表现型的植物的样品进行比较。举例来说,可以从展示快速生长的植物获得样品,并且将其基因表达与获自展示正常和慢速生长的植物的样品进行比较。从所述比较而鉴别得到的差异表达基因可能与生长速率相关,并且因此可用于操纵生长速率。
在另一实施例中,从无性繁殖的植物获得样品。在一实施例中,所述无性繁殖植物是松树或桉树种。可以在一年的不同时间杀死来自相同基因型的个别无性系分株。因此,对于任何基因型来说,可能在一季节的早期与晚期杀死至少两个遗传因子相同的树木。可以将这些树木各自分为幼年(顶部)和成熟(底部)样品。此外,举例来说,可以在至少5个剥离层中将组织样品划分为韧皮部至木质部。可以评价这些样品中每一个样品的表现型和基因表达。参看条目196。
当细胞组份可能干扰分析技术(例如杂交法、酶法、配体结合法或生物活性分析)时,可能需要从这些细胞组份分离基因产物。可以通过所属技术领域任何已知方法从细胞片段或溶解产物分离包括核酸和氨基酸基因产物在内的基因产物。
可以通过任何可用的方法或过程,或者通过其他已为所属技术领域已知的过程来制备根据本发明使用的核酸。举例来说,用于分离核酸的常规技术在Tijssen,LABORATORYTECHNIQUES IN BIOCHEMISTRY AND MOLECULAR BIOLOGY:HYBRIDIZATION WITH NUCLEICACID PROBES,第3章(Elsevier Press,1993),Berger和Kimmel,Methods Enzymol.152:1(1987),和GIBCO BRL & LIFE TECHNOLOGIES TRIZOL RNA IsOLATION PROTOCOL,第3786号(2000)中有详述。已知用于制备核酸样品,和为来自松树和桉树的多核苷酸测序的技术。参看,例如Allona等人,同上文和Whetton等人,同上文,和美国申请案第60/476,222号。
合适核酸样品可含有来源于细胞周期基因转录本的任何类型的核酸,即RNA或其亚序列,或者从细胞周期基因转录的mRNA充当模板的核酸。合适核酸包括从转录本反转录的cDNA、从所述cDNA转录的RNA、从所述cDNA扩增的DNA,和从经扩增DNA转录的RNA。这些产物和所衍生产物的检测指示样品中转录本的存在和/或丰度。因此,合适样品包括(但不限于)基因的转录本、从所述转录本反转录的cDNA、从所述cDNA转录的cRNA、从所述基因扩增的DNA和从扩增DNA转录的RNA。如本文所用,“转录本”类型包括(但不限于)前体mRNA初级转录本、转录本加工中间体和成熟mRNA与其降解产物。
并不必需监控所有类型转录本以实施本发明。举例来说,可以通过检测仅一个类型的转录本来执行本发明的表达型分析法,例如仅检测成熟mRNA水平。
本发明一方面中,制作染色体DNA或cDNA文库(例如,包含从总细胞mRNA合成的经荧光标记的cDNA)以根据所属技术领域的公认方法用于杂交法中。参看,Sambrook等人,同上文。
本发明另一方面,用(例如)Message Amp试剂盒(Ambion)扩增mRNA。另一方面,用可检测标记来标记mRNA。举例来说,可以用荧光发色团,例如CyDye(AmershamBiosciences),来标记mRNA。
在一些应用中,可能需要在用于杂交技术之前抑制或破坏通常存在于匀浆或溶解产物中的RNase。抑制或破坏核酸酶的方法众所周知。在本发明一实施例中,在离液剂存在下匀浆化细胞或组织以抑制核酸酶。在另一实施例中,通过热处理,接着用蛋白酶处理来抑制或破坏RNase。
可以通过所属技术领域中任何已知手段获得蛋白质样品。可用于本发明方法中的蛋白质样品包括粗细胞溶解产物和粗组织匀浆液。或者,可以纯化蛋白质样品。所属技术领域中为人熟知的各种蛋白质纯化方法可见于Marshak等人,STRATEGIES FOR PROTEINPURIFICATION AND CHARACTERIZATION:A LABORATORY COURSE MANUAL (Cold SpringHarbor Laboratory Press1996)中。
2.检测基因表达水平
对于包含检测基因表达水平步骤的本发明方法来说,可以不加限制地使用用于观测基因表达的任何方法。此类方法包括传统核酸杂交技术、基于聚合酶链反应(PCR)的方法和蛋白质测定法。本发明包括使用基于固体载体的分析形式的检测方法以及那些使用基于溶液的分析形式的检测方法。
尽管可以进行表达水平的绝对测量,但这并不需要。本发明包括包含比较样品之间表达水平差异的方法。可以目测或手工进行表达水平比较,或者可以使用(例如)光学检测装置由机器自动进行。Subrahmanyam等人,Blood.97:2457(2001);Prashar等人,Methods Enzymol.303:258(1999)。可购得用于分析差异基因表达的硬件和软件,并用其实施本发明。参看,例如GenStat Software and GeneExpress_GX ExplorerTM TrainingManual(同上文);Baxevanis & Francis-Ouellette(同上文)。
根据本发明一实施例,用核酸杂交技术来观测基因表达。例示性杂交技术包括Northern印迹法、Southern印迹法、溶液杂交法和S1核酸酶保护分析法。
核酸杂交技术通常包括在寡核苷酸探针可以通过互补碱基配对与其互补核酸形成稳定杂交双链的条件下,使该探针与包含核酸的样品接触。举例来说,参看PCT申请案WO99/32660;Berger & Kimmel,Methods Enzymol.152:1(1987)。随后将不形成杂交双链的核酸冲洗掉,留下经杂交核酸以待检测,所述检测通常是通过检测所附着的可检测标记来完成。可检测标记可以存在于探针上,或者存在于核酸样品上。在一实施例中,样品核酸是代表存在于植物组织(例如cDNA文库)中的mRNA转录本的经可检测标记的多核苷酸。可检测标记通常为放射性或荧光标记,但也可以使用任何能够检测到的标记。举例来说,可以用WO99/32660(同上文)所述的几种方法并入标记。一方面可以用MessageAmp试剂盒(Ambion)并添加氨基烯丙基-UTP以及游离UTP来扩增RNA。并入经扩增RNA中的氨基烯丙基可以与荧光发色团(例如CyDye(AmershamBiosciences))反应。
可以通过升高温度或者降低含有核酸的缓冲液的盐浓度来使核酸双链不稳定。在低严谨条件下(例如,低温和/或高盐),即使经退火序列不完全互补,也可以形成杂交双链(例如,DNA:DNA、RNA:RNA或RNA:DNA)。因此,在较低严谨度下杂交特异性降低。相反,在较高严谨度下(例如,较高温度和/或较低盐和/或存在去稳定剂时),杂交允许较少的错配。
通常,对于短探针(例如,10至50个核苷酸碱基)的严谨条件为:pH7.0至8.3时至少约0.01至1.0 M的盐浓度,和至少约30℃的温度。也可以通过加入例如甲酰胺等去稳定剂来达到严谨条件。
在一些情形下,需要在低严谨条件下进行杂交以确保杂交,所述低严谨条件例如37℃下6×SSPE-T(0.9M NaCl,60mM NaH2PO4,pH 7.6,6mM EDTA,0.005%Triton)。随后可以在较高严谨度(例如,37℃下1×SSPE-T)下进行洗涤以消除错配的杂交双链。可以在逐渐增高的严谨条件下(例如,37℃至50℃下低至0.25×SSPE-T)进行连续洗涤直至获得所需水平的杂交特异性。
总体来说,杂交标准条件是严谨度(杂交特异性)与信号强度之间的折衷。因此,在本发明一实施例中,在连续较高严谨条件下洗涤经杂交的核酸,并在每次洗涤之间读取。以这种方式得到的数据组的分析将展现一个洗涤严谨度,在所述严谨度之上时杂交图谱并无可感知的变化,并且提供所感兴趣的特殊寡核苷酸探针的足够信号。举例来说,最终洗涤可以选择为最高严谨度洗涤,其产生一致性结果并且提供高于大约10%背景强度的信号强度。
a.寡核苷酸探针
可用于本发明所用核酸杂交技术中的寡核苷酸探针能够通过一类或多类化学键与互补序列的核酸结合,通常是通过经氢键形成的互补碱基配对来结合。探针可以包括天然碱基(即A、G、U、C或T)或经修饰碱基(7-脱氮杂鸟苷、肌苷等)。此外,探针中的核苷酸碱基可以通过除磷酸二酯键之外的键合而连接,但要求该键合不干扰杂交。因此,探针可以是肽核酸,其中组成碱基是通过肽键而非磷酸二酯键合得以连接。
可以通过所属技术领域中任何已知手段制备寡核苷酸探针。可用于本发明的探针能够与细胞周期基因的核苷酸产物杂交,例如SEQ ID NO:1-237中之一。可以用SEQ IDNO:1-237所揭示的核苷酸序列产生可用于本发明的探针。本发明包括具有SEQ ID NO:1-237中任何一者的对应连续序列的至少2、10、15、20、30、35、40、45、50、55、60、65、70、75、80、85或100个核苷酸片段的寡核苷酸探针。本发明包括长度小于2、1、0.5、0.1或0.05 kb的寡核苷酸。在一实施例中,寡核苷酸为60个核苷酸长。
可以通过所属技术领域中任何已知手段设计寡核苷酸探针。参看,例如Li和Stormo,Bioinformatics17:1067-76(2001)。可以用软件进行寡核苷酸探针设计。例示性软件包括ArrayDesigner、GeneScan和ProbeSelect。与所定义核酸序列互补的探针可以用化学方法合成,可以使用限制性酶从较长核苷酸产生,或者可以使用例如聚合酶链反应(PCR)的技术获得。PCR技术众所周知,且描述于(例如)Innis等人编的PCR PROTOCOLS:AGUIDE TO METHODS AND APPLICATIONS,Academic Press Inc.San Diego,Calif.(1990)中。举例来说,可以用经放射生物素标记的标签或荧光标签来标记探针。最佳地,样品中的核酸经标记,而探针未被标记。由上述方法产生的寡核苷酸探针可以用于基于溶液或基于固体载体的方法。
本发明包括与细胞周期基因的编码区或3′非翻译区(3′UTR)的产物杂交的寡核苷酸探针。在一实施例中,寡核苷酸探针与SEQ ID NO:1-237中任一者的3′UTR杂交。甚至在相同家族的成员之间,3′UTR也通常是基因的独特区。因此,能够与3′UTR产物杂交的探针可用于区分一个家族内个别基因的表达,其中所述基因的编码区可能是高度同源的。这允许设计用作多个寡核苷酸的成员的寡核苷酸探针,每个都能够独特地与信号基因结合。在另一实施例中,寡核苷酸探针包含SEQ ID NO:471-697中任一者。在另一实施例中,寡核苷酸探针由SEQ ID NO:471-697中任一者组成。
b.寡核苷酸阵列法
本发明的一个实施例并用两个或两个以上的寡核苷酸探针以检测一个或一个以上细胞周期基因的表达水平,例如SEQ ID NO:1-237的基因。本实施例一方面,检测两个或两个以上不同基因的表达水平。所述两个或两个以上基因可能来自上述相同或不同的细胞周期基因家族。所述两个或两个以上寡核苷酸各自能够与这些基因中不同的一个杂交。
本发明一实施例采用两个或两个以上寡核苷酸探针,其中每个探针都特异性地与来源于由SEQ ID NO:1-237所提供的基因转录本的多核苷酸杂交。另一实施例采用两个或两个以上寡核苷酸探针,其中至少一者包含SEQ ID NO:471-697的核酸序列。另一实施例采用两个或两个以上寡核苷酸探针,其中至少一者由SEQ ID NO:471-697组成。
寡核苷酸探针可以包含约5个至约60个,或约5个至约500个核苷酸碱基,例如约60个至约100个核苷酸碱基,包括约15个至约60个核苷酸碱基。
本发明一实施例使用基于固体载体的寡核苷酸杂交法来检测基因表达。适于实施本发明的基于固体载体的方法众所周知,且描述于(例如)PCT申请案WO95/11755;Huber等人,Anal Biochem.299:24(2001);Meiyanto等人,Biotechniques.31:406(2001);Relogio等人,NucleicAcids Res.30:e51(2002)中。可以使用寡核苷酸能够共价或非共价结合的任何固体表面。这些固体载体包括滤纸、聚氯乙烯培养皿、基于硅或玻璃的芯片等。
一实施例使用寡核苷酸阵列,即微阵列,可以用其同时观测许多基因或基因产物的表达。寡核苷酸阵列包含设置于固体载体上的两个或两个以上寡核苷酸探针,其中每个探针占据载体上的独特位置。可以预定每个探针的位置,使得给定位置处可检测信号的检测指示与已知身份的寡核苷酸探针的杂交。每个预定位置可以含有一个以上的探针分子,但是预定位置内的每个分子具有相同序列。这些预定位置称作特征。举例来说,单独一个固体载体上可以存在2、10、100、1,000、2,000或5,000或更多的这些特征。在一实施例中,每个寡核苷酸位于阵列上的独特位置处至少2次、至少3次、至少4次、至少5次、至少6次或至少10次。
可以根据例如Lockhart等人,Nat′lBiotech.14:1675(1996),McGall等人,Proc.Nat′lAcad.Sci.USA 93:13555(1996),和Hughes等人,Nature Biotechnol19:342(2001)中所述的常规技术来制造和使用用于检测基因表达的寡核苷酸探针阵列。许多寡核苷酸阵列设计适于实施本发明。
在一实施例中,所述一个或一个以上寡核苷酸包括每个与在特定组织类型中表达的不同基因杂交的多个寡核苷酸。举例而言,所述组织可以是发育中的木材。
在一实施例中,可以扩增获自植物的核酸样品,并且视情况用可检测标记进行标记。可以使用任何核酸扩增方法和任何适于此目的的可检测标记。举例来说,可以用(例如)Ambion′s MessageAmp进行扩增反应,其创建“反义”RNA或“aRNA”(核酸序列与从样品组织提取的RNA互补)。视情况可以用CyDye荧光标记来标记RNA。在扩增步骤期间,将aaUTP并入所得aRNA中。在非酶反应中,CyDye荧光标记与aaUTP偶联。在扩增和标记步骤之后,沉淀出经标记的扩增反义RNA,并用适当缓冲液洗涤,且随后测定纯度。举例来说,可以用NanoDrop分光光度计测定纯度。核酸样品随后与具有附着于固体基板(“微阵列载片(microarray slide)”)上的能够与可能存在于样品中的所感兴趣的核酸杂交的寡核苷酸样品探针的寡核苷酸阵列接触。在感兴趣的核酸与存在于阵列上的寡核苷酸探针之间发生杂交的条件下进行所述接触步骤。随后洗涤阵列以移除非特异性结合的核酸,并且检测来自仍然与固体基板上的寡核苷酸探针杂交的经标记分子的信号。可以用适合于所用标记类型的任何方法完成检测步骤。举例来说,可以用激光扫描仪和检测器完成检测步骤。例如,可以使用Axon扫描仪,视情况使用GenePix Pro软件,来分析微阵列载片上信号的位置。
可以通过所属技术领域中已知的任何适当方法来分析来自一个或一个以上微阵列载片的数据。
可以用PCR产生用于本发明方法(包括微阵列技术)的寡核苷酸探针。举例来说,基于SEQ ID NO:1-237的序列选择用于产生探针的PCR引物,以引起细胞周期基因独特片段的扩增(即在标准杂交条件下与SEQ ID NO:1-237中任一者的仅一个多核苷酸杂交的片段)。计算机程序可用于设计具有所需特异性和最佳杂交特性的引物。举例来说,Li和Stormo,上文,第1075页中讨论使用ProbeSelect进行探针选择的方法,ProbeSelect是基于完整基因序列以及准备同时探测的其他基因序列来选择最佳的寡核苷酸探针。
在一个实施例中,使用寡核苷酸对照探针。例示性对照探针可以以下所述三类中的至少一类:(1)标准化对照,(2)表达水平对照,和(3)阴性对照。在微阵列方法中,一个或一个以上的这些对照探针可以提供于具有本发明细胞周期基因相关寡核苷酸的阵列上。
标准化对照校正染色偏差、组织偏差、灰尘、载片不规则、畸形载片斑等。标准化对照是与经标记的参照寡核苷酸或添加于准备筛选的核酸样品中的其他核酸序列互补的寡核苷酸或其他核酸探针。杂交之后获自标准化对照的信号提供关于杂交条件、标记强度、读取效率和其他可以引起阵列之间完美杂交信号有差异的因素的变更的对照。在一实施例中,从用于所述方法中的所有其他探针读取的信号(例如荧光强度或放射性)除以来自对照探针的信号,进而使测量标准化。
事实上任何探针均可以充当标准化对照。然而,杂交效率随碱基组成和探针长度而变化。选择优选标准化探针以反映所用其他探针的平均长度,但是也可以选择涵盖一定范围的长度。此外,可以选择标准化对照以反映所用其他探针的平均碱基组成。在一实施例中,仅可以使用一个或几个标准化探针,并且对其进行选择以使其良好杂交(即不形成二级结构),且不与任何测试探针配对。在一实施例中,标准化对照是哺乳动物基因。
表达水平对照探针特异性地与存在于生物样品中的组成型表达基因杂交。事实上任何组成型表达基因提供表达水平对照探针的合适目标。表达水平对照探针通常具有与组成型表达的“管家基因”的亚序列互补的序列,所述“管家基因”包括(但不限于)某些光合作用基因。
“阴性对照”探针不与任何测试寡核苷酸(即本发明的细胞周期基因相关寡核苷酸)、标准化对照或表达对照互补。在一实施例中,阴性对照是不与样品中的任何其他序列互补的哺乳动物基因。
术语“背景”和“背景信号强度”是指由经标记的目标核酸(即存在于生物样品中的mRNA)与寡核苷酸阵列的组份之间的非特异性结合或其他相互作用而产生的杂交信号。也可以通过阵列组份自身的固有荧光性产生背景信号。
可以计算整个阵列的单一背景信号,或者可以计算每个目标核酸的不同背景信号。在一实施例中,将背景计算为最低5至10%所用寡核苷酸探针的平均杂交信号强度,或者当计算每个目标基因的不同背景信号时,其为每个基因最低5至10%探针的平均杂交信号强度。当对应于特定细胞周期基因的寡核苷酸探针良好杂交,并且因此显示特异性地结合目标序列时,不将其用于背景信号计算。或者,可以将背景计算为通过与不与样品中所发现的任何序列互补的探针杂交而产生的平均杂交信号强度(例如,针对反义核酸的探针或者针对未发现于样品中的基因的探针)。在微阵列方法中,可以将背景计算为由完全缺少任何寡核苷酸探针的阵列的区域所产生的平均信号强度。
c.基于PCR的方法
在另一实施例中,用基于PCR的方法检测基因表达。这些方法包括反转录酶所介导的聚合酶链反应(RT-PCR),其包括实时与终点定量反转录酶所介导的聚合酶链反应(Q-RTPCR)。这些方法为所属技术领域所熟知。举例来说,可以使用购自(例如)AppliedBioSystems和Stratagene_的试剂盒和方法来进行定量PCR方法。也参看Kochanowski,QUANTITATIVE PCR PROTOCOLS(Humana Press,1999);Innis等人,同上文; Vandesompele等人,Genome Biol.3:RESEARCH0034(2002);Stein,Cell Mol.Life Sci.59:1235(2002)。
也可以使用Q-RTPCR在溶液中观察基因表达。Q-RTPCR依赖于在PCR产物扩增期间按比例产生的荧光信号的检测。参看Innis等人,同上文。与传统PCR方法一样,所述技术采用通常15-30个碱基长、与反向链和位于感兴趣DNA侧翼区杂交的PCR寡核苷酸引物。此外,设计探针(例如TaqMan_,Applied Biosystems)以与传统用于PCR技术中的正向与反向引物之间的目标序列杂交。用例如6-羧基荧光素(6-FAM)的报告基因荧光团和如6-羧基-四甲基-若丹明(TAMRA)的淬灭剂荧光团在5′末端标记探针。只要探针完整,就发生荧光能量转移,而引起淬灭荧光团对报告基因荧光团的荧光发射的吸收。然而,随着Taq聚合酶延伸引物,Taq的固有5′至 3′核酸酶活性降解探针,以释放报告基因荧光团。在扩增循环期间所检测到的荧光信号增加与每个循环中所产生的产物量成比例。
设计正向与反向扩增引物和内部杂交探针以特异性地且独特地与来自目标基因转录本的一个核苷酸杂交。在一实施例中,引物和探针序列的选择标准结合关于核苷酸含量与尺寸的限制来考虑以适应TaqMan_要求。
可以将SYBR Green_用作作为上述Taqman_型测定替代的较少探针Q-RTPCR。ABIPRISM_7900SEQUENCE DETECTION SYSTEM USER GUIDE APPLIED BIOSYSTEMS,第1-8章,附录A-F(2002)。
一个装置测量PCR扩增期间荧光发光强度的变化。“实时”进行测量,即随着扩增产物在反应中积累而测量。可以使用其他方法来测量由探针消化所引起的荧光变化。举例来说,荧光偏振可以根据分子翻转(molecular tumbling)来辨别大分子与小分子(参看美国专利第5,593,867号)。
d.蛋白质检测方法
可以通过所属技术领域中的任何已知手段观测蛋白质,包括免疫学方法、酶检测和蛋白质检测/蛋白质组学技术。
可以根据若干蛋白质方法来进行翻译状态的测量。举例来说,蛋白质的基因组监控--“蛋白质组(proteome)”--可以通过构建微阵列来进行,其中结合位点包含对多个具有任何SEQ ID NO:261-497氨基酸序列的蛋白质或由SEQ ID NO:1-237基因或其保守性变体所编码的蛋白质具有特异性的固定抗体,优选为单克隆抗体。参看Wildt等人,NatureBiotechnol.18:989(2000)。制造多克隆和单克隆抗体的方法为熟知,且描述于(例如)Harlow & Lane,ANTIBODIES:A LABORATORY MANUAL(Cold Spring Harbor Laboratory Press,1988)中。
或者,可以通过二维凝胶电泳系统分离蛋白质。二维凝胶电泳是所属技术领域熟知的,且通常包括沿第一维等电聚焦,接着沿第二维SDS-PAGE电泳。参看,例如,Hames等人,GEL ELECTROPHORESIS OF PROTEINS:A PRACTICAL APPROACH(IRL Press,1990)。可以通过许多技术分析所得电泳图,包括质谱技术、用多克隆和单克隆抗体的Western印迹和免疫印迹分析,和内部与N端微量测序(internal and N-terminal micro-sequencing)。
3.使基因表达与表现型和组织发育相关联
如上所述,本发明提供使基因表达与植物表现型相关联的方法与工具。可以在具有感兴趣的表现型的植物中研究基因表达,并且与不具有所述表现型或具有不同表现型的植物相比较。所述表现型包括(但不限于)耐旱性增加、除草剂抗性、高度减小或增加、分枝减少或增加、耐寒耐冻性增强、活力提高、颜色增强、健康与营养特征增强、储存性改善、产量增加、耐盐性增强、木材抗腐烂性增强、真菌疾病抗性增强、对昆虫害虫的吸引力改变、重金属耐性增强、疾病耐性增强、昆虫耐性增强、水胁迫耐性增强、甜度提高、质地改善、磷酸盐含量降低、出芽增加、微量营养素吸收增加、淀粉组成改善、花寿命提高、产生新颖树脂和产生新颖蛋白质或肽。
在另一实施例中,所述表现型包括一个或一个以上以下特性:形成应力木的倾向、幼年期缩短、幼年期增长、自体脱落分枝、生殖发育加速或生殖发育延迟。
在另一实施例中,在植物比较中不同的表现型包括一个或一个以上的以下特性:木质素品质、木质素结构、木材组成、木材外观、木材密度、木材强度、木材刚度、纤维素聚合化、纤维尺寸、内腔尺寸、其他植物组份、植物细胞分裂、植物细胞发育、每单位面积细胞数目、细胞尺寸、细胞形状、细胞壁组成、木材形成速率、木材美学外观、茎缺陷形成、平均微纤丝角度、S2细胞壁层宽度、生长速率、根形成速率、根与枝营养发育比率、叶面积指数和叶形。
可以通过上述任何合适手段评价表现型。
在另一实施例中,基因表达可以与细胞周期中的给定点、植物发育中的给定点和给定组织样品中的给定点相关联。可以在细胞周期不同阶段研究植物组织,从不同发育阶段的植物组织,从一年不同时间的植物组织(例如,春季对夏季),从经受不同环境条件(例如,光与温度的变化)的植物组织和/或从不同类型的植物组织和细胞。根据一实施例,在成熟期的不同阶段内和在一年的不同季节内获得植物组织。举例来说,可以从茎分裂细胞、分化中木质部、发育早期木材细胞、已分化春季木材细胞、已分化夏季木材细胞收集植物组织。
所属领域技术人员明显了解,在不悖离本发明精神或范围下可以对本发明的方法和组合物进行各种修改和变更。因此,本发明意欲涵盖这些对本发明的修改和变更,只要其处在随附权利要求书和其等同内容的范围内。
给出以下实例来说明本发明。然而,应了解,本发明并不限于这些实例中所述的特定条件或细节。在本说明书全文内,对包括美国专利在内的任何和所有可公开获得的文献的参考明确地以引用的方式并入本文。
实例
实例1
实例1说明RNA提取与纯化的过程,其尤其可用于从针叶树松针、木质部、形成层和韧皮部获得的RNA。
从针叶树松针、木质部、形成层或韧皮部获得组织。在液氮中冷冻组织并研磨。用Concert Plant RNA试剂(Invitrogen)提取总RNA。用苯酚:氯仿萃取所得RNA样品,并用DNase处理。RNA随后在65℃下培养2分钟,接着在4℃下离心30分钟。离心之后,将RNA用苯酚萃取至少10次以移除污染物。
进一步用RNeasy管柱(Qiagen)纯化RNA。用RiboGreen试剂(Molecular Probes)定量经纯化RNA,并且通过凝胶电泳评价纯度。
随后用MessageAmp(Ambion)扩增RNA。以4∶1氨基烯丙基-UTP-比-UTP的比率向经纯化RNA的活体外转录本中加入氨基烯丙基-UTP和游离UTP。随着转录,氨基烯丙基-UTP并入至新RNA链中。随后用经修改以用于RNA的Amersham程序,使氨基-烯丙基与Cy染料反应以将比色标记附着于所得经扩增RNA。通过乙醇沉淀移除未并入的染料。以分光光度计(NanoDrop)定量经标记的RNA。如Hughes等人,Nature Biotechnol19:342(2001)所述,通过加热至95℃来打碎经标记的RNA。
实例2
实例2说明如何确定对于辐射松木材发育来说重要的细胞周期基因,以及如何设计并合成用于微阵列上的独特结合于所述基因的寡核苷酸。
在自然光照条件下生长辐射松种的松树。举例来说,如Sterky等人,Proc.Nat′lAcad.Sci.95:13330(1998)所述制备组织样品。具体来说,从具有5米高的木材树木收集组织样品。通过经茎的形成层区形成弦切面来制备木材树木的组织样品。将茎水平切成从年幼(顶部)至成熟(底部)的截面。通过发育阶段分开的茎截面进一步通过剥离成韧皮部、分化中韧皮部、形成层、分化中木质部、发育中木质部和成熟木质部而被分成5层。也从辐射松种的幼苗制备包括叶、芽、嫩枝和根在内的组织样品。
如实例1或上文Sterky等人所述分离RNA和产生EST。将来源于含有发育中木材的样品的EST的核酸序列与已知涉及植物细胞周期的基因的核酸序列进行比较。也将来源于不含有发育中木材的样品的EST与已知涉及植物细胞周期的基因的序列进行比较。用BLAST(NCBI)进行计算机模拟杂交分析。将展示与来自含有发育中木材的样品的EST计算机模拟杂交,但不与来自不含有发育中木材的样品的EST杂交的来自于已知细胞周期基因的序列选择出来以用于进一步研究。
用分子生物学领域技术人员熟知的技术从cDNA文库选择含有与展示木材优选表达的基因杂交的序列的cDNA克隆体。使用所述序列信息设计寡核苷酸,使得每个寡核苷酸仅与文库中一个cDNA序列具有特异性。寡核苷酸序列提供于表14中。用上文Li与Stormo的方法,或用例如ArrayDesigner、GeneScan和ProbeSelect的软件设计60聚体寡核苷酸探针。
随后如Hughes等人,Nature Biotechnol19:324(2002)或如Kane等人,Nucleic AcidsRes.28:4552(2000)所述原位合成寡核苷酸,并且用5′氨基连接子将其附加于活化玻璃载片(Sigma-Genosus,The Woodlands,TX)上。每个寡核苷酸在载片上的位置是已知的。
实例3
实例3说明如何确定对于巨桉木材发育来说重要的细胞周期基因,以及如何设计并合成用于微阵列上的独特结合于所述基因的寡核苷酸。
在自然光照条件下生长巨桉种的桉树。举例来说,如Sterky等人,Proc.Nat′lAcad.Sci.95:13330(1998)所述制备组织样品。具体来说,从具有5米高的木材树木收集组织样品。通过经茎的形成层区形成弦切面来制备木材树木的组织样品。将茎水平切成从年幼(顶部)至成熟(底部)的截面。通过发育阶段分开的茎截面进一步通过剥离成韧皮部、分化中韧皮部、形成层、分化中木质部、发育中木质部和成熟木质部而被分成5层。也从辐射松种的幼苗制备包括叶、芽、嫩枝和根在内的组织样品。
如实例1或上文Sterky等人所述分离RNA和产生EST。将来源于含有发育中木材的样品的EST核酸序列与已知涉及植物细胞周期的基因的核酸序列进行比较。也将来源于不含有发育中木材的样品的EST与已知涉及植物细胞周期的基因的序列进行比较。举例来说,如Audic和Claverie,Genome Res.7:986(1997)所述进行计算机模拟杂交分析。将展示与来自含有发育中木材的样品的EST计算机模拟杂交,但不与来自不含有发育中木材的样品的EST杂交的来自于已知细胞周期基因的序列选择出来以用于进一步研究。
用分子生物学领域技术人员熟知的技术从cDNA文库选择含有与展示木材优选表达的基因杂交的序列的cDNA克隆体。使用所述序列信息设计寡核苷酸,使得每个寡核苷酸仅与文库中一个cDNA序列具有特异性。寡核苷酸序列提供于表14中。用上文Li与Stormo的方法,或用例如ArrayDesigner、GeneScan和ProbeSelect的软件设计60聚体寡核苷酸探针。
随后如Hughes等人,Nature Biotechnol19:324(2002)或如Kane等人,Nucleic AcidsRes.28:4552(2000)所述原位合成寡核苷酸,并且用5′氨基连接子将其附加于活化玻璃载片(Sigma-Genosus,The Woodlands,TX)上。每个寡核苷酸在载片上的位置是已知的。
实例4
实例4说明如何使用如实例2制备的寡核苷酸微阵列来检测对于木材形成重要的辐射松细胞周期基因的表达。这是用从成熟期韧皮部(P)、形成层(C)、发现于形成层下面的层中的展开木质部(X1)和发现于相同生长环中更深处的分化、木质化木质部细胞(X2)制备的aRNA样品所进行的平衡不完全区组设计实验的实例。在这个实例中,比较四个样品,即P、C、X1和X2之间的细胞周期基因表达。
在夏季,砍伐辐射松种植物,并且立即温和扯下主茎的树皮以暴露韧皮部和木质部。随后用解剖刀将韧皮部和木质部剥离,放入分开的液氮容器中。也用解剖刀从树木收集松针(叶)和芽,并放入分开的液氮容器中。随后如实例1所述从冷冻组织样品分离RNA。根据厂商说明书用RNeasy Mini管柱(Qiagen,Valencia,CA)从每份样品纯化等微克量的总RNA。
对每份P、C、X1和X2组织样品进行扩增反应。根据厂商说明书,用Ambion′sMessageAmp试剂盒,即基于T7的扩增程序进行扩增反应,不同之处在于在扩增步骤中向试剂混合物中加入经标记的aaUTP。将aaUTP并入至在此步骤中所形成的所得反义RNA。如实例1所述,在非酶反应中,CyDye荧光标记与aaUTP偶联。沉淀并洗涤经标记的扩增反义RNA,并且随后用NanoDrop分光光度计测定纯度。对应于从P、C、X1和X2组织样品分离的RNA的这些经标记反义RNA构成样品核酸,其称为P、C、X1和X2样品。
以500、200、100、50、25和10pg/μl连续稀释向每份样品中加入已知核酸的标准化对照样品以定量信号。也向植物样品中加入对应于展示在所有松树组织中表达的特异基因的阳性对照,例如管家基因。
在盖玻片下,在42℃下用125μL的P、C、X1或X2样品培养四个微阵列载片中的每一个16-18小时。用1×SSC、0.1%SDS洗涤这些阵列10分钟,接着用0.1×SSC、0.1%SDS洗涤10分钟,并使其干燥。
用Axon激光扫描仪扫描阵列载片,并用GenePix Pro软件分析。来自微阵列载片的数据使用GenStat SAS或Spotfire软件进行微阵列数据分析。去除离群数据,并用整体标准化使每个数据组的比率计量数据标准化,所述整体标准化采用用于校正差异染料偏差和空间效应的三次样条拟合。进行第二次转化以将对照信号比拟合至平均log2=0(即1∶1比率)。经标准化数据随后经方差分析。
对P、C、X1和X2样品微阵列载片中三个的每一个确定微阵列载片上任何给定位置处每个信号的平均信号强度。这个平均信号/探针位置与未用于计算平均值的样品载片上相同位置处的信号进行比较。举例来说,确定P、C和X1给定位置处的平均信号,并将X2微阵列载片中该位置处的信号与P、C和X1平均信号值进行比较。
表1展示以任何一个样品与其他三个样品的平均信号相比,具有大于两倍信号的基因。
表1
基因 PvCX12 PvX12 CvX12
WD40重复蛋白A -1.24 -0.88 -1.07
CDC2 -1.09 -0.78 -0.92
CYCLIN -1.08 -1 -0.26
WD-40重复蛋白B -1.01 -0.87 -0.42
CDC2 -0.83 -0.49 -1.01
P=韧皮部
C=形成层
X1=木质部层-1
X2=木质部层-2
PvCX12=韧皮部目标信号对形成层、木质部1和木质部2目标的平均信号比率
数据显示WD40重复蛋白A所编码的WD40重复蛋白在形成层中的表达水平比发育中木质部中的表达水平低很多,而WD40重复蛋白B所编码的WD40重复蛋白在韧皮部中的表达水平比其他组织的表达水平高很多。
随后用RT-PCR检验信号数据以证实对应于探针中独特寡核苷酸的基因在目标组织中的基因表达。
实例5
实例5证明如何将细胞周期基因表达与农艺学上重要的木材表现型相关联,所述表现型例如密度、刚度、强度、枝条间距和螺旋木理。
从已知母树的后代中选择具有极好生长特征和对重要真菌疾病具有抗性的成熟经无性繁殖的松树。从弦切面去除树皮,并且研究树木的胸高处第五年轮的平均木材密度、木材刚度与强度,和螺旋木理。也表征这些树的高度、主枝间的平均距离、树冠大小和分叉。
为了获得因影响密度、刚度、强度、枝条间距、螺旋木理和其他可能与影响这些特征的任何基因相关的特征的主基因而分异的幼苗家族,根据相互之间展示关于密度、刚度、强度、枝条间距和螺旋木理标准的最宽变化范围的标准,选择缺少普通亲本的树木进行种间杂交。因此,使用来自展示高密度、低平均主枝间距和高螺旋木理的精英树(plustree)的花粉来对来自展示最低密度、最高平均主枝间距和最低螺旋木理的选种中的不相关的精英树的球果授粉。重要的是应注意“精英树”经杂交,例如使得用来自展示高密度的精英树花粉来对展示高密度的另一精英树的发育球果授粉,且将可以用来自展示低平均主枝间距的树木花粉对展示低平均主枝间距的另一精英树发育球果授粉。
由所述受控授粉和生长收集种子,使得对每个种子而言母体特性得以维持,并且用于营养繁殖,从而使每种基因型由多个无性系分株所表现。用微繁殖、树篱法或花束插枝来完成营养繁殖。储存每个基因型的一些无性系分株,同时生长每个基因型的营养繁殖体至足够大小以建立大田种植。以重复设计的方式排列基因型,并且在测量并记录每日温度和降雨的大田条件下生长。
测量不同年龄的树木以确定密度、刚度、强度、枝条间距、螺旋木理和任何其他可能与影响这些特征的任何基因相关的可观测特征的表达与分异。收集样品来表征纤维素含量、木质素含量、纤维素微纤丝角度、密度、强度、刚度、管胞形态、轮宽和类似特征。也如实例4所述研究样品的基因表达。将每种基因型的无性系分株与不同年龄下相同基因型的无性系分株进行比较以建立这些特征的年龄相关性。
实例6
实例6证明如何使用如实例4中所制备的微阵列可将植物发育阶段和对例如光照和季节的环境条件的反应与细胞周期基因表达相关联。具体来说,研究与木材密度相关的基因表达变化。
在具有测量每日温度与降雨的气象站的地区生长三种不同经无性繁殖的巨桉杂交基因型的树木。在春季和随后的夏季中,首先以南北方向的标志对三种不同基因型的遗传因子相同的无性系分株照像,这是使用具有足够分辨率以显示植株幼年和成熟部分的树皮特征的照相机进行,并且随后如实例4砍伐。通过种植记录确定树木年龄,并且通过计数年轮来证实。在这些树木的每个中,将成熟木材定义为胸高以下的树木的最外面的年轮,并且将幼年木材定义为胸高以上的树木的最里面的年轮。因此将每株树木如下划分:
NM-北面成熟
SM-南面成熟
NT-北面过渡
ST-南面过渡
NJ-北面幼年
SJ-南面幼年
从植物树干以及从幼年和成熟形式的叶片收集组织。同时制备用于表现型(包括植物形态学和生物化学特征)分析和基因表达分析的样品。记录划分每个分区的位点处的树木的高度和直径,并且获取树木基部的土壤样品用于化学检定。称重为基因表达分析而制备的样品,并放入液氮中以用于随后制备用于微阵列实验的RNA样品。组织表示如下:
P-韧皮部
C-形成层
X1-展开木质部
X2-分化且木质化的木质部
如Ruzin,Plant Microtechnique and Microscopy,Oxford University Press,Inc.,NewYork,NY(1999)所述固定来自树干各区的弦切面和径切面的薄片,以用于解剖学研究并证实木材发育阶段。研究木材不同发育阶段的微纤丝,例如巨桉木材的幼年、过渡和成熟期。其他所研究的特征是各区内纤维与导管分子(vessel element)的比率和射线组织。此外,研究样品幼年与成熟木材之间和春季木材与夏季木材之间的变化特征,例如纤维形态、内腔尺寸和S2(最厚)细胞壁层的宽度。使用木材检定领域技术人员所熟知的技术进一步研究样品的第五年轮密度的测量,和弹性系数的测定。参看,例如,Wang等人,Non-destructive Evaluations of Trees,EXPERIMENTAL TECHNIQUES,第28-30页(2000)。
为了进行生物化学分析,使用植物生物化学领域技术人员熟知的生物化学检定来冻干并分析50克每个收集样品以定量单糖、氨基酸、脂质、其他提取物、木质素和纤维素。参看,例如,Pettersen & Schwandt,J.Wood Chem.&Technol.11:495(1991)。
在本实例中,选择用于进行比较的表现型是高密度木材、平均密度木材和低密度木材。如实例3所述,从在春季和夏季所收集的树木制备核酸样品。如实例3和4所述执行通过杂交进行的基因表达型分析和数据分析。
使用相似技术和经无性繁殖的个体可以研究与例如强度、刚度和螺旋性的其他复杂木材特征相关的细胞周期基因表达。
实例7
实例7证明本发明的寡核苷酸探针辨别细胞周期基因家族中高度同源成员的能力。与阵列上特定寡核苷酸的杂交鉴别独特WD40基因,所述WD40基因在具有较高密度木材的基因型中比在其他所研究基因型中所观测到的表达更强。WD40基因在成熟木材中比幼年木材中表达更强,并且在夏季木材中比春季木材表达更强。在叶或芽中没有发现这个基因的高表达水平。
通过RT-PCR证实所述基因表达模式。这个推定的“密度相关性”基因用于固定径切面的原位杂交。该密度相关性WD40基因与其中木质部主要由具有极少导管分子和极少木质部射线细胞的纤维组成的茎区域中的维管形成层杂交最强。
这些结果表明WD40基因产物在发生于形成层中并且引起直径生长的射线细胞分裂中而非在例如可能对顶点或叶来说是重要的轴向细胞分裂中起作用。这个基因可能难以通过cDNA微阵列或其他传统杂交方法鉴别,这是因为存在于基因中的高度保守区可能引起其与编码具有相似催化功能但行使轴向或径向分裂的酶的基因相混淆。此外,由RT-PCR和计算机模拟杂交所证实,根据揭示这个基因产物在细胞分裂中的功能的基于序列相似性的注释和这个微阵列杂交模式的观测,这个基因产物在正发育的次生木质部中特异性地起作用以引导纤维的细胞分裂模式,使得与导管分子和射线产生相比,此基因的较高表达引起较多纤维产生。纤维含量与主成分分析法(PCA)变量相关,后者解释至少10%的基本密度差异。
实例8
实例8证明如何可以使用本发明的寡核苷酸探针从同源基因家族中鉴别一个木材“密度相关性”WD40重复蛋白基因和其启动子。此外,本实例证明如何使用本方法所鉴别的启动子序列来转化其他硬木种以产生与相同种野生型植株相比增加的直径生长速率。
将WD40基因的序列用于探测基因组步移(Genome Walker)文库以分离包含启动子区的5′侧翼序列。随后使用美国申请案第60/476,222号所述的方法将启动子区经操作连接于β-葡糖苷酸酶报告基因,并将其克隆至双元载体中以转化至桉树中。随后切开再生转基因烟草和桉树植株并用X-gluc染色,证明微阵列数据引起能够在茎的那些发育出的纤维比导管分子或木质部射线更多的部分中单独高度形成层特异表达的启动子的分离。
使用分子生物学领域技术人员熟知的技术,随后将启动子操作连接于细胞分裂启动基因,并且将这个构建体置于双元载体中以转化至硬木植物中,例如枫香属和杨树属,使得细胞分裂启动基因比在导管形成层中正常时的表达更强。相对于对照硬木植物,这引起转基因硬木植物的直径生长速率增加。
实例9
实例9证明密度相关性多肽如何可以与组织优选启动子连接,且在松树中表达,以产生具有增加木材密度的植物。
通过实例7所述方法鉴别在早春期间内更高表达的密度相关性多肽。将具有经操作连接于启动子的密度相关性多肽的DNA构建体置于适当双元载体中,并且使用Connett等人(美国专利申请第09/973,088号和第09/973,089号)方法转化进松树中。如上文Connett等人所述转化松树植株,并且用转基因松树植株建立森林种植。与未经密度相关性DNA构建体转化的对照松树植株相比,在转基因松树植株中观察到甚至是春季木材(早期木材)的增加密度。
实例10
使用分子生物学领域技术人员熟知的技术,在从紫花苜蓿分离的基因组DNA中分析实例7中所分离的推定密度相关性基因的序列。这使得能够鉴别紫花苜蓿的直系同源性,紫花苜蓿的序列随后用于建立RNAi敲除构建体。随后将这个构建体转化进紫花苜蓿。参看,例如Austin等人,Euphytica85,3811995。再生转基因植株显示较低纤维含量和木质部中增加的射线细胞含量。这些特性改进可消化性,与相同种的野生型紫花苜蓿相比,其引起以这种紫花苜蓿为食的牛的较快生长速率。
实例11
实例11证明基因表达分析如何可以用于发现存在于具有所需表现型的成熟植株中的基因变体。这种变体的存在与否可以用来预测成熟植株的表现型,以允许筛选幼苗阶段的植株。尽管本实例采用桉树,但是本文所用方法也可用于松树和其他树木种的育种过程。
如先前实例所述,将推定密度相关性基因的序列用于探测从密度不同的桉树分离的基因组DNA。研究经非转基因产生的具有不同木材表现型的桉树杂交种。一个杂交种展示高木材密度,并且另一杂交种展示较低木材密度。发现可以从较低密度基因变体辨别高密度基因变体的编码区3′部分的分子标记。
这个分子标记使树木育种家能够测定仍然为幼苗阶段的非转基因桉树杂交种的可能密度分布,而在缺少标记的情形下,树木育种家必须在可以可靠预测收获年龄时的密度之前等待树木生长多年。这使得能够在幼苗阶段选择性外部种植最佳树木,而不需要在间苗年龄时的昂贵精选操作和所产生的腐蚀。这个分子标记进一步可用于育种过程以确定哪些母树可以产生高密度异型杂交后代。
在并不对应于更经常地见于较高或较低木材密度非转基因桉树杂交种树木中的变体的基因的编码区3′部分中所发现的分子标记也是可用的。发现这些标记可用于指纹识别桉树的不同基因型,可用于育种过程和种植园中的身分追踪(identity-tracking)。
实例12
本实例描述用于鉴别影响表现型特征的基因表达差异的微阵列,所述表现型特征对于商业木材来说是重要的,即木材外观、刚度、强度、密度、纤维尺寸、粗糙度、纤维素与木质素含量、提取物含量和类似特征。
如实例2-4中,砍伐不同位置和一年中不同收集时间的生产商业重要木材产品的木材树木属,在此情形下为松树和桉树,并且从发育中的木质部、形成层、韧皮部、叶、芽、根和其他组织分离RNA。也从相同属的幼苗分离RNA。
将所有的重叠群(contigs)与从含有发育中木材的样品所分离的RNA所制得的EST和从不含有发育中木材的各种组织的RNA所制得的EST进行比较。确定与从不含有发育中木材的样品所分离的RNA所制得的EST相比展示与从含有发育中木材的样品所分离的RNA所制得的EST更多计算机模拟杂交的主要含有EST的重叠群,以符合尤其在发育中木材中表达的可能新颖基因。随后将这些重叠群用于针对公共域序列的BLAST搜索。将以高严谨度与未知基因或注释为具有仅“假设蛋白质”的基因杂交的那些重叠群选择出来以用于下一个步骤。认为这些重叠群是展示木材优选表达的推定新颖基因。
用分子生物学领域技术人员熟知技术从cDNA文库选择含有与展示木材优选表达的推定新颖基因杂交的序列的最长cDNA克隆体。为这些cDNA测序,并且在可能时获得全长基因编码序列和非翻译侧翼序列。从展示木材优选表达的推定新颖基因的每个序列选择45-80个核苷酸(或寡核苷酸)的段,使得每个寡核苷酸探针在高严谨度下仅与从相同属树木或幼苗所分离的RNA制得的EST中所呈现的一个序列杂交。
随后化学合成寡聚体,并且如实例3所述置于微阵列载片上。每个寡聚体对应展示木材优选表达的推定新颖基因的特定序列,并且不对应从相同属树木或幼苗所分离RNA制得的EST中所呈现序列的其他基因。
如实例4进行样品制备和杂交。本实例中所使用的技术比使用cDNA探针的微阵列技术更为有效,这是因为信号的存在是特定基因表达的显著证据,而不是由于保守功能域或普通进化史而可能与cDNA具有相似性的任何基因表达的显著证据。因此,可能分化同源基因,例如在相同家族中但在表现型决定中可以具有不同功能的同源基因。
因此,用实例4的方法所获的杂交数据使使用者能够鉴别实际具有以下模式的推定新颖基因:与已知基因协同表达的模式,与特定发育作用一致的表达类模式和/或揭示基因具有以有价值方式驱动表达的启动子的表达模式。
因此使用这个方法的杂交数据可以用于(例如)鉴别展示为具有发育中春季木材(早期木材)中的最低纤维素微纤丝角度的管胞所特有的表达模式的推定新颖基因。也可以如实例8分离这个基因的启动子,并且将其操作连接于如实例9所示的与晚期木材(夏季木材)相关的基因。用实例9的方法产生含有这个构建体的转基因松树植株,并且随后显示这些植株的早期木材展示出若干晚期木材特征,例如较高微纤丝角度、较高密度、较小平均内腔尺寸等。
实例13
实例13证明经功能连接于细胞周期基因的形成层特异性启动子用于增加植物生物量的用途。
如实例4所述,经不同次生维管层的阵列分析来鉴别形成层特异性细胞周期转录本。举例来说,用BD Clontech GenomeWalker试剂盒从松树基因组DNA克隆连接于对应这些转录本的基因的候选启动子,并经报告基因测定法在转基因烟草中测试形成层特异性/优选性。将过表达涉及次生木质部细胞分裂的细胞周期基因的形成层特异性启动子用于增加的木材生物量。构建串联形成层特异性启动子以驱动细胞周期ORF。候选细胞周期基因升高的转录水平引起木质部生物量表现型增加。
实例14
来自巨桉的cDNA克隆体的分离与表征
从成熟嫩枝芽、早期木材韧皮部、花组织、叶组织(两个独立文库)、营养根、结构根、木质部或早期木材木质部制备巨桉cDNA表达文库,并且如下进行构建和筛选。
用Chang等人(Plant Molecular Biology Reporter11:113-116(1993))实验方案从植物组织提取总RNA。用Poly(A)Quik mRNA Isolation Kit(Stratagene,La Jolla,CA)或Dynal Beads Oligo(dT)25(Dynal,Skogen,Norway)从总RNA制剂分离mRNA。根据厂商说明,从经纯化的mRNA通过反转录酶合成,接着用ZAP Express cDNA Synthesis Kit(Stratagene)将所得cDNA克隆体插入至Lambda ZAP中来构建cDNA表达文库。用Gigapack II Packaging Extract(Stratagene),使用来自于依赖于文库的5μl连接反应的等分试样(1-5α1)包装所得cDNA。用XL1-Blue MRF细胞和XLOLR细胞(Stratagene)以及ExAssist辅助噬菌体(Stratagene)进行文库的大块切除(mass excision)。用NZY肉汤(Gibco BRL,Gaithersburg,MD)稀释经切除的噬粒,并且将其铺于含有X-gal和异丙基硫代-β-半乳糖苷(IPTG)的LB-卡那霉素琼脂板上。
经铺板和选择用于DNA miniprep的菌落中,99%含有适于测序的插入物。在具有卡那霉素的NZY肉汤中培养阳性菌落,并且借助碱性溶解和聚乙二醇(PEG)沉淀来纯化cDNA。用1%琼脂凝胶来筛选测序模板用于染色体污染。根据厂商说明用TurboCatalyst800机器(Perkin Elmer/Applied Biosystems Division,Foster City,CA)制备染料引物序列。
用Perkin Elmer/Applied Biosystems Division Prism377测序仪获得阳性克隆体的DNA序列。首先从5′末端进行cDNA克隆体测序,并且在一些情形下也从3′末端开始测序。对于一些克隆体来说,用核酸外切酶III缺失分析获得内部序列,以在pBK-CMV中产生大小差异亚克隆体的文库,或者通过用为鉴别感兴趣基因区而设计的基因特异引物来直接测序。
用计算机算法FASTA和/或BLASTN在EMBL数据库中将所确定的cDNA序列与已知序列比较。将冗余序列的多重比对用于建立可靠的一致序列。基于与来自其他植物种的已知序列的相似性,如本文所述将所分离的多核苷酸序列鉴别为编码转录因子。本文也阐述对应于寡核苷酸序列的经预测多肽序列。
实例15
来自辐射松的cDNA克隆体的分离与表征
如上文实例14所述构建并筛选辐射松cDNA表达文库(从一种以下组织制备:嫩枝芽组织、悬浮培养细胞、早期木材韧皮部(两个独立文库)、花束分裂组织、雌球花、根(未知世系)、营养根、结构根、雌球花、球果原基、雌受精锥和木质部(两个独立文库))。
在Perkin Elmer/Applied Biosystems Division Prism377测序仪上用正向和反向引物获得阳性克隆体的DNA序列,并且如上文所述将所确定的序列与数据库中的已知序列比较。
基于与来自其他植物种的已知序列的相似性,如本文所述将所分离的多核苷酸序列鉴别为编码转录因子。本文也阐述对应于寡核苷酸序列的经预测多肽序列。
实例16
5′RACE分离
为了鉴别cDNA文库中部分cDNA序列的5′或3′其他序列,用SMART RACE cDNA扩增试剂盒(Clontech Laboratories,Palo Alto,Calif.)进行5′和3′cDNA末端快速扩增(RACE)。一般来说,所述方法必须首先分离poly(A)mRNA,进行第一和第二链cDNA合成以产生双链cDNA,钝化cDNA末端,并且随后SMART RACE连接。用接头连接cDNA形成经接头连接的ds cDNA文库。设计基因特异引物以与接头特异引物一起用于5′和3′RACE反应。使用5′和3′RACE反应,获得5′和3′RACE片段,测序并克隆。可以重复这个过程直至鉴别出全长基因的5′和3′末端。通过使用对基因5′和3′末端特异的引物,通过末端-末端(end-to-end)PCR产生全长cDNA。
举例来说,为了从第一链cDNA扩增基因的缺失5′区,从模板序列的相反链5′→3′并且在模板序列的~100-200bp之间的区设计引物。成功的扩增应该在模板的5′末端与PCR产物之间产生~100bp DNA序列重叠。
用Concert Reagent Protocol(Invitrogen,Carlsbad,CA)以及标准分离和提取过程从四个松树组织,即幼苗、木质部、韧皮部和结构根提取RNA。随后以DNase用10U/μlDNase I(Roche Diagnostics,Basel,Switzerland)处理所得RNA。对于100μgRNA来说,使用9μl10×DNase缓冲液(Invitrogen,Carlsbad,CA)、10μl Roche DNase I和90μl无Rnase水。随后在室温下培养RNA15分钟,并且加入1/10体积的25mM EDTA。根据厂商说明用RNeasy小量试剂盒(Qiagen,Venlo,The Netherlands)进行RNA净化。
为了合成cDNA,使用从木质部、韧皮部、幼苗和根提取的RNA,并且根据厂商说明使用SMART RACE cDNA扩增试剂盒(Clontech Laboratories Inc,Palo Alto,CA)。对于RACE PCR来说,组合四种组织类型的cDNA。通过组合等体积来自木质部、韧皮部、根和幼苗组织的cDNA来建立PCR主混合物(master mix)。在96孔PCR板中进行PCR反应,将来自引物稀释板(10mM)的1μl引物添加至对应的孔位置。将49μl主混合物分成等分试样,加入具有引物的PCR板中。在以下参数下在GeneAmp 9700(AppliedBiosystems,Foster City,CA)上开始热循环:
94℃(5sec),
72℃(3min),5次循环;
94℃(5sec),
70℃(10sec),
72℃(3min),5次循环;
94℃(5sec),
68℃(10sec),
72℃(3min),25次循环。
按照标准程序在琼脂糖凝胶上分离cDNA。按照厂商说用Qiagen96孔Gel Elution试剂盒从凝胶切下并洗脱凝胶片段。
根据以下说明,在96孔板中将PCR产物与pGEMTeasy(Promega,Madison,WI)连接过夜:60-80ng DNA、5μl2×快速连接缓冲液、0.5μlpGEMT easy载体、0.1μlDNA连接酶,用水补足至10μl,并且培养隔夜。
按照标准程序将每个克隆体转化进大肠杆菌中,并且按照标准实验方案从12个所挑选的克隆体提取DNA。在1%琼脂糖凝胶上检验DNA提取和DNA品质。按照标准实验室程序,通过用限制性内切核酸酶EcoRI限制性消化和凝胶电泳来确定每个克隆体中恰当大小插入物的存在与否。
实例17
EST序列处理(curation)
在制得cDNA文库期间,原初转录本或其DNA对应物可能具有阻止其编码功能性蛋白质的特征。可能存在插入、缺失、碱基替换或未剪接或不适当剪接内含子。如果存在这些特征,那么通常可能对其进行鉴别,以使得可以将其改变。可以对任何其他与公共数据库中序列具有同源性的序列进行类似处理。
在确定DNA序列之后,BLAST分析展示其与公开可用拟南芥基因组序列上的一个拟南芥基因相关。然而,代替编码大约240氨基酸多肽,预测所处理的一致序列编码仅157个氨基酸残基的产物,这暗示DNA序列存在错误。为了鉴别真实编码区的可能位置,翻译三个阅读框中每一个中的每个EST末端的DNA序列,并且将所预测的序列与拟南芥基因的氨基酸序列比对。发现EST一部分中的DNA片段编码与拟南芥基因羧基末端具有相似性的序列。因此,显示EST中存在未经剪切的内含子。
关于使用所克隆序列来过表达感兴趣基因来说,未经剪切的内含子是相对较小的问题。可以预计由cDNA转录所得的RNA经正常加工以去除内含子。也预计反义和RNAi构建体行使抑制感兴趣基因的功能。在其他情形下,可能需要鉴别内含子精确限制以可以将之去除。当所述序列具有高度相似的公开序列时,可能能够通过比对两个序列和鉴别序列一致性降低的位置而得以发现内含子,这是借助内含子是以序列GT开始并以序列AG结束的知识。
当由于无可用高度相似序列而存在一些关于内含子位置的争论时,可以用实验检验内含子位置。举例来说,可以在认为的内含子位置处的区域侧翼合成DNA寡聚体。分离来自松树或桉树源种的RNA,并使用反转录酶将其用作制造cDNA的模板。随后将所选引物用于PCR反应以从cDNA群体扩增出经正确间接的DNA片段(预测大小为大约350bp,其小于原始一致序列的对应片段)。所扩增的片段随后经序列分析,并与一致序列比较以鉴别差异。
当怀疑替代间接事件(保留部分内含子或部分丢失外显子)时,可以使用相同程序。当EST具有小变化时,例如少数碱基的插入或缺失,当预测错误大小的翻译产物时或如果存在明显移码时,EST序列的计算机分析仍可以指示其位置。通过如上文所述合成引物、产生新cDNA和PCR扩增来检验真实序列。
实例18
用含有细胞周期基因的构建体转化美洲黑杨。
通过标准技术将如上文实例中所述并展示于表2中的构建体各自接种于农杆菌培养物中。
表2鉴别用于实例17中所述构建体的质粒、基因和Genesis ID号。
表2
质粒 基因 Genesis ID
pGrw14 周期素A prga001823
pGrw15 周期素A prpe001264
pGrw16 周期素D prxa004540
pGrw18 周期素D prxl006271
PGrw19 周期素D prpb019661
PGrw20 WEE1样蛋白 prrd041233
室温下,在生长室中在含有2.5μM玉米素的DKW培养基(Driver和Kuniyuki,1984,McGranahan等人1987,购自Sigma/Aldrich)上维持美洲黑杨储备植物培养物16h光周期。对于转化来说,用锋利解剖刀片从储备植株上无菌切下叶柄,将其切成4-6mm长,在收集之后立即置于含有1μg/ml BAP和1μg/ml NAA的DKW培养基上,并在黑暗生长室(28度)中培养24小时。
由0.8-1.0A之间的OD600所指示,将含有所需构建体的农杆菌培养物生长至对数期,随后离心成小球,并重悬于等体积农杆菌诱导培养基(AIM)中,所述培养基含有木料植物培养基盐(Lloyd,G,and McCown,B.,1981.Woody plant medium.Proc.Intern.Plant Prop.Soc.30:421,购自Sigma/Aldrich)、5g/L葡萄糖和pH 5.8下0.6g/L MES,每毫升AIM中加入1μl的100mM乙酰丁香酮储液。通过涡漩重悬离心小球(pellet)。在100rpm摇晃的环境室中,在28℃下于所述培养基中培养细菌细胞1小时。
培养期后,将美洲黑杨外植体暴露于农杆菌混合物15分钟。随后在无菌纸巾上玷污外植体,再置于相同植物培养基上,并在18-20℃的暗处培养。三天共培养之后,将外植体转移至DKW培养基中,所述DKW培养基中NAA浓度降至0.1μg/ml,并且其中加入400mg/L提门叮(timentin)以根除农杆菌。
在根除培养基上4天之后,将外植体转移至含有相同培养基的小品红盒中,所述培养基中补充有提门叮(400mg/L)以及选择剂遗传霉素(50mg/L)。每两周一次将外植体转移至新鲜选择培养基上。分离在选择培养基存在下所生长的胼胝质,并且每三周将其在新鲜选择培养基中亚培养。观察胼胝质不定根的产生。
通常在开始转化之后两个月内观察到不定根。将这些根簇转移到DKW培养基中用于嫩枝延长通常约14周,所述DKW培养基中未加入NAA,并且其中BAP浓度降至0.5μg/ml。切下经延长的嫩枝,并且转移至pH5.8下含有20g/l蔗糖和5g/1活性炭的BTM培养基中(Chalupa,Communicationes Instituti Forestalls Checosloveniae13:7-39,1983,购自Sigma/Aldrich)。参看下表3。
表3.美洲黑杨的生根培养基。
BTM-1培养基组份     mga
NH4NO3KNO3Ca(NO3)2·4H2OCaCl2·2H2OMgSO4·7H2OKH2PO4MnSO4·H2OZnSO4·7H2OCuSO4·5H2OCoCl2.·6H2OKIH3BO3Na2MoO4.2H2OFeSO4.7H2ONa2EDTA.2H2OMyo-肌醇烟酸吡哆醇HCl维生素B1HCl甘氨酸蔗糖活性炭     412475640440*3701702.38.60.250.020.156.20.2527.837.31000.50.512200005000
根发育之后,通常四周,通过切根法(rooted cutting method)在温室中繁殖转基因植株,或者活体外通过在含有11.4μM玉米素的DKW培养基上腋生枝诱导四周,其后分离所繁殖的嫩枝并转移至根诱导培养基上。将生根植株转移至土壤以评价在温室和大田条件下的生长。
实例19
由某些周期素D基因的异常表达所介导的不成比例大叶片的产生
用pGRW16和pGRW19转化每个构建体的大约100株美洲黑杨外植体,其含有通常展示在维管中的优选表达的基因,所述表达是由组成型启动子(辐射松超级泛素启动子)所驱动。再生之后,观察许多转化系(transline)的许多无性系分株具有相对于对照植株不成比例的大叶片。这些叶片比对照植株的叶片更长而且更宽。
不成比例的大叶片可能是生长潜能、大叶片大小和由此带来的高生长潜能的极为有用的早期指标。大叶片大小可能是增加数目的叶细胞或增加的叶细胞大小或两者的函数。
实例20
由周期素D基因的异常表达所介导的不正常维管发育的产生
用pGRWIS转化每个构建体的大约100株美洲黑杨外植体。从这个实验中所再生的多个转基因系展示极为独特的基因多效表现型。这些转基因系的叶片在中脉的两侧对称折叠至叶片全部长度。这些系的许多叶柄成螺旋,并且在许多情形下以右手形式朝叶片翻转360度。茎展示在靠近中间处稍许加厚并且略微弯曲。
杀死展示这些表现型的转基因系TDL002534的一株无性系分株用于研究所述组织水平的偏差。经甲苯胺蓝染色的卷曲叶柄的横切面显示维管发育的延迟,但是存在如黑色箭头所指的其他维管柱发育。卷曲叶柄的维管柱内的木质部和韧皮部显示发育相似和正确空间导向。直和卷曲叶柄的纵切面可能提供螺旋现象的解释。卷曲叶柄展示在卷曲外部转角上的更为延长的细胞和叶柄反面上更为压缩的细胞。
最显著的表现型可能是在叶片中所鉴别的。与叶柄一样,发现异常维管发育,其包含较大中脉侧面的另外形成维管柱。在一些切面中,几乎可以在靠近中脉处发现完全形成的叶脉。在所有发现折叠表现型的实例中,这种类型叶片外形与表现型相关。
在发现通常少数维管束或单个中脉的空间中其他维管柱的发育指示在早期维管发育水平上的不正常细胞分裂活性。因此,在维管优选启动子而非组成型启动子控制下所表达的这个基因具有增加后期维管发育中细胞分裂的效用,以产生额外木材。
实例21
本实例说明如何确定对于辐射松木材发育来说重要的多核苷酸,以及如何设计并合成用于微阵列上的独特结合于所述基因的寡核苷酸。
从美国种植园选择大约16年的开花授粉火炬松,并且在新西兰种植园选择大约16年的开花授粉辐射松。在春季和夏季砍伐树木,来比较与木材形成不同发育阶段相关的基因表达。各自砍伐树木,并且从底部区域去除树干部分(trunk section),这个区域是从底部大约1至2米,且在活树冠以下1至2米内。从树干底端去除的部分含有成熟木材。从活树冠以下去除的部分含有幼年木材。将在春季所收集的样品命名为早期木材或春季木材,而将在夏季所收集的样品视为晚期木材或夏季木材(Larson等人,Gen.Tech.Rep.FPL-GTR-129.Madison,WI:U.S.Department of Agriculture,Forest Service,ForestProducts Laboratory,第42页)。
从树干部分分离组织,使得去除韧皮部、形成层、发育中木质部和成熟中木质部。仅从当年生长年轮收集这些组织。在每种情形下去除组织之后,立即将材料浸于液氮中以保存核酸和其他组份。剥离所述部分的树皮,并且用剃刀刮削而从树皮里面移除韧皮部组织。通过轻轻刮取表面来从经剥离部分的外表面分离形成层组织。通过连续更有力地刮取剩余组织来分离发育中木质部和木质化木质部。将组织从液氮转移至容器中用于-70℃下长期储存直至进行RNA提取和后续分析。
实例22
本实例说明RNA提取与纯化的过程,其尤其可用于从针叶树松针、木质部、形成层和韧皮部获得的RNA。
从针叶树松针、木质部、形成层或韧皮部获得组织。在液氮中冷冻组织并研磨。用Concert Plant RNA试剂(Invitrogen)提取总RNA。用苯酚:氯仿萃取所得RNA样品,并用DNase处理。RNA随后在65℃下培养2分钟,接着在4℃下离心30分钟。离心之后,将RNA在苯酚中萃取至少10次以移除污染物。
进一步用Rneasy管柱(Qiagen)净化RNA。用RiboGreen试剂(Molecular Probes)定量经纯化RNA,并且通过凝胶电泳评价纯度。
随后用MessageAmp(Ambion)扩增RNA。以4∶1氨基烯丙基-UTP-比-UTP的比率向经纯化RNA的活体外转录本中加入烯丙基-UTP和游离UTP。随着转录,氨基烯丙基-UTP并入至新RNA链中。随后用经修改以用于RNA的Amersham程序,使氨基-烯丙基与Cy染料反应以将比色标记附着于所得经扩增RNA。通过乙醇沉淀移除未并入的染料。以分光光度计(NanoDrop)定量经标记的RNA。如Hughes等人,Nature Biotechnol19:342(2001)所述,通过加热至95℃来打碎经标记的RNA。
实例23
本实例说明如何确定对于辐射松木材发育来说重要的基因,以及如何设计并合成用于微阵列上的独特结合于所述基因的寡核苷酸。
在自然光照条件下生长辐射松种的松树。举例来说,如Sterky等人,Proc.Nat′lAcad.Sci.95:13330(1998)所述制备组织样品。具体来说,从具有5米高的木材树木收集组织样品。通过经茎的形成层区形成弦切面来制备木材树木的组织样品。将茎水平切成从年幼(顶部)至成熟(底部)的截面。通过发育阶段分开的茎截面进一步通过剥离成韧皮部、分化中韧皮部、形成层、分化中木质部、发育中木质部和成熟木质部而被分成5层。也从辐射松种的幼苗制备包括叶、芽、嫩枝和根在内的组织样品。
如上述实例或上文Sterky等人所述分离RNA和产生EST。将来源于含有发育中木材的样品的EST核酸序列与已知涉及多糖合成的基因的核酸序列进行比较。也将来源于不含有发育中木材的样品的EST与已知涉及植物细胞周期的基因序列进行比较。如下用BLAST(NCBI)进行计算机模拟杂交分析。
实例24
桉树计算机模拟数据
可以用计算机模拟基因表达来确定一致EST文库的成员。对于每个文库来说,从任何组织组中的EST数目除以乘以1000的一组EST总数目来确定一致。这些数据提供不由文库测序程度所引起的偏差的标准化值。对若干文库采样用于一致值,包括繁殖体文库、芽繁殖体文库、芽营养体文库、果实文库、叶片文库、韧皮部文库、形成层文库、木质部文库、根文库、茎文库、树液营养体文库、完整植株文库。
如下文所示,许多本发明的序列展示维管优选表达(如果数据库随机搜索,那么从发育中维管组织制得的文库可能存在超过50%的命中),并且因此可能涉及木材相关发育过程。数据展示于表12中。
实例25
松树计算机模拟数据
可以用计算机模拟基因表达来确定一致EST文库的成员。对于每个文库来说,从任何组织组中的EST数目除以乘以1000的一组EST总数目来确定一致。这些数据提供不由文库测序程度所引起的偏差的标准化值。对若干文库采样用于一致值,包括繁松针文库、韧皮部文库、形成层文库、木质部文库、根文库、茎文库和完整植株文库。
如下文所示,许多本发明的序列展示维管优选表达(如果数据库随机搜索,那么从发育中维管组织制得的文库可能存在超过50%的命中),并且因此可能涉及木材相关发育过程。数据展示于表13中。
实例26
将展示与来自含有发育中木材的样品所制得EST计算机模拟杂交,但不与来自不含有发育中木材的样品的EST杂交的序列选择出来以用于进一步研究。
用分子生物学领域技术人员熟知技术从cDNA文库选择含有与展示木材优选表达的基因杂交的序列的cDNA克隆体。使用所述序列信息设计寡核苷酸,以此每个寡核苷酸仅与文库中一个cDNA序列具有特异性。寡核苷酸序列提供于表14中。用上文Li与Stormo方法,或用例如ArrayDesigner、GeneScan和ProbeSelect的软件设计60聚体寡核苷酸探针。
随后如Hughes等人,Nature Biotechnol19:324(2002)或如Kane等人,Nucleic AcidsRes.28:4552(2000)所述原位合成寡核苷酸,并且用5′氨基连接子将其附加于活化玻璃载片(Sigma-Genosis,The Woodlands,TX)。每个寡核苷酸在载片上的位置是已知的。
实例27
本实例说明如何使用如上述实例制备的寡核苷酸微阵列来检测对于木材形成重要的本申请案的辐射松基因表达。这是用从成熟期韧皮部(P)、形成层(C)、发现于形成层以下的层中的展开木质部(X1)和发现于相同生长环更深处的分化、木质化木质部细胞(X2)制备的aRNA样品所进行的平衡不完全组区设计实验的实例。在本实例中,比较四个样品,即P、C、X1和X2之间的细胞周期基因表达。
在夏季,砍倒辐射松种植物,并且立即温和扯下主茎的树皮以暴露韧皮部和木质部。随后用解剖刀将韧皮部和木质部剥离,并放入分开液氮容器中。也用解剖刀从树木收集松针(叶)和芽,并将其放入分开液氮容器中。随后如实例1所述从冷冻组织样品分离RNA。根据厂商说明书用RNeasy Mini管柱(Qiagen,Valencia,CA)从每份样品纯化等微克量的总RNA。
对每份P、C、X1和X2组织样品进行扩增反应。根据厂商说明书,用Ambion′sMessageAmp试剂盒,基于T7的扩增程序进行扩增反应,不同之处在于在扩增步骤中向试剂混合物中加入经标记的aaUTP。aaUTP并入至在这个步骤中所形成的所得反义RNA。如实例1所述,在非酶反应中,CyDye荧光标记与aaUTP偶联。沉淀并洗涤经标记的扩增反义RNA,并且随后用NanoDrop分光光度计测定纯度。对应于从P、C、X1和X2组织样品分离的RNA的这些经标记反义RNA构成样品核酸,称其为P、C、X1和X2样品。
以500、200、100、50、25和10pg/μl连续稀释向每份样品中加入已知核酸的标准化对照样品以定量信号。也向植物样品中加入对应于展示在所有松树组织中表达的特异基因的阳性对照,例如管家基因。
在盖玻片下,在42℃下用125μL的P、C、X1或X2样品培养四个微阵列载片的每一个16-18小时。用1×SSC、0.1%SDS洗涤这些阵列10分钟,接着用0.1×SSC、0.1%SDS洗涤10分钟,并使其干燥。
用Axon激光扫描仪扫描阵列载片,并用GenePix Pro软件分析。来自微阵列载片的数据经使用GenStat SAS或Spotfire软件的微阵列数据分析。去除离群数据,并用整体标准化使每个数据组的比率计量数据标准化,所述整体标准化采用用于校正差异染料偏差和空间效应的三次样条拟合。进行第二次转化以将对照信号比拟合至平均log2=0(即1∶1比率)。标准化数据随后经方差分析。
对P、C、X1和X2样品微阵列载片中三个的每一个确定微阵列载片上任何给定位置处每个信号的平均信号强度。这个平均信号/探针位置与未用于计算平均值的样品载片上相同位置处的信号进行比较。举例来说,确定P、C和X1给定位置处的平均信号,并且将X2微阵列载片中该位置处的信号与P、C和X1平均信号值进行比较。
表5展示以任何一个样品与其他三个样品的平均信号相比,具有大于两倍信号的基因。
表5
基因 PvCX12 PvX12 CvX12
WD40重复蛋白A -1.24 -0.88 -1.07
CDC2 -1.09 -0.78 -0.92
周期素 -1.08 -1 -0.26
WD-40重复蛋白B -1.01 -0.87 -0.42
CDC2 -0.83 -0.49 -1.01
P=韧皮部
C=形成层
X1=木质部层-1
X2=木质部层-2
PvCX12=韧皮部目标信号对形成层、木质部1和木质部2目标的平均信号比率
数据显示WD40重复蛋白A所编码的WD40重复蛋白在形成层中的表达水平比发育中木质部的表达水平低很多,而WD40重复蛋白B所编码的WD40重复蛋白在韧皮部中的高表达水平比其他组织的表达水平高很多。
随后用RT-PCR核实信号数据以证实对应于探针中独特寡核苷酸基因在目标组织中的基因表达。
实例28
本实例说明如何使用来源于实例4所述的辐射松cDNA序列的微阵列选择来自多个松树种(在这个情形下为辐射松和火炬松树木)组织的RNA以用于与树木的幼年木材形成部分和成熟木材形成部分相关的基因表达模式。
从美国种植园选择大约16年的开花授粉火炬松,并且在新西兰种植园选择大约16年的开花授粉辐射松。在春季和夏季砍伐树木,来比较与木材形成不同发育阶段相关的基因表达。各自砍伐树木,并且从底部区域去除树干部分,这个区域是从底部大约1至2米,且在活树冠以下1至2米内。从树干底端去除的部分含有成熟木材。从活树冠以下去除的部分含有幼年木材。将在春季所收集的样品命名为早期木材或春季木材,而将在夏季所收集的样品视为晚期木材或夏季木材。Larson等人,Gen.Tech.Rep.FPL-GTR-129.Madison,WI:U.S.Department of Agriculture,Forest Service,ForestProducts Laboratory,第42页。
从树干部分分离组织,使得去除韧皮部、形成层、发育中木质部和成熟中木质部。仅从当年生长年轮收集这些组织。在每种情形下去除组织之后,立即将材料浸于液氮中以保存核酸和其他组份。剥离所述部分的树皮,并且用剃刀刮削而从树皮里面移除韧皮部组织。通过轻轻刮取表面来从经剥离部分的外表面分离形成层组织。通过连续更有力地刮取剩余组织来分离发育中木质部和木质化木质部。将组织从液氮转移至容器中用于-70℃下长期储存直至进行RNA提取和后续分析。
实例29
本实例说明上文实例中用于RNA提取和纯化(尤其可用于获自各种木本植物组织的RNA)的替代过程,和使用实例4所示阵列进行杂交和数据分析的过程。
根据Chang等人,Plant Mo1.Bio1.Rep.11:113的实验方案分离RNA。根据厂商推荐使用DNase I(Invitrogen,Carlsbad,CA)去除DNA。用Agilent2100 Bioanalyzer(AgilentTechnologies,USA)确定RNA样品的完整性。
用已知方法将10μg来自各组织的总RNA反转录成cDNA。
在辐射松韧皮部组织情形下,难以提取到足够量的总RNA以用于正常标记过程。如先前所述提取并处理总RNA,并且用来自NuGENTM(NuGEN,CA,USA)的OvationTMNanosample RNA Amplification系统扩增100ng总RNA。或者可以使用例如由Ambion所生产的相似扩增试剂盒。如上所述将经扩增的RNA反转录成cDNA,并且标记。
在42℃下如美国专利申请案“Methods and Kits for Labeling and Hybridizing cDNAfor Microarray Analysis”(上文)所述进行杂交和严谨度洗涤。用ScanArray4000Microarray Analysis System(GSI Lumonics,Ottawa,ON,Canada)扫描阵列(载片)。用QUANTARRAY软件(GSI Lumonics,Ottawa,ON,Canada)产生原始非标准强度值。
使用完全平衡、不完整组区实验设计(Kerr和Churchill,Gen.Res.123:123,2001)以设计将允许从所分析数据进行最大统计推理的阵列实验。
用SAS_Microarray Solution软件包(The SAS Institute,Gary,NC,USA)分析基因表达数据。随后用JMP_(The SAS Institute,Gary,NC,USA)使所得数据直观化。
这个实验所进行的分析是具有混合模型说明(Wolfmger等人,J.Comp.Biol.8:625-637)的ANOVA方法。采用两步线性混合模型。将第一者(标准模型)用于载片水平的整体标准化。将第二者(基因模型)用于每个基因的严格统计推理。两个模型阐述于模型(1)和(2)。
log2(Yijkls)=θij+Dk+Sl+DSklijkls(1)
R ijkls ( g ) = μ ij ( g ) + D K ( g ) + S l ( g ) + DS kl ( g ) + SS kl ( g ) + ϵ ijkls ( g ) - - - ( 2 )
Yijkls表示对ith细胞系采用jth处理的kth染料的1th载片中的sth斑点强度。θij、Dk、S1和Dskl表示ith细胞系中jth处理的平均效应、kth染料效应、1th载片随机效应和1th载片中kth染料的随机互作效应。ωijkls是随机错误项,与θii、Dk、Sl和Dskl表示相同作用,只是其特异于gth基因。Rijkls (g)表示来自模型(1)的gth基因残数。μij (g)、Dk (g)、Sl (g)和DSkl (g)与θij、Dk、Sl和DSkl表示相同作用,只是其特异于gth基因。SSls (g)表示对gth基因而言载片随机效应的斑点。εijkls (g)表示随机错误项。将所有随机项假定为正态分布,并且在每个模型内相互独立。
根据上述分析,发现某些cDNA差异表达,其中一些展示于下表6中。
表6
对应SEQ ID的基因 寡核苷酸ID 基因家族 表达
162 Pra_000171_O_4 肽基脯氨酰基异构酶 木质部中稳态RNA高于形成层中
164 Pra_001480_O_3 肽基脯氨酰基异构酶 木质部中稳态RNA低于形成层中
对照 Pra_000218_O_2 核糖核苷-二磷酸盐还原酶大链(EC1.17.4.1) 木质部中稳态RNA低于形成层中
对照 Pra_000193_O_2 推定表面蛋白质 木质部中稳态RNA低于形成层中
通过这些基因的上调或下调与木材发育特定阶段的相关性来推断木材发育中所包含的这些特异性基因。当将基因表达与木材发育相关联时,应考虑在特定季节与特定树干位置处跨越一个部分(韧皮部、形成层、发育中木质部、成熟中木质部)的木材发育空间连续性和季节与树干位置的关系。
实例30
本实例证明如何将多糖基因表达与农艺学重要木材表现型相关联,所述表现型例如密度、刚度、强度、枝条间距和螺旋木理。
从已知母树的后代中选择具有极好生长特征和对重要真菌疾病具有抗性的成熟经无性繁殖的松树。从弦切面去除树皮,并且研究胸高处第五年轮的平均木材密度、木材刚度和强度,和螺旋木理。也表征这些树的高度、主枝间的平均距离、树冠大小和分叉。
为了获得因影响密度、刚度、强度、枝条间距、螺旋木理和其他可能与影响这些特征的任何基因相关的特征的主基因而分异的幼苗家族,根据相互之间展示关于密度、刚度、强度、枝条间距和螺旋木理标准的最宽范围的标准,选择缺少普通亲本的树木用于种间杂交。因此,使用来自展示高密度、低平均主枝间距和高螺旋木理的树木花粉来对来自展示最低密度、最高平均主枝间距和最低螺旋木理的选种中的不相关精英树的球果授粉。重要的是应注意“精英树”经杂交,例如使得用来自展示高密度的精英树花粉来对展示高密度的另一精英树的发育球果授粉,且用来自展示低平均主枝间距的树木花粉对展示低平均主枝间距的另一精英树发育球果授粉。
由所述受控授粉和生长收集种子,使得对每个种子而言母体特性得以维持,并且用于营养繁殖,从而使每种基因型由多个无性系分株所表现。用微繁殖、树篱法或花束插枝来完成营养繁殖。储存每个基因型的一些无性系分株,同时生长每个基因型的营养繁殖体至足够大小以建立大田种植。以重复设计的方式排列基因型,并且在测量并记录每日温度和降雨的大田条件下生长。
测量不同年龄的树木以确定密度、刚度、强度、枝条间距、螺旋木理和任何其他可能与影响这些特征的任何基因相关的可观察特征的表达与分异。收集样品来表征纤维素含量、木质素含量、纤维素微纤丝角度、密度、强度、刚度、管胞形态、轮宽和类似特征。随后在春季和秋季从展示不同刚度和密度或其他特征的树木的复制样品,从另外尽可能与生长习性相似的基因型收集RNA,由此测定早期和晚期树木发育。与上述实例所述相似地研究这些样品的基因表达。
表7.一致ID信息
专利申请案 SEQID 基因家族 一致ID 表达
对照 核糖核苷-二磷酸盐还原酶 pinusRadiata_000218 早春季木质部对晚夏季木质部上表达
细胞周期 168 肽基脯氨酰基异构酶 pinusRadiata_001692 幼年发育中木材对成熟发育中木质部上表达
对照 亚硝酸盐转运子 pinusRadiata_016801 成熟发育中木质部对幼苗形成层上表达
将每种基因型的无性系分株与不同年龄下相同基因型的无性系分株进行比较以建立这些特征的年龄相关性。
实例31
实例8证明对环境条件(例如光照和季节)的反应如何改变植物表现型,并且可以用微阵列与多糖合成基因表达相关联。具体来说,研究与木材密度相关的基因表达变化。
在具有测量每日温度与降雨的气象站的地区生长三种不同经无性繁殖的巨桉杂交基因型的树木。在春季和随后的夏季中,首先以南北方向的标志对三种不同基因型的遗传因子相同的无性系分株照像,这是使用具有足够分辨率以显示植株幼年和成熟部分的树皮特征的照相机进行。通过种植记录确定树木年龄,并且通过计数年轮来证实。在这些树木的每个中,将成熟木材定义为胸高以下的树木的最外面的年轮,并且将幼年木材定义为胸高以上的树木的最里面的年轮。因此将每株树木如下划分:
NM-北面成熟
SM-南面成熟
NT-北面过渡
ST-南面过渡
NJ-北面幼年
SJ-南面幼年
从植物树干以及从幼年和成熟形式的叶片收集组织。同时制备用于表现型(包括植物形态学和生物化学特征)分析和基因表达分析的样品。记录划分每个分区的位点处树木的高度和直径,并且获取树木基部的土壤样品用于化学测定。称重为基因表达分析所制备的样品,并放入液氮中以用于随后制备用于微阵列实验的RNA样品。组织表示如下:
P-韧皮部
C-形成层
X1-展开木质部
X2-分化且木质化的木质部
如Ruzin,PLANT MICROTECHNIQUE AND MICROSCOPY,Oxford University Press,Inc.,New York,NY(1999)所述固定来自树干每区的弦切面和径切面的薄片,以用于解剖学研究并证实木材发育阶段。研究木材不同发育阶段的微纤丝,例如巨桉木材的幼年、过渡和成熟期。其他所研究的特征是每区内纤维与导管分子的比率和射线组织。此外,研究样品幼年与成熟木材之间和春季木材与夏季木材之间的变化特征,例如纤维形态、内腔尺寸和S2(最厚)细胞壁层的宽度。使用木材测定领域技术人员熟知技术进一步研究样品的第五年轮密度的测量,和弹性系数的确定。参看,例如,Wang等人,Non-destructiveEvaluations of Trees,EXPERIMENTAL TECHNIQUES,第25-30页(2000)。
为了进行生物化学分析,使用植物生物化学领域技术人员熟知的生物化学测定冻干并分析50克每个收集样品以定量单糖、氨基酸、脂质、其他提取物、木质素和纤维素。参看,例如,Pettersen & Schwandt,J.Wood Chem.& Technol.11:495(1991)。
在本实例中,选择用于进行比较的表现型是高密度木材、平均密度木材和低密度木材。如实例3所述,从在春季和夏季所收集的树木制备核酸样品。如上述实例执行通过杂交进行的基因表达型分析和数据分析。
使用相似技术和经无性繁殖的个体可以研究与例如强度、刚度和螺旋性的其他复杂木材特征相关的多糖基因表达。
实例32
实例32证明经功能连接于本申请案的一个基因的维管优选启动子的用途。
随后将维管优选启动子连接于本申请案的一个基因,并用于转化树木种。转化体的木质部中候选基因的转录水平升高引起木质部生物量表现型增加。
在另一实例中,例如ArborGen的2003年11月专利申请案中的任何维管优选启动子随后连接于含有来自本申请案一个基因的序列的RNAi构建体,并且用于转化从其分离基因的树木属。转化体的木质部中候选基因转录水平降低引起木质部生物量表现型增加。
实例33
用以下步骤开发pARB476载体。通过在KpnI和ClaI位点处加入超级泛素启动子3′UTR和nos 3′终止子序列来修饰Bluescript载体(Stratagene,La Jolla,CA),从而产生载体pARB005(SEQ ID NO.773)。向这个载体中加入具有内含子的辐射松超级泛素启动子。首先用标准PCR技术和SEQ ID NO774和775引物,从美国专利第6,380,459号所鉴别的辐射松超级泛素序列扩增启动子/内含子序列。随后用XbaI和PstI限制性消化来将所扩增的片段连接于pARB005以产生载体pARBI19(SEQ ID NO.776)。
用标准PCR技术和包括ATG与ClaI位点作为5′引物的部分和TGA与ClaI位点作为3′引物的部分的引物来扩增poplus tremuloises UDB葡萄糖结合域基因(专利WOOO71670,ptCelA Genbank第AF072131号)。随后将所扩增的片段克隆进pARB119的ClaI位点以产生载体pARB476(SEQ ID NO.777)。
移除含有具有来自pARB476的内含子::UDP葡萄糖结合域::3′UTR:nos终止子的辐射松超级泛素启动子的NotI盒,并且将其克隆进pART29的NotI位点以产生载体pARB483。双元载体pART29是含有拟南芥泛素3(UBQ3)启动子而代替nos 5′启动子且不含有lacZ序列的经修饰pART27载体(Gleave,Plant Mol.Biol.20:1203-1207,1992)。
SEQ ID 773
CGATGGGTGTTATTTGTGGATAATAAATTCGGGTGATGTTCAGTGTTTGTCGTATTTCTCACCAATAAA
TTGTGTTTATGTATGTGTTAGTGTTGTTTGTCTGTTTCAGACCCTCTTATGTTATATTTTTCTTTTCGT
CGGTCAGTTGAAGCCAATACTGGTGTCCTGGCCGGCACTGCAATACCATTTCGTTTAATATAAAGACTC
TGTTATCCGTGAGCTCGAATTTCCCCGATCGTTCAAACATTTGGCAATAAAGTTTCTTAAGATTGAATC
CTGTTGCCGGTCTTGCGATGATTATCATATAATTTCTGTTGAATTACGTTAAGCATGTAATAATTAACA
TGTAATGCATGACGTTATTTATGAGATGGGTTTTTATGATTAGAGTCCCGCAATTATACATTTAATACG
CGATAGAAAACAAAATATAGCGCGCAAACTAGGATAAATTATCGCGCGCGGTGTCATCTATGTTACTAG
ATCGCGGCCGCATTTAAATGGTACCCAATTCGCCCTATAGTGAGTCGTATTACGCGCGCTCACTGGCCG
TCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCC
CTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGA
ATGGCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGA
CCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCG
CCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACC
TCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTC
GCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACC
CTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGC
TGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGCTTACAATTTAGGTGGCACTTTTCG
GGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAG
ACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGT
CGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGT
AAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGAT
CCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGC
GGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTT
GGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGC
TGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCT
AACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGA
AGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATT
AACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGC
AGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCG
TGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACAC
GACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAA
GCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATT
TAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTT
CCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAAT
CTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAAC
TCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTA
GTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGT
GGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGC
GCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACT
GAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCC
GGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTA
TAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAG
CCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACAT
GTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGC
TCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAA
ACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGC
GGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTAT
GCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCA
TGATTACGCCAAGCGCGCAATTAACCCTCACTAAAGGGAACAAAAGCTGGGGCCGCTCTAGAACTAGTG
GATCCCCCGGGCTGCAGGAATTCGTCCAGCAGTTGTCTGGAGCTCCACCAGAAATCTGGAAGCTTAT
SEQ ID 774
AAATCTAGAGGTACCATTTAAATGCGGCCGCAAAACCCCTCACAAATACATAA
SEQ ID 775
TTTCTGCAGCTTGAAATTGAAATATGACTAACGAAT
SEQ ID 776
tctagaggtaccatttaaatgcggccgcaaaacccctcacaaatacataaaaaaaattctttatttaat
tatcaaactctccactacctttcccaccaaccgttacaatcctgaatgttggaaaaaactaactacatt
gatataaaaaaactacattacttcctaaatcatatcaaaattgtataaatatatccactcaaaggagtc
tagaagatccacttggacaaattgcccatagttggaaagatgttcaccaagtcaacaagatttatcaat
ggaaaaatccatctaccaaacttactttcaagaaaatccaaggattatagagtaaaaaatctatgtatt
attaagtcaaaaagaaaaccaaagtgaacaaatattgatgtacaagtttgagaggataagacattggaa
tcgtctaaccaggaggcggaggaattccctagacagttaaaagtggccggaatcccggtaaaaaagatt
aaaatttttttgtagagggagtgcttgaatcatgttttttatgatggaaatagattcagcaccatcaaa
aacattcaggacacctaaaattttgaagtttaacaaaaataacttggatctacaaaaatccgtatcgga
ttttctctaaatataactagaattttcataactttcaaagcaactcctcccctaaccgtaaaacttttc
ctacttcaccgttaattacattccttaagagtagataaagaaataaagtaaataaaagtattcacaaac
caacaatttatttcttttatttacttaaaaaaacaaaaagtttatttattttacttaaatggcataatg
acatatcggagatccctcgaacgagaatcttttatctccctggttttgtattaaaaagtaatttattgt
ggggtccacgcggagttggaatcctacagacgcgctttacatacgtctcgagaagcgtgacggatgtgc
gaccggatgaccctgtataacccaccgacacagccagcgcacagtatacacgtgtcatttctctattgg
aaaatgtcgttgttatccccgctggtacgcaaccaccgatggtgacaggtcgtctgttgtcgtgtcgcg
tagcgggagaagggtctcatccaacgctattaaatactcgccttcaccgcgttacttctcatcttttct
cttgcgttgtataatcagtgcgatattctcagagagcttttcattcaaaggtatggagttttgaagggc
tttactcttaacatttgtttttctttgtaaattgttaatggtggtttctgtgggggaagaatcttttgc
caggtccttttgggtttcgcatgtttatttgggttatttttctcgactatggctgacattactagggct
ttcgtgctttcatctgtgttttcttcccttaataggtctgtctctctggaatatttaattttcgtatgt
aagttatgagtagtcgctgtttgtaataggctcttgtctgtaaaggtttcagcaggtgtttgcgtttta
ttgcgtcatgtgtttcagaaggcctttgcagattattgcgttgtactttaatattttgtctccaacctt
gttatagtttccctcctttgatctcacaggaaccctttcttctttgagcattttcttgtggcgttctgt
agtaatattttaattttgggcccgggttctgagggtaggtgattattcacagtgatgtgctttccctat
aaggtcctctatgtgtaagctgttagggtttgtgcgttactattgacatgtcacatgtcacatattttc
ttcctcttatccttcgaactgatggttctttttctaattcgtggattgctggtgccatattttatttct
attgcaactgtattttagggtgtctctttctttttgatttcttgttaatatttgtgttcaggttgtaac
tatgggttgctagggtgtctgccctcttcttttgtgcttctttcgcagaatctgtccgttggtctgtat
ttgggtgatgaattatttattccttgaagtatctgtctaattagcttgtgatgatgtgcaggtatattc
gttagtcatatttcaatttcaagcgatcccccgggctgcaggaattcgtccagcagttgtctggagctc
caccagaaatctggaagcttatcgatgggtgttatttgtggataataaattcgggtgatgttcagtgtt
tgtcgtatttctcacgaataaattgtgtttatgtatgtgttagtgttgtttgtctgtttcagaccctct
tatgttatatttttcttttcgtcggtcagttgaagccaatactggtgtcctggccggcactgcaatacc
atttcgtttaatataaagactctgttatccgtgagctcgaatttccccgatcgttcaaacatttggcaa
taaagtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattac
gttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgattagagtc
ccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaactaggataaattatcgcgc
gcggtgtcatctatgttactagatcgcggccgcatttaaatggtacccaattcgccctatagtgagtcg
tattacgcgcgctcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaactt
aatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgccct
tcccaacagttgcgcagcctgaatggcgaatgggacgcgccctgtagcggcgcattaagcgcaacgggt
gtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccgctcctttcgctttcttc
ccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttc
cgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtgggcca
tcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttc
caaactggaacaacactcaaccctatctcggtctattcttttgatttataagggattttgccgatttcg
gcctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaatattaacgctt
acaatttaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttatttttctaaatacatt
caaatatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagta
tgagtattcaacatttccgtgtcgcccttattcccttttttgcggcattttgccttcctgtttttgctc
acccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaac
tggatctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcactt
ttaaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgccgca
tacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggatggcatga
cagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttctgacaa
cgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgatc
gttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatgg
caacaacgttgcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaatagact
ggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtttattgctg
ataaatctggagccggtgagcgtgggtctcgcggtatcattgcagcactggggccagatggtaagccct
cccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaatagacagatcgctg
agataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatactttagattg
atttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgataatctcatgaccaaaa
tcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgag
atcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtt
tgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcgcagataccaaata
ctgtccttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcg
ctctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaa
gacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttgg
agcgaacgacctacaccgaactgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaag
ggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccag
ggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgt
gatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggcct
tttgctggccttttgctcacatgttctttcctgcgttatcccctgattctgtggataaccgtattaccg
cctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaag
cggaagagcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcagctggcacg
acaggtttcccgactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattagg
caccccaggctttacactttatgcttccggctcgtatgttgtgtggaattgtgagcggataacaatttc
acacaggaaacagctatgaccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaagc
tggggccgctctag
[0386]SEQ ID777
TCTAGAGGTACCATTTAAATGCGGCCGCAAAACCCCTCACAAATACATAAAAAAAATTCTTTATTTAAT
TATCAAACTCTCCACTACCTTTCCCACCAACCGTTACAATCCTGAATGTTGGAAAAAACTAACTACATT
GATATAAAAAAACTACATTACTTCCTAAATCATATCAAAATTGTATAAATATATCCACTCAAAGGAGTC
TAGAAGATCCACTTGGACAAATTGCCCATAGTTGGAAAGATGTTCACCAAGTCAACAAGATTTATCAAT
GGAAAAATCCATCTACCAAACTTACTTTCAAGAAAATCCAAGGATTATAGAGTAAAAAATCTATGTATT
ATTAAGTCAAAAAGAAAACCAAAGTGAACAAATATTGATGTACAAGTTTGAGAGGATAAGACATTGGAA
TCGTCTAACCAGGAGGCGGAGGAATTCCCTAGACAGTTAAAAGTGGCCGGAATCCCGGTAAAAAAGATT
AAAATTTTTTTGTAGAGGGAGTGCTTGAATCATGTTTTTTATGATGGAAATAGATTCAGCACCATCAAA
AACATTCAGGACACCTAAAATTTTGAAGTTTAACAAAAATAACTTGGATCTACAAAAATCCGTATCGGA
TTTTCTCTAAATATAACTAGAATTTTCATAACTTTCAAAGCAACTGCTCCCCTAACCGTAAAACTTTTC
CTACTTCACCGTTAATTACATTCCTTAAGAGTAGATAAAGAAATAAAGTAAATAAAAGTATTCACAAAC
CAACAATTTATTTCTTTTATTTACTTAAAAAAACAAAAAGTTTATTTATTTTACTTAAATGGCATAATG
ACATATCGGAGATCCCTCGAACGAGAATCTTTTATCTCCCTGGTTTTGTATTAAAAAGTAATTTATTGT
GGGGTCCACGCGGAGTTGGAATCCTACAGACGCGCTTTACATACGTCTCGAGAAGCGTGACGGATGTGC
GACCGGATGACCCTGTATAACCCACCGACACAGCCAGCGCACAGTATACACGTGTCATTTCTCTATTGG
AAAATGTCGTTGTTATCCCCGCTGGTACGCAACCACCGATGGTGACAGGTCGTCTGTTGTCGTGTCGCG
TAGCGGGAGAAGGGTCTCATCCAACGCTATTAAATACTCGCCTTCACCGCGTTACTTCTCATCTTTTCT
CTTGCGTTGTATAATCAGTGCGATATTCTCAGAGAGCTTTTCATTCAAAGGTATGGAGTTTTGAAGGGC
TTTACTCTTAACATTTGTTTTTCTTTGTAAATTGTTAATGGTGGTTTCTGTGGGGGAAGAATCTTTTGC
CAGGTCCTTTTGGGTTTCGCATGTTTATTTGGGTTATTTTTCTCGACTATGGCTGACATTACTAGGGCT
TTCGTGCTTTCATCTGTGTTTTCTTCCCTTAATAGGTCTGTCTCTCTGGAATATTTAATTTTCGTATGT
AAGTTATGAGTAGTCGCTGTTTGTAATAGGCTCTTGTCTGTAAAGGTTTCAGCAGGTGTTTGCGTTTTA
TTGCGTCATGTGTTTCAGAAGGCCTTTGCAGATTATTGCGTTGTACTTTAATATTTTGTCTCCAACCTT
GTTATAGTTTCCCTCCTTTGATCTCACAGGAACCCTTTCTTCTTTGAGCATTTTCTTGTGGCGTTCTGT
AGTAATATTTTAATTTTGGGCCCGGGTTCTGAGGGTAGGTGATTATTCACAGTGATGTGCTTTCCCTAT
AAGGTCCTCTATGTGTAAGCTGTTAGGGTTTGTGCGTTACTATTGACATGTCACATGTCACATATTTTC
TTCCTCTTATCCTTCGAACTGATGGTTCTTTTTCTAATTCGTGGATTGCTGGTGCCATATTTTATTTCT
ATTGCAACTGTATTTTAGGGTGTCTCTTTCTTTTTGATTTCTTGTTAATATTTGTGTTCAGGTTGTAAC
TATGGGTTGCTAGGGTGTCTGCCCTCTTCTTTTGTGCTTCTTTCGCAGAATCTGTCCGTTGGTCTGTAT
TTGGGTGATGAATTATTTATTCCTTGAAGTATCTGTCTAATTAGCTTGTGATGATGTGCAGGTATATTC
GTTAGTCATATTTCAATTTCAAGCGATCCCCCGGGCTGCAGGAATTCGTCCAGCAGTTGTCTGGAGCTC
CACCAGAAATCTGGAAGCTTATCGATATGGATCAGTTCCCCAAGTGGAATCCTGTCAATAGAGAAACGT
ATATCGAAAGGCTGTCGGCAAGGTATGAAAGAGAGGGTGAGCCTTCTCAGCTTGCTGGTGTGGATTTTT
TCGTGAGTACTGTTGATCCGCTGAAGGAACCGCCATTGATCACTGCCAATACAGTCCTTTCCATCCTTG
CTGTGGACTATCCCGTCGATAAAGTCTCCTGCTACGTGTCTGATGATGGTGCAGCTATGCTTTCATTTG
AATCTCTTGTAGAAACAGCTGAGTTTGCAAGGAAGTGGGTTCCGTTCTGCAAAAAATTCTCAATTGAAC
CAAGAGCACCGGAGTTTTACTTCTCACAGAAAATTGATTACTTGAAAGACAAGGTTCAACCTTCTTTCG
TGAAAGAACGTAGAGCAATGAAAAGGGATTATGAAGAGTACAAAGTCCGAGTTAATGCCCTGGTAGCAA
AGGCTCAGAAAACACCTGAAGAAGGATGGACTATGCAAGATGGAACACCTTGGCCTGGGAATAACACAC
GTGATCACCCTGGCATGATTCAGGTCTTCCTTGGAAATACTGGAGCTCGTGACATTGAAGGAAATGAAC
TACCTCGTCTAGTATATGTCTCCAGGGAGAAGAGACCTGGCTACCAGCACCACAAAAAGGCTGGTGCAG
AAAATGCTCTGGTGAGAGTGTCTGCAGTACTCACAAATGCTCCCTACATCCTCAATGTTGATTGTGATC
ACTATGTAAACAATAGCAAGGCTGTTCGAGAGGCAATGTGCATCCTGATGGACCCACAAGTAGGTCGAG
ATGTATGCTATGTGCAGTTCCCTCAGAGGTTTGATGGCATAGATAAGAGTGATCGCTACGCCAATCGTA
ACGTAGTTTTCTTTGATGTTAACATGAAAGGGTTGGATGGCATTCAAGGACCAGTATACGTAGGAACTG
GTTGTGTTTTCAACAGGCAAGCACTTTACGGCTACGGGCCTCCTTCTATGCCCAGCTTACGCAAGAGAA
AGGATTCTTCATCCTGCTTCTCATGTTGCTGCCCCTCAAAGAAGAAGCCTGCTCAAGATCCAGCTGAGG
TATACAGAGATGCAAAAAGAGAGGATCTCAATGCTGCCATATTTAATCTTACAGAGATTGATAATTATG
ACGAGCATGAAAGGTCAATGCTGATCTCCCAGTTGAGCTTTGAGAAAACTTTTGGCTTATCTTCTGTCT
TCATTGAGTCTACACTAATGGAGAATGGAGGAGTACCCGAGTCTGCCAACTCACCAACACTCATCAAGG
AAGCAATTCATGTCATCGGCTGTGGCTATGAAGAGAAGACTGAATGGGGAAAAGAGATTGGTTGGATAT
ATGGGTCAGTCACTGAGGATATCTTAAGTGGCTTCAAGATGCACTGCCGAGGATGGAGATCAATTTACT
GCATGCCCGTAAGGCCTGCATTCAAAGGATCTGCACCCATCAACCTGTCTGATAGATTGCACCAGGTCC
TCCGATGGGCTCTTGGTTCTGTGGAAATTTTCTTTAGCAGACACTGTCCCCTCTGGTACGGGTTTGGAG
GAGGCCGTCTTAAATGGCTCCAAAGGCTTGCGTATATAAACACCATTGTGTACCCATGAATCGATGGGT
GTTATTTGTGGATAATAAATTCGGGTGATGTTCAGTGTTTGTCGTATTTCTCACGAATAAATTGTGTTT
ATGTATGTGTTAGTGTTGTTTGTCTGTTTCAGACCCTCTTATGTTATATTTTTCTTTTCGTCGGTCAGT
TGAAGCCAATACTGGTGTCCTGGCCGGCACTGCAATACCATTTCGTTTAATATAAAGACTCTGTTATCC
GTGAGCTCGAATTTCCCCGATCGTTCAAACATTTGGCAATAAAGTTTCTTAAGATTGAATCCTGTTGCC
GGTCTTGCGATGATTATCATATAATTTCTGTTGAATTACGTTAAGCATGTAATAATTAACATGTAATGC
ATGACGTTATTTATGAGATGGGTTTTTATGATTAGAGTCCCGCAATTATACATTTAATACGCGATAGAA
AACAAAATATAGCGCGCAAACTAGGATAAATTATCGCGCGCGGTGTCATCTATGTTACTAGATCGCGGC
CGCATTTAAATGGTACCCAATTCGCCCTATAGTGAGTCGTATTACGCGCGCTCACTGGCCGTCGTTTTA
CAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCC
AGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAA
TGGGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACA
CTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTT
CCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCC
AAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTG
ACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCG
GTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAA
CAAAAATTTAACGCGAATTTTAACAAAATATTAACGCTTACAATTTAGGTGGCACTTTTCGGGGAAATG
TGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAAC
CCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTA
TTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATG
CTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGA
GTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTAT
CCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGT
ACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAA
CCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTT
TTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTCGGAACCGGAGCTGAATGAAGCCATAC
CAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCG
AACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCAC
TTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTC
GCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGA
GTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGT
AACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGA
TCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAG
CGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCT
TGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTC
CGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCC
ACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTG
CCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGT
CGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACC
TACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCG
GCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTG
TCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGA
AAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTC
CTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCA
GCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTC
TCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTG
AGCGCAACGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGG
CTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACG
CCAAGCGCGCAATTAACCCTCACTAAAGGGAACAAAAGCTGGGGCCGCTCTAG
表8.pGrowth信息。
CWAR 质粒 启动子 基因 Genesis ID
88 pGrowth14 SUBIN 周期素A prga001823
88 pGrowth15 SUBIN 周期素A prpe001264
88 pGrowth16 SUBIN 周期素D prxa004540
88 pGrowth18 SUBIN 周期素D prx1006271
88 pGrowth19 SUBIN 周期素D prpb019661
88 pGrowth20 SUBIN WEE1样蛋白 prrd041233
为了制造growth100质粒,通过首先将来自pARB483a的NotI-SUBIN::UDPGBD::nos终止子-NotI盒插入至pWVK147的NotI处而构建受体载体(pWVK202)。接着使用限制性位点PstI和C1aI来去除UDPGBD基因。插入含有限制性位点PstI、NheI、AvrII、ScaI和ClaI的多接头(polylinker)以代替UDPGBD基因。位点AvrII和NheI均与SpeI相容,SpeI是通常见于由Genesis所提供的质粒中的位点。钝化ScaI,由此可以钝化任何片段,并且随后在该位置插入至受体载体中。从Genesis接收质粒,并且分析以确定哪些限制性位点可能最适合于亚克隆进受体载体pWVK202。进行连接之后,通过广泛限制性消化分析检查所得产物,以确保已经创建所需质粒。
表9.巨桉细胞周期基因和蛋白质。
 DNASEQ IDNO 蛋白质SEQ ID NO         序列标识 专利ORF开始 专利ORF结束
    1     236  eucalyptusSpp_003910     387     1820
    2     237  eucalyptusSpp_19213     99     1007
    3     238  eucalyptusSpp_036800     120     1004
    4     239  eucalyptusSpp_040260     23     937
    5     240  eucalyptusSpp_041965     149     1033
    6     241  eucalyptusSpp_002906     199     1116
    7     242  eucalyptusSpp_001518     41     982
    8     243  eucalyptusSpp_008078     291     2042
    9     244  eucalyptusSpp_009826     107     2236
    10     245  eucalyptusSpp_010364     82     1749
    11     246  eucalyptusSpp_011523     151     1560
    12     247  eucalyptusSpp_024358     82     1644
    13     248  eucalyptusSpp_039125     626     2782
    14     249  eucalyptusSpp_005362     13     1467
    15     250  eucalyptusSpp_044857     113     1558
    16     251  eucalyptusSpp_001743     187     1686
    17     252  eucalyptusSpp_012405     238     1653
    18     253  eucalyptusSpp_003739     235     1539
    19     254  eucalyptusSpp_022338     158     1618
    20     255  eucalyptusSpp_028605     205     1530
    21     256  eucalyptusSpp_041006     174     1499
    22     257  eucalyptusSpp_006643     94     1332
    23     258  eucalyptusSpp_045338     176     1342
    24     259  eucalyptusSpp_046486     150     1283
    25     260  eucalyptusSpp_012070     101     367
    26     261  eucalyptusSpp_006617     9     1352
    27     262  eucalyptusSpp_007827     89     1486
    28     263  eucalyptusSpp_008036     80     1477
    29     264  010212EGLA007017HT     160     1062
  DNASEQ IDNO 蛋白质SEQ ID NO     序列标识 专利ORF开始 专利ORF结束
    30     265  eucalyptusSpp_001596     172     1077
    31     266  eucalyptusSpp_005870     66     989
    32     267  eucalytusSpp_006901     111     1541
    33     268  eucalyptusSpp_006902     116     1615
    34     269  eucalyptusSpp_007440     155     1453
    35     270  eucalyptusSpp_008994     228     2033
    36     271  eucalyptusSpp_024580     110     1258
    37     272  eucalyptusSpp_037831     50     1462
    38     273  eucalyptusSpp_034958     176     739
    39     274  001209EGXC004488HT     150     1529
    40     275  010310EGXD012820HT     247     1971
    41     276  010310EGXD013036HT     136     1644
    42     277  010316EGXF999037HT     48     836
    43     278  010324EGXF002118HT     49     822
    44     279  011019EGKA001923HT     185     751
    45     280  eucalyptusSpp_000966     103     621
    46     281  eucalyptusSpp_001037     41     559
    47     282  eucalyptusSpp_004603     127     693
    48     283  eucalyptusSpp_005465     28     639
    49     284  eucalyptusSpp_006571     135     812
    50     285  eucalyptusSpp_006786     119     613
    51     286  eucalyptusSpp_007057     38     562
    52     287  eucalyptusSpp_008670     109     1872
    53     288  eucalyptusSpp_009137     74     1159
    54     289  eucalyptusSpp_010285     54     2045
    55     290  eucalyptusSpp_010600     53     1879
    56     291  eucalyptusSpp_011551     7     690
    57     292  eucalyptusSpp_020743     83     601
    58     293  eucalyptusSpp_023739     125     535
    59     294  eucalyptusSpp_024103     55     573
    60     295  eucalyptusSpp_031985     147     842
    61     296  eucalyptusSpp_032025     167     487
    62     297  eucalyptusSpp_032173     195     890
    63     298  eZucalyptusSpp_033340     68     586
    64     299  eucalyptusSpp_009143     182     3265
    65     300  eucalyptusSpp_000349     165     1145
    66     301  eucalyptusSpp_000575     529     1569
    67     302  eucalyptusSpp_00804     156     1136
    68     303  eucalyptusSpp_00805     90     1073
    69     304  eucalyptusSpp_000806     66     1049
    70     305  eucalyptusSpp_002248     277     1512
    71     306  eucalyptusSpp_003203     33     1076
    72     307  eucalyptusSpp_003209     65     973
    73     308  eucalyptusSpp_004429     82     1047
DNA SEQ ID NO 蛋白质SEQ ID NO     序列标识 专利ORF开始 专利ORF结束
    74     309  eucalyptusSpp_004607     43     1101
    75     310  eucalyptusSpp_004682     142     1095
    76     311  eucalyptusSpp_005786     61     1257
    77     312  eucalyptusSpp_005887     193     1527
    78     313  eucalyptusSpp_005981     109     1155
    79     314  eucalyptusSpp_006766     71     1213
    80     315  eucalyptusSpp_006769     109     1785
    81     316  eucalyptusSpp_006907     364     2685
    82     317  eucalyptusSpp_007518     96     1412
    83     318  eucalyptusSpp_007717     116     1702
    84     319  eucalyptusSpp_007718     46     1101
    85     320  eucalyptusSpp_007741     23     1258
    86     321  eucalyptusSpp_007884     404     2644
    87     322  eucalyptusSpp_008258     107     2383
    88     323  eucalyptusSpp_008465     243     1625
    89     324  eucalyptusSpp_008616     126     1127
    90     325  eucalyptusSpp_008690     257     1390
    91     326  eucalyptusSpp_008708     178     1632
    92     327  eucalyptusSpp_008850     290     2917
    93     328  eucalyptusSpp_009072     148     1197
    94     329  eucalyptusSpp_009465     140     1567
    95     330  eucalyptusSpp_009472     376     1737
    96     331  eucalyptusSpp_009550     69     1010
    97     332  eucalyptusSpp_010284     149     1423
    98     333  eucalyptusSpp_010595     365     2677
    99     334  eucalyptusSpp_010657     24     923
    100     335  eucalyptusSpp_012636     221     3598
    101     336  eucalyptusSpp_012748     44     1447
    102     337  eucalyptusSpp_012879     196     1314
    103     338  eucalyptusSpp_015515     193     1668
    104     339  eucalyptusSpp_015724     78     1634
    105     340  eucalyptusSpp_016167     85     2826
    106     341  eucalyptusSpp_016633     74     1246
    107     342  eucalyptusSpp_017485     100     4377
    108     343  eucalyptusSpp_018007     58     2439
    109     344  eucalyptusSpp_020775     159     1064
    110     345  eucalyptusSpp_023132     118     1665
    111     346  eucalyptusSpp_023569     57     1628
    112     347  eucalyptusSpp_023611     250     1566
    113     348  eucalyptusSpp_024934     106     1434
    114     349  eucalyptusSpp_025546     190     1917
    115     350  eucalyptusSpp_030134     102     2942
    116     351  eucalyptusSpp_031787     75     1079
    117     352  eucalyptusSpp_034435     99     1148
DNA SEQ ID NO 蛋白质SEQ ID NO     序列标识 专利ORF开始 专利ORF结束
    118     353  eucalyptusSpp_034452     232     1806
    119     354  eucalyptusSpp_035789     72     1124
    120     355  eucalyptusSpp_035804     315     2069
    121     356  eucalyptusSpp_043057     145     1968
    122     357  eucalyptusSpp_046741     130     1488
    123     358  eucalyptusSpp_047161     269     1693
    698     718  eucalyptusSpp_008994
    699     719  eucalyptusSpp_009143
    700     720  eucalyptusSpp_006366
    701     721  eucalyptusSpp_006907
    702     722  eucalyptusSpp_12636
    703     723  eucalyptusSpp_015724
    704     724  euoalyptusSpp_016167
    705     725  eucalyptusSpp_017485
    706     726  eucalyptusSpp_030134
    707     727  eucalyptusSpp_046741
    708     728  eucalyptusSpp_047161
    709     729  eucalyptusSpp_17378
表10.辐射松细胞周期基因和蛋白质。
DNA SEQ ID NO 蛋白质SEQ ID NO     序列标识 专利ORF开始 专利ORF结束
    124     359   pinusRadiata_001766     1163     2545
    125     360   pinusRadiata_002927     152     1582
    126     361   990309PRCA009171HT     389     1297
    127     362   pinusRadiata_013714     38     946
    128     363   pinusRadiata_016332     180     1088
    129     364   pinusRadiata_021677     40     948
    130     365   pinusRadiata_027562     229     1134
    131     366   pinusRadiata_001504     105     2642
    132     367   pinusRadiata_015211     187     2580
    133     368   pinusRadiata_020421     220     1749
    134     369   pinusRadiata_003187     438     1748
    135     370   pinusRadiata_015661     240     1631
    136     371   pinusRadiata_013874     252     1604
    137     372   pinusRadiata_014615     261     1817
    138     373   pinusRadiata_004578     167     1576
    139     374   pinusRadiata_023387     183     1598
    140     375   pinusRadiata_006970     98     1126
    141     376   pinusRadiata_010322     148     894
    142     377   pinusRadiata_022721     287     1363
    143     378   pinusRadiata_023407     251     1348
    144     379   pinusRadiata_001945     229     510
    145     380   pinusRadiata_008233     92     409
    146     381   pinusRadiata_008234     64     381
    147     382   pinusRadiata_022054     68     349
    148     383   pinusRadiata_012137     125     1849
    149     384   pinusRadiata_012582     70     1602
    150     385   pinusRadiata_015285     140     1465
    151     386   pinusRadiata_017229     628     2565
    152     387   pinusRadiata_020724     55     1818
    153     388  pinusRadiata_004555     259     1710
    154     389  pinusRadiata_004556     356     1807
    155     390  pinusRadiata_005729     261     1298
    156     391  pinusRadiata_007395     365     2251
    157     392  pinusRadiata_009503     156     1454
    158     393  pinusRadiata_011283     203     1348
    159     394  pinusRadiata_012322     229     1644
    160     395  pinusRadiata_018671     156     1454
    161     396  pinusRadiata_023236     27     2222
    162     397  pinusRadiata_000171     7     1759
    163     398  pinusRadiata_000172     358     2040
    164     399  pinusRadiata_001480     238     756
DNA SEQ ID NO 蛋白质SEQ ID NO        序列标识 专利ORF开始 专利ORF结束
    165     400  pinusRadiata_001481     285     803
    166     401  pinusRadiata_001483     190     708
    167     402  pinusRadiata_001484     156     674
    168     403  pinusRadiata_001692     176     1912
    169     404  pinusRadiata_005313     64     765
    170     405  pinusRadiata_006362     93     881
    171     406  pinusRadiata_006493     372     1070
    172     407  pinusRadiata_006983     28     594
    173     408  pinusRadiata_006984     34     648
    174     409  pinusRadiata_007665     481     1611
    175     410  pinusRadiata_012196     93     584
    176     411  pinusRadiata_013382     250     1869
    177     412  pinusRadiata_016461     84     422
    178     413  pinusRadiata_017611     128     1213
    179     414  pinusRadiata_019776     265     837
    180     415  pinusRadiata_020659     38     781
    181     416  pinusRadiata_022559     38     526
    182     417  pinusRadiata_024188     37     1158
    183     418  pinusRadiata_027973     61     768
    184     419  pinusRadiata_001353     421     2172
    185     420  pinusRadiata_001978     163     1647
    186     421  pinusRadiata_002810     192     1172
    187     422  pinusRadiata_002811     131     1111
    188     423  pinusRadiata_002812     149     1726
    189     424  pinusRadiata_003514     948     2228
    190     425  pinusRadiata_004104     332     1465
    191     426  pinusRadiata_005595     232     1590
    192     427  pinusRadiata_005754     207     1550
    193     428  pinusRadiata_006463     221     1171
    194     429  pinusRadiata_006665     221     3679
    195     430  pinusRadiata_006750     269     1252
    196     431  pinusRadiata_007030     214     1242
    197     432  pinusRadiata_007854     119     2065
    198     433  pinusRadiata_007917     186     1550
    199     434  pinusRadiata_007989     244     3671
    200     435  pinusRadiata_008506     163     1431
    201     436  pinusRadiata_008692     155     1081
    202     437  pinusRadiata_008693     537     1463
    203     438  pinusRadiata_009170     284     1909
    204     439  pinusRadiata_009408     610     1659
    205     440  pinusRadiata_009522     241     1452
    206     441  pinusRadiata_009734     223     1173
    207     442  pinusRadiata_009815     251     1777
    208     443  pinusRadiata_010670     367     1419
 DNASEQID NO 蛋白质SEQ IDNO     序列标识 专利ORF开始 专利ORF结束
    209     444  pinusRadiata_011297     284     1303
    210     445  pinusRadiata_013098     684     1784
    211     446  pinusRadiata_013172     336     2738
    212     447  pinusRadiata_013589     81     1622
    213     448  pinusRadiata_013608     399     1460
    214     449  pinusRadiata_014299     207     1673
    215     450  pinusRadiata_014498     263     1309
    216     451  pinusRadiata_014548     232     2529
    217     452  pinusRadiata_014610     56     2950
    218     453  pinusRadiata_015460     56     1234
    219     454  pinusRadiata_016090     193     2577
    220     455  pinusRadiata_016722     187     1233
    221     456  pinusRadiata_016785     51     1436
    222     457  pinusRadiata_017094     525     2351
    223     458  pinusRadiata_017527     152     1099
    224     459  pinusRadiata_017591     470     4114
    225     460  pinusRadiata_017769     196     2007
    226     461  pinusRadiata_018047     214     1323
    227     462  pinusRadiata_018414     68     2146
    228     463  pinusRadiata_018986     874     3705
    229     464  pinusRadiata_019479     360     1754
    230     465  pinusRadiata_020144     185     1384
    231     466  pinusRadiata_022480     241     1533
    232     467  pinusRadiata_023079     230     1435
    233     468  pinusRadiata_026739     101     2857
    234     469  pinusRadiata_026951     43     1548
    235     470  pinusRadiata_026529     206     1657
    710     730  pinusRadiata_000888
    711     731  pinusRadiata_004578
    712     732  pinusRadiata_007989
    713     733  pinusRadiata_009522
    714     734  pinusRadiata_014610
    715     735  pinusRadiata_017591
    716     736  pinusRadiata_017769
    717     737  pinusRadiata_026951
表11.本发明的注释肽序列。
条目             序列描述  注释肽序列
1 SEQ ID 261的氨基酸序列。保守真核蛋白激酶域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MGDGSLGSGGRGNSGGGGGGGSRPEWLQQ YDLIGKIGEG TYGLVFLARIKHPSTNRGKYIAIKKFKQSKDGDGVSPTA IREIMLLREISHENVVKLVNVHINPVDMSLYLAFDYADH DLYEIIRHHRDKVNQAINPYTVKSLLWQLLNGLNYLHSN WIIHRDLKPSNILVMGEGEEQGVVKIADFGLARVYQAPL KPLSDNGVVVTINYRAPELLLGAKHYTSAVDMWAVGCIF AELLTLKPLFQGQEVKANPNPFQLDQLDKIFKVLGHPTQ EKWPMLVNLPHWQSDVQHIQRHKYDDNALGNVVRLSSKN ATFDLLSKMLEYDPQKRITAAQALEHEYFRMEPLPGRNALVPSSPGDKVNYPTRPVDTTTDIEGTTSLQPSQSASSGNAVPGNMPGPHVVTNRPMPRPMHMVGMQRVPASGMAGYNLNPSGMGGGMNPSGIPMQRGVANQAQQSRRKDPGMGMGGYPPQQKQRRF
2 SEQ ID 262的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATp结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MEK YQQLAKIGEGTYGIVYKAKDKKSGELLALKKIRLEA EDEGIPSTAIREISLLKQLQHPNIVRLYDVVHTEKKLTL VFEFLDQDLKKYLDACGDNGLEPYTVKSFLYQLLQGIAF CHEHRVLHRDLKPQNLLINMEGELKLADFGLARAFGIPV RNYTHEVVTLWYRAPDVLMGSRKYSTQVDIWSVGCIFAE MVNGRPLFPGSSEQDQLLRIFKTLGTPSLKTWPGMAELP DFKDNFPKYVVQSFKKICPKKLDKTGLDLLSRMLQYDPA KRISAEQAMGHPYFKDLKLRKPKAAGPGP
3 SEQ ID 263的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATp结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MDQ YEKIEKIGFGTYGVVYKAIDRSTNKTIALKKIRLEQ EDEGVPSTAIREISLLKEMQHGNIVKLQDVVHSERRLYL VFEYLDLDLKKHMDSCPEFSKDTHTIKMFLYQILRGISY CHSHRVLHRDLKPQNLLLDRRTNSLKLADFGLARAFGIP VRTFTHEVVTLWYRAPEILLGSRHYSTPVDVWSVGCIFA EMVNRRPLFPGDSEIDELFKIFRIMGTPNEDSWPGVTSL PDFKSTFPKWASQDLKTVTPTVDPAGIDLLSKMLCMDPR RRITAKVALEHEYFKDVGVIP
4 SEQ ID 264的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATP结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MVMKSKLDK YEKLEKLGEGTYGVVYKAQDKTTKEIYALK KIRLESEDEGIPSTAIREIALLKELQHPNVVRIHDVIHT NKKLILVFEFVDYDLKKFLHNFDKGIDPKIVKSLLYQLV RGVAHCHQQKVLBRDLKPQNLLVSQEGILKLGDFGLARA FGIPVKNYTNEVVTLWYRAPDILLGSKNYSTSVDIWSIG CIFVEMLNQKPLFPGSSEQDQLKKIFKIMGTPDATKWPG IAELPDWKPENFEKYPGEPLNKVCPKMDPDGLDLLDKML KCNPSERIAAKNAMSHPYFKDIPDNLKKLYN
5 SEQ ID 265的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATP结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MDQ YEKVEKIGEGTYGVVYKAIDRLTNETLALKKIRLEQ EDEGVPSTAIREISLLKEMQHGNIVRLQDVVHSENRLYL VFEYLDLDLKKHMDSSPDFAKDPRLVKIFLYQILRGIAY CHSHRVLHRDLKPQNLLIDRRTNALKLADFGLARAFGIP VRTFTHEVVTLWYRAPEILLGSRHYSTPVDVWSVGCIFA EMVNQRPLFPGDSEIDELFKIFRILGTPNEDTWPGVTAL PDFKSAFPKWPAKNLQDMVPGLNSAGIDLLSKMLCLDPS KRITARSALEHEYFKDIGFVP
6 SEQID 266的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATP结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MEK YEKLEKVGEGTYGKVYKAKDKATGQLVALRKTRLEM DEEGVPPTALREVSLLQLLSQSLYVVRLLSVEHVDGGSK RKAAAAAAAEGGGGEAHGGGAVGGGKPMLYLVFEYLDTD LKKFIDSHRKGPNPRPVPAATVQNFLYQLLKGVAHCHSH GVLHRDLKPQNLLYDKEKGILKIADLGLGRAFTVPLKSY THEVFAFLAILLWRSEGESAADFDSXFRVSPVQVVTLWY RAPEVLLGSAHYSIGVDMMSVGCIFAEMVRRQALFPGDS EFQQLLHIFRLLGTPTEKQWPGVTTLRDWHVYPQWEPQN LARAVPSLGPDGVDLLSKMLKYDPAERISAKAALDHPFFDSLDKSQF
条目            序列描述  注释肽序列
7 SEQ ID 267的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATP结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MERPATAAVSAMEA FEKLEKVGEGTYGKVYRAREKATGK IVALKKTRLHEDEEGVPPTTLREISILRMLSRDPEIVRL MDVKQGQNKEGKTVLYLVFEYMETDLKKYIRGFRSSGES IPVNIVKSLMYQLCKGVAFCHGHGVLERDLKPHNLLMDK KTLTLKIADLGLARAFTVPIKKYTHEILTLWYRAPEVLL GATHYSTAVDMWSVGCIFAELVTKQALFPGDSELQQLLH IFRLLGTPNEKMWPGVSSLMNWHEYPQWKPQSLSTAVPN LDKDGLDLLSQMLHYEPSRRISAKAAMEHPYFDDVNKTCL
8 SEQ ID 268的氨基酸序列。保守真核蛋白激酶域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MGCVLGREVSSGIVTESKGRDSSEVETSKRDDSVAAKVEGEGKAEEVRTEETQKKEKVEDDQQSREQRRRSKPSTKLGNLPKHIRGEQVAAGWPSWLSDICGEALNGWIPRRANT FE KIDKIGDGTYSNVYKAKDLLTGKIVALKKVRFDNLEPES VRFMAREILILRHLDHPNVVKLEGLVTSRMSCSLYLVFE YMEHDLAGLAASPAIKFTEPQVKCYMHQLLSGLEHCHNR RVLHRDIKGSNLLIDNGGVLKIGDFGLASFYDPDHKHRM TSRVVTLWYRPPELLLGANDYGVGIDLWSAGCILAELLA GKPIMPGRTEVEQLHKIYKLCGSPSEEYWKKYKLPNATL FKPREPYRRCIRETFKDFPPSSLPLIETLLAIDPAERGT ATDALQSEFFRTEPYACEPSSLPQYPPSKEMDAKKRDDEARRLRAASKGQADGSKKERTRDRRVRAVPAPEANAELQHNIDRRRLISHANAKSKSEKFPPPHQDGALGFPLGASHRFDPAVVPPDVPFTSTSFTSSKEHDQTWSGPLVDPPGAPRRKKHSAGGQRESSKLSMGTNKGRRADSHLKAYESKSIA
9 SEQ ID 269的氨基酸序列。保守真核蛋白激酶域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MYSKSSAVDDSRESPKDRVSSSRRLSEVKTSRLDSSRRENGFRARDKVGDVSVMLIDKKVNGSARFCDDQIEKKSDRLQKQRRERAEAAAAADHPGAGRVPKAVEGEQVAAGWPVWLSAVAGEAIKGWLPRRADT FEKLDKIGQGTYSSVYKARDV TNNKIVALKRVRFDNLDTESVKFMAREIHILRMLDHPNV IKLEGLITSRMSCSLYLVFEYMEHDLTGLASRPDVKFSE PQIKCYMKQLLSGLDHCHKHGVLHRDIKGSNLLIDNNGI LKIADFGLASVFDPHQTAPLTSRVVTLWYRPPELLLGAS RYGVEVDLWSTGCILGELYTGKPILPGKTEVEQLHKIFK LCGSPSDDYWRRLHLPHAAVFKPPQPYRRCVAEIFKELP PVALGLLETLISVDPSQRGTAAFALRSEFFTASPLPCDPSSLPKYPPSKEIDMKLREEEARRRGAAGGKNELEKRGTKDSRTNSAYYPNAGQLQVKQCHSNANGRSEIFGPYQEKTVSGFLVAPPKQARVSKETRKDYAEQPDRASFSGPLVPGPGFSKAGKELGHSITVSRNTNLSTLSSLVTSRTGDNKQKSGPLVSESANQASRYSGPIREMEPARKQDRRSHVRTNIDYRSREDGNSSTKEPALYGRGSAGNKIYVSGPLLVSSNNVDQMLKEHDRRIQEHARRARFDKARVGNNHPQAAVDSKLVSVHDAG
10 SEQ ID270的氨基酸序列。保守真核蛋白激酶域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点为粗体。  MGCIPTIISDGRRRSAAPDKRRPRPRRSSSEGEAPPHATAAGSEGGESARGAPGKERPEPAPRFVVRSPQGWPPWLVAAVGHAIGEFVPRCADS FRDLAKIGEGTYSNVYKARDLVT GKTVALKKVRFDNLEAESIKFMAREILVLTRLNHPNVIK LEGPVTSRMSSGLYLAFEYMEHDLSGIAARQNGKFTEPQ VKCFMRQLLSGLEHCHNHDVLHRDIKCSNLLIDNEGNLK IADFGLATFYDPERKQVMTNRVVTLWYRAPELLLGATSY GIGIDLWSAGCILAELLYGKPIMPGRTEVEQLHKIFKLC GSPSEAYWNKFKLPMANIFKPPQPYARCIAETFKDFPPS ALPLLETLLSIDPDERGTATTALNSEFFAAEPHACEPSSLPKYPPSKEMDLKLIKEKTRRDSSKRPSAIHGSRRDGIHDRAGRVIPAPEATAENQATLHRPRAMKKANPMSRSEKFPPAHMDGVVGSSANAWLSGPASNAAPDSRRHRSLNQNPSSSVGKASTGSSTTQETLKVAPELLQVGSSSLHPCHRMLVYGSNLTIRSK
条目           序列描述  注释肽序列
11 SEQ ID 271的氨基酸序列。保守蛋白激酶家族域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MGCICAKQADRGPASPGSGILTGAGTGTGTRSSKIPSGLFEFEKSGVKEHGGRSGELRKLEEKGSLSKRLRLELGFSHRYVEAEQAAAGWPSWLTAVAGDAIQGLVPLKADS FEKLE KIGQGTYSSVFRARELANGRMVALKKVRFDNFQPESIQF MAREISILRRLDHPNIMKLEGIITSRMSNSIYLVFEYME HDLYGLISSPQVKFSDAQVKCYMKOLLSGIEHCHQHGVI HRDVKSSNILVNNEGILRIGDFGLANILNPKDROOLTSH VVTLWYRPPELLNGSTSYGVTVDLWSVGCVFAELMFRKP ILRGRTEVEQLHKIFKLCGSPPDGYWKMCKVPQATMFRP RHAYECTLRERCKGIATSAMKLMETFLSIEPHKRGTASS ALISEYFRTVPYACDPSSLPKYPPNKEIDAKHREEARRKKARSRVREAEVGKRPTRIHRASQEQGFSSNIAPKEKRSYA
12 SEQ ID 272的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATP结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MAVAAPGHLNVNESPSWGSRSVDC FEKLEQIGEGTYGQV YMAKEKKTGEIVALKKIRMDNEREGFPITAIREIKILKK LHHENVIKLKEIVTSPGPEKDEQGRPEGNKYKGGIYMVF EYMDHDLTGLADRPGMRFSVPQIKCYMRQLLTGLHYCHI NQVLHRDIKGSNLLIDNEGNLKLADFGLARSFSNDHNAN LTNRVITLWYRPPELLLGATKYGPAVDMWSVGCIFAELL HGKPIFPGKDEPEQLNKIFELCGAPDEINWPGVSKIPWY NNFKPTRPMKRRLREVFRHFDRHALELLERMLTLDPSQR ISAKDALDAEYFWADPLPCDPKSLPKYESSHEFQTKKKRQQQRQHEETAKRQKLQHPPQHPRLPPVQQSGQAHAQMRPGPNQLMHGSQPPVATGPPGHHYGKPRGPSGGAGRYPSSGNPGGGYNHPSRGGQGGSGGYNSGPYPPQGRAPPYGSSGMPGAGPRGGGGNNYGVGPSNYPQGGGGPYGGSGAGRGSNMMGGNRNQQYGWQQ
13 SEQ ID 273的氨基酸序列。保守丝氨酸/苏氨酸蛋白激酶域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MGCICTKGILPAHYRIKDGGLKLSKSSKRSVGSLRRDELAVSANGGGNDAADRLISSPHEVENEVEDRKNVDFNEKLSKSLQRRATMDVASGGHTQAQLKVGKVGGFPLGERGAQVVAGWPSWLTAVAGEAINGWVPRRADS FEKLEKIGOGTYSS VYRARDLETNTIVALKKVRFANMDPESVRFMAREIIIMR KLDHPNVMKLEGLITSRVSGSLYLVFEYMDHDLAGLAAT PSIKLTESQIKCYMQQLLRGLEYCHSHGVLHRDIKGSNL LVDNNGNLKIGDFGLATFFRTNQKQPLTSRVVTLWYRPP ELLLGSSDYGASVDLWSSGCILAELFAGKPIMPGRTEVE QLHKIFKLCGSPSEEYWKKSKLPHATIFKPQQPYKRCLL ETFKDFPSSALGLLDVLLAVEPECRGTASSALQNEFFTSNPLPSDPSSLPKYPSSKEFDARLRDEEARKHKATAGKARGLESIRKGSKESKVVPTSNANADLKASIQKRQEQSNPRSTGEKPGGTTQNNFILSGQSAKPSLNGSTQIGNANEVEALIVPDRELDSPRGGAELRRQRSFMQRRASQLSRFSNSVAVGGDSHLDCSREKGANTQWRDEGFVARCSHPDGGELAGKHDNSHHLLHRPISLFKKGGEHSRRDSIASYSPKKGRIHYSGPLLPSGDNLDEMLKEHERQIQNAVRKARLDKVKTKREYADHGQTESLLCWAKGR
14 SEQ ID 274的氨基酸序列。保守蛋白激酶家族域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MDPDPSPDPDPPKSWSIHTRREIIAR YEILERVGSGAYS DVYRGRRLSDGLAVALKEVHDYQSAFREIEALQILRGSP HVVLLHEYFWREDEDAVLVLEFLRSDLAAVIADASRRPR DGGGGGAAALRAGEVKRWMLQVLEGVDACHRNSIVHRDL KPGNLLISEEGVLKIADFGQARILLDDGNVAPDYEPESF EERSSEQADILQQPETMEADTTCPEGQEQGAITREAYLR EVDEFKAKNPRHEIDKETSIFDGDTSCLATCTTSDIGED FFKGSYVYGAEEAGEDAOGCLTSCVGTRWFRAPELLYGS TDYGLEVDLWSLGCIFAELLTLEPLFPGISDIDQLSRIF NVLGNLSEEVWPGCTKLPDYRTISFCKIENPIGLESCLP NCSSDEVSLVRRLLCYDPAARATPMELLQDKYFTEEPLPVPISALQVPQSKNSHDEDSAGGWYDYNDMDSDSDFEDFGPLKFTPTSTGFSIQFP
    条目             序列  注释肽序列
    15 SEQ ID 275的氨基酸序列。保守丝氨酸/苏氨酸蛋白激酶域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。  MDPDPSPSPDPPKSWSIHTRREIIAR YEILERVGSGAYS DVYRGRRLSDGLAVALKEVHDYQSAFREIEALQILRGSP HVVLLHEYFWREDEDAVLVLEPLRSDLAAVIADASRRPR GGGVAPLRAGEGKRWMLQVLEGVDACHRNSTVHRDLKPG NLLISEEGVLKIADFGQARILLDDGNVAPDYEPESFEER SSEQADILQQPETMEADTTCPEGQEQGAITREAYLREVD EFKAKNPRHEIDKETSIYDGDTSCLATCTTSDIGEDPFK GSYVYGAEEAGEDAQGSLTSCVGTRNFRAPELLYGSTDY GLEVDLWSLGCIFAELLTLEPLFPGISDIDQLSRIFNVL GNLSEEVWPGCTKLPDYRTISFCKIENPIGLESCLPNCS SDEVSLVRRLLCYDPAARATPMELLQDKYFTEEPLPVPISALQVPQSKNSHDEDSAGGWYDYNDMDSDSDFEDFGPLKFTPTSTGFSIQFP
    16  SEQ ID 276的氨基酸序列。保守周期素和周期素C末端域为下划线,且周期素信号为粗体。  MSNQHRRSSFSSSTTSSLAKRHASSSSSSLENAGKAFAAAAVPSHLAKKRAPLGNLTNLKAGDGNSRSSSAPSTLVANATKLAKTRKGSSTSSSIMGLSGSALPRYASTKPSGVLPSVNPSIPRIEIAVDPMSCSMVVSPSRSDMQSVSLDESMSTCESFKSPDVEYIDNEDVSAVDSIDRRTFSNLYISDAAAKTAVNICERDVLMEMETDEKIVNVDDNYSDPQLCATIAC D IYQHLRASEAKKRPSTDFMDRVQKDITASMRAILIDWLV EVAEEYRLVPDTIYLTVNYIDRYLSGNVMNRQRLQLLGV ACMMIAAKYEEICAPQVEEFCYITDNTYFKEEVLQMESS VLNYLKFEMTA PTVKCFLRRFVRAAQGVNEVPSLQLECM ANYIAELSLLEYDMLCYAPSLVAASAIFLAKFVITPSKR PWDPTLQHYTLYQPSDLGNCVKDLHRLCFNNHGSTLPAIREKYSQHKYKYVAKKYCPPSIPPEFFHNLVY
    17  SEQ ID 277的氨基酸序列。保守周期素和周期素C末端域为下划线。   MNKENAVGTKSEAPTIRITRSRSKALGTSTGMLPSSRPSFKQEQKKTVRANAKRSASDENKGTMVGNASKQHKKRTVLNDVTNIFCENSYSNCLNAAKAQTSRQGRKWSMKKDRDVHQSGAVQIMQEDVQAQFVEESSKIKVAESMEITIPDKWAKRENSEHSISMKDTVAESSRKPQEFICGEKSAALVQPSIVDIDSKLEDPQACTPYAL DIYNYKRSTELERRPSTIYMET LQKDVTPNMRGILVDWLVEVSEEYKLVPDTLYLTVNLID RSLSQKFIEKQRLQLLGVTCMLIASKYEEICPPRVEEFC FITDNTYTSLEVLKMESRVLNLLHFQLSVPTVKTFLRRF VQAAQVSSEVPSVELEYLANYLAELTLVEYSFLKFLPSL MAASAVLLARWTLNQSDNPWNLTLEHYTKYKASELKAAV LALEDLQLNTSGSTINAIREKYRQQKVNYSLLIHSKANHEIL
    18 SEQ ID 278的氨基酸序列。保守周期素N和C末端家族域为下划线,且周期素信号为粗体。  MAGSDENNPGVVGGAHVQEGLRVGAGKMGAGNVQQRRALSNINSNIIGAPPYPCAVNKRVLSEKNVNSENDLLNAAHRPITRQFAAQMAYKQQLRPEENKRTTQSVSNPSKSEDCAILDVDDDRMADDFPVPMFVQHTEAMLEEIDRMEEVEMEDVAEEPVTDIDSGDKENQLAVVEYID DLYMFYQKAEASSCV PPNYMDRQQDINERMRGILIDWLIEVHYKFELMDETLYL TVNLIIDRFLAVQPVVKKKLQLVGVTAMLACKYEEVSVP VVEDLILISDRAYSRKEVLEMERLMVNTLHFNMSVPTPY VFMRRFLKAAQSDKKLELLSFFIIELSLVEYDMLKFPPS LLAASAIYTALSTITRTKQWSTTCEWHTSYSEEQLLECA RLMVTFHQRAGSGKLTGVHRKYSTSKFGHAARTEPANFLLDFRL
    19 SEQ ID 279的氨基酸序列。保守周期素和周期素C末端域为下划线。  MASRPIVPVQARGEAAIGGGAGKAAIGGGAGKQQKKNGAAEGRNRKALGDIGNLVTVRGIEGKVQPHRPITRSFCAQLLANAQAAAAAENNKKQAVVNVNGAPSILDVPGAGKRAEPAAAAAAAVAKAAQKKVVKPKQKAEVIDLTSDSEERSRPRRSN NIMSLRRRKERNHREGICPLSLRSSLLEARLVDWLI EIHNKFDLMPETLYLTINIIDRPLSVKAVPRRELQLLGM GALFTASKYEEIWAPEVNDLVCIADRAYSHEQVLAMEKT ILGKLEWTLTVPTHYVFLVRFIKASLGDRKLENMVYFLA ELGVMNYATLTYCPSMVAASAVYAARCTLGLTPLWNDTL KLHTGFSESQLMDCARLLVGYHAKAKENKLQVVYKKYSSSQREGVALIPPAKALLCEGGGLSSSSSLASSS
    条目            序列描述  注释肽序列
    20 SEQ ID 280的氨基酸序列。保守周期素和周期素C末端域为下划线,且周期素信号为粗体。  MGLPDENNAALSKPTNLQVGGLEIGGRKFGQEIRQTRRALSVINQNLVGDRAYPCHVVNKRGHSKRDAVCGKDQVDPVHRPLTRKFAAQTASTQQHCIEEAKKPRTAVQERNEFGDCIFVDVEDCQPSSENQPVPMFLEIPESRLDDDMEEVEMEDIVEEEEEEPIMDIDGRDKKNPLAVVDYIE DIYANYRRTE NCSCVSANYNAQQADINEKMRSILIDWIIEVHDKFDLMH ETLFLTVNIIDRFLARQSVVRKKLQLVGLVAMLLACKYE EVSVPVVGDLILISDKAYTRKEVLEMESLMLNSLQFNMS VPTPYVFMRRFLKAAESDKKLEVLSFFLIELSLVEYENV KFPPSLLAAAAIFTAQCTLYGFKOWTKTCENHSNYTEDO LLECARMMVGFHQKAATGKLTGVHRKYGTSKFGYTSKCEPANFLLGEMKNP
    21 SEQ ID 281的氨基酸序列。保守周期素和周期素C末端域为下划线,且周期素信号为粗体。  MGLPDENNAALSKPTNLQVGGLEIGGRKFGQEIRQTRRALSVINQNLVGDRAYPCHVVNKRGHSKRDAVCGKDQVDPVHRPLTRKFAAQTASTQQHCIEEAKKPRTAVQERNEFGDCIFVDVEDCQPSSENQPVPMFLEIPESRLDDDMEEVEMEDIVEEEEEEPIMDIDGRDKKNPLAVVDYIE DIYANYRRTE NCSCVSANYMAQQADINEKMRSILIDNLIKVHDKFDLMH ETLFLTVNLIDRFLARQSVVRKKLQLVGLVAMLLACKYE EVSVPVVGDLILISDKAYTRKEVLEMEKLMLNSLQFNMS VPTPYVFMRRFLKAAESDKKLEVLSFFLIELSLVEYEMV KFPPSLLAAAAIFTAQCTLYGFKQWTKTCEWHSNYTEDQ LLECARMMVGFHQKAATGKLTGVHRKYGTSKFGYTSKCEAANFLLGEMKNP
    22 SEQ ID 282的氨基酸序列。保守周期素N和C末端家族域为下划线。  MAMVQRQGHDPSSPQEQEDGPSSFLSDDALYCEEGRFEEDDGGGGGQVDGIPLFPSQPADRQQDSPWADEDGEEKEEEEAELQSLFSKERGARPELAKDDGGAVAARREAV EWMLMV RGVYGFSALTAVLAVDYLDRFLAGFRLQRDNRPWMTQLV AVACLALAAKVEETDVPLLVELQEVGDARYVPEAKTVQR MELLVLSTLGWEMHPVTPLSFVHHVARRLGASPHHGEFT HWAFLRRCERLLVAAVSDARSLKHLPSVLAAAAMLRVIE EVEPFRSSEYKAQLLSALHMSQEMVEDCCRFILGIAETA GDAVTSSLDSFLKRKRRCGHLSPRSPSGVIDASFSCDDE SNDSWATDPPSDPDDNDDLNPLPKKSRSSSPSSSPSSVPDKVLDLPFMNRIFEGIVNGSPI
    23  SEQ ID 283的氨基酸序列。保守周期素和周期素C末端域为下划线。  MEASYQPHHHGHLRQHDPSSSQQEEQVPFDALYCSEEHWGEEDEEEGLASDGLLSEERDHRLLSPRALLDQDLLWED E ELASLFSKEEPGGMRLNLENDPSLADARREAVEWIMRVH AHYAFSALTALLAVNYWDRFTCSFALQEDKPWMTQLSAV ACLSLAAKVEETQVPLLIDFQVEDSSPVFEAKNIQRMEL LVLSSLEWKMNPVTPLSFLDYMTRRLGLTGHLCWEFLRR CENVLLSVISDCRFTCYLPSVIAASTMLHVINGLKPRLD VEDQTQLLGILAMGMDKIDACYKLIDDDHALRSQRYSHN KRKFGSVPGSPRGVMELCFSSDGSNDSWSVAASVSSSPEPHSKKSRAGEEAEDRLLRGLEGEEDDPASADIFSFPH
    24 SEQ ID 284的氨基酸序列。保守周期素和周期素C末端域为下划线。  MALQEEDTRRHYPTAPPFSPDGLYCEDETFGEDLADNACEYAGGGARDGLCEIKDPTLPPSLLGQDLFWED GELASLV SRETGTHPCWDELISDGSVALARKDAVGNILRVHGHYGF RPLTAMLAVNYLDRFFLSRSYQRDRPWISQLVAVACLSV AAKVEETQVPILLDLQVANAKFVFESRTIQRMELLLMST LDWRMNSVTPISFFDHILRRFGLTTNLHRQFFWMCERLL LSVVADVRLASFLPSVVATAAMLYVNKEIEPCICSEFLD QLLSLLKINEDRVNECYELILELSIDHPEILNYKHKRKR GSVPSSPSGVIDTSFSCDSSNDSNGVASSVSSSLEPRFKRSRFQDQQMGLPSVNVSSMGVLNSSY
    25 SEQ ID 285的氨基酸序列。保守周期素依赖性激酶调控亚单位域为下划线,且周期素依赖性激酶调控亚单位信号1为粗体。   MGQIOYSEKYEDDTYGYRHVVLPPDVAKLLPKNRLLSEN EWRAIGVQQSRGWVHYAIHRPEPHIMLFRRPLNYQQQQENQAQQNMLAK
    条目           序列描  注释肽序
    26 SEQ ID 286的氨基酸序列。保守染色体域为下划线,且MOZ/SAS样蛋白域为粗体/斜体。  MGSIDPPKAEQNGTAAAAVADPGQKPGAGDAMPPPPPVKHSNGTAAEPDVATKRRRMSVLPLEVGTRVMCRWRDG KYH PVKVIERRKLNPGDPNDYEYYVHYTEFNRRLDEWVKLEQ LDLNSVETVVDEKVEDKVTGLKMTRHQKRKIDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDCSKLYFCEFCLNFMKRKEQLQRHMKKCDLKHPPGDEIYRSGTLSMEEVDGKKNKVYGQNICYLAKLFLDHKTLYYDVDLFLFYVLCGCDDRGCHMYGYESKEKHSKSSYNLACILTLPPYQRKGYGKELIAFSYELSKKEGKVGTPERPLSDLGLLSYKGYWTRVLLDILKKMKANISIKELSDMTAIKADDILNTLQSLDLIQYRKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQG
    27 SEQ ID 292的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线。  MDTGGNSLPSGPDGVK RKVCYFYDPEVGNYYLLQHMQVL KPVPARDRDLCRFHmDDYVAFLRSITPETQQDQLRQLKR FNVGEDCPVFDGLHSFCQTYAGGSVGGAVKLNHGLCDIA INWAGGLHHAKKCEASGFCYVNDIVLGILELLKOHERVL YVDIDIHHGDGVEEAFYTTDRVMTVSFHKFGDYFPGTGD IRDIGYGKGKYYSLNVPLDDGIDDESYHSLFKPIIGKVM EVFKPGAVVLQCGADSLSGDRLGCFNLSIKGHAECVRYM RSFNVPVLLLGGGGYTIRNVARCWCYETGVALGLEVDDKMPQHEYYEYFGPDYTLHVAPSNMENKNSRQLLEEIRSKLLENLSKLQHAPSVPFQERPPDTELPEADEDQEDPDERWDPDSDMDVDEDRKPLPSRVKRELIVEPEVKDQDSQKASIDHGRGLDTTQEDNASIKVSDMNSMITDEQSVKMEQDNVNKPSEQIFPK
    28 SEQ ID 293的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线。  MDTGGNSLPSGPDGVK RKVCYFYDPEVGNYYYGQGHPMK PHRIRMTHALIAHYGLLQHMQVLKPVPARDRDLCRFHAD DYVAFLRSITPETQQDQLRQLKRFNVGEDCPVFDGLHSF CQTYAGGSVGGAVKLNHGLCDIAINWAGGLHHAKKCEAS GFCYVNDIVLGILELLKQHERVLYVDIDIHHGDGVEEAF YTTDRVMTVSFHKFGDYFPGTGDIRDIGYGKGKYYSINV PLDDGIDDESYHSLFKPIIGKVMEVFKPGAVVLQCGADS LSGDRLGCFNLSIKGHAECVRYMRSFNVPVLLLGGGGYT IRNVARCWCYETGVALGLEVDDKMPQHEYYEYFGPDYTLHVAPSNMENKNSRQLLEDIRSKLLENLSKLQHAPSVPFQERPPDTELPEADEDQEDPDERWDPDSDMDVDEDRKPLPSRVKRELIVEPEVKDQDSQKASIDHGRGLDTTQEDNASIKVSDMNSMITDEQSVKMEQDNVNKPSEOIFPK
    29 SEQ ID 294的氨基酸序列。保守组蛋白去乙酰基转移酶域为下划线。  MRPK DRISYFYDGDVGSVYFGPNHPMKPHRLCMTHHLVL SYELHTKMEIYRPHKAYPAELAQFHSPDYVEFLHRITPD TQHLFPNDLAKYNLGEDCPVFENLFEFCQIYAGGTIDAA RRLNNQLCDIAINWAGGLHHAKKCEASGFCYINDLVLGI LELLKYHARVLYIDIDVHHGDGVEEAFYFTDRVMTVSFH KFGDMFPPGTGDVKEIGGKEGKFYAINVPLKDGIDDTSF TRLFKAIISKVVETYQPGAIVLQCGADSLAGDRLGCFNL SIDGHSECVRFVKKFNLPLLVTGGGGYTKENVARCWVVE TGVLLDTELPNEIPENEYFKYFAPDYSLKIPRGNIVLENLNSKSYLSAIKVQVLENLRNIQHAPSVQMQEVPPDFYIPDFDEDEQNPDERMDQHTQDKQIQRDDEYYDGDNDNDHNMDD
    30 SEQ ID 295的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线,且锌指RanBP2型特征为粗体。   MTVAEDFHVNNRSKMVSQATPESRLTGGEDDNSLHNQVDELLCQELPERQVILEFEGTRPKPYFSDHNGGENSALGVRATEDDLNSDVEAEEKQKEMTLEDMYKNDGTLYDDDEDDSDWEPVKRQVELMRWFCTNCTMVNVEDVFLCDICGEHRDSGILRHGFYASPFMQDVGAPSVEAEVQESREDHARSSPPSSSTVVGFDEKMLLHSEVEMKSHPHPERADRLQAIAASIA TAGIFPGRCRSLPVREITKEELQMVHSSEHVDAVEMTSH MFSSYFTPDTYANEHSARAARIAAGLCADLASTIISGRS KNGFALVRPPGHHAGIKHAMGFCLHNNAAVAALAAQGAG AKKVLIVDWDVHHGNGTQEIFDGNKSVLYISLHRHEGGN FYPGTGAAHEVGTMGAEGYCVNIPWSRRGVGDNDYVFAF HHIVLPIASAFAPDFTIISAGFDAARGDPLGCCDVTPAG YAQMTHMLSALSGGKLLVILEGGYMLRSISSSAVAVIKV LLGDSPISEIADAVPSKAGLRTVLEVLKIQRSYWPSLESIFWELQSQWGMFLVDNRRKQIRKRRRVLVPIWWKWGRKSVLYHLLNGHLHVKTKR
    条目            序列描述  注释肽序列
    31 SEQ ID 296的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线。   MAAAPSSPPTNRVDVFMHDGMLSHDTGRGVFDTGSDPGF LDVLEKHPENPDRVRNMVSILKRGPISPFISWHTATPAL ISQLLSFHSPEYINELVEADKNGGKVLCAGTFLNPGSWD AALLAAGNILSAMKYVLDGKGKIAYALVRPPGHHAQPSQ ADGYCFLNNAGLAVRLALDSGCKRVVVVDIDVHYGNGTA EGFYQSSDVLTISLHMNHGSWGPSHPQSGSVDELGEDEG YGYNMNIPLPNGTGDRGYEYAVTELVVPAVESFKPEMVVLVVGQDSSAFDPNGRQCLTMDGYRAIGRTIRGLADRHSGGRILIVQEGGYHVTYSAYCLHATVEGILDLPDPLLADPIAYYPEDEAFPVKVVDSIKRYLVDKVPFLKEH
    32 SEQ ID 297的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线。  MVESSGGASLPSVGQDARK RRVSYFYEPTIGDYYYGQGH PMKPHRIRMAHNLIVHYYLHRRMEISRPFPAATTDIRRF HSEDYVTFISSVTPETVSDPAFSRDLKRFNVGEDCPVFD GIFGFCQASAGGSMGAAVKLNRGDSDIALNWAGGLHHAK KSEASGFCYVNDIVLGILELLKVHKRVLYVDIDVHHGDG VEEAFYTTDRVMTVSFHKFGDFFPGSGHIKDTGAGPGKN YALNVPLNDGIDDESFRGMFRPIIQKVMEVYQPDAVVLQ CGADSLSGDRLGCFNLSVKGHADCLRFLRSENVPLMVLG GGGYTMRNVARCWCYETAVAVGVEPENDLPYNEYYEYFGPDYTLHVEPCSMENLNAPKDLERIRNMLLEQLSRIPHAPSVPFQMTPPITQEPKEAEEDMDERPKPRIWNGEDYESDAEEDKSQHRSSNADALHDENVEMRDSVGENSGDKTREDRSPS
    33 SEQ ID 299的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶家族域为下划线。  MAAIISCHHYHSCCSSLIASKWVGARIPTSCFGRSSTQSNNAASVRQFVTRCSSSPSSRGQWQPHQNGEKGRSFSLRECAISIALAVGLVTGVPSLDMSTGNAYAASPALPDLSVLISGPPIKDPEALLRYALPINNKAIREVQKPLEDITDSLKVAGLRALDSVERNVRQASRVLKQGKNLIVSGLAESKKDHGVELLDKLEAGMDELQQIVEDGNRDAVAGKQRELLNYVGGVEEDMVDGFPYEVPEEYKNMPLLKG RAAVDMKVKVKDNP NLEECVFRIVLDGYNAPVTAGNFVDLVERHFYDGMEIQR ADGFVVQTGDPEGPAESFIDPSTEKPRTIPLEIMVDGEK APVYGATLEELGLYKAQTKLPFNAFGTMAMARDEFEDNS ASSQIFWLLKESELTPSNANILDGRYAVFGYVTENQDFLADLKVGDVIESVQVVSGLDNIANPSYKIAG
    34 SEQID300的氨基酸序列。保守FKBP型肽基脯氨酰基异构酶域为下划线。FKBP型肽基-脯氨酰基顺-反异构酶信号1为粗体,且FKBP型肽基-脯氨酰基顺-反异构酶信号2为粗体/斜体。  MAGEDFDIPPADEMNEDFDLPDDDDDAPVMKAGDEKEIGKQGLKKLVKE GDAWETPDNGDEVEVHYYTGTLLDGTOED SSRDRGTPFKFTLGQGQVIKGWDQGIKTMKKGERAIFTI PPELAYGEAGSPPTIPPNATLQFDVELLSWTSVKDICKDGGIFKKILVE GEKWENPKDLDEVLVRYEFQLEDGTTIAR SDGVEFTVKEGHFCPAVAKAVKTMKKGEKVLLTVKPQYG FGEKGKPASGDEGAVPPNATLQITIELVSWKTVSEVTDDKKVIKKILKEGEGYERPNEGAVVEVKLIGKLQDGTVFVKKGHDDCEELFKFKIDEEQVVDGLDKAVMNMKKGEVALLTVAPEYAFGSSESKQDLAVVPPSSTVYYEVELVSFVKDKESWDMNTEEKIEAAGKKKEEGNVIFKAGKYAKASKRYEKAVKYIEYDTSFSEDEKKQAKALKVACNLNDAACKLKLKDYNQAEKLCTKVLELDSRNVKALYRRAQAYIELSDLDLAEFDIKKALEIDPHNRDVKLEYKVLKEKVKEFNKKDAKFYGNMFAKMSKLEPVEKTAAKEPEPMSIDSKA
    35 SEQ ID 301的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶家族域为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。  MSTVYVLEPPTKG KVVLNTTHGPLDVELWPKEAPKAVRN FVQLCLEGYYDNTIFHRIIKDPLVQGGDPTGSGTGGESI YGDAFSDEFHSRLRFKHRGLVACANAGSPHSNGSQFFIT LDRCDWLDRKNTIFGKITGDSIYNLSGLAEVETDKSDRP LDPPPKIISVEVLWNPFEDIVPRAPVRSLVPTVPDVQNKEPKKKAVKKLNLLSFGEEAEEEEKALVVVKQKIKSSHDVLDDPRLLKEHIPSKQVDSYDSKTARDVQSVREALSSKKQELQKESGAEESNSFREIADDEDDDDDDASFDARMRRQILQKRKELGDLPPKPKPKSRDGISARKERETSISRDKDDDDDDDQPRVEKLSLKKKGIGSEARGERMANADADLQLLNDAERGRQLQKQKKHRLRGREDEVLTKLETFKASVFGKPLASSAKVGDGDGDLSDWRSVKLKFAPEPGKDRMTRNEDPNDYVVVDPLLEKGKEKFNRMQAKEKRRGREWAGKSLT
    条目            序列描述  注释肽序列
    36 SEQ ID 302的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶家族域为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。  MASAISMHSSGLLLLQGTNGKDVTEMGKAPASSRVANMQQRKYGATCCVARGLTSRSHYASSLAFKQFSKTPSIKYDRMVEIKAMATDLGLQAKVTN KCFFDVEIGGEPAGRIVIGL FGDDVPKTVENFRALCTGEKGFGYKGCSFHRIIKDEMIQ GGDFTRGNGTGGKSIYGSTFEDENFALKHVGPGVLSMAN AGPSTNGSQFFICTVKTPWLDNRHVVFGQVVDGMDVVQK LESQETSRSDVPRQPCRIVNCGELPLDG
    37 SEQ ID 303的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线。  MAASFTALSNVGSLSSPRNGSEIRRFRPSCNVAASVRPPPLKAGLSASSSSSFSGSLRLIPLSSSPQRKSRPCSVRASAEAAAAQSKVTN KVYLDISIGNPVGKLVGRIVIGLYGDD VPCTAENFRALCTGEKGFGYKGSTVHRVIKDFMIQGGDF DKGNGTGGKSIYGRTFKDENFKLSHVGPGVVSMANAGPN TNGSQFFICTVKTPWLDQRHVVFGQVLEGMDIVRLIESQETDRGDRPRKRVVVSDCGELPVV
    38 SEQ ID 304的氨基酸序列。保守FKBP型肽基-脯氨酰基顺-反异构酶信号为下划线,且FKBP型肽基-脯氨酰基顺-反异构酶信号2为粗体。  MAEAIDLTGDGGVMKTIVRRAK PDAVSPSETLPLVDVRY EGVLAETGEVFDSTHEDNTLFSFEIGKGSVISAWDTALR TMKVGEVAKITCKPEYAIGSTGSPPDIPPDATLIFEVEL VACKPCKGFSVTSVTEDKARLEELKKQREIAAATKEEEKKRREEAKAAAAARVQAKLDAKKGHGKGKGKAK
    39 SEQ ID 305的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶家族域为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。  MGNP KVFFDMSIGGQPAGRIVMELYADVVPRTAENFRAL CTGEKGAGRSGKPLHYKGSSFHRVIPGEMCQGGDFTAGN GTGGESIYGSKFADENFVKKHTGPGVLSMANAGPGTNGS QFFVCTAKTEWLDGKHVVFGQIVDGMDVVKAIEKVGSSS GRTSKPVVVADCGQLS
    40 SEQ ID 306的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。  MPNP KVFFDMTIGGAAAGRVVMELYADTTPRTAENFRAL CTGEKGVGRSKKPLHYKGSKFHRVIPSFMCQGGDFTAGN GTGGESIYGVKFADENFIKKHTGPGILSMANAGPGTNGS QFFICTTKTEWLDGKHVVFGKVVEGMEVVKAIEKVGSSS GRTSKPVVVADCGQLP
    41 SEQ ID 307的氨基酸序列。保守FKBP型肽基-脯氨酰基顺-反异构酶信号为下划线,且FKBP型肽基-脯氨酰基顺-反异构酶信号2为粗体。  MAEAIDLTGDGGVMKTIVRRAKPDAVS PSETLPLVDVRY EGVLAETGEVFDSTHEDNTLFSFEIGKGSVISAWDTALR TMKVGEVAKITCKPEYAYGSTGSPPDIPPDATLIFEVEL VACKPCKGFSVTSVTEDKARLEELKKQREIAAATKEEEKKRREEAKAAAAARVQAKLDAKKGHGKGKGKAK
    42 SEQ ID 308的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。  MATARSFFLCALLLLATLYLAQAKKSEDLKEVTH KVYFD VEIAGKPAGRIVMGLYGKAVPKTAENFRALCTGEKGTGK SGKPLHYKGSSFHRIIPSFMLOGGDFTLGDGRGGESIYG EKFADENFKLKHTGPGLLSMANAGPDTNGSQFFITTVTT SWLDGRHVVFGKVLSGMDVVYKVEAEGRQSGTPKSKVVI ADSGELPL
    43 SEQ ID 309的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶家族域为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。  MMRREISVLLQPRFVLAFLALAVLLLVFAFPFSRQRGDQVEEEPEITH RVYLDVDIDGQHLGRIVIGLYGEVVPRTVE NFRALCTGEKGKSANGKKLHYKGTPFHRIISGFMIQGGD VIYGDGKGYESIYGGTFADENFRIKHSHAGIISMVNSGP DSNGSQFFITTVKASWLDGEHVVFGKVIQGNDTVYAIEG GAGTYNGKRRKKVIIADSGEIPKSKWDEER
    条目          序列描述  注释肽序列
    44 SEQ ID 310的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶家族域为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。  MWATAEGGPPE VTLETSMGSFTVELYFKHAPRTSRNFIE LSRRGYYDNVKFHRIIKDFTVQGGDPTGTGRGGESIYGK KFSDEIKPELKETGAGILSMANAGPNTNGSQFFITLAPC PSLDGKHTIFGRVCRGMEIIKRLGSVQTDNNDRPIHDVK ILRTSVKD
    45 SEQ ID 311的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶家族域为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。  MSNP KVFFDILIGKMKAGRVVMELFADVTPKTAENFRAL CTGEKGIGRSGKPLHYKGSTEHRLIPNFMCQGGDFTRGN GTGGESIYGMKFADENFKIKHTGLGVLSMANAGPDTNGS QFFICTEKTPWLDGKHVVFGKVIDGYNVVKEMESVGSDS GSTRETVAIEDCGQLSEN
    46 SEQ ID 312的氨基酸序列。保守FKBP型肽基脯氨酰基异构酶域为下划线。FKBP型肽基-脯氨酰基顺-反异构酶信号1为粗体,且FKBP型肽基-脯氨酰基顺-反异构酶信号2为粗体/斜体。TPR重复为斜体。  MDDDFEFPASSNVENDDDDGMDMDDMGGDVPEEEDPVASPAVLKVGEEREIGKAGFKKKLVKE GFGWETPSSGDEVEV HYTGTLLDGTKFDSSRDRGTPFKFKLGRGQVIKGWDEGI KTMKRKKGENAIFIPPELAGESGSPPTIPPNATLQFDVE LLSWSSVKDICKDGGILKKVLVE GEKWDNPKDLDEVFVK YEASLEDGTLISKSDGVEFTVGDGYFCAALAKAVKTMKK GEKVLLTVMPQYAFGETGRPASGDEAAVPPDASLQIMLE LVSNKTVSDVTKDKKVLKKTLKE GEGYERPNDGAAVQVR LCGKLQDGTVFVKKDDEEPFEFKIDEEQVIDGLDRAVKN MKKGEVALVTIQPEYAIGPTESQQDLAVVPANSTVYYEV ELLSFVKEKESSWEMNQEKIEAAARKKEEGNAAFKAGKYVRASKRYEKAVRFIEYDSSFSDEEKQQAKTLKNTCNLNDAACKLKLKDFKEAEKLCTKVLEGDGKNVKALYRRAQAYIQLVDLDLAEQDIKKALEIDPNNRDVKLEYKILKEKVREYNKRDAQFYGNMFAKMNKLEHSRTAGMGAKHEAAPMTIDSKA
    47 SEQ ID 313的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶家族域为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。TPR重复为粗体/斜体。  NAKP RCFMDISIGGELEGRIVGELYTDVAPKTAENFRAL CTGEKGIGPHTGAPLHYKGVRFHRVIKGFMVQGGDISAG DGTGGESIYGLKFEDENFDLKHERKGMLSMANSGPNTNG SQFFITTTRTSHLDGKHVVFGRVVKGMGVVRSVEHVTTA AGDCPTVDVVIADCGEIPAGADDGIRNFFKDGDTYPDWPADLDESPAELSWWMDAVDSIKAFGNGSYKKQDYKMALRKYRKALRYLDICWEKEGIDEVESSSLRKTKSQIFTNSSACKLKLCDLKGALLDAEFAVRDGENNAKAYFRQGQAHMELNDIDAAAESFSKALELEPNDVGIKKELNAAKKKIFERREQEKRAYRKMFL
    48 SEQ ID 314的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型基-脯氨酰基顺-反异构酶信号为粗体。  MTKRKNP LVFLDVSIDGDPVERIVIELFADTVPRTAENF RSLCTGEKGVGKTTGKPLHYKGSYFHRIIKGFMAQGGDF SNGNGTGGESIYGGKFADENFKLAHDGPGLLSMANGGPN TNGSQFFIIFKRQPHLDGKHVVFGKVMRGMEVVKKIEQV GSANGKPLQPVKIVDCGETSETGTQDAVVEEKSKSATLKAKKKRSARDSSSESRGKRRQRKSRKERTRKKRRYSSSDSYSSESSDSDSESYSSDTESESKSHSESSVSDSSSSDGRRRKRKSTKREKLRRQRGKDSRGEQKSARYDKKSRHKSADSSSDSESESSSRSRSRDDKKKSSRRESARSVSKLKDAEANSPENLESPRDREIKKVEDNSSHEEGEFSPKNDVQHNGHGTDAKFGKYDDQRPRSDGSKKSSGSMRDSPKRLANSVPQGSPSSSPAHKASEPSSSIRARNPSRSPAPDGNSKRIRKGRGFTERFSYARRYRTPSPSDVTYRPYHYGRRNFHDRRNDRYSNYRSYSERSPHRRYRSPPRGRSPPKYQRKRSRSRSVSRSPGGNKGRYRGRDQSRSRSRSRSRSPRRGSSPANKQLPLSERLKSRLGTRVDRHSPRRRRSSSRSHDSSRSRSPDEVPDKHEGKAAPVSPARSRSSSPSGRGLVSYGDASPDSGIN
    条目           序列描述  注释肽序列
    49 SEQ ID 315的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线。CCHC型锌指为粗体,且RNA结合区RNP-1(RNA识别基序)为粗体/斜体。  MS VLLVTSLGDIVVDLHADRCPLTCKNFLKLCRIKYYNG CVFHTVQKDFTAQTGDPTGTGTGGDSVYKFLYGDQARFF HDEIHLDLKHSKTGTVAMASGGENLNASQFYFTLRDDLD YLDGKHTVFGEVAEGLETLTRINEAYVDEKGRPYKNIRI RHTYILDDPFDDPPQLAELIPDASPEGKPKDEVVDDVRLEDDWVPLDEQLGPAQLEEAIRAKEAHSRAVVLESIGDIPDAEIKPPDNVLFVCKLNPVTEDEDLHTIFSRFGTVVSADVLRDEKTGDSLCYAFIEFENKDSCCQAYEKMDNALIDDRRIKVDFSQSVAKLWSQFKRKDSQAAKGKGCTKCCAPDHMARECPGSSTRQPLSKYILKEDNAQRGGDDSRYEMVFDEDAPESPSHGKKRRGRDDRDDRHKMSRQSVEETKFNDREGGHSVDKHRQSERSKHREDEMSRDSKASEAGRRRIDRDFPEEERDGEKYTESHRDRDGKRGDYRDYRKGRADVQTHGDRRGDENYRRKSAAYDDGHEGAGAARRKDSNDDHHAYRRGYGDSRKGTRDEDDDGRGRRDDPSYRRSSGHKDSSNGGREEQKYRSGETDGKSHPERSHRGDRRR
    50 SEQID316的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线。  MRPFNGGSSIACLVLVLAAGALAESQGPHLGSA RVVFQT NYGDIEFGFFPGVAPRTVDHIFKLVRLGCYNTNHFFRVD KGFVAQVADVANGRTAPMNDEQRTEAEKTIVGEFSNVKH VRGILSMGRYDDPDSAQSSFSILLGDAPHLDGKYAIFGR VTKGDETLKKLEQLPTRREGMFVMPTERITILSSYYYDTGAESCEEENSTLRRRLAASAVEVERQRMKCFP
    51 SEQ ID 317的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。  MPNP KVFFDMQVGGAPAGRIVMELYADVVPKTAENFRAL CTGEKGTGRSGKPLHPKGSSFHRVIPGEMDQGGDFTRGN GTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGS QFFICTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGS GRTSKPVVLADSGQLA
    52 SEQ ID 318的氨基酸序列。保守FKBP型肽基-脯氨酰基顺-反异构酶信号为下划线,且FKBP型肽基-脯氨酰基顺-反异构酶信号2为粗体。  MRFTSITSAIALFAAAASALDKPLDIKVDKAV ECSRKTK AGDKIQVHYRGTLEADGSEFDASYKRGQPLSFHVGKGQV IKGWDQGLLDMCDGEKRTLTIQPDWGYGSRGMGPIPANS VLIFETELVEIAGVAREEL
    53 SEQ ID 319的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号2为粗体。  MGNP KVFFDMSIGGQPAGRIVMELYADVVPRTAENFRAL CTGEKGAGRSGKPLHYKGSSFBRVIFGFMCQGGDFTAGN GTGGESIYGSKFADENFVKKHTGPGVLSMANAGPGTNGS QFFVCTAKTEWLDGKHVVFGQIVDGMDVVKAIEKVGSSS GRTSKPVVVADCGQLS
    54 SEQ ID 320的氨基酸序列。保守FKBP型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。  MAVATRSRWVAMSVAWILVLFGTLALIQNRLSDTGASSDPKLVHRKVGEEKKKPDDLEEVTH KVFFDVEIGGKPAGRI VMGLFGKTVPKTVENFRALCTGEKGIGKSGKPLNYKGSQ FERIIPKPMIQGGDFTLGDGRGGESIYGNKFSDENFKLK HTDAGRLSMTNAGPDTNGSQFFITTVTTSWLDGRHVVFG KVLSGMDVVHKIEAEGGQSGQPKSIVVISDSGELDL
    55 SEQ ID 321的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶域为下划线。   MAVTLHTNLGDIKCEIFCDEVPKAAEHNARGILSMANSG PNTNGSQFFIAYAKQPHLNGLYTIFGRVIHGFEVLDIMEKTQTGPGDRPLAEIRLNRVTIHANPLAG
    条目            序列描述  注释肽序列
    56 SEQ ID 322的氨基酸序列。保守FKBP型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。  MAVATRSRWVAMSVAWILVLFGTLALIQNRLSDTGASSDPKLVHRKVGEEKKKPDDLEEVTH KVFFDVEIGGKPAGRI VIGLFGKTVPKTVENFRALCTGEKGIGKSGKPLNYKGSQ FHRIIFKTNIQGGDFTLGDGRGGESIYGNKFSDENFKLK HTDAGRLSMANAGPDTNGSQFFITTVTTSWLDGRHVVFG KVLSGMDVVHKIEAEGGQSGQPKSIVVISDSGELDL
    57 SEQ ID 323的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号2为粗体。  MGNP KVFFDMSIGGQPAGRIVMELYADVVPRTAENFRAL CTGEKGAGRSGKPLHYKGSSFHRVIPGFMCQGGDFTAGN GTGGESIYGSKFADENFVKKHTGPGVLSMANAGPGTNGS QFFVCTAKTEWLDGKHVVFGQIVDGMDVVKAIEKVGSSS GRTSKPVVVADCGQLS
    58 SEQ ID 324的氨基酸序列。视网膜母细胞瘤相关蛋白的保守A盒为下划线,且视网膜母细胞瘤相关蛋白的保守B盒为粗体。  MSPVAANAMEEA AEPEVPAPVTPSKDDADTDAAVSRPLG FCKSKLGLAEGNCVQSSTLLRKTAHVLRSSGTVIGTGTA EEAERYWFAFVLYTVRRVGERKAEDEQNGSDETEVPLSR ILKASVLNLIDFFFEIPQFVIKAGAIVSGIYGANWDSRL EAREMQTNYVHLCILCKFYKRICGEFFILNDAKDDMKSA DSSTSDPVIMYQPFGWLLFLALRIHALSRFKDLVSSTNALVSVLAILIIHLPTRFRKFSISDSSQLVKRSEKGVDLVGSLAYRYDTSEDEIKRTLEKANNVIAEILGITPPPASECKAENLENVDTDGLIYFGNLMEETSLSSILSTLEKIYEDATRNDSEFDERVFINDDDSLLVSGSLSGAAINLTGAKRKYDSFASPAKTITRPLSPSRSPASHINGIIGGTNLRITATPVATAMTTAKWLRTFVSPLPSKPSTDLQGFLASCDRDVTSDVIRRANIILEAIFPNSPIGERTVTGGLQNANLMDNMWAEQRRLEALKLYYRVLEAMCRAEAQILHSNNLTSLLTNERFHRCMLACSAELVLATHKTVTMLFPAVLERTGITAFDLSKVIESFVRHEETLPRELRRHLNTLEERLLENMVWERGSSMYNSLVVARPALAPEINRLGLLPEPMPSLDAIALLINFSSSGLPQSPVQKHEASPGQNGDIRSPKRISTEYRSVLVERNFTSPVKDRLLALSNIKSKLPPPPLQSAFASPTRPHPGGGGETCAE TAIHIFFSKITKLAAVRINAMLERLQLSQQIKE GVYCLFQQILSQRTNLFFNRHIDQVILCCFYGVAKINQI NLTFREIIYNYRKQPOCKPQVFRNVFVDWSTRRNGKAGN EHVDIISFYNEIFIPSVKPLLVELGPTGATTRTNRTSEVGNKNDAQCPGSPKISSFPTLPDMSPKKVSASHNVYVSPLRSSKMDASISHSSKSYYACVGESTHAYQSPSKDLVAINSRLNGNRKVRGTLNFDDVDAGLVSDSMVANSLYLQNGSSMSSSTAKSSEK
    59 SEQ ID 325的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MRPILMKGHERPLTFLKYNREGDLLFSCAKDHTPTVWFADNGE RLGTYRGHNGAVCCCDVSRDSMRLITGSADTTAKL WSVQNGTQLFTNFDSPARSVDFSIGDKLAVIITTDPFMELPSAIHVKRIARDPADQASESVLVLRGHQGRIARAVWGPLNKTIISAGEDAVIRIWDSETGKLLR ESDKETGHKKAVT SLMKSVDGSHFVTGSQDKSAKLWDIRTLTLIKTYVTERPVNAVTMSPLLDHVVLGGGQDASAVTMTDHRAGKFEAKFFDKILQEEIGGVKGHFGPINALAFNPDGKSFSSGGEDGYVRLHHFDPDYFNIKI
    60 SEQ ID 326的氨基酸序列。保守G蛋白β域为下划线,且WD-40重复域为粗体。  MDKKR TVVPLVCHGHSRPVVDLFYSPITPDGFFLISASK DSSPMLRNGETGDWIGTFEGHKGAVWSCCLDTNALRAAS GSADFSAKLWDALSGDELHSFEHKHIVRSCAFSEDTHLL LTGGVEKILRIFDLNRPDAPPREVDNSPGSIRTVAWLHB DQTILSSCTDIGGVRLWDVRSGKIVQTLETKSPVTSSEV SQDGRYITTADGSTVRFWDANHFGLVKSYNMPCNIESAS LEPKLGNKFIAGGEDMWVHIFDFHTGEEIGCNKGHHGPV HCVRFSPGGMSYASGSEDGTIRIWQTGPANNVEGDANPSNGPVTGKAKVGADEVTRKVEDLQIGKEGKDWREG
  条目             序列描述  注释肽序列
  61 SEQ ID 327的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MAEGLI LKGTMRAHTDMVTAIAIPIDNSDMVVTSSRDKS IILWHLTKEEKVYGV PRRRLTGHSHFVQDVVLSSDGQFA LSGSWDGELRLWDIATGV SARRFVGHTKDVLSVAFSIDN RQIVSASRDRTIKLWNTLGECKY TIQEGEAHTDWVSCVR FSPNTLQPTIVSASWDRTIKVWNLTNCK LRNTLAGHNGY VNTVAVSPDGSLCASGGKDGVILLWDLAEGKRLYNLEAGAIIHSLCFSPNRYWLCAATENSIKIWDLESKSIVEDLRVDLKNEADKTDGTTTAASNKKVIYCTSLNWSADGSTLFSGYNDGVIRVWGTGRY
  62 SEQ ID 328的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MAEGLHLKGTMKAHTDMVTAIAVPIDNADMIVTSSRDKSIILWHLTKEDKV YGVPRRRLTGHSHFVVQDWLSSDGQFA LSGSWDGELRLWDLA TGVSARRFVGHTKDVLSVAFSIDN RQIVSASRDRTIKLWNT LGECKYTIQEGEAHNDWVSCVR FSPNTLQPTIVSASWDRTVKVWNLT NCKLRNTLQGHSGY VNTVAVSPDGSLCASGGKDGVILLWDLAEGKKLYSLEAGAIIHSLCFSPNRYWLCAATENSIKIWDLESKSIVEDLRVDLKNEADMSDGTTGAMSSNKKVIYCTSLNWSADGSTLFSGYNDGVIRVWGIGRY
63 SEQ ID 329的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且TRP-Asp(WD)重复信号为粗体。  MAEGLH LKGTMKAHTDNVTAIAVPIDNADMIVTSSRDKS IILWHLTKEDKVY GVPRRRLTGHSHFVQDVVLSSDGQFA LSGSWDGELRLWDLAT GVSARRFVGHTKDVLSVAFSIDN RQTVSASRDRTIKIWNTLGEC KYTIQEGEAHNDWVSCVR FSPNTLQPTEVSASWDRTVKWHLTN CKLRNTLQGDHSGY VNTVAVSPDGSLCASGGKDGVILLWDLAE GKKLYSLEAG AIIHSLCFSPNRYWLCAATENSIKIWDLESKSIVEDLRVDLKNEADMSDG TTGAMSSNKKVIYCTSLNWSADGSTLFSGYNDGVIRVWGIGRY
64 SEQ ID 330的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MSGVPAPPFATTTPENGTMSSNSPAFHRDSDDDDDQGEVFLDDSDIIHEVAVDDEDLPDADDEADEAEEADD SLHIFT GHNGEVYSLACSPTDATLVATGAGDDKGFLWRIGHGD WA VELQGHKDDSISSLAISLDGQLASGSLDGVIQIWDVPSGN LKGTLDGPGGGIEWIRWHPKGHIILAGSEDSTVWMWNADKMAU YLNESGHGNSVTCGDFTPDGKTICTGSDDATLRI WNPKSGENIH VVKGHPYHAEGLTSMAISSDSGLAITGAK DGSVRIVNISSGR VVSSLDAHADSVEFVGLALSSPWAAT GSLDQKLIIWDLQHS SPRATCDHEDGVTCLSWVGASRFL ASGCVDGKVRVWDSLSGD CVRTFHGHSDAIQSLSVSANEEFLVSVSIDGTARVFEIAEFH
65 SEQ ID 331的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MGTSQHQLSSCLQLLPRRRGNKNLIFRRTMASGGAAAVAPPPGYKPYR HLKTLTGHVAAVSCVKFSNDGTLLASASLD KTLIIWSSAALS LLHRLVGHSEGVSDLAWSSDSHYICSA SDDRTLRIWSSRSPFD CLKTLRGHTDFVFCVNFNPQSSL IVSGSFDETIRIWEVKTGR CINVIRAHSMPVTSVHFNRD GSLIVSGSHDGSCKIWDTKNGAC LKTLIDDTVPAVSFAK FSPNGKFILVATLNDTLKLWNYATGK FLKIYTGHKNSVY CLTSTFSVTNGKYIVSGSEDRCICIWDLQGKN LIQKLEG HSDTVISVTCHPSENKIASAGLDSDRTVRIWLQDA
66 SEQ ID 332的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MP SQKIETGHQDIVHDVAMDYYGKRVATASSDTTIKIIG VSNSSGSQHLASLSGHKGPVWCVAWAHPKFGSILASCSYDGQVILW KEGNQNDWAQAHVFNDHKSSVNSIAWAPHELG LCLACGSSDGNISVFTARPDGGWD TTRIEQAHPVGVTSV SWAPSMAPGALVGSGLLDPVQKLASGGCDNTVKVWKLYNGTWKMD CFPALQMHSDWVRDVAWAPNLGLPKSTIASASQ DGTVVIWTVAKEGEQWQGKVLKDFKTPVWRVSWSLTGNLLAVADGNNNVTLWNEAVDGEWQQVTTVEP
67 SEQ ID 333的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且TRP-Asp(WD)重复信号为粗体。  MKIAG LKSVENAHDESVWAAAWVPATESRPALLLTGSLD ETVKLWRPDELA LERTNAGHFLGVVSVAAHPSGVIAASA SIDSFVRVFDVDTNA TIATLEAPPSEVWQMQFDFKGTTL AVAGGGSASIKLWDTATWELNATLSIPRPEQPKPSEKGNKKFVLSVAWSPDGRRLACGSMDGTISIFDVARAK FLHHL EGHFMPVRSLVFSPVEPRLLFSASDDAHVHMYDSEGKS L VGSMSGHASWVLSVDVSPDGAALATGSSDRTVRLWDLSMRA AVQTMSNHSDQVWGVAFRPMGAAGVRAGGRLASVSDDKSISLYDYS
    条目            序列描述  注释肽序列
    68 SEQ ID 334的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。  MEIDLGNLAFDVDFHPSEQLVASGLITGDLLLYRYGDGSSPEKLLEVRAHGESCRAVRFINDGKAILTGSPDCSILAT DVET GSVVARVENAHEAAVNRLVNLTESTIATGDDNGCI KVWDTRQ RSCCNTFSAHEDFISDMTFASDSMKLVVTSGD GTLSVCNLR SNKVQTRSEFSEDELLSVVIMKNGRKVVCG TQSGTLLLYSWGFFKDCSDRFVDLSPSSVDALLKLDEDRIIAGTENGLISLIGILP NRIIQPIAEHSDHPIERLAFSH DKKFLGSISHDQTLKLWDLNDILGSEDSPSSQAAIDDSDSDEMDVDANPPDSSKGNKKKHSGKGNDVGNANNFFADLGD
    69 SEQ ID 335的氨基酸序列。保守G蛋白β WD-40重复域为下划线且Trp-Asp(WD)重复信号为粗体。  MSQQPSVILATASYDHTIRFWEAKSGR CYRTIQYPDSQV NRLEITPHKRYLAVAGNPSIRLFDVNSNTPQ PVMSFDSH TNNVMAVGFQYDGNWMYSGSEDGTVRIWDLRARG CQREY ESRGAVNTVVLHPNQTELISGDQNGNIRVWDLTANSCS C ELVPEVDTAVRSLTVMWDGSLVVAANNNGTCYVWRLLRGSQTMTNFE PLHKLOAHNGYILKCLLSPEFCEPHRYLATA SSDHTVKIWNVEGFT LEKTLIGHQRWVWDCVFSVDGAYL ITASSDTTARLWSMSTGQDIRVYQGHHDATTCCALHDGAEGSDG
    70 SEQ ID 336的氨基酸序列。保守G蛋白βWD-40重复域为下划线  MEDAMDMEVEVEVEAEEHSPSSSNPSGSSFRRFGLKNSIQTNFGSDYVFEITPKFDWSLMGVSLSSNAVKLYSPTT GQ YCGECRGHSDTVNGISFSGPSSPHVLHSCSSDGTIRAWDTRSF KEVSCISAGPSQEIFSFSFGGSSDSLLSAGCKSQI LFWDWRNKKQVACLEDSHVDDVTQVCFVPHHQNKLISASVDGLICIFDTAGDINDDEHMESVINVGTSIGKVGIFGQTFEKLWCLTHIETLSVWDWKEGTNEANFEDARKLASDSWSLDHIDYFVDCHSAEEGEGLWVIGGTNAGTLGYFPVKYKGGAAI GSPEAVLGGGHSDVVRSVLPMSGMAGTTSKTRGIF GWTGGEDGRLCCWLSDDSSATSRSWMSSNLVLKSSRSHHKKNRHQPY
    71 SEQ ID 337的氨基酸序列。保守G蛋白β域为下划线,且WD-40重复域为粗体。  MSQHQEYPMEYAADDYDVGEVEDDMYFHERVMGDSDTDEDEEYDHLDNKITDTSAADARRGKDIQGIPWERLSVTREKYRRTRIEQYKNYENVPQSGESSEKDCKPTRKGGNYYEFWRNTRSVKSTILHFQLRNLVWSTTKHDVYLMSHFSIIHMSSLTCKKTEVLDVYGHVAPREKHPGSLLEGFTQTQVSTLAVRDKLLIAGGFQGELICKNLDRPGVSYCCRTTYDDNAITNAVEIYDYPSGAVHFMASNNDCGVRDFDMEKFELSRHFTFPWPVNHTSLSPDGKLLVIVGDNPEGIVVDSQR GKTIRP LQGHLDFSFABAWHPDGHIFATQNQDKTCRIWDIRNLSK SVAVLKGNLGAIRSIRITSDGRRMAMAEPADEVBVYDVKSGYEKEQEIDFFGEISGVSFSPDTESLFVGVWDRTYGSLLQYNRCRNYSYLDSM
    72 SEQ ID 338的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MGASSDPNPDVSDEHQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLDHPAAAAAVPNRVEIVQLDDSTGEIRADPNLSFDHPYPATKAAFVPDKDCQRADLLATSSDFLRIWRIADDSSRVDLRSFL NGNKNSEFCRPLTSFDWNEAEPKR IGTSSIDTTCTIWDIERETVDTQLIAHDKEVYDIAWGGV SVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMATIIMDSAKVVVLDIRYPTMP VVELQRHQ ASVNAIAWAPHSSCHICTAGDDSQALIWDLSSMAQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSLKLQ
    73 SEQ ID 339的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MRGGGGGGDATGWDEDAYRES VLKEREVQTRTVFRAAFA PSPSPSPSPDAVVVASSDGSVASYSISACLSDHRLQSLRFADAKSQNVLEAE PACFLQGHDGPAYDVKFYGEGEDSLL LSCGDDGRIRGWMWRDITSSEAHDHSQGNSAKPVLDLVNPQSRGPWGALSPIPENNALAVDVKRGSIYAAAGDSCAYCMDVECGK IKTVFKGHSDYLHCIAARNSSSQIITGSEDGT ARIWDCRSGKCVQVIDP DKDHKKGFFASVSCLALDASES WLVCGRGRDLSVWSISASDCIAKISTNAPAQDVLFDDNQILLVGAEPLISRLDNNGAVLSQIHCAPQSVFSVSLHQSGVTAVGGYGGLVDVISQFGSHLCTFRCKCI
    条目          序列描述  注释肽序列
    74 SEQ ID 340的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MEAPIIDPLQGDFPE VIEEYLEHGIMKCIAFNRRGTLLA AGCTDGSCIIWDFETRGVA KELRDKECTAAITSVCWSKY GHRILVSASDKSLILWDVLSGEKIAHTTLQHTVLQACLHPGSSTPSICLACPFSSAPMIVDLNTGSTTALPVLTADVSNGATPLSRNKTSDTSVTYSPCNACFNKHGDLVYAGTSKGEILIIDHKNVRV CAIVLVSGGAVIKNVVFSRNGQYMLTN SNDRLIRIYKNLLPPKDGLKMLDELNESFNESDDVEKLKAIGSKCLEILHEFQDSITRVQWKAPCFSGDGEWVIGGAASRGEHKIYIWDRAGH LVKILEGPKEALMDLAWHPVHPII ISVSLTGLVYIWAKDYTENWSAFAPDFKELEENEEYVEREDEFDLVPETEKVKGLDVHEDDEVDVLTVERDSVFSDSDMSQEELCFLPAVPCLDIPSQQDKCVGSCSKLPDGNHSGSPLSVEAGQNGNASNHNSSPLEPMENSTADDTDGVRLKRKRKPSEKGLELQAEKVKKPVKPLKSSGRLSKTNKPVIDPDSSNGVYGDDGSD
    75 SEQ ID 341的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MRGVSWPEDGNNPSTSSSSQRNQQQAHAPRAVSGHAASHPSASNIFKLLVQREVSPRSKHSSKKLWREASKCQPYPFQQSCEAVRDVRQGLISWVESASLRHL SAKYCPLVPPPRST IAAAFSPDGKILASTHGDHTVKLIDSQT GSCLKVLRGHR RTPWVVRFHPLYPEILASGSLDHEVRLWDANT AECIGSR NFYRPIASIAFHARGELLAVASGHKLYIWHYNRRGETSSPTIVLRTQRSLRAVHFHPHAAPFLLTAEVNDLDSADSAMTLATSPGYLHYPPPTVYFADAHSHERSRLADELPLMPLPLLMWPSFTRDDGRVPLQRIDGDVGLNGQQRVDSSSSVRLWTYSTPSGQYELLLSPVESGNSPSMPEETGNNAFSSAVEAEVSQSAMDTVEDMEVQPEERNTQFFSFSDPRFWELPLLHGWLVGQTQAGPRSVRQSSPGDIETQSAFGEVASVSPITSGVMPVSMDPSRFGGRSGSRYRSPGSRGVHVTGPNNDGPRDENDPQSVVSKLRSELAASLAAAASTELPCTVKLRIWPHDVKDPCAQLDLESCRLTIPHAVLCSEMGAHFSPCGRFLAACVACVLPHLESDPGLHGQVNQDVTGVATSPTRHPISAHQIMYELRIYSLEEATFGIVLASRPVRAAHCLTSIQFSPTSEHLLLAYGRRHSSLLKSIVIDGENTVPIYTILEVYRVSD MELVRVLPSAEDEVNVACFHPSVGGGLIYGTKEGKLRILHYDSSHGLNLRSSGFLDENVPEVOTYALEC
    76 SEQ ID 342的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。  MDSAVAIAALSLVVGAAIALLFFGNYFRKRRSEVVAMAEADLQPHPKNPSRPPPQPAAKKVHAKSHAHGADKDKNKRHHPL DLNTLKGHGDSVTGLCFASDGRSLATACADGVVRVF KLDDASNKSFKFLRINLPAGGHPTAVAFGDGVSSVIVASQHLSGCSLYMYGEEKPTNLDSNKQQTKLPMPEIKWEHHKVHEQKAILTLSGAAANYDSGDGSTIIASCSEGTDIIIWHAKTGK ILGNVDTNQLKNTMSAISPNGRFIAAAAFTADVK VWEIVYSKDGSVKGVT KVMQLKGHKSAVTWLCFTPNSEQ IVTASKDGSIRIWNINVRYHLDEDTKTLKVFPIPLQDSSGTTLHYERLSLSPDGKILAATHGSMLQWLCIETGKV LDT AEKAHDGDITCMSNAPQSIPTGDKKVNVLATASGDKKVKLWAAPPLPS
    77 SEQ ID 343的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MEVEPKKASKTFPVKPKLKPKPRTPSGKTPESKYWSS FK TTHPLDNLSFSVPSLAFSPSPPHLLAAAHSATVSLFSPH RTTISSFSDVVSSLSFRSDGQLLAASDLSGLIQVFDVRS RTPLRRLRSHARPVRFVRYPVLDKLHLVSGGDDALVKYW DVAG ESVVSELRGHKDYVRCGDCSPADANCFVTGSYDHV  VKLWDVRVRD GNRAATEVNHGSPVQDVIFLPSGSLVATA GGNSVKIWDLIGG GRMVYSMESHNKTVTSICVGTMGAQQ SGEEGVQLRILSVGLDGYMKVFDYSRMKVTHSMRFPAPLLSIGFSPDSNVRAIGTSNGILYVGKRKAKENAEGGANGILGLGSVEEPRRRVLKPSFYRYFHRGQSEKPSEGDYLVMRPKKVKLAEHDKLLKKFQHKNALISVLGGNDPEKVVAVMEELVARRALLKCVLNLDADELGLILTFLHKNSTVPRYSSLLLGLAKKVIDLRLEDIRASDALKGHIRNLKRSVDEEIRIQEGLQEIQGMVSPLLRIAGRR
    条目            序列描述  注释肽序列
    78 SEQ ID 344的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MQGGSSGVGYGLKYQARCISDVKADTDHTSFLTGTLSLKEENEVHLLRLSSGGTELICEGLFSHPSEIWDLSSCPFDQRIFSTVFSTGESYGAAVNQIPELYGQLNSPQ LEKIASLD AHSRKISCVLWWPSGRHDKLVSIDEENIFLWGLDCSKKS AQVQSQESAGMLHNLSGGAWDPHDVNTVAATCESSIQFW DLRTMKKANSLESVHARDLDYDMRKKHLLVTSEDESGVRVWDLRMP KAPIQEFPGHTHWTWAVRCNPDYEGLILSAGT DSAVNLWWSSTASSDELISERLIDSPTRKL DPLLHSYND YEDSVYGLAWSSREPWIFASLSYDGRVVVESVKPFLSRK
    79 SEQ ID 345的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MAEEEGSAELEQQLEEEFAVWKKNTPILYDLLISHALEWPSLTVHWAPLLPQPSSSAAAAAGDPSLAAHRLVLGTHTSDGAPNFLILADALLPSSESDHCGDDAVLPKVEISQKIRVDGEVNRARFMPQNHNIVGAKTNGCEVYVFDCSKQAAKQHDGGEDPDLRLTGHDGEGYGLSWSPLKENYLLSASHDKKICLWDISAAAQDKV LGAMHVFEAHEGAVGDASWHSKNDNL FGSAGDDCQLMIWDLRT NKAQQCVKAHEKEVNSVSFNSY NDWILATASSDTTVGLFDMRKL TTPLHVFSSHEGEVLQV EWDPNHEAVLASSSEDRRVMVWDLNRIGDEQQEGDASDGP AELLFSHGGHKAKISDFSWNKNEPWVISSVAEDNSVQVWQMAESICGDDDDMQAMEGYI
    80 SEQ ID 346的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MGNYGEEDEDQYFDALEETASVSDRGSNSSDCCSSGSGLDENVLDSLGFEFWTKFPESVRARRNRFLMLTGLGIEANSVDKEDAFPPSCNEIEVYTCKVTRDDGAVQRSLDSYNCISLLQSSTSIRSNQEVESLRGDSLLSSFRGRSKESDDLTELCGMGCPESKRNAVSEFGSVSQGSIEELRRIVASSPLVHPLLHRKLEYERELIETKQKMGAGWLRKFGSATCISGRQGDTWSDPDDLEITAGMKMRRVRAHSSKKKYKELSSLYAAQEFLAHEGSISTMKFSMDGQY LASAGEDTVVRVWKVTEEDRSERVNVTVDPSCLYFALNESTQLASLNTNKEHTGKAKTFQRSSDSSCVILPLKVFQITEKPWHEFKGHNGEVLDLSWSSKGY LLSSSTDKTVRLWRVGCDRCQRVYSHNDYVTCISFNPVNENF FISGSIDGKVRIWNVFGGQVVAYIDCREIVSAVCYRSDGKGAIVGTMTGNCLFYSIKDNHLQMDAQVYLHGKKKSPGKRITGFQFPPNDPGKLMITSADSVIRVLSGLDVVCKLKGPRNSGGPMIATFTSDGKHVISASEDSNVYIWNYAGQDKTSSRVKKIWSCESFWSSNASVALPWCGIRTVPEALAPPSRSEERRASCAENGENHHMLEEYFQKMPPYSPDCPSLSRGFFLELLPKGSATWPEEKLSDTSPPTVSSQAISKLEYKFLKSACHSVLSSAHMWGLVIVTAGWDGRIRTYHNYGLPVRS
    81 SEQ ID 347的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MDIDFKEYR LRCELRGHEDDVRGVCVCGDGSIGTSSRDR TVRLWAPSAGERRKYE VARVLLGHKSFVGPLAWVPPSEE LPEGGIVSGGMDTLVMAWDLRNG EAQTLKGHQLQVTGIV LDGGDIVSASVDCTLIRWKNGQ LTEHWEAHKAPIQAVIR LPSGELVTGSSDTTLKLWRGRT CTQTFVGHTDTVRGLAV MPDLGILSASHDGSIRLWAVSGE CLMEMVDHTSIVYSVD SHASGLIVSGSEDRFAKIWKDGV CFQSIEHPGCVWDVKF LEDGDIVTACSDGTIRIWTNQEDRMANSTELELFDLELSSYKRSRKRVGGLKLEELPGLEALQVPGTSDGQTKVIREGDNGVAYAWNSTELKWDKIGEVVDGPEDSMNRPALDGVQYDYVFDVDIGDGEPTRKLPYNRSDNPYDTADKWLLKENLPLSYRQQIVEFILANSGQRDFNLDPSFRDPYTGSSAYVPGAPSQLAAKQARPTFKHIPKKGMLVFDAAQFDGILKKINEFNNTLLSNQEKKNLSLTDIEISRLGAVVKILKDTSHYHSSKFADADFDLMLKLLESWPYEMMFPVIDIFRMVILHPDGADGLIRHQEDKKDVLMESIKRATGNPSVPANFLTSIRAVTNLFKNSAYYSWLQKHRSEMLDAFSSCSSSSNKNLQLSYATLLLNYAVLLIEKKDEEGQSQVLSAALELAENESLEVDARYRALVAIGSLMLDGLVKRIALDFDVEHIAKAARTSKEAKIAEVGADIELLIKQS
    条目            序列描述  注释肽序列
    82 SEQ D 348的氨基酸序列。保守G蛋白β域为下划线,且WD-40重复域为粗体。  MEFTEAYKQSGPCCFSPNARFIAVAVDYRIVIRDTLSLKVVQLFSCLDKISYIEWALDSEYILCGLYKRPMIQAWSLIQPEWTCKIDEGPAGIAYARWSPDSRHILTTSDFQLRLTVWSLVNTACVHVQWPKHASKGVSFTRDGKFAAICTRHDCKDYINLLSCHNWEIMGVFAVDTLDLADIQWSPDDSAIVIWDSPLEYKVLVYSPDGR CLEKIQAYESGLGVKSVSWSPCG QFLAVGSYDQMLRVLSHLTWKTFAEFTHLSNVRAPCCAA IFKEVDEPLQIDMSELSLSDDYMQGNSGDAPEGHYRVRY DVTEVPITLPCQKPPADRPNPKQGIGLMSWSNDSQYICT RNDSMPTILWIWDMRHLKLAAILVQKDPIRAAVMDPTGT RLVLCTGSSHLYMWTPSGAYCVSVPLSQFNITDLKNNSDGSCLLLKDKESFCCAAAPLPPDESSDYSSDD
    83 SEQ ID 349的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MATIAALDDDMVRSMS IGAVFSDFVGKLNSLDFHRKDDI LVTAGEDDSVRLYDIANARLLKTTFHKKHGTDRVCFTHHPNSLICSSTKNLDTGESLRYISMYDNR SLRYFKGHKQRV VSLCMSPINDSFMSGSLDHSVRMWDLRVNACQGILRLRGRPTVAYDQQGLVEAVAMEGGAIKLFDSRSYDKGPF DAFL VGGDTSEVCDIKFSNDGKSVLLSTTNNNIYVLDAYAGDKQC GFNLEPSPSTPIEASFSPDGQYVVSGSGDGTLHAWNISRRNEVACWNSHIGVASCLKWAPRRAMFVAASTVLTFWIPNSEPELASAKGEAGVPPEQV
    84 SEQID350的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且βG蛋白(转导蛋白)为粗体。  MSVAELKERHRAATETVNSLRERLKQKRVQLLDTDVAGYARTQGKTPVTFGATDLV CCRTLQGHTGKVYSLDWTPERN RIVSVSQDGRFIVWNALTSQ KTHAIRLPCAWVMTCAFAP NGQSVACGGLDSVCSIFNLNSPVDRDGNLP VSRMLSGHK GYVSSCQYVPDGDAHLITGSGDQTCVLWDITTGLRTSVFGGEFQSGHTADVLSVSINGSSPRIFVSGSCDSTARMWDTRVASR AVHTYHGHEGDVNAVKFFPDGNRFGTGSDDGTCR LFDIRTGHELQVY YQQRGIDEIPHVTSIAFSISGRLLIA GYSNGDCFVWDTLLAQVVLN LGSLQNSHEGRISCLGVSADGSALCTGSWDTNLKIWAFGGIRRVT
    85 SEQ ID 351的氨基酸序列。保守G蛋白β域为下划线,且WD-40重复域为粗体。  MKKRPRGASLDQAVVDIRRREVGGLSGLSFARRLAASEGLVLR LDIYNKLKGHRGCVNTVGFNLDGDIVISGSDDRHV KLWDWQTGKVKLSFDSGHLSNVFOAKIMPYTDDRSIVTC AADGQARHAQILEGGQVQTMLLAKBRGRAHKLAIDPGSP HTVYTCGEDGLVQRLDLRSNTARELFTCREVYGTHVKVV HLNAIAIDPRNPNLEVIGGSDEYARVYDIRNYKWNGSHN FGRSANYFCPSHLLGEAHVGITGLAFSGQSELLVSYNDE SIYLPTQEMGLGPDPLSASTKSVDSNSSEVTSPTAVNVD DNVTPQVYKGHRNCETVKGVGFFGPKCEYVVSGSDCGRI FIWKKKGGQLIRVMAADKHVVNCIEPNPHIPALASSGIE NDIKIWTPKAIERATLPMNVEQLKPKARGWMNRISSPRQLLLQLYSLERWPEHGGETSSGLAAGQEELTELFFALSANGNGSPDGGGDPSGPLL
    86 SEQ ID 352的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。  MSKR GYKLQEFVAHSSNVNCLSIGKKACRLFLTGGDDCK VNLWAIGK PNSLMSLCGHTNAVESVAFDSAEVLVLAGAS SGVIKLWDVEE AKLVRGLTGHRSNCTAMEFHPFGEFFASGSTDTNLKIWDIRKKGCIHTYKGHTRGISTIRFSPDGRWVVSGGNDNVVKVWDLTA GKLLHDFKFHENHIRSIDFHPL EFLLATGSADRTVKFWDLET FELIGSSRPEAAGVRAIAF HPDGRTLFCGLEDSLKVYSWEPVICHDGVDMGNSTLADLCIHDGKLLGCSYYQSSVGVWVADASLIEPYGTNVKPQQKDSGDDEIEHQESRPSAKVGTTIRSTSIMRCASPDYETKDIKNIYVDTASGNPVSSQRVGTTNFAKVTQPLDFNDTPNLTLRRQGLVTETPDGLSGHVPSKSITQPKVVSRDSPDGKDSSRRESITFSRTKPGMLLRPAHSRRPSSTKYDVDRLSACAEIGVLSSAKSGSESLVDSFLNIKVAPEDGARNGCEDNHSSVKNVSVESEKVLPLQTPKTEKCDQTVGFKEEINSVKFVNGVAVVPGRTRTLVEKFEKREKLNSTEDQTINTPENRTLDKTPPPSLAENEEKSDRLNIVERKATRMSSHMVTAEDRTPVTLVGSPEDQSTVMAPQRELPADESSKTPPLPVEDLEIHHGSNVSEDKATILSSQTVSEEDSKRSTLIRNFRRRDRFKSTEGRSPVMATQRKLPTDESGKTSSLPMEDLEIKGGLNVSEDKATSFSSRAPPREDRAHSALVRNVRKRDKFKSTNDTITVMVHQRGLSTDEASTVSVERVERRQLSNNVENPLNNLPPHSVPPTTTRGEPQYVGSESDSVNHEDVTELLLGNHEVFLSTLRSRLTKLQVV
    条目           序列描述  注释肽序列
    87 SEQ ID 353的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MSTFLTGTALSNPNPNK SYEVVQPPNDSVSSLSFNPKAN FLVATSWDNQVR0WEIVRSGTSLGT TPKASISHDQPVLC STWKDEGTTVFSGGCDKQVKMWPLSGG QPMTVAMHDAPI KEISWIPEMNLLVTGSWDKTLRYWDTRQAN PVHIQQLPE RCYALTVRHPLMVVGTADRNLIIYNLQSPQTEFKRISSPLKYQTRCLAAFPDQQGFLVGSIEGRVGVHHLDDSQQSKNFTFK CHREGSEIYSVNSLNFHPVHHTFATAGSDGAFNFW DKDSKQRLKAMSRCSQPIPCSTFNNDGSIFAYSACYDWSKGAENHNPATAKTYIFLHLPQESEVKGKPRLGTTGRK
    88 SEQ ID 354的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-ASp(WD)重复信号为粗体。  MEVEAQQRDVNNVMCQLVDPEGTTLGPPMYLPQDVGPQQLQQMVNKLLSNEDKLPYTFYISDQELVVPLESYLQKNKVSVEKVLSIVYQPQAIFRIRPVNR CSATIAGHSEAVLSVA FSPDGKQLASGSGDTTVRLWDLSTQTP MFTCKGHKNWVL SIAWSPDGKHLVSGSKAGEIQCWDPLTGQP SGNPLVGHK KWITGISWEPVHLSSPCRRFVSSSKDGDARIWDVTLRR C VICLSGHTLAVTCVKWGGDGVIYTGSQDCTIKVWETSQGK LIRELKGHGHWVNSLALSTEYVLRTGAFDHTGKQYSSAEEMKQVALERYKKMKGNAPERLVSGSDDFTMFLWEPSVSKH PKTRMTGHQQLVNHVYFSPDGQWVASASFDKSVKLWNGITGKFVAAFRGHVGPVYQISWSADSRLLLSGSKDSTLKIWDIRTKK LKRDLPGHADEVFAVDWSPDGEKVVSGGKDKVLKLWMG
    89 SEQID355的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MDAGSAHSSSNMKTQSRSPLQEQFLQRRNSRENLDRFIPNRSAMDFDYAHYMLTEGRKGKENPAVSSPSREAYRKQLAETLNMNRTRILAFKNKPPTPVELIPHELTSAQPAKPTKTRRYIPQTSERTLDAPDLLDDYYLNLLDWGSSNVLSIALGNTVYLWNASDGSTS ELVTIDDETGPVTSVSWAPDGRHIA VGLNNSDVQLWDSADNRL LRTLRGGHRSRVGSLAWNNHI LTTGGMDGLIVNNDVRVRSH IVDTYRGHTQEVCGLKWSA SGQQLASGGNDNILHIWDRSTASSNSPTQ WLHRLEEHTA AVKALAWCPFQGNLLASGGGGGDRTIKFWNTHTGACLNSVDTGSQVCALLWNKNERELLSSHGFTQNQLTLWKYPSMVKIAELTGHTSRVLFMAQSPDGCTVASAAGDETLRFWNVFGVPEVAKPAPKANPEPFAHLNRIR
    90 SEQ ID 356的氨基酸序列。保守G蛋白βWD40重复域为下划线,且Trp-ASp(WD)重复信号为粗体。  MEEAIPFK NLPSREYQGHKKKVHSVAWNCTGTKLASGSV DQTARVWHIEPHGHG KVKDIELKGHTDSVDQLCWDPKHA DLIATASGDKTVRLWDARSGKCSQQAELSGENINITYKPDGTHVAVGNRDDELTILDVRKFKPIHKRKFNYEVNEIAWNMSGEMFFLTTGNGTVEVLAYPS LRPVDTIMAHTAGCYC IAIDPVGRYFAVGSADSLVSLWDISE MLCVRTFTKLEWP VRTISFNHTGDYVASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKYNLLAYAGDDKNKYQADEGVFRIFGFESA
    91 SEQ ID 357的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MGKDEEEMRGEIEERLINEEYKVWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKLVLGTHTSENEPNYLMLAQVQLPLEDAENDARHYDDDRADVGGFGCANGKVQIIQQINHDGEVNRARYMPQNSFIIATKTVSAEVYVFDYSKHPSKPPLDGA CSPDLRLRGHSTEGYGLSWSKFKQGHLL SGSDDAQICLWDINATPKNKS LDAMQIEKVHEGVVEDVA WHLRHEYLFGSVGDDQYLLIWDLRTPSV TKPVQSVVAHQ SEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKI STALHT  FDAHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGP PELLFIHGGHTSKISDFSWNTCEDWVVASVAEDNILQIWQMAENIYHDEDDVPGEESNKGS
    条目            序列描述  注释肽序列
    92 SEQ ID 358的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MMRGFSCTEDGDAPSTSSTSPPPPPPPPHRQQMQAPRASSSSSGQPTSRRSTGNVFKLLARREVSPRSKHSLKKFWGEASECQLCPFQQSYEAVRDVRRSLISWVEAFSLQHLS AKY CPLMPPPRSTIAAAFSPDGKIIASTHGDHTVKLIDSCT G SCLKVLRGHRRTPWVVRFHPLYPEILASGSLDHEVHLWDANT AECIGSRNFYRPIASIAFHAQGDLLAVASGHKLYIW HYNRSGETSSPTIVLRTPRSLRAVHFHPHAAPFLLTAEVNDLDLTDSAMTLATSPGYLHYPPPTIYLADAHSNERSRLEDELPLMPSPLLMWPSFTRDDGRATLPHIGGDVGLSGQQRVDSLSSGQYEFHPSPIEPSSSTSMHEEMGTDPFSSVRESEVTQSAMNIVDNTEVQPEERSTYSFSFSDPRFWELPSVYGWLVGQTQAAPRTAPSPGALETASALGEVASVSPVRSEFMPGGMDQPRLGGRSGSGCRSSGSRMMRTAGLNDHPHDENYPQSVVSKLRSELEASLAAAASTELPCTVKLRVWPYDMKDPCALFRSESCRLTIPHAVLCSEMGAHFSPCGRFFAACVACVLPQLEADPVLHGQVDPDVTGVATSPTRHPVSAYQIMYELRIYSLEEATFGMVLASRSIRAAHCLTSIQFSPTSEHLLLAYGRRHNSLLKSIVIDGENTVPIYSILEVYRVSD M ELVRVLPSAEDEVNVACFHPSVGGGLVYGTKEGKLRILQIDSSGGLNPKSTGFLDENMAEVPTYALEC
    93 SEQ ID 359的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MGEGDLPRT EAGVLRGHEGAVLAARFNGDGNYCLSCGKD RTIRLMNPHRGI HIKTYKSHGREVRDVHCTSDNSKLISC GGDRQIFYWDVSTGR VIRRFRGHDSEVNAVKFNDYASVV VSAGYDRSVRAWDCRSHSTE PIQIINTFQDSVMSVCLTK TEIIGGSVDGTVRTFDIRIGR EISDDLGQPVNCISMSND GNCILASCLDSTLRLVDRSAGE LLQEYKGHTCKSYKLDC CLTNTDAHVAGGSEDGYVFFWDLVDAS VISKFRAHSSVV TSVSYHPKEDCMITASVDGTIKVWKT
    94 SEQ ID 360的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MACIKGVGRSASVAMAPDGGYLATGTMAGTVDLSFSSSASLEIFGLDFQSDDRDLPLIAESPSSERFNRLSWGKNGSGSDEFSLGLIAGGLVDGTIGLWNPLSLIRSEAGD KAIVGH LSRHKGPVRGLEFNVIAPNLLASGADDGEICIWDLAAPREPSHF PPLRGSGSAAQGEISFLSWNSKVQHILASTSYNG TTVVWDLKKQKPVISFSDSVRRRCSVLQWNPDLATQLVVASDEDSSPTLRLWDMRNI MSPVKEFAGHTRGVIAMSWCP NDSSYLVTCAKDNRTICWDTVT GEIVCELPAGSNWNFDV HWYPKIPGVTSASSFDGKIGIYNVEGCSRYGVRENEFGAATLRAPKWFKRPVGASFGFGGKVVSFHTRSTGGPSVNSSEVFVHDIITEQTLVSRSSEFEAAIQSGDRPSLRRLCEKKSQHCESTDDQETWGFLKVLLEDDGTARSKLLAELGFDIPTETNDGSQEDLSQQVNALGLEDVTADKVVQEDNNESMVFPTDNGEDFFNNLPSPRADTPVSTSADGFPTVNAAVEPSQDEVDGLEESSDPSFDDSVQRALVVGDYKAAVALCMSANKLADALVIAHVGGASLWESTRDKYLKMSRLPYLKVVFAMVNNDLQSLVDTRPLKFWKETLAILCSFAQGEEWAMLCNSLASKLMMAGNMLAATLCFICAGNIDKTVEIWSRSLATEHDGMSYMDLLQDLMEKTIVLALASGQKQFSASVCKLVEKYAEILASQGLLTTAMDYLKLLGTDDLSPELAVLRDRIAFSVEAEKGANISAFNGSQDPRGAVYGVDQSNYGMVDTSQHYYPEAAQPQVPHTVPGSPYGENYQQPFGSSFGKGYNTPMQYQAPSQASMFVPSEPPQNAQPSFVPTPVTSQPTTRSQFIPAPPLALRNPEQYQQPTLGSHLYPGSVNPTFQPLPHAPGPVAPVPPQVSSVPGQNMPQAVAPTQMRGFMPVTNPGVVQNPGPISMQPATPIESAAAQPVVSPAAPPPTVQTADTSNVPAPQKPVIATL
    95 SEQ ID 361的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MKERGKGAGRSVDERYTQWKSLVPVLYDWLANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVIANVEVVKPRVAAAEHISQFNEEARSPFVKKFKTIIHPGEVNRIRELPQNSKIVATHTDSPDVLIWDVETQPNRHAVLGASTSRP DLILTGEKDNAEFALAMSPTEPFVLSGGKDRYVVLW SIQDHISTLAADPGSAKSPGSAGTNNKQSSKAAGGNDKTGDSPSIE PRGVYLGHGDTVEDVTFCPSSAQEFCSVGDDS CLILWDARTGSSP AIKVEKAHHADLHCVDWNPHDVNLIL TGSADNTVRMFDRRNLTSGGVGS PVHTFEGHNAAVLCVQ WSPDKSSVFGSSAEDGILNIWDHEKIGRKIETVGSKVPNSPPG LFFRHAGHRDKVVDFHWNSSDPWTIVSVSDDGESTGGGGTLQIWRMIDLIYRPEEEVLAELDKFKSHILSCTS
    条目            序列描述  注释肽序列
    96 SEQ ID 362的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。  MAKIAPGCEPVAGTLTPSKKREYRVTNRLQEGKRPLYAVVFNFIDSRYFNVFATVGGNRVTVYQCLEGGVIAVL QSYI DEDKDESFYTVSWACNIDRTPFVVAGGINGIIRVIDAGNEK IHRSFVGHGDSINEIRTQPLNPSLIVSASKDESVRLW NVHTGICIL IFAGAGGHRNEVLSVDFHPSDKYRIASCGMDNTVKIWSMKEFWTYVEKSFTWTDLPSKFPTKYVQF PVF IAPVHSNYVDCNRWLGDFVLSKSVDNEIVLWEPKMKEQSPGEGSVD ILQKYPVPECDIWFIKFSCDFHYHSIAIGNRE GKIYVWELQSS PPVLIAKLSHPQSKSPIRQTAMSFDGST ILSCCEDGTIWRWDAITASTS
    97 SEQ ID 363的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MNTAMHFGAGWRSIAEMGYTMSRLEIEPESCEDEKSLDGVGNSQGPNELPRCLDHELAHLTNLKSRPHEHLIRDFPGRRALPVSTVKMIAGRECNYSRRGRFSSADCCHMLSRYVPVNGPSPLDQMNSRAYVSQFSADGSLFVAGFQGSHIRIYNVDKGWKCQKNILTKSLEWTITDTSLSPDQRYLVYASMSPIVHIVDIGSAAMDSLANITEIHEGLD FSADSGPYSFGIFS VKFSTDGREVVAGSSDDSIYVYDLVANK LSLRIPAHESD VNTVCFADESGHIIYSGSDDTYCKVWDRRCLSARNK PAG VLMGHLEGITFIDSRGDGRYFISNGKDQTIKLWDIRKMGSDICRRGFRNFEWDYRWMDYPPRARDSKHPFDL SVATYK GHSVLRTLIRCYFSPVHSTGQKYIYTGSHDSCVYIYDVVTGA QVAALKHHKSPVRDCSWHPEYEMIVSSSWDGDIVKWEFFGNGETEIPAMKKRIRRRHLY
    98 SEQ ID 364的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MEPQPQAPKKRGRKPKPKEDKKEEQLHQPPPPPPPQQQAAPAPAPAATRSSTSGSAGGRDRRPQQQHAVDEKYARWKSLVPVLYDWLANHNLLWPSLSCRWGPQLEQATYKNRQRLYISEQTDGSVPNTLVIANCEVVKPRVAAAEHVSQFNEEARSPFIRKYKTIIHPGEVNRVRELPQNPNIVATHTDSPDVLIWDVESQPNRHAVYGATASRPNLILTGHQENAEFALAMCPAEPFVLSGGKDKTVVLWSIQDHITASATDQTTNKSPGSGGSIIKKTGEGNEETGNGPS VGPRGIYCGHEDTVEDVAF CPSTAQEFCSVGDDSCLILWDARVGT NPVAKVEKAHNGD LHCVDWNPHDNNLILTGSADNSVNMFDRRNLTSNGV GSP VYKFEGHKAAVLCVQWSPDKPSVFGSSAEDGLLNIWDYERVDKKVDRAPNAPAGLFFQHAGHRDKIVDFHWNAADPWTMVSVSDDCDTAGGGGTLQIWRMSDLIYRPEEEVLAELENFKAHVLECSKA
    99 SEQ ID 365的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。Utp21特异性WD40相关推定域为斜体。  MGIFEPYRAVGYITTGVPFSVQRLGTETFVTVSVGKAFQVYNCAKLSLVLVGPQLPKKIRALASYREYTFAAYGSDIGIFKRAHQLATWSGHTAKVCLLLLFGEHILSVDVDGNAYIWAFKGMNYNLSPVGHILLDSNFTPSCIMHPDTYLNKVILGSQEGPLQLWNISTKTKLYEFKGWNSSVSSCVSSPALDVVAVGCADGKIHVHNIRYD EELVTFSHSMRGSVTALSFST DGQPLLASGSSSGVVSIWNLDKRRLQSVIRDAHDGSIISLHFFANEPVLMSSSADNSIKMWIFDTSDGDPRLLRFRSGHSAPPLCIRFYANGRHILSAGQDRAFRLFSVVQDQQSRELSQRHVSKRAKKLKLKEEEIKLKPVIAFDVAEIRERDWCNVVTSHMDTPQAYVWRLQNFVIGEHILRPCPNKPTPVKACMISACGNFAILGTAGGWIERFNLQSGISRGSYIDQLEGTNSAHDGEVVGVACDATNTLMISAGYAGDIKVWDFKGRELKSRWEIGSSLVKISYHRLNGLLATVADDFIIRLFDAVALRMVRKFEGHTDRITDLCFSEDGKWLLSSSMDGSLRIWDIILARQVDAVFVDVSITALSLSPNMDILTTHVDQNDGVFLWVNQSMFSGDSDINLYASGKEVVTVKLPSVSSVEGSQVEESNEPTIRHSESKDVPSFRPSLEQIPDLVTLSLLPKSQWQSLINLDIIKVRNKPVEPPKKPEKAPFFLPSIPSLSGEILFKPSEMSDKGDMKADEDKSKITPEVPSSRFLQLLHSCSEAKNFSPFTTYIKGLSPSTLDLELRMLQIIDDDAVDADADDPQDVDKRQELLSIELLMDYFIHEISCRSNFEFVQALVRLFLKIHGETIRRQSVLQNKAKVLLETQCSVWQRVDKLFQGARCMVAFLSNSQF
    条目           序列描述  注释肽序列
    100 SEQ ID 366的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MEETKVTCGSWIRRPENVNLAVIGRSPRRRGSAALEIFAFDPKSTSLSSSPLVAHVIEEIEGDPLAIAVHPNGEDIVCFASSGSCLSFELSGQESNLKLLTK ELPPLRGIGPQKCMA FSVDGSRPATGGVDGRLRILEWPSLRI ILDEPKAHKSIR DLDFSLDSEFLATTSTDGSARIWKAEDGLPCTTLTRRSDEKIELCRFSKDGTKPFLFCTVQRGDKAVTGVWDISTWNKIG HKRLLRKPAVVMSISLDGKYLAQGSKDGDMCVVEVKKMEVSHWSKRLHLGTSLTSLEFCPIERVVITTSDEWGVLVTKLNVPADWKAWQVYLLLLGLFLASLVAFYIFYENSDSFWGFPLGKDQPARPKIGSVLGDPKSADDQNMWGEFGPLDM
    101 SEQ ID 367的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MADPVEHQHQQHQQHQLQQQRRRGWRLQGGQYLGEISALCFLHLPPPPLSLSSSPVLSLSSGLDSESRDRPACSFRFPSAGSGSQVSLFDLASGAMVRTFYVFRGIRVHGIVLGCADFPGGSSSSSSTLDYVIAVYGERRVKLFRLSVRLGRGAGEGSGTVLSADLELVSAAPRLSHWVMDVRFLKENGTSEDELQRCLTVAIGCSDNSIRLWDVDKCSFVLAVSSPERCLLYSMRLWGDNLEDLQVASGTIYNEILIWKVVPNHDAPSSNELTEEGLTNSCAGNSVHECLRYE AYHICRLVGHEGSIFRIA WSSDGSKLVSVSDDRSARIWEVHCKVQYSEDA GEVGLLF GHSARVWDCYISDNLIVTAGEDCSCRVWGLDGQQHDVIKEHIGRGIWRCLYDPWSSLLVTGGFDSAIKVHKLDASLAEASAKQSNIKDLSDGTELFTTHLPNSSGHSGHMDSKSEYVRCLSFSCEDVMYIATNHGYLYHAKLCNDGDLRWTELAQVSNEVQIICMELLPSNPYDPRIDADDWVAVGDGKGWTTVVRVVKNSDSPKVSTSFSWAAEMDRQLLGIHWCKSLGHRFIFTADPRGALKLWRFFEVSQSSSLYPENSPRISLIAEFKSDLGARIMCLDVAFESELLICGDLRGNLVLFPLLKDLLLDTFVVSAAKISPVNHFKGAHGISAVSSISVAHMSFNHIELRSTGADGCICYMEYDKGLQSLNFVGMKQVDELSMIESVSTENESTGYRTSGSYASGFASTDFIIWNLVTEAKVLQVSCGGWRRPHSYYLSDVPEMKNCFAYVKDDIIYIRRHWIKDSKDKILPQNLRLQFHGREVHSLCFVTGDFQLRKNKQSSWIVTGCEDGTVRLTRYTQCTDNWSSSKLLGEHVGGSAVRSICCVSNIHTTSSGTSVSDVKGIENLPKDIKGTLMEDECNPSLLISVGAKRVLTSWLLRRRKQDGKEDDVTDLQEAENSSLPSSAGSSTFSFQWLSTDMPVKYSVPSKKSGSIKKLIGVSDTNVRCKSL
    102 SEQ ID 368的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MPYK LSATLSNHSSDVRAVASPSDDLILSASRDSTAISW FRQSPSSFT PASVIRAGSRFVNAIAYLPPTPRAPQGYAV VGGQDTVVNVFALGPGDKEE PEYTLVGHTDNVCALSVNS  DDTIISGSWDKTAKVWKDFA LVYDLKGHQQSVWAVLAMN EKEFLTASADRTIKYWVQHK TMQTYEGHRDAVRGLALIP DIGFASCSNDSEIRVWTMGGD VVYTLSGHTSFVYSLSVL PNGDLVSAGEDRSVRVWRDGE CSQVIVHPAISVWAVSTM PNGDIISGSSDGVVRVFSESEKRWATASELKALEDQIASQSLPSQQVGDVKKTDLPGPEALSVPGKKAGEVKMIRSGDVVEAHQWDSLASSWQKIGEVVDAIGSGRKQLHDGKEYDYVFDVDIQEGAPPLKLPYNVSENPYTAAQRFLEQNDLPTGYLDQVVKFIEQNTAGVKLGNDGYVDPFTGASRYQPATQSTSNTASSSYMDPFTGGSRHIAESAPSNVPQGSHATGIIPFSKPIFFKLANVSAMQAKMPQFDEVLRNEISTATLAMRPDEVIMVNETFTYLSKVVTSTSSARTSLGWIHIETIMQILDRWPVPQRFPVIDLGRLVTAYCMNAFSGPGDLEKFFSCLFRTSEWTSITSGSKALTKAQETNVLLLFRTIANSLDGAPLNDMEWIKQIFRELAQTPQLVLNKSHRLALASVLFNFSCIGLKGPVPADVRTLHLTIILQVLRSPNDDPEVAYRTCVALGNMLYSDKTRGTPRDAQSPSPTELKSAVAAIKGGFSDPRINDVHREIMSLI
    103 SEQ ID 369的氨基酸序列。保守G蛋白β域为下划线,且WD-40重复域为粗体。   MPPQKIESGHKDTVHDLAMDYYGKRLATASSDHTINVVG VSSSGSQHLATLIGHQGPVNQISWAHPKFGSLLASCSYD GRVTIWREGNPNEWTQAQVFIEHKSSVNSVAWUPHKLGL CLACGSSDGNISVFTARQDGGWDTSRIDQAHFVGVTSVS WAPSTAPGALVGSGMMEPVQKLCSGGCDNTVKVWKLYNR VWKLDCFPVLOMHTDWVRDVAWAPNLGLPKSTIASASQD GRVIIWTLAKEGDQWQGKVLYDFRTPVWRVSWSLTGNILAVADGNNVSLWNNEAVDGEWIQVSTVEP
    条目             序列描述  注释肽序列
    104 SEQ ID 370的氨基酸序列。保守G蛋白β WD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。  MSAPMLEIEARDVVKIVLQFCKENSLHQTFQTLQSECQVSLNTVDSIETFVADINSGRWDAILPQVAQLKLPRNTLEDLYEQIVLEMIELRELDTARAILRQTQAMGVMKQEQPERYLRLEHLLVRTYFDPNEAYQDSTKEKRRAQIAQALAAEVTVVPPSRLMALVGQALKWQQHQGLLPPGTQFDLFRGTAAMKQDVDDMY PTTLSHTIKFGTKSHAECARFSPDGQFLVSC SVDGFIEVWDYMSGKLKKDLQ YQADETFMMHDDPVLCVD FSRDSEMLASGSQDGKIKVWRIR TGQCLRRLERAHSQGV TSVLFSRDGSQLLSTSFDGSARIHGLK SGKQLKEFRGHS SYVNDAIFSNDGSRVITASSDCTVKVWDVK TSDCLQTFK PPPPLRGGDASVNSVHLFPKNADHIVVCNKTSSIYIMTLQGQVVKSLSSGKREGGDFVAACVSPKGEWIYCVGEDRNL YCFSCQ SGKLEHLMKVHEKDVIGVTHHPHRNLVATYSEDSTMKLWKP
    105 SEQ ID 371的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MDLLQSYAEDNDGDLGRHSSPEPSPPRLLPSKSAAPAVDDTTLALTVAQTNQTLARPIDPSQHAVAFNPTYDQLWAPICGPAHPYAKDGIAQGMNHKLGFVEDAAIGSFLFDDEQYNTFQRYGYAADPCASTGNEYVGDLDALKQNDGISVYNIRQQEQKKYAEEYAKKKGEERGEGGREKAEVVSDKSTFHGKEERDYQGRSWIAPPKDAKATNDHCYIPKR LVHTWSGHTKG VSAIRFFPKHGHLILSAGMDTKVKIWDVFNSGK CMRTYM  GHSKAVRDISFCNDGTKFLTAGYDKNIKYWDTETGK VIS TFSTGKIPYVVKLHPDDEKQNILLAGMSDKKIVQWDMNTGQ ITQEYDQHLGAVNTITFVDDNRRFVTSSDDKSLRVWEFGIPVV IKYISEPHMHSMPSISLHPNTNWLAAQSLDNQI LIYSTRERFQLN KKKRFAGHIVAGYACQVNFSPDGRFVN SGDGEGRCWFWDWKSCK VFRTLKCHEGVCIGCEWHPLEQSKVATCGWDGLIKYWD
    106 SEQ ID 372的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MESNGNLEQTLQDGRIYRQLNSLIVAHLRDHNFPQAASAVALATMTPLNVEAPRNRLLELVAKGLAVEKGELLRGVSHAGTNDLGGSIPASYGLVPAPWTAIDFSSLRDTKGMSKSFTKHETRHLSDHKNVARCARFSTDGRFFATGSADTSIKLF EVSKIKQMMLPDSTDGA IRAVIRTFYDHTHPVNDLDFHP QNTVLISAAKDHTVKFFDYSKAT AKRAFRVIQDTHNVRS VAFHPSGDFLLAGTDHPIPHLYDVN TFQCYLSANVPEFA VNAAINQVRYSSSGGMYVTASKDGTIRFWDGA SANCVRS IAGAHGAAEVTSANFTKDQRYVLSCGKDSTVKLWEVGTGRLVKQYLGATHMQLRCQAVFNNTEEFVLSIDEPSNEIVVWDAM TAEKVARWPSNHNGPPRWIEHSPTEAAFVSCGTDRSIRFWKETH
    107 SEQ ID 373的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MSNFQGEDGEYVADDFEAEDGDEELHGRESADPESDVDEIDTPSNRFTDTTADQARRGRDIQGIPWERLSITREKYRRTRLEQYKNYENVPQSGEKSGKDCTVTEKGNSFYEFRRNSRSVKSTILHFQLRNLVWATSKHDVYLMSNYSVVHWSSLTGKKSEVLNLAGHVAPNEKHPGSLLEGFTQTQVSTLAVKDRFLVAGGFQGELICKFLDRPGISFCSRTTYDDNAITNAVEIYVSPSGGIHFIASNNDCGVRDFDME NFELSKHFRFPW PVNHTSLSPDGKLLVIVGDDPEGILVDAK TGKTIMPLRG HLDFSFASEWHPDGVTFATGNQDKTCRIWDIRN LSKSIA VLKGNLGAIRSIRYTSDGRYMAIAEPADFVHVYDTKTGYKKEQEIDFFGEISGMSFSPDTESLFIGVWDRTYGSLLEYGRRRNFSYLDCLV
    108 SEQ ID 374的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且剪接因子基序为粗体。  MGVEEDLEDLNALAESTDAAVDGQAALASAVDSVTLQPAPPILPPVIPPPAVPVVAPVPTIPPVLRPLAPLPIRPPVLRPPAPKRDEAGSSDSDSDHDGTAAGSTAEYEITEESRLVRERHEKAMQDLMMKRRGAALAVPTNDKAVRARLRRLGEPMTLFGERRMERRDRLRMIMAKLDAHGQLEKLMKAHEDEEAAASAAPEDVEEEMLQYPFYTEGSKALFNARIDIAKFSITRAALRLERARRRRDDPDEDVDAEIDWALKKAESLSLHCSEIGDDRPLSGCSFSHDSKLLATCSMSGVAKLWDTCRMPQVNRVLTLKGHTERATDVAFSPVQNH IATASADRTAKLW NTEGTILKTFEGHLDRLGRIAFHPSGKY LGTTSFDKTWR LWDIESGEELLLQEGHSRSIYGIDFHRDGSLVASCGLDALARVWDLRTGRSILALEGHVKPVLGVSFSPNGYH LATGG EDNTCRIWDLRKKKSLYTIPAHANLISEVKFEPQEGYFLVTASYDTTAKVWSARDFKPVKTLSVHEAKITSVDITADASHIVTVSHDRTIKLWTSNDDVKDQAMDVD
  条目            序列描述  注释肽序列
  109 SEQ ID 375的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且保守Dip2/Utp12域为粗体。  MVKAYLRYEPAAAFGVIASVESNIAYDASGKHLLAPALEKVGVWHVRQGVCTKALAPSASSAAGPSLAVTAIASSPSSL IASGYADGSIRIWDFEKGSCETTLNGHKGAVSVLRYGKLGSL LASGSKDNDIILWDVVGETGLYRLRGHRDQVTDLVFLDSDKK LVSSSKDKYLRVWDLETQHCMQIVGGHHSEIWSLDTDPEERYLVTGSADPELRFYTVKNDSSDERSEADASGGVGNGDLASHNKWDVLKQFGEIQRQSKDRVATVRFNKNGNLLACQAAGKLVEVFRVLDEAEAKRKAKRRLHRKREKKGADVNENGDSSRGIGEGHDTMVTVADVFKLLQTIRASKKICSISFCPVAPKSSLATLALSLNNNLLEFHSIEADKTSKMLTIELQGHRSDVRSVTLSSDNTLLMSTSHNSVKIWNPSTGSCLRTIDSGYGLCGLIVPQNKHALIGTKDGAIEIFDVGSGTCIEVVEAHGGSIRSIVAIPNQNGFVTGSADHDIKFWEYGMKQKPGDNSKHLTVSNVRTLKMNDDVLVVAVSPDAQKIAVALLDCTVKVFFMDSLKLMHSLYGHRLPVLCLDISSDGDLIVTGSADKNLMIWGLDFGDRHKSIFAHGDSIMAVQFVGNTHYMFSVGKDRLVKYWDADKFELLLTLEGHHADIWCLAISNRGDFLVTGSHDRSIRRWDRTEEPFFIEEEKEKRLEEMFSSDLDNAFGNKYVPKEEIPEEGAVALAGKKTOETLSATDSIIEALDLAEVELKRIAEHEEEKNNGKTAEWHPNYVMLGLSPSDFILRALSNVQINDLEQTLLALPFSDALKLLSYLKDWTTYPDKVELVSRIATVLLQTHYNQLVSTPAARPLLTTLKDILHKKVKECKDTIGFNLAAMDHLKQLMALRSDALFQDAKVKLLEIRSQISKRLEERTDPRKAKRRKKKQKKSTMMHAWP
  110 SEQ ID 376的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MGGVQAEREDKDKVSLELTEEILQ SMEVGMTFRDYSGRI SSMDFHRASSYLVTASDDESIRLYDVASATCLKTINSKKYGVDLVSFTSHPMTVIYSSKNGWDESLRLLSLH DNKYLR YFKGHHDRVVSLSLCPRNECFISGSLDRTVLLWDQRAEKCQGLLRVQGRPATAYDDPGLVFAIAFGGCVRMFDARKYEK GPFEIFSVGGDVSDANVVKFSNDGRIMLLTTTDGHIHV LDSF RGTLLYTFNVKPTSSKSTLEASFSPEGMFVISGSG DGSVYAWSVRGGKEVASWLSTDTEPPVIKWAPGNLMFATGSSELSIWIPDLSKLGAYVGRK
  111 SEQ ID 377的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MAAFGAAPAGNHN PNKSSEVIQPPSDSVSSLCFSPRANH LVATSWDNQVRCWELTKNGASV TSVPKASMSHDQPVLCS AWKDDGTTVFSGGCDKQAKMWSLMS GGQPVTVAMHDAPI KEIAWIPEMNVLVTGSWDKTLKYWDTRQSNPVHTQQLPERCYAMTVRYPLMVVGTADRNLIVFN LQNPQAEFKRFSSP LKYQTRCVAAFPDQQGFLVGSIEGRVGVHHLDDS QISKN FTFKCHRDNNDIYSVNSLNFHPVHHTFATAGSDGTFNFW DKDSKQRLKAMSRCSQPIPCSTFNNDGTIYAYSVCYDWSKGAENHNPATAKTYIFLHLPQESEVKAKPRVGTTNRK
  112 SEQ ID 378的氨基酸序列。保守G蛋白βWD-40重复域为下划线。  MNCSISGEVPEEPVVSTKSGHVFERRLIERYVSDYGKCPVSGEPLTMDDVLPVRMGKIVKPRPLQAASIPGLLSIFQNEWDSLMLSNFALEQQLHTARQELSHALYQHDAACRVIARLKKERDEARSLLALAERQIPMTASSDIAVNAPAMSNGRKASLDEEPGYAGKKMRPGISASIIAEITDCNLALSQQRKKRQIPSTLAPVEDLE RYTQLSSYPLHKTGKPGITSLDICH SKDIIATGGIDTSAVLPDRS SGQIMSTLSGHSKKVTSVN FDAQGDMVLTGSADKTVRIWQSSEDG SYNCRHILKDHTA EVQAITVHATNNYFATASLDNTWCFYEFS TGLCLTQVEG ASGSEGYTSAAFHPDGLILGTGTSNADVKIWDVK TQANV TTFSGHTGAITAISFSENGYFLATAAQDGVKLWDLR KLK NFRTFSAYDKDTGTNSVEFDHSGCYLGLAGSDIRVYQVASVKS EWNCVKTFPDLSGTGKVTCVKFGPDSKYIAVGSMDHNLRIFGLPSEDGAMES
  113 SEQ ID 379的氨基酸序列。保守G蛋白β域为下划线,且WD-40重复域为粗体。  MAAPGVETLKKEIKELKEKIAQHRLDTDGEQPLPAAAKSKSVFEVSAA LKQRRILKGHFGKIYALHWSADSRHLVSAS QDGKLIIWNGFTTNKVHAIPLRSSWVMTCAYSPSGILVA CGGLDNLCSVYKVPHGGNKESSSAQKTYGSLAQHEGYLS CCRFIKDNEIVTSSGDSTCILWDVETKTPKAIENDHTGD VMSLAVEDDKGVFVSGSCDATAKLWDHRVHKQCVMTFQG HESDINSVQFFPDGDAFGTGSDDSSCRLFDIRAYQQINK YSSDKILCGITSVAFSKTGKSLFAGYDDYNTYVWDTLSG NQVSVLTGHENRVSCLGVSEDGKALATGSWDTLLKIWA
  条目 序列描述   注释肽序列
  114 SEQ ID 380的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MGGVEDESEPASKRMKLSSRVGRGLANGSSRTEPAAGSSLDLMARPLPIEGDEEVIGSKGVIKRVEFVRLIAKALYSLGYEKSGARLEEESGIPLQSSVVNLFMQQISDGLWDESVVTLHKIGLSDENLVKSASFLILEQKFLELLDQEKAMDALKTLRTEITPLCIKNSRVRELSSCIISPSSCGLLNQNKRNSTRARSRSELLEEIQKLLPPAVIIPERRLEHLVEQALVLQTDACMLHNSIDMEMSLYTDHQCGKEHI PCRTLQILQSHN DEVWLVQFSHNGKYLASASNDRSAIIWEVDENG SVSLKH KLTGHQKPISSVCWSPDDRQLLTCGVGETVRRWDVS SGE CLRVYEKAGHGLISCAWFPDGKNICYGVSDRSICMCDL E GKEIECWKGQRTLSISDLEITSDGKQIISICRETAILLL DR EAKYKRMIEENQTITSFSLSKDNRYLLVNLLNQEIHL WDIKG DFRLVAKYKGLKRSRFVIRSCFGGLKQAFVASGS EDSQVYIWHKG SGELIEPLPGHSGAVNCVSNNPANHHMLASASDDRTIRIWGLNELNTRHKGARPNGVHYCNGNGTS
  115 SEQ ID 381的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MTQLAETYACNPSTERGRGILIAGNPKPGSNSVLYTNGRSVVILNLD NPLDISVYAEHAYPATVARFSPNGEWVASAD SSGAVRIWGAYNDHVLKKEFKVLSGRIDDLQWSPDGLRIVASGDGKGKSLVRAFMND SGTNVGEFDGHSRRVLSCAFK PTRPFRIVTCGEDFLVNFYEGP PFKFKLSRRDHSNFVNC LRFSPDGNRFISVSSDKKGIIYDGK TGEKIGELSSDGGH TGSIYAVSWSPDSKQVITVSADKSAKIWDISEDGSGNLRKTLTSSGSGGVDDMLVGCLWQNNHLVTVSLGGTISIYTAGD LDKAPVSFSGHMKNVSSLSVLKGDPKVILSSSYDGLI IKWIQGIGFSGRVQRKESTQIKCLAAVDEEIVTSGYDNKVCRVSGSGDAEFIDIGCQPKDLSLALQCPEFALVSTDTGVVLLR GAKIVSTINLGFAVTASTVAPDGTEAIIGAQDGK LRIYSISGD TLTEEAVLEKHRGAISVIHYSPDLSMFASG DLNREAVVWDRASR EVRLKNILYHTARINCLAWSPDSST VATGSLDTCVIIYEVDK PASNRLTIKGAHLGGVYGLAFTDDFSVVSSGEDACIRVWKINRQ
  116 SEQ ID 382的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且SOF1蛋白域为粗体。   MKVKVISRSTDEFTRERSQDLQRVFRNFDPNLRTQEKAVEYVRALNAAKLDKVFA RPFVGAMDGHVDSVSCMAKNPNY LKGIFSGSMDGDIRLWDIAS RRTVCQFPGHQGPVRGLAA STDGQILVSCGIDSTVRLWNVPVATLGESDGTHENLAKPLAVYVWKNAFWAVDHQWDGELFATAGAQVDIWNQNR SQP ISSFEWGTDTVISVRFNPGEPNVLATSGSDRSITLYDLRMSSPTRKVIMRTKTNAISWNPMEPMNFTAANEDCNCYSYDARKL EEAKCVHKDHVSAVMDIDYSPTGREFVTGSYDRT VRIFQYNGGH SREVYHTKRMQRVFCVKFSCDASYVISGSD DTNLRLWKAKASEQLGVVLPRERRKHEYHEAVKSRYKHLPEVKRIVRHRHLPKPIYKAGILRRTVNEADRRKEERRKAHSAPGSSSAEPLRKRRIIKEIE
  117 SEQ ID 383的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MVRSIKNPKKAKRKNKGSKNGDGSSSSSSIPSMPTKVWQPGVDKLFEGEELQCDPSAYNSLHAFHIGWPCLSFDIVRDTLGLVRTEFPHQVYFVAGTQAEKPTWNSIGIFKVSNITGKRRELVPSKPTDDADEESDSSDSDEDSDDEVGGSGTPILQLRKVGHEGCVNRIRAMNQNPHICASWGDSGHVQIWDFSSHLNALAESEADVSQGASSVFNQAPLVKFGGHKDEGYALDWSPLVPGRLVSGDCKNSIHLWEPTSGSTW NVDSTPFIG HAASVEDLQWSPTEENVFASCSVDGTIAIWDTRLG KTPA ASFKAHDADVNVISWNRLATCMLASGCDDGTFSIHDLRLLKEG DSVVAHFEYHKHPVTSIEWSPHEASTLAVSSADCQ LTIWDLSLEKDEEEEAEFKAKTKEQVNAPEDLPPQLLFVHQGQKDLKELHWHAQIPGMIVSTAADGFNILMPSNIQSTLPSDGA
  118 SEQ ID 384的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATP结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。   MER YKVIKELGDGTYGSVWKALEQTHEIVAIKKMKRKY YIWEECINLREVKSLRKLNHPNIIKLKEVIRENNELFFI FEYMECNLYQIMKERSTFFSETAIIKFCYQILQGLSYMH RNGYFHRDLKPENLLVTSDLIKIADFGLAREVLTSPPYT DYVSTRWYRAPEVLLQSPTYTTAIDMWAVGAILAELFTL HPLFPGESELDEIYKICGVLGTPDYETWPDGMQLAAFRN FIFPQFLPVNLSVLIPHASPEAIDLITRLCSWDPQKRPT  AEQALHHPFFRIGMSIPLSLGGHFQDNTCAAEVDTKFHSKKACKAWNGEKESSLECFLGLSLGLKPSLGHLGAMGSQGVGAVKQEVGSSPGCQSNPKQSLFQVLNSRAILPLFSSSPNLNVVPVKSSLPSAYTVNSQVMWPTIAGPPAAAVTVSTLQPSILGDFKIFGKSMGLASQYAGKEASPFS
  条目 序列描述   注释肽序列
  119 SEQ ID 385的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATP结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体加框。   MGEMGRGINNSSNNNNSNRPAWLQH YDLVGKIGEGTYGL VFLARSKLPNNRGLRIAIKKFKQSKDGDGVSPTAIREIM LLREFSHENVVKLVNVHINHVDMSLYLAFDYAEHDLYEI IRHHREKLNNHNINQYTVKSLLWQLINGLNYLHSNWIVH RDLKPSNILVMGEGEEHGVVKIADFGLARIYQAPLKPLS DNGVVVTIWYRAPELLLGAKHYTSAVDMWAVGCIFAELI TLKPLFQGVEVKASPNPFQLDQLDKIFKVLGHPTIEKWP TLMNLPHWSKNLQQIQQHKYDNAGLHIGPIPAKSPAYDL LSKMLEYDPRKRITAAQALEHEYFRIDPQPGRNALVPSQPGEKAINYPPRLVDANTDFDGTIAPQPSQVSSGNAPSGSIASAAVPAVRPLPQQMQLMGMQRMQNPGMAAFNLGAQASNSGLNHNNIALQRGSSQQQAHQQVRRKEPNSGFPNTGYPPPPKSRRL
  120 SEQ ID 386的氨基酸序列。保守蛋白激酶家族域为下划线。蛋白激酶ATP结合区为粗体,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体/斜体。   MDK YEKLEKVGEGTYGKVIKARDKMTQOLVAUKKTRLEM DEEGVPPSSLREISLLQMLSQSIYVVRLLCVEHVTKKGK PLLYLVFEYLDTDLKKFIDYRRSVNAGPLPQNVIQSFMY QLLKGVAHCHSHGVLHRDLKPQNLLVDKSKGLLKVGDLG LGRAFTVPLKCYTHEVVTLWYRAPEVLLGSTHYSTPVDI WSVGCIFAEMVRRQPLFPGDCEIQQLLHIFTLLGTPTEE MWPGVKRLRDWHEYPQWKPENLARAVPNLSPTGLDLISK MLQCDPAKRISAKAAMNHPYFDDLDKSQF
  121 SEQ ID 387的氨基酸序列。保守蛋白激酶家族域为下划线。蛋白激酶ATP结合区为粗体,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体/斜体。   MDG YEKMDKVGEGTYGKVYMARDKKTGQLVALKKTRLEN DGEGIPPTALREISLLQMLSQDIYIVRLLDVKHTENKLG KPLLYLVFEYMESDLKKYIDSYRRSHTKMPPSMIKSFMY QLCRGVAYCHSRGVMHRDLKPHNLLVDKEKGVLKIADLG LSRAFTVPVKKYTHEIVTLWYRAPEVLLGATHYSLPVDI WSVGCIFAEMSRMQALFTGDSEVQQLMNIFRFLGTPNEE VWPGVTKLKDWHIYPEWKPQDISHAVPDLEPSGLDLLSQ MLVYEPSKRISAKKALEHPYFDDLDKSQF
  122 SEQ ID 388的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATP结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。   MDA YEKLEKVGEGTYGKVYKAKDKNTGQLVALKKTRLES DDEGIPPTALREISLLQMLSQDIHIVRLLDVEHTENKNG KPLLYLVFEYMDSDLKKYIDGYRRSHTKVPPNIIKSFMY QLCQGVAYCHSRGVMHRDLKPHNILVDKQRGVVKIADLG LGRAFTIPIKKYTHEIVTLWYRAPEVLLGATHYSTPVDI WSVGCIFAEMVRLQALFIGDSEVQQLFKIFSFLGTPNEE IWPGVTKFRDWHIYPQWKPQDISSAVPDLEPSGVDLLSK MLVYEPSKRISAKKALEHPYFDDLDKSQF
  123 SEQ ID 389的氨基酸序列。保守蛋白激酶家族域为下划线。蛋白激酶ATP结合区为粗体,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体/斜体。   MDS YEKLEKVGEGTYGKVYKAKDKKTGKLVALKKTRLEN DGEGIPPTALREISLLQMLSQDMNIVPLLDVEHTENKNG KPLLYLVFEYMDSDLKKYVDGYRRSHTKMPPKIIKSFMY QLCQGVAYCHSRGVMHRDLKPHNLLVDKQRGVLKIADLG LGRAFTVPIKKYTHEIVTLWYRAPEVLLGATHYSTPVDI WSVGCIFAEMSRMHALFCGDSEVQQLMSIFKFLGTPNEG VWPGVTKLKDWHIYPEWRPQDLSRAVPDLEPSGVDLLTK MLVYEPSKRISAKKALQHPYFDDLDKSQF
  124 SEQ ID 390的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATP结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。   MEK YEKLEKVGEGTYGKVYKGRDKRTGRLVALKKTPFHQ EEGIPPTAIREISLLKSLSQCIYIVKLLDVKASFNGKGK HVLFMVFEYADSDLKKHIDAHRQCNTKLSPRSIQSYMFQ LCKGIAYCHSHGVLHRDLKPQNILVDQKIGLLKIADLGL GRACTVPIKSYTFEVVTLWYRAPEVLLGAKRYSMALDIW SLGCIFAELCNLQALFAGDSQIQQLINIFRLLGTPNEQL WPGVTQLSDWHEFPQWRPQDLSKVVFNLDPNGVDLLSKM LQYDPAKRISAKEALDHPYFDSLDKSQF
  条目 序列描述   注释肽序列
  125 SEQ ID 391的氨基酸序列。保守真核蛋白激酶域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。   MGCVCGKPSARAADYVESPAEKGASSNSRSSSMASRRLVAPAVMDQGIDAENGHEGDYRTKLRGKQSNGADPVSLLSDDAEKQRHSRHHQHQQHHPIRPHHLRPQGEFVPNANSNPRFGNPPRHIEGEQVAAGWPAWLTAVAGEAIKGWIPRRADSFEKLDKIGQGTYSNVYKARDLDTGKIVALKKVRFDNLEP ESVRFMAREIQVLRRLDHPNVVKLEGLVTSRMSCSLYLV FEYMDHDLAGLAACPGIKFTEPOVKCYMQQLLRGLDHCH SRGVLHRDIKGSNLLIDNGGILKIADFGLATFFHPDQRQ PLTSRVVTLWYRPPELLLGATEYGVAVDLWSTGCILAEL LAGKPIMPGRTEVEQLHKIFKLCGSPSEDYWKKSKLPHA TIFKPQQPYKRCVAETFKDFPPSALALMEVLLAIEPADR GTATSALKSDFFTTKPLACDPSSLPKYPPSKEFDAKIRDEEARRQRAAGGRGRDAARRPSRESRAIPAPEANAELAISIQKRRLSSQGPSKSKSEKFNPQQEDGAVGFPIEPPRPMHIGIDAGATSRMYSQQFGPSHSGPLSNQISSSIWGKNQKEDEIQMAPGRPSRSSKATISDFRKPGACAPQPGADLSHLSSLVATARSNAGIDTHKDRSGMWQHNRIDAIDGVHNNGKHEFLEVPEHPNRQDWTRFQQPESFKGLDNYHLQDLPATHHRKDERVASKEATMNWQGYGGQGGDKIHYSGPLLPPSGNIDEILKEHERHIQHAVRRARQDKGRPQRSNLSQNERKAFEHRSFVSGVNGNAGYSDLVNELPISVGSNRLKVSKTRGTEEIVELRELEREPLSSVMEKYEREHEM
  126 SEQ ID 392的氨基酸序列。保守真核蛋白激酶域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。   MGCVCAKQSDILGEPESPKVKGSNLASSRWSVSSETKQLPQHSDSGILHHQHYYHPRDESDEAKLKESNYGGSKRRTRQGRDPADLDMGIFVRTPSSQSEAELVAAGWPAWMAAFAGEAIHGWIPRRAES FEKLYKIGQGTYSNVYRARDLDNGKI VALKKVRFDSLDAESVRFMAREILVLRKLDHPNIVKLEG LVTSEVSSSLYLVFEYMEHDLAGLAACPGIKFTEPQVKC YMQQLLQGLDHCHRHGVLHRDIKGSNLLIDNGGILKIAD FGLATFFYPDQKQLLTSRVVTLWYRPPELLLGATDYGVA VDIWSAGCILAELLAGKPILPGRTEVEQLHKIFKLCGSP SEDYWKESKLPHATIFKPQHPYKSCIAEAFKDFSPSALALLETLLAIEPGHRGEASGALKSEFFTTEPLSCDPSSLPKYPPSKEFDAKLRAQETRRQRDVGVRGHGSEAARRTSRLSRAGPTPNEGAELTALTQKQHSTSHATSNIGSEKPSTKKEDYTAGLHIDPPRPVNHSYETTGVSRAYDAIRGVAYSGPLSQTHVSGSTSGKKPKRDHVKGLSGQSSLQPSKPFIVSDSRSERIYEKSHVTDLSNHSRLAVGRNRDTTDPHKSLSTLMQQIQDGTLDGIDIGTHEYARAPVSSTKQKSAQLQRPSALKYVDNVQLQNTRVGSRQSDERPANKESDMVSHRQGQRIHCSGPLLHPSANIEDLLQKHEQQIQQAVRRAHHGKREALSNKSSLPGKKPVDHRAWVSSGKGNKESPYFKGKGNKELSDLKGGPTAKVTNFRQKVM
  127 SEQ ID 393的氨基酸序列。保守蛋白激酶家族域为下划线。蛋白激酶ATP结合区为粗体,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体/斜体。   MAVANPGQLNLQEAPSWGSRSVNC FEKLEQIGEGTYGQV YMAKEIETGEIVALKKIRMDNEREGFPITAIREIKLLKK LQHENVIKLKEIVTSPGPEKDEQGKSDGNKYNGSIYMVF EYMDHDLTGLAERPGMRFSVPQIKCYMKQLLIGLHYCHI NQVLHRDIKGSNLLIDNNGILKLADFGLARSFCSDQNGN LTNRVITLWYRPPELLLGSTKYGPAVDMWSVGCIFAELL YGKPILPGKNEPEQLTKIFELCGSPDESNWPGVSKLPWY SNFKPQRQMKRRVRESFKNFDRHALDLVEKMLTLDPSQR ISAKDALDAEYFWTDPVPCAPSSLPRYEPSHDFQTKRKRQQQRQHDEMTKRQKISQHPPQQHVRLPPIQNAGQGHLPLRPGPNPTMHNPPPQFPVGPSHYTGGPRGAGGQNRHPQNIRPLHAAQGGGYNANRGYGGPPQQQGGGYPPHGMGNQGPRGGQFGGRGAGYSQGGPYGGPVGGRGPNVGGGNRGPQFWSEQ
  128 SEQ ID 394的氨基酸序列。保守真核蛋白激酶域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。   MQNMEDNVQSSWSLHGNKEICAR YEILERVGSGTYSDVY RGRRKADGLIVALKEVHDYQSSWREIEALQRLCGCPNVV RLYEWFWRENEDAVLVLEFLPSDLYSVIKSGKNKGENGI PEAEVKAWMIQILQGLADCHANWVIHRDLKPSNLLISAD GILKLADFGQARILEEPEAIYEVEYELPQEDIVADAPGE RLMEEDDSVKGVRNEGEEDSSTAVETNFGDMAETANLDL SWKNEGDMVMQGFTSGVGTRWYRAPELLYGATIYGKEID LWSLGCILGELLILEPLFSGTSDIDQLSRLVKVLGTPTE ENWPGCSNLPDYRKLCFPGDGSPVGLKNHVPSCSDSVFS ILERLVCYDPAARLNAKEVLENKYFVEDPYPVLTHELRVPSPLREENNFSEDWAKWKDMEADSDLENIDEFNVVHSSDGFCIKFS
  条目 序列描述   注释肽序列
  129 SEQ ID 395的氨基酸序列。保守真核蛋白激酶域为下划线,且蛋白激酶ATP结合区和丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。   MDLNQYPEDLNPELPEGTDNVDNPDNNKGSPVPSPHPPLKPLDPSER YRKGITLGQGTYGIVYKAFDTVTNKTVAVKK IHLGKAKEGVNVTALREIKLLKELSHPNIIQLIDAYPHK QNLMIVFEFMETDLEAVIKDRNLVFSPADIKSYLQMTLK GLAVCHKKWVLHRDMKPNNLLIAADGQLKLGDFGLARLF GSPDRKFTHQVFAVWYRAPELLFGAKQYGPAVDIWATGC IFAELLLRKPFLQGVSDLDQIGKIFAAFGTPRQSQWPDV ASLPDFVEFQFVFAPSLRSLFPMASEDALDLLSKMFTLD PKNRITAQQALEHRYFSSVPAPTRPDLLPKPSKVDSSRPPKHASPDGPVVLSPSKARRVMLFPNNLAGILPKQVSQSTTGGTPIEFDMPTQKLREVCPRSRITESGKKHLKRKTMDMSAALDECAREQEGQEGKTILDPDHQRSAKKEKHM
  130  SEQ ID 396的氨基酸序列。保守周期素N和C末端家族域为下划线。   MAGGQENCVRITRARAACVSKASAPVIQSQVDEKKSRKRAPKRAAVDDLAANASGSQPKRRVVLGDVTNLHAAATDCLSTAEDQVDAPNPSIKGRARNKKKEARTSTKVVKDEIHPESNPLADHSSNLSECQKPPAAKLAEQRSLRGVPSKAKQGGSSNSQSCSKHTDIDKDHTDPQMCTTYVE DIYEYLRNAEL KNRPSANFMETAQNDITPNMRAILVDWLVEVSEEYKLVP DTLYLTVSYIDRYLSANPTSRHKLQLLGVSCMLIASKYE EVCPPHVEEFCYITDNTYTRDEMLSMERKILIFLNFEMT KPTTKSFLRRFVRASQAGNKAPSLHMEFLANYLAELTLM ECSFLQYLPSLIAASTVFLSRLTLDFLTNPWNPTLAHYT GYKASQLKDCVMAIYNVQMNRKGSTLVAIREKYQQHKFK CVASLPPPPFIAERFFEDTPN
  131 SEQ ID 397的氨基酸序列。保守周期素和周期素C末端域为下划线,且周期素信号为粗体。   MTGTQASNVRITRARAAKSTLNNALPPLPPAQGKPRGKRAATESNISGFSVAAEPLKRRAVLSDVSNICKEAAAVDCLKKPKAVKVVSQNANAKGRGRGIPRNNKKITQEAEIKKETSPAICNVDDASAGNAIGDDKQNNNVNPLKEVQDNPKELNPIAEQISVHPHCKQSVEKPNEKEIVVSDNKAAIASLKQQSTLQSLRIPKQPKYSLKQGNPVPLANLHEDVGRSSCSDFIDIDSEYKDPQMCTAYVT DIYANMRVVELKRRPLPNFME TTQRDINANMRSVLIDWLVEVSEEYKLVPDTLYLTVSYI DRFLSANVVNRQRLQLLGVSCMLVASKYEEICAPPVEEF CYITDNTYKKEEVLEMEISVLNRLQYDLTTPTTKTFLRR FIRAAQASCKVSSLHLEFMGNYLAELTLVEYDFLKYLPS LIAAAAVFVARMTLDPMVHPWNSTLQHYTGYKVSDMRDC ICAIHDLQLNRKGCTLAAIREKYNQPKFKCVANLFPPPIISPQFLIDNEV
  132 SEQ ID 398的氨基酸序列。保守周期素和周期素C末端域为下划线,且周期素信号为粗体。   NAAPNQNALLINNNNRRPLVDIGNLVGALNAQCNISKNGARKRAFGDIGNLVEDLDAKCTISKYWVRKRPRTNFGVNANKGASSSTQGQGIVVRGEQKAWDRIVWGNKQSCAIKMNAQHVTATQRGTAISISDIIDSSVQDGGIKAPSQLKARKQTVRTVTATLTARSEDSLRDVLEVPPGIDDGDRDNPLAVVEYVE DIYHFYRKIEVRSCVPPDYMTRQLEIKDSMRGVIID WLIEVHRTFLLMPETLYLTVNIIDRYLSIQSVTRNELQL MGITAMFIASKYEEISPPKINDLVYITKDAYTSKQIVNM EHTILNRLKFKLTVPTPYVFLVRFLKAAGPDKVMKNLAF FLVDLCLLHYKMIKYSPSMLAAAAVYTAQCTLKKHPYWN KTLILHIGYSEAHLRECAHLMADLHLKAEGSNLKSVYKK YSYPIFGSVAFLSPAKIPAGTVAAPAIDKCAHQIYLRNLR
  133 SEQ ID 399的氨基酸序列。保守周期素N和C末端家族域为下划线。   MPPNKQTQGLVQNKKMASKAAQPKAMVPPQRVPPAANNRRALGDIGNIVADVGGKCNVTKDGVNGKPLAQVSRPITRSFGAQLLAQAAANKGISAANNQTQVPVVIPKADVRGNKQRRTSKSKDIPPTTVVTNESDDCVIIEQAQRIKPTCNHNVGAVGNKEKPQLLTAKPKSLTASLTSRSAVALRGFRFDDEMTEAEEDPLPNIDVGDRDWQLAVVEYVE DIYKFYRRTEQM SCVPDYMPRQQEINPKMRAVLINWLIEVHYRTGLMPETL YLTTNLIDRYLATQLVSRSNYQLVGATAMLLASKYEEIW APEMNDFLDILENKFERKHVLVMEKAMLNKLKFHLTVPT PYVFLVRFLKAAASDEEMENLVFFLMELSLMQYVMIKFP PSMLAAAAVYTAQITLKKTTVWNDVLKRHTGYSEIDLKE CTRLMVAFHQSSEESKLNVVFKKYSMPEYDSVALIKPAKLPA
  条目 序列描述   注释肽序列
  134 SEQ ID 400的氨基酸序列。保守周期素和周期素C末端域为下划线。   MAPSFDCVANAYIESCEDQEKLRQNAQILAQSGENDVD E PVSMLVQRETHYMLPEDYLQRLRNRTLDVNVRREAVGWI LKVHSFYNFGAPTAYLAVNYLDRFLSRHRMPQGVKAWMI QLMAVACLSLAAKMEETQVPLPSDLQREDARFIFDARTI QRMELLILSTLQWGMRS ITPFSFIDYFAYRAVQGHGHGH DATPKAVMSRAIELILSTTEEIDFMEYRPSAIAAAALLC AAEEVVPLQAVHYKRALSSSITDVDKDKMFGCYNLIQET IIEGGCYWTPMSLQSTEKTPVGVLDAAACLSNTPTSSYSVKPYASVTAAKRRKLNEICSALLVSQAHPC
  135 SEQ ID 401的氨基酸序列。保守周期素和周期素C末端域为下划线。   MAANFWTSSHCKELLDAEKVGIVHPLDKDQGLTQEDVKIIKINMS NCIRTLAQYVKLRQRVVATAITYCRRVYTRKSF TEYDPQLVAPTCLYLASKAEESTVQAKLVIFYMKKYSKH RYEIKDMLEMEMKLLEALDYYLVIYHPY RPLIQFLQDAG LNDLKVTAWALVNDTYRTDLILTYPPYMIALACIYFACI MEEKDAQAWFEELRVDMNEIKNISMEIVDYYDNYRVIPDEKMNSALNKLPHRF
  136 SEQ ID402的氨基酸序列。保守周期素域为下划线。   MAPALSSSYECLSHLLCAEDASNVVGCWDEDESKIFCEEEEGFGIQHFPDFPVPDD DEIRVLVRKESQYMPGKSYVQS YQNLGLDFTARQNAIGWILKVHGSYNFGPLTAYLSINYL DRFLSRNPLPKAKVWMLQLLSVACLSLAAKMEETQVPLL LDLQAEEPDFLFEPRTIQRMELLVLSTLEWRMLSVTRFSFVDYFLQGGGGRKPPPRAMVARANELIFNTHTVLDFLEHRPSAIAAAAVICAAEEVLPLEAAQYKETILSCSLVDKEWVFGSYNLIQEVLIEKFSTPKKAKSASSSIPQSPVGVLDAFCLSNNSNNTSLEASLSVNLYASVAAKRRKLNDYCNTWRMFQHSTC
  137 SEQ ID403的氨基酸序列。保守周期素域为下划线。   MAPNCIDCAPSDLFCAEDAFGVVEWGDAETGSLYGDEDQLHYNLDICDQHDEHLWDDG ELVAFAEKETLYVPNPVEKN SAEAKARQDAVDWILKVHAHYGFGPVTAVLSINYLDRFL SANQLQQDKPWMTQLAAVACLSLAAKMDETEVPLLLDFQ VEEAKYIFESRTIQRMELLVLSTLEWRMSPVTPLSYIDHASRMLGLENHHCWIFTMRCKEILLNTLRDAKFLGLLPSVVAAAIMLHVIKETELVNPCEYENRLLSAMKVNKDMCERCIGLLIAPESSSLGSFSLGLKRKSSTINIPVPGSPDGVLDATFSCSSSSCGSGQSTPGSYDSNNSSILCISPAVIKKRKLNYEFCSDLHCLED
  138 SEQ ID 404的氨基酸序列。保守周期素依赖性激酶调控亚单位域为下划线,且周期素依赖性激酶调控亚单位信号1为粗体。 MPQIQYSSKYTDDTYSYRHVVLPPETAKLLPKNRLLNEN EWRAIGVQQSRGWVHYAIHRPEPHIMLFRRPLNYQQNQQQQAGAQSQPMGLKAQ
  139 SEQ ID 405的氨基酸序列。保守周期素依赖性激酶调控亚单位域为下划线,且周期素依赖性激酶调控亚单位信号1为粗体。    MDQIEYSEKYYDDTYEYRHVELPPDVARLLPKNRLLTEN EWRGIGVQQSRGWVHYAIHCSEPHIMLFRRPLNYEQNHQHPEPHIMLFRRPLNCQPNHQPQAHHPT
  140 SEQ ID 406的氨基酸序列。保守周期素依赖性激酶调控亚单位域为下划线,且周期素依赖性激酶调控亚单位信号1为粗体。 MDQIEYSEKYYDDTYEYRHVELPPDVARLLPKNRLLTEN EWRGIGVQQSRGWVHYAIHCSEPHIMLFRRPLNYEQNHQHPEPHIMLFRRPLNCQPNHQPQAHHPT
  141 SEQ ID 407的氨基酸序列。保守周期素依赖性激酶调控亚单位域为下划线,且周期素依赖性激酶调控亚单位信号1为粗体。    MPQIQYSEKYTDDTYEYRHVVLPPDVARLLPKNRLLNEN EWRGIGVQQSRGWVHYAIHRPEPHIMLFRRHLNYQQNQQQQAQQQPAQAMGLQA
  条目 序列描述   注释肽序列
  142 SEQ ID 408的氨基酸序列。保守GCN5相关性N-乙酰基转移酶家族域为下划线,且基本SAM家族域为粗体。   MALVETEPVTLIHPEEPKKFKKKPTPGRGGVISHGLTEEEARVKAIAEIVGAMVEGCRKGEDVDLNALKAAACRRYGLSRAPKLVEMIAALPDGERAAVLPKLKAKPVRTASGLAVVAVMSKPHRCPHIATTGNICVYCPGGPDSDFEYSTQSYTGYEPTSMRAIRARYNPYVQTRSRIDQLKRLGHTVDKVEFILMGGTFMSLPADYRDYFIRNLHDALSGHTSSNVEEAVCYSEHSATKCIGLTOETRPDYCLGPHLRQMLSIGCTRLEIGVQSTYEDYARDTNRGHTVAAVADCFCLAKDAGFKVVAHMMPDLPNVGVEPDMESFREFFENPAFRADGLKIYPTLVIRGTGLYELWKTGRYRNYPPEQLVDIIARVLALVPPWTRVYRVQRDIPMPLVTSGVEKGNLRELALARMDDLLGLKCRDVRTREAGIQDIHHKIRPEVVELVRRDYCANEGWETFLS YED TRQDILVGLLRLRKCGHNTTCPELKGRCSIVRELHVYGT AVPVHGRDADKLQHQGYGTLLMEQAERIAWKEHRSIKIAVISGVGTRHYYRKLGYELEGPYMMKYLN
  143 SEQ ID 409的氨基酸序列。保守染色体域为下划线,且MOZ/SAS样蛋白域为粗体。   MLGFRDLYTSICEHLQRASGRLPIIAAATSLISTPEIAAVEKENKAPNSVDKMGMGSADESGRFSTSNGQFMNMNNGVVLEEWKGGVPVVPSAPTTVPVITNVKLETPSSPDHDMARKRKLGFLPLEVGTRVLCKWRDG KFHPVKIIERRKLPNGA TNDYEYYVHYTEFNRRLDEWVKLEQLELDSVETDADEKV DDKAGSLKMTRHQKRKIDETHVEGNEELDAASLREHEEFTKVKNITKIELGRYEIETWYFSPFPSEYNNCEKLYFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYAQNLCYLAKLFLDHKTLYYDVDLFLFYILCECDERGCHMVGYFSKEKHSEKSYNLACILTLPPYQRKGYGKFLISFSYELSKKEGKVGTPERPLSDLGLLSYRGYWTRVLLDILKKHKSNISIKELSDMTAIKADDVLSTLQGLDLIQYRKGQHAICADPKVLDRHLKAVGRGGLEVDVCKLIWTPYKEQ
  144 SEQ ID 410的氨基酸序列。保守MOZ/SAS样蛋白域为下划线。   MGSLDESTCSEEIRDEGKDSIRTKFKVESTVNNAQNGGNDNSKKKRAAGLPLEVGIRLLCKWRDSKLHPVKIIERRKLPNGFPQDYEYYVHYTEFNRRLDEWVKLEQFELDSVETDADEKIEDKGGSLKMTRHQKRKIDEIHVEEGQGHEDFDPASLREHEEFTKVKNIAKVELGRYEIETWYFSPFPPEYSHCEKLFFCEFCLNFMKRKEQLQRHMRKCD LKHPPGDEIYRNG TLSMFEVDGKKNKIYGQNLCYLAKLFLDHKTLYYDVDLF LFYVLCECDDRGCHVVGYFSKEKHSDEAYNLACILTLPP YQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLLSY RGYWTRILLDILKKQRGNISIKELSDMTAIKVEDVISTL QVLDLIQYRKGQHVICADPKVLDRHLKAAGIAGLEVDVSKLIWTPYKEQCG
  145 SEQ ID 411的氨基酸序列。保守溴家族域为下划线。   MASAPMVGCDDSRDKHRWVESKVYMRKGHGKGSKGNAGFNAQNSTAQVRRENDNMGNSIADNGKSEAASEGLSSLSRKQITVNQDHPPNETSSMPAVGGLQNIDTHVTFKLEGCSKQEIWELRKKLTNELEQVRGTFKKLEARELQLRGYSVSAGVNTSYSASQFSGNDMRNNGGKEVTSEVASGGAITPKQAQRESNPPRQLSISLMENNQAASDMGEKGKRTPKANQYYRNSEFVLGKDKFPPAESKKSKSTGNKKISQSKVFSKETMQVGKEFMPQKSVN EVFRQCSLLLTKLMKHKYGWVFNLPVDAQ ALGLHDYHTIIKRPMDLGTVKSKLEKNLYNSPASFAEDV KLTFSNAMTYNPKGHEVHTMAEQLLQLFEERWKTIYEEHLDGKMRFGSGQGLGASSSTKKLPFQDSKKNIKKSEPAGGPSPPKPKSTNHHASRTPSAKKPKAKDPHKRDMTYEEKQKLSTNLQNLPQERLELIVQIIKKRNPSLCQHDEEIEVDIDSFDTETIWELDRFVTNYKKSLSKNKKKALLADQAKRASEHGSARNKHPMIGRELPMNNKKGEQGEKVVEIDHMPPVNPPVVEVEKDGVYAKRSSSSSSSSSDSGSSSSDSDSGSSSGSESDAYAATSPPAGSNTSARG
  条目 序列描述  注释肽序列
  146 SEQ ID 412的氨基酸序列。保守GCN5相关性N-乙酰基转移酶家族域为下划线,且溴域为粗体。  MEGHSGALGFGQGFSRSSQSPNLSPSPSHSASASVTSSGQKRKRNEVEHAGVASNSTGMFAVPPSHIYSHLHPMSMSMPMPMHNSHPSSLSESRDGALTSNDDDDNLTGGNQSQLDSMSAGNTDGREDFDDEDDDDDDEEDDDEVEGDEEDQDHDPDADDDSDDGHDSMRTFTAARLDNGAPNSRNLKPKADAAGVAIAPTVKTEPILDTVKEEKVSGNNNNNSVSANNAQVAPSGSAVLLSAVKEEANKPTSTDHIQTSGAYCAREESLKREEDADRLKFVCPGNDGIDQHMIWLIGLKNIFARQLPNMPKEYIVRLVMDRSHKS VMIIKQNQVVGGITYRPYLSQKFGE IAFCAITADEQVKGYGTRLMNHLKQHARDVDGLTHFLTY ADNNAVGYFIKQDFTKEIKLEKERWHGYIKDYDGGILMECKIDPKLPYTDLPAMIRWQRQTIDEKIRELSNCHIVYSGIDIQKKEAGIPRKPIKVEDIPGLKEAGWTGDQWGHSRFRLLNSPSEGLPNRQVIHAFMRSLHKAKVEHADAWPFKEPVDPRDVPDYYDIIKDPMDVKRMFTNARTYNTHETIYYKCANR
  147 SEQ ID 413的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线。  MEESGNSLTSGPDGSK RRVSYFYDSDIGNYYYSQGHPMK PHRIRMAHSLIVHYALDEKMEVCRPNLLQSRELRVFHAD DYISFLQSVTPETQHEQLRQLKRFNVGEDCPVFDGLYNF CQTYAGGSVGAAIKLNNKEADIAINWSGGLHHAKKCEAS GFCYVNDIVLAILELLKVHQRVLYIDIDIHHGDGVEEAF YSTDRVNSVSFHKFGDYFPGTGHLKDVGYGKGKYYSLNV PLNDGIDDESYKNLFRPIIQKVMEIYQPEAVVLQCGADS LSGDRLGCFNLSVKGHADCVRFLRSFNVPLVLVGGGGYT IRNVARCWCYETAVAVGVEPQDKLPYNEYYEYFGPDYTLHVAPSNMENQNSAKELAKIRNTLLEQLKRIQHVPSVPFQERPPDTKFPEEDEEDYEKRPKGHKWGGEYFGSESDEEQKPQNRDIDISDKPGIRRQSPPNVEAAKKIKVEEEDGDIGIVNENDGAKWPLGEAG
  148 SEQ ID 414的氨基酸序列。保守组蛋白去乙酰基转移酶域为下划线。  MEESGNSLTSGPDGSK RRVSYFYDSDIGNYYYSQGHPMK PHRIRMAHSLIVHYALDEKMEVCRPNLLQSRELRVFHAD DYISFLQSVTPETQHEQLRQLKRFNVGEDCPVFDGLYNF CQTYAGGSVGAAIKLNNKEADIAINWSGGLHHAKKCEAS GFCYVNDIVLAILELLKVHQRVLYIDIDIHHGDGVEEAF YSTDRVMSVSFHKFGDYFPGTGHLKDVGYGKGKYYSLNV PLNDGIDDESYKNLFRPITQKVMEIYQPEAVVLQCGADS LSGDRLGCFNLSVKGHADCVRFLRSFNVPLVLVGGGGYT IRNVAACWCYETAVAVGVEPQDKLPYNEYYEYFGPDYTLHVAPSNMENQNSAKELAKIRNTLLEQLKRIQHVPSVPFQERPPDTKFPEEDEEDYEKRPKGHKWGGEYFGSESDEEQKPQNRDIDISDKPGIRRQSPPNVEAAKKIKVEEEDGDIGIVNENDGAKWPLGEAG
  149 SEQ ID 416的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线。  MMETGGNSLPSGPDGVK RKVAYFYDPEVGNYYYGQGHPM KPHRIRMTHALLVQYGLHKEMQILKPYPARDRDLCRFHA DDYVAFLRGITPETIQDQVKALKRFNVGDDCPVFDGLYQ YCQTYAGGSVGGAVKLNHKLCDIAINWAGGLHHAKKCEA SGFCYVNDIVLAILELLKYHKRVLYVDIDIHHGDGVEEA FYTTDRVMTVSFHKFGDYFPGTGDIRDIGCGKGKYYAVN VPLDDGIDDFSFQSLFKPIIQQVMLVYNPEAIVLQCGAD SLSGDRLGCFNLSVKGHAECVRYMRSFNVPLLMVGGGGY TVRNVARCNCYETGVAVGVEIDDKMPQHEYYEYFGPDYTVHVAPSNMENKNTKQYLDKIRSKILENINSLPCAPSAQFQVQPPDTDFPELEEEDYDERTRSHKWDGASCDSDSENGDLKHRNHDVEESAFPRHNLANISYNTKIKLEGVGTGGLDMAAGTDTKKNDESFEAMDYESGEELRQDHFASTINASQPCDPALLTGVQNQLQ8TDTVKPIEQSGNAPGIPPPSVATVSTGTRPSSISRTSSLNSMSSVKQGSILGPNPPQGLNASGLQFPVPTSNSPIRQGGSYSITVQAPDKQGLQNHMKGPQNMPGNS
  条目 序列描述   注释肽序列
  150 SEQ ID 417的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线。   MPPK DRVAYFYDGDVGSVYFGPNHPMKPHRLCMTHHLVL SYELHKKMEIYRPHKAYPVELAQFHSADYVEFLHRITPD TQHLFTKELVKYNMGEDCPVFENLFEFCQIYAGGTIDAA HRLNNQICDIAINWSGGLHHAKKCEASGFCYINDLVLGI LELLKHHARVLYVDIDVHHGDGVEEAFYFTDRVMTVSFH KYGDMFFPGTGDVKEVGEREGKYYAINVPLKDGIDDASF TRLFKTIITKVVDIYQPGAIVLQCGADSLAGDRLGCFNL SIDGHAQCVRIVKKFNLPLLVTGGGGYTKENVARCWSVE TGVLLDTELPNEIPDNDYIKYFAPDYSLKINTAGNMENLNSKTYLSAIKVQVMENLRAIQHAPSVQMHEVPPDFYIPDIDEDELNPDERMDQHTQDRQIQRDDEYYDGDNDIDHDMEEAS
  151 SEQ ID 418的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线。   MDSSKSEEANILHVFWHEGMLNHD LGTGVFDTLEDPGFL EVLEKHPENADRVRNMLSILRKGPIAPYTEWHTGRAAYL SELYSFHRPDYVDMLAKTSTAGGKTLCHGTRLNPGSWEA ALLAAGTTLEAMRYILDGHGKLSYALVRPPGHHAQPTQA DGYCFLNNAGLAVELAVASGCKRVAVVDIDVHYGNGTAE GFYERDDVLTISLHMNHGSWGPSHPQTGFHDEVGRGKGL GFNLNVPLPNGTGDKGYEHAMHELVVPAISKFMPEMIVL VIGQDSSAFDPNGRECLTMEGYRKIGQIMRQQADQFSGG RLVVVQEGGYHITYAAYCLHATLEGVLCLPHPLLSDPIAYYPEHDIYSERVTFIKNYWQGIISTTDKRN
  152 SEQ ID 419的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线。   MEESGNALVSGPDGSK RRVTYFYDADIGNYYYGQGHPMK PHRMRMAHNLIVHYGLHQRMEVCRPHLAQSKDIRAFHTD DYIHFLSSVAPDTQQEQLRQLKRFNVGEDCPVFDGLFNF CQSSAGGSIGAALKLNRKDADIAINWAGGLHHAKKCEAS GFCYVNDIVLGILELLKVHQRVLYIDIDIHHGDGVEEAF YTTDRVMTVSFHKFGDYFPGTGHIKDVGYGKGKYYALNV PLNDGIDDESYKHLFRPIIQKVMEVYQPEAVVLQCGADS LSGDRLGCFNLSVKGHADCVRFVRSFNIPLMLVGGGGYT IRNVARCWCYETAVAVGVEPQDKLPYNEYYEYFGPDYTLYVAPSNMENLNTEKDLEKMRNVLLEQLSKIQHTPSVPFQERPPDTEFNDEEEEDMEKRSKCRIWDGEYVGSEPEEDGKLPRFDADTYERSVLKHENKRLVPVSNVEPLKRIKQEEDGAAV
  153 SEQ ID 421的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线。   MDLNLVSHGEEEEGVRR RKVGIVYDERMCKHATPEDQPH PEQPDRIRVIWDKLNSAGVLHKCVMVEAKEASEEQLAGV HSRKHIEVMKSIGTARYNKKKRDKLAASYSSIYFSQGSS EAALLAAGSVVEISEKVASGELDAGVAIVRPPGHHAEAD KAMGFCLFNNIAIAAKHLVHERPELGVQKVLIVDWDVHH GNGTQHMFWTDPHVLYPSVHRFDAGPFYPGGDDGFYDKI GEGKGAGYNINVPWEQGKCGDADYLAVWDHVLVPVAKSY DPDMVLISGGFDAALGDPLGGCRLTPYGYSLNTKKLMEF AGGKIVLALEGGYNLKSLADSFLACVEALLKDGPSRSSVLTHPFGSTWRVIQAVRKELSSFWPALNEELQLPRLLKDASESFDKLSSSSSDESSASEDEKKFAEVTSIMEVSPDPSSILALTAEDIAQPLAGLKIEEAGTDSQRSSDHTLLDLTNDDTQKLKQFEGEIFVMIGDEESVPSASSSKDQNESTVVLSKSNIKAHSWRLTFSSYYVWYASYGSNMWNPRFLCYIEGGQVEGMAKRCCGSEDKLLLKGYSGKLFLIECFLGDHTQIHGVQEECPFLIQIVVIRVKRMSACIK
  154 SEQ ID 422的氨基酸序列。保守FKBP型肽基-脯氨酰基顺-反异构酶信号为下划线,且FKBP型肽基-脯氨酰基顺-反异构酶信号1和2为粗体。   MADEDLDLSDVGEVEDEPGEEIESTPPLAVGQEKEINSLALKKKLLKV GTRWETPENGDEVTVHYTGTLPDGTKFDSS RDRGEPFTFKLGQGQVIKGWDQGIVTMKKGERALFTIPP ELAYGSSGVRPTIPPNATLQFDVELLSWTNIVDVCNDGGILKRIISE GEKYERPKDPDEVTVKYEAKLEDGTLVAKSP EEGVEFYVNDGHFCPAIAKAVKTMKRGESVILTIKPTYA FGERGKDAEEGFAAIPPNATLTTSLELVSFKAVIAVTEDKKVIKKILKE ADGYDKPSDGTVVQIRYTAKLQDGTIFEK KGYEGEEPFQFVVDEEQVIAGLDKAVETMKTGEIALITI GAEYGFGNFETQRDLAVIPPNSTLIYEVEMISFTKEKESWDMDTTEKIEASKQKKEQGNSIFKVGKYQRAAKKYEKAAKYIEHDSSFSAEEKKQSKVLKVSCNINHAACRLKLKDFKEAVKLCSKVLELESQNVKALYRRAQAYIETADLDLAEFDIKKALEIEPQNREVQLEYKILKQKQIEYNKKDAKLYGNMFAKLNKLEAFEGKVLS
  条目 序列描述   注释肽序列
  155 SEQ ID 423的氨基酸序列。保守FKBP型肽基-脯氨酰基顺-反异构酶家族域为下划线。FKBP型肽基-脯氨酰基顺-反异构酶信号1和2为粗体。TPR重复为粗体/斜体。   MADEGLELSDVAEVEDEPGEEFESAPPLVVGQEKELNSSGLKKKLLKAG TRCETPENGDEVTVHYTGTLLDGTKFDSS RDRGEPFTFNIGQGQVIKGWDQGIVTMKKREHALFTIPP ELAYGASGMPPTIPPNATLQFDVELLSWTNIVDVCKDGGILKRIISD GEKYERPKDPDEVTVKYEAKLEDGMLVAKSP EEGVEFYVNDGNFCPAIVKAVKTMKKGENVTLTIKPAYA FGEQGKDAEEGFAAIPPNATITINLQLVSFKAVKEVTEDKKVYKKILKE ADGYDKPSDGTVVQIRYTAKLQDGTIFEK KGYAGEEPFQFVVDEEQVIAGLDKAVETMKTGEVALITI GPEYGFGNIETQRDLAVIPPYSTLIYEVEMVSFTKEKESWDMNTTENIEASKQKKEQGNSLFKVGKYLRAAKKYDKAAKYIEHDNSFSAEEKKQSKVLKVSCNLNHAACCLKLKDFKKAVKLCSKVLELESQNVKALYRRAQAYIETADLDLAEFDIKKALEIEPQNREVRLEYLILKQKQIEYNKKDAKLYGNMFARQNKLEAIEGKD
  156 SEQ ID 424的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号2为粗体。   MPNP KVFFDMQVGGAPAGRIVMELYADVVPKTAENFRAL CTGEKGTGRSGKPLHFKGSSFHKVIPGTMCQGGDFTRGN GTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGS QFFICTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGS GRTSKPVVIADSGQLA
  157 SEQ ID 425的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号2为粗体。   MPNP KVFFDMQVGGAPAGRIVMELYADVVPKTAENFRAL CTGEKGNGRSGKPLHFKGSSFHRVIPGFMDQGGDFTRGN GTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGS QFFICTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGS GRTSKPVVIADSGQLA
  158 SEQ ID 426的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号2为粗体。   MPNP KVFFDMQVGGAPAGRIVMELYADVVPKTAENFRAL CTGEKGTGRSGKPLHFKGSSFHRVIPGFMCQGGDFTRGN GTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGS QFFICTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGS GRTSKPVVIADSGQLA
  159 SEQ ID 427的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号2为粗体。   MPNP KVFFDMQVGGAPAGRIVMELYADVVPKTAENFRAL CTGEKGTGRSGKPLHFKGSSFHRVIPGFMCQGGDFTRGN GTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGS QFFICTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGS GRTSKPVVIADSGQLA
  160 SEQ ID 428的氨基酸序列。保守FKBP型肽基-脯氨酰基顺-反异构酶信号为下划线,且FKBP型肽基-脯氨酰基顺-反异构酶信号1为粗体和下划线。TPR重复为粗体/斜体。   MADDFELPESAGMMENEDFGDTVFKVGEEKEIGKQGLKKLLVKE GGSWETPETGDEVEVHYTGTLLDGTKFDSSRDRG TPFKFKLGQGQVIKGWDQGIATMKKGENAVFTIPPDLAY GESGSQPTIPPNATLKFDVELLSWASVKDICKDGGIFKKIIKE GEKWEHPKEADEVLVKYEARLEDGTVVSKSEEGVE FYVKDGYFCPAFAIAVKTMKKGEKVLLTVKPQYGFGHQG REAIGNDVARSTNATLLVDLELVSWKVVDEVTDQKKVLKKILKQ GEGYERPNDGAVVKVKYTGKLEDGTIFEEKGSDE EPFEFMAGEEQVVDGLDRAVMTMKKGEVALVSVAAEYGY QTEIKTDLAVVPPKSTLIYEVELVSFVKEKESWDMNTAEKIEAAGKKKEEGNALFKVGKYFRASKKYEKATKYIEYDTSFSEEEKKQSKPLKVTCNLNNAACKLKLKDYTQAEKLCTKVLEVESQNVKALYRRAQAYIQTADLELAELDIKKALEIDPNNRDVKLEYRALKEKQKEYNKKEAKFYGMMFARMSKLEELESRKSGSQKVETANKEEGSDAMAVDGESA
  条目 序列描述   注释肽序列
  161 SEQ ID 429的氨基酸序列。保守FKBP型肽基脯氨酰基异构酶域为下划线。   MAASLTPLGAGLAYATIYDQAKVRKLEPTKRSLIALCQHSDSQHRRFITRKYHVNVQILNRRDAIRLIGLAAGLCIDLSLMYDARGAGLPPQENAKLCDTTCEKELENAPMITTESGLQYKDIKI GNGPSPPIGFQVAANYVAMVPSGQVFDSSLD KGQPYIFRYGSGQVIKGLDRGLLSMKVGGKRRLYIPGPL AFPKGLNSAPGKRPRVAPSSPVIFDVSLEFIPGLESEEE
162 SEQ ID 430的氨基酸序列。保守FKBP型肽基脯氨酰基异构酶域为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。   MSAASLSADMAIRGTILGKTALGVLGPQVVSQCRQPVMFKCPPHTLRKMRFSAQDLQSKNFYSGFTPFKSVFISTSKRSWQAGSARAMSQDAAFQSKVTT KCFLDIEIGGDPAGRIV LGLFGEDVPKTAENFRALCTGEKGFGYKGSSFHRIIKDF MLQGGDFDRGDGTGGKSIYGRTFEDENFKLAHVGPGVLS MANAGPNTNGSQFFICTVKTPWLDKRHVVFGQVIEGMEI VKKLESEETNRTDRPKRPCRIVDCGELP
  163 SEQ ID 431的氨基酸序列。保守FKBP型肽基脯氨酰基异构酶域为下划线。   MGRIKPQTLLQQSKKKKVPGRISVSTIIVCNIIIIFLMFSLVGIYRQRAKRNRATSRSDGDEEMENFGRSKINSVPHQ AIVNTTKGLITLELFGKSSAHTVEKFVEWSERGYFNGLP FYRVIKHFVIQVGDPKFAGNREDWTVGGQLNVQLEFSPK HEAFMLGTSKLEDQGDGFELFITTAPIPDLNDKLNVFGRVIKGQDVVQEIEEVDTDEHFQPKSPIIINDVRLKDEL
  164 SEQ ID 432的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。   MARQSTLLLFWSLVFLGAIVFTQAKHEELEEVTH KVYFD VDIAGKPAGRVVIGLFGKAVPKTVENFRALCTGEKGVGK SGKPLHYKGSFFHRIIPSFMIQGGDFTLGDGRGGESIYG TKFADENFKLKHTGPVFITTVTTDWLDGRHVVFGKIISG MDVVYKVEAEGRQSGQPKRKVKIADSGELSMD
  165 SEQ ID 434的氨基酸序列。保守FKBP型肽基-脯氨酰基顺-反异构酶信号为下划线,且TPR重复为粗体。   MEMDEIQEQSQPQSSEKQDISQESDTGNDKTINAEKITSENAEVEEDDMLPPKVNTEVEVLHDKVTKQIIKE GSGNKP SRNSTCFLHYRAWAESTMHEFQDTWQEQQPLELVLGREK KELSGFAIGVAGMKAGERALLHVDWQLGYGEFGNFSFPN VPPRANLIYEAELIGFEYAKEGKARSDMTVEERIEAADRRRQQGNELFKEDKLAEAMQQYEMALAYMGDDFMFQLFGKYKDMANAVKNPCHLNMAQCLLKLNRYEEAIGQCNMYLAEDEKNIKALFRRGKARATLGQTDDARKDFQKVRKPSPEDKAVIRELRLLAEHDKQVYQKQKEMFKGLFGQKPEQKPKKLHWFVVFWQWLLSMIRTIFRMRSKTD
  166 SEQ ID 435的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶信号为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。   MAGAGEG TPEVTLETSMGPITVELYHKHAPKTCRNFLEL SRRGYYNNVKFHRVIKDFMVQGGDPTGTGRGGESIYGPR FEDEITRDLKHTGAGILSMANAGPNTNGSQFFISLAPTP WLDEKHTIFGRVCKGMDVVKRLGNVQTDKNDRPIHDVKI LRTTVKD
  167 SEQ ID 436的氨基酸序列。保守TPR重复域为下划线。   MMDPELMRLAQEQMSKISPDELMKMQRQIMANPDLMRMASENMKNLKPEDIRFAAEQMKNVRKEEMAEISERISRASPEEIEAMKARANLQSAYQLQVAQNLKDQGNQLHARMKYSEAAEKYLQARNNLTGIPFSEAKSLLLASSSNLMSCYLKTGQYEECVQTGSEVLAYDAMN VKALYRRGQAYKQIGKLELA VADLRKAVEVSPEDETIAQALREASTELMEKGGTQDQNGPRIEEIIEEEAVQPTAEKYPQSAPMVTSVTEDVSDDEQGSEDQNGFSRDSFQATNAPDGQMYAESLRNLTENPDMLRTMQSLMKNVDPDSLVALSGGKLSPDMVKTVSGMFGRMSPEEIQNMMKMSSTLSRQNPSTSSRFDDITRSHSNMDSSPQSVSVDNDLFEENQNRVGESSTNLSSSAAFSGMPNFSAEMQEQVRNQMNDPATRQMFTSMIQNMSPEMMASMSEQFGVKLSPEDAVKAQNAMASLSPNDLDRLMNWATRLQTAIDYARKIKNWILGRPGLIFAISMLLLAIILHRFGYIGD
  168 SEQ ID 437的氨基酸序列。保守FKBP型肽基脯氨酰基异构酶域为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。   MGVEKEILRP GNGPKPRPGQSVTVHCTGYGKNEDLSQKF WSTKDPGQKPFTFTIGQGRVIKGWDEGVLDMQLGEIFKL RCSPDYGYGSNGFPAWGIRPNSVLVFEIEVLSVN
  条目   序列描述   注释肽序列
  169   SEQ ID 438的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反构酶家族域为下划线,且亲型肽基-脯氨酰基顺-反异构酶号为粗体。   MPNP RCYLDITIGEELEGRILVELYSDVVPKTAENFRAL CTGEKGIGPHTGVPLHYKGLPFHRVIKGFMIQGGDISAQ NGTGGESIYGLKFDDENFQLKHERRGMLSMANSGPNTNG SQFFITTTRTSHLDGKHVVFGKVIKGMGVVRGIEHTPTE SNDRPSLDVVISDCGEIPEGSDDGIANFFKDGDLYPDWPADLDEKSAEISNWMNAVDSAKCFGNENYKKGDYKMALRKYRKALRYLDICWEKEEIDEEKSNHLRKTKSQIFTNSSACKLKLGDLKGALLDTEFAMRDGEDNVKALFRQGQAYMALKDVDSAVASFKKALQLEPNDAGIRKELAVATKMINDRRDQERRAYARMFQ
  170   SEQ ID 439的氨基酸序列。保守FKBP型肽基脯氨酰基异构酶域为下划线,且亲环素型肽基-脯酰基顺-反异构酶信号为粗体   MGDVIDLNGDGGVLKTIIRSAKP GAMQPTEDLPNVDVHY EGTLADTGEVFDTTREDNTLFSFELGKGTVIKAWDIAVK TMKVGEVARITCKPEYAYGSAGSPPDIPENATLIFEVEL VACKPRKGSTFGSVSDEKARLEELKKQREIAAASKEEEKKRRERAKATAAARVQAKLEAKKGQGRGKGKSKGK
  171   SEQ ID 440的氨基酸序列。保亲环素型肽基-脯氨酰基顺-反构酶信号为下划线。   MGLGLKIASASFLPIFNIMATRSLCILLVCFIPVLAHVLSLQDPELGTV RVYFQTTYGDIEFGFFPHVAPKTVEHIYK LVRLGCYNSNHFFRVDKGFVAQVADVVGGREVPLNSEQR KEGEKTIVGEFSEVKHVRGILSMGRYSDPDSASSSFSILLGNAPHLDGQYAVFGKVTKGDDTLKRLEEVPTRQEGIFVMPLERIRILSTYYYDTNERESNLTCDHEVSILKRRLVESAYEIEYQRRKCLP
  172   SEQ ID 441的氨基酸序列。保FKBP型肽基脯氨酰基异构酶为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。   MASKRSLRTMNVWPTLPPLVLLLLLCFSSMSSSVVAKKSDVSELQIGVKHKP KSCDIQAHKGDRIKVHYRGSLTDGTV FDSSFERGDPIEFELGSGQVIKGNDQGLLGMCVGEKRKL RIPSKLGVGAQGSPPKIPGGATLIFDTELVAVNGKGISNDGDSDL
  173   SEQ ID 442的氨基酸序列。保FKBP型肽基脯氨酰基异构酶为下划线,且亲环素型肽基-脯酰基顺-反异构酶信号为粗体   MSGAPAERP ISYFDITIGGKPIGRIVFSLYADLVPKTAE NFRALCTGEKGIGKSGKPLCLAGSGFHRVIKGFMCQGGD FTAGNGTGGRSIYGEKFEDEAFPVKHTKPFLLSMANAGK DTNGSQFFITVSQTPHLDDKHVVFGEVIKGKSIVRAIEN YPTASGDVPTSPIIISACGVLSPDDPSLAASEETIGDSYEDYPEDDDSDVQNPEVALDIARKIRELGNKLFKEGQIELALKKYLKSIRYLDVHPVLPDDSPPELKDSYDALLAPLLLNSALAALRTQPADAQTAVKNATRALERLELSDADKAKALYRRASAHVILKQEDEAEEDLVAASQLSPEDMAISSKLKEVKDEKKKKREKEKKAFKKMFSS
  174   SEQ ID 443的氨基酸序列。保FKBP型肽基脯氨酰基异构酶为下划线。   MASSLRSSLFSSWALDSKSVCSLFNLNPGKMGLPSISTPLNNRTCCCSHSSELLELNEGLQSSKRKTVMGLSTVIALSLVYCDEVGAVSTSKRALRSQKVPEDEYTTLPNGLKYYDLKV GSGTEAVKGSRVAVHYVAKWKGITFMTSRQGMGITGG TPYGFDVGASERGAVLKGLDLGVQGMRVGGQRILIVPPE LAYGNTGIQEIPPNATLEFDVELISIKQSPFGSSVKIVEG
  175   SEQ ID 444的氨基酸序列。保G蛋白βWD-40重复域为下划线   MGAIEDEEPPLKKLKVSSPGLRRGLEEEAPSLSVGSVSILMAKSLSLEEGETVGSKGLIRRVEFVRIITQALYSLGYQKAGALLEEESGILLQSSNVALFRKQILDGKWDESVVTLRGIDQVEVEGNTLKAASFLILQQKFFELLDKGNIPEAMKTLRLEISPMQLNTKRVHELASCIVFPSRCEELGYSKQGNPKSSQRMKVLQEIQQLLPPSIMIPEKRLERLVEQALNVQREACIFHNSLDPALSLYTDHQCGRDQIP TTTLQVLESHKN EVWFLQFSNNGKYLASASKDCSAIIWEITEGDS FSMKHR LSAHQKPVSFVAWSRDDKLLLTCGIEEVVKLWNVET GEC KLTYDKANSGFTSCGWFPDGERFISGGVDKCIYIWDLEGKELDSWKGQGMPKISDLAVTSDGKEIISICGDNAIVMYNLDTKTERLIEEESGITSLCVSKDSRFLLLNLANQEIHLWDIGARSKLLLKYKGHRQGRYVIRSCFGGSDLAFVVSGSEDSQVYIWHRGN GELLAVLPGHSGTVNCVSWNPVNPHVFASASDDYTIRIWGVNRNTFRSKNASSSNGVVHLANGGP
  条目   序列描述   注释肽序列
  176   SEQ ID 445的氨基酸序列。保G蛋白βWD-40重复域为下划线且Trp-ASp(WD)重复信号为粗体。   MPGTTAGAGIEPIEPQSLKKLSLKSLKRSFDLFASLHGEPQPPDQRSQRIRIACKVRAEYEVVKNLPTLPQREVGSSVSNSNVGETHSSLTTNQAQGFPTDTSGDLSKDEGKEITSIAVHLQPQTGLIDGKAGAIAGTSTAISSVGSSDRYQPSAAIMKRLPSKWPRPIWHPPWK NYRVISGHLGWVRSVAFDPG NEWFCTGSADRTIKIWEVATGK LKLTLTGHIEQIRGLAV SSRHPYLFSAGDDKQVKCWDLEYNK AIRSYHGHLSGVYC LALHPTLDILCTGGRDSVCRVWDIRTKA QIFALSGHENT VCSVFTQAIDPQVVTGSHDTTIKLWDLAAGK TMSTLTYH KKSVRAIAKHPFEHTFASASADNIKKFKLPKGEF LHNML SQQKTIVNAMAINEDNVLVSAGDNGSLWFNDWKSGHNFQQAQTIV QPGSLDSEAGIYALQYDITGSRLVSCEADKTIKMWKEDETATPESHPINFKAPKDIRRF
  177   SEQ ID 446的氨基酸序列。保守G蛋白βWD-40重复域为下划   M RPILMKGHERPLTFLKYNRDGDLLFSCAKDHTPTVWYGHNGE RLGTYRGHNGAVWCCDVSRDSTRLITSSADQTAKL WNVETGAQLFSFNFESPARAVKLAIGDKLVVITTDPFMELPSAIHIKRIEKDLSKQTAD SVLTITGIKGRINRAVWGP LNSTIISGGEDSVVRIWDSETGKLLR ESDKETGHQKPIT SLCKSADGSHFLTGSLDKSARLWDIRTLTLIKTYVTERPVNAVAISPLLDHVVIGGGQEASHVTTTDRRAGKFEAKFFHKILEE EIGGVKGHFGPINSLAFNPDGRSFASGGEDGYVRLHHFDPDYFHIKM
  178   SEQ ID 447的氨基酸序列。保守G蛋白βWD-40重复域为下划线   M RPILMKGHERPLTFLKYNRDGDLLFSCAKDHTPTVWYGHNGE RLGTYRGHNGAVWCCDVSRDSTRLITSSADQTAKL WNVETGNQLFSFNFESPARAVDLAIGDKLVVITTDPFMELPSAIHIKRIEKDLSKQTAD SVLTITGIKGRINRAVWGP LNSTIISGGEDSVVRIWDSETGKLLR ESDKETGHQKAIT SLCKSADGSHFLTGSLDKSARLWDIRTLTLIKTYVTERPVNAVAISPLLDHVVIGGGQEASHVTTTDRRAGKFEAKFFHKILEE EIGGVKGHFGPINSLAFNPDGRSFASGGEDGYVRLHHFDPDYFHIKM
  179   SEQ ID 448的氨基酸序列。保守G蛋白βWD-40重复域为下划线   MAENNVGDFIPLDRQEYPSKPAPGAVDSSFWKSF KKKEV SRQIAGVTCINFCPEPPHDFAVTSSTRVHIYDGKSCE LK KTITKFKDVAYSGVFRSDGQIIAAGGETGVIQVFNAKSQM VLRQLKGHGRPVRVVRYSPQDKLHLLSGGDDSMVKWWDITTQE ELLNLEGHKDYVRCGAASPSSVNLWATGSYDHTV RLWDLRNS KTVLQLKHGKPLEDVLFFPSGGLLATAGGNV VKVWDILGGGR PIHTMETHQKTVMAMCISKVPRSGQALG DAPSRLVTASLDGYMKVFDLDHFKVTHSARYPAPILSMGISSLCRTMAVGTSSGLLFIRQRKGQIEDKIHSDSSGLQVNPVNDEKDSAVLKPNQYRYYLRGRSEKPSEGDYVVKRMAKVYFQEYDKDLRHFNHSKALVSALKAADSKGTVAVIEELVARKRLIQTLSILNLDELELLINFLSRFILVPKYSRFLISLTDRVLDARAVDLGKSENLKKQIADLKGIVVQELRVQQSMQELQGIIEPLIRASAR
  180   SEQ ID449的氨基酸序列。保守C-x8-C-x5-C-x3-H型锌指为下划线且为粗体,且保守CyS和HiS残基为粗体。保守G蛋白βWD-40重复域为下划线,且Trp-ASp(WD)重复信号为粗体(非斜体)。   MDVETSGKPTGNKRTYTRLPRQVCVFWQEGRCTRESCNFLHVDEPGSVKRGGATNGFAPKRSYNGSDERDTLAAGPPGGSRRNISARWGRGRGGIFISDERQKI RNKVCNYWLAGNC QRGEECKYLHSFVMGS DVKFLTQLSGHVKAIRGIAFPSD SGKLYSGGQDKKVIVWDCQTGQGTDIPLNDEVGCIMSEGPWIFVGLPNAVKAWN ILTSTELSLVGPRGQVHALAVGNG MLFAGTHDGSILAWKFSPAS NTFEPAASLVGHTQAVVSL VSGADRLYSGSMDKTIRVWDL GTFQCLQTLRDHTSVVMS LLCWDQFLLSCSLDNTVKVWVAT SSGALEVTYTHNEEHG VLALCGMNDEQAKPVLLCSCNDNTVRLYDL PSFSERGRI FSRNEVRTFQIAPGGLFFTGDATGELKVWNWATQKS
  181   SEQ ID 450的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MSVQELRERHAAATAKVNALRERIKAKRLQLLDTDVATYASSNGRTPISFSFTDLV CCRTLQGHTGKVYSLDWTSEKN RIVSASQDGRLIVWNALTSQ KTHAIKLPCAWVMTCAFSP SGQAVACGGLDSVCSIFQLNNQLDRDGHLP VSRILSGHR SYVSSCQYVPDGDTHVITGSGDRTCTQWDVTTGQRIAIFGGEFPLGHTADVMSVSISAANPKEFVSGSCDTTTRLWDTRIASR AIRTFHGHEADVNTVKFFPDGLRFGSGSDDGTCR LFDIRTGHQLQVY RQPPRENQSPTVTAIAFSFSGRLLFA GYSNGDCFVWDTILEKVVLN LGELQNTHNGRISCLGLSA DGSALCTGSWDKNLKIWAFGGHRKIV
  条目   序列描述   注释肽序列
  182   SEQ ID 451的氨基酸序列。保守G蛋白βWD-40重复域为下划线   MKVKIISRSTDEFTRERSNDLQRVFRNFDPNLHTQARAQEYVRALNAAKLDKIF AKPFLAAMSGHIDGISAMAKSPRH LKSIFSGSVDGDIRLWDIA ARRTVQQFPGHRGAVRGLTV STEGGRLISCGDDCTVRLWDIPVAGIGESSYGSEN VQKP LATYVGKNSFRAVDYQWDSNVFATGGAQVDIWDHD RSEP TNSFAWGSDTVISVRFNPAEKDIFATTASDRSIVLYDLR MASPLNKLIMQTRNNAIAWNPREPMNFTAANEDCNCYSYDMRR MNISTCVHQDHVSAVMDIDYSPSCREFVTGSYDRT VRIFPY NAGHSREIYHTKRMQRVFCVKFSGDATYVVSGS DDANIRLWKAKASEQLGVLLPRERKRHEYLDAVKERFKHLPEIKRIERHRHLPKPIYKAALLRHTVNAAAKRKEERKRAHSAPGSVVTNPLRKKRIVAQLE
  183   SEQ ID 452的氨基酸序列。保守G蛋白βWD-40重复域为下划线   MDHYYQDDFDYLVDDEMVDFADDVEDDVRTRRRSDIDSDSENDFDLNNKSPDTTALQAKRGKDIQGIPWNRLNFTREKYRETRLQQYKNYENLPRPRRSRNLDKECTNFERGSSFYDFRHNTRSVKATIVHFQLRNLVWATSKHNVYLMQNYSIMHWSSLKQKGEEVLNVAGPIVPSVKHPGSSPQGLTRVQVSAMSVKDNLVVAGGFQGELICKYLDKPGVSFCTKISHDENGITNAVEIYNDASGATRLMTANNDLAVRVFDTEKFTVLERFSFPWSVNHTSVSPDGKLVAVLGDNADCLLADCKT GKTV GTLRGHLDYSFAAAWHPDGYILATGNQDTTCRLWDVRKLSSSLAVLKGRMGAIRSIRFSSDGRFMAMAEPADFVHLYDTRQNYTKSQEIDLFGEIAGISFSPDFEAFFVGVADRTYGSLLEFNRRRMNYYLDSIL
  184   SEQ ID 453的氨基酸序列。G蛋白βWD-40重复域为下划且Trp-Asp(WD)重复信号为粗体。   MAE ALVLRGTMEGHTDAVTAIATPIDNSDMTVSSSRDKS ILLWNLTKEPEK YGVPRRRLTGHSHFVDDVVISSDGQFA LSGSWDSELRLWDLN TGLTTRRFVGHTKDVLSVAFSIDN RQIVSASRDRTIKLWNT LGECKYTIQPDAEGHSNWISCV RFSPSATNPTIVSCSWDRTVKVWNLT NCKLRNTLVGHGG YVNTAAVSPDGSLCASGGKDGVTMLWDLA EGKRLYSLDA GDIIYALCFSPNRYWLCAATQQCVKIWDLESKSIVADLRPDF IPNKKAQIPYCTSLSWSADGSTLFSGYTDGKIRVWGIGHV
  185   SEQ ID 454的氨基酸序列。保守G蛋白β WD-40重复域为下划线   MAAIKSTSRSASVAFAPDAPLLAAGTMAGAIDLSFSSLANLEIFKLDFQSDDP ELPVVGECPSNERLNRLSWGSAGGS FGIIAGGLVDGTINIWNPATLINSEDN GDALIARLEQHT GPVRGLEFNTISTNLLASGAEDGELCIWDLANPTAPTH F PPLKGVGSGAQGEISFLAWNRKVQHILASTSYSGTTVVW DLR RQKPIISFPDATRRRCSVLQWNPDASTQLIVASDDD NSPTLRAWDLRN TISPYKEFVGHSRGVIAMSWCPSDSLF LLTCAKDNRTLCWDTG SGEIVCELPAGANWNFDVQWSPK IPGILSTSSFDGKIGIHNIEACSRNVSGEVEFGGAIVRGGPSALLKAPKWLERPAGVSFGFGGKLASFRPSTVAQAADHRHSEVFIHNLVTEDNLVIRSTEFEAAIADGEKVSLRALCDRKAEESQSDEEKETWNFLRVMFEDEGTARTKLLEHLGFKVQSEENGDLQETHSSKIDDIGSEIGKTLTLDDKTEEDVLPQLKGGQDAAIPQDNGEDFFDNLHSPKEEVSLSHVGNDFVGEKDKDMVVNGAEIEHETEDLTEYSDWNEAIQHSLVVGDYKGAVLQCLSANRMADALIIAHLGGNSLWEKTRDEYLKKAKSSYLKVVSAMVNNDLTGLVNSRPLKSWKETLAMLCTYSQREEWTVLCDMLASRLIAAGNVMAATLCYICAGNIEKTVEIWSRSLKYDYDGRSFVDHLQDVMEKTVVLALATGQKRVSPSLSKLVENYAELLASQGLLTTAMEYLKLLGTEESSHELSILRDRLYLSGTDNEVEASSFPFETRQDLTESQYNMHQTGFGAPETQKNYQENVHQVLPSGSYTDNYQPTANTHYIAGYQPAPQQQPSFQNYFTPASYQPAPSPNVFYPSQVSQAEQSNFAPPVNQPPMKTFVPSTPPILRNVDQYQTPSLNPQLYQGVSSATVETHPYQTGAPASVSVGTTPGQPSVVPNFMVPGPVTAPTVTPRGFMPVTTPTQHPLGSANPPVQPQSPQSSQVQSV
  186   SEQ ID 455的氨基酸序列。保G蛋白β WD-40重复域为下划且Trp-Asp(WD)重复信号为粗体。   MAGAADSQLQTLSERDSTPNF KNLHTREYAAHKKKVHSV AWNCTGTKLASGSVDQTARVWNIEPHGH SKTKDLELKGH ADSVDQLCWDPKHSELLATASGDRTVRLMDAR SGKCSQQ VELSGENINITFKPDGTHIAVGNRDDELTIIDVR KFKPL HKRKFSYEVNEIAWNTTGELFFLTTGNGTVEVLSYP SLQ VLHTLVAHTAGCYCIAIDPIGRYFAVGSADALVSLNDLSEMLCVRTFTKLEWPVRTISFNHDGQYIASASEDLFTDIA DVQ TGRTVHQISCRAAMNSVEWNPKYNLLAFAGDDKNKYMQDEGVFRVFGFETP
  条目   序列描述   注释肽序列
  187   SEQ ID 456的氨基酸序列。G蛋白βWD-40重复域为下划线   MAATSPVGAG SGRELANPPTDGISNLRFSNHSDHLLVSS WDRKVRLYDAS ANSLKGOFVHGGPVLDCCFHDDASGFSG SADNTVRRYDF STRKEDILGRHEAPVRCVEYSYAAGQVI TGSWDKTLKCWDPRGASGQ EDTLVGTYSQLERVYSMSLV GHRLVVATAGRHINVYDLRNMSQPEQRRESSLKYQTRCVRCYPNGTGFALSSVEGRVAMEFFDLSEAGQAKKY AFKCH RKSEAGRDTVYPVNAIAFHPIYGTFATGGCDGYVNVWDGNNKKRLYQYSKYPTSIAALSFSRDGRLLAVASSYTFEEGEKPHEPDAVFVRSVNEAEVKPKPKVYAAPP
  188  SEQ ID 457的氨基酸序列。保G蛋白βWD-40重复域为下划线且Trp-Asp(WD)重复信号为粗体。   MASDDEEGFKNEEAPGVVDEAEVQEGLRACFPLSFGKQEKKQAPLESIHSATKRPEDPRPRRQLGPPRPPPSILAEQEDSDRFVGPPRPPQFVRDDNDDGEAEIMIGPPRPPAQYSDDHDNEETIGPPKPSYLEKGEETDQMVGPSKRGSDDETSGDSDDGDDAVDFRV PLSNEIVLRGHTKVVSALAIDOTGSR VLTGSYDYSVRMYDFQGMTS QLKSFRQLEPAEGHQVRSL SWSPTSDRFLCVTGSAQAKIFDRDGLTLGEFVK GDMYLR DLKNTKGHISGLTCGEWHPKEKQTILTCSEDGSLRIWDVND FNTQKQVIKPKLAKPGRVPVTACANGRDGKCIAGGVG DGSIQVWNLKPGWG SRPDLYVAKGHDDDITGLQFSADGN ILLTRSTDETLKVWDLRKAITPLQVFRDLPNNYAQTNVAFSPDERLIFTGTSVERDGNSGGLLCFYDRQTLELVLRIGVSPVHSVVRCTWHPRHNQVFATVGDKKEGGAHILYDPALSERGALVCVARAPRKKSLDDFEAKPVIHNPHALPLFRDEPSRKRQREKARMDPMKSQRPDLPVTGPGFGGRVGSTKGSLLTQYLLKEGGLIKETWMEEDPREAILKYADVAAKDPKFIAPAYAQTQPETVFAETDSEEEQK
  189   SEQ ID 458的氨基酸序列。G蛋白βWD-40重复域为下划   MKERGQSHAGQPSVDERYTQWKSLVPVLYDWLANHNLVWPSLSCRWGPQMHQATYKNSQRLYLSEQTDGTVPNTLVIATCEVVKPRVAAAEHISQFNEEARSPFVKKFKTIIHPGEVNRIRELPQNSKIVATHTDGPDVLIWDVDTQPNRQATLGAADSRPDLVLTGHKDNAEFALAMSPSAPFVLSGGKDKCVLLWSIQDHISAATEPSSAKASKTPSSAHGEKVPKIPS IGP RGVYKGHKDTVEDVQFCPSNAQEFCSVGDDSALILWDARNGN EPVIKVEKAHNADLHCVDWNPHDENLILTGSADNSV RMFDRRNLTSSGV GSPVHKFEGHSAPVLCVQWCPDKASV FGSMAEDSYLNVWDYEKVGKNVGKKTPPGLFFQHAGHRDKVVDFHWNSFDPWTIVSVSDDGESTGGGGTLQIWRMSDLIYRREDEVLAELERFRAHILSCQNK
  190   SEQ ID 459的氨基酸序列。保G蛋白βWD-40重复域为下划无脑回畸形1类样同源基序为体,且CTLH、LisH的C末斜体。   MSSLSRELVFLILQFLDEEKFKESVHKLEQESGFFFNMKYFDEKAQAGEWDEVERYLSGFTKVDDNRYSMKIFFEIRKQKYLEALDRQDRAKAVDILVKDLKVFSTFNEELYKEITQLLTLDNFRENEQLSKYGDTKSARTIMMSELKKLIEANPLFREKLIYPNLKASRLRTLINQSLNWQHQLCKNPRPNPDIKTLFTDHACGPPNGARTPTQPTASLGVLPKATTFTPIGPHGPFPSSSTATSGLASWMSNPNMVTSPQAPVAVGPSVPVPPNQATLLKRPRTPPGSSSVVDYQTADSEQLIKRLRPVSQSIDEATYPGPTLRVPWSTDDLP KTLARALNEPYPVTSI DFHPSQQTPLLVGTKNGEITLNEVGSREKLATRSFKIWDNANCSNHLEAAFVKDSSVSINRVLWSPDGTLIGIAFTKHLVHTYTFQGLD LRQHLEIDAHVGGVNDLAFSHPNKQLCV VTCGDDKMIKVWDAVT GRKLYNFEGHDAPVYSVCPHHKE NIQFIFSTAVDGKIKAWLYDHLGSRVDYDAPGHSCTTMMYSADGTRLFSCGTSKEGESFLVEWNESEGAIKRTYSGLRKKGSGVVQFDTTQNHFLAVGDEHLIKFWDMDSTNMLTSCDAEGGLLNLPRLRFNKEGSLLAVTTVNGIKILANADGQKLLKTMENRTFDLPSRAHIDAASATSSPATGRMERIERTSSANTVSGINGVDPAQSSEKLRLSDDLSEKTKIWKLTEITDSIQCRCITLPENAAEPASKVSRLLYTNSGVGLLALGSNAVHKLWKWNRSEQNPSGKATASVHPQRWQPTSGLL MTND ITDINPEEAVPCIALSKNDSYVMSASGGKVSLFNMMT FK VMTTFMPPPPASTFLAFHPQDNNIIAIGMEDSTIHIYNVRV DEVKTKLKGHQRRITGLAFSSTQNILVSSGADAQLCV WNTETWEKRKSKTIQMPVGKTVSGDTRVQFHSDQLHILVVHETQLAIYDAYKLERQYQWVPQDALSAPILYATYSCNRQLIYATFSDG
  条目   序列描述   注释肽序列
  191   SEQ ID 460的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MAKDEEEFRGEMEERLVNEEYKIWKKNTPFLYDLVITHALEWPSLTVQWLPDREEPPGKDYSVQKMILGTHTSDNEPNYLMLAQVQLPLEDAENDARQYDDERGEIGGFGCANGKVQVIQQINHDGEVNRARYMPQNPFIIATKTVSAEVYVFDYSKHPSKPPQDGGCH PDLRLRGHNTEGYGLSWSPFKHGHLL SGSDDAQICLWDIWVPAKNKVLE AQQIFKVHEGVVEDVA WHLRHEYLFGSVGDDRHLLIWDLRTSATNK PLHSVVAHQ GEVNCLAFNPFNEWVLATGSADRTVKLFDLRKISS ALHT FSCHKEEVFQIGWSPKNETILASCSADRRLMVWDLSRIDEFQTPEDALDGPPE LLFIHGGHTSKISDFSWNPCEDWVIASVAEDNILQIWQMAENIYHDEEDDMPPEEVV
  192   SEQ ID 461的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MSPGV KQTGSQKFESGHQDVVHDVTMDYYGKRIATCSAD RTIKLFGLNASD TPSLLASLTGHEGPVWQVAWAHPKFGS MLASCSYDGRVIIWREGQQEN EWSQVQVFKEHEASVNSI SWAPNELGLCLACGSSDGSITVFTCREDG SWDKTKIDQA HQVGVTAVSWAPASAPGSLVGQPSDPIQKLVSGGCDNTA KVWKFYNGSW KLDCFPPLQMHTDWVRDVAWAPNLGLPKS TIASCSQDGKVVIWTQGKEG DKHEGRIINDFKIPVWRVNWSLTGNILAVADGNNSVTLWKEAVDGDWNQVTTVQ
  193   SEQ ID 462的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MSSGVK QTGSQKFESGHQDVVHDVTMDYYGKRIATCSAD RTIKLFGMNTSDT PTLLASLTGHEGPVWQVAWAHPKFGS MLASCSYDRRVIIWREGQQENE WSQVQVFKEHEASVNSI SWAPHELGLCLACGSSDGSITVFTGREDGS WDKTKIDQA HQVGVTAVSWAPASAPGSLVGQPSDPVQKLVSGGCDNTA KVWKFYNGSWK LDCFPPLQMHTDWVRDVAWAPNLGLPKS TIASCSQDGRVVIWTQGKEGD KWEGKILNDFKTPVWRIS WSLTGNILAVADGNNNVTLWKEAVDGEWNQVTTVQ
  194   SEQ ID 463的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MKKRSRPSNGHLSTAAKNKSRKTAPITKDPFFDSAHNRNKSKGKGKSRGKGEEIFSSDEDDDAIGRDAPAEEEEEIAEEERETADEKRLRVAKAYLDKIRAITKANEEDNEEEAGEDEETEAERRGKRDSLVAEILQQEQLEESGRVQRQLASRVVTPSKLVECRVVKRHKQSVTAVALTEDDLRGFSASKDGTIIHWDVETGASEKYEWPSQAVSVSSSNEVSKT QKGKGSKK QGSKHVLSMAVSSDGRYLATGGLDRYIHLWDTRT QKHIQ AFRGHRGAVSCLAFRQGTQQLISGSFDRTIKLWSAEDRAYMDTLYGHQSEILAVDCLRKERVLSVGRDHTLRLWKVPE ETQLVFRGHAASLECCCFINNEDFLSGSDDGSIELWSMLRKKPVFMAKNAHGHAIVENLSEDTSTREEPDEEVTTRQLPNGNSIGNGMRNQMGITPSVESWVGAVTVCRGTDLAASGAGNGVVRLWAIENSSKSLRALHDIPLTGFVNSLTFARSGRFLIAGVGQEPRLGRWGRIQAARNGVTLCPIELS
  195   SEQ ID 464的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MAATFGTINTATSPHN PNKSFEIVQPPNDSISSLSFSPK ANYLVATSWDNQVRCWEVLQTG ASMPKAAMSHDQPVLCS TWKDDGTAVFSAGCDKQAKMWPLLT GGQPVTVAMHDAPI KDIAWIPEMNLLATGSWDKTLKYWDTR QSNPVHTQQLPE RCFALSVRHPLMVVGTADRNLIIFNLQN PQTEFKRISSP LKYQTRCVAAFPDKQGFLVGSIEGRVGVHHVEEA QQSKN FTFKCHRDSNDIYAVNSLNFHPVHQTFATAGSDGAFNFW DKDSKQRLKAMARSNQPIPCSTFNSDGSLYAYAVSYDWSKGAENHNPATAKHHILLHVPQESEIKGKPRVTTSGRK
  196   SEQ ID 465的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MVVMDKGTHQTNEDESESEFIDEDDVIDEISIDEEDLPDADVEGEDVQEDNKRSEPDENSSSLDDAIHTFEGHEDTLFAVACSPVDATWVASGGGDDKAFMWRIGH ATPEEELKGHT DSVVALSFSNDGLLLASGGLDGVVRIWDAST GNLIHVLD GPGGGIEWVRWHPKGHLVLAGSEDYSTWMWNADL GKCLS VYTGHCESVTCGDFTPDGKAICTGSADGSLRVWNPQTQES KLTVKGYPYHTEGLTCLSISSDSTLVVSGSTDGSVHVV NIKN GKVVASLVGHSGSIECVRFSPSLTWVATGGMDKKL MIWELQ SSSLRCTCQHEEGVMRLSWSLSSQHIITSSLDG IVRLWDSRS GVCERVFEGHNDSIQDMVVTVDQRFILTGSDDTTAKVFEIGAF
  条目   序列描述   注释肽序列
  197   SEQ ID 466的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。   MPVFRTAFNGYAVKFSPFVETRLAVATAQNFGIIGNG RQ HVLELTPNGIVEVCAFDSSDGLYDCTWSEANENLVVSAS GDGSVKIWDIALPPV ANPIRSLEEHAREVYSVDWNLVRK DCFLSASWDDTIRLWTIDR PQSMRLFKEHTYCIYAAVWN PRHADVFASASGDCTVRIWDVRE PNATIIIPAHEHEILS CDWNKYNDCMLVTGSVDKLIKVWDIRTY RTPMTVLEGHT YAIRRVKFSPHQESLIASCSYDMTTCMWDYRAPE DALLA RYDHHTEFAVGIDISVLVEGLLASTGWDETVYVWQHGMDPRAC
  198   SEQ ID 467的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MDSRNRRSRLNLPPGMSPSSLHLETTAGSPGLSRVNSSPSTPSPSRTTTYSDRFIPSRTGSRLNGFALIDKQPQPLPSPTRSAAEGRDDASSSSASAYSTLLRNELFGEDVVGPATPATPEKSTGLYGGSRDSIKSPMSPSRNLFRFKNDHGGNSPGSPYSASTVGSEGLFSSNVGTPPKPARKITRSPYKVLDAPALQDDFYLNLVDWSSNNVLAVGLGTCVYLWSACTSKVTKLCDLGVNDSVCSVGWTPQGTHLAVGTNIGEVQIWDTSRCKKVRTMGGHCTRAGALAWSSYILSSGSRDRNILHRDIRVQ DDFIRKLVGHKSEVCGLKWSYDDRELASGGNDNQLLV WNQQSAQPLLRFNEHTAAVKAIAWSPHQHGILASGGSTADRCLRFWNTATDTRLNCVDTGSQVCNLVWCKNWNELVSTHGYSQNQIMVWRYPS MSKLATLTGHTLRVLYLAISPDGQ TIVTGAGDETLRFWSIFPSPKSQSAVHDSGLWSLGRTHIR
  199   SEQ ID 468的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MEKK KVVVPIVCHGHSRPIVDLFYSPVTPDGLFLISASK DSSTMLRNGE TGDWIGTFEGHKGAVWSCCLDNRALRAAS GSADFSAKIWDAL TGDELHCFVHKHIVRACAFSESTSLL LTGGHEKILRIFDLNR PDAPPKEVDNSPGSIRTVAWLHS DQTILSSNSDAGGVRLWDLR TEKIVRVLETKSPVTSAEV SQDGRYITTADGNSVKFWDAN HFGMVKSYTMPCMVESAS LEPTMGNMFVAGGEDMWVRLFDFH TGEEIACNKGHHGPV HCVRRFAPGGESYSSQSEQGTIRIWQTLNMNSENESYGVNGLSGKVRVGVDDVVQKVEGFQITADGHINDKPEKPNP
  200   SEQ D 469的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MERYSQGTQKKSEIYTYEAPWQIYGMNWSVRKDKKFRLGIGSFLEEYNNRVEIIELDEESGEFKSDPRLAFDHPYPTTKIMFVPDKECQRPKLLATTGDYLRIWQVCEDRVE PKSLL NNNKNSEFCAPLTSFDWNDADPKRIGTSSIDTTCTIWDIE KEVVDTQLIAHDKEVYDIAWGEVGVFASVSADGSVRVF DLRDKEHSTIIYESSQPETPLLRLGWNKQDPRFIATILMDSCKVVILDIRF PTLPVAELQRHQASVNTIAWAPHSPCH ICTAGDDSQALIWELSSVSQPLVEGGGLDPILAYTAAAEINQLQWSSMQPDWVAIAFSNEVQILRV
  201   SEQ ID 470的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MQSENNLDESL HLREVQELQGHTDTVWAVAWNPVTGIDG APSMLASCSGDKTVRIWENTHTLNSTSP SWACKAVLEET HTRTVRSCAWSPNGKLLATASFDATTAIWENVGG EFECI ASLEGHENEVKSVSWSASGMLLATCGRDKSVWIWDVQPGNE FECVSVLQGHTQDVKMVQWHPNRDILVSASYDNSIKV WAEDGDGD DWACMQTLGNSVSGHTSTVWAVSFNSSGDRM VSCSDDLTLMVWDTSINPAERSGNAG PWKHLCTISGYHDRTIFSVHWSRSGLIASGASDDCIRLFS
  条目   序列描述   注释肽序列
  202  SEQ ID 471的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-ASp(WD)重复信号为粗体。   MK RAYKLQEPVAHASNVNCLKIGKKSSRVLVTGGEDHKV NMWAIGK PNAILSLSGHSSAVESVTFDSAEALVVAGAAS GTIKLWDLE EAKIVRTLTGHRSNCISVDFHPFGEFFASG SLDTNLKIWDIR RKGCIHTYKGHTRGVNSIRFSPDGRWV VSGGEDNIVKLWDLT AGKLNHDFKCHEGQIQCMDFHPQE FLLATGRADRTVKFWDLE TFELIGSAGPETTGVRAMIFN PDGRTLLTGLHESLKVFSWEPLRCYDAVDVGWSKLADLNIHEGKLLGCSYNQSCVGVWVVDISRVGPYAAGNVSRTNGHNEAKLASSGHPSVQQLDNNLKTNMARLSLSHSTESGIKEPKTTTSLTTTEGLSSTPQRAGIAFSSKNLPASSGPPSYVSTPKKNSTSRVQPTTNFQTLSRPDIVPVIVPRSNSLRPETTSDVKKEMNNFGRVVPSTVSTKSTDVIKSGSNRDESDKIDSINQKRMTGNDKTDLNIARAEQHVSSRLDNTNTSSVVCDGNQPAARWIGAAKFRRNSPVDPVVSPHDRSPTFPWSATDDGVTCQPDRQVTAPELSKRVVEPGRARALVASWETREKALTADTPVLVSGRPPTSPGVDMNSFIPRGSHGTSESDLTVSDDNSAIEELMQQHNAFTSILQARLTKLQVIRRFWQRNDLKGAIDATGKMGDHSVSADVISVLIERSEIFTLDICTVILPLLTRLLQSETDRHLTVAMETLLVLVKTFGDVIRATISATPTIGVDLQAEQRLERCNLCYVELENIKQILVPLIRRGGAVAKSAQELSLALQEV
  203  SEQ ID 472的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-ASp(WD)重复信号为粗体。   MSTLEIEARDVIKIVLQFCKENSLHQTFQTLQNECQVSLNTVDSLETFVADINSGRWDVILPQVAQLKLPRKKLEDLYEQIVLEMIELRELDTARAILRQTQAMGFMKQEQPERYLRLEHLLVRTYFDPREAYHESSKEKRRSQIAQALASEVTVVPPSRLMALIGQSLKWQQHQGLLPPGTQFDLFRGTAAVKADEEEMY PTTLAHTIKFGKQSHPECARFSPDGQYLVSCSV DGFIEVWDYISGKLKKDL QYQADDSFMMHDDAVLCVDFS RDSEMLASGSQDGKIKVWRIR TGQCLRRLERAHSQGVTS LSFSRDGSQLLSTSFDSTARIHGLK SGKALKEFRGHTSY VNDAIFTSDGGVITASSDCTVKVWWDVK TTDCIQTFKPP PPLKGGDVSVNSVHLFPKNSEHIVVCNKASSIYIMTL QG QVVKSFSSGKREGGDFVAACISPKGEWIYCVGEDRNIYC FSQQ SGKLEHLMKAHDKDIIGVTPHPHRNLLVTYSEDSTMKIWKP
  204  SEQ ID 473的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MDIELEDQPFDLDFHPSAPIVAVALITGRLQLFRYVDISS EPERLWTVTAHTESCRAARFINAGSSVLTASPDCSILA TNVE TGQPVARLDNAHGAAINCLTNLTESTIASGDENGI IKVWDTR QNSCCNKFKAHEDYISDMEFVPDTMQLLGTSG DGTLSVCNLR KNKVHARSEFSEDELLSVALMKNGKKVVC GSQEGVLLLYSWGY FKDCSDRFVGHPHSVDALLKLDEDT VLTGSSDGIIRVVSIL PNKMIGVIGEHSSYPIERLAFSH DRNVLGSASHDQILKLWDIHYLHEDDEPETNKQEAVNDENVDMDLDVDTEKRPRGSKRKKRAEKGQTSSQKQSSDFFADI
  205  SEQ ID 474的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MDRIQQIPHTCVARKINLPLGMSKESLALNLPANLAPTMSPPSITYSDRFIPSRKASNFEEFALPDKTSPSPNSAGGQSSSTNGEGRDDACAAYSALLRTELFPATPDKTEGCRRPVIGSPSGNVFRFKSQQCKSQSPFSLCPVGEDGDLSETGAVARKTTRKIPRSPFKVLDAPALQDDFYLNLVDWSSHNILAVGLSACVYLWSASSSKVTKLCDLGLDDNVCSVAWTQRGTYLAVGTNNGGVQIWDAAHCKQVRTMEGHCTRVGTLAWNSHILSSGGRDRNILQRDIRAQ DDFVSKFSGHKSEVCGLKW SYDNRELASGGNDNQLFVWNQQSQQPVLKYNEHTAAVKAIAWSPHQHGLLASGGGTADRCIRFWNTATNTSLNCVDTGSQVCNLVWSKNVNELVSTHGYSQNQIIVWRYPT MSKLAT LTGHTLRVLYLAISPDGQTIVTGAGDETLRFWNVFPSSKTQQNTIRDMGVWSSGRTHIR
  206  SEQ ID 475的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MAGGQGEGEEKVDKLSMELTEDVMK SMEIGAVFKDYNGK INSLDFHRTNNYLVTASDDEAIRLFDTASATWQKTSYSKKYGVDLICFTNHQTSVLYSSKNGWDESLRHLSLM DNKYL RYFKGHHDRVVSLCMSPKGECFMSGSLDRTVLLWDLRIDKCQGLIRVRGRPAVAYDEQGLVFAISNEGGLIKMFDARLYDK GPFDTFVVEGDKSEASGIKFSNDGKLILLSTMDSNI HVLDAY QGTTVHSFSVEAVPNGGEAVPNGGTLEASFSPD GKFVISGSGNGNIHAWSVNSGKEVACWTTEGVIPAVVKWAPRRLMFASGSSVLSLWVPDLSKLASLTGSNSNSAY
 条目  序列描述   注释肽序列
 207  SEQ ID 476的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MHRVGSTGNTSNSSRPRR EKRLTYVLNDANDSRHCSGIN CLVISKLSLLGGNDYLPSGSRDGTLKRWELADD SAVCSA TFESHVDWVNDAVLTGETLVSCSSDTTLKTWRPFS DGVC TRTLRQHSDYVTCLAAASKNSNIVASGGLGREVFIWDIEAAMAPVSRTSEAMDDDTSNGVLSSGNSVLSTTVRSTNATNSASLHTSQL QGYTPIAAKGHKESVYALAMNDVGTLLVS GGTEKVVRVWDPR SGAKQMKLRGHTDNVRALILDSTGRF CLSGSSDSIIRLWDLG QQRCVHSYAVHTDSVWALASTPN FSHVYSGGRDLSLYLTDLTTRESLLLCMEKHPLLRLTLQPDSIWVATTDSSLHRWPAEGQNPPKMFQRGGSFLAGNLSFTRARACLEGSAPVPVNTQPSFVIPGSPGIVQHEILNNRRHVLTKDAEGTVKLWEITRGAVLDDYGKVSFEEKKEELFEMVSIPAWFTMDTRLGSMSVHLDTPQCFTAEMYAVDLNVPDAPEEQKINLAQETLRGLLAHWLSRRRQRLATQASANGDFPAGQENALRNHISSRIDVHDDAETHIAGILPAFDFSTTSPPSIITEGSQGGPWRKKITDLDGTEDEKDFPWWCLECVLHGRLSPRESLKCSFYLHPYEGTTVQVLTQGKLSAPRILRIQKVINYVLEKMVLDRPLDSSNSETTFTPGLSGNQSHAAVVGDGSLRSGARVWQQKAKPLVEILCNNQVLSPDMSLATVRTYIWKKPDDLYLYYRLVQNR
 208  SEQ ID 477的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MM KGKTIQMQAAHQNHDGETSVACVLWDWHAKHLITAGA DNTILIHSYPSSS SSKPITLRHHKNAVTALAINSNVRSL ASGSVDHSVKLYSYP GGEFQSNVTRFTLPIRSLAFNKSG ELLAAAGDDEGIKLISTI DNSIARVLKGHNGPVTSISFD PKNEFLASSDSDGTVIYWELS TGKPVHTLKKIAPNTTSN PTSLNQISWRPDGEMLAVPGRKSEVSMYDRDTAEKLFSLKGGHSDTICSLAWSPNGKYIATAGTDRQVMVWDADRRQDIDKQRFDNPICSVAWKPSDNALAVIDVLGRFGVWESPIASHMKSPADGAERYDNMEDEEPLMARYEEELEDSVSGSLNEIINDDDDDDEMGKIPRKILQKKPSVKVEKGKEESNAKAFKSGQDSFKLKSAMQEAFQPGATQRQSGKRNFLAYNMLGSVITFDNDGFSHIEVDFHDIGKGCRVPSMTDYFGFTMASLSESGSVFGSPQKGEKNPSTLMYRPFSSWANNSEWSMRFPMGEEVKAVALGSGWVAAVTSLNFLRVFSEGGLQKFVLSMDGPVVTAAGYENLLVVVSHASNPLLSGDQVLSFTVYDISQKTCPLSGRLPLSPGSHLTWLGFSEEGLLSSYDSEGNLRVFTNDYNGCWVPIPSAARERKSETESIWMVGLNSTQVFCVVCKLPDTYPQVAPKPVLSVLNLSLPLACSDLGADDLENEYLRGSLLLSQMQKKAEDAVACGRESNMEEDSIFKMEAALDRCLLRLIANCCKGDKLVRATELARLLSLEKSLQGAIKLVSAMKLPMLAERFNTILEEKILQENMETISCRRLTSEAQDMDTPISISVKQVSYGANLGDSPFLPNRQVEPKHSTPVFSKPDTKIEVDTSEAIAKGCDAQNGNIKSGDAEVQPASHNDSIQKPSNPFAKASNTSANQAVQRNASLLSSIKQMKTATENEGKRKERARSGSLPQKPAKQSKIS
 209  SEQ D 478的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MKQKRKGHQVDDPKYSVQTPQEDDTPNESGPASEEVESSDEEGGNSSNIEDDIIYSSSEEDPVVSSDYEEDEDAESDAEGVTAEQELEGDIDNALQNYMGTLTVLSNFHGENLKNAEGEDTSGDDDDEEEMPKRAEESDSPEDENDERPKRAEESDFSEDEDEERPKRAEESDSSEDEVPSRNTVGDVPLRWYKDEQHIGYDIKGKKIKKQPKKDQLDSFLASTDDSSDWRKVYDEYNDEEVELTKDEIKFISRLRKGTIPHADVNPYEPYVDWFDWKDKGHPLSNAPEPKRRFIPSKWEAKKVVKLVRAIRKGWITFQKAEEKPRFYLMWGDDLKPSEKMANGLSYIPAPKPKLPGHEESYNPPPEYIPTQEEINSYQLMYEEDRPKFIPKRFDSLRNVPAYDRFLSEIFERCLDLYLCPRTRKKRINIDPESLIPKLPKPKDLQPF PSICFLEYKGHTGAVSCISP ESSGQWLASGSKDGTVRIWEVETARCLKVWDIGRPIQHIAWNPVSQLSILAVAVDEEVLVLNTGLGSEDSQEKVAELLHVKSKPVSADDLGDNTSLTRWIKHEKFDGIKLTHLKPVHLISWHHKGDYFATVAPDGNTRAVLVHQLSKQQTQNPFKKMQGRVVHVLFHPSRAIFFVATKTHVRVYDLVKQQLVKRLVTGLHEVSSMAVHHKGDNLLVGSKEGKVCWFDMDL STQP YKTLKNHSKDIHSVAFHDSYPLPASCSDDCKAYVFYGLVYSDLLQNPLIVPLKVLQGHQSVNGMGVLDCQFHPKQPWLFTAGADSVVKLYCN
  条目  序列描述   注释肽序列
  210  SEQ ID 479的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MMSLKRGFEESLVPAKRQKTELSTVTYGDGPRRTSSL ES PIMLLTGHHAAIYTMKFNPTGTVIASGSHEREIFLWNVHGD CKNFMVLKGHKNAVLDLHWTTDGCQIISASPDKTLRA WDVET GKQIKKMAEHSSFVNSCCPSRRGPPLVVSGSDDG TAKLWDLRH RGAIQTFPDKYQITAVGFSDAADKIYSGGI DNEIKVWDLRR GEVTMRLQGHTDTITGMQLSSDGSYLLT NSMDCSLRIWDMRPYAPQNRCV KILTGHQHNFEKNLLKC SWSSDGSKVTAGSADRMVYIWDTTTRRILYKLPGHTGSVNETGFHPTQPIIGSCSSDKQIYLGEIEPNVGYQAVI
  211  SEQ ID 480的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MEFSDTYKHTGPCCFSPDARYLAIAVDYRLVIRDVVTLKVVQLYSCMDKISNIEWALDSEYIICGLYKRAMVQAWS LS QPEWTCKIDEGPAGIAHARWSPDSRHIITTSDFQLRLTV WSLVNTACIHIQWPKHASKGVSFTQDGKFAAIATRRDCKDYVNLLSCHTWEVMGTFTVDTIDLADLEWSPNDSAIVVWDSPLEYKVLIYSP DGRCLFKYQAYDSWLGVKTVAWSPCS QFLAVGSYDQTLRTLNHLTWKPFAEFVHVSTVRGPASAVVFKEVEEPWNLDVSGLHLNDDNAHDIQDGKPAEGHSRVRYKVVEFPVNVSSQKHPVDKPNPKQGIGLLAWSRDSQYLFTRNDNMPTALWIWDIC RLELAALLIQKEPIRAAAWDPVY PRVALCTGSSHLYMWTPSGACCVNIPLPQFVVSDLKWNPDGTSMLLKDRESFCCTFVPMLPENDDETNEE
  212  SEQ ID 481的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MAKLIETHSCVPSTERGRGILIAGDAKTNSIIYCNGRSVIMRNLD NPLEASVYGEHSYPATVARFSPNGEWVASGDTS GTVRIWGRGS DHTLKYEYKALAGRIDDLEWSADGQRIVV CGDSKGKSMVRAFMWDSGTNVGEFDGHSRRVLSCSFKPT RPFRVATCGEDFLVNFYEGP PFRFKTSHRDHSNYVNCVR FAPDGSKFITVGSDRRGVIFDGK MGEKIGELSKEGGHTG SIYAASWSPDSKQVLTVSADKSAKIWEISETGNGTVKKTLTFGSQGGADDMLVGCLWLNDYLITVSLGGIVSLLSAVDPDKPPKTISGHMKSINAIALSLQSGQSEVCSSSYDGVIV RWILGVGYAGRVERKDSTQIKCLATIEGELVTCGFDNKVRRVPLLSEQHKESEPIDIGAQPKDLDVAVGCPELTFVSTDAGIIIIR ASKIVSTTNVGYAVTAAAISPDGTEAVVGGQ DGKLRVYSIKGD TLLEESVLERHRGPINAIRFSPDGSMF ASGDLNREAVWDRITR EVKLKNMVYHTAARINCIAWSPD SSKVATGSLDTCILIYEVGK PASSRITIKGAHLGGVYGLAFSDQSTVISAGEDACVRVWSLP
  213  SEQ ID482的氨基酸序列。保守G蛋白β WD-40重复域为下划线,且Trp-ASp(WD)重复信号为粗体。   MPQPSVVLLATAGYDHVRFWEAT SGRCYRTLQYPDSQVN HLEITPDKQYLAAAGNPHIRLFEVNSN NPQPVISYDSHT NNVTAVGFQCDGKWMYSGSEDGTVKIWDLR APGFQREYE SRAAVNTVVLHPNQTELISGDQNGNIRVWDLN ANSCSCE LVPEDTAVRSLTVMWDGSLVVAANNHGTCYVWRLMRGTQTMT NFEPLHKLQAHNSYILKCLLSPEFCEHHRYLATTSS DOTVKIWNVD GFTLERTLTGHQRWVWDCVFSVDGAFLVT ASSDSTARLWDLSTGEAIRTYQGHHKATVCCALHDGTDGASC
  条目  序列描述   注释肽序列
  214  SEQ ID 483的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。外被体WD相关性区为粗体/斜体。   MLTKFETKSNRVKGLSFHPKRPWILASLHSGVIQLWDYRM GLLIDKFDEHDGPVRGVHFHKTQPLFVSGGDDYKIKVW NYKM RQCLFTFVGHLDYIRTVHFHNEYPWIVSASDDQTI RLWNWQS RVCISVLTGHNHYVMSASFHPKEDLVVSASLD QTVRVWDISGLRKKTVSPADDLSRLAQMNTDLFGGGD VV VKYVLEGHDRGVNWAAFHTSLPLIVSGADDRQVKLNRMNDTK AWEVDTLRGHTNNVSCVIFHARQDIIVSNSEDKSIR VWDMSKRTSVQTFRREHDRFWILAAHPEMNLLAAGHDSGMIVFKLERERPAYVVYGGSLLYVKQRYLRTYEEATQKDNPLIPIRKPGSIGPNQGPRSLSYSPTENAILICSDADGGAYELYAVPKDSHGRSDTVQEAKKGLGGSAVFVARNRFAVLDKNHNQVTIKNLKNEVTKKFDLPVTADLFPYAGTGNLLCRSEDSVFLFDMQQRTVGKITQFPNVRYVVWSNDMENVALLSKHTIIIASKKLSSTCSLHETIRVKSGAWDDWGIFMYSTLNHIKYCLPNGDSGIIKTLDVPVYITKVSGKSLYCLDRDGKNRVIQIDITECLFKLALSKKKYDYVINMIRNSQLCGQAIIAYLQQKGFPEVALHFVRDERTRFNLAVESGNIEIAVASAKEIDNKDHWYRLGVEALRQGNAGIVEYAYQRTKNFERLSFLYLITGNLDKLSKMLRLAEMKNDVMGQFHNALYLGDIQERIKILEESGHLHLAYATASLHGLADIADRLAADLGGNIPVLPPGKKSSLLMPPAPILHGGDWPLLRVTKGIFEGGLENSTSAAYEEEDEEAAADWGEDIDIENIEGENGEATVLDDQEVKGGEDDEGGWDMEDLELPPDVAAANVGTNQKTLFVAPTLGMPVSQIWMQKSSLAGEHAAAGNFETALRLLTRQLGIKNFSPLKPLFLELYMGSHTFLPSFASVPAFSLALQRGWSSSASPNIRGPPALVYRLSVLEEKLTVAYRATTEGRFSEALRLFL
  215  SEQ ID 484的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MDLLQNYQDDSEDSNPELRNHPPLEDATATSAPAGVENETSSSPDSSPLRLALPAKSCAPDVDETLMALGVPGSEKKNNHNKPIDPTQHSVTFNPSYDQLWAPLYGPAHPYAKDGIAQGMRNHKLGFVEDSAIEPFMFDEQYNTFHRYGYAADPSASLGSHIVGDLESLKKNDGASVYNLPKREHKRQKLEKKMIQKDENEEEEKEVGEEVDNPSTEEWLKKNRKSPWAGKKEGLQTELTEEQKKYAQEHAEKKGDREKGEKVEIVDKTTFHGKEERDYQGRSWIDPPKDAKATNDHCYIP KRWVHTWSGHT KGVSAIRFFPKYGHLLLSAGMDTKVKIWDVFNS GKCMRT YMGHSKAVRDISFSNDGSRFLSAGYDRNIKLWDTET GKV ISTFSTGKIPYVVKLHPDEDKQNVLLAGMSDKKIVQWDMNS GEITQEYDQHLGAVNTITFVDNNRRFVTSSDDKSLRV WEFGIP VVIKYISEPHMHSMPSISLHPNTNWLAAQSLDN QILIYSTRERFQ LNKKKRFAGHIAAGYACQVNFSPDGRF VMSGDGEGRCWFWDWKT CKVFRTLKCHDNVCIGCEWHPLEQSKVATCGWDGMIKYWD
  216  SEQ ID 485的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。   MARKGLGTDPAIGSLMSSKKRKEYKVTNRFQEGKRPLYAIAFNFIDARYHNIFATAGGTRVTIYQCLEGGAISVLQAYVDQD KDESFYTLSWACDVNGSPLLVAGGHNGIIRVLDVANEKVHKSFVGHGDSVNEIRTQALKPSLILSASKDESVRL WNVQ TGICILIFAGAGGHRNEVLSVDFHPSDVYRIASCG MDNTVKIWSMKEFWTYVEKSFTWTDLPSKFPTKY VQFPV FIAAVHSNYVDCTRWLGNFILSKSVDNEVVLWEPYSKEQSTSDG VVDILQKYPVPECDIWFIKFSCDFHYNSMAVGNR EGKVYVWELQSS PPNLIARLSHAHCKNPIRQTAISHDGSTILCCCDDGSMWRWDVVQ
  条目   序列描述   注释肽序列
  217   SEQ ID 486的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。   MESGAGGSVGARVPSAKPEMLQQPPYSNGDDDNDMERGTAPVPSSNPNTVSKWELDKDFLCPICMQTMRDAFLTACGHSFCYMCIMTHLNNKSNCPCCSLYLTNNQLFPNFLLNKLLKKTSACQMASTASPVENLCLSLQQGAEVSVKELDFLLTLLAEKKRKMEQEEAETNMEILLDFLQRLRQQKQAELNEVQADLHYIKDDILALEKRRLELSRARERYSRKLHMLLDDPMDTTLGHAAIDDGNNVRTAFVRGGQGDAISGKFQQKKAEIKAQASSQGMQKRANFCHSDSQVLPTLSGLTIARKRRVLAQFDDLQECYLQKRRRWATQLRKQCDGGLRKERDGNSISREGYHAGLEEFQSILTTFTRYS RLRVISELRHGDLFHSAN IVSSIEFDRDDELFATAGVSRRIKVFDFATVVNEPAD VH CPVVEMSTRSKLSCLSWNKCIKSQIASSDYEGIVTVWDVN TRQSVMMYEEHEKRAWSVDFSRTEPTRLISGSDDGKVK VWCTRQETSVLNIDMKANICCVKYNPGSSYYVAVGSADH HIHYYDLRN PSVPLYEFNGHRKTVSYVKFISTNELASAS TDSTLRLWDVR DNCLVRTFKGHTNEKNFVGLTVNSEYIA CGSETNGVFVYHKAISKPAAWHQFGSPDLDDSDDDTSHFISAVCWKSESPTMLAANSQGTIKVLVLAP
  218   SEQ ID 487的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MANYVDSKKNFKCVPALQQFYTGGPFRLSSDGSFLVCACNDEVKVVDLAT GSVKNTLEGDSELIVALALTPDNKYLFS ASRSTQIKFWDLSS ATCKRTWKAHNGPVADMACDASGGL LATAGADRSILVWDVDG GYCTHSFRGHQGVVTTVIFHPD PHCLLLFSGSDDATVRIWDLVA KKCISVLEKHFSTVTSL AISENGWNLLSAGRDKVVNIWDLRDYHCRATIPTYEPLEAVCVLPTGSRLVSVMNQSRALPENRKKSGAAPVYFLTVGERGIVRIWYSEGALCLYEQKSSDAIISSDKDELKGGFVSAVLLPLTQGVMCVTADQRFLFYNLDESDEGKCDLKVSKRLIGYNEEIVDLKFLGDEEKFLAVATNLEQVRMYDLSS MT CVYELSGHTDIVLCLDTVVFSGHSLLASGSKDHTVRIWDTES KSCICVAAGHMGAVGAVAFSKKAKNFFVSGSSDRTI KVWSFASVLDFGGISKSIK LSSQAAVAAHDKDINSVAVA PNDSLICTGSQDRTARIWRLPD LVPVLVLRGHKRGVWCV EFSPVDQCVMTASGDKTIKIWALSD GSCLKTFEGHTASV LRASFLTRGTQFVSSGADGLLKLWTIKS NECIATFDQHE DKIWAMAVGKKTEMLATGGSDSLVNLWHDCTTTDEEEALLKEEEAALKDQELLNALADTDYVKAIQLAFELRRPYKLLNVFTELYSKGHAQDQIQKVIRELGNEELRLLLEYVREWNTKPKFAHVAQFVLFQLFNVLPPKEIIEVQGISELLEGLIPYAQRHYSRIDRLMRSTFLLDYTLSSMSVLSPTETDLSSSNLLARTADPLHAQIDQFHPTHFPEPNLTPIQSLLDSGNTDSVEVTARRAKKKRVSGNDSEKTTVAEVKIGDMENAFDEPDVADQGSSRKHKPASSKKRKSIAVGNASIKRIASGNAVTIALQV
  219   SEQ ID 488的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MESSCSSMNSNRHSTEKRCLRPLQKQGASMNKHSSDRFIPARGSIDLDVARFMVTQKQKDNNDIHALSPSPSPSKKAYQKEMADTLLKNAGAADNNCRILSFNGKSSTVSQGSQENVLANLSISRRARRYI PQSADRTLDAPDLLDDYYLNLLDWS STNVLSTALGNTVYLWDASNS SISELLIADEEEGPVTSV SWAPDGSQIAVGLNNSVVQLWDSQ SNKKLRALKGHHDRV GALSWNGPILTTGGLDGIITNHDVRT RDHIVQTYKGHTQ EVCGLKWSPSGQQLASGGNDNLLYIWDKSMASHNP SSQY FHQLDEHCAAVKALAWCPFQTNLLASGGGTSDGSIKFWNTQ TGACLNTVDTHSQVCSLLWNRHERELLSSHGLNQNQL TIWKYP SMVKITELTGHTARVLHMAQSPDGYTVASAAADETLKFWQVFGAPDASKKTKTKDTKGAFNMFHMHIR
  220   SEQ ID 489的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MLDEIVADEEEEFNIWKKNTPLLYDVVITHALEWPSLTVQWLPDRHQSPTKDYSLQKMIVGTHTSGDEPNYLMIAEVQMPLQYSEDGNVGGFESTEAKVHIIQQINHEGEVNRAQYMPQNSFIIATKTVSSDVYVFDYTKHSSNAPQERVCN PELI LKGHTNEGYSLSWSPLKEGQLLSGSNDAQICFWDINAASGRKVVE AKQIFKVHEGAVEDVSWHLKHEYLFGSVGDDCH LLIWDTRTAAPNK PQHSVVAHESEVNSLAFNPFNEWLLA TGSADKTVKLFDLRKLSC SLHTFSNHTEEVFQIEWSPMN ETILASSGGDRRLMVWDLRRIGDEQTSEDAEDGPPE LIF IHGGHTSKISDFSWNLHDDWLIASVAEDNILQIWQMAENIYHDDADIL
  条目  序列描述   注释肽序列
  221  SEQ ID 490的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MTKEDHGESRDEMGERMVNEEYKLWKKNTPFLYDLVITHALEWPSLTVQWLPPSCKQQQDIIKDDDIDHPNTQMVILGTHTSDNEPNYLILAEVQLHDGTEDEDGDGDVKRPQDKMKPGTSGGAMGKVRILQQINHQKEVNHARYMPQKPTIIATKTVNADVYVFDYSKHPSKPPQEGR CNPELRLQGHESEGYG LSWSPLKEGHLLSASDDAQICLWDITAATKAPKV VEANQ IFRYHDGPVEDVAWHAIHDHLFGSVGDDHHLLLWDIRNDSEKPLHIVEAHQAEVNCLAFNPFNEWIVATGSADRTVAL HDIRK LDKVLHTCAHHMEEVFQIGWSPQNGAILASCGSD RRLMVWDLSRIGDEQNPEDAEEAP PELLFIHGGHTSKIS DFSWNPAEEWVIASVAEDNILQVWQMSEHIYNDDNDSPTA
  222  SEQ ID 491的氨基酸序列。保守G蛋白β WD-40重复域为下划线。   MAMAMGDENAADPVEEFNIWKKNTPFLYDLVITHALEWPSLTVQWLPDRHQSSTADYSLQKMIVGTHTSEDEPNYLMIAEVQIPLQNSEDNIIGGFESTEA KVQIIQKINHEGEVNK ARYMPQNSFVIATKTVSSDVYVFDYSKHPSKAPQER VCN PELILKGHSNEGYGLSWSPLKEGYLLSGSNDAQICLWDINAAFGKK VLEANQIFKVHEGAVGDVSWHLKHEYLFGSVG DDCHLLIWDMRTAA PNKPQQSVIAHQSEVNSLAFNPFNE WLLATGSMDKTVKLFDLRK LSCSLHTFSNHTDQVFQIEWS PMNETILASSGADRRLMVWDLARIGETPEDEEDG PPEL LFVHGGHTSKISDFSWNLNDDRVIASVAEDNILQIWQMAENIYHDDEDML
  223  SEQ ID 492的氨基酸序列。保守G蛋白β WD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。   MGLFEPFRALGYITDGVPFAVQRRGIETFVTLSVGKAWQIYNCAKLIPVLVGPQMDKKIRALACWRDFTFAATGHDIAVFRRAHQVATWSGHKAKVTLLLSFGQHVLSVDLEGCLFIWAVAEVNQN KPPIGQIQLGEKFSPSCIMHPDTYLNKVLI GSEEGTLQLWNVNT RKKLYEFKGWGSSIRCCVSSPALDV VGIGCSDGKIHVHNLRYD EEIVTFMHSTRGAVTALSFRT DGQPLLAAGGSSGVISIWNLEKK KLQSVIKDAHDSSVCS LHFFANEPVLMSSATDNSIKMWIFDTTDGE ARLLKYRSG HSAPPMCIRYYGKGRHILSAGQDRAFRIFSVIQDQQSRELSQGHVGKRAKKLKVKDEEIKLPPVIAFDAAEIRERDWCNVVTCHLDDPCAYTWRLQNFVIGEHILKPCLEDPTPVKSCSISACGNFAVLGTEGGWLERFNLQSGISRGTYI DIGEK RQCAHNGAVVGLACDATNTLLISGGYNGDIKVWDFK GRE LKFRWEIEVPLIKIVYHPGNGILATAADDMILRLFDVTAMRLVRIFVGHMDRVTDLCFSGDGKWLLSSSMDGTIRVNDIISSRQLNAMHMDSAVTALSLSPGMDMLATTHVGHNGIYLWANRMIYSKATDIEPFISGKQVVKVSMPTVSSKRESEEGDEKRTIVAESNVNKSDVSGSLIGDSYSAQLTPELVTLALLPKAQWQSLVNLDIIKMRNKPIEPPKKPEKAPFFLPSLPTLSGERIFIPSSMNGDGDQDETRNDKTVFEARGKKLGGESLSFMQLLQSCAKIKDFTTFTNYLKGLSPSAVDMELRLLQIVDNENISETEHSVELQGIGMLLDYFVNEVSCNNNFEFVQALIRLFLKIHGETIRCQVSLQEKARKLLEIQSSTWERLDTSFQNARCMITFLSSSQF
  224  SEQ ID 493的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。   MIAAVCWVPKGVAKVLPDSAEPPTQEEIQELLKCNVVAESDDNEDSDEESEEMDTETDKNTDAVAKALAAANALGSQSSDFQRQHKVDDIANGLKELDMDHYDDEDEGIDIFGSGSLGNCYYPANDMDPYLVEQDDDDEDEIEDMTIKPSDLIILSARNEDDVSHLEVWIYEEETEEGGSNMYVHHDIILPAFPLSLAWLDCNLKGGEKGNFVAVGTMQPEIELWDLDVLDEVEPAVVLGGAVKDEASGKTTKLKKKKKNK QAVNFKEGSHTD AVLGLAWNMEYRNVLASASADKSVKIWDIVA EKCEHTMQ PHTDKVQAVAWNPNQATVLLSGSFDRSVIMMDMRA PTHS GIRWPVPADVESLAWDPHTDHSFMVSAEDGTVRGFDIRAAASTADFD GKPMFILHAHDKAVCAISYNPAAPSLLTTGS TDKMVKLWDITNNQ PSCIASTNPNVGAVFSAAFSKNSPFLLATGGSKGILHVWDTLDNSEVARRFGKFRPQN
  条目  序列描述   注释肽序列
  225  SEQ ID 494的氨基酸序列。保守真核蛋白激酶域为下划线。   MIMDENEFCDIFSLRKRLCLLSSQEGEEEEELEAMSQLDAGEFTVTGNEEVVAIAEDDVNTGILSQDLFSSQDYCTPSQPQDSTDLDSKDKAPCPLSPVKSTIQRKRCRPELLSNPPDSIQFSFQRLERVRSEESIQSSSQQLARVRSEVSSSDDFKTPKITASGQKNYVSQSALALRARVMSPPCIKNPYLDENEELNEKIQRSTRRSPACVTPIQSGACLSRYRAD FHELEE IGRGNFSRVYKALNRLDGCCYAVKCSQSELRLDTERKVA LMEVQSLAALGPHKNIVGYHTAWFENDHLYIQMELCDHN LTTANDRGILRTDTDFLEAVYQIAQALEFIHGRGVAHLD VKPENIYVRDGTYKLGDFGRATLINGTLHVERGDARYMS REILNDNYEHLDKVDMFSLGATFFELLMRKQYPGSGKRI DRDTEIKIPILPGFSIYFQKLLQDLVSNDPGKRPSAKDVLKNPIFNKVRGAKEV
  226  SEQ ID 495的氨基酸序列。保守G蛋白β WD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。   MLAPALEMEPVEPQSLKKLSFKSLKRALDLFSPVHGQIAPPDPESKKMRISYKLNFEYGGGSGSEDQVPKRKESGAAQNQGQQAAGASNALALPGPEGSKIPPMEKSQNALTVGPSLRPQGLNDVGLHGKGTAIISASGSSDRNLSTSAIMERLPSRWPRPVWHPP WKNYRVISGHLGWVRSIAFDPSNQWFCTG SADRTIKIWDLAS GRLKLTLTGHIEQIRGLAVSSKHTYM FSAGDDKQVKCWDLEQ NKVIRSYHGHLSGVYCLALHPTI DILLTGGRDSVCRVWDIRS KMQIFALSGHDNTVCSVFAR PTDPQVVTGSHDTTIKFWDLRH GKTMTTLTNHKKSVRAM AQHPKENCFASASADNIKKFQLPR GEFLHNMLSQQKTII NTMAVNEEGVMATGGDNGSLWFWDWKSGHNFQQAHT IVQ PGSLESEAGIYALSYDLTGSRLVSCEADKTIKMWKEDELATPETHPLNFKPPKDIRRF
  227  SEQ ID 496的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MEEAAKEQSAGSGKPRLLRYGLRSAAKPKEDKKEEQLHQPPPPPPPQQQAAPAPAPAATRSSTSGSAGGRDRRPQQQHAVDEKYARWKSLVPVLYDWLANHNLLWPSLSCRWGPQLEQATYKNRQRLYISEQTDGSVPNTLVIANCEVVKPRVAAAEHVSQFNEEARSPFIRKYKTIIHPGEVNRIRELPQNPNIVATHTDSPDVLIWDVESQPNRHAVYGATASRPNLILTGHQENAEFALAMCPAEPFVLSGGKDKTVVLWSIQDHITASATDQTTNKSPGSGGSIIKKTGEGNEETGNGPS VGPRGIYC GHEDTVEDVAFCPSTAQEFCSVGDDSCLILWDARIGT NP VAKVEKAHNGDLHCVDWNPHDNNLILTGSADNSVNMFDRRNLTSNGV GSPVYKFEGHKAAVLCVQWSPDKPSVFGSSA EDGLLNIWDYERVDKKVDRAPNAPAGLFFQHAGHRDKIVDFHWNTADPWTMVSVSDDCDTAGGGGTLQIWRMSDLIYRPEEEVLAELENFKAHVLECSKA
  228  SEQ ID 497的氨基酸序列。保守G蛋白β WD-40重复域为下划线。   MAKDEEEFRGEMEERLVNEEYKIWKKNTPFLYDLVITHALEWPSLTVQWLPDREEPPGKDYSVQKMILGTHTSDNEPNYLMLAQVQLPLEDAENDARQYDDERGEIGGFGCANGKVQVIQQINHDGEVNRARYMPQNPFIIATKTVSAEVYVFDYSKHPSKPPQDGGCHPDLRLRGHNTEGYGLSWSPFKHGH LL SGSDDAQICLWDINVPAKNKVLEAQQIFKVHEGVVEDVAWHLRHEYLFGSVGDDRHLLIWDLRTSATNKPLHSVVAHQGEVNCLAFNPFNEWV LATGSADRTVKLFDLRKISSALHTFSCHKEEVFQIGWSPKNETI LASCSADRRLMVWDLSRIDEFQTPEDALDGPPELLFIHGGHTSKISDFSWNPCEDWVIASVAEDNILQIWQMAENIYHDEEDDMPPEEVV
  229  SEQ ID 498的氨基酸序列。保守周期素依赖性激酶抑制剂域为下划线。   MGKYMRKGKGVGEVAVMEVSQGSLGVRTRARTLAAASSQKDHRRLGASKSVTTKHQSSAPPASPCVESSMHTCYLELRSRKLEKFSRCYHSAHGATSHGESKRSLSLSEPSRLAVSEEARVASDKSSHRVLQQQSSVAHSRNNSATFSHNAKPAKAAQRKERRDDDHTSARPSEAPHEDEDGMEVEASFGENVMDLDSRERRTRETTPSSYTRDVETMETPGSTTRPPSNAGRRRFQTSGGHGTR NQFHVPTTNEIEEFFAGAEQQEQRRFTDRYNYDPVSDSPLPGRFEWVRLRP
  230  SEQ ID 499的氨基酸序列。保守丝氨酸/苏氨酸蛋白激酶域为下划线,且丝氨酸/苏氨酸蛋白激酶活性位点信号为粗体。   MQNMEENVQSSWSLHGNKEICAR YEILKRVSSGTYLDVY RGRRKEDGLIVALKEVHDYQSSWREIEALQRLCGCPNVV RLYEVILEFLTSDLYSVIKSAKNKGENGIPEAEVKAWMI QILQGLANCHANWVIHRDLKPENMLISAYGILKLADFGS MSFLKRAIYEVEYELPQEDILADAPGERLMDEDDSVKGV WNEGEEDSSTAVETNFDDMAETANLDLSWKNEGDMVMQG FTSGVGTRWYRAPDFLYGATIYGKEIDLWSLGCILGELL ILEPLFSGTSNIDQLSRLVKVLGLQQKKNWPGCSNLPDY RKLCFPGDGSPYGLKNHVFNCSDNMFSILERLVCYDPAA RLNAKEIVENKYFVEDPYPVLTHELRVPSPLREENNFSEDWAKWKDMEVDSDLENIDEFNVVHSSDGFCIKFS
  条目   序列描述   注释肽序列
  231   SEQ ID 502的氨基酸序列。保守组蛋白去乙酰基转移酶家族域为下划线。   MADVPESLQQEKDEQGTDKNCCDGKFQKEIDIDDMEEEYNESSIDDEEENLSDNVATNNMGTIPQGQACMAVTVEGIEHANSVGCGRNGREGSEEVTAAEDMGHVSIENIREQGRNRKSSEQLLALYEQEGLLEDDEDDDDVDWEPFEGVTVQMKWYCTNCTMANSDDSVHCDSCGEHRNSDILRQGFLASPYLPAESPSSSDVPDERLEESKCVMTTLTPSISPMIGVCCSSLQSE RRTVVGFDERMLLHSEIQMETYPHPKRPDRLRAIAA SLRAAGLFPGKCFSIPAREATCEELQTIHSLEHVNAVES TSCGMLSHLSPDTYANEHSSLAARLAAGLCADLAKAIMT GQAQNGFALVRPPGHHAGVKDSMGFCLHNNAAIAVSASR VVGAKKVLIVDWDVHHGNGTQEIFEADQSVLYISLHRHG EGFYPGSGAVTEVGSSKGEGYSVNIPWKCGGVGDNDYIF AFQHAVLPIAEQFEPDLTIISAGFDAAKGDPLGRCEVTP DGPAHMAQMLSCLSKGKMLVILEGGYNLRSISASATAVI KVLLGDNPKALPIDIQPSKGGLQTLLEVFEIQSKYWSSLKGHDQKLRSQWEAQYGSKKRKVIRKRHMHIVGGPVWWKWGRKRVVYYHWFARVSSRKHL
  232   SEQ ID 503的氨基酸序列。保守亲环素型肽基-脯氨酰基顺-反异构酶家族域为下划线,且亲环素型肽基-脯氨酰基顺-反异构酶信号为粗体。   MASGAGAAGVVEWHQKPPNPKNP VVFFDVTIGTIPAGRI KMELFADIVPRTAENFRQFCTGEYRKAGIPIGYKGCHFH RVIKDFMIQAGDFVKGDGSGCISIYGSKFEDENFIAKHT GPGLLSMANSGPNTNGCQFFLTCAKCDWLDNKHVVFGRV LGEGLLVLRKIENVQTGQHNRPKLPCVIAECGEM
  233   SEQ ID 505的氨基酸序列。保守G蛋白β WD-40重复域为下划线。   MDHYYQDDFDYLVDDEMVDFADDVEDDVRTRRRSDIDSDSENDFDSNNKSPDTTALQAKRGKDIQGIPWNRLNFTREKYRETRLQQYKNYENLPRPRRSRNLDKECTNFERGSSFYDFRHNTRSVKATIVHFQLRNLVWATSKHNVYLMQNYSIMHWSSLKQKGEEVLNVAGPIIPSVKHPGSSPQGLTRVQVSAMSVKDNLVVAGGFQGELICKYLDKPGVSFCTKISHDENGITNAVEIYNDASGATRLMTANNDLAVRVFDTEKFTVLERFSFPWSVNHTSVSPDGKLVAVLGDNADCLLADCKT GKTV GTLRGHLDYSFAAAWHPDGYILATGNQDTTCRLWDVRKLSSSLAVLKGRMGAIRSIRFSSDGRFMAMAEPADFVHLYDTRQNYTKSQEIDLFGEIAGISFSPDTEAFFVGVADRTYGSLLEFNRRRMNYYLDSIL
  234   SEQ ID 506的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MDCSGDEEEEQFFESLEEMLSPSDSGSEAADNETGCRNADARSKYEIWKRAPSSIQERRQRFLVRMGLANPSELGNQVNSTSAESTCSTETANIPNGIERLRENSGAVLRTAGSSGRKTHCKNVINIGLREGSVRSSSSSNGTPDVGEDNGEFGGTIFSRSGGTWECMCKIKNLDSGKEFVVDELGQDGLWNKLREVGTDRQLTMDEFERSLGLSPLVQELMRRESGVAQADCNGVHHHDAEISSSKRRSWLKALKSAAYSMRRPKSDQSNYDSERSGRRSGSFDVPWGKPQWTKVRHYRKRYKEFTALYMGQEIEAHEGSIWTMKFSLDGRYLASAGQDCVIHVREVIESMRTFGADTPDLYASSAYFSMNGLQELVPLSIEDHANKMKRGKIIGSKKSSNSDCIVLPNKVFQLS EEPVCSFHGHLLD VFDLSWSPSQYLLSSSMDKTVRLWKLGH ESCLKVFSHND IVTCIQFNPVDERYFISGSLDGKARIWSIPDRQVVDWSDLREMVTAVCYTPDGQGGLVGSIKGSCRFYNTSGNKLQLENQLNVRSKKKKSSGKKITGFQFAPGGDSQKVLITSADSRVRVYNGSELVCKYKGFRNTCSQISASFAPNGQHFVCASEDSRVYIWNHESPRGSGARHEKSSWSHEHFLSQGVSVATPWSGMKLQPPVWNSPEFMLGQRHNLLSLQGGKDVGCQNGLLSREAGEGQESETPLHYISQVSHSCGSQNMVDRDGQDDLSRYSACISDSRLSSFMAFPESPGNPDDLNSKVFFSDSSSKGSATWPEEKLPPTRKQSRSNSTSSHYDTLKTHLGNTIQGQSGASAAVAWGLVIVTAGHGGEIRSFQNYGLPVRL
  条目   序列描述   注释肽序列
  235   SEQ ID 507的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MPSIPAIGEFTVCEINRELLTTKDESDTQAKDAYAKILGLVFPPISFQIEEGFGSASRQQFDQDLDREDTIVTPSTSEGTNALQEGGLLLKGVSVLKNILASSFGPIFSPNDTKVLKKVELLQGISWHRHKHILAFISGSNQVTVHDFQDPEWR ES SLLVSESQRGIEALEWRPNGGTTLSVACRGGICIWSASYPGSVAPVRSGVASFLGTSTRGSSVRWTLVDFLQIPGGKAVTALSWSPTGRLLASASREDSSFTIWDVAQGVGTPLRRGLGGISLLKWSPTGDYLFSAKPNGTFYLWETNTWTLEQWSSSGGCVISATWGPDGRMLFMAFSESTTLGSLHFAGRPPSLDAHLLPMELPEIGSITGGFGNIEKMAWDGCGERLAVSYTGGDLMYVGLIAIYDTRRTPFISASLVGFIRGPGEQVKPLAFAFHDKFKQGPLLSVCWSSGLCCTYPLIFRAH
  236   SEQ ID 508的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MEEENAKHTEETRQVQVRFTTLLQPALRVPTTSIAIPAHLTRYGLSDIVNTLLGNDKPQPFDFLVESELVRTSLEKLLLIKGISAEKILNIEYILAVVPPKQEEPSLHDDWVSVVDGSYPNFIFSGSFDSIGRIWKGEGLCTHVLEGHRDAITSAAFIMPSDSSDSFIN LATASKDRTLRLWQFKPNEHMTNGKMVRPYKLLKGHTSSVQTVSACPRRNLICSGSNDCSIKIWQTAGEMDIESNAGSVKKRELEDSTEQIISQIEASRTLEGHSQCVSSVVNLEKDT IYSASWDHSVRSWDVETGVNSLTVGCRKALHCLSIGGEGSALIAAGGADSVLRIWDPRMPGTFTPILQLSSHKSWITACKWHPKSRHH LISASHDGTLKLWDVRSKVPLTTLEAHKDKVLCADWWKEDCVISGGADSTLQIFSNLNLT
  237 238   SEQ ID 509的氨基酸序列。保守RING型锌指为下划线。SEQ ID510的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且剪接因子基序为粗体。   MNRLRSKRNHILELRLGQSEPEAEATLASNRSRGTNAPIVVEDDDDVVVSSPRSFALARSSVSQRSSRIPIVNEEDLELRLGLAVTGRTSAEHNPRRRHGRVPPNKPIVLCDDAGEADQSSSKKRRTGQQLSSDVQSDESKEVKLTCAICISTMEEETSTI CGHIFCKKCITNAIHRWKRCPTCRKKLAINNIHRIYISSSTGMEEPPPPAVLPSSEDTSIVSSHSFVNAPPTVPVGLDASIPQISTPGINQPGLTIPVPPEAAPLTASLVAASAGMPPAVVPSFVRPAIVAHPSVMPPPSMPLAALPMPVASAVPVAAPHFPPSTPNDNSITPSMPVPTPIVASSSVPPSVTIPGIAPLPFIAPIPVPSSRPVAPSPFMPPARPLGASVSVAMDVDNTDEQDQDADNKGESPSSSPDHPEDPSAAEYEITEESRKVRERQEQAIQELLLRRAAYALAVPTNDSSVRARLRRLNEPITLFGKREMERRDRLRALMAKLDAEGQLEKLMKVQEEEEAAANVDAEEVQEMEGPQVYPFYTEGSQELLKARTEITKFSLPRAVSRLQRARRKREDPDEDEDEELKCVLQQSAQINMDCSEIGDDRPLSGCAFSSDGTLLATSAWSGVTKLWSVPNINKVATLKGHTERVTDVAFSPTNCHLATACADRTAMLWNSE GVLMKTYEGHLDRLARLAFHPSGLYLGTASFDKTWRL WDVNT GIELLLQEGHSRSVYGIAFQCDGSLAATCGLDGL ARIWDLRT GRSILALEGHVKPVLGIDFSPNGYHLATGSE DHTCRIWDLRK RQSVYIIPAHSHLVSQVKFEPQEGYFLV TASYQSTAKVWSARD FKSIKVLAGHEAKVTSVDITAQGQYIATVSHDRTIKLWSSKNSTNDMNIG
  239   SEQ ID 511的氨基酸序列。保守G蛋白βWD-40重复域为下划线,且Trp-Asp(WD)重复信号为粗体。   MKR AYKLQEFVAHASNVNCLKIGKKSSRVLVTGGEDHKV NMWAIGK PNAILSLSGHSSAVESVTFDSAEALVVAGAAS GTIKLWDLEE AKIVRTLTGHRSNCISVDFHPFGEFFASG SLDTNLKIWDIRR KGCIHTYKGHTRGVNSIRFSPDGRWV VSGGEDNIVKLWDLTA GKLMHDFKCHEGOIQCMDFHPQE FLLATGSADRTVKFWDLET FELIGSAGPETTGVRAMIFN PDGRTLLTGLHESLKVFSWEPLRCYDAVDVGWSKLADLNIHEGKLLGCSYNQSCVGVWVVDISRVGPYAAGNVSRTNGHNEAKLASSGHPSVQQLDNNLKTNMARLSLSHSTESGIKEPKTTTSLTTTEGLSSTPQRAGIAFSSKNLPASSGPPSYVSTPKKNSTSRVQPTTNFQTLSRPDIVPVIVPRSNSLRPETTSDAKKEMNNFGRVVPSTVSTKSTDVIKSGSNRDESDKIDSINQKRMTGNDKTDLNIARAEQHVSSRLDNTNTSSVVCDGNQPAARWIGAAKFRRNSPVDPVVSPHDRSPTFPWSATDDGVTCQPDRQVTAPELSKRVVEPGRARALVASWETREKALTADTPVLVSGRPPTSPGVDMNSFIPRGSHGTSESDLTVSDDNSAIEELMQQHNAFTSILQARLTKLQVIRRFWQRNDLKGAIDATGKMGDHSVSADVISVLIERSEIFTLDICTVILPLLTRLLQSETDRHLTVAMETLLVLVKTFGDVIRATISATPTIGVDLQAEQRLERCNLCYVELENIKQILVPLIRRGGAVAKSAQELSLALQEV
  条目     序列描述     注释肽序列
    240  SEQ ID 512的氨基酸序列。保守周期素N和C末端家族域为下划线。   MAGSDENNPGVVGGAHVQEGLRVGAGKMGAGNVQQRRALSNINSNIIGAFPPYCAVNKRVLSEKNVNSENDLLNAAHRPITRQFAAQMAYKQQLRPEENKRTTQSVSNPSKSEDCAILDVDDDKMADDFPVPMFVQHTEAMLEEIDRMEEVEMEDVAEEPVTDIDSGDKENQLAVVEYID DLYMFYOKAEASSCV PPNYHDROODINERMRGILIDWLIEVHYKFELMDETLYL TVNLIDRFLAVOPVVKKKLOLVGVTAMLLACKYEEVSVP VVEDLILISDRAYSRKEVLEMERLMVNTLHFNMSV PTPY VFMRRFLKAAOSDKKLELLSFFIIELSLVEYDMLKFPPS LLAASAIYTALSTITRTKOWSTTCEWHTSYSEEOLLECA RIMVTFHORAGSGKLTGVHRKYSTSKFGHAARTEPANFLLDFRL
    24l  SEQ ID 513的氨基酸序列。保守周期素依赖性激酶抑制剂域为下划线。  MQAPREGKSAAAIVGMGKYMKKSKAIPRDVSLLEASPRSPSATGVRTRAKTLASRRLRRASQRRPPPPAAAAAAAAPSLDASPCPFSYLQLRSRRLRRPRLAPSPEARIDEGPAGSGSRGSRDASCSARTASSSGGVEGEGACVGRGDRGNGGECVRDAAVDASYGENDLEIEDRDRSTRESTPCSLIRDSNANTPPGSTTRQQSSCTAHRTQMS ILRSIPTSDEMEEFFAYAEOROORSFIEKYNFNFDIVKDRPLPGRFEWVOVIP
    242  SEQ ID 514的氨基酸序列。保守GCN5相关性N.乙酰基转移酶家族域为下划线,且溴域为粗体。  MDGHSSHLAAQNRSRGSQTPSPSHSAASASATsSIHLKRKLSAANASAASAAAAAAAAAAAADDHAPPFPPSSISADTRDGALTSNDDLESISARGGGAGDDSDDDSDDEEEDDGDNDGGSSLRTFTAARLENVGPAAARNRKIKAESNATVKVEKEDSAKDGGNGAGVGALGPAATSGAGSGSGTVPKEDAVKIFTENIQASGAYSAREENLKREEEAGRLKFECLSNDGVDDHMVWLIGLKNIFARQLPNMPKEYIVRLVMDRNHKS VMVI RRNLVVGGITYRPYASOKFGEIAFCAIKADEOVKGYGTR LMNHLKOHATDVDGLTHFLTYADNNAVGYFIKQGFTKEIYLDKDRWHGYIKDYDGGILMECKIDPKLPYTDLSTMVRRQRQAIDEKIRELSNCHIVYQGIDFQERDAGVPQNTIKMEDIPGLRBAGWTPDQWGYSRFRGLSDQKRLTFFIRQLLKVLNDHSDAWPFKEFVDAREVPDYYDIIKDPMDLKTMTLRVESEQYYVTLEMFIADVKRMFANARTYNSPDTIYFKIAPRLEAHFQSKVQSNLQSGAGKIQQ
    243  SEQ ID 515的氨基酸序列。保守TPR重复域为下划线。  MFNGMMDPELFKLAQEQMNRMSPAELAKIQQQMMSNPELMRMASESMKNMRPEDLRQAAEQLKHVRPEEMAEIGEKMANASPEEIAAVRARADAQMTYEINAAKILKKEGNELHSQGRFKDASQKYLRAKNNLKGIPSSEGKNLLLACSLNLMSCYLKTRQYEECIKEGSEALACEEEN LEAFYRRGQAYRELGQ LKDAVSDLRKAHEISPDDETIAQVLRDTEESLTKEGGSAPRGVVIEEITEEDETLASVNHESPSEYSEKRHQESEDAHKGPINGDIMGQMTNSESLKALEGDPDAIRSFQNFISNADPTTLAAMGAGNAGEVSPDLIKTASSMIGKMSAEELQKMIQLASSFPGENPYVTRNSDSNSNSFGNGSIPNVSPDMLKTASDMMSKMSPDDLQRMFEMASSSRGKDFSLDANHASSSSGANLAANLNHILGESEPSSSYHIPSSSRNISSSPLSNFPSSPGDMOEQIRNQMKDPAMRQMFTSMMKNMSPEMMANMGKQFGLELSPEDAAKAQEAMSSLSPEMLDKMMRWADRAQRGVETAKKTKNWLLGRPGMILAICMLLLAVILHRLGFIGS
    244 SEQ ID 516的氨基酸序列。保守G蛋白pwD.40重复域为下划线。  MIAAISWVPRGASKAVPEVAEPPSKEEIEEILKSGVVERSGDSDGEEDDENMDAVASEKADEVSTALSAADALGRISKVTKAGSGFEDIADGLRELDMDNYDEEDEDVKLFSTGLGDLYYPSNDMDPYLKDKDDDDDTEEIEDLSIKPMOSLIVCARTDDEVNLLEVYLLEPSLSDESNMYVHHEVVISEFPLCTAWLDCPIKGGDKGNF IAVGSMEPAIEIWDLDIIDAVEPCLVLGGOEELKKKKKKGKEASIKYKEGSHTDSVLGLAWNKEFRNI LASASADROVKIWDVAAGKCNITMEHHTDKVQAVAWNHHAPQVLLSGSFDHSVVMKDGRIPSHSGYRWSVTADVESLAWDPHSEHFFVVSLEDGTVRGFDVRAAISNSASQSLPSFTLHAHEKAVSTISYNPAAPNL LATGSTDKMVKLWDLSNNQPSCIASRNPKAGAVFSVSFSEDSPLLLAIGGSKGRLEVWDTSSDAAVSRRFGKHGKPKTAEPGS
  条目   序列描述   注释肽序列
  245   EQ ID 517的氨基酸序列。保守Zn-指、RING域为下划线,且SPX、N末端为粗体。   MKFCKKYQEYMQCQEGKKLPGLGFKKLKKILKRCRRRDSLHSQKALQAVQNPRTCPAHCSVCDGSFFPSLLEEMSAVLGCFNKQAQKLLELHLASGFQKYLMWFKGKLRGNHVALIQEGKDLVTIALINAIAIRKILKKYDKIHLSTQGQAFKSQVQRMHMEILQSPWLCELIAFHINVRETKANSGKGHALFEGCSLVVDDGKPSLSCELFDSIKLDIDLT CSICLDTVFDSV SLTCGHIYCYMCACSAASVTIVDGLKAAEPKEKCPLCREARVFEGAVHLDELNILLSRSCPEYWAERLQTERVERVRQAKEHWESQCRAFMGVE
  246   SEQ ID 518的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MVSTQSTRENPSIFFPPPLKPWLLPVVLSLSLSRQLGMAAAAAASLPFKKNYRSSQALQQFYAGGPFAVSSDGSFIACNCGDSIKIVDSSNASLRPSIDCGSDTITALSLSPDGKLLFSAGHSRQIRVWDLSTSTCLRSWKGHDGPVMSMACPVSGGLLATGGAQRKVMVWDVDGGFCTHFFKGHDGVVSTVLFHPDSNRSL LFSGSDDGTIRVWDLLAKKCASTLRGHDSTVTSLAFSEDGLTLLAAGRDKVVSLWDLHNYACKKTIPMYEVLESVCVIHSGTVLASQLGLDDQLKVTKESAQNIHFITVGERGILRIWKSEGSVCLFKQEHSDVTVISDEDDSRSGFTAAVMLPLDQGLLCVTADQQFLFYYPEKHPEGIFSLTLCRRLVGYNEEIVDMKFLGEEENFLAVATNLEQVRVYELASMSCSYVLAGHTETVLCLDTCISSSGRTL IVTGSKDNSVRLW DSESRHCIGVGVGHMGAVGAVAFSRKRQDF FVSGSSDRT LKVWSLDGISEDGVDSTNLKAKAVVAAHDKDINSVAVAPNDSLVCSGSQDRTACVWRLPDLVSVVVLKGHKRGIWSVEFSPVDQCVLTASGDKTVKIWAISDGSCLKTFEGHVSSVLRASFLTRGTQFVSCGADGLVKLWTVRTNECIATYDQHSDKVWALAVGKKTEMLATGGSDAVVNLWYDSTASDKEDAFRKEEEGVLKGQELENAVSDADYTKAIELALELRRPHKLFELFSELCRTREVGDRVERILSALSGEEVCLLLEYIREWNAKPKLCHVAQSVLSQVFRILSPTEIVEIKGIGELLEGLIPYSQRHFSRIDRLVRSTYLLDYTLTGMSVIEPEADRSAVNDGSPDKSGLEKLEDGLLGENVGEEKIQNKEELESSAYKKRKLPRSKDRSKKKSKNVVYADAAAISFRA
  247   SEQ ID 519的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MDSAPRRKSGGINLPSGMSETSLRLDGFSGSSSSFRAISNLTSPSKSSSISDRFIPCRSSSRLHTFGLVERGSPVKEGGNEAYSRLLKAELFGSDFGSLSPAGQGSPMSPSKNMLRFKTESSGPNSPFSPSILRQDSGFSSEASTPPKPPRKVPKTPHKVLDAPSLQDDFYLNLVDWSSQNTLAVGLGTCVYLWSASNSKVTKLCDLGPNDGVCAVQWTREGSYISIGTSLGQVQIWDGTQCKRVRTMGGHQTRTGVLAWNSRILASGSRDRVILQHDLRV PNEFIGKLVGHKSEVCGLKWSHDDRELASGG NDNQLLVWNQH SQQPVLKLTEHTAAVKAIAWSPHQNGLL ASGGGTADRCIRFWNTTNGHQTSSVDTGSQVCNLAWSKNVNELVSTHGYSQNQIMVWKYP SMAKVATLTGHSLRVLYL AMSPDGQTIVTGAGDETLRFWNVFPSAKAPAPVKDTGLWSLGRTHIR
  248   SEQ ID 520的氨基酸序列。保守G蛋白βWD-40重复域为下划线。   MEDEAEIYDGVRAQFPLTFGKQSKPQTSLESVHSATRRGGPAPAPAPASSSSLPSTTSPSAAGGAGKSSGLPSLSSSSTAWLEGLRAGNPRAGREAGIGSRGGDGEDGGRAMIGPPRPPPGFSANDDGGGEDDDDDGDGVMVGPPPPPPGNLGDGDDDEEEEEAMIGPPRPPVVDSDEEEEEEEEENRYRLPLSNEIVLKGHNKIVSALAVDPTGSRVLSGSYDYTVRMFDFQGMNSRLSSFRDFEPVEGHQVRNLSWSPTADRFLCVTGSAQAKIYDRDGLTLGEFVKGDMYIRDLKNTKGHITGLTWGEWHPKTKET ILTSSEDGSLRIWDVNDFKSQKQVIKPKLARPGRVPVTTCTWDREGKC IAGGIGDGSIQIWNLKPGWGSRPDIHVEQAHADDITGLKFSSDGKI LLTRSFDDSLKVWDLRLMKNPLKVFEDLPNHYAQTNIACSPDEQLFLTGTSVERESTIGGLLCFFDRSKLELVSRIGISPTCSVVQCAWHPRLNQIFATSGDKSQGGTHVLYDPTLSERGALVCVARAPRKKSVDDFELKPVIHNPHALPLFRDQPSRKRQREKILKDPLKSHKPELPMNGPGHGGRVGASKGSLLTQYLLKQGGMIKETWMDEDPREAILKHADAAEKNPKFTRAYAETQPDPVFAKSDSEDEDK
表12.桉树计算机模拟数据。
 SEQID  ConsIDeucSpp 家族   1 2   3   4   5  6   7   8  9   10   11   12
 1  3910 周期素依赖性蛋白激酶 0.25  0.11   0.20   0.73
 2  19213 周期素依赖性蛋白激酶   0.59   0.64
 3  36800 周期素依赖性蛋白激酶   0.11   0.36
 4  40260 周期素依赖性蛋白激酶   0.85
 5  41965 周期素依赖性蛋白激酶   0.35   0.86
 6  2906 周期素依赖性蛋白激酶 0.93   0.81
 7   1518 周期素依赖性蛋白激酶 0.08   0.28   0.08  0.06   0.11
 8   8078 周期素依赖性蛋白激酶   0.17   3.20
 9   9826 周期素依赖性蛋白激酶   0.36   0.23   0.15   0.04   0.24   0.43
 10   10364 周期素依赖性蛋白激酶  0.11   1.52   0.13
 11   11523 周期素依赖性蛋白激酶   0.15  0.06   0.15   2.40
 12   24358 周期素依赖性蛋白激酶 0.76   0.07   0.04   0.24
 13   39125 周期素依赖性蛋白激酶   0.23
 14   5362 周期素依赖性蛋白激酶 0.68  0.06   0.08   1.17
 15   44857 周期素依赖性蛋白激酶 0.68  0.06   0.08   1.17
 16   1743 周期素A   0.19   2.10  0.06   0.15
 SEQID   ConsIDeucSpp   家族   1   2   3   4   5   6   7   8  9   10   11   12
 17   12405   周期素A   0.06   0.59   2.84
 18   3739   周期素B   0.42   1.99   0.08   2.33
 19   22338   周期素B   0.86
 20   28605   周期素B   0.39   0.04   0.47
 21   41006   周期素B   0.71
 22   6643   周期素D   0.85   0.83   0.06   1.06   0.08   0.26
 23   45338   周期素D   2.03
 24   46486   周期素D   0.30
 25   12070   周期素依赖性激酶调控亚单位   0.24   0.82   0.06   0.26   0.92
 26   6617   组蛋白乙酰基转移酶   0.08   0.06   0.04   0.55   0.51   0.26
 27   7827   组蛋白乙酰基转移酶   2.27   0.11   0.04
 28   8036   组蛋白乙酰基转移酶   1.16
 30   1596   组蛋白去乙酰基转移酶   0.17   0.16   0.08   2.98   0.88   0.26   0.98   0.71
 31   5870   组蛋白去乙酰基转移酶   0.19   0.17   0.12   5.43
 32   6901   组蛋白去乙酰基转移酶   1.21   0.08   2.01   1.16   0.08
 33   6902   组蛋白去乙酰基转移酶   0.08   0.11   1.21   0.47
 34   7440   组蛋白去乙酰基转移酶   0.48   1.23   0.15   0.22   0.48   0.20   2.02
 35   8994   组蛋白去乙酰基转移酶   0.09   0.15
 36   24580   组蛋白去乙酰基转移酶   0.42   1.22
 37   37831   组蛋白去乙酰基转移酶   0.08   0.22   0.40   1.19   0.12
 38   34958   MAT1 CDK活化激酶组装因子   0.15   0.23
 39   22967   肽基-脯氨酰基顺-反异构酶   0.72   0.69
 40   8599   肽基-脯氨酰基顺-反异构酶   0.46   0.08   0.50   0.17   0.51   0.28   3.01
 41   9919   肽基-脯氨酰基顺-反异构酶   0.51   0.35   0.06   0.15   0.43   4.24
 42   15820   肽基-脯氨酰基顺-反异构酶   0.04   6.78
SEQID   ConsIDeucSpp   家族   1   2   3   4   5   6   7   8  9   10   11   12
  异构酶
43   8327   肽基-脯氨酰基顺-反异构酶   0.06   0.04   6.86
44   4604   肽基-脯氨酰基顺-反异构酶   0.68
45   966   肽基-脯氨酰基顺-反异构酶   0.59   1.02   0.54   0.69   0.50   0.93   0.59   0.95   18.65
46   1037   肽基-脯氨酰基顺-反异构酶   0.59
47   4603   肽基-脯氨酰基顺-反异构酶   0.17   0.17   1.24   0.04   0.34
48   5465   肽基-脯氨酰基顺-反异构酶   1.21   0.08   0.66   0.11   0.29   0.16   6.99
49   6571   肽基-脯氨酰基顺-反异构酶   0.51   0.08   0.41   0.08   1.14
50   6786   肽基-脯氨酰基顺-反异构酶   0.42   0.33   0.06   0.41   0.04
51   7057   肽基-脯氨酰基顺-反异构酶   0.42   0.11   0.04
52   8670   肽基-脯氨酰基顺-反异构酶   1.56   0.39   0.20   0.12
53   9137   肽基-脯氨酰基顺-反异构酶   0.04   0.59
54   10285   肽基-脯氨酰基顺-反异构酶   0.60   1.16   0.04   0.04   0.45
55   10600   肽基-脯氨酰基顺-反异构酶   0.16   0.17   0.06   0.46
56   11551   肽基-脯氨酰基顺-反异构酶   0.08   0.06   0.04   0.08   1.89
57   20743   肽基-脯氨酰基顺-反异构酶   0.76
58   23739   肽基-脯氨酰基顺-反异构酶   0.59
  SEQID  ConsIDeucSpp     家族 1 2 3  4 5  6  7  8  9  10 11  12
  60  31985 肽基-脯氨酰基顺-反异构酶 1.99
  61  32025 肽基-脯氨酰基顺-反异构酶 0.99
  62  32173 肽基-脯氨酰基顺-反异构酶 1.99
  64  9143 视网膜母细胞瘤相关蛋白 0.90  0.15
  65  349 WD40重复蛋白 0.24 0.34  0.08 0.17  0.22  0.33  0.08 0.25  2.24
  66  575 WD40重复蛋白 0.25 0.94  0.31 0.34  0.11  0.16  0.47  1.87
  67  804 WD40重复蛋白  0.15 0.34  0.39  0.33  0.39  1.82
  68  805 WD40重复蛋白 0.97 0.51 4.66  0.23 0.17  0.77  0.33  1.07  0.24  4.43
  69  806 WD40重复蛋白 0.83  0.04
  70  2248 WD40重复蛋白 0.08  0.08 1.92  0.06  0.08  0.91
  71  3203 WD40重复蛋白 0.34 0.18  0.15 0.17  0.11  0.30  0.04  0.72
  72  3209 WD40重复蛋白 0.08  0.15 0.17  0.12  0.61
  73  4429 WD40重复蛋白 0.08 1.16  0.08  0.13
  74  4607 WD40重复蛋白 0.76  0.54  0.06  0.07
  75  4682 WD40重复蛋白 0.08 0.28  0.23  1.13  0.08  0.12
  76  5786 WD40重复蛋白 0.08  0.06  0.46  0.08  0.13
  77  5887 WD40重复蛋白 1.61 1.23  0.08  0.06  0.15  0.28  1.41
  78  5981 WD40重复蛋白 0.08  0.37
  79  6766 WD40重复蛋白 0.24 0.08 1.31 0.51  0.06  0.74  0.51  0.28
  80  6769 WD40重复蛋白 0.93 0.17  0.12  2.28
  81  6907 WD40重复蛋白 0.25 0.17  0.06  0.45  0.32  0.47  1.67
  82  7518 WD40重复蛋白 0.91  0.28  0.15  0.55  0.59
  SEQID   ConsIDeucSpp    家族 1  2 3  4  5  6  7  8  9  10  11  12
  83   7717 WD40重复蛋白 0.47  0.38
  84   7718 WD40重复蛋白 0.24 1.88  0.08  0.22  0.04  0.92
  85   7741 WD40重复蛋白 1.42  0.11  0.47
  86   7884 WD40重复蛋白 1.33  0.15  0.24
  87   8258 WD40重复蛋白 0.72 0.19  0.23  0.87  0.15  0.08  0.08
  88   8465 WD40重复蛋白 0.47  0.08  1.75
  89   8616 WD40重复蛋白 0.57  0.08  0.69  0.16  0.13
  90   8690 WD40重复蛋白 0.26  0.08  0.35  1.39  0.34  0.32  2.13  0.80
  91   8708 WD40重复蛋白 0.57  0.04
  92   8850 WD40重复蛋白 0.09  0.06  0.27  2.03
  93   9072 WD40重复蛋白 1.21  0.17  0.48
  94   9465 WD40重复蛋白 0.24 0.72  0.33  0.15
  95   9472 WD40重复蛋白 0.36  1.99  0.11  0.61  6.90
  96   9550 WD40重复蛋白 0.90  0.11  1.78
  97   10284 WD40重复蛋白 0.24  0.08  1.82  1.22  0.16  0.47  0.28
  98   10595 WD40重复蛋白 0.16  0.17  0.11  6.52  0.85
  99   10657 WD40重复蛋白  0.06  0.12
  100   12636 WD40重复蛋白  0.06  0.65
  101   12748 WD40重复蛋白 1.50  0.08  0.06  1.67  0.04  0.38
  102   12879 WD40重复蛋白  0.08  0.33  0.06  0.04  0.08  2.00
  103   15515 WD40重复蛋白  0.35  0.30
  104   15724 WD40重复蛋白  0.25 0.33  0.15  0.47  0.04  0.39
  105   16167 WD40重复蛋白 0.24  0.52
  106   16633 WD40重复蛋白 1.96  0.12  0.42
  SEQID  ConsIDeucSpp     家族 1 2 3 4 5 6 7 8 9 10 11 12
  107  17485 WD40重复蛋白 0.65
  108  18007 WD40重复蛋白 0.12
  109  20775 WD40重复蛋白 0.17 0.08
  110  23132 WD40重复蛋白 2.42
  111  23569 WD40重复蛋白 0.91 0.91
  112  23611 WD40重复蛋白 4.15
  113  24934 WD40重复蛋白 0.34 0.04
  114  25546 WD40重复蛋白 0.09
  115  30134 WD40重复蛋白 0.07
  116  31787 WD40重复蛋白 0.19 1.19
  117  34435 WD40重复蛋白 0.35 0.08
  118  34452 WD40重复蛋白 1.44 0.20 0.25
  119  35789 WD40重复蛋白 0.20
  120  35804 WD40重复蛋白 0.19 0.27 0.08
  121  43057 WD40重复蛋白 0.30 0.57
  122  46741 WD40重复蛋白 0.46
  123  47161 WD40重复蛋白 1.78
  235  6366 WD40重复蛋白 0.08 0.68 0.23 0.93 0.11 0.36 0.83 0.24 0.94
  236  17378 WD40重复蛋白 0.65 0.12 0.08
  252  45414 周期素B 3.13
  253  44328 周期素依赖性激酶抑制剂 0.38
  254  15615 组蛋白乙酰基转移酶 0.22 0.04
  255  17239 肽基-脯氨酰基顺-反异构酶 0.08 0.50 0.08
  SEQID  ConsIDeucSpp    家族   1   2   3   4   5   6   7 8   9   10   11  12
  256  18643 WD40重复蛋白 0.04  0.90
  257  19127 WD40重复蛋白 0.04  0.89
  258  22624 WD40重复蛋白  1.16
  259  32424 WD40重复蛋白  0.50
  260  37472 WD40重复蛋白 0.08  0.17
在表12中,以下数字1-12表示以下组织:1为芽繁殖体;2为芽营养体;3为形成层;4为果实;5为叶;6为韧皮部;7为繁殖体;8为根;9为树液营养体;10为茎;11为整体;且12为木质部。
表13:松树计算机模拟数据。
 SEQID   ConsIDpinusRadiata    家族   1   2   3    4    5    6    7   8    9   10   11   12
 124   1766 周期素依赖性蛋白激酶  1.02  0.05  1.58  0.15  0.22  0.22  0.18  2.16  4.91
 125   2927 周期素依赖性蛋白激酶 0.16  0.19  0.11  0.14  0.04  0.36  0.38  0.17
 126   7642 周期素依赖性蛋白激酶 0.22  0.21  0.05  0.07
 127   13714 周期素依赖性蛋白激酶  0.11  0.11
 128   16332 周期素依赖性蛋白激酶  0.54  0.26  0.14  0.04  0.91
 129   21677 周期素依赖性蛋白激酶  0.05  0.14  0.17
 130   27562 周期素依赖性蛋白激酶  0.41
 131   1504 周期素依赖性蛋白激酶 0.16  0.36  0.35  0.21  0.54  0.09  0.65
 132   15211 周期素依赖性蛋白激酶  0.13  0.15  0.19  0.19
 133   20421 周期素依赖性蛋白激酶  0.04  0.05  0.95
 134   3187 周期素依赖性蛋白激酶  0.34  0.15  0.04  0.18  0.38
 SEQID   ConsIDpinusRadiata     家族   1    2    3    4    5    6    7    8    9    10    11    12
 135   15661 周期素依赖性蛋白激酶  0.04  0.13
 136   13874 周期素A  0.31  0.27  0.15  0.05
 137   14615 周期素A  0.16  0.15
 138   4578 周期素B  0.47  0.14  0.13  0.22  0.74  0.38
 139   23387 周期素B  0.29  0.26  0.17
 140   6970 周期素D  0.14  0.27  0.04
 141   10322 周期素D  0.16  0.19  0.06  0.14  1.12  1.36
 142   22721 周期素D  0.27  0.36
 143   23407 周期素D  0.15  0.26  0.31
 144   1945 周期素依赖性激酶调控亚单位  0.28  0.55  0.41  0.16  1.62  5.02  0.22  0.72  0.39  3.06
 145   8233 周期素依赖性激酶调控亚单位  0.21
 146   8234 周期素依赖性激酶调控亚单位  0.16  0.11
 147   22054 周期素依赖性激酶调控亚单位  0.05  0.22  0.18
 SEQID   ConsIDpinusRadiata     家族   1     2     3    4     5    6    7     8     9     10   11   12
 148   12137 组蛋白乙酰基转移酶  0.06  1.51  0.19
 149   12582 组蛋白乙酰基转移酶  0.64  0.15  1.09  0.33  0.63
 150   15285 组蛋白乙酰基转移酶  0.21  0.12  0.70  0.14
 151   17229 组蛋白乙酰基转移酶  0.94  0.16
 152   20724 组蛋白乙酰基转移酶  0.04  0.19  0.19
 153   4555 组蛋白去乙酰基转移酶 0.16  0.14  0.97  0.14  0.89  0.89
 154   4556 组蛋白去乙酰基转移酶  0.14
 155   5729 组蛋白去乙酰基转移酶 0.31  0.28  0.22  0.58  0.22  2.00  0.48  0.07  0.04  2.73  1.46
 156   7395 组蛋白去乙酰基转移酶  0.14  0.14  0.19  0.93  0.04  0.14  1.33
 157   9503 组蛋白去乙酰基转移酶  0.11  0.14
 158   11283 组蛋白去乙酰基转移酶  0.19  0.15  0.96  1.35
 159   12322 组蛋白去乙酰基转移酶 0.16  0.06  0.11  0.04  0.05  0.29
 161   23236 组蛋白去乙酰基转移酶  0.13  0.11
 162   171 肽基-脯氨酰基顺-反异构酶  0.07  0.46
  SEQID   ConsIDpinusRadiata     家族    1    2    3     4    5     6     7     8    9    10    11    12
  163     172 肽基-脯氨酰基顺-反异构酶   0.19   0.11   0.18   0.11   0.46
  164     1480 肽基-脯氨酰基顺-反异构酶   2.51   4.20  0.88    2.97   1.58   3.53   7.36   1.33   2.74   0.72   6.62   10.14
  168     1692 肽基-脯氨酰基顺-反异构酶   0.16  0.22   0.65   0.61   0.26   0.29   0.18   1.28   0.34
  169     5313 肽基-脯氨酰基顺-反异构酶   0.14   0.07   0.37   0.17
  170     6362 肽基-脯氨酰基顺-反异构酶   0.14  0.33    0.05   0.06   0.60   0.04   2.92   0.68
  171     6493 肽基-脯氨酰基顺-反异构酶   0.42  0.11    0.21   0.11   0.04   0.25   0.32
  172     6983 肽基-脯氨酰基顺-反异构酶    0.61   0.13   0.04
  174     7665 肽基-脯氨酰基顺-反异构酶  0.11    0.39   0.05   0.62   0.25
  175     12196 肽基-脯氨酰基顺-反异构酶   0.19   0.15   0.14   0.16
  SEQID   ConsIDpinusRadiata     家族   1    2   3    4     5     6    7     8     9     10   11   12
  176   13382 肽基-脯氨酰基顺-反异构酶   0.25   0.06  0.07   0.04   0.87   0.15
  177   16461 肽基-脯氨酰基顺-反异构酶   0.19   0.15   0.15   0.04   0.04   0.74
  178   17611 肽基-脯氨酰基顺-反异构酶   0.24   0.11   0.27   0.41   0.99
  179   19776 Peptidyl-prolylcis-transisomerase   0.13  0.07   0.16   0.05   0.61
  180   20659 肽基-脯氨酰基顺-反异构酶  0.15   0.19
  181   22559 肽基-脯氨酰基顺-反异构酶   0.11  0.14   0.20
  182   24188 肽基-脯氨酰基顺-反异构酶   0.23
  183   27973 肽基-脯氨酰基顺-反异构酶   1.01
  184   1353 WD40重复蛋白 0.44  0.05  0.73   0.11  1.07   0.70   1.32
  185   1978 WD40重复蛋白    0.14   0.05  0.44   0.11  0.21   0.27  0.36   1.46   0.82
  SEQID   ConsIDpinuSRadiata     家族   1  2     3   4    5     6    7    8    9   10    11    12
  186     2810 WD40重复蛋白 0.42  0.79   0.11   0.39  0.27  0.36   1.69   1.03
  187     2811 WD40重复蛋白   0.14   0.09   0.14
  188     2812 WD40重复蛋白  0.15  0.18   0.04   0.16
  189     3514 WD40重复蛋白  0.63   0.06  0.14  0.18   0.48   0.56
  190     4104 WD40重复蛋白 0.14  0.25   0.27  0.37  0.36   0.19  0.18   0.39   0.53
  191     5595 WD40重复蛋白 0.14  0.25  0.15  0.14   0.07   0.23
  192     5754 WD40重复蛋白  0.31 0.14   0.06  0.07   0.16   0.10   0.16
  193     6463.. WD40重复蛋白  0.16 0.56 0.22  0.43   0.81  0.53  0.21   0.08   1.00   0.70
  194     6665 WD40重复蛋白  0.31 0.28  0.45   0.44   0.96   0.07   3.37   2.68
  195     6750 WD40重复蛋白 0.14  0.59   0.05  0.37  0.42   0.04   0.18   0.52
  196     7030 WD40重复蛋白  0.31  0.40   0.54   0.45  0.37   0.07   1.58   3.41
  197     7854 WD40重复蛋白 0.11  0.14   0.05
  198     7917 WD40重复蛋白 0.22  0.39   0.13  0.15   0.18   0.56
  199     7989 WD40重复蛋白 0.11   0.04   0.11
  200     8506 WD40重复蛋白  0.47 0.33   0.11   0.86  0.19  1.28   0.04   1.23   3.12
  201     8692 WD40重复蛋白  0.21   0.06  0.11   0.15   0.10   0.87
 SEQID   ConsIDpinusRadiata.   家族     1   2    3    4    5    6    7    8    9   10    11   12
 202     8693  WD40重复蛋白   0.11   0.80  0.25  0.14  0.18   0.53  0.31
 203     9170  WD40重复蛋白   0.16   0.11   0.05   0.05
 204     9408  WD40重复蛋白   0.33  0.05  0.41  0.15  0.14   0.41  0.33
 205     9522  WD40重复蛋白   0.11   0.18
 206     9734  WD40重复蛋白   0.11   0.05  0.11  0.15  0.07  0.25   0.11
 207     9815  WD40重复蛋白   0.11   0.18  0.14
 208     10670  WD40重复蛋白   0.40  0.16  0.11  0.16   0.34  0.31
 209     11297  WD40重复蛋白   0.53  0.15  0.16   0.05
 210     13098  WD40重复蛋白   0.19  0.11  0.54  0.31  0.14  0.26   1.85  0.14
 211     13172  WD40重复蛋白  0.04
 212     13589  WD40重复蛋白  0.11  0.06  0.21   0.05  0.37
 213     13608  WD40重复蛋白  0.11  0.04   0.59  0.33
 214     14299  WD40重复蛋白   0.16   0.05  1.09   0.38
 215     14498  WD40重复蛋白   0.21   0.44  0.30
 216     14548  WD40重复蛋白   0.16  0.11   0.11  0.82
 217     14610  WD40重复蛋白   0.16  0.27
 SEQID   ConsIDpinusRadiata  家族  1  2  3  4  5  6  7  8  9  10  11  12
 218   16090  WD40重复蛋白  0.43  0.04  0.37  0.85
 219   16722  WD40重复蛋白  0.10
 220   16785  WD40重复蛋白  0.05  0.13  0.38  0.50
 221   17094  WD40重复蛋白  0.29  0.15  0.24  0.81
 222   17527  WD40重复蛋白  0.04  0.10
 223   17591  WD40重复蛋白  0.14  0.10
 224   17769  WD40重复蛋白  0.39
 225   18047  WD40重复蛋白  0.05  0.22  0.98  0.15  2.68  0.07  0.19  0.80
 226   18414  WD40重复蛋白  0.16  0.15  0.34  0.23  0.19
 227   18986  WD40重复蛋白  0.41  0.15
 228   19479  WD40重复蛋白  0.05  0.28  0.32
 229   20144  WD40重复蛋白  0.43  0.29  0.05
 230   22480  WD40重复蛋白  0.15  0.27
 231   23079  WD40重复蛋白  0.13  0.04
 232   26739  WD40重复蛋白  0.15  0.18
 233   26951  WD40重复蛋白  0.21  0.20
SEQID ConsIDpinusRadiata 家族 1 2 3 4 5 6 7 8 9 10 11 12
234 26529 WEE1样蛋白 0.04 0.18
237 888 WD40重复蛋白 0.11 0.18
238 14166 周期素依赖性激酶抑制剂 0.16 0.05 0.05
239 3189 周期素依赖性蛋白激酶 0.06
240 9356 组蛋白乙酰基转移酶 0.11 0.22 0.46
241 65 组蛋白去乙酰基转移酶 0.16 0.22 0.27 0.22 0.24 0.34
242 14197 组蛋白去乙酰基转移酶 0.16 0.33 0.05
243 9081 肽基-脯氨酰基顺-反异构酶 0.11 0.05 0.29 0.26 0.69
244 13417 肽基-脯氨酰基顺-反异构酶 0.06 0.59
245 5755 WD40重复蛋白 0.16
246 6670 WD40重复蛋白 0.14 0.05
247 7027 WD40重复蛋白 0.14 0.15 1.30 0.15
248 7276 WD40重复蛋白 0.14 0.11 0.05
249 7390 WD40重复蛋白 0.31 0.14 0.11 0.44 1.29 0.38
SEQID ConsIDpinusRadiata 家族 1 2 3 4 5 6 7 8 9 10 11 12
250 12648 WD40重复蛋白 0.05 0.06 0.05 0.94
251 13171 WD40重复蛋白 0.19 0.63 0.19 0.34
表13,以下数字1-12表示以下组织:1为芽繁殖体;2为芽营养体;3为愈伤组织;4为形成层;5为分裂组织营养体;6为韧皮部;7为雌繁殖体;8为雄繁殖体;9为根;10为导管;11为整体;且12为木质部。
表14.寡核苷酸表。
寡核苷酸SEQID  寡核苷酸ID  微阵列寡核苷酸序列
521  Euc_003910_O_4  GATTTTAAGTAACTCAATTAGCAGTTCCAACATTAAACCATTATTATTACCCCTTTTATC
522  Euc_019213_O_1  CTCAAAAAGTACTTGGATGCGTGCGGTGACAACGGACTCGAACCGTACACTGTCAAATCT
523  Euc_036800_O_4  TTGTCAAGTTGCAGGACGTAGTGCACAGTGAGAGGCGTCTATATCTAGTTTTTGAGTACT
524  Euc_040260_O_1  GAAGAAATTATATAACTAGATACAAGGTTAGCTAGGTATATAATAGCGGTACAAGTCTTT
525  Euc_041965_O_1  GGACAAATCAAGTAGAACTTCTCTCGGCAGCATCAGTTTTTCTAATCCATGCCTTGTTGC
526  Euc_002906_O_1  CTCAGTTCTGATAATGCCTCGGATATATGGCCGAGTGTTCGCTGGACGGCCTCTTATGTT
527  Euc_001518_O_3  GGAGATTCTGAACTGCAACAGCTCCTACACATTTTCAGACTGTTGGGTACTCCAAATGAA
528  Euc_008078_O_2  GACTGGTAAAATCGTTGCACTAAAAAAGGTCCGGTTTGACAACTTGGAACCTGAAAGCGT
529  Euc_009826_O_4  AAACACCAATCTATCAACACTGTCGAGTTTAGTCACTAGTAGAACCGGAGATAACAAACA
530  Euc_010364_O_1  CTATGATCCTGAGCGCAAGCAAGTTATGACCAATAGAGTCGTTACACTATGGTACCGAGC
531  Euc_011523_O_1  TGTTGTGAAGGTAGTTATAGCCATCGATTAGACAGTGATTAAAGTAGTACCCGTGCCAAT
532  EUc_024358_O_2  CCACATACAAGAGTTGTTACGCTACACATCCTATACCATCAAAGGAACGTTGGAATGCCA
533  Euc_039125_O_3  TATGATCGACACAAGCATTTTGTGTTGGAGCCTCAGCTAATTGTATGTCATCGAGTACTT
534  Euc_005362_O_3  AAAATTTTTGCTACGGATAATGTTGTGAGGCGAGGCAGTCGAAATTACGGAGGTTGACTT
535  Euc_044857_O_1  ATGCAGGGATCAAATTTGTGAGTACTACGTAAAATTTTGCTACGGAGGCGAGGCAGTCGA
536  Euc_001743_O_1  GAAGAATACAGGCTCGTACCTGATACACTGTACCTGACTGTTAACTACATAGATCGGTAT
537  Euc_012405_O_1  TCCACCCTAAATGCGATACGTGAAAAGTATAGACAACAGAAGGTAAACTATTCATTACTG
538  Euc_003739_O_2  AGGCTTCTAGTTGCGTTCCCCCAAACTACATGGATCGGCAGCAGGATATTAATGAGCGGA
539  Euc_022338_O_2  GAGAAAAATGACAGATTGATATCGATGATGATGACTGTCGTGTCATCAGTAGTGTGCTTT
540  Euc_028605_O_5  TTTCCAATTGTAGTTCGTCTTTTATTGTAACAATAAATTGATAGATACTGATTCGAAATA
541  Euc_041006_O_1  ACATTTATGCTAACTATAGGAGAACGGAGAATTGTAGCTGCGTCTCTGCTAACTACATGG
542  Euc_006643_O_1  TTCTGGCTTAAAGGCTATTCTTTGTGCACAATGACCTGAGGGAGGTCTCGACAGACCACT
543  Euc_045338_O_1  TTCATCCGGGTCCTGGTTATCATACTCTTATATATGTTGGGGAATAACGGTTCATATGTT
544  Euc_046486_O_3  GGGTGTGCTTAATAGTTCTTATTAGTCTTAGCTTATTATCTTTGATTGGACATGCTATAA
545  Euc_012070_O_2  CTTGCTAAGTAGACATGTTATATTTCTAATGCTTTGAGAACAATATTACAGTATAATTAG
546  Euc_006617_O_2  AATCATCGACTAGACCGATGGTCAAAGTGGTAATCATGTAATTAAACGCGTTTGTCATTG
547  Euc_007827_O_2  ATGGAAAAATCTATGGATATGAAGGATTGAAGATATCCGTCTGGGTAAGCTGTGTATCAT
548  Euc_008036_O_3  TTATGATTTGAGAAAACCCTTGCAGGCTGCGATTTGCGGATCATGACAGCATAGTTTTGC
549  Euc_001596_O_2  GTTTTGTTGTGAGGGCTTGGTAGGTTTTCATTATATTGTAATGTCGACGACAGAGATTTT
550  Euc_005870_O_3  CCAATTAATGTTACTGCTCAAGCTGACGTACCTGCGAAAAAAGCACCAGTGACTGCTAAT
551  Euc_006901_O_3  TGATGTCAAAACGTAGCTCTTTTTTGTGTGAGCTATCCTGCTAAATTAAACCTCAGCAAA
552  Euc_006902_O_1  ACATGAGTATTATGAATACTTCGGTCCTGACTATACACTTCATGTTGCTCCGAGTAACAT
553  Euc_007440_O_2  GAATTGGCGATCACAATCTACTGTAGTCAATACTCAAGTGGGAGGTGTAAATAGATTCCA
554  Euc_008994_O_1  GATCATGTGTAATCAGTATATCAGGTTAGAAACAGTACTCTTGAGCTTAGCGGGCACTGT
555  Euc_024580_O_2  TCCTGTGAAGGTGGTCGACTCAATCAAAAGGTACCTTGTAGATAAGGTACCTTTTCTCAA
556  Euc_037831_O_5  GCATTTTATACGACGGATAGAGTCATGACCGTATCTTTCCATAAGTTTGGGGACTTCTTC
557  Euc_034958_O_3  CCTCGTTTCTTTGCGGTTCGGACGCATCATGGATGTATCTCCAAAGAGTAATCTGTCGAT
558  Euc_022967_O_2  AATTCAGATCTATTAGTGAAAGTTGGCATGAGTCTCAATCTTAGGGGAATACAGTACGGA
559  Euc_008599 O_3  TGATATGAGTATCATAACTCGGATGGTGACAACTTTGTACTACGGTCGGCACCGGTAGAT
560  Euc_009919_O_1  CATATACAATCTTAGTGGATTAGCTGAGGTCGAAACTGACAAGAGTGATCGCCCGTTGGA
寡核苷酸SEQID 寡核苷酸ID  微阵列寡核苷酸序列
561 Euc_015820_O_2  CATGGCTAACGCTGGCCCTAGCACTAATGGGAGCCAATTTTTCATATGCACTGTAAAGAC
562  Euc_008327_O_2  AACAAAGTCTACCTTGACATTAGCATCGGTAACCCTGTCGGGAAACTAGTCGGAAGAATT
563  Euc_0046O4_O_2  TGTGCTTGGATATACTGTATAAGCATTCTATATTATGCTTGTTGGCTTCGTTTTGAGGGA
564  Euc_000966_O_1  TTAACGTCGACCGCTTCTCTGCCCCTTGAATTT TCCCGAGAAAACCAGGAACCTGCCAAA
565  Euc_001037_O_1  TGTTGAATACGATGTATTATAATGTTGGTGTCTTGGTGAAATACAGAATTATGCTTGCGT
566  Euc_004603_O_2  ATCGCTGTGGCTGATCTCGTCGCTCCGGCTTTTCATAAAAATCATGGCTGAGGCAATCGA
567  Euc_005465_O_2  CTCGCAACCCTATATCTCGCTCAGGCGAAGAAGTCTGAGGATTTGAAAGAGGTGACTCAC
5 68  Euc_006571_O_1  TGTTTTTGGGTACACGCAGTTAGGATAACTAGCATGAAAGCCCGATCCCGCATATACAGG
569  Euc_006786_O_2  GAGGACTAGCCGGAACTTCATCGAACTCTCTCGGAGGGGTTACTACGATAACGTCAAGTT
570  Euc_007057_0_1  GATGGCTAGCACTGTGTAGAAAGGTGAATTTAAAGTACTTGTCTACACTGCTTATTAAAT
571  Euc_008670_O_2  TGAGACTGTCTTGGCGTGTATTTTGGAATAAACTATTATCACGTTTTGTTAAATATAATA
572  Euc_009137_O_3  TTACAAAATGGCTCTCAGAAAGTATCGAAAGGCCCTGCGCTATCTGGATATCTGCTGGGA
573  Euc_010285_O_2  AATTTTATGTTTGCTACTGCTTAGTGCTTAATGGACTTGCGTAGGTATTCAAATTACAGA
574  Euc_010600_O_1  TGGAACCGTGGTATCGGCTGACGTTATCCGTGATTTTAAGACTGGAGATAGTTTATGCTA
575  Euc_011551_O_2  CTTTGATGTATCCTCAGTGTACTGCTTTTAGCTATGTATAGATCGAGTCAACTCATTGAA
576  Euc_020743_O_3  TTTTTATTATTTACCTTCGCCTTTACGCTGCATACGTTAATAGGTTATTATTTCCTTCAA
577  Euc_023739_O_1  ATTTGTCCATGACAATCGTAGTCGAAGACACGATACGCTCTTAGATGGTACGGAAATCTG
578  Euc_031985_O_2  TGAATAGAGATAACTTTTCTGAGTGTGAATTGGATATTACGTTGCAAATAGCCGAATGAA
579  Euc_032025_O_2  GCTTTAGGTTAGGGATCCCTGTAAGCTGATGATAGATATTGGAGATGGTACTTGTAAGAT
580  Euc_032173_O_1  TGTTGTGTTTGGAAAGGTGCTGTCTGGGATGGATGTTGTCCACAAGATTGAGGCTGAAGG
581  Euc_009143_O_1  GGAAAGCGGGGAATGAGCATGTGGATATTATCTCTTTCTACAATGAAATATTCATTCCTT
582  Euc_000349_O_1  CATCAGGACGTTGACTCTAATTAAGACATATGTGACAGAGCGCCCTGTTAATGCGGTTAC
583  Euc_000575_0_2  CTTTAGGTTTGATCTGTCTGTTTTGTCTATCCTGCGAGTTTCGAGCATGTGCGTGTGTGA
584  Euc_000804_O_1  CAGCCCCAATAGATACTGGCTCTGTGCCGCTACTGAGAACAGTATTAAAATCTGGGACCT
585  Euc_000805_O_2  AAGAATGAAGCTGATATGAGTGATGGAACTACGGGGGCCATGAGCTCAAATAAGAAGGTC
586  Euc_000806_O_1  TGACTACAATTAGCACCTCACCATTATCGAACTGTATAATTGTGCTTGCCTGCTATTATT
587  Euc_002248_O_4  TTGAAGCGGAAATATATATTTATGCTACTACATAAGTAATGTACTA
 CTTGACAAGATGAG
588  Euc_003203_O_1  TACTCGATGTGGTATAGAATTTATCCAATGTACTCCTAAATGTAGATACATCGTGTATTG
589  Euc_003209_O_2  GCTTCGTCTGATACCACTATCAAGATAATAGGCGTGAGCAATAGCTCTGGATCACAGCAC
590  Euc_004429_O_4  GGTCGGCTTGCTAGTGTATCTGATGACAAGAGCATATCACTCTATGATTACTCATGAAGG
591  Euc_004607_O_3  GAAAGGAGAAAAGCATGGAGATCGATCTCGGAAACCTCGCATTCGACGTCGATTTTCATC
592  Euc_004682_O_1  GATTCAGTACCCGGATTCGCAAGTCAACCGGTTGGAGATAACTCCACATAAGCGGTACCT
593  Euc_005786_O_1  TTCCATGTATCAAGCCGCATCAATGTTTGTCGCTGCAATTAACATGTGTGCAGTCGATCC
594  Euc_005887_O_2  TTCAGCGCATTGTGTAAATGTAGATAGGTGATATATTTCTCGTTGCAATGTAGGGTAAGA
595  Euc_005981_O_2  TCCAATAATCACATTTACCATCAACAGGCATCAGCAACATACTGTTGTAGTGTAATTAAT
596  Euc_006766_O_1  GGGCATTCTGACTACCTGCACTGTATAGCTGCACGGAACTCTTCTAGTCAGATTATAACA
597  Euc_006769_O_1  AATCGTCTGGTAGATTGTCAAAAACTAATAAACCTGTGATTGATCCGGATTCTAGTAATG
598  Euo_006907_O_2  AGTTGAGGATTCTCCACTATGACAGCTCTCATGGCTTGAATCTAAAGTCATCTGGTTTTC
599  Euo_007518_O_1  GAACAATCATTCTGTAGAACACTAGAGTCTATATGCTTGACTGTATCGGTTAATTAATTC
600  Euc_007717_O_1  AGATAGCGATAGAGTTATACTGCATGTACTGAGGTAAATGTTTTGATTACTCCACCCAAT
601  Euc_007718_O_1  AAGAATTGTTAGGAGGTGTATACTTTCTGTAACTGTATTCAATGAGCATACACCTGACGG
602  Euc_007741_O_2  CAACTCATATAATGACTGGATTCTGGCAACCGCGTCTTCAGACACAACAGTTGGACTATT
603  Euc_007884_O_1  AGTGTAAAAGGATGCCCCTAATAGATTATATGCCAAGTGTAGTATATATAATAGTGCTTT
寡核苷酸SEQID  寡核苷酸ID  微阵列寡核苷酸序列
604  Euc_008258_O_2  AAGAATCTACAGTTGTCTTATGCTACTCTATTACTCAATTATGCTGTGCTATTGATTGAG
605  Euc_008465_O_4  TCTGAATACATACTTTGTGGTCTCTATAAAAGACCAATGATACAGGCATGGTCATTAATT
606  Euc_008616_O_5  TAAATCTTCTCATGTGCCTGGCGTAAATTTTGCAGTTATTACTAGACCAAGATAGTTTCA
607  Euc_008690_O_4  ACATGGATTCGATCAATCGCCACATGACAACTAAAACAAGCGGTTCACGTGATTGTAATT
608  Euc_008708_O_4  AGATGAGTATGCTCGGGTGTATGATATTCGCAATTACAAGTGGAATGGATCGCATAATTT
609  Euc_008850_O_5  TCTTTGATTCTGTTGTATGGTGTATCTTATTGTATCTTCTATCTGCCCCCCATGTAATTC
610  Euc_009072_O_1  TTCGTTGTGTAGTACTGGGAGTTACTACTTGTATGTATGTAAATCATGTGGCGTCTGTCC
611  Euc_009465_O_1  GGAGATGTGTAATATGTCTGAGCGGTCACACTCTAGCTGTTACATGCGTAAAGTGGGGAG
612  Euc_009472_O_3  CCACCGTTGCGTAACTCGAATAGCCGGATTTTCGTTTTCGTTTTTATTTCCCCGTTAATT
613  Euc_009550_O_1  TGAGATGCTCTGTGTGAGGACTTTTACGAAACTTGAATGGCCCGTAAGGACAATAAGCTT
614  Euc_010284_O_3  TGGGTTGTTGCGACGGGTTCTACAGATAAGACTGTTAAGTTATTTGATCTACGCAAGATC
615  Euc_010595_O_1  GCAGAGGTGCCTACATATGCTTTAGAATGCTAGTAGCTTGGAAGTGCAACACGCTCGTGA
616  Euc_010657_O_1  AGTAAAGTTTAACGACTATGCATCTGTCGTAGTATCAGCCGGCTATGATCGTTCAGTGCG
617  Euc_012636_O_2  CGTTAGGATAGTCTTTAAAGGAGTTGGTGATTATTGATTTCCACCCAATATATGTAGCGT
618  Euc_012748_O_2  GAGCAAGCTACTTACAAAAATCGACAGCGTCTTTACCTATCTGAACAGACAGATGGCAGT
619  Euc_012879_O_2  TCCTTCCGACAAGTACCGTATTGCAAGTTGTGGTATGGACAATACGGTTAAAATCTGGTC
620  Euc_015515_O_1  TTTCACTCGATGACGGTTGGCCGGATAAATAATCGCTTATATAGTCCTAATAAGTTCCAT
621  Euc_015724_O_3  ATATGTAGGTGGTAGAGGTGTGGATATTGCATAGACCGAACCTCCGCAGGTCCGCATTCT
622  Euc_016167_O_1  CCATTGAACTACTTATGGATTACTTTATACATGAAATATCATGCCGGAGTAATTTTGAGT
623  Euc_016633_O_3  AGCATTAGAGACCTGGATTTTAGTCTAGATTCAGAGTTTTTGGCTACGACATCTACTGAT
624  Euc_017485_O_3  AAAGGTTTATCCCTCATTGGATTTGATATATAAACTGAGAGTGTTTTGCCCCCCATTAAA
625  Euc_018007_O_1  GTACAGCGTGTATTTCTTGTTACGATACTTGAGGGGTTAGAGGCACCTACGAATTAGGAA
626  Euc_020775_O_3  ATATCCTTATGAATGAAGTTTGGATGATAAGTGGCGCCAGACTTTCTACTCACCCTTTTT
627  Euc_023132_O_3  TGATCACATCGTTGTTTGCAATAAGACGTCATCAATTTATATCATGACTCTACAGGGACA
628  Euc_023569_O_2  TTTTCCCAGTGTACTGCGAGAGTGATGCTACATAAGTTTACTCTTGTGTCTAACTTTTCC
629  Euc_023611_O_1  AGATTCTACAGATGGCGCTATACGAGCTGTTATACGGACATTTTATGACCATACACATCC
630  Euc_024934_O_3  TGCTACGGGAAACCAGGACAAAACTTGTAGGATTTGGGACATA
 CGAAACTTATCTAAGTC
631  Euc_025546_O_1  CAAGTCATATAGTTACAGTGTCGCATGACAGAACAATTAAGCTCTGGACTAGTAACGACG
632  Euc_030134_O_2  TGCCACATCGTAACCATCATAGCACTTATCATCTAATTATGGTGAAAGGGAGTTATATAT
633  Euc_031787_O_5  GTTTATACTTATAAACAACAGAGAGACAACTGTACAGGTGTTGTAAACACTCCCAGTGTG
634  Euc_034435_O_1  CTGTGTTTTAGCCCGAGGGCCAATCACTTAGTTGCTACTTCGTGGGATAATCAGGTACGG
635  Euc_034452_O_3  GCAAAGTAGAGTTTAAGTTTCGTTGTGCTTGGACCGGAAAACTCACATGCTTAGAGTTTA
636  Euc_035789_O_5  AAGATTTGGGCATAACTTGTATGAACTTTTTCTGTTGTCGACACTGTAATTACACGAGCT
637  Euc_035804_O_4  AAACAGATGCATGTATGCTTCATAACTCTATAGATATGGAAATGTCACTGTACACTGATC
638  Euc_043057_O_2  TTATTGGTGCACAGGACGGAAAATTGCGCATATATTCTATTTCAGGTGATACATTAACAG
639  Euc_046741_O_1  AGGCACAGACACTTGCCTAAACCAATATACAAGGCAGGTATTCTAAGGCGCACCGTGAAT
640  Euc_047161_O_4  CATGCGAAGGTTTCTGGGAATTTTCAGTAGAAAATTCGGTCGTGGCGGCCATCCTCGATA
641  Pra_001766_O_1  TTAAGCTGATAGCTTTAGTTCCTACGTGGAATGTATAAATGCACCATTGTCCATAAGGCA
642  Pra_002927_O_2  GGATGCTCTGGTTACATGACTACTCCTTAGGGAATCAGTCAGACATTTTAAATAACTTCC
643  Pra_007642_O_2  TCATTAAGCGGTACTGGCAGAGGACATGTCTATTTATACAAGCAAATGGTCCTATTGGCT
644  Pra_013714_O_1  ATGTTGGTCAGACCTCAAATATTGTACTCCCCACACTAGGGAGCATTTACGGTGAATATA
645  Pra_016332_O_1  TCCTCTCGACCCTTAGAGTCCTCTGCGAATCTTGTTGTTAGTTACTGTGTACGCTGTAAC
646  Pra_021677_O_3  AAGCATGTTTTGAATTTATGGTGGTGGCATGTGGATATTTGAACTTGGTTGAGAAAAATT
寡核苷酸SEQID 寡核苷酸ID  微阵列寡核苷酸序列
647  Pra_027562_O_2  CATTCCTATTGAAGGGTCAACCTTTAATTTTGGCTAGCAGGACTGTATAGGATTATATGC
648  Pra_001504_O_2  TTATTGTATTTTAGATTCTTGATGGCCATCTAAACTTCTGGCTGCTTGGTGCAACATTGA
649  Pra_015211_O_2  ATAGCTAATGATTCCATGCTATCCATGGTATCTACTTCACGATAATAAAGGTCTTAGTCC
650  Pra_020421_O_2  CACCTAATAGGCCTGAGTATTGCTCACCACTATGCTGATATGGGGAGCAATAACGTTAGT
651  Pra_003187_O_2  TTTCTTTTCACTTTGTACTAATGATCATTGTGACCACAAAATCTTTATACACAATACAGA
652  Pra_015661_O_1  CTTGTCACTATCCTCATATTGATATCACCTCGTGTATGTTGTGGGGTGGCAAAATTACTT
653  Pra_013874_O_1  TATTTTAACTCAGCGACTTACCAGCCTAGTAAGCAATGGGGAGCTTGCATGTATTAGTTT
654  Pra_014615_O_1  ATTCGTCCTGGTCCTTTAGGACATGTACTTATGTCCATGCAAGTGCTTCTTGCCTAAGCT
655  Pra_004578_O_2  TTCTAGGCGATATATATCGCCGTAACTTTGGATGTGTTAAGAATATAGGGGATCATTAGC
656  Pra_023387_O_3  AGTTGCAGAGTGTGTAGCAACTGATGAGCATAGTTGTTATGTTTCTCAACTCAGTTGCAC
657  Pra_006970_O_1  AAGAAACTCATACACTGGACAGGCCAACCTTCCAAATATGTGTTTAGAAAACCTTTGTCT
658  Pra_010322_O_1  AAGGGGTGCTATCCATATCTAGAATCTACCATGCTCAATGAGGTATCTTCATTAGTATAC
659  Pra_022721_O_1  ATCTAATGCTAGTTTATTGATTTCTATGATCCAAGACCTCGTCATAGATCAAGTGCCTAG
660  Pra_023407_O_1  TTGTTATTAAATACCATTCAATATGCTTATGATTCATGAATGCTTAAGAGATTCTGCTGC
661  Pra_001945_O_2  GCTTCTAAACTGTAGAAGCCTGTTATCTTTAGACTCGTGGTTATGTGAACTACTTTTACA
662  Pra_008233_O_1  GGCTGTGGGGATTCGAGCCTGATGGTTATGCACTGTGGCCAGCAAGATGTTGAAGTTTTA
663  Pra_008234_O_4  GCCTGATGGTTATGCACTGTAAGTGATCTGATTTGATTAACTATTTTATCAATTAATTTT
664  Pra_022054_O_2  ATGGTCATTATCCGAGATAGTGCGCTTTGTCATGGGAAAATGACTATTGAATGTGAGTTT
665  Pra_O12137_O_2  TTTTCTGGTGCATCCTTAACACAGCTTGGTTACATGGTGAATTACAGTATTTGAAGGAGT
666  Pra_012582_O_2  AGATTTAATGCCACTTAGGTGATCGGTGACCCACTTGTACATATAGATGTTGGCGATGTT
667  Pra_015285_O_2  AAGAAATTCATCAATTCTTTGAAATTATTGTTCCCTTTTGATGCGGCCCCTTTCTGGAGG
668  Pra_017229_O_1  TAAAGTATATTTTAGCCGCTGTTGTTGTAAATTTATGTTTTTCATTGCTATCAACATTTA
669  Pra_020724_O_2  GGTTTTCCTATAAGATGTATGAATTCGCACTGTGGTGCAATTTTATGAATTAAACTCAAA
670  Pra_004555_O_1  TTTACTATTCCGTCTGGGCTTAGAGATGTACGTTAATTGGTCATTTAAGACGACTCAGTT
671  Pra_004556_O_5  TCAAATCTAGTCAATATCCGTGTTGAGCTAAACAAGCGCTGAAAGTTTGCTCGAATCAGC
672  Pra_005729_O_2  AGAAAGTTGTGTACTAATTTGTATTGTAACGTCCATTTATCCAACGAGTCCTCCATTCAT
673  Pra_007395_O_3  CAGTACTGTATTCGAAGATCCTGAAAATTTACTAAAACAAATGG
 AATATCAACAACCTAG
674  Pra_009503_O_1  TTGCTCTATATAATTTGTGCTCGTGTGTGTACTTGAAGATCCATCCTCACATAGTCCAAT
675  Pra_011283_O_1  GTGTGTATAGTTTTATAACACTCTATGGTATCACTACCACTATGGGCCTGTTTAGTCCAA
676  Pra_012322_O_3  GAAGCAGAATCAGCTTTGACCAGTATTTAGTGTCTTGTATACAATTCTTGTTTCAGTGAA
677  Pra_023236_O_3  AAATCAAGATTAAAATCCGAAACCAAGGCTAACCAGCAAACTGTGAGGTGTACATTGTTG
678  Pra_000171_O_2  TTCCAAGCAGAAGGGCACATGTTGTGACATCAAGTAGTAGATTGTTCTGCAGATTCTGGT
679  Pra_000172_O_1  GTTAATGTAATACATTTAGTTTTTAGATAACTGTTAATGTGTAGTAAAGCACTAGGAAGA
680  Pra_001480_O_3  GAGGCTTCAAAGGTTTTTGTGTCTTTTCTAGTTATTATAAACGCTTCATAGGTTCCTAGG
681  Pra_001692_O_2  GAAGATTGTAAGTTGGGTGAACTTTTTTACCACGCTAGGTTGATCTATTTTAAGACTCTT
682  Pra_005313_ORF_O1  AAAATAGCTGCGCGTACCACAAAGGTGACAAACGCCGGATTTCTCTTATCAGACTTGTCA
683  Pra_006362_O_1  TTTAATTATCATAGTTTTATTCCGGCTATCTTGATCATTCACGGAAGTCCCGAGAGTCAA
684  Pra_006493_O_3  GTGGAGTGAACGTGGTTACTTCAATGGATTACCCTTCTATCGTGTCATTAAACACTTTGT
685  Pra_006983_O_1  GCTAACTCTTCTAGTTGAGATCTCCATCAATTAATGGATACAAACATTGAGTTTCACTTT
686  Pra_007665_O_1  GGATCACTACTGGATTCCGTTACATTAGTTATTGCAAGTTGGTTATTATGTACGTTTATA
687  Pra_012196_O_1  ATGAACAAATGCAATTACCCTGTTTTATTCTATCCCGCTTTAATTAATATTGGTCATGTT
688  Pra_013382_O_1  TTTGCTTGTGGATTGTACTGTGGTACATGGTATAAATCTATAGGCTATGTCGATTATTTT
689  Pra_016461_O_1  ATATAAGATATAAGATATTGCCAGCAAACTATTTGACAGGTTATTTAATAAAGTGTGCTA
寡核苷酸SEQID     寡核苷酸ID              微阵列寡核苷酸序列
690  Pra_017611_O_1  TTTTAAATGTGGACAGAGGCACTATAAGAATGCGAAATATCGTCGGAGCACGACTAATTG
691  Pra_019776_O_1  ATAGACTAGTTCTACAAAGCCCTAGGATGATGGACTTCATTTCTTTTGCATTAAGATGAA
692  Pra_020659_O_1  GATTTCTTATGGGGTTGGAACATTCCTCGCTGCCTTCTGGTAATATTAGGTTATGCGTTT
693  Pra_022559_O_3  AATTGAGGTTGACTGTGTACTTCTCCAGTGGACAGGAGAAAGCGATAAAATTCAAACGTT
694  Pra_024188_O_5  AAGGAAGGGCAAATAGAGCTCGCGCTCAAGAAATACCTTAAATCGATACGGTATTTGGAT
695  Pra_027973_O_2  TAATTTAAGAGCTATGAAACAACTACCTTTTGGAATGGTTTTGTTTTTAGCATCCCAATT
696  Pra_001353_O_1  TTGTAAATTATGCTGGTTCCATATGGGGGTTAATCAGTATCCTGGTTATTTGTGACACCA
697  Pra_001978_O_3  GTTGTGAACTATCAATAGACGGGGATGGTCCTTTTTAGCTGCTCCTTAAGCAGCTCAAAT
698  Pra_002810_O_2  TCAATTCCGGTCATATGTAGACGACTATAATGTTGTTTGTGTCCTATAACTATAGTGTTG
699  Pra_002811_O_1  CATTTTACACCCTATAACAAAATATAGTGTCATAAGTTTACACCAGGTAACAACTCTATA
700  Pra_002812_O_3  ATGGAGAGTTTTATTCATTACATGAAAGAGTATGTCACCTTTCGTGCTCCATCTATTGAT
701  Pra_003514_O_1  TTTCACGTCCTGTATACTCACTCAAGCAACTTTAGGATGAAGAGCTAAAGTATATCAAAG
702  Pra_004104_O_2  AATGCACTCTTTATAAAGTGGGATGAGGTATGTGTTTCCTTCCTATTGGCTAACCTGAAT
703  Pra_005595_O_1  ATTGGGCAATCGTTATTGATTTTACCTATCGCTATCTCACTGTCCGCCAATTTAGTGTAA
704  Pra_005754_O_1  TTTCAGCGGATATAAAGTCTTCCAACTTGTAAACCGGTGCTGTGAAGATTAAAAGTCCTT
705  Pra_006463_O_1  GCTTTAGAGGCAATGGTAGATTATGAAGTCAACACCAGGGAGTTTGACCGTTTGGGACAT
706  Pra_006665_O_1  CATTCAATTTGACATTGGAGTTTCAAGGCATTCCAAGGATAGCATGTACACAAGTTGAAT
707  Pra_006750_O_1  CATAAAATTACTATGGAAGTTGGATCATTATCTATGCCATAGTGGAGTAGAACTAGATTT
708  Pra_007030_O_1  CTCTTGATTCTAGAATCTAAACTACTACCTTGCGGACATGACTGAGCATCTCTCTAACAG
709  Pra_007854_O_1  CAGGGTTGTGCTAGTTTAACATTTTAACTTAATGTAATCATGTAAGCTTTAGAGAGGTGG
710  Pra_007917_O_1  GTAAATGTTTACATTGAGGTCATGCATGAGTGTTAATTACGCTTTCACTACTGTTCACTT
711  Pra_007989_ORF_O2  AATTAAAGCTTGGTTGTATGATCATTTGGGATCGAGAGTAGATTATGATGCTCCTGGGCA
712  Pra_008506_O_1  TTATCTAGCTAGAAGTTGTGAAATTAAGAGGGATGTGAGGATTGGGTTATAACTAGTGTA
713  Pra_008692_ORF_O2  AATGAATCAGGCATTAAAGCGGGAATCATTTATGACTTGGCAACCTGAAAATTCTATTAA
714  Pra_008693_O_2  TTCTTGACGTTTTAATATGGTATGGTATTAAATTTGGAAGGCCTATTCGATTGTTTGCAA
715  Pra_009170_O_1  TTCTTATAACCTGTACGATTGCCGATATATCACCAATTTTGCTGATTTTAATCTGAGTTT
716  Pra_009408_O_1  CAATTTCATATTCGGGTTCAATGTAGTGCCTCTCATTTTAGGGTGATAGCATGAGTTTTT
717  Pra_009522_O_1  TCCACAAGTTAACATAGGTAACTATCGACTGAAGTGAACTGGGGGGCAGAAGCTAACTAT
718  Pra_009734_O_2  TTTAGATAGCCATTTACATTTTACTTATTATTGGACTTGTAAAGATTTTTGTACCCTTGT
719  Pra_009815_O_4  TTGCTGAAATATTTCAAGCTGAAAGTTATGATTCTGGCCAAGAAGTCTACTGAAAATTTG
720  Pra_010670_O_2  AAACATAAGTTTGGCCCAGATTCGGTTTATCATAAAATCTGGCTGCATATAAGGTGTCAG
721  Pra_011297_O_1  ATGTTCTAGAATTTGTCTAAGCTAGCTACTGGTGTTTAACTGATATGGAAAACTTTTGCC
722  Pra_013098_O_2  TTTGGGGAGTACTTTAGTCAATAAAAGTGAAGTGAATCATGATATAAAGGGTTTAAGTAA
723  Pra_013172_O_2  AGAAGTTACTAATTTGTAGATAAATTCTAACGAAGGTGATGATAGCATACACGTAATGAA
724  Pra_013589_O_2  GAATTTTGATGGTAGCGTATGGTTGAAGGAAAACTTGGATATATCATGTAAACATTTTTC
725  Pra_013608_O_1  TTAATGAACCGCTTTTTCCTTGAGAGGCTATGAATGCCTGTAGAACTAATCCTTTAAGTA
726  Pra_014299_O_2  TTTCTCTAACACTATATTTTCTGGTATGACCGCTCTACATTGTATATTAACCCTTGCAAA
727  Pra_014498_O_1  TATATTCACTGTGCTGGGATTATCCTCTCCCCTTTTTGACCCACTGTTGTGTGTATTTGA
728  Pra_014548_O_1  GAGCATACAGCGTTATCTTTGAGACGAGTCATCAATGATAATATCCTCGTAAAAGGTTAC
729  Pra_014610_O_2  TTTATTCAATTACGACGGATTCAGTTGGCCTTTTGTAACATTCAAGTATCCATCTATCAC
730  Pra_016090_O_2  ATGTTCAGGGGTATTAAAAATTCAGAGGATAAATTTCCTCACTCTCAAGTGTTAGATGGT
731  Pra_016722_O_2  CAAAGTCTAGACGTTAATGTTTTGGAACTCTTTTTTCGAATTTGTGCCTATTGAATCACT
732  Pra_016785_O_3  TATAAATATATTGTACTGGGGATCCAAGACATGGCAATATATGTCGAGATTTTCATTTTC
寡核苷酸SEQID     寡核苷酸ID                微阵列寡核苷酸序列
733  Pra_017094_O_3  CTTTTGCATGAGTTCAAATGTCTTTGTGACATATTGTCTTGAACCACCGAGGATATATCA
734  Pra_017527_O_2  GTTTGTATGTCCAATAGATTATAACCTATTTACTGTGACACTATTCTTCACACCCATGTC
735  Pra_017591_ORF_O2  AGATCTAGTTGTTTCAGCATCGTTGGACCAAACTGTTCGTGTATGGGATATAAGTGGCCT
736  Pra_017769_O_2  TGCCGTATCAAAAGATTGGTACTTCCTTATGGACACACAAGATCGTAAGCATGGCTGAAT
737  Pra_018047_O_2  TTGATGGCCACATGAGTTGTTTATACAAGTCGTTGTTTTATGAGAGAACCTTCTTCAGAT
738  Pra_018414_O_1  ATTTCTATAGTGCCATATGCTTGTCGGTTGTCATTGACCTCTAATAGAATAGCCAGAGTA
739  Pra_018986_O_1  TTCACGGCAGTTGAACTAGTCATAGTGGAATATTATTTAAATGGTGTATTCTAGTCACAT
740  Pra_019479_ORF_O1  TGCAGGCGCTCTATAGTTCTGTTCTCTAGCATGAAGTGTGTATTTTATCTATTGTGGACC
741  Pra_020144_O_1  TGTCTTTAATCTTCAGGGTTCGTTACTAACAATTGAGCTCAAATCTCTATTCTGACCAGC
742  Pra_022480_O_1  CATTTATAGAGTTGTGCAAAATCACCCATAATGCTATGAATTGACAGGTGACTGTAATCT
743  Pra_023079_O_2  GGAGAAAATTTCCTATCCCTTTGTGGGTGTGTGAAAAACGAAATATAGAGGAACAATGTG
744  Pra_026739_O_2  ACCAATCATTTATTTGCAGTGTAGTTGATATGAAGGGAGAAATATGACAGTTGGTTTCAA
745  Pra_026951_O_2  AAGTTAATGTTCTCATAGGTTATTCATTGGAGTTGTCTCGTATGTACGCTGTGCCGTAGT
746  Pra_026529_O_2  CTCATAAATTGAGGCTTGCCTACGTTAATTGTTATATATGGAGAGCCATGCTAATTGTTA
747  Euc_006366_O_2  GCAGATCATGTAATTGTATCTCAAATTATAGTATCCGTATTCTGTACAAATGCTCCGGAA
748  Euc_017378_O_1  TCTTTACGCAGATGGTGACTGAAGCTGGTTCCGAGATCGGCATATGTAGCTGGTAGAGGT
749  Pra_000888_O_1  TTCACATTGAGGGTTGCCGTCGGTATTCGCCGATGATATCCTGTTTTACGCGCAACAGTT
750  Pra_014166_O_1  TCATTATTTAGGGTGCAGGCTGTATAAAATGTTGTAAATTGTAGTATCAATGTGTACAAT
751  Pra_003189_O_1  GCATTCACCACGACAGTAAAGTAATCATTATGATTACTAATGTATTGCTTTCATGGGGTG
752  Pra_009356_O_4  AAAGGGTATATTTTGTCTCATGTTGGGGTGATAATTCTCCCTGAAAGTCTCCAAAATATA
753  Pra_000065_ORF_0_2  AAATTTCCGGTTGCCATAGTCTAGTGGGGTGAGGGTTCATTCTAGGGGATTTATTGTGTT
754  Pra_014197_ORF_O1  GCAGTGATAAAGGTACTTCTTGGTGATAATCCTAAAGCCTTACCCATGGATATCCAGCCT
755  Pra_009081_O_2  TTCTTTAACAAGGTAAAAATCCCCCCCTTGGCATGTAGCTCAATTAGTTGTAATGGAACT
756  Pra_013417_O_1  AGTTGTAAACAGTGTAATAAGGAGCAGAAGTTGTGATAGCTTTTAGGAACGATAGACTTT
757  Pra_005755_O_1  TGAACCAATTCTTGTATATTAGATATGTAACATGTATGAATGTCCATAGAGCAGAGCTTT
758  Pra_006670_O_2  AGCCAGGCACGCTTAACTAAATTTCGTTTAGTTCACCATGACTATTCGTTGAACTTAATG
759  Pra_007027_O_1  CAAAACCCCTTGTAGGGTGGACTTCTGTTGTATCCAATTTTTATG
 GCATAATTAGCTAGT
760  Pra_007276_O_1  AATTTGGTGATTATTCCTTACCATATCGTACTGTACAGATACGGTAAGGTCGAAATATAT
761  Pra_007390_ORF_O1  CATGCCGTGATCGGTCGATTGCATTAAGTGCTGCAAGGATCAAATAGTGGCACTGTCATG
762  Pra_012648_ORF_O1  CAAACATAAATAAGGTTGCTACTTTAAAGGGACATACGGAACGAGTTACTGATGTGGCAT
763  Pra_013171_O_2  ATTTATGGATGAGGTACTCCTTATGAATATCTTCAAACTAAGAAATAACTATATATGCAA
764  Euc_045414_O_2  CTTGGTTTTTGTTGAGCTTTCTATTTCAAGCAATTTGTGATTGGGGGGTTCTGCATTCTT
765  Euc 044328_O_2  ATGTCTAAAGAGCCGTGATCTATGAGTAGATTAGAAACCGCCTTTTTAGTTGCAAACGCC
766  Euc_015615_O_2  TTGCAACAAGGTATACTTAGTCAGTCCTTGTTATGTATGTCTTTTGTCAACCCTTCAGGG
767  Euc_017239_O_3  GGCGGAATCCCTTTGTTCTTTCGAGCTTTACGTGACAAGTCGGCCAGAAAGCAGTAGCAT
768  Euc_018643_O_3  TTGATGTACGAGCCGCTATATCTAATTCTGCCTCCCAGTCACTGCCAAGTTTTACTCTTC
769  Euc_019127_O_5  GTCTTGCATGTCAGCTATTATACAGTCCTGTTTATAGTCCTGTGATGTAATAAAAAGCTG
770  Euc_022624_O_3  AAGTAGGAGATCGTGTAGAGAGAATACTTTCTGCTCTCAGCGGCGAAGAGGTTTGTCTGC
771  Euc_032424_O_1  AATTGTGAGTAGAATAGGAGAAACTTTTGTACAAGATTAATACGTGTGGCATAATAAGAT
772  Euc_037472_O_1  TGATGTGCAGTTTACATTATTATGGTTCGAGTATTATTTAGCTGCCCTATCTTAAGTCAT
表15.肽表。
蛋白质SEQ ID    目标                           专利肽序列 专利ORF开始 专利ORF终止
261 CDKA型  MGDGSLGSGGRGNSGGGGGGGSRPEWLQQYDLIGKIGEGTYGLVFLARIKHPSTNRGKYIAIKKFKQSKDGDGVSPTAIREIMLLREISHENVVKLVNVHINPVDMSLYLAFDYADHDLYEIIRHHRDKVNQAINPYTVKSLLWQLLNGLNYLHSNWIIHRDLKPSNILVMGEGEEQGVVKIADFGLARVYQAPLKPLSDNGVVVTIWYRAPELLLGAKHYTSAVDMWAVGCIFAELLTLKPLFQGQEVKANPNPFQLDQLDKIFKVLGHPTQEKWPMLVNLPHWQSDVQHIQRHKYDDNALGNVVRLSSKNATFDLLSKMLEYDPQKRITAAQALEHEYFRMEPLPGRNALVPSSPGDKVNYPTRPVDTTTDIEGTTSLQPSQSASSGNAVPGNMPGPHVVTNRPMPRPMHMVGMQRVPASGMAGYNLNPSGMGGGMNPSGIPMQRGVANQAQQSRRKDPGMGMGGYPPQQKQRRF 387 1820
262 CDKA型  MEKYQQLAKIGEGTYGIVYKAKDKKSGELLALKKIRLEAEDEGIPSTAIREISLLKQLQHPNIVRLYDVVHTEKKLTLVFEFLDQDLKKYLDACGDNGLEPYTVKSFLYQLLQGIAFCHEHRVLHRDLKPQNLLINMEGELKLADFGLARAFGIPVRNYTHEVVTLWYRAPDVLMGSRKYSTQVDIWSVGCIFAEMVNGRPLFPGSSEQDQLLRIFKTLGTPSLKTWPGMAELPDFKDNFPKYVVQSFKKICPKKLDKTGLDLLSRMLQYDPAKRISAEQAMGHPYFKDLKLRKPKAAGPGP 99 1007
263 CDKA型  MDQYEKIEKIGEGTYGVVYKAIDRSTNKTIALKKIRLEQEDEGVPSTAIREISLLKEMQHGNIVKLQDVVHSERRLYLVFEYLDLDLKKHMDSCPEFSKDTHTIKMFLYQILRGISYCHSHRVLHRDLKPQNLLLDRRTNSLKLADFGLARAFGIPVRTFTHEVVTLWYRAPEILLGSRHYSTPVDVWSVGCIFAEMVNRRPLFPGDSEIDELFKIFRIMGTPNEDSWPGVTSLPDFKSTFPKWASQDLKTVTPTVDPAGIDLLSKMLCMDPRRRITAKVALEHEYFKDVGVIP 120 1004
264 CDKA型  MVMKSKLDKYEKLEKLGEGTYGVVYKAQDKTTKEIYALKKIRLESEDEGIPSTAIREIALLKELQHPNVVRIHDVIHTNKKLILVFEFVDYDLKKFLFNFDKGIDPKIVKSLLYQLVRGVAHCHQQKVLHRDLKPQNLLVSQEGILKLGDFGLARAFGIPVKNYTNEVVTLWYRAPDILLGSKNYSTSVDIWSIGCIFVEMLNQPLFPGSSEQDQLKKIFKIMGTPDATKWPGIAELPDWKPENFEKYPGEPLNKVCPKMDPDGLDLLDKMLKCNPSERIAAKNAMSHPYFKDIPDNLKKLYN 23 937
265 CDKA型  MDQYEKVEKIGEGTYGVVYKAIDRLTNETIALKKIRLEQEDEGVPSTAIREISLLKEMQHGNIVRLQDVVHSENRLYLVFEYLDLDLKKHMDSSPDFAKDPRLVKIFLYQILRGIAYCHSHRVLHRDLKPQNLLIDRRTNALKLADFGLARAFGIPVRTFTHEVVTLWYRAPEILLGSRHYSTPVDVWSVGCIFAEMVNQRPLFPGDSEIDELFKIFRILGTPNEDTWPGVTALPDFKSAFPKWPAKNLQDMVPGLNSAGIDLLSKMLCLDPSKRITARSALEHEYFKDIGFVP 149 1033
266 CDK B-1型  MEKYEKLEKVGEGTYGKVYKAKDKATGQLVALKKTRLEMDEEGVPPTALREVSLLQLLSQSLYVVRLLSVEHVDGGSKRKPMLYLVFEYLDTDLKKFIDSHRKGPNPRPVPAATVQNFLYQLLKGVAHCHSHGVLHRDLKPQNLLVDKEKGILKIADLGLGRAFTVPLKSYTHEVVTLWYRAPEVLLGSAHYSIGVDMWSVGCIFAEMVRRQALFPGDSEFQQLLHIFRLLGTPIEKQWPGVTTLRDWHVYPQWEPQNLARAVPSLGPDGVDLLSKMLKYDPAERISAKAALDHPFFDSLDKSQF 199 1116
蛋白质SEQ ID    目标                          专利肽序列 专利ORF开始 专利ORF终止
267 CDK B-2型  MERPATAAVSAMEAFEKLEKVGEGTYGKVYRAREKATGKIVALKKTRLHEDEEGVPPTTLREISILRMLSRDPHIVRLMDVKQGQNKEGKTVLYLVFEYMETDLKKYIRGFRSSGESIPVNIVKSLMYQLCKGVAFCHGHGVLHRDLKPHNLLMDKKTLTLKIADLGLARAFTVPIKKYTHEILTLWYRAPEVLLGATHYSTAVDMWSVGCIFAELVTKQALFPGDSELQQLLHIFRLLGTPNEKMWPGVSSLMNWHEYPQWKPQSLSTAVPNLDKDGLDLLSQMLHYEPSRRISAKAAMEHPYFDDVNKTCL 41 982
268 CDK C型  MGCVLGREVSSGIVTESKGRDSSEVETSKRDDSVAAKVEGEGKAEEVRTEETQKKEKVEDDQQSREQRRRSKPSTKLGNLPKHIRGEQVAAGWPSWLSDICGEALNGWIPRRANTFEKIDKIGQGTYSNVYKAKDLLTGKIVALKKVRFDNLEPESVRFMAREILILRHLDHPNVVKLEGLVTSRMSCSLYLVFEYMEHDLAGLAASPAIKFTEPQVKCYMHQLLSGLEHCHNRRVLHRDIKGSNLLIDNGGVLKIGDFGLASFYDPDHKHRMTSRVVTLWYRPPELLLGANDYGVGIDLWSAGCILAELLAGKPIMPGRTEVEQLHKIYKLCGSPSEEYWKKYKLPNATLFKPREPYRRCIRETFKDFPPSSLPLIETLLAIDPAERGTATDALQSEFFRTEPYACEPSSLPQYPPSKEMDAKKRDDEARRLRAASKGQADGSKKERTRDRRVRAVPAPEANAELQHNIDRRRLISHANAKSKSEKFPPPHQDGALGFPLGASHRFDPAVVPPDVPFTSTSFTSSKEHDQTWSGPLVDPPGAPRRKKHSAGGQRESSKLSMGTNKGRRADSHLKAYESKSIA 291 2042
269 CDK C型  MYSKSSAVDDSRESPKDRVSSSRRLSEVKTSRLDSSRRENGFRARDKVGDVSVMLIDKKVNGSARFCDDQIEKKSDRLQKQRRERAEAAAAADHPGAGRVPKAVEGEQVAAGWPVWLSAVAGEAIKGWLPRRADTFEKLDKIGQGTYSSVYKARDVTNNKIVALKRVRFDNLDTESVKFMAREIHILRMLDHPNVIKLEGLITSRMSCSLYLVFEYMEHDLTGLASRPDVKFSEPQIKCYMKQLLSGLDHCHKHGVLHRDIKGSNLLIDNNGILKIADFGLASVFDPHQTAPLTSRVVTLWYRPPELLLGASRYGVEVDLWSTGCILGELYTGKPILPGRTEVEQLHKIFKLCGSPSDDYWRRLHLPHAAVFKPPQPYRRCVAEIFKELPPVALGLLETLISVDPSQRGTAAFALRSEFFTASPLPCDPSSLPKYPPSKEIDMKLREEEARRRGAAGGKNELEKRGTKDSRTNSAYYPNAGQLQVKQCHSNANGRSEIFGPYQEKTVSGELVAPPKQARVSKETRKDYAEQPDRASFSGPLVPGPGFSKAGKELGHSITVSRNTNLSTLSSLVTSRTGDNKQKSGPLVSESANQASRYSGPIREMEPARKQDRRSHVRTNIDYRSREDGNSSTKEPALYGRGSAGNKIYVSGPLLVSSNNVDQMLKEHDRRIQEHARRARFDKARVGNNHPQAAVDSKLVSVHDAG 107 2236
270 CDK C型  MGCIPTIISDGRRRSAAPDKRRPRPRRSSSEGEAPPHATAAGSEGGESARGAPGKERPEPAPRFVVRSPQGWPPWLVAAVGHAIGEFVPRCADSFRKLAKIGEGTYSNVYKARDLVTGKTVALKKVRFDNLEAESIKFMAREILVLTRLNHPNVIKLEGPVTSENSSGLYLAFEYMEHDLSGIAARQNGKPTEPQVKCFMRQLLSGLEHCHNHDVLHRDIKCSNLLIDNEGNLKIADFGLATFYDPERKQVMTNRVVTLWYRAPELLLGATSYGIGIDLWSAGCILAELLYGKPIMPGRTEVEQLHKIFKLCGSPSEAYWNKFKLPNANIFKPPQPYARCIAETFKDFPPSALPLLETLLSIDPDERGTATTALNSEFFAAEPHACEPSSLPKYPPSKEMDLKLIKEKTRRDSSKRPSAIHGSRRDGIHDRAGRVIPAPEATAENQATLHRPRAMKKANPMSRSEKFPPAHMDGVVGSSANAWLSGPASNAAPDSRRHRSLQNPSSSVGKASTGSSTTQETLKVAPELLQVGSSSLHPCHRMLVYGSNLTIRSK 82 1749
蛋白质SEQ ID   目标                           专利肽序列 专利ORF开始 专利ORF终止
271 CDK C型  MGCICAKQADRGPASPGSGILTGAGTGTGTRSSKIPSGLFEFEKSGVKEHGGRSGELRKLEEKGSLSKRLRLELGFSHRYVEAEQAAAGWPSWLTAVAGDAIQGLVPLKADSFEKLEKIGQGTYSSVFRARELANGRMVALKKVRFDMFQPESIQFMAREISILRRLDHPNIMKLEGIITSRMSNSIYLVFEYMEHDLYGLISSPQVKFSDAQVKCYMKQLLSGIEHCHQHGVIHRDVKSSNILVNNEGILRIGDFGLANILNPKDRQQLTSHVVTLWYRPPELLMGSTSYGVTVDLWSVGCVFAELMFRKPILRGRTEVEQLHKIFKLCGSPPDGYWKMCKVPQATMFRPRHAYECTLRERCKGIATSAMKLMETFLSIEPHKRGTASSALISEYFRTVPYACDPSSLPKYPPNKEIDAKHREEARRKKARSRVREAEVGKRPTRIHRASQEQGFSSNLAPKEKRSYA 151 1560
272 CDK C型  MAVAAPGHLNVNESPSWGSRSVDCFEKLEQIGEGTYGQVYMAKEKKTGEIVALKKIRMDNEREGFPITAIREIKILKKLHHENVIKLKEIVTSPGPEKDEQGRPEGNKYKGGIYMVFEYMDHDLTGLADRPGMRFSVPQIKCYMRQLLTGLHYCHINQVLHRDIKGSNLLIDNEGNLKLADFGLARSPSNDHHANLTNRVITLWYRPPELLLGATKYGPAVDMWSVGCIFAELLHGKPIFPGKDEPEQLNKIFELCGAPDEINWPGVSKIPWYNNFRPTRPMKRRLREVFRHFDRHALELLERMLTLDPSQRISAKDALDAEYFWADPLPCDPKSLPKYESSHEFQTKKKRQQQRQHEETAKRQKLQHPPQHPRLPPVQQSGQAHAQMRPGPNQLMHGSQPPVATGPPGHHYGKPRGPSGGAGRYPSSGNPGGGYNHPSRGGQGGSGGYNSGPYPPQGRAPPYGSSGMPGAGPRGGGGNNYGVGPSNYPQGGGGPYGGSGAGRGSNMMGGNRNQQYGNQQ 82 1644
273 CDK C型  MGCICTKGILPAHYRIKDGGLKLSKSSKRSVGSLRRDELAVSANGGGNDAADRLISSPHEVENEVEDRKNVDFNEKLSKSLQRRATMDVASGGHTQAQLKVGKVGGFPLGERGAQVVAGWPSWLTAVAGEAINGWVPRRADSFEKLEKIGQGTYSSVYRARDLETNTIVALKKVRFANMDPESVRFMAREIIIMRKLDHPNVMKLEGLITSRVSGSLYLVFEYMDHDLAGLAATPSIKLTESQIKCYMQQLLRGLEYCHSHGVLHRDIKGSNLLVDNNGNLKIGDFGLATFFRTNQKQPLTSRVVTLWYRPPELLLGSSDYGASVDLWSSGCILAELFAGKPIMPGRTEVEQLHKIFKLCGSPSEEYWKKSKLPHATIFKPQQPYKRCLLETFKDFPSSALGLLDVLLAVEPECRGTASSALQNEFFTSNPLPSDPSSLPKYPSSKEFDARLRDEEARKHKATAGKARGLESIRKGSKESKVVPTSNANADLKASIQKRQEQSNPRSTGEKPGGTTQNNFILSGQSAKPSLNGSTQIGNANEVEALIVPDRELDSPRGGAELRRQRSPMQRRASQLSRFSNSVAVGGDSHLDCSREKGANTQWRDEGFVARCSHPDGGELAGKHDWSHHLLHRPISLFKKGGEHSRRDSIASYSPKKGRIHYSGPLLPSGDNLDEMLKEHERQIQNAVRKARLDKVKTKREYADHGQTESLLCWANGR 626 2782
274 CDK D型  MDPDPSPDPDPPKSWSIHTRREIIARYEILERVGSGAYSDVYRGRRLSDGLAVALKEVHDYQSAFREIEALQILRGSPHVVLLHEYFWREDEDAVLVLEFLRSDLAAVIADASRRPRDGGGGGAAALRAGEVKRWMLQVLEGVDACHRNSIVHRDLKPGNLLISEEGVLKIADFGQARILLDDGNVAPDYEPESFEERSSEQADILQQPETMEADYTCPEGQEQGAITRFAYLREVDEFKAKNPRHEIDKETSIFDGDTSCLATCTTSDIGEDPFKGSYVYGAEEAGEDAQGCLTSCVGTRWFRAPELLYGSTDYGLEVDLWSLGCIFAELLTLEPLFPGISDIDQLSRIFNVLGNLSEEVWPGCTKLPDYRTISFCKIENPIGLESCLPNCSSDEVSLVRRLLCYDPAARATPMELLQDKYFTEEPLPVPISALQVPQSKNSHDEDSAGGWYDYNDMDSDSDFEDFGPLKFTPTSTGFSIQFP 13 1467
蛋白质SEQ ID   目标                          专利肽序列 专利ORF开始 专利ORF终止
275 CDK D型  MDPDPSPSPDPPKSWSIHTRREIIARYEILERVGSGAYSDVYRGRRLSDGLAVALKEVHDYQSAFREIEALQILRGSPHVVLLHEYFWREDEDAVLVLEFLRSDLAAVIADASRRPRGGGVAPLRAGEGKRWMLQVLEGVDACHRNSIVHRDLKPGNLLISEEGVLKIADFGQARILLDDGNVAPDYEPESFEERSSEQADILQQPETMEADTTCPEGQEQGAITREAYLREVDEFKAKNPRHEIDKETSIYDGDTSCLATCTTSDIGEDPFKGSYVYGAEEAGEDAQGSLTSCVGTRWFRAPELLYGSTDYGLEVDLWSLGCIFAELLTLEPLFPGISDIDQLSRIFNVLGNLSEEVWPGCTKLPDYRTISFCKIENPIGLESCLPNCSSDEVSLVRRLLCYDPAARATPMELLQDKYFTEEPLPVPISALQVPQSKNSHDEDSAGGWYDYNDMDSDSDFEDFGPLKFTPTSTGFSIQFP 113 1558
276 周期素A  MSNQHRRSSFSSSTTSSLAKRHASSSSSSLENAGKAFAAAAVPSHLAKKRAPLGNLTNLKAGDGNSRSSSAPSTLVANATKLAKTRKGSSTSSSIMGLSGSALPRYASTKPSGVLPSVNPSIPRIEIAVDPMSCSMVVSPSRSDMQSVSLDESMSTCESFKSPDVEYIDNEDVSAVDSIDRKTFSNLYISDAAAKTAVNICERDVLMEMETDEKIVNVDDNYSDPQLCATIACDIYQRLRASEAKKRPSTDFMDRVQKDITASMRAILIDWLVEVAEEYRLVPDTLYLTVNYIDRYLSGNVMNRQRLQLLGVACMMIAAKYEEICAPQVEEFCYITDNTYFKEEVLQMESSVLNYLKFEMTAPTVKCFLRRFVRAAQGVNEVPSLQLECMANYIAELSLLEYDMLCYAPSLVAASAIFLAKFVITPSKRPWDPTLQHYTLYQPSDLGNCVKDLHRLCFNNHGSTLPAIREKYSQHKYKYVAKKYCPPSIPPEFFHNLVY 187 1686
277 周期素A  MNKENAVGTKSEAPTIRITRSRSKALGTSTGMLPSSRPSFKQEQKRTVRANAKRSASDENKGTMVGNASKQHKKRTVLNDVTNIFCENSYSNCLNAAKAQTSRQGRKWSMKKDRDVHQSGAVQIMQEDVQAQFVEESSKIKVAESMEITIPDKWAKRENSEHSISMKDTVAESSRKPQEFICGEKSAALVQPSIVDIDSKLEDPQACTPYALDIYNYKRSTELERRPSTIYMETLQKDVTPNMRGILVDWLVEVSEEYKLVPDTLYLTVNLIDRSLSQKFIEKQRLQLLGVTCMLIASKYEEICPPRVEEFCFITDNTYTSLEVLKMESRVLNLLHFQLSVPTVKTFLRRFVQAAQVSSEVPSVELEYLANYLAELTLVEYSFLKFLPSLMAASAVLLARWTLNQSDNPWNLTLEHYTKYKASELKAAVLALEDLQLNTSGSTLNAIREKYRQQKVNYSLLIHSKANHEIL 238 1653
278 周期素B  MAGSDENNPGVVGGAHVQEGLRVGAGKMGAGNVQQRRALSNINSNIIGAPPYPCAVNKRVLSEKNVNSENDLLNAAHRPITRQFAAQMAYKQQLRPEENKRTTQSVSNPSKSEDCAILDVDDDKMADDFPVPMFVQHTEAMLEEIDRMEEVEMEDVAEEPVTDIDSGDKENQLAVVEYIDDLYMFYQKAEASSCVPPNYMDRQQDINERMRGILIDWLIEVHYKFELMDETLYLTVMLIDRFLAVQPVVKKKLQLVGVTAMLLACKYEEVSVPVVEDLILISDRAYSRKEVLEMERLMVNTLHFNMSVPTPYVFMRRFLKAAQSDKKLELLSFFIIELSLVEYDMLKFPPSLLAASAIYTALSTITRTKQWSTTCEWHTSYSEEQLLECARLMVTFHQRAGSGKLTGVHRKYSTSKPGHAARTEPANFLLDFRL 235 1539
蛋白质SEQ ID   目标                            专利肽序列 专利ORF开始 专利ORF终止
279 周期素B  MASRPIVPVQARGEAAIGGGAGKAAIGGGAGKQQKKNGAAEGRNRKALGDIGNLVTVRGIEGKVQPHRPITRSFCAQLLANAQAAAAAENNKKQAVVNVNGAPSILDVPGAGKRAEPAAAAAAAVAKAAQKKVVKPKQKAEVIDLTSDSERAIEAKKKQQHHEPTKKEGEKSSRRNMPTLTSVLTARSKAACGMTKKPKEKVVDIDAGDAHNELAAFEYIEDIYTYYKEAENESLPRNYMSSQPEINEKMRAILVDMLIEIHNKFDLMPETLYLTINIIDRFLSVKAVPRRELQLLGMGALFTASKYKEIWAPEVNDLVCIADRAYSHEQVLAMEKTILGKLEWTLTVPTHYVFLVRFIKASLGDRKLENMVYFLAELGVMNYATLTYCPSMVAASAVYAARCTLGLTPLWNDTLKLHTGPSESQLMDCARLLVGYHAKAKENKLQVVYKKYSSSQREGVALIPPAKALLCEGGGLSSSSSLASSS 158 1618
280 周期素B  MGLPDENNAALSKPTNLQVGGLEIGGRKFGQEIRQTRRALSVINQNLVGDRAYPCHVVNKRGHSKRDAVCGKDQVDPVHRPLTRKFAAQTASTQQHCIEEAKKPRTAVQERNEFGDCIFVDVEDCQPSSENQPVPMFLEIPESRLDDDMEEVEMEDIVEEEEEEPIMDIDGRDKKNPLAVVDYIEDIYANYRRTENCSCVSANYMAQQADTNEKMRSILIDWLIEVHDKFDLMHETLFLTVNLIDRFLARQSVVRKKLQLVGLVAMLLACKYEEVSVPVVGDLILISDKAYTRKEVLEMESLMLNSLQFNMSVPTPYVFMRRFLKAAESDKKLEVLSFFLIELSLVEYEMVKFPPSLLAAAAIFTAQCTLYGFKQWTKTCEWHSNYTEDQLLECARMMVGFHQKAATGKLTGVHRKYGTSKFGYTSKCEPANFLLGEMKNP 205 1530
281 周期素B  MGLPDENNAALSKPTNLQVGGLEIGGRKFGQEIRQTRRALSVINQNLVGDRAYPCHVVNKRGHSKRDAVCGKDQVDPVHRPLTRKFAAQTASTQQHCIEEAKKPRTAVQERNEFGDCIFVDVEDCQPSSENQPVPMFLEIPESRLDDDNEEVEMEDIVEEEEEEPIMDIDGRDKKNPLAVVDYIEDIYANYRRTENCSCVSANYMAQQADINEKMRSILIDWLIEVHDKFDLMHETLFLTVNLIDRFLARQSVVRKKLQLVGLVAMLLACKYEEVSVPVVGDLILISDKAYTRKEVLEMEKLMLNSLQFNMSVPTPYVFMRRFLKAAESDKKLEVLSFFLIELSLVEYEMVKFPPSLLAAAAIFTAQCTLYGFKQWTKTCEWHSNYTEDQLLECARMMVGFHQKAATGKLTGVHRKYGTSKFGYTSKCEAANFLLGEMKNP 174 1499
282 周期素D  MAMVQRQGHDPSSPQEQEDGPSSFLSDDALYCEEGRFEEDDGGGGGQVDGIPLFPSQPADRQQDSPWADEDGEEKEEEEAELQSLFSKERGARPELAKDDGGAVAARREAVEWMLMVRGVYGFSALTAVLAVDYLDRFLAGFRLQRDNRPWMTQLVAVACLALAAKVEETDVPLLVELQEVGDARYVFEAKTVQRMELLVLSTLGWEMHPVTPLSFVHHVARRLGASPHHGEFTHWAFLRRCERLLVAAVSDARSLKHLPSVLAAAAMLRVIEEVEPFRSSEYKAQLLSALHMSQEMVEDCCRFILGIAETAGDAVTSSLDSFLKRKRRCGHLSPRSPSGVIDASFSCDDESNDSWATDPPSDPDDNDDLNPLPKKSRSSSPSSSPSSVPDKVLDLPFMNRIFEGIVNGSPI 94 1332
283 周期素D  MEASYQPHHHGHLRQHDPSSSQQEEQVPFDALYCSEEHWGEEDEEEGLASDGLLSEERDHRLLSPRALLDQDLLWEDEELASLPSKEEPGGMRLNLENDPSLADARREAVEWIMRVHAHYAFSALTALLAVNYWDRFTCSFALQEDKPWMTQLSAVACLSLAAKVEETQVPLLIDFQVEDSSPVFEAKNIQRMELLVLSSLEWKMNPVTPLSFLDYMTRRLGLTGHLCWEFLRRCENVLLSVISDCRFTCYLPSVIAASTMLHVINGLKPRLDVEDQTQLLGILAMGMDKIDACYKLIDDDHALRSQRYSHNKRKFGSVPGSPRGVMELCFSSDGSNDSWSVAASVSSSPEPHSKKSRAGEEAEDRLLRGLEGEEDDPASADIFSFPH 176 1342
蛋白质SEQ ID      目标                            专利肽序列 专利ORF开始 专利ORF终止
284 周期素D  MALQEEDTRRHYPTAPPFSPDGLYCEDETFGEDLADNACEYAGGGARDGLCEIKDPTLPPSLLGQDLFWEDGELASLVSRETGTHPCWDELISDGSVALARKDAVGWILRVHGHYGFRPLTAMLAVNYLDRFFLSRSYQRDRPWISQLVAVACLSVAAKVEETQVPTLLDLQVANAKFVFESRTIQRMELLLMSTLDWRMNSVTPISFFDHILRRFGLTTNLHRQFFWMCERLLLSVVADVRLASFLPSVVATAAMLYVNKEIEPCICSEFLDQLLSLLKINEDRVNECYELILELSIDHPEILNYKHKRKRGSVPSSPSGVIDTSFSCDSSNDSWGVASSVSSSLEPRFKRSRFQDQQMGLPSVNVSSMGVLNSSY 150 1283
285 周期素依赖性激酶调控亚单位 MGQIQYSEKYFDDTYEYRHVVLPPDVAKLLPKNRLLSENEWRAIGVQQSRGWVHYAIHRPEPHIMLFRRPLNYQQQQENQAQQNMLAK 101 367
286 组蛋白乙酰基转移酶  MGSIDPPKAEQNGTAAAAVADPGQKPGAGDAMPPPPPVKHSNGTAAEPDVATKRRRNSVLPLEVGTRVMCRWRDGKYHPVKVIERRKLNPGDFNDYEYYVHYTEFNRRLDEWVKLEQLDLNSVETVVDEKVEDKVTGLKMTRHQKRKIDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDCSKLYFCEFCLNFMKRKEQLQRHMKKCDLKHPPGDEIYRSGTLSNFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHSEESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLLSYKGYWTRVLLDILKKHKANISIKELSDNTAIKADDILNTLQSLDLIQYRKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQG 9 1352
287 组蛋白乙酰基转移酶  MAQKHSTAPDPAAEPKKRRRVGFSGIDAGVDPNGCFKVYLVSREEEVGAPDSFCLDPVDLSHFFEEEDGKIYGYEGLKISVWVSCVSFHSYAEIAFESKSDGGKGITDLNTALKNMFGETLVDNKDDFLQTFSKETQFIRSTVSAGEILKHKHSDDHVNDSVSNLKVGSDVEAVRMLMGDMTAGHLYSRLVPLVLLLVDGSSPIDVTDSSWELXLLIQKTSDQQGNFHDRLLGFAAVYRFYHYPDSSRLRLGQILVLPLYQRKGYGRYLLEVLNNVAIADDVYDFTIEEPVDNLQHLRTCIDVQRLLSFDKVQQAVNSTVSQLKQGKLSKKTYIPRLLPPPSVVEDARKRFKINKKQFLQCWEILVYLGLDPADKSIQDYFSVISNRVRADILGKDSETAGKKVIEVPSDFDPEMSPVMHRAKAGGEANGIQVEDNQNKQEEQLQQLIDERLKDIKLIAEKVTQK 89 1486
288 组蛋白乙酰基转移酶  MAQKHSTAPDPAAEPKKRRRVGFSGIDAGVDPNGCFKVYLVSREEEVGAPDSFCLDPVDLSNFFEEEDGKIYGYEGLKISVWVSCVSFHSYAEIAFESKSDGGKGITDLNTALKNMFGETLVDNKDDFLQTFSKETQFIRSTVSAGEILKHKHSDGHVNDSVSNLKVGSDVEAVRMIMGDMTAGHLYSRLVPLVLLLVDGSNPIDVTDSSWELYLLIQKTSDQQGNFHDRLLGFAAVYRFYHYPDSLRLRLGQILVLPLYQRKGYGHYLLEVLNNVAIADDVYDFTIEEPVDNLQHLRTCIDVQRLLSFDKVQQAVHSTVSQLKQGKLSKKTYIPRLLPPPSVVEDARKRFKINKKQFLQCWEILVYLGLDPADKSIQDYFSVISNRVRADILGKDSETAGKKVIEVPSDFDPENSFVLHRAKAGGETNGIQVEDNQNKQEEQLQQLIDERLKDIKLIAQKVSRK 80 1477
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
289 组蛋白去乙酰基转移酶 MALPMEFWGVEVKAGQPLKVNPGNAKILHLSQASLGECKSSKGNESVPLHVKFGDQKLVLGTLSTENFPQLAFDLVFEKEFELSHNWKSGSVYFCGYKSVVHDDDDEFSDLESDSEEEDLPMIGVENGKVAAQASAKTATASANASKVESSGKQKARIPQPMKVDEDDSDEDDDDEDEDESDEEGVDGEADSDEEEDESDEEETPKKAEIGKKRAADSATKTPVPAKKSKLPTPQKTDGKKGGHTATPHPAKQAGKNPANSANKSQSPKSAGQVSCKSCSKTFNSDGALQSHSKAKHGGK; 160 1062
290 组蛋白去乙酰基转移酶 MEFWGVEVKAGQPLKVNPGNAKILHLSQASLGECKSSKGNESVPLHVKFGDQKLVLGTLSTENFPQLAFDLVFEKEFELSHNWKSGSVYFCGYKSVVHDDDDEFSDLESDSEEEDLPMIGVENGKVAAQASAKTATASANASKVESSGKQKASIPQPMKVDEDDSDEDDDEDDDDEDESDEGVDGEADSDEEEDESDEEETPKKAEIGKKRAADSATKTPVPAKKSKLPTPQKTDGKKGGHTATPHPAKQAGKNPANSANKSQSPKSAGQVSCKSCSKTFNSDGALQSHSKAKHGGK 172 1077
291 组蛋白去乙酰基转移酶 MEFWGVEVKSGEPLNVEPGAETVVHLSQACLGETKEKTKESVLLYVHIGVQKLVLGTLSADKFPQIPFDLVFEKSFKLSHNNKNGSVFFSGYKTLLPCGSDADSPYSDSDTDEGLPINVTAQADVPAKKAPVTANANAAKPNLASAKQKVKIVESNEDGKNEGDDDEDADVSSDDDAEDDSGDEDMVDGGDESSDEDDDDSEEGESSEEEEPKAQPSKKRPADSVLKTPASDKKSKLETPQKTDGKKASEHVATPYPSKQAGKAIASKGQAKQQTPNSNEFSCKPCNRSFKSDQALQSHNKAKHGGS 66 989
292 组蛋白去乙酰基转移酶 MDTGGNSLPSGPDGVKRKVCYFYDPEVGNYYLLQHMQVLKPVPARDRDLCRFHADDYVATLRSITPETQQDQLRQLKRFNVGEDCPVFDGLHSFCQTYAGGSVGGAVKLNHGLCDIAINWAGGLHHAKKCEASGFCYVNDIVLGILELLKQHERVLYVDIDIHHGDEVEEAFYTTDRVMTVSFHKFGDYFPGTGDIRDIGYGKGKYYSLNVPLDDGIDDESYHSLFKPIIGKVMEVFKPGAVVLQCGADSLSGDRLGCFNLSIKGHAECVRYMRSFNVPVLLLGGGGYTIRNVARCWCYETGVALGLEVDDKMPQHEYYEYFGPDYTLHVAPSNMKNKNSRQLLEEIRSKLLENLSKLQHAPSVPFQERPPDTELPEADEDQEDPDERWDPDSDMDVDEDRKPLPSRVKRELIVEPEVKDQDSQKASIDHGRGLDTTQEDNASIKVSDMNSMITDEQSVKMEQDNVNKPSEQIFPK 111 1541
293 组蛋白去乙酰基转移酶 MDTGGNSLPSGPDGVKRKVCYFYDPEVGNYYYGQGHPMKPHRIRMTHALLAHYGLLQHMQVLKPVPARDRDLCRFHADDYVAFLRSITPETQQDQLRQLKRFNVGEDCPVFDGLHSFCQTYAGGSVGGAVKLNHGLCDIAINWAGGLHHAKKCEASGFCYVNDIVLGILELLKQHERVLYVDIDIHHGDGVEEAFYTTDRVMTVSEHKFGDYFPGTGDIRDIGYGKGKYYSLNVPLDDGIDDESYHSLFKPIIGKVMEVFKPGAVVLQCGADSLSGDRLGCFNLSIKGHAECVRYMRSFNVPVLLLGGGGYTIRNVARCWCYETGVALGLEVDDKMPQHEYYEYFGPDYTLHVAPSNMENKNSRQLLEDIRSKLLENLSKLQHAPSVPFQERPPDTELPEADEDQEDPDERWDPDSDMDVDEDRKPLPSRVKRELIVEPEVKDQDSQKASIDHGRGLDTTQEDNASIKVSDMNSMITDEQSVKMEQDNVNKPSEQIFPK 116 1615
蛋白质SEQID 目标 专利肽序列   专利ORF开始 专利ORF终止
294 组蛋白去乙酰基转移酶 MRPKDRISYFYDGDVGSVYFGPNHPMKPHRLCMTHHLVLSYELHTKMEIYRPHKAYPAELAQFHSPDYVEFLHRITPDTQHLFPNDLAKYNLGEDCPVFENLFEFCQIYAGGTIDAARRLNNQLCDIAINWAGGLHHAKKCEASGFCYINDLVLGILELLKYHARVLYIDIDVHHGDGVEEAFYFTDRVMTVSFHKFGDMFFPGTGDVKEIGGKEGKFYAINVPLKDGIDDTSFTRLFKAIISKVVETYQPGAIVLQCGADSLAGDRLGCFNLSIDGHSECVRFVKKFNLPLLVTGGGGYTKENVARCWVVETGVLLDTELPNEIPENEYFKYFAPDYSLKIPRGNIVLENLNSKSYLSAIKVQVLENLRNIQHAPSVQMQEVPPDFYIPDFDEDEQNPDERMDQHTQDKQIQRDDEYYDGDNDNDHNMDDS   155 1453
295 组蛋白去乙酰基转移酶 MTVAEDFHVNNRSKMVSQATPESRLTGGEDDNSLHNQVDELLCQELPERQVILEFEGTRPKPYFSDHNGGENSALGVRATEDDLNSDVEAEEKQKEMTLEDMYKNDGTLYDDDEDDSDWEPVKRQVELMRWFCTNCTMVNVEDVFLCDICGEHRDSGILRHGFYASPFMQDVGAPSVEAEVQESREDHARSSPPSSSTVVGFDEKMLLHSEVEMKSHPHPERADRLQAIAASLATAGIFPGRCRSLPVREITKEELQMVHSSEHVDAVEMTSHMFSSYFTPDTYANEHSARAARIAAGLCADLASTIISGRSKMGFALVRPPGHHAGIKHAMGFCLHNNAAVAALAAQGAGAKKVLIVDWDVHHGNGTQEIFDGNKSVLYISLHRHEGGNFYPGTGAAHEVGTMGAEGYCVNIPWSRRGVGDNDYVFAFHHIVLPIASAFAPDFTIISAGFDAARGDPLGCCDVTPAGYAQMTHMLSALSGGKLLVILEGGYNLRSISSSAVAVIKVLLGDSPISEIADAVPSKAGLRTVLEVLKIQRSYWPSLESIFWELQSQWGMFLVDNRRKQIRKRRRVLVPIWWKWGRKSVLYHLLNGHLHVKTKR   228 2033
296 组蛋白去乙酰基转移酶 MAAAPSSPPTNRVDVFWHDGMLSHDTGRGVFDTGSDPGFLDVLEKHPENPDRVRNMVSILKRGPISPFISWHTATPALISQLLSFHSPEYINELVEADKNGGKVLCAGTFLNPGSWDAALLAAGNTLSAMKYVLDGKGKIAYALVRPPGHHAQPSQADGYCFLNNAGLAVRLALDSGCKRVVVVDIDVHYGNGTAEGFYQSSDVLTISLHMNHGSWGPSHPQSGSVDELGEDEGYGYNMNIPLPNGTGDRGYEYAVTELVVPAVESFKPEMVVKVVGQDSSAFDPNGRQCLTMDGYRAIGRTIRGLADRHSGGRILIVQEGGYHVTYSAYCLHATVEGILDLPDPLLADPIAYYPEDEAFPVKVVDSIKRYLVDKVPFLKEH   110 1258
  297   组蛋白去乙酰基转移酶 MVESSGGASLPSVGQDARKRRVSYFYEPTIGDYYYGQGHPMKPHRIRMAHNLIVHYYLHRRMEISRPFPAATTDIRRFHSEDYVTFISSVTPETVSDPAFSRQLKRFNVGEDCPVFDGIFGFCQASAGGSMGAAVKLNRGDSDIALNWAGGLHHAKKSEASGFCYVNDIVLGILELLKVHKRVLYVDIDVHHGDGVEEAFYTTDRVMTVSFHKFGDFFPGSGHIKDTGAGPGKNYALNVPLNDGIDDESFRGMFRPIIQKVMEVYQPDAVVLQCGADSLSGDRLGCFNLSVKGHADCLRFLRSFNVPLMVLGGGGYTMRNVARCWCYETAVAVGVEPENDLPYNEYYEYFGPDYTLHVEPCSMENLNAPKDLERIRNMLLEQLSRIPHAPSVPFQMTPPITQEPEEAEEDMDERPKPRIWNGEDYESDAEEDKSQHRSSNADALHDENVEMRDSVGENSGDKTREDRSPS   50 1462
  298   MAT1 CDK活化激酶组装因子 MVVPSSNPHNREMAIRRRMASTFNKREDDFPSLREYNDYLEEVEEMTFNLIEGVDVPTIEAKIAKYQEENAEQIMINRAKKAEEFAAALAASKGLPPQTDPDGALNSQAGLSVGTQGQYAPAIAGGQPRPTGMAPQPVPLGTGLDIHGYDDEEMIKLRAERGGRAGGWSIELSKKRALEEAFGSLNL   176 739
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
299 肽基脯氨酰基异构酶 MAAIISCHHYHSCCSSLIASKWVGARIPTSCFGRSSTQSNNAASVRQFVTRCSSSPSSRGQWQPHQNGEKGRSFSLRECAISIALAVGLVTGVPSLDMSTGNAYAASPALPDLSVLISGPPIKDPEALLRYALPINNKAIREVQKPLEDITDSLKVAGLRALDSVERNVRQASRVLKQGKNLIVSGLAESKKDHGVELLDKLEAGMDELQQIVEDGNRDAVAGKQRELLNYVGGVEEDMVDGFPYEVPEEYKNMPLLKGRAAVDMKVKVKDNPNLEECVFRIVLDGYNAPVTAGNFVDLVERHFYDGMEIQRADGFVVQTGDPEGPAESFIDPSTEKPRTIPLEIMVDGEKAPVYGATLEELGLYKAQTKLPFNAFGTMAMAKDEFEDNSASSQIFWLLKESELTPSNANILDGRYAVFGYVTENQDFLADLKVGDVIESVQVVSGLDNLANPSYKIAG; 150 1529
300 肽基脯氨酰基异构酶 MAGEDFDIPPADEMNEDFDLPDDDDDAPVMKAGDEKEIGKQGLKKKLVKEGDAWETPDNGDEVEVHYTGTLLDGTQFDSSRDRGTPFKFTLGQGQVIKGWDQGIKTMKKGENAIFTIPPELAYGEAGSPPTIPPNATLQFDVELLSWTSVKDICKDGGIFKKILVEGEKNENPKDLDEVLVKYEFQLEDGTTIARSDGVEFTVKEGHFCPAVAKAVKTMKKGEKVLLTVKPQYGFGEKGKPASGDEGAVPPNATLQITLELVSWKTVSEVTDDKKVIKKILKEGEGYERPNEGAVVEVKLIGKLQDGTVFVKKGHDDCEELFKFKIDEEQVVDGLDKAVMNMKKGEVALLTVAPEYAFGSSESKQDLAVVPPSSTVYYEVELVSFVKDKESWDMNTEEKIEAAGKKKEEGNVIFKAGKYAKASKRYEKAVKYIEYDTSFSEDEKKQAKALKVACNLNDAACKLKLKDYNQAEKLCTKVLELDSRNVKALYRRAQAYIELSDLDLAEFDIKKALEIDPHNRDVKLEYKVLKEKVKEFNKKDAKFYGNMFAKMSKLEPVEKTAAKEPEPMSIDSKA; 247 1971
301 肽基脯氨酰基异构酶 MSTVYVLEPPTKGKVVLNTTHGPLDVELWPKEAPKAVRNFVQLCLEGYYDNTIFHRIIKDFLVQGGDPTGSGTGGESIYGDAFSDEFHSRLRFKHRGLVACANAGSPHSNGSQFFITLDRCDWLDRKNTIFGKITGDSIYNLSGLAEVETDKSDRPLDPPPKIISVEVLWNPFEDIVPRAPVRSLVPTVPDVQNKEPKKKAVKKLNLLSFGEEAEEEEKALVVVKQKIKSSHDVLDDPRLLKEHIPSKQVDSYDSKTARDVQSVREALSSKKQELQKESGAEFSNSFREIADDEDDDDDDASFDARMRRQILQKRKELGDLPPKPKPKSRDGISARKERETSISRDKDDDDDDDQPRVEKLSLKKKGIGSEARGERMANADADLQLLNDAERGRQLQKQKKHRLRGREDEVLTKLETFKASVFGKPLASSAKVGDGDGDLSDWRSVKLKFAPEPGKDRMTRNEDPNDYVVVDPLLEKGKEKFNRMQAKEKRRGREWAGKSLT; 136 1644
302 肽基脯氨酰基异构酶 MASAISMHSSGLLLLQGTNGKDVTEMGKAPASSRVANMQQRKYGATCCVARGLTSRSHYASSLAFKQFSKTPSIKYDRMVEIKAMATDLGLQAKVTNKCFFDVEIGGEPAGRIVIGLFGDDVPKTVENFRALCTGEKGFGYKGCSFHRTIKDFMIQGGDFTRGNGTGGKSIYGSTFEDENFALKHVGPGVLSMANAGPSTNGSQFFICTVKTPWLDNRHVVFGQVVDGMDVVQKLESQETSRSDVPRQPCRIVNCGELPLDG; 48 836
303 肽基脯氨酰基异构酶 MAASFTALSNVGSLSSPRNGSEIRRFRPSCNVAASVRPPPLKAGLSASSSSSFSGSLRLIPLSSSPQRKSRPCSVRASAEAAAAQSKVTNKVYLDISIGNPVGKLVGRIVIGLYGDDVPQTAENFRALCTGEKGFGYKGSTVHRVIKDFMIQGGDFDKGNGTGGKSIYGRTFKDENFKLSHVGPGVVSMANAGPNTNGSQFFICTVKTPWLDQRHVVFGQVLEGMDIVRLIESQETDRGDRPRKRVVVSDCGELPVV; 49 822
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
304 肽基脯氨酰基异构酶 MAEAIDLTGDGGVMKTIVRRAKPDAVSPSETLPLVDVRYEGVLAETGEVFDSTHEDNTLFSFFIGKGSVISAWDTALRTMKVGEVAKITCKPEYAYGSTGSPPDIPPDATLIFEVELVACKPCKGFSVTSVTEDKARLEELKKQREIAAATKEEEKKRREEAKAAAAARVQAKLDAKKGHGKGRGKAK; 185 751
305 肽基脯氨酰基异构酶 MGNPKVFFDMSIGGQPAGRIVMELYADVVPRTAENFRALCTGEKGAGRSGKPLHYKGSSFHRVIPGFMCQGGDFTAGNGTGGESIYGSKFADENFVKKHTGPGVLSMANAGPGTNGSQFFVCTAKTEWIDGKHVVFGQIVDGMDVVKAIEKVGSSSGRTSKPVVVADCGQLS 103 621
306 肽基脯氨酰基异构酶 MPNPKVFFDMTIGGAAAGRVVMELYADTTPRTAENFRALCTGEKGVGRSKKPLHYKGSKFHRVIPSFMCQGGDFTAGNGTGGESIYGVKFADENFIKKHTGPGILSMANAGPGTNGSQFFICTTKTEWLDGKHVVFGKVVEGMEVVKAIEKVGSSSGRTSKPVVVADCGQLP 41 559
307 肽基脯氨酰基异构酶 MAEAIDLTGDGGVMKTIVRRAKPDAYSPSETLPLVDVRYEGVLAETGEVFDSTHEDNTLFSFEIGKGSVISAWDTALRTMKVGEVAKITCKPEYAYGSTGSPPDIPPDATLIFEVELVACKPCKGFSVTSVTEDKARLEELKKQREIAAATKEEEKKRREEAKAAAAARVQAKLDAKKGHGKGKGKAK 127 693
308 肽基脯氨酰基异构酶 MATARSFFLCALLLLATLYLAQAKKSEDLKEVTHKVYFDVEIAGKPAGRIVMGLYGKAVPKTAENFRALCTGEKGTGKSGKPLHYKGSSFHRIIPSFMLQGGDFTLGDGRGGESIYGEKFADENFKLKHTGPGLLSMANAGPDTNGSQFFITTVTTSWLDGRHVVFGKVLSGMDVVYKVEAEGRQSGTPKSKVVIADSGELPL 28 639
309 肽基脯氨酰基异构酶 MMRREISVLLQPRFVLAFLALAVLLLVFAFPFSRQRGDQVEEEPEITHRVYLDVDIDGQHLGRIVIGLYGEVVPRTVENFRALCTGEKGKSANGKKLHYKGTPFHRIISGFMIQGGDVIYGDGKGYESIYGGTFADENFRIKHSHAGIISMVNSGPDSNGSQFFITTVKASWLDGEHVVFGRVIQGMDTVYAIEGGAGTYNGKPRKKVIIADSGEIPKSKWDEER 135 812
310 肽基脯氨酰基异构酶 MWATAEGGPPEVTLETSMGSFTVELYFKHAPRTSRNFIELSRRGYYDNVKFHRIIKDFIVQGGDPTGTGRGGESIYGKKFEDEIKPELKHTGAGILSMANAGPNTNGSQFFTTLAPCPSLDGKHTIPGRVCRGMEIIKRLGSVQTDNNDRPIHDVKILRTSVKD 119 613
311 肽基脯氨酰基异构酶 MSNPKVFFDILIGKMKAGRVVMELFADVTPKTAENFPALCTGEKGIGRSGKPLHYKGSTFHRIIPNFMCQGGDFTRGNGTGGESIYGMKFADENFKIKHTGLGVLSMANAGPDTNGSQFFICTEKTPWLDGKHVVFGKVIDGYNVVKEMESVGSDSGSTRETVAIEDCGQLSEN 38 562
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
312 肽基脯氨酰基异构酶 MDDDFEFPASSNVENDDDDGMDMDDMGGDVPEEEDPVASPAVLKVGEEREIGKAGFKKKLVKEGEGWETPSSGDEVEVHYTGTLLDGTKFDSSRDRGTPFKFKLGRGQVIKGWDEGIKTMRKGENAIFTIPPELAYGESGSPPTIPPNATLQFDVELLSWSSVKDICKDGGILKKVLVEGEKWDNPKDLDEVFVKYEASLEDGTLISKSDGVEFTVGDGYFCAALAKAVKTMKKGEKVLLTVMPQYAFGETGRPASGDEAAVPPDASLQIMLELVSWKTVSDVTKDKKVLKKTLKEGEGYERPNDGAAVQVRLCGKLQDGTVFVKKDDEEPFEFKIDEEQVIDGLDRAVKNMKKGEVALVTIQPEYAFGPTESQQDLAVVPANSTVYYEVELLSFVKEKESWEMNNQEKIEAAARRKEEGNAAFKAGKYVRASKRYEKAVRFIEYDSSFSDEEKQQAKTLKNTCNLNDAACKLKLKDFKEAEKLCTKVLEGDGKNVKALYRRAQAYIQLVDLDLAEQDIKKALEIDPNNRDVKLEYKILKEKVREYNKRDAQFYGNMFAKMNKLEHSRTAGMGAKHEAAPMTIDSKA 109 1872
313 肽基脯氨酰基异构酶 MAKPRCFMDISIGGELEGRIVGELYTDVAPKTAENFRALCTGEKGIGPHTGAPLHYKGVRFHRVIKGFMVQGGDISAGDGTGGESIYGLKFEDENFDLKHERKGMLSMANSGPNTNGSQFFITTTRTSHLDGKHVVFGRVVKGMGVVRSVEHVTTAAGDCPTVDVVIADCGEIPAGADDGIRNFFKDGDTYPDWPADLDESPAELSWWMDAYDSIKAFGNGSYKKQDYKMALRKYRKALRYLDICWEKEGIDEVESSSLRKTKSQIFTNSSACKLKLCDLKGALLDAEFAVRDGENNAKAYFRQGQAHMEINDIDAAAESFSKALELEPNDVGIKKELNAAKKKIFERREQEKRAYRKMFL 74 1159
314 肽基脯氨酰基异构酶 MTKRKNPLVFLDVSIDGDPVERIVIELFADTVPRTAENFRSLCTGEKGVGKTTGKPLHYKGSYPHRIIKGFMAQGGDFSNGNGTGGESIYGGKFADENFKLAHDGPGLLSMANGGPNTNGSQFFIIFKRQPHLDGKHVVFGKVMRGMEVVKKIEQVGSANGKPLQPVKIVDCGETSETGTQDAVVEEKSKSATLKAKKKRSARDSSSESRGKRRQRKSRKERTRKRRRYSSSDSYSSESSDSDSESYSSDTESESKSHSESSVSDSSSSDGRRRKRKSTKREKLRRQRGKDSRGEQKSARYDKKSRHKSADSSSDSESESSSRSRSRDDKKKSSRRESARSVSKLKDAEANSPENLESPRDREIKKVEDNSSHEEGEFSPKNDVQHNGHGTDAKFGKYDDQRPRSDGSKKSSGSMRDSPKRLANSVPQGSFSSSPAHKASEPSSSIRARNPSRSPAPDGNSKRIRKGRGFTERFSYARRYRTPSPEDVTYRPYHYGRRNFHDRRNDRYSNYRSYSERSPHRRYRSPPRGRSPPRYQRRRSRSRSVSRSPGGNKGRYRGRDQSRSRSRSRSRSPRRGSSPANKQLPLSERLKSRLGTRVDEHSPRRRRSSSRSHDSSRSRSPDEVPDKHEGKAAPVSPARSRSSSPSGRGLVSYGDASPDSGIN 54 2045
315 肽基脯氨酰基异构酶 MSVLLVTSLGDIVVDLHADRCPLTCKNFLKLCRIKYYNGCYFHTVQKDFTAQTGDPTGTGTGGDSVYKFLYGDQARFFMDEIHLDLKHSKTGTVAMASGGENLNASQFYFTLRDDLDYLDGKHTVFGEVAEGLETLTRINEAYVDEKGRPYKNIRIRHTYILDDPFDDPPQLAELIPDASPEGKPKDEVVDDVRLEDDWVPLDEQLGPAQLEEAIRAKEAHSRAVVLESIGDIPDAEIKPPDNVLFVCKLNPVTEDEDLHTIFSRFGTVVSADVIRDFKTGDSLCYAFIEFENKDSCEQAYFKMDNALIDDRRIKVDFSQSVAKLNSQFKRKDSQAAKGKGCFKCGAPDHMARECPGSSTRQPLSKYIIKEDNAQRGGDDSRYEMVFDEDAPESPSHGKKRRGRDDRDDRHKMSRQSVEETKFNDREGGHSVDKHRQSERSKHREDEMSRDSKASEAGRRRIDRDFPEEERDGEKYTESHRDRDGKRGDYRDYRKGRADVQTHGDRRGDENYRRKSAAYDDGHEGAGAARRKDSNDDHHAYRRGYGDSRKGTRDEDDDGRGRRDDPSYRRSSGHKDSSNGGREEQKYRSGETDGKSHPERSHRGDRRR 53 1879
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
316 肽基脯氨酰基异构酶 MRPFNGGSSIACLVLVIAAGALAESQGPHLGSARVVFQTNYGDIEFGFFPGVAPRTVDHIFKLVRLGCYNTNHFFRVDKGFVAQVADVANGRTAPMNDEQRTEAEKTIVGEFSNVKHVRGILSMGRYDDPDSAQSSFSILLGDAPHLDGKYAIFGRVTKGDETLKKLEQLPTRREGMFVMPTERITILSSYYYDTGAESCEEENSTLRRRLAASAVEVERQRMKCFP 7 690
317 肽基脯氨酰基异构酶 MPNPKVFFDMQVGGAPAGRIVMELYADVVPKTAENFRALCTGEKGTGRSGKPLHFKGSSFHRVIPGFMCQGGDFTRGNGTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGSQFFICTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGSGRTSKPVVIADSGQLA 83 601
318 肽基脯氨酰基异构酶 MRFTSITSAIALFAAAASALDKPLDIKVDKAVECSRKTKAGDKIQVHYRGTLEADGSEFDASYKRGQPLSFHVGKGQVIKGWDQGLLDMCPGEKRTLTIQPDWGYGSRGMGPIPANSVLIFETELVEIAGVAREEL 125 535
319 肽基脯氨酰基异构酶 MGNPKVFFDMSIGGQPAGRIVMELYADVVPRTAENFRALCTGEKGAGRSGKPLHYKGSSFHRVIPGFMCQGGDFTAGNGTGGESIYGSKFADENFVKKHTGPGVLSMANAGPGTNGSQFFVCTAKTEWLDGKHVVFGQIVDGMDVVKAIEKVGSSSGRTSKPVVVADCGQLS 55 573
320 肽基脯氨酰基异构酶 MAVATRSRWVAMSVAWILVLFGTLALIQNRLSDTGASSDPKLVHRKVGEEKKKPDDLEEVTHKVFFDVEIGGKPAGRIVMGLFGKTVPKTVENFRALCTGEKGIGKSGKPLNYKGSQFHRIIPKFMIQGGDFTLGDGRGGESIYGNKFSDENFKLKHTDAGRLSMTNAGPDTNGSQFFITTVTTSWLDGRHVVFGRVLSGMDVVHKIEAEGGQSGQPKSIVVISDSGELDL 147 842
321 肽基脯氨酰基异构酶 MAVTLHTNLGDIKCEIFCDEVPKAAEHNARGILSMANSGPNTNGSQFFIAYAKQPHLNGLYTIFGRVIHGFEVLDIMEKTQTGPGDRPLAEIRLNRVTIHANPLAG 167 487
322 肽基脯氨酰基异构酶 MAVATRSRWVAMSVAWILVLFGTLALIQNRLSDTGASSDPKLVHRKVGEEKKKPDDLEEVTHKVFFDVEIGGKPAGRIVMGLFGKTVPKTVENFRALCTGEKGIGKSGKPLNYKGSQFHRIIPKFMIQGGDFTLGDGRGGESIYGNKFSDENFKLKHTDAGRLSMANAGPDTNGSQFFITTVTTSWLDGRHVVFGKVLSGMDVVHKIEAEGGQSGQPKSIVVISDSGELDL 195 890
323 肽基脯氨酰基异构酶 MGNPKVFFDMSIGGQPAGRIVMELYADVVPRTAENFRALCTGEKGAGRSGKPLHYKGSSFHRVIPGFMCQGGDFTAGNGTGGESIYGSKFADENFVKKHTGPGVLSMANAGPGTNGSQFFVCTAKTEWLDGKHVVFGQIVDGMDVVKAIEKVGSSSGRTSKPVVVADCGQLS 68 586
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
324 视网膜母细胞瘤相关蛋白 MSPVAANAMEEAAEPEVPAPVTPSKDDADTDAAVSRFLGFCKSKLGLAEGNCVQSSTLLRKTAHVLRSSGTVIGTGTAEEAERYWFAFVLYTVRRVGERKAEDEQNGSDETEVPLSRILKASVLNLIDFFKEIPQFVIKAGAIVSGIYGANWDSRLEAREMQTNYVHLCILCKFYKRICGEFFILNDAKDDMKSADSSTSDPVIMYQPFGWLLFLALRIHALSRFKDLVSSTNALVSVLAILIIHLPTRFRKFSISDSSQLVKRSEKGVDLVGSLAYRYDTSEDEIKRTLEKANNVIAEILGITPPPASECKAENLENVDTDGLIYFGNLMEETSLSSILSTLEKIYEDATRNDSEFDERVFINDDDSLLVSGSLSGAAINLTGAKRKYDSFASPAKTITRPLSPSRSPASHINGIIGGTNLRITATPVATAMTTAKWLRTFVSPLPSKPSTDLQGFLASCDRDVTSDVIRRANIILEAIFPNSPIGERTVTGGLQNANLMDNMWAEQRRLEALKLYYRVLEAMCRAEAQILHSNNLTSLLTNERFHRCMLACSAELVLATHKTVTMLFPAVLERTGITAFDLSKVIESFVRHEETLPRELRRHLNTLEERLLENMVWERGSSMYNSLVVARPALAPEINRLGLLPEPMPSLDAIALLINFSSSGLPQSPVQKHEASPGQNGDIRSPKRISTEYRSVLVERNFTSPVKDRLLALSNIKSKLPPPPLQSAFASPTRPHPGGGGETCAETAIHIFFSKITKLAAVRINAMLERLQLSQQIKEGVYCLFQQILSQRTNLFFNRHIDQVILCCFYGVAKINQINLTFREILYNYRKQPQCKPQVFRNVFVDWSTRRNGKAGNEHVDIISFYNEIFIPSVKPLLVELGPTGATTRTNRTSEVGNKNDAQCPGSPKISSFPTLPDMSPKKVSASHNVYVSPLRSSKMDASISHSSKSYYACVGESTHAYQSPSKDLVAINSRLNGNRKVRGTLNFDDVDAGLVSDSMVANSLYLQNGSSMSSSTAKSSEKPES 182 3265
325 WD40重复蛋白 MRPILMKGHERPLTFLKYNREGDLLFSCAKDHTPTVWFADNGERLGTYRGHNGAVWCCDVSRDSMRLITGSADTTAKLWSVQNGTQLFTFNFDSPARSVDFSIGDKLAVITTDPFMELPSAIHVKRIARDPADQASESVLVLRGHQGRIARAVWGPLNKTIISAGEDAVIRIWDSETGKLLRESDKETGHKKAVTSLMKSVDGSHFVTGSQDKSAKLWDIRTLTLIKTYVTERPVNAVTMSPLLDHVVLGGGQDASAVTMTDHRAGKFEAKFFDKILQEEIGGVKGHFGPINALAFNPDGKSFSSGGEDGYVRLHHFDPDYFNIKI 165 1145
326 WD40重复蛋白 MDKKRTVVPLVCHGHSRPVVDLFYSPITPDGFFLISASKDSSPMLRNGETGDWIGTFEGHKGAVWSCCLDTNALRAASGSADFSAKLWDALSGDELHSFEHKHIVRSCAFSEDTHLLLTGGVEKILRLFDLNRPDAPPREVDNSPGSIRTVAWLHSDQTILSSCTDIGGVRIWDVRSGKIVQTLETKSPVTSSEVSQDGRYITTADGSTVKFWDANHFGLVKSYNMPCNIESASLEPKLGNKFIAGGEDMNVHIFDFHTGEEIGCNKGHHGPVHCVRFSPGGESYASGSEDGTIRIWQTGPANNVEGDANPSNGPVTGKAKVGADEVTRKVEDLQIGKEGKDWREG 529 1569
327 WD40重复蛋白 MAEGLILKGTMRAHTDMVTAIAIPIDNSDMVVTSSRDRSIILWHLTKEEKVYGVPRRRLTGHSHFVQDVVLSSDGQFALSGSWDGELRLNDLATGVSARRFVGHTKDVLSVAFSIDNRQIVSASRDRTIKLWNTLGECKYTIQEGEAHTDWVSCVRFSPNTLQPTIVSASWDRTIKVWNLTNCKLRNTLAGHNGYVNTVAVSPDGSLCASGGKDGVILLWDLAEGKRLYNLEAGAIIHSLCFSPNRYWLCAATENSIKINDLESKSIVEDLRVDLKNEADKTDGTTTAASNKKVIYCTSLNWSADGSTLFSGYNDGVIRVWGTGRY 156 1136
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
328 WD40重复蛋白 MAEGLHLKGTMKAHTDMVTAIAVPIDNADMIVTSSRDKSIILWHLTKEDKVYGVPRRRLTGHSHFVQDVVLSSDGQFALSGSWDGELRLWDLATGVSARRFVGHTKDVLSVAFSIDNRQIVSASRDRTIKLWNTLGECKYTIQEGEAHNDWVSCVRFSPNTLQPTIVSASWDRTVKVWNLTNCKLRNTLQGHSGYVNTVAVSPDGSLCASGGKDGVILLWDLAEGKKLYSLEAGAIIHSLCFSPNRYWLCAATENSIKIWDLESKSIVEDLRVDLKNEADMSDGTTGAMSSNKKVIYCTSLNWSADGSTLFSGYNDGVIRVWGIGRY 90 1073
329 WD40重复蛋白 MAEGLHLKGTMKAHTDMVTAIAVPIDNADMIVTSSRDKSIILWHLTKEDKVYGVPRRRLTGHSHFVQDVVLSSDGQFALSGSWDGELRLWDLATGVSARRFVGHTKDVLSVAFSIDNRQIVSASRDRTIKLWNTLGECKYTIQEGEAHNDWVSCVRFSPNTLQPTIVSASWDRTVKVWNLTNCKLRNTLQGHSGYVNTVAVSPDGSLCASGGKDGVILLWDLAEGKKLYSLEAGAIIHSLCFSPNRYWLCAATENSIKIWDLESKSIVEDLRVDLKNEADMSDGTTGAMSSNKKVIYCTSLNWSADGSTLFSGYNDGVIRVNGIGRY 66 1049
330 WD40重复蛋白 MSGVPAPPFATTTPENGTMSSNSPAFHRDSDDDDDQGEVFLDDSDIIHEVAVDDEDLPDADDEADEAEEADDSLHIFTGHNGEVYSLACSPTDATLVATGAGDDKGFLWRIGHGDWAVELQGHKDSISSLAFSLDGQLLASGSLDGVIQIWDVPSGNLKGTLDGPGGGIEWIRWHPKGHIILAGSEDSTVWMWNADKMAYLNMFSGHGNSVTCGDFTPDGKTICTGSDDATLRIWNPKSGENIHVVKGHPYHAEGLTSMAISSDSGLAITGAKDGSVRIVNISSGRVVSSLDAHADSVEFVGLALSSPWAATGSLDQKLIIWDLQHSSPRATCDHEDGVTCLSWVGASRFLASGCVDGKVRVWDSLSGDCVRTFHGHSDAIQSLSVSANEEFLVSVSIDGTARVFEIAEFH 277 1512
331 WD40重复蛋白 NGTSQHQLSSCLQLLPRRRGNKNLIFRRTMASGGAAAVAPPPGYKPYRHLKTLTGHVAAVSCVKFSNDGTLLASASLDKTLIIWSSAALSLLHRLVGHSEGVSDLAWSSDSHYICSASDDRTLRIWSSRSPFDCLKTLRGHTDFVFCVNFNPQSSLIVSGSFDETIRIWEVKTGRCLNVIRAHSMPVTSVHFNRDGSLIVSGSHDGSCKIWDTKNGACLKTLIDDTVPAVSFAKFSPNGKFILVATLNDTLKLWNYATGKFLKIYTGHKNSVYCLTSTFSVTNGKYIVSGSEDRCICIWDLQGKNLIQKLEGHSDTVISVTCHPSENKIASAGLDSDRTVRIWLQDA 33 1076
332 WD40重复蛋白 MPSQKIETGHQDIVHDVAMDYYGKRVATASSDTTIKIIGVSNSSGSQHLASLSGHKGPVWQVAWAHPKFGSILASCSYDGQVILWKEGNQNDWAQAHVFNDHKSSVNSIAWAPHELGLCLACGSSDGNISVFTARPDGGWDTTRIEQAHPVGVTSVSWAPSMAPGALVGSGLLDPVQKLASGGCDNTVKVWKLYNGTWKMDCFPALQMHSDWVRDVAWAPNLGLPKSTIASASQDGTVVIWTVAKEGEQWQGKVLKDFKTPVWRVSWSLTGNLLAVADGNNNVTLWNEAVDGEWQQVTTVEP 65 973
333 WD40重复蛋白 MKIAGLKSVENAHDESVWAAAWVPATESRPALLLTGSLDETVKLWRPDEIALERTNAGHFLGVVSVAAHPSGVIAASASIDSFVRVFDVDTNATIATLEAPPSEVWQMQFDPKGTTLAVAGGGSASIKLWDTATWELNATLSIPRPEQPKPSEKGNKKFVLSVAWSPDGRRLACGSMDGTISIFDVARAKFLHHLEGHFMPVRSLVFSPVEPRLLFSASDDAHVHMYDSEGKSLVGSMSGHASWVLSVDVSPDGAALATGSSDRTVRLWDLSMRAAVQTMSNHSDQVWGVAFRPMAGAGVRAGGRLASVSDDKSISLYDYS 82 1047
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
334 WD40重复蛋白 MEIDLGNLAFDVDFHPSEQLVASGLITGDLLLYRYGDGSSPEKLLEVRAHGESCRAVRFINDGKAILTGSPDCSILATDVETGSVVARVENAHEAAVNRLVNLTESTIATGDDNGCIKVWDTRQRSCCNTFSAHEDFISDMTFASDSMKLVVTSGDGTLSVCNLRSNKVQTRSEFSEDELLSVVIMKNGRKVVCGTQSGTLLLYSWGFFKDCSDRFVDLSPSSVDALLKLDEDRIIAGTENGLISLIGILPNRIIQPIAEHSDHPIERLAFSHDKKFLGSISHDQTLKLWDLNDILGSEDSPSSQAAIDDSDSDEMDVDANPPDSSKGNKKKHSGKGNDVGNANNFFADLGD 43 1101
335 WD40重复蛋白 MSQQPSVILATASYDHTIRFWEAKSGRCYRTIQYPDSQVNRLEITPHKRYLAVAGNPSIRLFDVNSNTPQPVMSFDSHTNNVMAVGFQYDGNWMYSGSEDGTVRINDLRARGCQREYESRGAVNTVVLHPNQTELISGDQNGNIRVWDLTANSCSCELVPEVDTAVRSLTVMNDGSLVVAANNNGTCYVWRLLRGSQTMTNFEPLHKLQAHNGYILKCLLSPEFCEPHRYLATASSDHTVKIWNVEGFTLEKTLIGHQRWVWDCVFSVDGAYLITASSDTTARLWSMSTGQDIRVYQGHHKATTCCALHDGAEGSPG 142 1095
336 WD40重复蛋白 MEDAMDMEVEVEVEAEEHSPSSSNPSGSSFRRFGLKNSIQTNFGSDYVFEITPKFDWSLMGVSLSSNAVKLYSPTTGQYCGECRGHSDTVNGISFSGPSSPHVLHSCSSDGTIRAWDTRSFKEVSCISAGPSQEIFSFSFGGSSDSLLSAGCKSQILFWDWRNKKQVACLEDSHVDDVTQVCFVPHHQNKLISASVDGLICIFDTAGDINDDEHMESVINVGTSIGKVGIFGQTFEKLWCLTHIETLSVWDWKEGTNEANFEDARKLASDSWSLDHIDYFVDCHSAEEGEGLWVIGGTNAGTLGYFPVKYKGGAAIGSPEAVLGGGHSDVVRSVLPMSGMAGTTSKTRGIFGWTGGEDGRLCCWLSDDSSATSRSWMSSNLVLKSSRSHHKKNRHQPY 61 1257
337 WD40重复蛋白 MSQHQEYPMEYAADDYDVGEVEDDMYFHERVMGDSDTDEDEEYDHLDNKITDTSAADARRGKDIQGIPWERLSVTREKYRRTRIEQYKNYENVPQSGESSEKDCKPTRKGGNYYEFWRNTRSVKSTILHFQLRNLVWSTTKHDVYLMSHFSIIHWSSLTCKKTEVLDVYGHVAPREKHPGSLLECFTQTQVSTLAVRDKLLIAGGFQGELICKNLDRPGVSYCCRTTYDDNAITNAVEIYDYPSGAVHFMASNNDCGVRDFDMEKFELSRHFTFPWPVNHTSLSPDGKLLVIVGDNPEGIVVDSQRGKTIRPLQGHLDFSFASAWHPDGHIFATGNQDKTCRIWDIRNLSKSVAVLKGNLGAIRSIRFTSDGRFMAMAEPADFVHVYDVKSGYEKEQEIDFFGEISGVSFSPDTESLFVGVWDRTYGSLLQYNRCRNYSYLDSM 193 1527
338 WD40重复蛋白 MGASSDPNPDVSDEHQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLDHPAAAAAVPNRVEIVQLDDSTGEIRADPNLSFDHPYPATKAAFVPDKDCQRADLLATSSDFLRIWRIADDSSRVDLRSFLNGNKNSEFCRPLTSFDWNEAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKEVYDIAWGGVSVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGNNKQDPRYMATIIMDSAKVVVLDIRYPTMPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSMAQPVEGGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSLKLQ 109 1155
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
339 WD40重复蛋白 MRGGGGGGDATGWDEDAYRESVLKEREVQTRTVFRAAFAPSPSPSPSPDAVVVASSDGSVASYSISACLSDHRLQSLRFADAKSQNVLEAEPACFLQGHDGPAYDVKFYGEGEDSLLLSCGDDGRIRGWMWRDITSSEAHDHSQGNSAKPVLDLVNPQSRGPWGALSPIPENNALAVDVKRGSIYAAAGDSCAYCWDVECGKIKTVFKGHSDYLHCIAARNSSSQIITGSEDGTARIWDCRSGKCVQVIDPDKDHKKGFFASVSCLALDASESWLVCGRGRDLSVWSISASDCIAKISTNAPAQDVLFDDNQILLVGAEPLISRLDMNGAVLSQIHCAPQSVFSVSLHQSGVTAVGGYGGLVDVISQFGSHLCTFRCKCI 71 1213
340 WD40重复蛋白 MEAPIIDPLQGDFPEVIEEYLEHGIMKCIAFNRRGTLLAAGCTDGSCIIWDFETRGVAKELRDKECTAAITSVCWSKYGHRILVSASDKSLILWDVLSGEKIAHTTLQHTVLQACLHPGSSTPSICLACPFSSAPMIVDLNTGSTTALPVLTADVSNGATPLSRNKTSDTSVTYSPCNACFNKHGDLVYAGTSKGEILIIDHKNVRVCAIVLVSGGAVIKNVVFSRNGQYMLTNSNDRLIRIYKNLLPPKDGLKMLDELNESFNESDDVEKLKAIGSKCLELLHEFQDSITRVQWKAPCFSGDGEWVIGGAASRGEHKIYIWDRAGHLVKILEGPKEALMDLAWHPVHPIIISVSLTGLVYIWAKDYTENWSAFAPDFKELEENEEYVEREDEFDLVPETEKVKGLDVHEDDEVDVLTVERDSVFSDSDMSQEELCFLPAVPCLDIPEQQDKCVGSCSKLPDGNHSGSPLSVEAGQNGNASNHNSSPLEPMENSTAQDTDGVRLKRKRKPSEKGLELQAEKVKKPVKPLKSSGRLSKTNKPVIDPDSSNGVYGDDGSD 109 1785
341 WD40重复蛋白 MRGVSWPEDGNNPSTSSSSQRNQQQAHAPRAVSGHAASHPSASNIFKLLVQREVSPRSKHSSKKLWREASKCQPYPFQQSCEAVRDVRQGLISWVESASLRHLSAKYCPLVPPPRSTIAAAFSPDGKILASTHGDHTVKLIDSQTGSCLKVLRGHRRTPWVVRFHPLYPEILASGSLDHEVRLWDANTAECIGSRNFYRPIASIAFHARGELLAVASGHKLYIWHYNRRGETSSPTIVLRTQRSLRAVHFHPHAAPFLLTAEVNDLDSADSAMTLATSPGYLHYPPPTVYEADAHSHERSRLADELPLMPLPLLMWPSFTRDDGRVPLQRIDGDVGLNGQQRVDSSSSVRLWTYSTPSGQYELLLSPVESGNSPSMPEETGNNAFSSAVEAEVSQSAMDTVEDMEVQPEERNTQFFSFSDPRFWELPLLHGWLVGQTQAGPRSVRQSSPGDIETQSAFGEVASVSPITSGVMPVSMDPSRFGGRSGSRYRSPGSRGVHVTGPNNDGPRDENDPQSVVSKLRSELAASLAAAASTELPCTVKLRIWPHDVKDPCAQLDLESCRLTIPHAVLCSEMGAHFSPCGRPLAACVACVLPHLESDPGLHGQVNQDVTGVATSPTRHPISAHQIMYELRIYSLEEATFGIVLASRPVRAAHCLTSIQFSPTSEHLLLAYGRRHSSLLKSIVIDGENTVPIYTILEVYRVSDMELVRVLPSAEDEVNVACFHPSVGGGLIYGTKEGKLRILHYDSSHGLNLKSSGFLDENVPEVQTYALEC 364 2685
342 WD40重复蛋白 MDSAVAIAALSLVVGAAIALLFFGNYFRKRRSEVVAMAEADLQPHPKNPSRPPPQPAAKKVHAKSHAHGADKDKNKRHHPLDLNTLKGHGDSVTGLCFASDGRSLATACADGVVRVPKLDDASNKSFKFLRINLPAGGHPTAVAFGDGVSSVIVASQHLSGCSLYMYGEEKPTNLDSNKQQTKLPMPEIKWEHHKVHEQKAILTLSGAAANYDSGDGSTIIASCSEGTDIIIWHAKTGKILGNVDTNQLKNTMSAISPNGRFIAAAAFTADVKVWEIVYSKDGSVKGVTKVMQLKGHKSAVTWLCFTPNSEQIVTASKDGSIRIWNINVRYHLDEDTKTLKVFPIPLQDSSGTTLHYERLSLSPDGKILAATHGSMLQWLCIETGKVLDTAEKAHDGDITCMSWAPQSIPTGDKKVNVLATASGDKKVKLWAAPPLPS 96 1412
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
343 WD40重复蛋白 MEVEPKKASKTFPVKPKLKPKPRTPSGKTPESKYWSSFKTTHPLDNLSFSVPSLAFSPSPPHLLAAAHSATVSLFSPHRTTISSFSDVVSSLSFRSDGQLLAASDLSGLIQVFDVRSRTPLRRLRSHARPVRFVRYPVLDKLHLVSGGDDALVKYWDVAGESVVSELRGHKDYVRCGDCSPADANCFVTGSYDHVVKLWDVRVRDGNRAATEVNHGSPVQDVIFLPSGSLVATAGGNSVKIWDLIGGGRMVYSMESHNKTVTSICVGTMGAQQSGEEGVQLRILSVGLDGYMKVFDYSRMKVTHSMRFPAPLLSIGFSPDSNVRAIGTSNGILYVGKRKAKENAEGGANGILGLGSVEEPRRRVLKPSFYRYFHRGQSEKPSEGDYLVMRPKKVKLAEHDKLLKKFQHKNALISVLGGNDPEKVVAVMEELVARRALLKCVLNLDADELGLILTFLHKNSTVPRYSSLLLGLAKKVIDLRLEDIRASDALKGHIRNLKRSVDEEIRIQEGLQEIQGMVSPLLRIAGRR 116 1702
344 WD40重复蛋白 MQGGSSGVGYGLKYQARCISDVKADTDHTSFLTGTLSLKEENEVHLLRLSSGGTELICEGLFSHPSEIWDLSSCPFDQRIFSTVFSTGESYGAAVNQIPELYGQLNSPQLEKIASLDAHSRKISCVLWWPSGRHDKLVSIDEENIFLWGLDCSKKSAQVQSQESAGMLHNLSGGAWDPHDVNTVAATCESSIQFWDLRTMKKANSLESVHARDLDYDMRKKHLLVTSEDESGVRVWDLRMPKAPIQEFPGHTHWTWAVRCNPDYEGLILSAGTDSAVNLWWSSTASSDELISERLIDSPTRKLDPLLHSYNDYEDSVYGLAWSSREPWIFASLSYDGRVVVESVKPFLSRK 46 1101
345 WD40重复蛋白 MAEEEGSAELEQQLEEEFAVWKKNTPILYDLLISHALENPSLTVHWAPLLPQPSSSAAAAAGDPSLAAHRLVLGTHTSDGAPNFLILADALLPSSESDHCGDDAVLPKVEISQKIRVDGEVNRARFMPQNHNIVGAKTNGCEVYVFDCSKQAAKQHDGGFDPDLRLTGHDGEGYGLSWSPLKENYLLSASHDKKICLWDISAAAQDKVLGAMHVFEAHEGAVGDASWHSKNDNLFGSAGDDCQLMIWDLRTNKAQQCVKAHEKEVNSVSFNSYNDWILATASSDTTVGLFDMRKLTTPLHVFSSHEGEVLQVEWDPNHEAVLASSSEDRRVMVWDLNRIGDEQQEGDASDGPAELLFSHGGHKAKISDFSWNKNEPWVISSVAEDNSVQVWQMAESICGDDDDMQAMEGYI 23 1258
346 WD40重复蛋白 MGNYGEEDEDQYFDALEETASVSDRGSNSSDCCSSGSGLDENVLDSLGFEFWTKFPESVRARRNRFLMLTGLGIEANSVDKEDAFPPSCNEIEVYTCKVTRDDGAVQRSLDSYNCISLLQSSTSIRSNQEVESLRGDSLLSSFRGRSKESDDLTELCGMGCPESKRNAVSEFGSVSQGSIEELRRIVASSPLVHPLLHRKLEYERELIETKQKMGAGWLRKFGSATCISGRQGDTWSDPDDLEITAGMKMRRVRAHSSKKKYKELSSLYAAQEFLAHEGSISTMKFSMDGQYLASAGEDTVVRVWKVTEEDRSERVNVTVDPSCLYFALNESTQLASLNTNKEHIGKAKTFQRSSDSSCVILPLKVFQITEKPWHEFKGHNGEVLDLSWSSKGYLLSSSTDKTVRLWRVGCDRCQRVYSHNDYVTCISFNPVNENFFISGSIDGKVRIWNVFGGQVVAYIDCREIVSAVCYRSDGKGAIVGTMTGNCLFYSIKDNHLQMDAQVYLHGKKKSPGKRITGFQFPPNDPGKLMITSADSVIRVLSGLDVVCKLKGPRNSGGPMIATFTSDGKHVISASEDSNVYIWNYAGQDKTSSRVKKIWSCESFWSSNASVALPWCGIRTVPEALAPPSRSEERRASCAENGENHHMLEEYFQKMPPYSPDCFSLSRGFFLELLPKGSATWPEEKLSDTSPPTVSSQAISKLEYKFLKSACHSVLSSAHMWGLVIVTAGWDGRIRTYHNYGLPVRS 404 2644
蛋白质SEQID 目标   专利肽序列 专利ORF开始 专利ORF终止
347 WD40重复蛋白   MDIDFKEYRLRCELRGHEDDVRGVCVCGDGSIGTSSRDRTVRLWAPSAGERRKYEVARVLLGHKSFVGPLAWVPPSEELPEGGIVSGGMDTLVMAWDLRNGEAQTLKGHQLQVTGIVLDGGDIVSASVDCTLIRWKNGQLTEHWEAHKAPIQAVIRLPSGELVTGSSDTTLKLWRGKTCTQTFVGHTDTVRGLAVMPDLGILSASHDGSIRLWAVSGECLMEMVDHTSIVYSVDSHASGLIVSGSEDRFAKIWKDGVCFQSIEHPGCVWDVKFLEDGDIVTACSDGTIRIWTNQEDRMANSTELELFDLELSSYKRSRKRVGGLKLEELPGLEALQVPGTSDGQTKVIREGDNGVAYAWNSTELKWDKIGEVVDGPEDSMNRPALDGVQYDYVFDVDIGDGEPTRKLPYNRSDNPYDTADKWLLKENLPLSYRQQIVEFILANSGQRDFNLDPSFRDPYTGSSAYVPGAPSQLAAKQARPTFKHIPKKGMLVFDAAQFDGILKKINEFNNTLLSNQEKKNLSLTDIEISRLGAVVKILKDTSHYHSSKFADADFDLMLKLLESWPYEMMFPVIDIFRMVILHPDGADGLLRHQEDKKDVLMESIKRATGNPSVPANFLTSIRAVTNLFKNSAYYSWLQKHRSEMLDAFSSCSSSSNKNLQLSYATLLLNYAVLLIEKKDEEGQSQVLSAALELAENESLEVDARYRALVAIGSLMLDGLVKRIALDFDVEHIAKAARTSKEAKIAEVGADIELLIKQS 107 2383
348 WD40重复蛋白   MEFTEAYKQSGPCCFSPNARFIAVAVDYRLVIRDTLSLKVVQLFSCLDKISYIEWALDSEYILCGLYKRPMIQAWSLIQPEWTCKIDEGPAGIAYARWSPDSRHILTTSDFQLRLTVWSLVNTACVHVQWPKHASKGVSFTRDGKFAAICTRHDCKDYINLLSCHNWEIMGVFAVDTLDLADIQWSPDDSAIVIWDSPLEYKVLVYSPDGRCLFKYQAYESGLGVKSVSWSPCGQFLAVGSYDQMLRVLSHLTWKTFAEFTHLSNVRAPCCAAIFKEVDEPLQIDMSELSLSDDYMQGNSGDAPEGHYRVRYDVTEVPITLPCQKPPADRPNPKQGIGLMSNSNDSQYICTRNDSMPTILWIWDMRHLELAAILVQKDPIRAAVWDPTGTRLVLCTGSSHLYMWTPSGAYCVSVPLSQFNITDLKWNSDGSCLLLKDKESFCCAAAPLPPDESSDYSSDD 243 1625
349 WD40重复蛋白   MATIAALDDDMVRSMSIGAVFSDFVGKLNSLDFHRKDDILVTAGEDDSVRLYDIANARLLKTTFHKKHGTDRVCFTHHPNSLICSSTKNLDTGESLRYISMYDNRSLRYFKGHKQRVVSLCMSPINDSFMSGSLDHSVRMWDLRVNACQGILRLRGRPTVAYDQQGLVFAVAMEGGAIKLFDSRSYDKGPFDAFLVGGDTSEVCDIKFSNDGKSVLLSTTNNNIYVLDAYAGDKQCGFNLEPSPSTPIEASFSPDGQYVVSGSGDGTLHAWNISRRNEVACWNSHIGVASCLKWAPRRAMFVAASTVLTFWIPNSEPELASAKGEAGVPPEQV 126 1127
350 WD40重复蛋白   MSVAELKERHRAATETVNSLRERLKQKRVQLLDTDVAGYARTQGKTPVTFGATDLVCCRTLQGHTGKVYSLDWTPERNRIVSVSQDGRFIVWNALTSQKTHAIRLPCAWVMTCAFAPNGQSVACGGLDSVCSIFNLNSPVDRDGNLPVSRMLSGHKGYVSSCQYVPDGDAHLITGSGDQTCVLWDITTGLRTSVFGGEFQSGHTADVLSVSINGSSPRIFVSGSCDSTARMWDTRVASRAVHTYHGHEGDVNAVKFFPDGNRFGTGSDDGTCRLFDIRTGHELQVYYQQRGIDEIPHVTSIAFSISGRLLIAGYSNGDCFVWDTLLAQVVLNLGSLQNSHEGRISCLGVSADGSALCTGSWDTNLKIWAFGGIRRVT 257 1390
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
351 WD40重复蛋白 MKKRPRGASLDQAVVDIRRREVGGDSGLSFARRLAASEGLVLRLDIYNKLKGMRGCVNTVGFNLDGDIVISGSDDRHVKLWDWQTGKVELSFDSGHLSNVFQAKIMPYTDDRSIVTCAADGQARHAQILEGGQVQTMLLAKNRGRAHKLAIDPGSPHIVYTCGEDGLVQRLDLRSNTARELFTCREVYGTHVEVVHLNAIAIDPRNPNLFVIGGSDEYARVYDIRNYKWNGSHNFGRSANYFCPSHLIGEAHVGITGLAFSGQSELLVSYNDESIYLFTQENGLGPDPLSASTKSVDSNSSEVPSPTAVNVDDNVTPQVYKGHRNCETVKGVGTTGPKCEYVVSGSDCGRIFIWKKKGGQLIRVMAADKHVVNCIEPHPHIPALASSGIENDIKIWTPKAIERATLPMNVEQLKPKARGMDNRISSPRQLLLQLYSLERWPEHGGETSSGLAAGQEELTELFFALSANGNGSPDGGGDPSGPLL 178 1632
352 WD40重复蛋白 MSKRGIKLQEFVAHSSNVNCLSIGKKACRLFLTGGDDCKVNLWAIGKPNSLMSLCGHTNAVESVAFDSAEVLVLAGASSGVIKLWDVEEAKNVRGLTGHRSNCTAMEFHPFGEFFASGSTDTNLKIWDIRKKGCIHTYKGHTRGISTIRFSPDGRWVVSGGNDNVVKVWDLTAGKLLHDFKFHENHIRSIDFHPLEFLLATGSADRTVKFWDLETFELIGSSRPEAAGVRAIAFNPDGRTLPCGLEDSLKVYSWEPVICHDGVDMGWSTLADLCIHDGKLLGCSYYQSSVGVWVADASLIEPYGTNVKPQQKDSGDDEIEHQESRPSAKVGTTIRSTSIMRCASPDYETKDIKNIYVDTASGNPVSSQRVGTTNFAKVTQPLDFNDTPNLTLRRQGLVTETPDGLSGHVPSKSITQPKVVSRDSPDGKDSSRRESITFSRTKPGMLLRPAHSRRPSSTKYDVDRLSACAEIGVLSSAKSGSESLVDSFLNIKVAPEDGARNGCEDNHSSVKNVSVESEKVLPLQTPKTEKCDQTVGFKEEINSVKFVNGVAVVPGRTRTLVEKFEKREKLNSTEDQTINNTPEPTLDKTPPPSLAENEEKSDRLNIVERKATRMSSHMVTAEDRTPVTLVGSPEDQSTVMAPQRELPADESSKTPPLPVEDLEIHHGSNVSEDKATILSSQTVSEEDSKRSTLIRNFRRRDHFKSTEGRSPVMATQRKLPTDESGKTSSLPMEDLEIKGGLNVSEDKATSFSSRAPPREDRAHSALVRNVAKRDKFKSTNDTITVMVHQRGLSTDEASTVSVERVERRQLSNNVENPLNNLPPHSVPPTTTRGEPQYVGSESDSVNHEDVTELLLGHHEVFLSTLRSRLTKLQVV 290 2917
353 WD40重复蛋白 MSTFLTGTALSNPNPNKSYEVVQPPNDSVSSLSFNPKANFLVATSWDNQVRCWEIVRSGTSLGTTPKASISHDQPVLCSTWKDDGTTVFSGGCDKQVKMWPLSGGQPMTVAMHDAPIKEISWIPEMNLLVTGSWDKTLRYWDTRQANPVHIQQLPERCYALTVRHPLMVVGTADRNLIIYNLQSPQTEFKRISSPLKYQTRCLAAFPDQQGFLVGSIEGRVGVHHLDDSQQSKNFTFKCHREGSEIYSVNSLNFHPVHHTFATAGSDGAFNFWDXDSKQRLKAMSRCSQPIPCSTFNNDGSIFAYSACYDWSKGAENHNPATAKTYIFLHLPQESEVKGKPRLGTTGRK 148 1197
354 WD40重复蛋白 NEVEAQQRDVNNVMCQLVDPEGTTLGPPMYLPQDVGPQQLQQMVNKLLSNEDKLPYTFYISDQELVVPLESYLQKNKVSVEKVLSIVYQPQAIFRIRPVNRCSATIAGHSEAVLSVAFSPDGKQLASGSGDTTVRLWDLSTQTPMFTCKGHKNWVLSIAWSPDGKHLVSGSKAGEIQCWDPLTGQPSGNPLVGHKKNITGISWEPVHLSSPCRRFVSSSKDGDARIWDVTLRRCVICLSGHTLAVTCVKWGGDGVIYTGSQDCTIKVWETSQGKLIRELKGHGHWVNSLALSTEYVLRTGAFDNTGKQYSSAEEMKQVALERYKKNKGNAPERLVSGSDDFTMFLWEPSVSKHPKTRMTGHQQLVNHVYFSPDGQWVASASFDKSVKLWNGITGKFVAAFRGHVGPVYQISWSADSRLLLSGSKDSTLKIWDIRTKKLKRDLPGHADEVFAVDWSPDGEKVVSGGKDKVLKLWMG 140 1567
蛋白质SEQID 目标   专利肽序列 专利ORF开始 专利ORF终止
355 WD40重复蛋白   MDAGSAHSSSNMKTQSRSPLQEQFLQRRNSRENLDRFIPNRSAMDFDYAHYMLTEGRKGENPAVSSPSREAYRKQLAETLNNDNRTRILAFKNKPPTPVELIPHELTSAQPAKPTKTRRYIPQTSERTLDAPDLLDDYYLNLLDNGSSNVLSLALGNTVYLMNASDGSTSELVTIDDETGPVTSVSNAPDGRHIAVGLNNSDVQLWDSADNRLLRTLRGGHRSRVGSLNWNNHILTTGGMDGLIVHNDVRVRSHIVDTYRGHTQEVCGLKWSASGQQLASGGNDNILHIWDRSTASSNSPTQWLHRLEEHTAAVKALAWCPFQGNLLASGGGGGDRTIKFWHTHTGAGLNSVDTGSQVCALLWNKNERELLSSHGFTQNQLTLWKYPSMVKIAELTGHTSRVLFMAQSPDGCTVASAAGDETLRFWNVFGVPEVAKPAPKANPEPFAHLNRIR 376 1737
356 WD40重复蛋白   MEEAIPFKNLPSREYQGHKKKVHSVAWNCTGTKLASGSVDDTARVWHIEPHGHGKVKDIELKGHTDSVDQLCWDPKHADLLATASGDKTVRLWDARSGKCSQQAELSGENINITYKPDGTHVAVGNRDDELTILDVRKFKPIHKRKFNYEVNEIAWNMSGEMFFLTTGNGTVEVLAYPSLRPVDTLMAHTAGCYCIAIDPVGRYFAVGSADSLVSLWDISEMLCVRTFTKLEWPVRTISFNHTGDYVASASEDLFIDISNVQTGRTVHQIPCRAAMWSVEWNPKYNLLAYAGDDKNKYQADEGVFRIFGFESA 69 1010
357 WD40重复蛋白   MGKDEEEMRGEIEERLINEEYKVWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKLVLGTHTSENEPNYLMLAQVDLPLEDAENDARHYDDDRADVGGFGCANGKVQIIQQINHDGEVNRARYMPQNSFIIATKTVSAEVYVFDYSKHPSKPPLDGACSPDLRLRGHSTEGYGLSWSKFKQGHLLSGSDDAQICLWDINATFKNKSLDAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLIWDLRTPSVTKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKISTALHTFDAHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNTCEDWVVASVAEDHILQIWQMAENIYHDEDDVPGSESHKGS 149 1423
358 WD40重复蛋白   MMRGFSCTEDGDAPSTSSTSPPPPPPPPHRQQMQAPRASSSSSGQPTSRRSTGNVFKLLARREVSPRSKHSLKKFWGEASECQLCPFQQSYEAVRDVRRSLISWVEAFSLQRLSAKYCPLMPPPRSTIAAAFSPDGKILASTHGDHTVKLIDSQTGSCLKVLRGHRRTPWVVRFHPLYPEILASGSLDHEVHLNDANTAECIGSRNFYRPIASIAFHAQGDLLAVASGHKLYIWHYNRSGETSSPTIVLRTPRSLRRAVHFHPHAAPFLTAEVNDLDLTDSAMILATSPGYLHYPPPTIYLADAHSNERSRLEDELPLMPSPLLMWPSFTRDDGRATLPHIGGDVGLSGQQRVDSLSSGQYEFHPSPIEPSSSTSMREKNGTDPPSSVRESEVTQSAMNIVDNTEVQPEERSTYSFSFSDPRFWELPSVYGHLVGQTQAAPRTAPSPGALETASALGEVASVSPVRSEFMPGGMDQPRLGGRSGSGCRSSGSRMMRTAGLNDHPHDENYPQSVVSKLRSELEASLAAAASTELPCTVKLRVWPYDMKDPCALFRSESCRLTIPHAVLCSEMGAHFSPCGRFFAACVACVLPQLEADPVLHGQVDPDVTGVATSPTRHPVSAYQIMYELAIYSLEEATFGMVLASRSTRAAHCLTSIQFSPTSEHLLLAYGRRHNSLLKSIVIDGENTVPIYSILEVYRVSDMELVRVLPSAEDFVNVACFNPSVGGGLVYGTKEGKLRILQIDSSGGLNPKSTGFLDENMAEVPTYALEC 365 2677
蛋白质SEQID 目标   专利肽序列 专利ORF开始 专利ORF终止
359 WD40重复蛋白   MGEGDLPRTKAGVLRGHEGAVLAARFNGDGNYCLSCGKDRTIRLWNPHRGIHIKTYKSHGREVRDVHCTSDNSKLISCGGDRQIFYWDVSTGRVIRRFRGHDSEVNAVKTNDYASVVVSAGYDRSVRAWDCRSHSTEPTQIINTFQDSVHSVCLTKIEIIGGSVDGTVRTFDIRIGREISDDLGQPVNCISNSNDGNCILASCLDSTLRLVDRSAGELLQEYKGHTCKSYKLDCCLTNTDAHVAGGSEDGYVFFWDLVDASVISKFRAHSSVVTSVSYHPKEDCMTTASVDGTIKVNKT 24 923
360 WD40重复蛋白   MACIKGVGRSASVAKAPDGGYLATGTMAGTVDLSFSSSASLEIFGLDFQSDDRDLPLIAESPSSERFNRLSWGKNGSGSDEFSLGLIAGGLVDGTIGLWNPLSLIRSEAGDKAIVGHLSRHKGPVRGLEFNVIAPNLLASGADDGEICIWDLAAPREPSHFPPLRGSGSAAQGEISFLSWNSKVQHILASTSYNGTTVVWDLKKQKPVISTSDSVRRRCSVLQWNPDLATQLVVASDEDSSPTLRLWDNRNIMSPVKEFAGHTRSVIAMSWCPNDSSYLVTCAKDNRTICWDTVTGETVCELPAGSNWNFDVHWYPKIPGVISASSFDGKIGIYNVEGCSRYGVRENEFGAATLRAPKWFERPVGASFGFGGKVVSFHTRSTGGPSVNSSEVFVHDIITEQTLVSRSSEFEAAIQSGDRPSLRALCEKKSQHCESTDDQETWGFLKVLLEDDGTARSKLLAHLGFDIPTETNDGSQEDLSQQVNALGLEDVTADKVVQEDNNESMVPPTDNGEDFFNNLPSPRADTPVSTSADGPPTVNAAVEPSQDEVDGLEESSDPSFDDSVQRALVVGDYKAAVALCMSANKLADALVIAHVGGASLWESTRDKYLKMSRLPYLKVVFAMVNNDLQSLVDTRPLKFWKETLAILCSFAQGEEWAMLCNSLASKLMAAGNMLAATLCFICAGNIDKTVEIWSRSLATEHDGMSYMDLLQDLMEKTIVLALASGQKQFSASVCKLVEKYAEILASQGLLTTAMDYLKLLGTDDLSPELAVLRDRIAFSVEAEKGANISAFNGSQDPRGAVYGVDQSNYGNVDTSQHYYPEAAQPQVPHTVPGSPYGENYQQPFGSSFGKGYNTPMQYQAPSQASMFVPSEPPQNAQPSFVPTPVTSQPTTRSQFIPAPPLALRNPEQYQQPTLGSHLYPGSVWPTFQPLPHAPGPVAPVPPQVSSVPGQNMPQAVAPTQMRGFMPVTNPGVVQNPGPISMQPATPIESAAAQPVVSPAAPPPTVQTADTSNVPAPQKPVIATLTRLYNETSEAALGGSRANPAKKREIEDNSRKIGALFAKNSGDISKNAADKLVQLCQALDNGDYSTALQIQVLLTTSEWDECNFWLATLKRMIKTRQNVRLS 221 3598
361 WD40重复蛋白   HKERGKGAGRSVDERYTQWKSLVPVLYDWLANRNLVWPSLSCRWGPQLEQATYRNRQRLYLSEQTDGSVPNTLVIAWVEVVKPRVAAAEHISQFNEEARSPFVKKFKTIIHPGEVNRIRELPQNSKIVATHTDSPDVLIWDVETQPNRHAVLGASTSRPDLILTGHKDNAEFALAMSPTEPFVLSGGKDRYVVLWSIQDHISTLAADPGSAKSPGSAGTNNKQSSKAAGGNDKTGDSPSIEPRGYYLGNGDTVEDVTFCPSSAQEFCSVGDDSCLILWDARTGSSPAIKVEKAHHADLHCVDNNPHDVNLILTGSADNTVRMFDRRNLTSGGVGSPVHTFEGHNAAVLCVQWSPDKSSVFGSSAEDGILNIWDHEKIGRKIETVGSKVPNSPPGLFFRHAGHRDKVVDFHWNSSDPWTIVSVSDDGESTGGGGTLQIWRMIDLIYRPEEEVLAELDKFKSHILSCTS 44 1447
362 WD40重复蛋白   MAKIAPGCEPVAGTLTPSKKREYRVTNRLQEGKRPLYAVVFNFIDSRYFNVFATVGGNRVTVYQCLEGGVIAVLQSYIDEDKDESFYTVSWACNIDRTPFVVAGGINGIIRVIDAGNEKIHRSFVGHGDSINEIRTQPLNPSLIVSASKDESVRLWNVHTGICILIFAGAGGHRNEVLSVDFHPSDKYRIASCGMDNTVKIHSMKEFWTYVEKSFTWTDLPSKFPTKYVQFPVFIAPVHSNYVDCNRWLGDFVLSKSVDNEIVLWEPKMKEQSPGEGSVDILQKYPVPECDIWFIKFSCDFMYHSIAIGNREGKIYVWELQSSPPVLIAKLSHPQSKSPIRQTAMSFDGSTILSCCEDGTIWRWDAITASTS 196 1314
蛋白质SEQID 目标   专利肽序列 专利ORF开始 专利ORF终止
363 WD40重复蛋白   HNTAFHFGAGWRSIAEMGTTMSRLEIEFKSCEDEKSLDGVGNSQGPWELPRCLDNELAHLTNLKSRPHEHLIRDFPGRRALPVSTVKMLAGRECNYSRRGRFSSADCCHMLSRYVPVNGPSPLDQMNSRAYVSQFSADGSLFVAGFQGSHIRIYNVDKGWKCQKNILTKSLRWTITDTSLSPDQRYLVYASMSPIVHIVDIGSAAMDSLANITEIHKGLDFSADSGPYSFGIFSVKFSTDGREVVAGSSDDSIYVYDLVANKLSLRIPAHLSDVNTVCFADESGHIIYSGSDDTYCKVWDRRCLSARNKPAGVLMGHLEGITFIDSRGDGRYFISMGKDQTIKLWDIRKNGSDICRRGFRNFEWDYRWMDYPPRARDSKHPFDLSVATYKGHSVLRTLIRCYFSPVHSTGQKYIYTGSHDSCVYIYDVVTGAQVAALKHHKSPVRDCSWHPEYPMIVSSSWDGDIVKWEFFGNGETEIPAMKKRIRRRHLY 193 1668
  364   WD40重复蛋白   MEPQPQAPKKRGRKPKPKEDKKEEQLHQPPPPPPPQQQAAPAPAPAATRSSTSGSAGGRDRRPQQQHAVDEKYARWKSLVPVLYDWLANHNLLWPSLSCRWGPQLEQATYKNRQRLYISEQTDGSVPNTLVIANCEVVKPRVAAAEHVSQFNEEARSPFIRKYKTIIHPGEVNRVRELPQNPNIVATHTDSPDVLIWDVESQPNRHAVYGATASRPNLILTGHQEHAEFALAMCPAEPFVLSGGKDKTVVLWSIQDHITASATDQTTNKSPGSGGSIIKKTGEGNEETGNGPSVGPRGIYCGHEDTVEDVAFCPSTAQEFCSVGDDSCLILWDARVGTNPVAKVEKAHMGDLHCVDWNPEDNNLILTGSADNSVNMFDRRNLTSNGVGSPVYKFEGHKAAVLCVQWSPDKPSVFGSSAEDGLLNIWDYERVDKKVDRAPWAPAGLFFQHAGHRDKIVDFHWNAADPWTMVSVSDDCDTAGGGGTLQIWRMSDLIYRPEEEVLAELENFKAHVLECSKA 78 1634
  365   WD40重复蛋白   MGIFSPYRAVGYITTGVPFSVQRLGTETFVTVSVGKAFQVYNCAKLSLVLVGPQLPKKIRALASYREYTFAAYGSDIGIFKRAHQLATWSGHTAKVCLLLLFGEHILSVDVDGNAYIWAFKGMNYNLSPVGHILLDSNFTPSCIMHPDTYLNKVILGSQEGPLQLWNISTKTKLYEFKGNWSSVSSCVSSPALDVVAVGCADGKIHVHNIRYDEELVTFSHSNRGSVTALSPSTDGQPLLASGSSSGVVSINNLDKRRLQSVIRDAHDGSIISLHFFANEPVLNSSSADNSIKMWIFDTSDGDPRLLRFRSGHSAPPLCIRFYANGRHILSAGQDRAFRLFSVVQQQQSRELSQRHYSKRAKKLKLKEEEIKLKPVIAFDVAEIRERDWCNVVTSHMDTPQAYVWRLQNFVIGEHILRPCPNKPTPVKACMISACGNFAILGTAGGWIERFNLQSGISRGSYIDQLDGTNSAHDGEVVGVACDATNTLMISAGYAGDIKVWDFKGRELKSRWEIGSSLVKISYHRLNGLLATVADDFIIRLFDAVALRMVRKFEGHTDRITDLCFSEDGWILLSSSMDGSLRIWDIILARQVDAVFVDVSITALSLSPNNDILATTHVDQNGVFLWVNQSMFSGDSDINLYASGKEVVTVKLPSVSSVEGSQVEESNEPTIRHSESKDVPSFRPSLEQIPDLVTLSLLPKSQWQSLINLDIIKVRNKPVEPPKKPEKAPFFLPSIPSLSGEILFKPSENSDKGDMKADEDKSKITPEVPSSRFLQLLHSCSEAKNFSPFTTYIKGLSPSTLDLELRMLQIIDDDAVDADADDPQDVDKRQELLSIELLMDYFIHEISCRENFEFVQALVRLPLKIHGETIRRQSVLQHKAKVLLETQCSVWQRVDKLFQGARCMVAFLSNSQF 85 2826
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
366 WD40重复蛋白 HEETAVTCGSWIRKPENVNLAVLGKSFRRKGSAALEIFAFDPKSTSLSSSPLVAHVIEEIEGDPLAIAVHPNGEDIVCFASSGSCLSFELSGQESNLKLLTKELPPLRGIGPQKCMAFSVDGSRFATGGVDGRLRILSWPSLRIILDEPKAHKSIRDLDFSLDSEFLATTSTDGSARIWKAEDGLPCTTLTRRSDEKIELCRFSKDGTKPFLFCTVQRGDKAVTGVWDISTWNKIGHKRLLRKPAVVMSISLDGKYLAQGSKDGDMCVVEVKKMEVSHWSKRLHLGTSLTSLEFCPIERVVITTSDEWGVLVTKLNVPADWKAWQVYLLLLGLFLASLVAFYIFYENSDSFWGFPLGKDQPARPKIGSVLGDPKSAQQQNMWGEFGPLDM 74 1246
MADPVEHQHQQHQQHQLQQQRRRGWRIQGGQYLGEISALCFLHLPPPPLSLSSSPVLSLSSGLDSESRDRPACSFRFPSAGSGSQVSLFDLASGAMVRTFYVFRGIRVHGIVLGCADFPGGSSSSSSTLDYVIAVYGERRVKLFRLSVRLGRGAGEGSGTVLSADLELVSAAPRLSHWVMDVRFLKENGTSEDELQRCLTVAIGCSDNSIRLWDVDKCSFVLAVSSPERCLLYSMRLWGDNLEDLQVASGTIYNEILIWKVVPNHDAPSSNELTEEGLTNSCAGNSVHECLRYEAYHICRLVGMEGSIFRIAWSSDGSKLVSVSDDRSARIWEVHCKVQYSEDAGEVGLLFGHSARVWDCYISDNLIVTAGEDCSCRVWGLDGQQHDVIKEHIGRGIWRCLYDPWSSLLVTGGFDSAIKVHKLDASLAEASAKQSNIKDLSDGTELFTTHLPNSSGHSGHMDSKSEYVRCLSFSCEDVMYIATNHGYLYHAKLCNDGDLRWTELAQVSNEVQIICMELLPSNPYDPRIDADDWVAVGDGKGWTTVVRVVKNSDSPKVSTSFSWAAEMDRQLLGIHNCKSLGHRFIFTADPRGALKLWRFFEVSQSSSLYPENSPRISLIAEFKSDLGARIMCLDVAFESELLICGDLR
367 WD40重复蛋白 GNLVLFPLLKDLLLDTFVVSAAKISPVNHFKGAHGISAVSSISVAHMSFNHIELRSTGADGCICYMEYDKGLQSLNFVGMKQVKELSMILSVSTENESTGYRTSGSYASGFASTDFIIWNLVTEAKVLQVSCGGWRRPHSYYLGDVPENKNCFAYVKDDIIYIRRHWIKDSKDKILPQNLRLQFHGHEVHSLCFVTGDFQLRKNKQSSWIVTGCEDGTVRLTRYTQCTDNWSSSKLLGEHVGGSAVRSICCVSNIHTTSSGTSVSDVKGIENLPKDIKGTLMEDECNPSLLISVGAKRVLTSWLLRRRKQDGKEDDVTDLQEAENSSLPSSAGSSTISFQWLSTDMPVKYSVPSKKSGSIKKLIGVSDTNVRCKSLLPDSEALQSKVSAVDKNEDDWRYLAVTAFLVRHSGSRLIVCFIIVACSDATLAIRALVLPYRLWFDVALMVPLSSPVLSLQHVIIGRCQLPDENVQIGNVYVVISGATDGSIAFWDLTESVEAFMRRLSNIHLEKFMDCQKRPRTGRGSQGGRWWRSLSKIACKEQPINDPVTAKAIKELNRKLTGGVACGSSSSMLDASPELDSNAANSSFEIIEVNPFHVLNGVHQSGVNCLHVCETKHGQSSDGRFLYQLVSGGDDQALHLLKFEVLVQPPVQVPDVPNSDIRNSILVEEFLLDEQNQETKCTIEFISQEKIASAHNSAVKGVWTDGTWVPSTGLDQRVRCWISKDRGTPTELAHFIISVPEPEALDARSICWDQYQIAVAGRGMQMIEFHVPSSEIR 100 4377
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
368 WD40重复蛋白 MPYKLSATLSNHSSDVRAVASPSDDLILSASADSTAISWFRQSPSSFTPASVIRAGSRFVNAIAYLPPTPRAPQGYAVVGGQDTVVNVFALGPGDKEEPEYTLVGHTDNVCALSVNSDDTIISGSWDKTAKVWKDFALVYDLKGHQQSVWAVLAMNEKEFLTASADRTIKYWVQHKTMQTYEGHRDAVRGLALIPDIGFASCSNDSEIRVWTMGGDVVYTLSGHTSFVYSLSVLPNGDLVSAGEDRSVRVWRDGECSQVIVHPAISVWAVSTMPNGDIISGSSDGVVRVFSESEKRWATASELKALEDQIASQSLPSQQVGDVKKTDLPGPEALSVPGKKAGEVKMIRSGDVVEAHQWDSLASSNQKIGEVVDAIGSGRKQLHDGKEYDYVPDVDIQEGAPPLKLPYNVSENPYTAAQRFLEQMDLPTGYLCQVVKFIEQNTAGVKLGNDSYVDPFTGASRYQPATQSTSNTASSSYMDPFTGGSRHIAESAPSNVPQGSHATGIIPFSKPIFFKLANVSAMQAKMFQPDEVLRNEISTATLAMRPQEVIMVNETFTYLSKVVTSTSSARTSLGWIHIETIMQILDRWPVPQRFPVIDLGRLVTAYCMNAFSGPGDLEKFFSCLFRTSENTSITSGSKALTKAQETHVLLLFRTIANSLDGAPLNDMEWIKQIFRELAQTPQLVLNKSHRLALASVLFNFSCIGLKGPVPADVRTLHLTIILQVLRSPNDDPEVAYRTCVALGNMLYSDKTRGTPRDAQSPSPTELKSAVAAIKGGFSDPRINDVHREIMSLI 58 2439
369 WD40重复蛋白 MPPQKIESGRKDTVHDLANDYYGKRLATASSDHTINVVGVSSSGSQHLATLIGHQGVVWQISWAHPKFGSLLASCSYDGRVIIWREGNPNEWTQAQVFELHKSSVNSVAWAPHELGLCLACGSSDGNISVFTARQDGGWDTSRIDQAHPVGVTSVSWAPSTAPGALVGSGNMEPVQKLCSGGCDNTVKVWKLYNRVWKLDCFPVLQNHTDWVHDVAWAPNLGLPKSTIASASQDGRVIIWTLAKEGDQWQGKVLYDFRTPVWRVSWSLTGNILAVADGNNNVSLWREAVDGEWWIQVSTVEP 159 1064
370 WD40重复蛋白 MSAPMLEIEARDVVKIVLQFCKENSLHQTFQTLQSECQVSLNTVDSIETFVADIMSGRWDAILPQVAQLKLPRNTLLDLYEQIVLEMIELRELDTARAILRQTQAMGVMKQEQPERYLRLEHLLVRTYFDPNEAYQDSTKEKRRAQIAQALAAEVTVVPPSRLMALVGQALKWQQHQGLLPPGTQFDLFRGTAAMKQDVDDMYPTTLSHTIKFGTKSHAECARFSPDGQFLVSCSVDGFIEVWDYMSGKLKDLQYQADETFMNHDDQPVLCVDFSRDSEMLASGSQDGKIKVWRIRTGQCLRRLERAHSQGVTSVLFSRDGSQLLSTSFDGSARIHGLKSGKQLKEFRGHSSYVNDAIFSNDGSRVITASSDCTVKVWDVKTSDCLQTFKPPPPLRGGDASVNSVHLFPKNADHIVVCNKTSSIYIMTLQGQVVKSLSSGKREGGDFVAACVSPKGEWIYCVGEDRNLYCFSCQSGKLEHLMKVHEKDYIGVTHHPHRNLVATYSEDSTMKLWKP 118 1665
371 WD40重复蛋白 MDLLQSYAEDNDGDLGRHSSPEPSPPRLLPSKSAAPKVDDTTLALTVAQTNDTLARPIDPSQHAVAFNPTYDQLWAPICGPAHPYAKDGIAQGMRNBKLGFVEDAAIGSFLPDEQYNTFQRYGYAADPCASTGNEYVGDLDALKQNDGISVYNIRQQEQKKYAEEYAKKKGEERGEGGREKAEVVSDKSTFHGKEERDYQGRSWIAPPKDALATNDHCYIPKRLVHTWSGHTKGVSAIRFFPKHGHLILSAGMDTKVKIWDVFNSGKCMRTYMGHSKAVRDISFCNDGTKFLTAGYDKNIKYWDTETGKVISTFSTGKIPYVVKLHPDDEKQNILLAGMSDKKIVQWDMNTGQITQEYDQHLGAVNTITFVDDNRRFVTSSDDKSLRVWEFGIPVVIKYISEPHMHSMPSISLHPNTNWLAAQSLDNQILIYSTRERFQLNKKKRFAGHIVAGYACQVNFSPDGRFVMSGDGEGRCWFWDWKSCKVFRTLKCHEGVCIGCEWNFLEQSKVATCGWDGLIKYWD 57 1628
蛋白质SEQID 目标   专利肽序列 专利ORF开始 专利ORF终止
372 WD40重复蛋白   MESNGNLEQTLQDGRIYRQLN$LIVAHLRDHVFPQAASAVALATMTPLNVEAPRNRLLELVAKGLAVEKGELLRGVSHAGTNDLGGSIPASYGLVPAPWTAIDFSSLRDTKGMSKSFTTKHETRELSDKNVARCARFSTDGRFFATGSADTSIKLFEVSKIKQNMLPDSTDGAIRAVIRTFYDHTHPVNDLDFHPQNTVLISAARDHTVKFFDYSKATAKRAFRVIQDTHWVRSVAFHPSGDFLLAGTDHPIPHLYDVNTFQCYLSANVPEFAVNAAINQVRYSSSGGMYVTASKDGTIRFWDGASANCVRSIAGAHGAAEVTSANFTKDQRYVLSCGKDSTVKLWEVGTGRLVKQYLGATHMQLRCQAVFNNTEEFVLSIDEPSNEIVVWDAMTAKVARRWPSNHNGPPRWIEHSPTEAAFVSCGTDRSIRFHKETH 250 1566
373 WD40重复蛋白   MSNFQGEDGEYVADDFEAEDGDEELHGRESADPESDVDEIDTPSNRFTDTTADQARRGRDIQGIPWERLSITREKYRRTRLEQYKNYENVPQSGEKSGKDCTVTEKGNSFYEFRRNSRSVKSTILHFQLRNLVWATSKHDVYLMSNYSVVHWSSLTGKKSEVLNLAGHVAPNEKHPGSLLEGFTQTQVSTLAVKDRFLVAGGFQGELICKFLDRPGISFCSRTTYDDHAITNAVEIYVSPSGGIHFIASNNDDGVRDFDMENFELSKHFRFPWPVNHTSLSPDGKLLVIVGDDPEGILVDAKTGDTIMPLRGHLDFSFASEWHPDGVTFATGNQDKTCRIWDIRNLSKSIAVLKGNLGAIRSIRYTSDGRYMAIAEPADFVHVYDTKTGYKKEQEIDFFGEISGMSFSPDTESLFIGVWDRTYGSLLEYGRRRNFSYLDCLV 106 1434
  374   WD40重复蛋白   MGVSEDLEDLNALAESTDAAVDGQAALASAVDSVTLQPAPPILPPVIPPPAVPVVAPVPTIPPVLRPLAPLPIRPPVLRPPAPKRDEAGSSDSDSDHDGTAAGSTAEYEITEESRLVRERHEKAMQDLNMKRRGAALAVPTNDKAVRARLRRLGEPMTLFGEREMERRDRLRMLMAKLDAEGQLEKLMKAHEDEEAAASAAPEDVEEEMLQYPFYTEGSKALFNARIDIAKFSITRAALRLERARRRRDDPDEDVDAEIDWALKKAESLSLHCSEIGDDRPLSGCSFSHDGKLLATCSMSGVAKLWDTCRMPQVNRVLTLKGHTERATDVAFSPVQHHLATASADRTAKLMNTEGTILRTFEGHLDRLGRIAFHPSGKYLGTTSFDKTWRLWDIESGEELLLQEGHSRSIYGIDFHRDGSLVASCGLDALARVWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLYTIPAHANLISEVKFEPQEGYFLVTASYDTTAKVWSARDFKPVKTLSVHEAKITSVDITADASHIVTVSHDRTIKLWTSNDDVKEQAMDVD 190 1917
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
375 WD40重复蛋白 MVKAYLRYEPAAAFGVIASVESNIAYDASGKHLLAPALEKVGVWHVRQGVCTKALAPSASSAAGPSLAVTAIASSPSSLIASGYADGSIRIWDFEKGSCETTLNGHKGAVSVLRYGKLGSLLASGSKDNDIILWDVVGETGLYRLRGHRDQVTDLVFLDSDKKLVSSSKDKYLRVWDLETQHCMQIVGGHHSEIWSLDTDPEERYLVTGSADPELRFYTVKNDSSDERSEADASGGVGNGDLASHNKWDVLKQFGEIQRQSKDRVATVRFNKNGNLLACQAAGKLVEVFRVLDEAEAKRKAKRRLHRRKREKKGADVNEGDSSRGIGEGHDTMVTVADVFKLQTIRASKKICSISFCPVAPKSSLATLALLSLNNNLLEFHSIEADKTSKMLTIELQGHRSDVRSVTLSSDNTLLMSTSHNSVKIWNPSTGSCLRTIDSGYGLCGLIVPQNKHALIGTKDGAIEIFDVGSGTCIEVVEAHGGSIRSIVAIPNQNGFVTGSADHDIKFWEYGMKQKPGDNSKHLTVSNVRTLKMNDDVLVVAVSPDAQKIAVALLDCTVKVFFMDSLKLMHSLYGHRLPVLCLDISSDGDLIVTGSADKWLMIWGLDFGDRHKSIFAHGDSIMAVQFVGNTHYMFSVGKDRLVKYWDADKFELLLTLEGHHADIWCLAISNRGDFLVTGSHDRSIRRWDRTEEPFFIEEEKEKRLEEMFESDLDWAFGWKYVPKEEIPEEGAVALAGKRTQETLSATDSIIEALDIAEVELKRIAEHEEEKNNGKTAEFHPNYVMLGLSPSDFILRALSNVQTNDLEQTLLALPFSDALKLLSYLKDWTTYPDKVELVSRIATVLLQTHYNQLVSTPAARPLLTTLKDILEKKVKECKDTIGFNLAAMDHLKQLMALRSDALFQDAKVKLLEIRSQLSKRLEERTDPREAKKRKKKQKKSTNMHAWP 102 2942
376 WD40重复蛋白 MGGVQAEREDKDKVSLELTEEILQSMEVGMTFRDYSGRISSMDFHRASSYLVTASDDESIRLYDVASATCLKTINSKKYGVDLVSFTSHPMTVIYSSKNGWDESLHLLSLHDNKYLRYFKGHRDRVVSLSLCPRNECFISGSLDRTVLLWDQRACKCQGLLRVQGRPATAYDDPGLVFAIAFGGCVRMFDAREYEKGPFEIFSVGGDVSDANVVKFSNDGRLMLLTTTDGHIHVLDSFRGTLLYTFNVKPTSSKSTLEASFSPEGMFVISGSGDGSVYAWSVRGGKEVASWLSTDTEPPVIKWAPGNLMFATGSSELSFWIPDLSKLGAYVGRK 75 1079
377 WD40重复蛋白 MARFGAAPAGNHNPNKSSEVIQPPSDSVSSLCFSPRANHLVATSWDNQVRCWELTDNGASVTSVPKASMSHDQPVLCSAWKDDGTTVFSGGCDKQAKMWSLMSGGQPVTVAMHDAPIKEIAWIPEMNVLVTGSWDKTLKYWDTRQSNPVRTQQLPERCYAMTVRYPLMVVGTADRNLIVFNLQNPQAEFKRFSSPLKYQTRCVAAFPDQQGFLVGSIEGRVGVHHLDDSQISKNFTFKCHRQNNDIYSVNSLNFHPVHHTFATAGSDGTFNFWDKDSKQRLKAMSRCSQPIPCSTFNNDGTIYAYSVCYDWSKGAENHNPATAKTYIFLHLPQESEVKAKPRVGTTNRK 99 1148
378 WD40重复蛋白 MSCSISGEVPEEPVVSTKSGHVFERRLIERYVSQYGKCPVSGEPLTMDDVLPVKMGKIVKPRPLQAASIPGLLSIFQNEWDSLMLSNFALEQQLHTARQELSHALYQHDAACRVIARLKKERDEARSLLALAERQIPMTASSDIAVNAPAMSNGRKASLDEEPGYAGKKMRPGISASIIAEITDCNLALSQQRKKRQIPSTLAPVEDLERYTQLSSYPLHKTGKPGITSLDICHSKDIIATGGIDTSAVLFDRSSGQIMSTLSGHSKKVTSVNFDAQGDMVLTGSADKTVRIWQGSEDGSYNCRHILKDHTAEVQAITVHATNNYFATASLDNTWCPYEFSTGLCLTQVEGASGSEGYTSAAFHPDGLILGTGTSNADVKIWDVKTQANVTTFSGHTGRITAISFSENGYFLATAAQDGVKLWDLRKLKNFRTFSAYDKDTGTNSVEFDHSGCYLGLAGSDIRVYQVASVKSEWNCVKTFPDLSGTGKVTCVKFGPDSKYIAVGSHDHNLRIFGLPSEDGAMES 232 1806
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
379 WD40重复蛋白 MAAPGVETLKKEIKELKEKIAQHRLDTDGEQPLPAAAKSKSVPEVSAALKQRRILKGHFGKIYALHWSADSRHLVSASQDGKLIIWNGFTTNKVKAIFLRSSNVMTCAYSPSGNLVACGGLDNLCSVYKVPHGGNKESSSAQKTYGKLAQHEGYLSCCRFIKDNEIVTSSGDSTCILWDVETKTPKAIFNDRTGDVMSLAVFDDKGVFVSGSCDATAKLWDHRVHKQCVMTFQGHESDINSVQFFPDGDAFGTGSDDSSCRLFDIRAYQQINKYSSDKILCGITSVAFSKTGKSLFAGYDDYNTYVWDTLSGNQVEVLTGHENRVSCLGVSEDGKALATGSWDTLLKIWA 72 1124
380 WD40重复蛋白 MGGVEDESEPASKRMKLSSRVLRGLANGSSRTEPAAGSSLDLHARPLPIEGDEEVIGSKGVIKRVEFVRLIAKALYSLGYEKSGARLEESSGIPLQSSVVNLFMQQISDGLWDESVVTLHKIGLSDENLVKSASFLILEQKFLELLDQEKAMDALKTLRTEITPLCIKNSRVRELSSCIISPSSCGLLNQNKRNSTRARSRSELLEELQKLLPPAVIIPERRLEHLVEQALVLQTDACNLHNSIDMEMSLYTDHQCGKEHIPCRTLQILQSHNDEVWLVQFSHNGRYLASASNDRSAIIWEVDENGSVSLKHKLTGHQKPISSVCNSPDDRQLLTCGVGETVRRNDVSSGECLRVYEKAGRGLISCAWFPDGKWICYGVSDRSICMCDLEGKEIECWKGQRTLSISDLEITSDGKQIISICRETAILLLDREAKYERMIEENQTITSFSLSKDNRYLLVNLLNQEIHLWDIKGDFRLVAKYKGLKHSRFVIRSCFGGLKQAFVASGSEDSQVYINHKGSGELIEPLPGHSGAVNCVSWNPANHHMLASASDDRTIRIWGLWELNTRHKGARPNGVHYCNGNGTS 315 2069
381 WD40重复蛋白 HTQLAETYACMPSTERGRGILIAGNPKPGSNSVLYTNGRSVVILNLDNPLDISVYAEHAYPATVARFSPNGEWVASADSSGAVRIWGAYNDHVLKKEFKVLSGRIDDLQWSPDGLRIVASGDGKGKSLVRAFMNDSGTNVGEFDGHSRAVLSCAFKPTRPFRIVTCGEDFLVNFYEGPPFKFKLSRRDHSNFVNCLRFSPDGNRFISVSSDKKGIIYDGKTGEKIGELSSQGGHTGSIYAVSWSPDSKQVITVSADRSAKIWDISEDGSGNLRKTLTSSGSGGVDDMLVGCLWQNNHLVTVSLGGTISIYTAGDLDKAPVSFSGHMKNVSSLSVLKGDPKVILSSSYDGLIIKWIQGIGFSGRVQRKESTQIKCLAAVDEEIVTSGYDNKVCRVSGSGDAEFIDIGCQPKDLSLALQCPEFALVSTDTGVVLLRGAKIVSTINLGFAVTASTVAPDGTEAIIGAQDGKLRIYSISGDTLTEEAVLEKHRGAISVIHYSPDLSMFASGDLNRKAVVWDRASREVRLKNILYHTARINCLAWSPDSSTVATGSLDTCVIIYEYDRPASNRLTIKGAHLGGVYGLAFTDDFSVVSSGEDACIRVWKINRQ 145 1968
382 WD40重复蛋白 MKVKVISRSTDEFTRERSQDLQRVFRNFDPNLRTQEKAVEYVRALNAAKLDKVFARPFVGANDGHVDSVSCMAKNPNYLKGIFSGSMDGDIRLWDIASRRTVCQFPGHQGPVRGLAASTDGQILVSCGIDSTVRLWNVPVATLGESDGTHENLAKPLAVYVWKHAFWAVDHQWDGELFATAGAQVDIWNQNRSQPISSFEWGTDTVISVRFNPGEPNVLATSCSDRSITLYDLRMSSPTRKVIMRTKTWAISWNPHEPMNFTAANEDCNCYSYDARKLEEAKCVHKDHVSAVMDIDYSPTGREFVTGSYDRTVRIFQYNGGHSREVYHTKRMQRVFCVKFSCDASYVISGGDDTNLRLWKAKASEDLGVVLPRERRKHEYHEAVKSRYKLPEVKRRIVRHRHLPKPIYKAGILRRTVNEADRRKEERRKARSAPGSSSAEPLRKRRIIKEIE 130 1488
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
383 WD40重复蛋白 MVRSIKNPKKAKRKNKGSKNGDGSSSSSSIPSMFTKVWQPGVDRLEEGEELQCDPSAYNSLRAFMIGWPCLSFDIVRDTLGLVRTEFPHQVYFVAGTQAEKPTWNSIGIFRVSNITGKRRELVPSKPTDDADEESDSSDSDEDSDDEVGGSGTPILQLGKVGHEGCVNRIRAMNQNPHJCASWGDSGRVQIWDFSSHLNALAESKADVSQGASSVFNQAPLVKFGGHKDEGYALDWSPLVPGRLVSGDCKNSIHLNEPTSGSTWNVDSTPFIGHAASVEDLQWSPTEENVFASCSVDGTIAIWDTRLGKTPAASFKAHDADVWVISWNRLATCMLASGCDDGTFSIHDLRLLKEGDSVVAHFEYHKHPYTSIEWSPHEASTLAVSSADCQLTINDLSLEKDEEEEAEFKAKTKEQVNAPEDLPPQLLFVHQGQKDLKELHWHAQIPGMIVSTAADGFNILMPSNIQSTLPSDGA 269 1693
384 CDKA型 MWRYKVIKELGDGTYGSVNKALNQQTHEIVAIKKMRRKYYIWEECINLREVKSLRKLHHPNIIRLKEVIRENNELFFIFEYNECNLYQINKERSTPFSETAIIKFCYQILQGLSYNHRNGYFHRDLKPENLLVTSDLIKIADFGLAREVLTSPPYTDYVSTRWYRAPEVLLQSFTYTTAIDMNAVGAILAELFTLHPLFPGESELDEIYKICGVLGTPDYETWPDGMGLAAFRNFIFPQFLPVNLSVLIPHASPEAIDLITRLCSWDPQKRPTAEQALRHPFFRIGMSIPLSLGGHPQDNTCAAEVDTNFHSKKACKGRGMGEKESSLECFLGLSLGLKPSLGHLGAMGSQGVGAVKQEVGSSPGCQSWPKQSLFQVANSRAILPLFSSSPNLNVVPVKSSLPSAYTVNSQVMWPTIAGPPAAAVTVSTLQPSILGDFKIFGKSMGLASQYAGKEASPFS 1163 2545
385 CDKA型 MGEMGRGTNNSSWNNNSNRPAWLQHYDLVGKIGEGTYGLVFLARSKLPNNRGLKIAIKKFKQSKDGDGVSPTAIREIMLLREFSHENVVKLVNVHINHVDMSLYLAFDYAEHDLYEIIRHHREKLNHHNLNQYTVKSLLWQLLNGLNYLHSNWIVHRDLKPSNILVMGEGEEHGVVKIADFGLARIYQAPLKPLSDNGVVVTIWYRAPELLLGAKHYTSAVDMWAVGCIFARLITLKPLFQGVEVKASPNPFQLDQLDKIFKVLGHPTIEKWPTLMNLPHWSKNLQQIQQHKYDNAGLRIGPIPAKSPAYDLLSKMLEYDPRKRITAAQALEHEYFRIDPQPGRNALVPSQPGEKAINYPPRLVDANTDFDGTIAPQPSQVSSGHAPSGSIASAAVPAVRPLPQQMQLMGMQPMQNPGMAAFNLGAQASMSGLNHNNIALQRGSSQQQAHQQVRRKEPNSGFPNTGYPPPPKSRRL 152 1582
386 CDK B-1型 MDKYEKLEKVGEGTYGKVYKARDKNTGQLVALKKTRLEMDEEGVPPSSLREISLLQMLSQSIVVVRLLCVEHVTKKGKPLLYLVFEVLDTDLKKFIDYRRSVNAGPLPQNVIQSFMYQLLKGVAHCHSHGVLHRDLKPQNLLVDKSKGLLKVGDLGLGRAFTVPLKCYTHEVVTLHYRAPEVLLGSTHYSTPVDIWSVGCIPAENVRRQPLFPGDCEIQQLLHIFTLLGTPTEEMNPGVKRLRDWHEYPQWKPEWLARAVPNLSPTGLDLISKMLQCDPAKRISAKAAMNHPYFDDLDKSQF; 389 1297
387 CDK B-1型 MDGYEKMDKVGEGTYGKVYMARDKKTGQLVALKKTRLENDGEGIPPTALREISLLQMLSQQIYIVRLLDVKHTEMKLGKPLLYLVFEYMESDLKKYIDSYRRSHTRMPPSMIKSFMYQLCRGVAYCHSRGVNHRDLKPHKLLVDKEKGVLKIADLGLSRAFTVPVKKYTHEIVTLWYRAPEVLLGATHYSLPVDIWSVGCIFAEMSRMQALFTGDSEVQQLMNIFRFLGTPNEEVWPGVTKLKDWHIYPEWKPQDISHAVPDLKPSGLDLLSQMLVYRPSKRISAKKALEHPYFDDLDKSQF 38 946
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
388 CDK B-1型 MDAYEKLEKVGEGTYGKVYKAKDKNTGQLVALKKTRLESDDEGIPPTALREISLLQMLSQDIHIVRLLDVEHTENKNGKPLLYLVFEYMDSDLKKYIDGYRRSHTKVPPNIIKSFNYQLCQGVAYCHSRGVMHRDLKPHNLLVDXQRGVVKIADLGLGRAFTIPIKKYTHEIVTLWYRAPEVLLGATHYSTPVDIWSVGCIFAEMVRLQALFIGDSEVQQLFKIFSFLGTFNELJWPGVTKFRDQHIYPQNKPQDISSAVPDLEPSGVDLLSKMLVYEPSKRISAKKALEHPYFDDLDKSQF 180 1088
389 CDK B-1型 WDSYEKLEKVGEGTYGKVYKAKDKKTGKLVALKKTRLENDGEGIPPTALREISLLQMLSQDMNIVHLLDVEHTEHKNGKPLLYLVFEYMDSDLKKYVDGYRRSHTKMPPKIIKSFMYQLCQGVAYCHSRGVMHRDLKPHNLLVDKQRGVLKIADLGLGRAFTVPIKKYTHEIVTLWYRAFEVLLGATHYSTPVDIWSVGCIFAEMSRMPALFCGDSEVQQLMSIFKFLGTPNEGVNPGVTKLKDWHIYPEWRPQQLSRAVPDLEPSGVDLLTKMLVYEPSKRISAKKALQHPYFDDLQRSQF 40 948
390 CDK B-1型 MEKYEKLEKVGEGTYGEVYKGHDKHTGRLVALKKTPFHQEEGIPPTAIREISLLKSLSQCIYIVKLLDVKASFNGKGKHVLFMVFEYADSDLKKHIDAHRQCNTKLSPRSIQSYMFQLCKGIAYCHSHGVLHRDLKPQNILVDQKIGLLKIADLGLGRACTVPIKSYTFEVVTLWYRAPEVLLGAKRYSMALDIWSLGCIFAELCNLQALFAGDSQIQQLINIFRLLGTPNEQLWPGVTQLSDWHEFPQWRPQDLSKVVFNLDPNGVDLLSKMLQYDPAKRISAKEALDHPYFDSLDKSQF 229 1134
391 CDK C型 MGCVCGKPSARAADYVESPAEKGASSNSRSSSMASRRLVAPRVNDQGIDAENGHEGDYRTKLRGKQSNGADPVSLLSDDAEKQRHSRHHQHQQHHPIRPHHLRPQGEFVPNANSNPRFGNPPRHIEGEQVAAGHPANLTAVAGEAIKGNIPRRADSFEKLDKIGQGTYSNVYKARDLDTGKIVALKKVRFDNLEPESVRFMAREIQVLRRLDHPNVYKLEGLVTSRMSCSLYLVFEYHDHDLAGLAACPGIKFTEPQVKCYMQQLLRGLDHCHSRGVLHRDIKGSNLLIDNGGILKIADFGLATFFHPDQRQPLTSRVVTLWYRPPELLLGATEYGVAVDLWSTGCILAELLAGKPIMPGRTEVEQLHKIFELCGSPSEDYWKKSKLPHATIFKPQQPYKRCVAETFKDFPPSALALMEVLLAIEPADRGTATSALKSDFPTTKPLACDPSSLPKYPPSKEFDAKIRDEEARRQRAAGGRGRDAARRPSRESPAIPAPEANAELAISIQKRRLSSQGPSKSKSEKFNPQQEDGAVGFPIEPPRFNHIGIDAGATSRMYSQQFGPSHSGFLSNQISSSIWGKNQKEDEIQMAPGRPSRSSKATISDFRKPGACAPQPGADLSHLSSLVATARSNAGIDTHKDRSGHNQHNRIDAIDCGVHNNGKHEFLEVPEHPNRQDWTRFQQPESFGLDNYHLQDLPATHHRKDERVASKEATMNWQGYGGQGGDKIHYSGPLLPPSGNIDEILKEHERHIQHAVRRARQDKGRPQRSNLSQNERKAFEHRSFVSGVNGNAGYSDLVNELPISVGSNRLKVSKTRGTEEIVELAELEREPLSSVMEKYERKHKM 105 2642
392 CDK C型 MGCVCAKQSDILGEPESPKVKGSNLASSRWSVSSETKQLPQHSDSGILHHQHYYHPRDESDEAKLKSSNYGGSKRRTRQGRDPADLDMGIFVRTPSSQSEAELVARGNPAWMAAFAGEAIHGWIPRRAESFEKLYKIGQGTYSNVYKARDLDNGKIVALKKVRFDSLDAESVRFMAREILVLRKLDHPNIVKLEGLVTSEVSSSLYLVFEYNEHDLAGLAACPGIKFTEPQVKCYMQQLLQGLDHCHRHGVLHRDIKGSNLLIDNGGILKIADFGLATFFYPDQKQLLTSRVVTLWYRPPELLLGATDYGVAVDIWSAGCILAELLAGKPILPGRTKVEQLHKYFKLCGSPSEDYNKESKLPHATIFKPQHPYKSCIAEAFKDFSPSALALLETLLAIEPGHRGEASGALKSEFFTTEPLSCDPSSLFKYPPSKEFDAKLRAQETRRQRDVGVRGHGSEAARRTSRLSRAGPTPNEGAELTALTQK 187 2580
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
QHSTSHATSNIGSEKPSTKKEDFTAGLHIDPPRPVNHSYETTGVSRAYDAIRGVAYSGPLSQTHVSGSTSGKKPKRDHVKGLSGQSSLQPSKPFIVSDSRSERIYEKSHVIDLSNHBRLAVGRNRDTTDPHKSLSTLNQQIQDGTLDGIDIGTHEYARAPVSSTKQKSAQLQRPSALKYVDNVQLQNTRVGSRQSDERPANKESDMVSHRQGQRIHCSGPLLNPSANIEDLLCKDEQQIQQAVRRAHHGKREALSNKSSLPGKKPVDHRAWVSSGKGNKESPYFKGKGNEELRDLKGGPTAKVTNFRQKVM
393 CDK C型 MAVANPGQLNLQEAPSWGSRSVNCFEKLEQIGEGTYGQVYMAREIETGEIVALKKIRMDHEREGFPITAIREIKLLKKLQHENVIKLKEIVTSPGPEKDEQGKSDGNKYNGSIYMVFEYMDHDLTGLAERPGMRFSVPQIKCYMKQLLIGLHYCHINQVLHRDIKGSNLLIDHNGILKLADFGLARSFCSDQNGNLTNRVITLWYRFPPELLGSTKYGPAVDMNSVGCIFAELLYGKPILPGKNEPEQLTKIFELCGSPDESNWPGVSKLPWYSNFKPQRQMKRRVRESFKNFQRHALDLVEKHLTLDPSQRISAKDALDBEYFWTQPVPCAPSSLPRYEPSHDFQTKRKRQQQRQHDEMTKRQKISQHPPQQHVRLPPIQNAGQGHLPLRPGPNPTMHNPPPQFPVGPSHYTGGPRGAGGQNRHPQNIRPLNAAQGGGYNANRGYGGPPQQQGGGYPPHGMGNQGPRGGQFGGRGAGYSQGGPYGGPVGGRGPNVGGGNRGPQFNSEQ 220 1749
394 CDK D型 MQNMEDNVQSSWSLHGNKEICARYEILERVGSGTYSDVYRGRRKADGLIVALKEVHDYQSSWREIEALQRLCGCPEVVRLYENFWRENEDAVLVLEFLPSDLYSVIKSGKNKGKNGIPEAEVKAWNIQILQGLADCHANWVINRDLKPSNLLISADGILKLADFGQARILEEPEAIYEVEYELPQEDIVADAPGERIHEEDDSVKGVRNEGEEDSSTAVETNFGDMAETANLDLSWKNEGDMVMQGFTSGVGTRWYRAPELLYGATIYGKEIDLNSLGCILGELLILEPLFSGTSDIDQLSRLVKVLGTPTEENWPGCSNLPDYRKLCFPGDGSPVGLKNHVPSCSDSVFSILERLVCYDFAARLNAKEVLENKYFVEDPYPVLTHELRVPSPLREENNFSEDNAKWKDMEADSDLENIDEFNVVHSSDGPCIKFS 438 1748
395 CDK D型 MDLKQYPEDLNPELPEGTDNVDNPDNWKGSPVPSPHPPLKPLDPSERYRKGITLGQGTYGIVYKAFDTVTNKTVAVKKIHLGKAKEGVNVTALREIKLLKELSHPNIIQLIQAYFKKQNLHIVFEFMETDLEAVIKDRNLVFSFADIKSYLQMTLKGLAVCHKKWVVLHRMKPNNLLIAADGQLKLGDFGLARLFGSPDRKFTHQYFAVWYRAPELLFGAKQYGPAVDINATGCIFAELLLRKPFLQGVSDLDQIGKIFAAFGTPRQSQWPDVASLPDFVEFQFVPAPSLRSLFPMASEDALDLLSKMFTLDPKNRITAQQALEHRYFSSVPAPTRPDLLPKPSKVDSSRPPKHASPDGPVVLSPSKARRVMLFPNNLAGILPKQVSQSTTGGTPIEFDMPTQKLREVCPRSRITESGKKHLKKKTMDMSAALDECAREQEGQEGRTILDPDRQRSAKKEDHM 240 1631
396 周期素A MAGGQENCVRITRARAACVSRASAPVIQSQVDEKKSRKRAPKRAAVDDLAANRSGSQPKRRAVLGDVTNLHAAATDCLSTAEDQVDAPNPSIKGRARNKKKEARTSTKVVKDEIHPESNPLADHSSNLSECQKPPAAKLAEQRSLRGVPSKAKQGGSSNSQSCSKHTDTDKDHTDPQMCTTYVEQIYEYLRNAELKNRPSANFMETAQNDITPNMRAILVDWLVEVSEEYKLVPDTLYLTVSYIDRYLSANPTSRHKLQLLGVSCMLIASKYEEVCPPHVEEFCYITDNTYTRDEMLSMERKILIFLNFEMTKPTTKSFLRRFVRASQAGKKAPSLHMEFLANYLAELTLMECSFLQYLPSLIAASTVFLSRLTLDFLTNPNNPTLAHYTGYKASQLKDCVMAIYNVQNNRKGSTLVAIREKYQQHKFKCVASLPPPPFIAERFFEDTPN 252 1604
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
397 周期素A MTGTQASNVRITRARAAKSTLNNALPPLPPAQGKPRGKRAATESNISGFSVAAEPLKRRAVLSDVSNICKEAAAVDCLKKPKAVKVVSQNANAKGRGRGIPRNNKKITQEAEIKKETSPAICNVDDASAGNAIGDDKQNNNVNPLKEVQDWPKELNPIAEQISVHPHCKQSVEKFWEKEIVVSDNKAAIASLKQQSTLQSLRLPKQPKYSLKQGNPVPLANLHEDVGRSSCSDEIDIDSEYKDPQMCTAYVTDIYANMRVVELKRRPLPNFMETTQRDINANMRSVLIDWLVEVSEEYKLVPDTLYLTVSYIDRFLSANVVNRQRLQLLGVSCMLVASEYEEICAPPVEEFCYITDNTYKKEEVLEMEISVLNRLQYDLTTFTTKTFLRRPIRAAQASCKVSSLHLEFNGNYLAELTLVEYDFLKYLPSLIAAAAVFVARMTLDPMVHPWNSTLQHYTGYKVSDMRDCICAIHDLQLNRKGCTLAAIKEKYNQPKFKCVANLFPPPIISPQFLIDNEV 261 1817
398 周期素B HAAPNQNALLINNNNRRPLVDIGNLVGALNAQCNISRNGAFKRAFGDIGNLVEDLDAKCTISKYWVRKRPRTNFGVNANKGASSSTQGQGIVVRGEQKAWDRIVNGNKQSCAIKMRAQHVTATQRGTAISISDIIDSSVQDGGIKAPSQLKARKQTVRTVTATLTARSEDSLRDVLEVPPGIDDGDRDNPLAVVEYVEDIYHFYRKIEVRSCVPPDYMTRQLEIKDSMRGVIIDWLIEVHRTFLLMPETLYLTVNIIDRYLSIQSVTRNELQLMGITAMFIASKYEEISPPKINDLVYITKDAYTSKQIVNMEHTILNRLKFKLTVPTPYVFLVRFLKAAGPDKVMKNLAFFLVDLCLLHYKMIKYSPSMLAAAAVYTAQCTLKKHPYNNKTLILHIGYSEAHLRECAHLMADLHLKAEGSNLKSVYKKYSYPIFGSVAFLSPAKIPAGTVAAPAIDKCAHQIYLRNLR 167 1576
399 周期素B MFPNKQTQGLVQNKKMASKAAQPKAMVPPQRVPPAANNRRALGQIGNIVADVGGKCNVTKDGVNGKPLAQVSRPITRSFGAQLLAQAAANKGISAANNQTQVPVVIPKADVRGNKQRRTSKSKDIPPTTVVTNESDDCVIIEQAQRIKFTCNHNVGAVGNKEKPQLLTAKPKSLTASLTSRSAVALRGFRFDDEMTEAEEDPLPNIDVGDRDNQLAVVEYVEDIYKFYRRTEQMSCVPDYMPRQQEINPKMRAVLINMLIEVHYRFGLMPETLYTTNLIDRYLATQLVSRSNHYQLVGATAMLLASKYEEIWAPEMNDFLDILENKFERKHVLVMEKAMLNKLKFHLTVPTPYVPLVRFLKAAASDEEMENLVFFLMELSLMQYVMIKFPPSNLAAAAVYTAQITLKKTTVWNDVLRRHTGYSEIDLKECTRLMVAFHQSSELSKLNVVFKKYSMPEYDSVALIKPAKLPA 183 1598
400 周期素D MAPSFDCVANAYIESCEDQEKLRQNAQILAQSGENDVDEPVSMLVQNETHYMLPEDYLQRLRNRTLDVNQRREAVGWILKVHSFYNFGAPTAYLAVNYLDRFLSRHRMPQGVKAWMIQLMAVACLSLAAKMEETQVPLPSDLQREDARFIFDARTIQRNELLILSTLQWGMRSITPFSFIDYFAYRAVQGHGHGHDATPKAVMSRRIELILSTTEEIDFMEYRPSAIAAAALLCAAEEVVPLQAVHYKRALSSSITDVDKDDMFGCYNLIQETIIEGGCYWTPMSLQSTEKTPVGVLDAAACLSNTPTSSYSVKPYASVTAAKRRKLNEICSALLVSQAHPC 98 1126
401 周期素D MAANFNTSSHCKELLDAEKVGIVHPLDKDQGLTQEDVKIIKINMSNCIRTLAQYVKLRQRVVATAITYCRRVYTRKSFTEYDPQLVAPTCLYLASKAEESTVQAKLVIFYMKKYSKHRYEIKDMLEMENKLLEALDYYLVIYHPYRPLIQFLQDAGLWDLKVTANALVNDTYRTDLILTYPPYMIALACIYFACIMEEKDAQAVFEELRVDMNEIKNISMEIVDYYDMYRVIPDEKMNSALNKLPHRF 148 894
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
402 周期素D MAPALSSSYECLSHLLCAEDASNVVGCWDEDESKIFCEEEEGFGIQHFPDFPVPDDDEIRVLVRKKSQYMPGKSYVQSYQNLQLDFTARQNAIGWILKVHGSYNFGPLTAYLSINYLDRFLSRNPLPKAKVWMLQLLSVACLSLAAKMEETQVPLLLDLQAEEPDFLFEPRTIQRMELLVLSTLEWRMLSSVTPFSFVDYFLQGGGGRKPPPRAMARANELIFNTHTVLDFLEHRPSAIAAAAVICAAEEVLPLEAAQYKETILSCSLVDKENVFGSYNLIQEVLIEKFSTPKKAKSASSSIPQSPVGVLDAPCLSNNSNNTSLEASLSVNLYASVAAKRRKLNDYCNTWRNFQHSTC 287 1363
403 周期素D MAPNCIDCAPSDLFCAEDAFGVVENGDAETGSLYGDEDQLHYNLDICDQRQEHLNDDGELVAFAEKETLYVPNPVEKNSAEAKARQDAVDWILKVHAHYGFGPVTAVLSINYLDRPSANQLQQDKPWMMTQLAAVACLSLAAKMDETEVPLLLDFQVEEAKYIFESRTIQRMELLVLSTLEWRMSPVTPLSYIDHASRMIGLENHHCWIFTMRCKEILLNTLRDAKFLGLLPSVVAAAIMLHVIKETELVNPCEYENRLLSAMKVNKDMCERCIGLLIAPESSSLGSFSLGLKRKSSTINIPVPGSPDGVLDATFSCSSSSCGSGQSTPGSYDSNNSSILCISPAVIKKRKLNYEFCSDLHCLED 251 1348
404 周期素依赖性激酶调控亚单位 MPQIQYSEKYTDDTYEYRHVVLPPETAKLLPKNRLLNENENRAIGVQQSRGWVHYAIHRPEPHIMLFRRPLNYQQNQQQQAGAQSQPNGLKAQ 229 510
405 周期素依赖性激酶调控亚单位 MDQIEYSERYYDDTYEYRHVELPPDVARLLPKNRLLTENEWRGIGVQQSRGWVHYAIHCSEPHIMLFRRPLNYEQHQHPEPHIMLFRRRPLNCQPNHQPQAHHPT 92 409
406 周期素依赖性激酶调控亚单位 NDQIEYSEKYYDDTYEYRHVELPPDVARLLPKNRLLTENEWRGIGVQQSRGWVHYAIHCSEPHIMLFRRPLNYEQNHQHPEPHIMLFRRPLNCQPNHQPQAHHPT 64 381
407 周期素依赖性激酶调控亚单位 MPQIQYSKKYYDDTYEYRHVVLPPDVARLLPKNRLLNKNEWRGIGVQQSRGWVHYAIHRPEPHIMLFRRHLNYQQNQQQQAQQQPAQAMGLQA 69 349
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
408 组蛋白乙酰基转移酶 MALVETEPVTLLHPEEPKKFKKKPTPGRGGVISHGLTEEEARVKAIAEIVGAMVEGCRKGEDVDLNALKAAACRRYGLSRAPKLVEMLAALPDGERAAVLPKLKAKPVRTASGIAVVAVMSKPHRCPHIATTGNICVYCPGGPDSDFEYSTQSYTGYEPTSMRAIRARYNPYVQTRSRIDQLKRLGHTVDKVEFILMGGTFMSLPADYRDYFIRNLHDALSGHTSSNVEEAVCYSEHSATKCIGLTIETRPDYCLGPHLRQMLSYGCTRLEIGVQSTYEDVARDTNRGRTVAAVADCFCLAKDAGFKVVAHMMPDLPNVGVERDMESFREFFENPAFRADGLKIYPTLVIRGTGLYELWKTGRYRHYPPEQLVDIIARVLALVPPWTRVYRVQRDIPMPLVTSGVEKGNLRELALARNDDLGLKCRDVRTREAGIQDIHHKIRPEVVELVRRDYCANEGWETFLSYEDTRQDILVGLLRLRKCGNNTTCPELEGRCSIVRELHVYGTAVPVHGRDADKLQRQGYGTLLMEQAERIANKEHRSIKIAVISGVGTRHYYRKLGYELEGPYMMKYLN 125 1849
409 组蛋白乙酰基转移酶 MLGFRDLYTSICEHLQRASGRLPIIAAATSLISTPEIAAYEKENKAPNSVDKMGMGSADESGRFSTSNGQFMNHNNGVVKEENKGGVPVVPSAPTTVPVITNVKLETPSSPDHDMARKRKLGFLPLEVGTRVLCKWRDGKFHPVKIIERRKLPNGATNDYEYYVHYTEFNRRLDEWVKLSQLELDSVETDADEKVDDKAGSLKMTRHQKRKIDETHVEGNEELDAASLREHEEFTKVKNITKIELGRYEIETWYFSPFPSEYNNCEKLYFCEFCLNFMKRKKQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYAQNLCYLAKLPLDHKTLYYDVKLFLFYILCECDERGCHMVGYFSKEKHSEESYNLACILTLPPYQRKGYGKFLISFSYELSKKEGKVGTPERPLSDLGLLSYRGYWTRVLLDILKKHKSNISIKELSDMTAIKADDVLSTLQGLDLIQYRKGQHAICADPKVLDRHLKAVGRGGLEVDVCKLIWTPYEEQ 70 1602
410 组蛋白乙酰基转移酶 MGSLDESTCSEEIRDEGKDSIRTKFKVESTVNNAQNGGNDNSKKKRAAGLPLEVGIRLLCKWRDSKLHPVKIIERRKLPNGFPQDVEYYVHYTEFNRRLDEWVKLEDFELDSVETDADEKIEDKGGSLKMTRHQKRKIDEIHVEEGQGHEDFDPASLREHEEFTKVKNIAKVELGRYEIETNYFSPFPPEYSHCEKLFFCEFCLNFMKRKEQLQRHHRKCDLKHPPGDEIYRNGTLSMFEVDGKKNKIYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHVVGYFSKEKHSDEAYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGIPERPLSDLGLLSYRGYWTRILLDILKKQRGNISIKELSDMTAIKVEDVISTLQVLDLIQYRKGQHVICADPKVLDRHLKAAGIAGLEVDVSKLIWTPYKEQCG 140   1465
411 组蛋白乙酰基转移酶 MASAPMVGCDDSRDKHRNVESKVYMRKGRGKGSKGNAGFNAQNSTAQVRRENDNMGNSIADNGKSEAASEGLSSLSRKQITVNQDHPPHETSSMPAVGGLQNIDTHVTFKLEGCSKQEIWELRKKLTNELEQVRGTFKKLEARELQLRGYSVSAGVNTSYSASQFSGNDMRNNGGREVTSEVASGGAITPKQAQRESNPPRQLSISLMENNQAASDMGEKGKRTPKANQYYRHSEFVLGKDKFPPAESKKSKSTGNKKISQSKVFSLETMQVGKEFMPQKSVNEYFKQCSLLLTKLMKHKYGWVFNLPVDAQALGLHDYHTIIKRPMDLGTVKSKLEKNLYNSPASFAEDVKLTFSNAMTYNPKGHEVMTMAEQLLQLFEERWRTIYEEHLDGKMHFGSGQGLGASSSTKKLPPQDSKKNIKKSEPAGGPSPPKPKSTNHHASRTPSAKKPKAKDPHKRDMTYEEKQKLSTNLQNLPQERLELIVQIIKKRNPSLCQHDEEIEVDIDSFDTETLWELDRFVTNYKKSLSKNKKKKALLDQAKRASEHGSARNKHRMIGRELPMNNKKGEQGEKVVEIDHMPPVNPPVVEVEKDGVYAKRSSSSSSSSSDSGSSSSDSDSGSSSGSLSDAYAATSPPAGSNTSARG 628   2565
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
412 组蛋白乙酰基转移酶 MEGHSGALGPGQGFSRSSQSPNLSPSPSHSASASVTSSGQKRKRNEVEHAGVASNSTGMFAVPPSHIYSHLHPMSMSMPMPMHNSHPSSLSESRDGALTSNDDDDNLTGGNQSQLDSMSAGNTDGREDFDDEDDDDDDEEDDDEVEGDEEDQDHDPDADDDSDDGRDSMRTFTAARLLDNGAPNSRNLKPKADAGVAIAPIVKTKPILDTVKEEKVSGNNNNNSVSANNAQVAPSGSAVLSAVKEEANKPTSTDHIQTTSGAYCARERSLKREEDADRLKFVCFGNDGIDQHMINLIGLKNIFARQLPNMPKEYIVRLVMDRSHKSVMIIKQNQVVGGITYRPYLSQKFGEIAFCAITADEQVKGYGTRLMNHLKQHARDVDGLTHFLTYADNNAVGYFIKQDFTKEIKLEKERWHGYIKDYDGGILMECKIDPKLPYTDLPANIRWQRQTIDEKIRELSNCHIVYSGIDIQKKEAGIPRKPIKVEDIPGLKEAGWTTDQWGHSSRFRLLNSPSEGPNRQVLHAFMRSLHKAMVEHADAWPPKEPVDPRDVPDYYDIIKDPMDVKRMFTNARTYNTHETIYYKCANR 55 1818
413 组蛋白去乙酰基转移酶 NEESGNSLTSGPDGSKRRVSYFYDSDIGNYYYSQGHPMKPHRIRMAHSLIVHYALDEKMEVCRPNLLQSRELRVFHADDYISFLQSVTPETQHEQLRQLKRFNVGEDCPVPDGLYNFCQTYAGGSVGAAIKLNNKEADIAINWSGGLHHAKKCEASGFCYVHDIVLAILELLKVHQRVLYIDIDIHHGDGVEEAFYSTDRVMSVSFHKFGDIFPGTGHLKDVGYGKGKYYSLNVPLNDGIDDESYENLFRPIIQKVKEIYQPEAVVLQCGADSLSGDRLGCFNLSVKGHADCVRFLRSFNVPLVLVGGGGYTIRNVARCWCYETAVAVGVEPQDKLPYNEYYEYFGPDYTLHVAPSNMENQNSAKELAKIRNTLLEQLKRIQHVPSVPFQERPPDTKFPEEDEEDYEKRPKGHKWGGEYFGSESDEEQKPQNRDIDISDKPGIRRQSPPNVEAAKKIKVEEEDGDIGIVNENDGAKNPLGEAG 259 1710
414 组蛋白去乙酰基转移酶 MEESGNSLTSGPDGSKRRVSYFYDBDIGNYYYSQGHPMKPHRIRHAHSLIVHYALDEKMEVCRPNLLQSRELRVFHADDYISFLQSVTPETQHEQLRQLKRFNVGEDCPVFDGLYNFCQTYAGGSVGAAIKLNNKEADIAINWSGGLHHAKKCEASGFCYVNDIVLAILELLKVHQRVLYIDIDIHHGDGVESAFYSTDRVMSVSFHKFGDYFPGTGHLKDVGYGKGKYYSLNVPLNDGIDDESYKNLFRPIIQKVMEIYQPEAVVLQCGADSLSGDRLGCFNLSVKGNADCVRFLRSFNVPLVLVGGGGYTIRNVARCWCYETAVAVGVEPQDKLPYNEYYLYFGPDYTLBVAPSNMENQNSAKELAKIRNTLLEQLKRIQHVPSVPFQERPPDTKFPEEDEEDYEKRPKGHKWGGEYFGSESDEEQKPQNRDIDISDKPGIRRQSPPNVEAAKKIKVEEEDGDIGIVNENDGAKWPLGEAG 356 1807
415 组蛋白去乙酰基转移酶 HEFWGVEVKPGLALTCDPGDERYLHMSQAAIGDKEGAKENERVSLYVHVDGKKFVLGTLSRGKCDQIGLDLVFEKEFKLSHTSQTGSVFVSGYTTVDHEALDGFPDDEDLESSEDEEEELAQITTLTAKENGGKTGAKPVKPESKSSVTDKAAAKGKPSVKPPVKKQEDDSDSDEDEDEDEDEDKDDDDEDDEDMRDASASDDGDEEDDSDEESDDDEEEDEETPKPAAGKKRPMPASDNKSPATDKKAKITTPAGGQKPGADKGKKTEHIATPYPKHGAKGPASGVKGKETPLGSKQTPGSKVKNSSTPESGKKSGQFKCQSCSRDFATEGALSSHNAAKHGGK 261 1298
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
416 组蛋白去乙酰基转移酶 MMETGGHSLPSGPQGVKRKVAYFYDPEVGNYYYGQGHPNKPHRIRMTHALLVQYGLHKENQILKPYPARDRDLCRFHADDYVAFLRGITPETIQDQVKALKRFNVGDDCPVFDGLYQYCQTYAGGSVGGAVKLNHKLCDIAINWAGGLHHAKKCEASGFCYVNDIVLAILELLKYHKRVLYVDIDIHHGDGVEEAFYTTDRVMTVSFHKFGDYFPGTGDIRDIGCGKGKYYAVNVPLDDGIDDESFQSLFKPIIQQVMLVYNPEAIVLQCGADSLSGDRLGCFNLSVKGHAECVRYHRSFNVPLLHVGGGGYTVRNVARCWCYETGVAVGVEIDDKMPQHEYYEYFGPDYTVHVAPSNMENKNTKQYLDKIRSKILENINSLPCAPSAQFQVQPPDTDFPELEEEDYDERTRSHKWDGASCDSDSENGDLKHRNHDVEESAFPRHNLANISYNTKIKLEGVGTGGLDMAAGTDTKKNDKSFEAMDYESGEELRQDHPASTINASQPCDPALLIGVQNQLQSTDTVKPIEQSGNAPGIPPPSVATVSTGTRPSSISRTSSLNSMSSVKQGSILGPNPPQGLNASGLQFPVPTSNSPIRQGGSYSITVQAPDKQGLQNHMKGPQNMPGNS 365 2251
417 组蛋白去乙酰基转移酶 MPPKDRVAYFVDGDVGSVYFGPNHPMKPHRLCMTHLVLSYELHKKMETIYRPHKAYPVELAQFHSADYVEFLHRITPDTQHLFPKELVKYNMGEDCPVFENLFEFCQIYAGGTIDAAHRLNNQICDIAINWSGGLHHAKKCEASGFCYINDLVLGILELLKHHARVLYVDIDVHHGDGVEEAFYFTDRVMTVSFHKYGDMFFPGTGDVKEVGEREGKYYAINVPLKDGIDDASFTRLFKTIITKVVDIYQPGAIVLQCGADSLAGDRLGCFNLSIDGHAQCVRIVKKFNLPLLYTGGGGYTKENVARCWSVETGVLLDTELPNEIPDNDYIKYFAFKYSLKINTAGNMENLNSKTYLSAIKVQVMENLRAIQNAFSVQNHEVPPDFYIPDIDEDELNPDERMDQHTQDRQIQRDDEYYDGDNDIDHDMEEAS 156 1454
418 组蛋白去乙酰基转移酶 MDSSKSEEANILHVFWHHEGMLNHDLGTGFDTLEDPGFLEVLEKHPENADRVNKMLSILRKGPIAPYTEWHTGRAAYISELYSFHRPDYVDMLAKTSTAGGKTLCHGTRLNPGSWEAALLAAGTTLEAMRYILDGHGKLSYELVRPPGHHAQPTQADGYCFLNHAGLAVELAVASGCKRVAVVDIDVHYGNGTAEGFYERDDVLTISLHMNHGSWGPSHPQTGFHDEVGRGKGLGFNLNVPLPRGTGDKGYENAMHELVVPAISKFMPEMIVLVIGQDSSAFDPNGRECLTMEGYRKIGQIMRQQADQFSGGRLVVVQEGGYHITYAAYCLHATLEGVLCLPHPLLSDPIAYYPEHDIYSERVTFIKNYNQGIISTTDKRN 203 1348
419 组蛋白去乙酰基转移酶 MEESGNALVSGPDGSKRRVTYFYDADIGNYYYGQGHPMKPHRMRMRHNLIVNYGLHQRMEVCRPHLAQSKDIRAFHTDDYIHFLSSVAPDTQQEQLRQLKRPNVGEDCPVFDGLFNFCQSSAGGSIGAALKLNRKDADIAINNAGGLHHAKKCEASGFCYVNDIVLGILELKVHQRVLYIDIDIHHGDGVELAFYTTDRVMTVSFHKFGDYFPGGTGHIKDVGYGKGLYYALNVPLNDGIDDESYRHLFRPIIQKVMEVYQPEAVVLQCGADSLSGDRLGCFNLSVKGHADCVRFVRSFNIPLMLVGGGGYTIRNVARCWCYETAVAVGVEPQDKLPYNEYYEYFGPDYTLYVAPSNMENLNTEKDLEKMRNVLLEQLSKIQHTPSVPFQERPPDTEFNDEEEEDMEKRSKCRIWDGEYVGSEPEEDGKLPRFDADTYERSVLKHENKRLVPVSNVEPLKRIKQEEDGAAV 229 1644
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
420 组蛋白去乙酰基转移酶 MPPKDRVAYFYDGVGSVYFGPNHPMKPHRLCMTHHLVLSYELHKKKMEIYRPHKAYPVELAQFHSADYVEFLHRITPDTQHLFTKELVKYNMGEDCPVFENLFEFCQIYAGGTIDAARRLNNQICDIAINWSGGLHHAKKCEASGFCYINDLVLGILELLKHHARVLYVDIDVHHGDGVEEAFYFTDRVMTVSFHKYGDMFPPGTGDVKEVGEREGKYYAINVPLKDGIDDASFTRLFKTIITKVVDIYQPGAIVLQCGADSLAGDRLGCFNLSIDGHAQCVRIVKKFNLPLLVTGGGGYTKENVARCWSVETGVLLDTELPNEIPDNDYIKYFAPDYSLKINTAGNMENLNSKTYLSAIKVQVMENLRAIQHAPSVQMHRVPPDFYIPDIDEDELNPDERMDQHTQDRQIQRDDEYYDGDNQIDHDNKEAS 156 1454
421 组蛋白去乙酰基转移酶 MDLNLVSHGEEEEGVRRRKVGIVYDERMCKHATPEDQPHPEQPDRIRVIWDKLNSAGVLNKCVNVEBKEASEZQLAGVHSRKHIEVMKSIGTARYNKKKRDKLAASYSSIYFSQGSSKAALLAAGSVVEISEKVASGILDAGVAIVRPPGHHAEADKAMGFCLFNNIAIAAKHLVHERPELGVQEVLIVDWDVHHGNGTQHMFWTDPRVLYFSVHRFDAGTFYPGGDDGFYDKIGEGKGAGYNINVPNEQGKCGDADYLAVWDHVLYPVAKSYDPDMVLISGGFDAALGDFLGGCRLTPYGYSLMTKKLMEFAGGKIVLALEGGYNLKSLADSFLACVEALLKDGPSRSSVLTHPFGSTNRVIQAVRKELSSFNPALNEELQLPRLLKDASESPDKLSSSSSDESSASEDEKKIAEVTSIMEVSPDPSSILALTAEDLAQPLAGLKIEEAGTDSQRSSDHTLLDLTKDDTQKLKQFEGEIFVMIGDEESVPSASSSKDQNESTVVLSKSNIKAHSWRLTFSSIYVWYASYGSNMWNPRFLCYIEGGQVEGMAKRCCGSEDKTPPQRIQWRVVPHRMFFGRSYTNTWGSGGVSFLDPNCSDTSEAHVCLYKITLAQFNDLLLQENNLNCGTEHPLVDLSSIDAIRNGNSILELIKDSWYGTLIYLGMEGGLPIVTFTCSVCDVEKFKHGQLPLCPPSSRYENILIRGLVQGKKLSEDDATAYIRAASTSPLL 27 2222
422 肽基脯氨酰基异构酶 MADEDLDLSDVGEVEDEPGEEIESTPPLAVGQEKEINSLALKKKLLKVGTRWETPENGDKVTVHYTGTLPDGTKFDSSRDRGEPFTFKLGQGCVIDGWDQGIVTMKKGERALFTIPPELAYGSSGVRPTIPPHATLQFDVELLSWTNIVDVCNDGGILKRIISEGEKYERPKDPDEVTVKYEAKLEDGTLVAKSPEEGVEFYVNDGHFCPAIAKAVKTMKRGESVILTIKPTYAFGERGKDAEBGFAAIPPNATLTTSLELVSFKAVIAVTEDKKVIKKILKEADGYDKPSDGTVVQIRYTAKLQDGTIFEKKGYEGEEPPQFVVDEEQVIAGLDKAVETMKTGEIALITIGAEYGFGNFETQRDLAVIPPNSTLIYEVEMISFTKEKISWDMDTTEKIEASKQKKEQGNSLFKVGKYQRAAKKYEKAAKYIEHDSSFSAEEKKQSKVLKVSCNLNHAACRLKLKDFKEAVKLCSKVLELESQNVEALYRRAQAYIETADLDLAEFDIKKALEIEPQNREVQLEYKILKQKQIEYNKNDAKLYGNMPAKLNKLEAFEGKVLS 71 1759
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
423 肽基脯氨酰基异构酶 MADEGLELSDYABVBDEPGEEFESAPPLVVGQEKELNSSGLKKKLLKNGTRCETPENGDEVTVHYTGTLLDGTKFDSSRDRGEPFTYNIGQGQVIKGWDQGIVTMKKREHALFTIPPELAYGASGMPPTIPPNATLQFDVELLSWTNIVDVCKDGGILKRIISDGEKYERPKDPDEVTVKIEAKLEDGMLVAKSPEEGVEFYVNDGNFCPAIVKAVKTMKKGKNVTLTIKPAYAFGEQGKDAEEGFAAIPPNATITINLQLVSFRAVKEVTEDKKVIKKILKEADGYDEPSDGTVVQIRYTAKLQDGTIFEKKGYAGEEPFQFVVDEEQVIAGLDKAVETMKTGEVALITIGPEYGFGNIETQRDLAVIPPYSTLIYEVEMVSFTKEKECWDMNTTENIEASKQKKEQGNSLFKVGKYLRAAKKYDKAAKYIEHDNSFSAEEKKQSKVLKVSCNLNHAACCLKLKDFKKAVKLCSKVLELESQNVKALYRRAQAYIETADLDLAEFDIKKALEIEPQNREVRLEYLILKQKQIEYNKKDAKLYGNMFARQWKLEAIEGKD 358 2040
424 肽基脯氨酰基异构酶 MPNPKVFFDMQVGGAPAGRIVMELYADVVPKTAENFRALCTGEKGTGRSGKPLHFKGSSFERVIPGEHCQGGDFTRGNGTGGESIYGEKFADENFVKEHTGPGILSMANAGPWTNGSQFFICTAQTSWLDGKHVVPGQVVEGLEVVRDIEKVGSGSGHTSKPVVIADSGQLA 238 756
425 肽基脯氨酰基异构酶 MPNPKVFFDMQVGGAPAGRIVMELYADVVPKTAENFRALCTGEKGNGRDHKPLHFKGSSFHRVIPGFMCQGGDFTRGNGTGGESYYGKKFADENFVKKHTGPGILSMANAGPNTNGSQFFICTAQTSWLDGKHVVFGQVVKGLEVVRDIEKVGSGSGRTSKPVVIADSGQLA 285 803
426 肽基脯氨酰基异构酶 MPNPKVFFDMQVGGAPAGRIVMELYADVVPKTAENFRALCTGEKGTGRSGKPLHFKGSSFHRVIPGFMCQGGDFTRGNGTGGESIYGEKFADENPVKKHTGPGILSMAMAGPNTNGSQFFICTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGSGRTSKPVVIADSGQLA 190 708
427 肽基脯氨酰基异构酶 MPNPKVFFDMQVGGAPAGRIVNELYADVVPKTAENFRALCTGEKGTGRSGKPLHFKGSSFHRVIPGFMCQGGDFTRGNGTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGSQFFICTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGSGRTSKPVVIADSGQLA 156 674
428 肽基脯氨酰基异构酶 MADDFELPESAGMMENEDPGDTVFKVGEEKEIGKQGLKKLLVKEGGSWETPETGDEVEVHYTGTLLDGTKPDSSRDRGTPFKFKLGQGQVIKGNDQGIATMKKGENAVFTIPPDLAYGESGSQPTIPPNATLKFDVELLSWASVKDICKDGGIFKKIIKEGEKNEHPKEADEVLVKYEARLEDGTVVSKSEEGVEFYVKDGYFCPAFAIAVKTMKKGEKVLLTVKPQYGFGHQGREAIGNDVARSTNATLLVDLELVSWKVVQEVTDDKKVLKKILKQGEGYERPNDGAVVKVKYTGKLEDGTIFEEKGSDEEPFEFMAGEEQVVDGLDRAVMTMKKGEVALVSVAAEYGYQTEIKTDLAVVPPKSTLIYEVELVSFVKEKESWDMRTAEKIEAAGKKKEEGNALFKVGKYFRASKKYEKATKYIEYDTSFSEEEKKQSKPLKVTCNLNNAACKLKLKDYTQAEKLCTKVLEVESQNVKALYRRAQAYIQTADLELAELDIKKALEIDPNMRDVKLEYRALKEKQKEYNKKEAKFYGNMFARMSKLEELESRKSGSQKVETANKEEGSDAMAVDGESA 176 191
429 肽基脯氨酰基异构酶 MAASLTPLGAGLAYATIYDQRKVRKLIPTKRSLIALCQHSDSQHRRFITRKYHVNVQILNRRDAIRLIGLAAGLCIDLSLNYDARGAGLPPQENAKLCDTTCEKELENAPMITTESGLQYKDIKIGNGPSPPIGPQVAANYVAMVPSGQVFDSSLDKGQPYIFRVGSGQVTKGLDEGLLSNXVGGKRRLYIPGPLAFPKGLNSAPGRPRVAPSSPVIFDVSLEFIPGLESEEE 64 765
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
430 肽基脯氨酰基异构酶 MSAASLSADMAIRGTILGKTALHVLGPQVVSQCRQPVMFKCPPHTLRKMRFSAQDLQSKNFYSGFTPFKSVFISTSKRSWQAGSARAMSQDAAFQSKVTTKCFLDIEIGGDPAGRIVLGLFGEDVPKTAENFRALCTGEKGFGYKGSSFNRIIKDFMLQGGDFDRGDGTGGKSIYGRTFEDENFKLAHVGPGVLSHANAGPNTNGSQFFICTVKTPWLDKRHVVFGQVIEGNEIVKKLESEETNRTDRPKRPCRIVDCGELP 93 881
431 肽基脯氨酰基异构酶 MGRIKPQTLLQQSKKKKVPGRISVSTIIVCNLIIIFLMFSLVGIYRQRAKRNRATSRSDGDEEMENFGRSKINSVPHQAIVNTTKGLITLELFGKSSAHTVEKFVEWSERGYFNGLPFYRVIKHFVIQVGDPKFAGNREDWTVGGQLNVQLEFSPKHEAFMLGTSKLEDQGDGFELFITTAPIPDLNDRLNVFGHVIKGQDVVQEIEEVDTDEHFQPKSPIIINDVRLKDEL 372 1070
432 肽基脯氨酰基异构酶 MARQSTLLLFWSLVFLGAIVFTQAKHEELEEVTHKVYFDVDIAGKPAGRVVIGLPGKAVPKTVENFRALCTGEKGVGKSGKPLHYKGSFFHRIIPSFMIQGGDFTLGDGRGGESIYGTKFADENFKLKHTGPVFITTVTTDWLDGRHVVFGKIISGMDVVYKVEAEGRQSGQPKRKQKIADSGELSMD 28 594
433 肽基脯氨酰基异构酶 MARQSTLLLFWSLVFLGAIVFTQAKHEELEEVTHKVYFDVDIAGKPAGRVVIGLFGKAVPKTVENFRALCTGEKGVGKSGKPLHYKGSFFHRIIPSFMIQGGDFTLGDGRGGESIYGTKFADENFKLKHTGPGFLSHANAGPDTNGSQFFITTVTTDWLDGRHVVFGKIISGQGMDVVYKVEAEGRQSGQPKRKVKIADSGELSMD 34 648
434 肽基脯氨酰基异构酶 MEMDEIQEQSQPQSSEKQDISQESDTGNDKTINAEKITSENAEVEEDDMLPPKVNTEVEVLHDKVTKQIIKEGSGNKPSRNSTCFLHYRAWAESTNHKFQDTWQEQQPLELVLGREKKELSGEAIGVAGMKAGERALLHVDWQLGYGEEGNFSFFNYPPRANLIYEAELIGFEEAKEGKARSDMTVEERIEAADRRRQQGNELFEEDKLAEAMQQYEMALAYHGDDFMFQLFGKYKDMANAVKNPCHLNMAQCLLKLNRYEEAIGQCNMVLAEDEKNIKALFRRGKARATLGQTDDAREDFQKVRKFSPEDKAVIRELRLLAEHDKQVYQKQKEMFKGLFGQKPEQKPKKLHWFVVFWQWLLSMIRTIFRMRSKTD 481 1611
435 肽基脯氨酰基异构酶 MAGAGEGTPEVTLETSMGPITVELYHKHAPKTCRNFLELSRRGYYNNVKFHRVIKDFMVQGGDPTGTGRGGESIYGPRFEDEITRDLKHTGAGILSMANAGPNTNGSQFFISLAPTPWLDEKHTIFGRVCKGMDVVKRLGNVQTDKNDRPIHDVKILRTTVKD 93 584
436 肽基脯氨酰基异构酶 NMDPELMRLAQEQMSKISPDELMKMQRQIMANPDLMRMASENMKNLKPEDIRFAAEQMKNVRKEEMAEISERISRASPEEIEAMKARANLQSAYQLQVAQNLKDQGNQLHARNKYSEAAEKYLQARNNLTGIPFSEAKSLLLASSSNLMSCYLKTGQYEECVQTGSEVLAYDAMNVKALYRRGQAYKQIGKLELAVADLRKAVEVSPEDETIAQALREASTELNEKGGTQDQNGPRIEEIIEEEAVQPTAEKYPQSAPMVTSVTEDVSDDEQGSEDQNGFSRDSFQATNAPDGQMYAISLRNLTENFDMLRTNQSLMTNVDPDSLVALSGGKLSPDMVKTVSGMFGRMSPEEIQNNHKMSSTLSRQNPSTSSRFDDITRGHSNMDSSPQSVSVDNDLFEENQNRVGESSTMLSSSAAFSGMPNFSAEMQEQVRNQMNDPATRQMFTSMIQNMSPENMASMSIQFGVKLSPEDAVKAQNAMASLSPNDLDRLMNIATRLQTAIDYARKIKNWILGRPGLIFAISMLLLAIILHRFGYIGD 250 1869
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
437 肽基脯氨酰基异构酶 MGVEKEILRPGNGPKPRPGQSVTVHCTGYGKNEDLSQKFWSTKDPGQKPFTFTIGQGRVIKGWDEGVLDMQLGEIFKLRCSPDYGYGSMGFPAWGIRPNSVLVFEIEVLSVN 84 422
438 肽基脯氨酰基异构酶 MPHPRCYLDITIGEELEGRILVELYSDVVPKTAENFRALCTGEKGIGPHTGVPLHYRGLPFHRVIKGFMIQGGDISAQNGTGGESIYGLKFDDENFQLKHERRGMLSMANSGPNTNGSQFFITTTRTSHLDGKHVVFGKVIKGMGVVRGIEHTPTESNDRPSLDVVISDCGEIPEGSDDGIANFFKDGDLYPDWPADLDEKSAEISWWMNAVDSAKCPGNENYKKGDYKMALRKYRKALRYLDICWEKEEIDEEKSNHLRKTKSQIFTNSSACKLKIGDLKGALLDTEFANRDGEDNVKALFRQGQAYMALKDVDSAVASFKKALQLCPNDAGIRKELAVATKMINDRRDQERRAYARMFQ 128 1213
439 肽基脯氨酰基异构酶 MGDVIDLNGDGGVLKTIIRSAKPGAMQPTEDLPNVDVHYEGTLADTGEVFDTTREDNTLFSFELGKGTVIKAWDIAVKTMKVGEVARITCKPEYAYGSAGSPPDIPENATLIFEVELVACKPRKGSTFGSVSDEKARLEELKKQREIAAASKEEEKKRREEAKATAAARVQAKLEAKKGQGRGRGKSKGK 265 837
440 肽基脯氨酰基异构酶 MGLGLKLASASFLPIFNIMATRSLCILLVCFIPVLAHVLSLQDPELGTVRVYFQTTYGDIEFGFFPHVAPKTVEHIYKLVRLGCYNSNHEERVDKGFVAQVADVVGGREVPLNSEQRKEGEKTIVGEFSEVKHVRGILSMGRYSDPDSASSSFSILLGNAPHLDGQYAVFGKVTKGDDTLKRLEEVPTRQEGIFVHPLERIRILSTYYYDTNERESNLTCDHEVSILKRRLVESAYEISYQRRKCLP 38 781
441 肽基脯氨酰基异构酶 MASXRSLRTMNVWPTLPPLVLLLLLCFSSMSSSVVAKKSDVSFLQIGVKHKPKSCDIQAHKGDRIKVHYRGSLTDGTVFDSSFERGDPIEFELGSGQVIKGNDQGLLGMCVGEKRKLRIPSKLGYGAQGSPPKIPGGATLIFDTELVAVNGKGISNDGDSDL 38 526
442 肽基脯氨酰基异构酶 MSGAPAERPISYFDITIGGKPIGRIVFSLYADLVPKTAENFRALCTGEKGIGKSGKPLCYAGSGFHRVIKGFMCQGGDFTAGNGTGGESIYGEKFEDEAFPVKHTKPELLSMANAGKDTNGSQFFITVSQTPHLDDKHVVFGEVIKGKSIVRAIENYPTASGDVPTSPIIISACGVLSPDDPSLAASEETIGDSYEDYPEDDDSDVQNPEVALDIARKIRELGNLKFKEGQIELALKKYLKSIRYLDVHPVLPDDSPPELKDSYDALLAPLLLNSALAALKTQPADAQTAVKNATRALERLELSDADKAKALYRRASAHVILKQEDEAEEDLVAASQLSPEDMAISSKLKEVKDEKKKKREKEKKAPKKMFSS 37 1158
443 肽基脯氨酰基异构酶 HASSLRSSLFSSWALDSKSVCSLFNLNPGKMGLPSISTPLNWRTCCCSHSSELLELNEGLQSSRRKTVMGLSTVIALSLVYCDEVGAVSTSKRALRSQKVPEDEYTTLPNGLKYYDLKVGSGTEAVKGSRVAVHYVAKWKGITFMTSRQGMGITQGTPYGFDVGASERGAVLKGLDLGVQGMRVGGQRILIVPPELAYGNTGIQEIPPNATLEFDVELISIKQSPFGSSVKIVEG 61 768
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
444 WD40重复蛋白 MGAIEDEEPPLKRLKVSSPGLRRGLEEEAPSLSVGSVSILMAKSLSLEEGETVGSKGLIRRVEFVRIITQALYSLGYQKAGALLEEESGILLQSSNVALFRKQILDGKWDESVVTLRGIDQVEVEGNTLKAASFLILQQKFFELLDKGNIPEAMKTLRLEISPMQLNTKRVHELASCIVFPSRCEELGYSKQGNPKSSQRMKVLQEIQQLLPPSIMIPEKRLERLVEQALNVQREACIFHNSLDPALSLYTDHDCGADQIPTTTLQVLESHKNEVWFLQFSNNGKYLASASKDCSAIIWEITEGDSFSMKHRLSAHQKPVSFVANSPDDKLLLTCGIEKVVKLWNVETGECKLTYDKANSGFTSCGWFPDGERFISGGVDKCIYIWDLEGKELDSNKGQGMPKISDLAVTSQGKEIISICGDNAIVMYNLDTKTERLIEEESGITSLCVSKDSRFLLLNLANQEIHLNDIGARSKLLLKYKGHRQGRYVIRSCFGGSDLAFVVSGSEDSQVYIWRRGNGELLAVLPGHSGTVNCVSWNPVNPHVFASASDDYYIRIWGVNENTFRSKNASSSNGVVHLANGGP 421 2172
445 WD40重复蛋白 MPGTTAGAGIEPIEPQSLKKLSLKSLKRSFDLFASLHGEPQPPDQRSQRIRIACKVRAEYEVVKNLPTLPQREVGSSVSNSNVGETHSSLTTNQAQGFPTDTSGDLSKDEGKKITSIAVHLQPQTGLIDGKAGAIAGTSTAISSVGSSDRYQPSAAIMKRLPSKWPRPIWHPPWKNYRVISGHLGWVRSVAFDPGNEWFCTGSADRTIKIWEVATGKLKLTLTGHIEQIRGLAVSSRHPYLESAGDDKQVKCNDLEYNKAIRSYHGHLSGVYCLALHPTLDILCTGGRQSVCRVWDIRTKAQIFALSGHENTVCSVFTQAIDPQVVTGSHDTTIKLWDLAAGKTMSTLTYHKKSVRAIAKRPFEHTFASASADNIKKFKLPKGEFLHNMLSQQKTIVNAMAINEDNVLVSAGDNGSLWFWDWKSGHNFQQAQTIVQPGSLDSEAGIYALQYDTTGSRLVSCEADKTIKMWKEDETATPESNPINFKAPKDIRRF 163 1647
446 WD40重复蛋白 MRPILMKGHERPLTFLKYNRDGDLLFSCAKDHTPTVWYGHNGERLGTYRGHNGAVWCCDVSRDSTRLITSSADQTAKLNNVETGAQLFSFNFESPARAVDLAIGDKLVVITTDPFMELPSAIHIKRIEKDLSKQTADSVLTITGIKGRINRAVWGPLNSTIISGGEDSVVRIWDSETGKLLRESDKETGHQKPITSLCKSADGSHFLTGSLDKSARLNDIRTLTLIKTYVTERPVNAVAISPLLDHVVIGGGQEASHVTTTDRRAGKFEAKFFHKILEEEIGGVKGHFGPINSLAPNPDGRSPASGGEDGYVRLRHFDPDYFHIKM 192 1172
447 WD40重复蛋白 MRPILMKGHERPLTFLKYNRDGDLLFSCAKDHTPTVWYGHNGEALGTYRGHNGAVWCCDVSRDSTRLITSSADQTAKLWNYETGNQLFSFNFESPARAVDLAIGDKLVVITTDPTMELPSAIHIKRIEKDLSKQTADSVLTITGIKGRINRAVNGPLNSTIISGGEDSVVRIWDSETGKLLRESDKFTGHQKAITSLCKSADGSHFLTGSLDKSARLWDIRTLTLIKTYVTERPVNAVAISPLLDHVVIGGGQEASHVTTTDRRAGKFEAKFFHKILEEEIGGVKGHFGPINSLAFNPDGRSFASGGEDGYVRLHHFDPDYFHIKN 131 1111
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
448 WD40重复蛋白 MAENNVGDFIVLDKQEYPSKPAPGAVDSSFWKSFKKKEVSRQIAGVTCINFCPEPPHDFAVTSSTRVHIYDGKSCELKKTITKFKDVAYSGVFRSDGQIIAAGGETGVIQVFNAKSQMVLRQLKGHGRPVRVVRYSPQDKLHLLSGGDDSNVKWWDITTQEELLNLEGHKDYVRCGAASPSSVNLWATGSYDHTVRLWDLRRSKTVLQLKHGKPLEDVLFFPSGGLLATAGGNVVKVWDILGGGRPIHTMETHQKTVMAMCISKVPRSGQALGDAPSRLVTASLDGYMKVFDLDHFKVTHSARYPAPILSMGTSSLCRTMAVGTSSGLLFIRQRKGQIEDKIHSDSSGLQVNPVNDEKDSAVLKPNQYRYYLRGRSEKPSEGDYVVKRMAKVYFQEYDKDLRHFNHSKALVSALKAADSKGTVAVIEELVARKRLIQTLSILNLDELELLINFLSRFILVPKYSRFLISLTDRVLDARAVDLGKSENLKKQIADLKGIVVQELRVQQSMQELQGIIEPLIRASAR 149 1726
449 WD40重复蛋白 KDVETSGKPTGNKRTYTRLPRQVCVFWQEGRCTRESCNFLHVDEPGSVKRGGATNGFAPKRSYNGSDERDTLAAGPPGGSRRNISARWGRGRGQIFISDERQKIRNKVCNYWLAGNCQRGEECKYLHSFVMGSDVKFLTQLSGHVKAIRGIAFPSDSGKLYSGGQDKKVIVWDCQTGQGTDIPLNDEVGCLMSEGPWIFVGLPNAVKAWHILTSTELSLVGPRGQVHALAVGNGMLFAGTHDGSILAWKFSPASNTFEPAASLVGHTQAVVSLVSGADRLYSGSMDKTIRVWDLGTFQCLQTLRDHTSVVMSLLCWDQFLLSCSLDNTVKVWVATSSGALEVTYTNNEEHGVLALCGNNDEQAKPVLLCSCNDNTVRLYDLPSFSERGRIFSRNEVRTFQIAPGGLFFTGDATGELKVWNWATQKS 948 2228
450 WD40重复蛋白 MSVQELRERHAAATAKVNALRERIKAKRLQLLDTDVATYASSNGRTPISFSFTDLVCCRTLQGHTGKVYSLDWTSEKNRIVSASQDGRLIVWNALTSQKTHAIKLPCAWVMTCAFSPSGQAVACGGLDSVCSTFQLNNQLDRDGRLPVSRILSGHRSYVSSCQYVPDGDTHVITGSGDRTCIQWDVTTGQRIAIFGGEFPLGHTADVMSVSISAANPKEFVSGSCDTTTRLWDTRLASRAIRTFKGHEADVNTVKFFPDGLRFGSGSDDGTCRLFDIRTGHQLQVYRQPPRENQSPTVTAIAFSPSGRLLFAGYSNGDCFVWDTILEKVVLNLGELQNTHNGRISCLSLSADGSALCTGSWDKNLKIWAFGGHRKIV 332 1465
451 WD40重复蛋白 MKVKIISRSTDEFTRERSNDLQRVPRNFDPNLHTQARAQEYVRALNAAKLDKIEAKPFLAAMSGHIDGISAMAKSPRHLKSIFSGSVDGDIRLWDIAARRTVQQFPGHRGAVRGLTVSTEGGRLISCGDDCTVRLWDIPVAGIGESSYGSENVQKPLATYVGKNSFRAVDYQWDSNVFATGGAQVDIWDHDRSEPTNSFAWGSDTVISVRFNPAEKDIFATTASDRSIVLYDLRMASPLNKLIMQTRNNAIAWNPREPMNFTAANEDCNCYSYDMRRMNISTCVHQDHVSAVMDIDYSPSGREFVTGSYDRTVRIFPYNAGHSREIYHTKRMQRVFCVKPSGDATYVVSGSDDANIRLNKAKASEQLGVLLPRERKRHEYLDAVKERFKHLPEIKRIERHRRLPKPIYKAALLRHTVNAAAKRKEERKRAHSAPGSVVTNPLRKKRIVAQLE 232 1590
452 WD40重复蛋白 MDHYYQDDFDYLVDDEMVDFADDVEDDVRTRERSDIDSDSENDFDLNNKSPDTTALQAKRGKDIQGIPWNRLNFTREKYRETRLQQYKNYENLPRPRRSRNLDKECTNFERGSSFYDFRHNTRSVKATIVHFQLRNLVWATSKHNVYLMQNYSIMHWSSLKQKGEEVLNVAGPIVPSVKHPGSSPQGLTRVQVSAMSVKDNLVVAGGFQGELICKYLDKPGVSFCTKISHDENGITNAVEIYNDASGATRLMTANNDLAVRVFDTEKFTVLERFSFPWSVNHTSVSPDGKLVAVLGDNADCLLADCKTGKTVGTLRGHLDYSFAAAWHPDGYILATGNQDTTCRLWDVRKLSSSLAVLKGRMGAIRSIRFSSDGREMAMAEPADFVHLYDTRQNYTKSQEIDLFGEIAGISFSPDTEAFFVGVADRTYGSLLEFNRRRMNYYLDSIL 207 1550
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
453 WD40重复蛋白 MAEALVLRGTMEGHTDAVTAIATPIDNSDNIVSSSRDKSILLWNLTKEPEKYGVPRRRLTGHSHFVQDVVISSDGQFALSGSWDSELRLWDLNTGLTTRRFVGRTKDVLSVAFSIDNRQIVSASRDRTIKLWNTLGECKYTIQPDAEGHSNWISCVRFSPSATNPTIVSCSWDRTVKVWNLTNCKLRNTLVGHGGYVNTAAVSPDGSLCASGGKDGVTMLWDLAEGKRLYSLDAGDIIYALCFSPNRYWLCAATQQCVKIWDLESKSIVADLRPQFIYNKKAQIPYCTSLSWSADGSTLFSGYTDGKIRVWGIGHV 221 1171
454 WD40重复蛋白 HAAIKSTSRSASVAFAPDAPLLAAGTMAGAIDLSFSSLANLEIFKLDFQSDDPELPVVGECPSNERLNRLSWGSAGGSFGIIAGGLVDGTINIWNPATLINSEDNGDALIARLEQHTGPVRGLEFNTISTNLLASGAEDGELCIWDLANPTAPTHPPPLKGVGSGAQGKISFLAMNRKVQHILASTSYSGTTVVWDLRRQKPIISFPDATRRRCSVLQWNPDASTQLIVASDDDNSPTLRAWDLRNTISPYKEFVGHSRGVYAMSWCPSDSLFLLTCAKDNRTLCWDTGSGIIVCELPAGANWNFDVQMSPKIPGILSTSSFDGKIGIHNIEACSRNVSGEVEFGGAIVRGGPSALLKAPKWLERPAGVSFGFGGKLASFRPSTVAQAADHRHSEVFIHNLVTEDNLVIRSTEFEAAIADGEKVSLRALCDRKAEESQSDEEKETWNFLRVMFEDEGTARTKLLEHLGFKVQSEENGDLQETHSSKIDDIGSEIGKTLTLDDKTEEDVLPQLKGGQDAAIPQDNGEDFFDNLHSPKEEVSLSHVGNDFVGEKDKDMVVNGAEIEHETEDLTEYSDNNEAIQHSLVVGDYKGAVLQCLSANRMADALIIAHLGGNSLWEKTRDEYLKKAKSSYLKVVSAMVNNDLTGLVNSRPLKSWKETLAMLCTYSQREEWTVLCDMLASRLIAAGNVMAATLCYICAGNIEKTVEIWSRSLKYDYDGRSFVDHLQDVMEKTVVLALATGQKRVSPSLSKLVENYAELLASQGLLTTAMEYLKLLGTEESSHELSILRDRLYLSGTDNKVEASSFPFETRQDLTKSQYNMHQTGFGAPETQKNYQENVHQVLPSGSYTDNYQPTANTHYIAGYQPAPQQQPSFQNYFTPASYQPAPSPNVFYPSQVSQAEQSNFAPPVNQPPMKTFVPSTPPILRNVDQYQTPSLNPQLYQGVSSATVETHPYQTGAPASVSVGTTPGQPSVVPNFMVPGPVTAPTVTPRGFMPVTTPTQHPLGSANPPVQPQSPQSSQVQSVTAATTPPPTTQNVDTSNVAAEIRPVIGTLRRLYDETSEALGGARANPAKRREIEDNSRKIGSLFAKLNSGDISSNAASKLVHLCQALESRDYATAFQIQVGLTTSDWDECSFWLAALKRMIKVKQNMR   221   3679
455 WD40重复蛋白 MAGAADSQLQTLSERDSTPNFKNLHTREYAAHKKKYHSVAWNCTGTKLASGSVDQTARVWNIEPHGRSKTKDLELKGHADSVDQLCWDPKHSELLATASGDRTVRLWDARSGKCSQQVELSGENINITFKPDGTHIAVGNRDDELTIIDVRKFKPLHKRKFSYEVNEIAWNTTGELFFLTTGNGTVEVLSYPSLQVLHTLVAHTAGCYCIAIDPIGRYFAVGSADALVSLWDLSRMLCVRTFTKLEWPVRTISFNHDGQYIASASEDLFIDIADVQTGRTYHQISCRAAMNSVEWNPKYNLLAFAGDDKNKYMQDEGVFRVFGFETP 269 1252
456 WD40重复蛋白 NAATSPVGAGSGRELANPPTDGISNIRFSNHSDHLLVSSWDRKVRLYDASANSLKGQFVHGGPVLDCCFHDDASGFSGSADNTVRRYDFSTRKEDILGRHEAPVRCVEYSYAAGQVITGSWDKTLKCWDPRGASGQEKTLVGTYSQLERVYSMSLVGRRLVVATAGRHINVYDLRNMSQPEQRRESSLKYQTRCVRCYPNGTGFALSSVEGRVAMEFFDLSEAGQAKKYAPKCHRKSEAGRDTVYPVNAIAFHPIYGTFATGGCDGYVNVWDGNNKKRLYQYSKYPTSIAALSFSRDGRLLAVASSYTFEEGEKPHEPDAVFVRSVNEAEVKPKPKVYAAPP 214 1242
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
457 WD40重复蛋白 MASDDEEGFXNEEAPGVVDEAEVQEGLRACFPLSFGRQKKKQAPLESIHSATKRPEDPRPRRQLGPPRPPPSILAEQEDSDRFVGPPRPPQFVRDDNDDGEAEIMIGPPRPPAQYSDDNDNEETIGPPKPSYLEKGEETDQMVGPSKRGSDQETSGDSDDGDDAVDFRVPLSNEIVLRGHTKVVSALAIDQTGSRVLTGSYDYSVRMYDFQGMTSQLKSFRQLEPAEGHQVRSLSWSPTSDRFLCVTGSAQAKIFDRDGLTLGEFVKGDMYLRDLKNTKGHTSGLTCGENHPKEKQTILTCSEDGSLRIWDVNDFNTQKQVIKPKLAKPGRVPVTACAWGRDGKCIAGGVGDGSIQVWNLKPGNGSRPDLYVAKGHDDDITGLQFSADGNILLTRSTDETLKVWDLRKAITPLQVFRDLPNNYAQTNVAFSPDERLIFTGTSVERDGNSGGLLCFYDRQTLELVLRIGVSPVHSVVRCTWHPRHNQVFATVGDKKEGGAHILYDPALSERGALVCVARAPRKKSLDDFEAKPVIHNPRALPLFRDEPSRKRQREKARMDPMKSQRPDLPVTGPGFGGRVGSTKGSLLTQYLLKEGGLIKETWMEEDPREAILKYADVAAKDPKFIAPAYAQTQPETVFAETDSEEEQK 119 2065
458 WD40重复蛋白 MKERGQSHAGQPSVDERYTQWKSLVPVLYDWLANHHLVWPSLSCRWGPQMHQATYKNSQRLYLSEQTDGTVPNTLVIATCSVVKPRVAAAEHISQFNEEARSPFVKKFKTIIHPGEVNRURELPQNSKIVATHTDGPDVLIHDVDTQPNRQATLGAADSRPDLVLTGHKDNAEFALAMSPSAPFVLSGGKDKCVLLWSIQDHISAATEPSSAKASKTPSSAHGEKVPKIPSIGPRGVYKGHKDTVEDVQFCPSNAQEFCSVGDDSALILWDARNGNEPVIKVEKARNADLHCVDWNPHDENLILTGSADNSVRMFDRRNLTSSGVGSPVHKFEGHSAPVLCVQWCPDKASVFGSAAEDSYLNVWDYEKVGKNVGKKTPPGLFFQHAGHRDKVVDFHWNSFDPWTIVSVSDDGESTGGGGTLQIWRMSDLIYRPEDEVLAELERFRAHTLSCQNK 186 1550
  459 WD40重复蛋白 MSSLSRELVFLILQFLDEEKFKESVHKLEQESGFFFNMKYFDEKAQAGEWDEVERYLSGFTKVDDNRYSMKIFFEIRKQKYLEALDRQDRAKAYDILVKDLKVFSTFNEELYKEITQLLTLDNFRENEQLSKYGDTKSARTIMMSELKKLIEANPLPREKLIYPNLKASRLRTLINQSLNWQHQLCKHPRPNPQIKTLFTDHACGPPNGARTPTQPTASLGVLPKATTFTPIGPHGPFPSSSTATSGLASWMSNPNMVTSPQAPVAVGPSVPVPPNQATLLKRPRTPPGSSSVVDYQTADSEQLIKRLRPVSQSIDEATYPGPTLRVPWSTDDLPKTLARALNEPYPVTSIDFHPSQQTFLLVGTKNGEITLWEVGSREKLATRSFKIWDNANCSNHLEAAFVKDSSVSINRVLWSPDGTLIGIAFTKHLVHTYTFQGLDLRQHLEIDANVGGVNDLAFSHPNKQLCVVTCGDDKMIKVWDAVTGRKLYNFEGHDAPVYSVCPHHKENIQFIFSTAVDGKIKAWLYDHLGSRVDYDAPGHSCTTMMYSADGTRLFSCGTSKEGESFLVEWNESEGAIKRTYSGLRKKGSGVVQFDTTQNHFLAVGDEHLIKFWDMDSTNMLTSCDAEGGLLNLPRLRFNKEGSLLAVTTVNGIKILANADGQKLLKTHENRTFDLPSRAHIDAASATSSPATGRMERIERTSSAKTVSGINGVDPAQSSEKLRLSDDLSEKTKIWKLTEITDSIQCRCITLPENAAEPASKVSRLLYTNSGVGLLALGSNAVHKLWKWNRSEQNPSGKATASVHPQRWQPTSGLLMTNDITDINPEEAVPCIALSKNDSYVMSASGGKVSLFNMMTFKVMTTFMPPPPASTFLAFHPQDNNIIAIGMEDSTIHIYNVRVDEVKTKLKGHQKRITGLAFSSTQNILVSSGADAQLCVWNTETWEKRKSKTIQMPVGKTVSGDTRVQFHSDQLHILVVHETQLAIYDAYKLERQYQWVPQDALSAPILYATYSCNRQLIYATFSDGNIGVYDAEILRPRCRIAPTTYLSSGTSSSTSLPLVVAAHPHEPNQFAIGLSDGAVQVLEPSESEGKWGVSPPPENGVVPAVVAGPSTSNQGSEQAPR 244 3671
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利0RF终止
460 WD40重复蛋白 ]MAKDEEEFRGEMEERLVNEEYKIWKKNTPFLYDLVITHALEWPSLTVQWLPDREEPPGKDYSVQKMILGTHTSDNEPNYLMLAQVQLPLEDAENDARQYDDERGEIGGFGCANGKVQVIQQINHDGEVNRARYMPQNPFIIATKTVSAEVYVFDYSKHPSKPPQDGGCHPDLRLRGHNTEGYGLSWSPFKHGHLLSGSDQAQICLWDINVPAKNKVLEAQQITKVHEGVVEDVAWHLRHEYLFGSVGDDRHLLIWDLRTSATNKPLHSVVAHQGEVNCLAFNPFNEWVLATGSADRTVKLEDLRKISSALHTFSCHKEEVFQIGWSPKNETILASCSADRRLNVWQLSRIDEFQTPEDALDGPPELLFIHGGRTSKISDFSWNPCEDWVLASVAEDNILQIWQQNAENIYHDEEDHPPEEVV 163 1431
461 WD40重复蛋白 MSPGVKQTGSQKFKSGHQQVVHDVTMDYYGKRIATCSADRTIKLFGLNASDTPSLLASLTGHEGPVWQVAWAHPXFGSMLASCSYDGRVIIWREGQQENENSQVQVFKEHEASVNSISWAPNELGLCLACGSSDGSITVFTCREDGSWDKTKIDQAHQVGVTAVSWAPASAPGSLVGQPSDPIQKLVSGGCDNTAKVNKFYNGSWKLDCFPPLQMHTDWVRDVAWAPNLGLPKSTIASCSQDGKVVIWTQGKEGDKWEGRILNDFKIPVWRVNWSLTGNILAVADGNNSVTLWKEAVDGDMNQVTTVQ 155 1081
462 WD40重复蛋白 MSSGVKQTGSQKFESGHQDVVHDVTMDYYGKRIATCSAQRTIKLFGMNTSDTPTLLASLTGHEGPVWQVAWAHPKFGSMLASCSYDRRVIIWREGQQENENSQVQVFKEHEASVNSISWAPHELGLCLACGSSDGSITVFTGREDGSWDKTKIDQAHQVGVTAVSWAPASAPGSLVGQPSDPVQKLVSGGCDNTAKVWKFYNGSWKLDCFPPLQMHTDWVRDVAMAPNLGLPKSTIASCSQDGRVVINTQGKEGDKWEGKILNDFKTPVWRISWSLTGNILAVADGNNNVTLNKEAVDGEWNQVTTVQ 537 1463
463 WD40重复蛋白 MKKRSRPSNGHLSTAAKNKSRKTAPITKDPFFDSAHNRNKSKGKGKSRGKGEEIFSSDEDDDAIGRDAPAEEEEEIAEEEHETADEKRLRVAKAYLDKIRAITKANEEDNEEEAGEDEETEAERRGKRDSLVAEILQQEQLEESGRVQPQLASRVVTPSKLVECRVVKRHKQSVTAVALTEDDLRGFSASKDGTIIHWDVETGASEKYEWPSQAVSVSSSMEVSKTQKGKGSKKQGSKHVLSMAVSSDGRYLATGGLDRYIHLWDTRTQKHIQAFRGHRGAVSCLAFRQGTQQLISGSFDRTIKLWSAEDRAYMDTLYGHQSIILAVDCLRKERVLSVGRDHTLRLHKVPEETQLVFRGHAASLECCCFINNEDFLSGSDDGSIELWSMLRKKPVFNAKNAHGHAIVENLSEDTSTREEPDEEVTTRQLPNGNSIGNGMTNQMGITPSVESWVGAVTVCRGTDLAASGASNGVVRLWAIENSSKSLRALHDIPLTGFVNSLTFARSGRFLIAGVGQEPRLGRWGRQIAARNGVTLCPIELS″ 284 1909
464 WD40重复蛋白 MAATFGTINTATSPHNPNKSFEIVQPPNDSISSLSFSPKANYLVATSWDNQVRCWEVLQTGASMPKAAMSHDQPVLCSTWKDDGTAVFSAGCDKQAKMWPLLTGGQPVTVAMHDAPIKDIAWIPEKRLATGSWDKTLKYNWDTRQSNPVHTQQLPERCFALSVRHPLMVVGTADRNLIIPNLQNPQTEFKRISSPLKYQTRCVAAFPDKQGFLVGSIEGRVGVHHVEEAQQSKNFTFKCHRDSNDIYAVNSLNFHPVHQTFATAGSDGAFNFWDKDSKQRLKAMARSWQPIPCSTFNSDGSLYAYAVSYDWSKGAENHNPATAKHHILLHVPQESEIKGKPRVTTSGRK 610 1659
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
465 WD40重复蛋白 MVVMDKGTRQTNEDESESEFIDEDDVIDEISIDEEDLPDADVEGEDVQEDNKRSEPDENSSSLDDAIHTFEGHEDTLFAVACSPVDATWVASGGGDDKAFMWRIGHATPFFELKGHTDSVVALSFSNDGLLLASGGLDGVVRIWDASTGNLIHVLDGPGGGIEWVRMHPKGRLVLAGSEDYSTWNWNADLGKCLSVYTGHCESVTCGDFTPDGKAICTGSADGSLRVWNPQTQESKLTVKGYPYHTEGLTCLSISSDSTLVVSGSTDGSVHVVNIKNGKVVSLVGHSGSIECVRFFSPSLTWVATGGMDKKLMIWELQSSSLRCTCQHEEGFNRLSNSLSSQHIITSSLDGIVRLWDSRSGVCERVFEGHNDSIQDNVVTVDQRFILTGSDDTTAKVFEIGAF 241 1452
466 WD40重复蛋白 NPVFRTAFNGYAVKFSPFVETRLAVATAQNFGIIGNGRQHVLELTPNGIVEVCAFDSSDGLYDCTMSEANELNVVSASGDGSVKIWDIALPPVANPIRSLEEHAREVYSVDWNLVRKDCFLSASWDDTIRLNTIDRPQSMRLFKEHTYCIYAAVWNPRHAMVFASASGDCTVRIWDVREPNATIIIPAHEHEILSCDWNKYNDCMLVTGSVDKLIKVWDIRTYRTPMTVLEGHTYAIRRVKFSPHQESLIASCSYDNTTCNWDYRAPEDALLARYDHHTEFAVGIDISVLVEGLLASTGNDETVYVWQHGMDPRAC 223 1173
467 WD40重复蛋白 MDSRKRRSRLNLPPGNSPSSLHLETTAGSPGLSRVNSSPSTPSPSRTTTYSDRFIPSRTGSRLNGFALIDKQPQPLPSPTRSAAEGRDDASSSSASAYSTLLRHELFGEDVVGPATPATPEKSTGLYGGSRDSIKSPMSPSRMLFRFKNDHGGNSPGSPYSASTVGSEGLFSSNVGTPPKPARKITRSPYKVLDAPALQDDFYLNLVDWSSNNVLAVGLGTCVYLNSACTSKVTKLCDLGVNDSVCSVGWTPQGTHLAVGTNIGEVQIWDTSRCKKVRTMGGHCTRAGALAWSSYILSSGSRDRNILHRDIRVQDDFIRKLVGHKSEVCGLKWSYDDRELASGGNDNQLLVWNQQSAQPLLRFNEHTAAVKAIAWSPHQHGILASGGGTADRCLRFWNTATDTRLNCVDTGSQVCNLVWCKNVNELVSTHGYSQNQIMVWRYPSMSKLATLTGHTLRVLYLAISPDGQTIVTGAGDETLRFWSIFPSPKSQSAVHDSGLWSLGRTHIR 251 1777
468 WD40重复蛋白 NEKKKVVVPIVCHGHSRPIVDLFYSPVTPDGLFLISASKDSSTMLRNGETGDWIGTFEGHKGAVNSCCLDNRALRAASGSADFSAKIWDALTGDELHCFVHKHIVRACAFSESTSLLLTGGHEKILRIFDLNRPDAPPKEVDNSPGSIRTVAWLRSDQTILSSNSDAGGVRLWDLRTEKIVRVLETKSPVTSAEVSQDGRYITTADGNSVKFWDANHFGMVRSYTMPCMVESASLEPTMGNMFVAGGEDMWVRLFDFHTGEEIACNKGHHGPVHCVRFAPGGESYSSGSEDGTIRIWQTLNMNSEENESYGVNGLSGKVRVGVDDVVQKVEGFQITADGHLNDKPEKPNP 367 1419
  469 WD40重复蛋白 MERYSQGTQKKSEIYTYEAPWQIYGMNWSVRKDKKFRLGIGSFLEEYNNRVEIIELDEESGEFKSDPRLAFDHPYPTTKIMFVPDKECQRPDLLATTGDYLRIWQVCEDRVEPKSLLNNNKNSEFCAPLTSFDWNDADPKRIGTSSIDTTCTIWDIEKEVVDTQLLAHDKEVYDIAWGEVGVFASVSADGSYRVFDLRDREHSTIIYESSQPETPLLRLGWNKQQPRFIATILMDSCKVVILDIRFPTLPVAELQRHQASVNTIAWAPHSPCHICTAGDDSQALIWELSSVSQPLVEGGGLDPILAYTAAAEINQLQWSSMQPDWVAIAFSNEVQILRV 284 1303
蛋白质SEQID 目标 专利肽序列 专利ORF开始 专利ORF终止
470 WD40重复蛋白 HQSENNLDESLHLREVQELQGHTDTVWAVAWNPVTGIDGAPSMLASCSGDKTVRIWENTHTLNSTSPSWACKAVLEETHTRTVRSCAWSPNGXLLATASFDATTAIWENVGGEFECLASLEGHHENEVKSVSWSASGMLATCGRDKSVWIWDVQPGNEFECVSVLQGHTQDVKMVQWHPNRDILVSASYDNSIKVMAEDGDGDDWACMQTLGNSVSGHTSTVWAVSFNSSGDRNVSCSDDLTLMVWDTSINPARRSGNAGPWKHLCTISGYHDRTIFSVHWSRSGLIASGASDDCIRLFSESTDDSVTPVDGTSYKLILKKEKAHSHDVNSVQWHPSEPDLLASASDDGRIKIWEVTRINGLANSH 684 1784
471 WD40重复蛋白 NKRAYKLQEFVAHASNVNCLKIGKKSSKVLVTGGEDRKVMMRAIGKPNAILSLSGHSSAVESVTFDSAEALVVAGAASGTIKLWDLEEAKIVRTLTGHRSNCISVDFHPFGEFFASGSLDTNLKIHDIRSKGCIHTYKGHTRGVNSERFSPDGRWVVSGGEDWIVKLWDLTAGKLMHDFKCHEGQIQCHDFHPQEFLLATGSADRTVKFWDLETFELTGSAGPETTGVRAMIFNPDGRTLLTGLHESLKVFSWEPLRCYDAVDVGWSKLADLNIHEGKLLGCSYNQSCVGVWVVDISRVGPYAAGNVSRTNGHNEAKLASSGHPSVQQLDNNLKTNMARLSLSHSTESGIKEPKTTTSLTTTEGLSSTPQRAGIAFSSKNLPASSGPPSYVSTPKKNSTSRVQPTTNFQTLSRPDIVPVIVPRSNSLRPETTSDVKKEMNNFGRVVPSTVSTKSTDVIRSGSNRDESDKIDSINQKRMTGNDKTDLNIARAEQHVSSRLDNTNTSSVVCDGNQPAARWIGAAKFRRNSPVDPVVSPHDRSPTPPWSATDDGVTCQPDRQVTAPELSKRVVEPGRARALYASWETREKALTADTPVLVSGRPPTSPGVDMNSFIPRGSHGTSESDLTVSDDWSAIEELMQQHNAFTSILQARLTKLQVIRRFWQRNDLKGAIDATGKMGDHSVSADVISVLIERSEIFTLDICTVILPLLTRLLQSETDRHLTVAMETLLVLVKTFGDYIRATISATPTIGVDLQAEQRLERCNLCYVELENIKQILVPLIRRGGAVAKSAQELSLALQEV 336 2738
472 WD40重复蛋白 MSTLEIEARDVIKIVLQFCKENSLHQTFQTLQNECQVSLNTVDSLETFVADINSGRWDVILPQVAQLKLPRKKLEDLYEQIVLEMIELRKLDTARAILRQTQAMGFMKQEQPERYLRLEHLLVRTYFDPREAYHESSKEKRRSQIAQALASEVTVVPPSRLMALIGQSLKWQQHQGLLPPGTQFDLFRGTAAVKADEEEMYPTTLAHTIKFGKQSHPECARPSPDGQYLVSCSVDGFIEVWDYISGKLKKDLQYQADDSFMMHDDAVLCVDFSRDSEMLASGSQDGKIKVWRIRTGQCLRRLERAHSQGVTSLSFSRDGSQLLSTSFDSTARIHGLKSGKALKEFRGHTSYVNDAIFTSDGGRVITASSDCTVKVWDVXTTDCIQTFKPPPPLKGGDVSVNSVHLFPKNSEHIVVCNKASSIYIMTLQGQVVKSFSSGKREGGDFVAACISPKGEWIYCVGEDRNIYCFSQQSGKLEHLMKAHDKDIIGVTPHPHRNLLVTYSEDSTMKIWKP 81 1622
473 WD40重复蛋白 MDIELEDQPEDLDFHPSAPIVAVALITGRLQLFRYVDISSEPERLWTVTANTESCRAARFINAGSSVLTASPDCSILATNVETGQPVARLDNAHGAAINCLTNLTESTIASGDENGIIKVWDTRQNSCCNKFKAHEDYISDMEFVPDTMQLLGTSGDGTLSVCNLRKNKVHARSEFSEDELLSVALMKNGKKVVCGSQEGVLLLYSWGYFKDCSDRFVGHPHSVDALLKLDEDTVLTGSSDGIIRVVSILPNKMIGVIGEHSSYPIERLAFSHDRNVLGSASHDQILKLWQIHYLHSDDEPETHKQEAVWDENVDMQLDVDTEKRPRGSKRKKRAEKGQTSSQKQSSDFFADI 399 1460
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
474 WD40重复蛋白 MDRIQQIPHTCVARKINLPLGHSKESLALNIPANLAPTHSPPSITYSDRFIPSRKASNFEEFALPDRTBPSPNSAGGQSSSTNGEGRDDACAAYSALLRTELFPATPKKTCGCRRPVIGSPSGNVFRFKSQQCKSQSPFSLCPVGEDGDLSETGAVARATTRKIPRSPFKVLDAPALQDDFYLNLVDWSSHNILAVGLSACVYLWSASSSKVTKLCDLGLDDNVCSVAWTQRGTYLAVGTNNGGVQRNDAAHCKQVRTMEGHCTRVGTLAWNSAILSSGGRDRNILQRDIRAQDDFVSKFSGHKSEVCGLXWSYDNRELASGGNDNQLFVNNQQSQQPVLKYNEHTAAVKAIAWSPHQGGLLASGGGTADRCIAFWNTATNTSLNCVDTGSQVCNLVNSKNVNELVSTHGYSQNQIIVWRYPTMSKLATLTGHTLRVLYLAISPDGQTIVTGAGDETLRFWNVFPSSKTQQNTIRDMGVWSSGRTHIR 207 1673
475 WD40重复蛋白 NAGGQGEGEEKVDKLSMELTEDVHKSMEIGAVFKDYNGKINSLDFHRTNNYLVTASDDEAIRLFOTASATWQKTSYSKKYGVDLICFTNHQTSVLYSSRNGNKESLRHLSIMDNXYLRYFKGHHDRVVSLCMSPKGECFMSGSLDRTVLLWDLRIDKCQGLIRVRGRPAVAYDEQGLVFAISNEGGLTKMFDARLYOKGPFDTFVVEGDKSEASGIKFSNDGKLILLSTMDSNIHVLDAYQGTPVHSFSVEAVPNGGEAVPNGGTLEASFSPDGKFVISGSGNGNIHAWSVNSGKEVACNTTEGVIPAVVKWAPRRLMFASGSSVLSLNVPDLSKLASLTGSNSNSAY 263 1309
476 WD40重复蛋白 NHRVGSTGNTSNSSRPRREKRLTYVLNDANDSRHCSGJNCLVISKLSLLGGNDYLFSGSRDGTLKRNELADDSAVCSATFESBVDAVNDAVLTGETLVSCSSDTTLKTWRPFSDGVCTRTLRQHSDYVTCLAAASKNSNIVASGGLGBEYFINDIKAANAPVSRTSEAMDDDTSNGVLSSGNSVLSTTVRSTNATNSASLHTSQLQGYTPIAAXGHRESVYALAMNDVGTLLVSGGTEKVVRVWDPRSGAXQNKLRGHTDNVRALILDSTGRFCLSGSSDSIIRLWDLGQQRCVHSYAVHTDSVWALASTPNFSHVYSGGRDLSLYLTDLTTRESLLLCMERHPLLRLTLQDDSIWVATTDSSLHKWPAEGQNPPKNFQRGGSFLAGNLSFTRARACLEGSAPVPVNTQPSFVIPGSPGIVQHEILNNRRHVLTKDAEGTVKLWEITRGAVLDDYGKVSFEEXXEELFEMVSIPAWFTMDTRLGSNSVHLDTPQCPTAEMYAVDLNVPDAPEEQKINLAQETLRGLLAHWLSRRRQRLATQASANGDFPAGQENALRNHTSSRIDVHDDAETHIAGTLPAFDFSTTSPPSIITEGSQGGPWRKKITDLDGTEDERDFPWNCLECVLHGRLSPRESLKCSFYLHPYEGTTVQVLTQGKLSAPRILRIQKVINYVLEXHVLDRPLDSSNSETTFTPGLSGNQSHAAVVGDGSLRSGARVWQQKAKPLVEILCNNQVLSPDMSLATVRTYIWKKPDDLYLYYTLVQNR 232 2529
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
477 WD40重复蛋白 MMRGKTIQMQAAHQNHDGETSVACVLWDWHAKHLITAGADNTILIHSYPSSSSSKPITLRHHKNAVTAIAINSNVRSLASGSVDRSVKLYSYPGGEFQSNVTRFTLPITSLAFNKSGELLAAAGDDEGIKLISTIDNSIAKVLKGHHGPVTSISFDPKNEFLASSDSDCTVIYWELSTGKPVHTLKKIAPNTTSNPTSLNQISWRPOGEMLAVPGRDSEVSMYDRDTAKKLFSLKGGHSDTICSLAWSPNGKYIATAGTDRQVNVWDADRRQDIDKQRFDNPICSVANXPSDNALAVIDVLGRFGVWLSPIASHMKSPADGAERYDNMEDEEPLNARYEEBLEDSVSGSLNEIINDDDDDDEHGXIPRKILQKKPSVXVEKGKEESNARAFKSGQDSFKLKSAMQEAFQPGRTQRQSGKRNFLAYNHLGSVITFDNDGFSHIEVDFHDIGKGCRVPSMTDYFGFTMASLSESGSVFGSPQKGEKNPSTLMYRPFSSWANNSENSMRFPMGEEVKAVALGSGNVAAVTSLNFLRVFSEGGLQKFVLSMDGPVVTAAGYENLLVVVSHASNPLLSGDQVLSFTVYDISQKTCPLSGRLFLSPGSHLTWLGFSEEGLLSSYDSEGNLRVFTNDYNGCWVPIFSAARKRKSETESIWMVGLNSTQVFCVVCKLPDTYPQVAPKPVLSVLNLSLPLACSDLGADDLENEYLRGSLLLSQMQKKAEDAVACGRESNMEEDSIFKMEAALDRCLLRLIANCCKCDKLVRATELARLLSLEKSLQGAIKLVSAMKLPMLAERFNTILEEKILQBNMETISCRRLTSEAQDMDTPISISVKQVSYGANLGDSPFLPNRQVEPKBSTPVPSXPDTKIEVDTSEAIAKGCDAQNGNIRSGDAEVQPASHNDSIQKPSNPFAKASNTSANQAVQRNASLLSSIKQMKTATENBGXRKERARSGSLPQKPAKQSKIS 56 2950
478 WD40重复蛋白 MRQXRKGHQVDDPKYSVQTPQEDDTPNESGPASEEVESSDEEGGNSSNIEDDIIYSSSEEDPVVSSDYEEDEDAESDAEGVTAEQELEGDIDNALQNYMGTLTVLSNFHGENLKNAEGEDTSGDDDDEEEMPKRAEESDSPEDENDERPKRAEESDFSEDEDEERPKRAEESDSSEDEVPSRNTVGDVPLRWYXDEQHIGYDIKGXKIKKQPXXDQLDSFLASTDDSSDWRKVYDEYNDEEVELTKDEIKFISRLRKGTIPHADVNPYEPYVDWFDWKDKGHPLSNAPEPKRRFIPSXWEAKKVVKLVRAIRKGWITFQKAEEKPRFYLMWGDDLKPSEKMAMGLSYIPAPKPKLPGHEESYNPPPEYIPTQEEINSYQLMYEEDRPHFIPKRFDSLPNVPAYDRFLSEIFERCLDLYLCPHTRKKRINIDPESLIPKLPKPKDLQPFPSICFLEYKGHTGAVSCISPESSGQWLASGSHDGTVRIWEVETARCLKVWDIGRPIQHIAWNPVSQLSILAVAVDEEVLVLNTGLGSEDSQEKVAELLBVKSKPVSADDLGDNTSLTKWIKWEKFDGIKLTHLRPVHLISWHHKGDYFATVAPDGNTRAVLVHQLSKQQTQNPFKKMQGRVVHVLFHPSRAIFFVATKTHVRVYDLVRQQLVKRLVTGLHKVSSMAVHHKGDNLLVGSKEGKVCWFDMDLSTQPYKTLKNHSKDIHSVAFHDSYPLFASCSDDCKAYVFYGLVYSDLLQNPLIVPLKVLQGHQSVNGMGVLDCQFPHXQPMLFTAGADSVVDLYCN 193 2577
479 WD40重复蛋白 MMSLKRGFEESLVPADRQKTELSTVTYGDGPRRTSSLESPIMLLTGHHAAIYTHKFNPTGTVIASGSHEREIFLWNVHGDCKNFMVLKGHKNAVLDLHNTTDGCQIISASPDXTLRAWDYETGKQIKKMAEHSSFVNSCCPSRRGPPLVVSGSDDGTAKLWDLRHRGAIQTFPDKYQYTAVGFSDAADKIISGGIDMEIHVNDLRRGEVTHRLQGHTDTITGMQLSSDGSYLLTNSMDCSLRIWDMRPYAPQNRCVKILTGHQHNFEKNLLKCSWSSDGSKVTAGSADRMVYIWDTTTRRILYKLPGHTGSVNETGFHPTQPIIGSCSSDKQIYLGEIEPNVGYQAVI 187 1233
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
480 WD40重复蛋白 MEFSDTYKHTGPCCFSPDARYLAIAVDYRLVIRDVVTLKVVQLYSCMDKISNIENALDSEYILCGLYKRAMVQAWSLSQPEWTCKIDEGPAGIAHARWSPDSRHIITTSDFQLRLTVWSLVNTACIHIQWPKHASKGVSFTQDGHFAAIATRRDCKDYVNLLSCHTWEVMGTFTVDTIDLADLEWSPNDSAIVVWDSPLEYKVLIYSPKGRCLFKYQAYDSWLGVKTVAWSPCSQFLAVGSYDQTLRTLNHLTWKFFAEFVHVSTVRGPASAVVFREVEEPWNLDVSGLHINDDNAHDIQDGKPAEGHSRVRYKVVEFPVNVSSQKHPVDKPNPKQGIGLLANSRDSQYLFTRNDNMPTALWIWDICRLELAALLIQKEPIRAAAWDPVYPRVALCTGSSHLYMWTPSGACCVNIPLPQFVVSDLKWNPDGTSHLLKDRESFCCTTVPMLPEFNDDETNEE 51 1436
481 WD40重复蛋白 MAKLIETHSCVPSTERGRGILTAGDAKTWSIIYCNGRSVIMRNLDNPLEASVRGEHSYPATVARFSPNGEWVASGDTSGTVRIWGRGSDHTLKYEYKALAGRIDDLEWSADGQRIVVCGDSKGKSMVRAFMWDSGTNVGEFDGHSRRVLSCSFKPTRPFRVATCGEDFLVNFYEGPPFRFKTSHRDHSNYVNCVFJAPKGSKFITVGSDRKGVIFDGKMGEKIGELSKEGGHTGSIYAASWSPDSKQVLTVSADKSAKIWEISETGNGTVKKTLTFGSQGGADDMLVGCLNLNDYLITVSLGGTVSLLSAVDPDKPPKTISGKMKSLNAIALSLQSGQSEVCSSSYDGVIVRWILGVGYAGRVERKDSTQIKCLATIEGELVTCGFDNKVRRVPLLSEQHKESEPIDIGAQPKDLDVAVGCPELTFVSTDAGIIIIRASKIVSTTHVGYAVTAAAISPDGTEAVVGGQDGKLRVYSIKGDTLLKESVLERHRGPINAIRFSPDGSMFASGDLNREAVVWDRITREVKLRNMVYHTARINCIAWSPDSSKVATGSLDTCILIYEVGKPASSRITIKGAHLGGVYGLAFSDQSTVISAGEDACVRVWSLP 525 2351
482 WD40重复蛋白 MPQPSVILATAGYDHTVRFWEATSGRCYRTLQYPDSQVNHLEJTPDKQYLAAAGNPRIRLFEVNSNNPQPVISYDSHTNNVTAVGFQCDGKWMYSGSEDGTVKIWDLRAPGFQREYRAAWMTVVLHPMQTELISGDQNGNGNIRVWDLNANSCSCELVPEDTAVRSLTVVMDGSLVVAANNHGTCYVNRLMRGTQTMTWFEPLHKLQAHNSYILRCLLSPEFCEHHRYLATTSSDQVKIWNVDGFTLENTLTGHQRWVVWDCVFSVDGAFLVTASSDSTARLWDLSTGEAIRTYQGHHKATVCCALHDGTDGASC 152 1099
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
483 WD40重复蛋白 MLTKFFTKSNRVKGLSFHPKRPWILASLHSGVIQLWDYRMGTLIDKFDEHDGPVRGVHFHXTQPLFVSGGDDYKIKVWMYKMRQCLFTFVGHLDYIRTVHFHNEYPWIVSASDDQTIRLWNWQSRVCISVLTGHNHYVMMSASFHPKDLVVSASLDQTVRVWDISGLRXXTVSPADDLSRLAQMNTDLFGGGDVVVKYVLEGHDRGVNWAAFHTSLPLIVSGADDRQVKLMRMNDTKAWEVDTLRGATNNVSCYIPHARQDIIVSNSIDKSIRVWDNSKRTSVQTFRREHDRFWILAAHPEMNLLAAGHDSGMIVFKLERRRPAYVVYGGSLLYVKDRYLRTYEFATQKDNPLIPIRKPGSIGPNQGPRSLSYSPTEMAILICSDADGGAYELYAVPKDSHGRSDTVQEAKKGLGGSAVFVARNRFAVLDKNHNQVTIKNLKNEVTKKFDLPVTADALFYAGFGNLLCRSEDSVFLFDMQQRTVLGEIQTPNVRYVVNSNDMENVALLSKHTIIIASKKLSSTCSLHETIRVKSGAWQQNGIFMYSTLNHIKYCLPNGDSGIIKTLDVPVYITKVSGKSLYCLDRDGKNRVIQIDITECLFKLALSKKKYDYVINMIRNSQLCGQATTAYLQQRGFPEVALHFVRDERTRFNLAVESGNIEIAVASAKEIDEKDHWYRLGVEALRQGNAGIVEYAYQRTRNFERLSFLYLITGNLDKLSDMLRIAEMKNDYMGQFHNALYLGDIQERIKILEESGHLNLAYATASLMGLADLADRLAADLGGNIPVLPPGKKSSLLMPPAPILHGGDWPLLRVTRGIFEGGLKKNSTSAAYEEEDEEAADWGEDIDIENIEGENGKATVLDDQEVKGGEDDEGGWDMEDLELPPDVAAANVGTNQKTLFVAPTLGMPVSQIWMQKSSLAGEHAAAGNFETALHLLTRQLGIRNFSPLKPLFLELYMGSHTFLPSFASVPAFSLALQRGNSESASPNIRGPPALVYRLSVLEEKLTVAYPATTEGRFSEALRLFLNILHTIPVIVVDSRKEIDEVKELIGIAKEYVLGLRMEVKRKEIRDDAVRQQELAAYFTHCNLQKALKLALLNAMGISYRCKNYNTAAANFARRLLETDPSSNHATKARQVLQVCERNLQDATCLNXDFRNPFVVCGATFTPIYRGQKEVSCPYCMARFVPDIAGKLCSICDLAIVGSDASGLFCFATQTR 470 4114
484 WD40重复蛋白 MDLLQNYQDDSEDSNPELRNHPPLEDATATSAPAGVENETSSSPDSSPLRLALPAKSCAPDVDETLNALGVPGSEKKNNHNKPIDPTQHSVTFNPSYDQLWAPLYGPAHPYAKDGIAQGMRNHKLGFVEDSAIEPFMFDEQYNTFHRYGYAADFSASLGSHIVGDLESLKKNDGASVYNLPKREHKRQKLEKKMIQKDENEEKEKEVGEEVDNPSTEEWLKKNRKSPWAGKKEGLQTELTEEQKKYAQERAEKKGDDREKGKVEIVDKTTFHGKEERDYQGRSWIDPPKDAKATNDHCYIPKRWVHTWSGHTKGVSAIRFFPKYGHILLSAGMDTKVKINDVFNSGKCMRTYMGHSKAVRDISFSNDGSRFLSAGYDRNIKLWDTETGKVISTFSTGKIPYVVKLHPDEDKQNVLLAGMSDKKIVQWDMNSGEITQEYDQHLGAVNTITFVDNNRRFVTSSDDKSLRVWEFGSIVVIKYISEPHMHSMPSISLHPNTNWLAAQSLDNQILIYSTRERFQLNKKKRFAGHIAAGYACQVNFSPDGRFVMSGDGEGRCWFWDWKTCKVFRTLKCHDNVCIGCEWHPLEQSKVATCGWDGMIKYWD 196 2007
485 WD40重复蛋白 MARXGLGTDPAIGSLMSSKKRKEYRVTHRFQEEGKRLYAIAFNFIDARYHNIFATAGGTRVTIYQCLEGGAISVLQAYVDDDKDESFYTLSMACDVNGSPLLVAGGHNGIIRVLDVNEKVHKSFVGHGDSVNEIRTQALKPSLILGSASKDESVRLWNVQTGICILIFAGAGGHREVLSVDFHPSDVYRIASCGMDNTVXISWSMKLFWTYVEKSFTWTDLPSKFPTKYVQFPVFIAAVHSNYVDCTRWLGNFILSKSVDNEVVLWEPYSKEQSTSDGVVDILQKYPVPECDIWFIKFSCDFHYNSMAVGNREGKVYVWELQSSPPNLIARLSHAHCKNPIRQTAISHDGSTILCCCDDGSNMRNDVVQ 214 1323
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
486 WD40重复蛋白 MESGAGGSVGARVPSAKPKMLQQPPYSNGDDDNDMERGTAPVPSSNPNTVSKMELDKDFLCPICNQTMKQAFLTACGHSFCYMCIMTHLNNKSNCPCCSLYLTNNQLFPNFLLNKLLKKTSACQMASTASPVENLCLSLQQGAEVSVKELDFLLTLLAEXKRKMEQERAETHMEILLDFLQRLRQQKQAELNEVQADLHYIKDDILALBKRRLELSRARERYSRXLHNLLDDPMDTTLGHAAIDDGMNVRTAFVRGGQGDAISGKFQQKKAXIDAQASSQGMQXRANFCHSDSQVLPTLSGLTIARKRRVLAQFDDLQECYLOKRRRWATQLRKQCDGGLRXERDGNSISREGYWAGLEEFQSILTTFTRYSRLRVISELRHGDLFHSANIVSSIEFDRDDELFATAGVSRRIXVFDFATVVNEPAVVHCPVVEHSTRSKLSCLSWNKCIRSQIASSDYEGIVTVWDVNTRQSVHMYEEHEKRAWSVDFSRTEPTRLISGSDDGKVKVWCTRQETSVLNIDHKANICCVKYNPGSSYYVAVGSADHHIHYYDLRNPSVPLYEFNGHRKTVSYVKFISTNELASASTDSTLRLWDVRDNCLVRTFKGHTNEFNFVGLTVNSEYIACGSETNGVPVYHXAISKPAAWHQFGSPDLDDSDDDTSHFISAVCWKSESFTMLAANSQGTIKVLVLAP 68 2146
487 WD40重复蛋白 MANYVDSXXNFKCVPALQQFYTGGPFRLSSDGSFLVCACNDEVKVVDLATGSVKNTLEGDSELIVALALTPDNKYLFSASRSTQIKFWDLSSATCKRTWKAHNGPVADMACDASGGLLATAGADRSILVWDVDGGYCTHSPRGHQGVVTTVIFHPDPHCLLIFSGSDDATVRIWDLVAKKCISVLIKHFSTVTSLAISENGWNLLSAGRDKVVNINDLRDYHCRATIPTYEPLEAVCVLPTGSRLVSVMNQSRALPENRKKSGAAPVYFLTVGERGTVRIWYSEGALCLYEQKSSDAIISSDKDELKGGFVSAVLLPLTQGVMCVTADQRFLFYNLDESDKGKCDLKVSKRLIGYNEEIVDLKFLGDEEKFLAVATNLEQVRMYDLSSMTCVYRLSGHTDIVLCLDTVVFSGHSLLASGSKDHTVRIWDTESKSCICVAAGHMGAVGAVAFSHKAKNFFVSGSSDRTIKVWSFASVLDFGGISKSIKLSSQAAVAAHDKDINSVAVAPNDSLICTGSQDRTARIWRLPDLVPVLVLRGHKRGVWCVEFSPVDQCVMTASGDKTIKIWALSDGSCLRTFEGHTASVLRASFLTRGTQFVSSGADGLLKLWTIKSNECIATFDQHEDKIWAMAVGKKTEMLATGGSDSLVNLWHDCTTTDEEEALLKEEEAALKDQELLNALADTDYVRATQLAFELRRPYKLLNVFTELYSKGHAQDQIQKVIRELGNEELRLLLEYVREWNTKPKFAHVAQFVLFQLFNVLPPKEIIEVQGISELLEGLIPYAQRHYSRIDRLMRSTFLLDYTLSSMSVLSPTETDLSSSNLLARTADPLNAQIDQFHPTHFPEPNLTPIQSLLDSGNTDSVEVTARRAKKKRVSGNDSEKTTVAEVKIGDMENAFDEPDVADQGSSRKHKPASSKKRKSIAVGNASIKRIASGNAVTIALQV 874 3705
488 WD40重复蛋白 MESSCSSMNSNRHSTEKRCLRPLQKQGASMNKHSSDRFIPARGSIDLDVARFMVTQKQXDNNDIHALSPSPSPSKKAYQXEMADTLLKNAGAADNNCRILSFMGKSSTVSQGSQENVLANLSISRRARRYIPQSADRTLDAPDLLDDYYLNLLDWSSTNVLSTALGNTVYLWDASNSSISELLIADEEEGPVTSVSWAPDGSQIAVGLNNSVVQLNDSQSNKKLRALKGHHDRVGALSWNGPILTTGGLDGIIINNDVRTRDHIVQTYKGHTQEVCGLKNSPSGQQLASGGNDNLLYIMDKSNASHNPSSQYFHQLDEHCAAVKALAWCPFQTNLLASGGGTSDGSIKFWNQTGACLNTVDTHSQVCSLLWNRMHERBLLSSHGLNQNQLTLWKYPBMVKITELTGHTARVLHMAQSPDGYTVASAAADETLKFWQVFGAPDASKKTKTKDTKGAFNNFHMRIR 360 1754
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
489 WD40重复蛋白 MLDEIVADEEEEFNIWKKNTPLLYDVVITHALEWPSLTVQWLPDRHQSPTKDYSIQKMIVGTHTSGDEPNYLMIAEVQMPLQYSEDGNVGGFESTEAKVHIIQQINHEGEVNRAQYMPQNSFIIATKVSSDVYVFKYTKHSSNAPQERVCHNPELILKGHTNEGYSLSWSPLKEGQLLSGSNDAQICFWDINAASGRKVVEAKQIFKVHEGAVEDVSWHLKHEYLFGSVGDDCHLLINDTRTTAAPNKPQHSVVAHSEVNSLAFNPFNEWLLATGSADKTVKLFDLRKLSCSLHTFSNHTEEVFQIEWSPMNETILASSGGDRRLMVWDLRRIGDEQTSRDAEDGPPELIFIHGGHTSKISDFSWMLHDDHLIASVAEDNILQIWQMQENIYHDDADIL 185 1384
490 WD40重复蛋白 MTKEDHGESRDEMGERMVNEEYKLWKKNTPFLYDLVITHALEWPSLTVQWLPPSCKQQQDIIKDDDIDHPNTQMVILGTHTSDHEPNYLILAEVQLHDGTEDEDGDGDVKRPQCKMRPGTSGGAMGKVRILQQINHQKEVNRARYMPQKPTIIATKTVNADVYVEDYSKHPSKPPQEGRCNPELRLQGHESLGYGLSNSPLKEGHLLSASDDAQICLWDITAATKAPKVVLANQIFRYHDGPVEDVAWHAIHDHLFGSVGDDHHLLLWDIRNDSEKPLHIVEANQAEVNCLAFHPFNEWIVATGSADRTVALHDIRKLDXVLBTCAHHMEEVFQIGWSPQNGAILASCGSDRRLMVWDLSRIGDEQNPEDAEEAPPELLFIHGGHTSKISDFSWNPAEEWVIASVAEDNILQVWQMSEHIYNDDNDSPTA 241 1533
491 WD40重复蛋白 MAMAHGDENAADPVEEFNIWKKNTPFLYDLVITHALEWPSLTVQWLPDRHQSSTADYSLQHMIVGTHTSEDEPNYLMIAEVQIPLQNSEDNIIGGFESTEAXVQIIQKINHEGEVNKARYMPQNSEVIATKTVSSDVYVFDYSKHPSKAPQERVCNPELILDGHSNEGYGLSWSPLXEGYLLSGSWDAQICLWDINAAFGKKVLEANQIFKVHEGAVGDVSWHLKHEYLFGSVGDDCHLLIWDMRTAAPNKPQQSVIAHQSEVNSLAFNPFNEWLLATGSMDKTVKLFDLRKLSCSLHTFSNHTDQVFQIEWSPMNETILASSGADRRIMVWDIAAIGETPEDEEDGPPELLFVHGGHTSKISDFSNNLNDDRVIASVAEDNILQIWQHAKNIYHDDEDML 230 1435
  492 WD40重复蛋白 MGLFEPFRALGYITDGYPFAVQRRGIETFVTLSVGXAWQIYNCAKLIPVLVGPQMDKKIRALACHRDFTPAATGHDIAVFRRAHQVATWSGHKAKVTLLLSFGQHVLSVDLCGCLFIWAVAEVNQNKPPIGQIQLGEKFSPSCIMHPDTYLNKVLIGSEEGTLQLWNVNTRKKLYEFKGWGSSIRCCVSSPALDVVGIGCSDGKIHVHNLRYDEEIVTFMHSTRGAVTALSFRTDGQPLLAAGGSSGVISIWNLEKKKLQSVIKDAHDSSVCSLHFFANEPVLMSSATCNSIKMWIFDTTDGEARLLKYRSGHSAPPMCIRYYGKGRHILSAGQDRAFRIFSVIQDQQSRELSQGHVGXRAKKLKVRDEEIKLPPVIAFDAAEIRERDWCNVVTCHLDDPCAYTWRLQNFVIGEHILXPCLEDPTPVRSCSISACGMFAVLGTEGGWLERFNLQSGISRGTYIDIGEKRQCAHNGAVVGLACDATNTLLISGGYNGDIKVWDFRGRELRFRWEIEVPLIXIVYHPGNGILATAADDMILRLFDVTAMRLVRIFVGHMDRVTDLCFSGDGRWLLSSSHDGTIRVWDIISSRQLNAMRMDSAVTALSLSPGMDNLATTHVGHNGIYLWANRMIYSKATDIEPFISGKQVVKVSMPTVSSKRESEEGDKKRTIVAESNVNXSDVSGSLIGDSYSAQLTPELVTLALLPKAQWQSLVNLDIIXMRNKPIEPPKKFEKAPFFLPSLPTLSGERIFIPSSMNGDGDQDETRNDKTVFEARGXKLGGESLSFMQLLQSCRRIKDFTTFTNYLXGLSPSAVDMELRLLDIVDNENISETEHSVELQGIGMLLDYFVNEVSCNNNFEFVQALIRLFLKIHGETIRCQVSLQEXARXLLEIQSSTWERLDTSFQNARCNITFLSSSQF 101 2857
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
493 WD40重复蛋白 MIAAVCWVPKGVADVLPDSAEPPTQEEIQRLLKCNVVAESDDNEDSDEESEEMDTETDKNTDAVAXALAAANALGSQSSDFQRQHKVDDIANGLKELDMDHYDDEDEGIDIFGSGSLQHCYYPANDMDPYLVEQDDDOEDEIEDMTIKPSDLIILSARNEDDVSHLEVWIYSSETEEGGSNMYVHHDIILPAFPLSLAWLDCNLKGGEKGNFVAVGTMQPEIELWDLDVLDEVEPAVVLGGAVRDEASGKTTKLKXXKKNKQAVNFKEGSHTDAVLGLAWNMEYRNVLASASADKSVRIWDIVAEXCEHTMQPHTDKVQAVAMNPNQATVLLSGSFDRSVIMMDMRAPTHSGIRWPVPADVESLAWDPHTDHSFMVSAEDGTVRGFDIRAAASTADFDGKPMFILHAHDXAVCAISYNPAAPSLLTTGSTDXMVKLWDITNNQPSCIASTWPNVGAVFSAAFSKNSPFLLRTGGSKGILHVWDTLDNSEVARRFGKFRPQN 43 1548
494 WEE1样蛋白 HIMDENBFCDIFSLRKRLCLLSSQBGEEEEELEAMSQLDAGEFTVTGNEBVVAIAEDDVNTGILSQDLFSSQDYCTPSQPQDSTDLDSKDKAPCPLSPVKSTIQRKRCRPELLSNPPDSIQFSFQRLERVRSEESIQSSSQQLARVRSEVSSSDDFKTPKITASGQKNYVSQSALALRAHVMSPPCIKNPYLDENEELNEKIQRSTRRSPACVTPIQSGACLSHYRADFHLLEEIGRGNFSRVYKALNRLDGCCYAVKCSQSELRLDTERKVALMEVQSLAALGPHKNIVGYHTAWFENDHLYIQMELCDHNLTTANDRGILRTDTDFLEAYYQIAQALEFIHGRGVAHLDVRPSNIYVRDGTYKLGDFGRATLINGTLHVEEGDARYNSREILNDNYEHLDKVDMFSLGATFFELLMRKQYPGSGKRIDRDTEIRIPILPGFSIYFQKLLQDLVSNDPGKRPSAXDVLKNPIFNKVRGAKEV 206 1657
495 WD40重复蛋白 MLAPALEMEPVEPQSLKKLSFKSLFRALDLFSPVHGQIAFFDPESKKMRISYKLNFEYGGGSGSEDQVPKRKESGAAQNQGQQAAGASNALALPGPEGSKIPPMEKSQKALTVGPSLRPQGLNDVGLHGKGTAIISASGSSDRNLSTSAIMERLPSRWPRPVWHPPWKNYRVISGHLGWVRSIAFCPSNQWFCTGSADRTIKIWDLASGRLKLTLTGHIEQIRGLAVSSKHTYKFSAGDDKQVKCNDLEQNKVIRSYAGHLSGVYCLALHPTIDILLTGGRDSVCRVWDIRSRMQIFALSGHDNTVCSVFARPTDPQVVTGSHDTTIKFNDLRHGKTMTTLTNHKKSVRAMAQHPKENCFASASADNIKKFDLPRGEFLHNMLSQQKTIINTMAVNEEGVMATGGDNGSLWFWDHKSGHNFQQAHTIVQPGSLESEAGIYALSYDLTGSRLVSCEADKTIKMWKEDELATPETHPLNFKPPKDIRRF 117 1580
496 WD40重复蛋白 MEEAAHEQSAGSGKPKLLRYGLRSAAKPKEDKKEEQLHQPPPPPPFQQQAAPAPAPAATRSSTSGSAGGRDRRPQQQHAVDEKYARWKSLVPVLYDWLANHNLLWPSLSCKWGPQLEQATYRNRQRLYISEQTDGSVFNTLVIANCEVVKPRVAAAEHVSQFNERARSPFIRKYKTIIHPGEVNRIRELPQNPNIVATHTDSPDVLIWDVESQPNRHAVYGATASRPMLILTGHQENAEFALAMCPAEPFVLSGGKDKTVVLMSIQDHITASATDQTTNKSPGSGGSIIKKTGEGNEETGNGPSVGPRGIYCGHEDTVEDVAFCPSTAQEFCSVGDDSCLILWDARIGTNPVAKVEKAHNGDLHCVDWNPHDNNLILTGSADNSVNMFDRRNLTSNGVGSPVYXFEGHXAAVLCVQWSPDKPSVFGSSAEDGLLNIWDYERVDKKVDRAPNAPAGLFFQHAGHRDKIVDFHWNTADPWTMVSVSDDCOTAGGGGTLQIWRMSDLIYRPEEEVLAELENFXAHVLBCSKA1 111 1700
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
497 WD40重复蛋白 MAKDEEEPRGENEERLVNEEYKIWXKNTPFLYDLVITHALEWPSLTVOWLPDREEPPGKDYSVQDMILGTHPSDNEPNYLMLAQVQLPLEDAENDARQYDDERGEIGGFGCANGKVQVIQQINHDGBVNRARYMPQNPFIIATETVSAEVYVFKYSKHPSKPPQDGGCHFDLRLRGHNTEGYGLSWSPFKHGHLLSGSDDAQICLWDINVPAKNKVLEAQQIEKVHEGVVEDVAHHLRHEYLFGSVGDDRHLLIHDLRTSATNKPLHSVVARQGEVNCLAFNPPNENVLATGSADRTVKLFDLRRISSALHTFSCHKEEVFQIGWSPKNETILASCSADRRLMVNDLSRIDEFQTPEDALDGPPELLFIHGGHTSKISDFSWNPCEDWVIASVAEDNILQIWQMARNIYHDEEDDMPPEEVV 144 1412
498 周期素依赖性激酶抑制剂 MGKYMRRGKGVGEVAVHEVSQGSLGVRTRARTLAAASSQKDHRRLGASKSVTTKHQSSAPPASPCVESSMHTCYLELRSRKLEKFSRCYHSAHGATSHGESKRSLSLSEPSRLAVSEEARVASDKSSHRVLQQQSSVAHSRNNSATFSRNAKPAKAAORKERRDDDHTSARPSEAPHEDEDGNEVEASFGENVMDLDSRERRTRETTPSSYTRDVETHETPGSTTRPPSNAGRRRFQTEGGHGTRNQFHVPTTNEIEEFFAGAEQQEQRRFTDRYNYDPVSDSPLPGRFEWVRLRP 793 1683
499 CDK D型 MQHMEENVQSSWSLHGNKEICARYEILKRVSSGTYLDVYRGRRKEDGLIVALKEVKDYQSSWREIEALQRLCGCPNVVRLYEVILEFLTSDLYSVIKSAKNKGENGIPEAEVKAWMIQILQGLANCHANWVIHRDLKPSNMLISAYGILKLADFGSMSFLDRAIYEVEYELPQEDILADAPGERLMDEDDSVXGVNNEGEEDSSTAYETNFDDMAETANLDLSWKNEGDMVMQGFTSGVGTRWYRAPDFLYGATIYGKEIDLWSLGCILGELLILEPLFSGTSNIDQLSRLVKVLGLQQKNWPGCSNLPDYRKLCFPGDGSPVGLKNHVPNCSDNMFSILERLVCYDPAARLNAKEIVENKYFVEDPYPVLTHELRVPSPLREENNFSEDWAKWKDMEVDSDLENIDEFNVVHSSDGFCIKFS 415 2196
500 组蛋白乙酰基转移酶 MAPVKRRIEPEKTKANEGKRRRKVAFAYDTGIEANDCISLHLVSTPEEMRDAEGVEDQSLSFNPKYMQHFVGEHGXIYGYKGLKIDVWLNALSFHAYVDIQYESKVEEGKSEKEATDLTDIMKRIFGRGLVEDRNAFIQSFSSNSQSIESMTHNEGERIATREILTDKGLSAQGDSERLGVSNEIFRLELSDPQIRENHARLEPLVLLFVEGSQPISQDDPKWEMYIRVQAESLSGGSAVCRLLGFCTVYRFYHYPDTTRLRISQKLVFPPYQGKGHGLLLLEAVNKTAVSRDSYDVTVEEPSESLQELRDCMDTIRLLSFEPVMRAVKSAVQKLKEANPSDKGAADHCLEGNVNNETVTTSSTKPKNKSGWFPPPGLVEEVRKHLKISKKQFKRCWEILLYLNLDRSDSQCEDKYHISLMEQIMSELFDKSSEXSAKGKRVIDIDNEYDNSKTFIMVRTRNPGNGEGFLPEALEGGMEVSQEDQLXSLFEERLEEIAQIAEKVPSLCKALCMP 109 1653
501 组蛋白去乙酰基转移酶 MPEDRKKILEALAARRRAEXESGEKKKRQKSSLNPAKPVSKPVSKPVGGIGSAGKSTSAPISSTKAKSKHKEEVKAKRVTKMDRYETDEDDESEEEEDLDSESDDDELSDEDSEDDIKSKSVKKLPPQSKGKAPVKGISSSNGKGRDEKGKGIMKDKGRAKAKVEESSSDAEGDSDDDGGDLSDDPLQEVDPSNILPSKTRRRASQPTNYQFANMSGDDDDDDDSD 343 1023
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
502 组蛋白去乙酰基转移酶 MADVPESLQQERDEQGTDKNCCDGKFQKEIDIDDNEEEYNESSIDDEEENLSDNVATNNMGFIPQGQACMAVTVEGIEHANSVGCGRNGREGSEEVTAAEDMGHVSIENIREQGRMRKSSEQLLALYEQEGLLEDDEDDDDVDWEPFGGVTVQMKMICTMCTMANSDDSVHCDSCGEHRNSDILRQGFLASPYLPAESPSSSDVPDERLEESKCVMTTLTPSISPMIGVCCSSLQSERRTVVGFDERMLLHSEIQMETYPHPERPDRLPAIAASLRAAGLFPGKCFSIPAREATCEELQTIHSLEHVNAVESTSCGMLSHLSPDTYANFHSSLAARLAAGLCADLAKAIMTGOAONGFALVRPPGHHAGVXDSNGFCLHNKAAIAVSASRVVGAKKVLIVDNDVHHGNGTQZIFEADQSVLYISLHRHGEGFYPGSGAVTEVGSSKGEGYSVNFPWKCGGVGDNDYIFAFQHAVLPIAEQFEPDLTIISAGFDAAKGDPLGRCEVTPDGFABMAQHLSCLSKGKMLVILEGGYNLRSISASATAVIKVLLGDNPKALPIDIQPSKGGLQTLLEVFEIQSKYWSSLKGHDQKLRSQNEAQYGSKKRKVIRKRHMHIVGGPVWWKWGRKRVVYYHWFARVSSRKHL 417 2351
503 肽基脯氨酰基异构酶 MASGAGAAGVVEWHQKPPNPKNPVVFFDVTIGTIPAGRIKMELFADIVPRTAENFRQFCTGEYRKAGIPIGYKGCHFHRVIXDEMIQAGDFVKGDGSGCISIYGSKFEDENFIAKHTGPGLLSNANSGPNTNGCQFFLTCAKCDWLKNKHVVFGRVLGEGLLVLRKIENYQTGQHNRPKLPCVIAECGEM 69 641
504 肽基脯氨酰基异构酶 MAKLVSSVCAFSCQQRHPRSRPRELSNRDHYNHYHNHSHYHNVCYFPPMMMMQQQLQKQKRMTTKTITSLFKCNSSNHTLLKGLXEFMGFKFRLQAAMLSCEMSILGRVFAIFFIVHQAAAPFPFNHFDNWLVPPASAVLYSPNTKVPRTGEVALRRSIPANPAMKSIQDFLEDIYYLLRFPQRKPYGTMEGDVKSALQIAINEKDSILGSVPLDMKERGLQLYNFLIDGQGGLQVLIEYIKEKDPDKVSVNLSSSLDTIAQLELLQAPGLPYLLPEXYQQYPRLNGRATleftMERGDNSMFSVSSGGGLQKTATIQVVLDGYSAPLTAGNFTKLVIDGAYNGLXLXTTEQAVISDNERAEAGFNLPIEILPAGGFEPLYRTTLSYQDGELPVLPLSVYGAIAMAHNTISEDYSSPSQFFFYLYDRRNAGLGGLSFDEGQFSVFQYTTVGKEILPQLKTGDIIKSAKLVDGFDHLVLPSSST 172 1623
505 WD40重复蛋白 MLHYYQDDFDYLVDDEMVDFADDVEDDVRTRRRSDIDSDSENDFDSNNKSPDTTALQARRGKDIQGIPWNRLNFTREKYRETRLQQYXNYENLPRPRRSRNLDRECTNFERGSSPYDFRHNTRSVXATTVHFQLRNLVMATSRHNVYLMQWYSIMHWSSLKQKGEEVLNVAGPIIPSVKRPGSSPQGLTRVQVSAMSVKDNLVVAGGFQGELICKYLDRPGVSPCTKISHDENGITNAVEIYNDASGATRLMTANNDLAVRVFDTEKFTVLERPSFPWSVNHTSVSPDGKLVAVLGDNADCLLADCKTGKTVGTLRGHLDYSFAAAWHPDGYILAPGNQDTTCRLWDVRKLSSSLAVLDGRMGAIRSIRFSSDGRFMAMAFPADFVHLYDTRQNYTKSQEIDLFGEIAGISFSPDTEAFFVGVADRTYGSLLEFNRRRMNYYLDSIL 231 1768
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
506 WD40重复蛋白 MDCSGDEEEEQFFLSLEEMLSPSDSGSEAADNETGCRNADARSKYEIWKRAPSSIQERRQRFLVRMGLANPSELGNQVNSTSAESTCSTETANIPNGIERLRENSGAVLRTAGSSGRKTHCXNVINIGLREGSVRSSSSSNGTPDVGEDNGEFGGTIFSRSGGTWECHCKIKNLDSGREFVVDELGQDGLWNKLREVGTDRQLTMDEFERSLGLSPLVQELMWRESGVAQADCNGVHHHDAEISSSKRRSWLKALKSAAYSMRRPKEDQSNYDSERSGRRSGSFDVPWGKPQWTKVRMYRKRYKEFTALYMGQEIEAHEGSIWTMKFSLDGRYLASAGQDCVIHVREVIESMRTFGADTPDLYASSAYFSMNGLQELVPLSIEDHANKHKRGKIIGSKKSSNSDCIVLPNKVPQLSEEPVCSFHGHLLDVFDLSWSPSDYLLSSSMDKTVRLMRLGHEBCLKVFSHNDIVTCIQENPVDERYFISGSLDGKARIWSIPDRQVVDWSDLREMYTAVCYTPDGQGGLVGSIKGSCRFYNTSGNKLQLENQLNVRSKKKKSSGKKITGFQFAPGGDSQKVLITSADSRVRVYNGSELVCXYKGFPNTCSQISASFAPNGQHFVCASEDSRVYIWNHESPRGSGARHEKSSWSHEHFLSQGVSVAIPWSGMKLQPPVWNSFEFNLGQRHNLLSLQGGKDVGCQNGLLSRBAGEGQESETPLHYISQVSHSCGSQNMVDRDGQDDLSRYSACISDSRLSSFMAFPESPGNPDDLNSKVFFSDSSSKGSATNPEEKLPPTRKQSRSNSTSSHYDTLKTHIGWIIQGQSGASAAVAWGLVIVTAGNGGEIRSTQNYGLPVRL 376 2943
507 WD40重复蛋白 MPSIPAIGEFTVCEINRELLTTKDESDTQAKDAYAKILGLVFPPISFQIEEGFGSASRQQFDQDLDREDTIVTPSTSEGTNALQEGGLLLKGVSVLKNILASSFGPIFSPWDTKVLKKVKLLQGISWHRHKHILAFISGSNQVTVHDFQDFENRESSLLVSESQRGIEALEWRPNGGTTLSVACRGGICIWSASYPGSVAPVRSGVASFLGTSTRGSSVRNTLVDFLQIPGGXAVTALSWSPTGRLLASASREDSSFTIWDVAQGVGTPLRRGLGGISLLKWSPTGDYLFSAKPNGTFYLWETNTWTLEQWSSSGGCVISATNGPDGRMLFMAFSESTTLGSLHFAGRFFSLDAHLLPMELPEIGSITGGFGNIEKMAWDGCGERLAVSYTGGDLMYVGLIAIYDTRRTPFISASLVGPIRGPGEQVKPLAFAPHDKTKQGPLLSVCWSSGLCCTYPLIFRAH 107 1498
508 WD40重复蛋白 MEEENAKSTEETRQVQVRFITKLQPALRVPTTSLAIPAHLTRYGLSDIVNTLLGNDKPQPTDELVESELVRTSLEKLLLIKGISAEKILNIEYILAVVPPKQEEPSLHDDWVSVVOGSYPNFIFSGSFDSIGRIWKGEGLCTHVLEGHRDAITSAAFIMPSDSSDSFINLATASKDRTLRLWQFKPNEHMTNGKMVRPYKLLKGHTSSVQTVSACPRRNLICSGSWDCSIKIWQTAGENDIESNAGSVKKRKLEDSTEQTISQIEASRTLEGHSQCVSSVVWLEKDTIYSASWDHSVRSWDVETGVNSLTVGCRKALHCLSIGGEGSALIAAGGADSVLRTWDPRMPGTFTPILQLSSHRSWITACKWHPKSRHHLISASHDGTLKLWDVRSKVPLTTLEAHKDKVLCADNNKEDCYISGGADSTLQIFSNLNLT 118 1425
509 WD40重复蛋白 NNRLHSKRNHILELRLGQSEPEKFATLASNRSRGTNAPIVVEDDDDVVVSSPRSFALARSSVSQRSSRIPIVNEEDLILRLGLAVTGRTSAEHNPRRRHGRVPPNKPIVLCDDAGEADQSSSKKRRTGQQLSSDVQSDESKEVKLTCAICISTMEEETSTICGHIFCKKCITNAIHRWKRCPTCRKXLAINNIHRIYISSSTG 186 797
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
510 WD40重复蛋白 MEEPPPPAVLPSSEDTSIVSSHSFVWAPPTVPVGLDASIPQISTPGINQPGLTIPVPPSAAPLTASLVAASAGMPPAVVPSFVRPAIVAHPSVMPPPSMPLAALPMPVASAVPVAAPHFPPSTPNDNSITPSMPVPTPIVASSSVPPSVTIPGIAPLPFIAPIPVPSSRPVAPSPEMPPARPLGASVSVAMDVDNTDEQDQDADNKGESPSSSPDHPEDPSAAEYEITEESRKVRERQEQAIQELLLRRRAYALAVPTNDSSVRARLRRLWEPITLFGEREMERRDRLRALMAFLDAEGQLEKLMKVQEEEEAAANVDAEEVQEMEGPQVYPFYTEGSQELLKARTEITKFSLPRAVSRLQRARRKREDPDEDEDEELKCVLQQSAQINMDCSEIGDDRPLSGCAFSSDGTLLATSAWSGVTKLWSVPNINKVATLKGHTERVTDVAFSPTNCHLATACADRTAMLWNSEGVLMKTYEGHLDRLARLAFHPSGLYLGTASFDKTWRLNDVNTGIELLLQEGHSRSVYGIAFQCDGSLAATCGLDGLARIWDLRTGRSILALEGHVKPVLGIDFSPNGYHLATGSEDHTCRIWDLRRRQSVYIIPAHSHLVSQVKFEFQEGYFLVTASYDSTAKVWSARDEKSIKVLAGHEAKVTSVDITADGQYIATVSHDRTIKLWSSKNSTNDMNIG 387 2456
511 WD40重复蛋白 MKRAYKLQEFVAHASNVNCLKIGKKSSRVLYTGGEDHKVNMWAIGKPHAILSLSGHSSAVESVTFDSAEALVVAGAASGTIKLWDLEEAKIVRTLTGHRSNCISVDFHPFGEFFASGSLDTNLKIWDIRRKGCIHTYKGHTRGVNSIRFSFDGRWVVSGGEDNIVKLWDLTAGKLMHDFKCHEGQIQCMDFHPQEFLLATGSADRTVKFWDLETFELIGSAGPETTGVRAMIENPDGRTLLTGLHESLKVFSWEPLRCYDAVDVGWSKLADLNIHEGKLLGCSYNQSCVGVWVVDISRVGPYAAGNVSRTNGHNEAKLASSGHPSVQQLDNNLKTNMARLSLSHSTESGIKEPKTTTSLTTTEGLSSTPQRAGIAFSSKNLPASSGPPSYVSTEKKNSTSRVQQTTKFQTLSRPDIVDVIVPRSNSLRPETTSDAKKLMNNFGRVVPSTVSTKSTDVIKSGSNRDESDKIDSINQKRMTGNDKTDLNIARAEQHYSSRLDNTNTSSVVCDGNQPAARWIGAAKFRRNSPVDPVVSPHDRSPTEPWSATDDGVTCQPDRQVTAPELSKRVVEPGRARALVASWETREKALTADTPVLVSGRPPTSPGVDMNSFIPRGSHGTSESDLTVSDDNSAIEELMQQHNAFTSILQARLTKLQVIRREWQRNDLKGAIDATGKMGDHSVSADVISVLIERSEIFTLDICTVILPLLTRLLQSETDRHLTVAMETLLVLVKTFGDVIRATISATPTIGVDLQAEQRLERCHLCYVELENIKQILVPLIRRGGAVAKSAQELSLALQEV 359 2761
512 周期素B 1MAGSDENNPGVVGGAHVQEGLRVGAGKMGAGNVQQRRALSNINSNIIGAPPYPCAVNKRVLSEKNVNSENDLLNAAHRPITRQFAAQMAYKQQLRPEENKRTTQSVSNPSKSEDCAILDVDDDKMADDPPVPNFVQHTEAMLEEIDRMEEVEMEDVAEEPVTDIDSGDKENQLAVVEYIDDLYMFYQKAEASSCVPPNYMDRQQDINERMRGILIDWLIEVHYKFELMDETLYLTVNLIDRFLAVQPVVKKKLQLVGVTAMLLACKYEEVSVPVVEDLILISDRAYSRKEVLEMERLMVNTLHFNMSVPTPYVFMRRFLKAAQSDKKLELLSFFIIELSLVEYDMLKFPPSLLAASAIYTALSTITRTKQWSTTCEWHTSYSEEQLLECARLMVTFHQRAGSGKLTGVHRKYSTSKFGHAARTEPANFLLDFRL 238 1648
513 周期素依赖性激酶抑制剂 MQAPREGKSAAAIVGMGKYMKKSKAIPRDVSLLEASPRSPSATGVRTRAKTLASRRLRRASQRRPPPPAAAAAAAAPSLDASPCPFSYLQLRSRRLRRRRLAPSPEARIDEGPAGSGSRGSRDASCSARTASSSGGVEGEGACVGRGDRGNGGECVRDAAVDASYGENDLEIEDRDRSTRESTPCSLIRDSNANTPPGSTTRQQSSCTAHRTQMSILRSIPTSDEMEEFFAYAEQRQQRSFIEKYNFDIVKDRPLPGRFEWVQVIP 59 859
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
514 组蛋白乙酰基转移酶 MDGHSSHLAAQNRSRGSQTPSPSHSAASASATSSIHLKRKLSAANASAASAAAAAAAAAAAADDHAPPFPPSSISADTRDCALTSNDDLESISARGGGAGDDSDDDSDDEEEDDGDNDGGSSLKTPTAARLBNVGPAAARMRKIKAESWATVKVEKEDSAKDGGNGAGVGALGPAATSGAGSGSGTVPKEDAVKIFTKNLQASGAYSAREENLKREEEAGRLKFECLSNDGVDDHMVWLIGLKNIFARQLFNMPKEYIVRLVMDRNHKSVMVIRRNLVVGGITYRPYASQKFGEIAFCAIKADEQVKGYGTRLMNHLKQHARDVDGLTHFLTYADNNAVGYFIKQGFTKEIYLDKDRWHGYIKDYDGGILMECKIDPKLPYTDLSTMVKRQRQAIDEKIRELSNCHIVYQGIDFQKRDAGVPQNTIKMEDIPGLREAGWTPDQMGYSRFRGLSDQKRLTFFIRQLLKVLNDHSDAWPFKEPVDAREVPDYYDIIKDPMDLKTMTRRVESEQYYVTLEMFIADVKRMFANARTYNSPDTIYFKIATRLEAHFQSKVQSNLQSGAGKIQQ 44 1829
515 肽基脯氨酰基异构酶 MFWGMMDPELFKLAQEQMNRNSPAELAKIQQQMMSNPELMRMASESMKNHRPIDLRQAAEQLKRVRPEEMAEIGEKMANASPEEIAAVRARADAQMTYEINAAKILKKEGNELHSQGRFKDASQKYLRAKNNLKGIPSSEGKNLLLACSLNLMSCYLKTRQYEECIKEGSEALACEEKNLKAFYRRGQAYRELGQLKDAVSDLRKAHEISPDDETIAQVLRDTEESLTKEGGSAPGGVVIEEITEEDETLASVNHESPSEYSEKRHQESEDARKGPINGDIMGQMTNSESLKALKGDPDAIRSFQNFISNADPTTLAAMGAGNAGEVSPDLIKTASSMIGKMSAEELQKMIQLASSFPGENPYVTRNSDSNSNSFGNGSIPNVSPDMLKTASDMMSKMSPDDLQRMFEMASSSRGKDPSLDANHASSSSGANLAANLNHILGESEPESSYHIPSSSRNISSSPLSNFPSSPGDMQEQIRNQMKDPAMRQMFTSNMKNMSPEMMANMGKQFGLELSPEDAAKAQEAMSSLSPEMLDKHMRWADRAQRGVETAKKTKNWLLGRPGMILAICMLLLAVILHRLGFIGS 109 1866
516 WD40重复蛋白 MIAAISWVPRCASKAVPEVAEPPSKEEIEEILKSGVVERSGDSDGEEDDENMDAVASEKADEVSTALSAADALGRISKVTKAGSGFEDIADGLRELDMDNYDEEDEDVKLFSTGLGDLYYPSNDMDPYLKDKDDDDDTEEIEDLSIKPMDSLIVCARTDDEVNLLEVYLLEPSLSDESNMYVHHEVVISEFPLCTAWLDCPIKGGDKGNFIAVGSMEPAIEIWDLDIIDAVEPCLVLGGQEELKKKKKKGKKASIRIKEGSHTDSVLGLAWNKEFRNILASASADRQVKIWDVAAGKCNITMEHHTDKVQAVAWNHHAPQVLLSGSFDHSVVMRDGRIPSHSGYRWSVTADVESLAWDPHSEHFFVVSLEDGTVRGFDVRAAISNSASQSLPSFTLHAHEFAVSTISYNPAAPNLLATGSTDKNVKLWDLSNNQPSCIASRNPKAGAVFSVSFSEDSPLLLAIGGSKGRLEVWDTSSDAAVSRRFGKHGKPKTAEPGS 212 1815
517 WD40重复蛋白 MKFCRRYQEYMQGQRGKKLPGLGEKKLKKILKRCRRRDSLHSQKALQAVQNPRTCPAHCSVCDGSFEPSLLEEMSAVLGCFNKQAQKLLELHLASGFQKYLMWFXGKLRGNHVALIQEGKDLVTYALINAIAIRKILKKYDKIHLSTQGQAFKSQVQRMHMEILQSPWLCELIAFHINVRETKANSGKGHALFLGCSLVVDDGKPSLSCELFDSIKLDIDLTCSICLDTVFDSVSLTCGHIYCYMCACSAASVTIVDGLKAAEPKEKCPLCREARVFEGAVHLDELHILLSRSCPEYWAERLQTERVERVRQAKEHWESQCRAFMGVE 207 1193
蛋白质SEQ ID 目标 专利肽序列 专利ORF开始 专利ORF终止
518 WD40重复蛋白 MVSTQSTRENPSIFFPPPLKPWLLPVVLSLSLSRQLGMAAAAAASLPFKKNYRSSQALQQFYAGGPFAVSSDGSFIACNCGDSIKIVDSSNASLRPSIDCGSDTITALSLSPDGKLLFSAGHSRQIRVWDLSTSTCLRSWKGHDGPVMSMACPVSGGLLATGGADRKVMVWDVDGGFCTHFFKGHDGVVSTVLFHPDSNRSLLFSGSDDGTIRVWDLLAKKCASTLRGHDSTVTSLAFSEDGLTLLAAGRDKVVSLWDLHNYACXXTIPMYEVLESVCVIHSGTVLASQLGLDDQLKVTKESAQNIHFITVGERGILRIWKSEGSVCLFKQEHSDVTVISDEDDSRSGETAAVMLPLDQGLLCVTADQQFLFYYPEKHPEGIFSLTLCRRLVGYNEEIVDMKFLGEEENFLAVATNLEQVRVYELASMSCSYVLAGRTETVLCLDTCISSSGRTLIVTGSKDNSVRLNDSESRRCIGVGVGHHGAVGAVAFSRKRQDFFVSGSSDRTLKVWSLDGISEDGVDSTNLKAKAVVAAHDKDINSVAVAPNDSLVCSGSQDRTACVWRLPDLVSVVVLKGHKRGIWSVEFSPVDQCVLTASGDKTVKIWAISDGSCLKTFEGHVSSVLRASFLTRGTQFVSCGADGLVKLWTVRTNECIATYDQHSDKVWALAVGKKTEMLATGGSDAVVNLWYDSTASDKEDAFRKEEEGVLKGQELEWAAVSDADYTAIELALELRRPHKLFELFSELCRTREVGDRVERILSALSGEEVCLLLEYIREWWAKPKLCHVAQSVLSQVFRILSPTEIVEIKGIGELLEGLIPYSQRHFSRIDRLVRSTYLLDYTLTGMSVIEPEADRSAVNDGSPDKSGLEKLEDGLLGENVGEEKIQNKEELESSAYKKRKLPRSKDRSKKKSKNVVYADAAAISFRA 6 2786
519 WD40重复蛋白 MDSAPRRKSGGINLPSGMSETSLRLDGFSGSSSSFRAISNLTSPSKSSSISDRFIPCRSSSRLHTFGLVERGSPVKEGGNEAYSRLLRAELFGSDFGSLSPAGQGSPMSPSKNMLRFKTESSGPNSPFSPSILRQDSGFSSEASTPPKPPRRVPKTPHKVLDAPSLQDDFYLNLVDMSSQNTLAVGLGTCVYLWSASNSKVTKLCDLGPNDGVCAVQWTREGSYISIGTSLGQVQIWDGTQCKRVRTMGGHQTRTGVLAWNSRILASGSRDRVILQHDLRVFNEFIGKLVGHKSEVCGLKWSHDDRELASGGNDNQLLVWNQHSQQPVLKLTEHTAAVKAIAWSPHQNGLLASGGGTADRCIRFWNTTNGHQTSSVDTGSQVCNLAWSKNVNELVSTHGYSQNQIMVWKYPSMAKVATLTGHSLRVLYLAMSPDGQTIVTGAGDETLRFWNVFPSAKAPAPVKDTGLNSLGRTHIR 213 1726
520 WD40重复蛋白 MEDEAEIYDGVRAQFPLTFGKQSKPQTSLESVHSATRRGGPAPAPAPASSSSLPSTTSPSAAGGAGKSSGLPSLSSSSTAWLEGLRAGNPRAGREAGIGSRGGDGEDGGRAMIGPPRPPPGFSANDDGGGEDDDDDGDGVMVGPPPPPPGNLGDGDDDEEEESAMIGPPRPPVVDSDEEEEEEEEENRYRLPLSNEIVLKGHNKIVSALAVDPTGSRVLSGSYDYTVRMFDPQGMNSRLSSFRDFEPVEGHQVRNLSWSPTADRFLCVTGSAQAKIYDRDGLTLGRFVKGDMYIRDLKNTKGHITGLTWGEWHPKTKETILTSSEDGSLRIWDVNDFKSQKQVIKPKLARPGRVPVTTCTWDREGKCIAGGIGDGSIQIWNLKPGWGSRPDIHVEQAHADDITGLKFSSDGKILLTRSFDDSLKVWDLRLMKNPLKVFEDLPNHYAQTNIACSPDEQLFLTGTSVEERSTIGGLLCFFDRSKLELVSRIGISPTCSVVDCAWHPRLNQIFATSGDKSQGGTHVLYDPTLSERGALVCVARAPRRKSVDDFELKQVIHNPHALPLFRDQPSRKRQREKILKDPLKSHKPELPMNGPGHGGRVGASKGSLLTQYLLRQGGMIKETWMDEDPREAILKHAQAAEKNPKFTRAYAETQPDPVFAKSDSEDEDK 101 2110
表16.BLAST序列比对表
SEQID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
1 CDKA型 eucalyptusSpp_003910  Q9FRN5 推定丝氨酸/苏氨酸激酶 0 367 492
2 CDKA型 eucalyptusSpp_019213  044000 CDC2样蛋白激酶TPK2 e-160 217 290
3 CDKA型 eucalyptusSpp_036800  Q40789 蛋白激酶P34CDC2 0 259 294
4 CDKA型 eucalyptusSpp_040260  Q27168 CDC2 e-156 208 304
5 CDKA型 eucalyptusSpp_041965  Q43361 CDC2PAmRNA.SPTREMBL e-159 274 294
6 CDK B-1型 eucalyptusSpp_002906  Q9FYT9 周期素依赖性激酶B1-1 e-159 269 305
7 CDK B-2型 eucalyptusSpp_001518  Q9FSH4 B2型周期素依赖性激酶 0 270 315
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
8 CDK C型 eucalyptusSpp_008078  Q9LDC1 CRK1蛋白 0 415 558
9 CDK C型 eucalyptusSpp_009826  Q9LNN0 F8L10.9蛋白SPTREMBL 0 392 716
10 CDK C型 eucalyptusSpp_010364  Q8GZA7 推定周期素依赖性蛋白激酶 e-172 309 499
11 CDK C型 eucalyptusSpp_011523  Q8W2N0 周期素依赖性激酶CDC2C e-165 273 405
12 CDK C型 eucalyptusSpp_024359  P93320 CDC2MSC蛋白 0 448 523
13 CDK C型 eucalyptusSpp_039125  O80540 F14J9.26蛋白 0 418 743
14 CDK D型 eucalyptusSpp_005362  O80345 CDK活化激酶1AT(Cdk活化激酶CflKlAt) e-180 305 483
15 CDK D型 EucalyptusSpp_044857  O80345 CDK活化激酶1AT(Cdk活化激酶CflKlAt) e-177 302 477
SEQ ID 目标 专利标识 BlastX最高命中 基因名称  BlastX e值 BlastX 一致性 BlastX重叠
16 周期素A eucalyptusSpp_001743  Q39879 有丝分裂周期素A2型  0 360 508
17 周期素A eucalyptusSpp_012405  Q39878 有丝分裂周期素A2型  e-179 278 470
18 周期素B eucalyptusspp_003739  Q9LDM4 F2D10.10(F5M15.6)  e-148 288 466
19 周期素B EucalyptusSpp_022338  P93557 有丝分裂周期素  e-168 310 476
20 周期素B eucalyptusSpp_028605  Q40337 B样周期素SPTREMBL  e-158 300 439
21 周期素B EucalyptusSpp_041006  Q40337 B样周期素  e-158 300 439
22 周期素D eucalyptusspp_006643  Q9SXN7 NtCycD3-1蛋白  1E-73 177 404
23 周期素D eucalyptusspp_045338  Q8LK74 周期代表D3.1蛋白SPTREMBL  e-101 190 332
SEQ ID 目标 专利标识 BlastX最高命中 基因名称  BlastX e值 BlastX 一致性 BlastX重叠
24 周期素D eucalyptusSpp_046486  Q9ZRX7 周期素D3.2蛋白  e-126 196 373
25 周期素依赖性激酶调控亚单位 eucalyptusSpp_012070  CAB69358 来自专利W09841642的序列1  8E-64 83 88
26 组蛋白乙酰基转移酶 eucalyptusSpp_006617  080378 181(片段)  0 371 395
27 组蛋白乙酰基转移酶 eucalyptusSpp_007827  Q9FJT8 组蛋白乙酰基转移酶HAT B  e-148 260 465
28 组蛋白乙酰基转移酶 eucalyptusSpp_008036  Q9FJT8 组蛋白乙酰基转移酶HATB.SPTREMBL  e-149 262 465
30 组蛋白去乙酰基转移酶 eucalyptusSpp_001596  Q9M4T5 推定组蛋白去乙酰基转移酶HD2  7E-76 156 305
31 组蛋白去乙酰基转移酶 eucalyptusSpp_005870  Q9M4T4 推定组蛋白去乙酰基转移酶HD2c(AT5g0374O/F17C15_160)  7E-66 144 318
32 组蛋白去乙酰基转移酶 eucalyptusSpp_006901  HDAC_ARATH 组蛋白去乙酰基转移酶(HD)  0 405 499
SEQID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
33 组蛋白去乙酰基转移酶 eucalyptusSpp_006902  AAM13152 组蛋白去乙酰基转移酶 0 427 499
34 组蛋白去乙酰基转移酶 eucalyptusSpp_007440  Q8W508 组蛋白去乙酰基转移酶 0 369 428
35 组蛋白去乙酰基转移酶 eucalyptusSpp_008994  Q8LD93 推定组蛋白去乙酰基转移酶 0 354 536
36 组蛋白去乙酰基转移酶 eucalyptusSpp_024580  Q94EJ2 Atlg08460/T27G7 7(HDA8).SPTREMBL e-165 274 373
37 组蛋白去乙酰基转移酶 eucalyptusSpp_037831  Q9FML2 组蛋白去乙酰基转移酶SPTREMBL 0 356 464
38 MAT1 CDK活化激酶组装因子 eucalyptusSpp_034958  Q8LES8 假定蛋白 4E-47 101 190
39 肽基脯氨酰基异构酶 001209EGXC004488HT  TL40_SPIOL 肽基-脯氨酰基顺-反异构酶叶绿体阻遏物 0 329 392
40 肽基脯氨酰基异构酶 010310EGXD012820HT  Q9FJL3 肽基脯氨酰基异构酶 0 453 579
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
41 肽基脯氨酰基异构酶 010310EGXD013036HT O82646 假定57.1KDA蛋白(EC5.2.1.8) 0 302 521
42 肽基脯氨酰基异构酶 010316EGXF999037HT BAB39983 推定肽基脯氨酰基异构酶,叶绿体 e-115 146 172
43 肽基脯氨酰基异构酶 010324EGXF002118HT AAK32894 AT5G13120/T1 9L5_80 e-122 179 264
44 肽基脯氨酰基异构酶 011019EGKA001923HT AAM14253 假定20.3KDA蛋白 e-108 146 188
45 肽基脯氨酰基异构酶 eucalyptusSpp_000966 Q8L5T1 肽基脯氨酰基异构酶(亲环素)(EC5.2.1.8) 1E-91 155 170
46 肽基脯氨酰基异构酶 eucalyptusSpp_001037 QBVX73 亲环素(EC5.2.1.8) e-120 155 169
47 肽基脯氨酰基异构酶 eucalyptusSpp_004603 AAM14253 假定20.3KDA蛋白 e-108 146 188
48 肽基脯氨酰基异构酶 EucalyptusSpp_005465 Q9SP02 亲环素R0C7(EC5.2.1.8)(AT5g58710/m znl_160)(肽基-脯氨酰基顺-反异构酶) 2E-93 172 204
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
49 肽基脯氨酰基异构酶 eucalyptusSpp_006571  O49605 EC 5.2.1.8(亲环素样蛋白)(肽基-脯氨酰基顺-反异构酶) 9E-98 169 224
50 肽基脯氨酰基异构酶 eucalyptusSpp_006786  Q93VG0 亲环素(EC5.2.1.8)(肽基-脯氨酰基顺-反异构酶) 5E-82 142 164
51 肽基脯氨酰基异构酶 eucalyptusSpp_007057  Q38901 胞质亲环素(EC5.2.1.8)(肽基-脯氨酰基顺-反异构酶) 3E-84 144 172
52 肽基脯氨酰基异构酶 eucalyptusSpp_008670  Q9FJL3 肽基脯氨酰基异构酶 0 423 596
53 肽基脯氨酰基异构酶 eucalyptusSpp_009137  Q9C566 亲环素-40(EC5.2.1.8)(表达蛋白) e-168 285 361
54 肽基脯氨酰基异构酶 eucalyptusSpp_010285  Q9LY75 亲环素样蛋白(EC5.2.1.8)(肽基-脯氨酰基顺-反异构酶) e-160 345 658
55 肽基脯氨酰基异构酶 eucalyptusSpp_010600  Q93YQ8 假定50.1KDA蛋白(片段) 0 346 475
56 肽基脯氨酰基异构酶 eucalyptusSpp_011551  Q9ZVG4 T2P11.13蛋白 e-115 154 192
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
57 肽基脯氨酰基异构酶 eucalyptusSpp_020743  Q8VXA5 推定环孢霉素A结合蛋白 e-125 161 172
58 肽基脯氨酰基异构酶 eucalyptusSpp_023739  FK21_NEUCR FK506结合蛋白前驱物(FKBP-21) 3E-49 74 112
60 肽基脯氨酰基异构酶 eucalyptusSpp_031985  Q8L8W5 亲环素样蛋白(EC5.2.1.8)(肽基-脯氨酰基顺-反异构酶) 1E-82 155 229
61 肽基脯氨酰基异构酶 eucalyptusSpp_032025  Q9LPC7 F22M8.7蛋白(EC5.2.1.8)(肽基-脯氨酰基顺-反异构酶) 1E-45 99 160
62 肽基脯氨酰基异构酶 eucalyptusSpp_032173  Q8L8W5 亲环素样蛋白(EC5.2.1.B)((肽基-脯氨酰基顺-反异构酶) 4E-83 156 229
64 视网膜母细胞瘤相关蛋白 eucalyptusSpp_009143  Q9SLZ4 视网膜母细胞瘤相关蛋白 0 704 1008
65 WD40重复蛋白 eucalyptusSpp_000349  AAK49947 TGF-β受体互作蛋白1 0 291 326
66 WD40重复蛋白 eucalyptusSpp_00057S  Q9LW17 WD-40重复蛋白样(表达蛋白) e-168 282 341
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
67 WD40重复蛋白 eucalyptusSpp_000804 GBLP_SOYBN 鸟嘌呤核苷酸结合蛋白β亚单位样 0 291 326
68 WD40重复蛋白 eucalyptusSpp_000805 GBLP_MEDSA 鸟嘌呤核苷酸结合蛋白β e-171 291 327
69 WD40重复蛋白 eucalyptusSpp_000806 GBLP_MEDSA 鸟嘌呤核苷酸结合蛋白β亚单位样 e-171 291 327
70 WD40重复蛋白 eucalyptusSpp-002248 AAL86002 假定43.8KDA蛋白 0 261 388
71 WD40重复蛋白 eucalyptusSpp_003203 Q9SY00 推定WD重复蛋白(AT4G02730/T 5J8_2) e-144 236 317
72 WD40重复蛋白 eucalyptusSpp_003209 AAM14986 假定32.6KDA蛋白 e-160 259 302
73 WD40重复蛋白 eucalyptusSpp_004429 Q9SZQ5 假定34.3KDA蛋白 0 260 322
74 WD40重复蛋白 eucalyptusSpp_004607 AAC27402 表达蛋白 0 253 356
SEQ ID Target  Patent Identifier BlastX tophit Gene name BlastX  evalue BlastXidentities BlastXoverlap
75 WD40重复蛋白 eucalyptusSpp_004682 AAK00964 假定35.3KDA蛋白 0 264 313
76 WD40重复蛋白 eucalyptusSpp_005766 Q944S2 At2g47790/Fl7A22.18(表达蛋白)SPTREMBL e-155 264 396
77 WD40重复蛋白 eucalyptUsSpp_005887 Q94AB4 AT3g13340/MDCll_13 0 332 446
78 WD40重复蛋白 eucalyptusSpp_005981 Q8L4X6 WD重复蛋白GhTTG2.SPTREMBL 0 315 348
79 WD40重复蛋白 eucalyptusSpp_006766 QBL4M1 推定WD-40重复蛋白 e-137 234 369
80 WD40重复蛋白 eucalyptusSpp_006769 Q9LJC6 视网膜母细胞瘤结合蛋白样 0 372 566
81 WD40重复蛋白 eucalyptusSpp_006907 Q94C94 假定蛋白 0 446 812
82 WD40重复蛋白 eucalyptusSpp_007518 Q93ZN5 AT4GD0090/F6N15_8 0 311 436
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
83 WD40重复蛋白 eucalyptus5pp_007717  O82266 At2g47990蛋白(假定58.9kDa蛋白) e-180 327 528
84 WD40重复蛋白 eucalyptusSpp_007718  Q8RWD8 假定蛋白SPTREMBL e-173 278 350
85 WD40重复蛋白 eucalyptusSpp_007741  Q8LA4D 推定WD-40重复蛋白,MSI2 e-158 269 409
86 WD40重复蛋白 eucalyptusSpp_007884  Q9FHY2 与未知蛋白相似 e-149 316 765
87 WD40重复蛋白 eucalyptusSpp_008256  Q9LHN3 EMBICAB63739.1(AT3G18860/MCB22 3) 0 524 758
88 WD40重复蛋白 eucalyptusSpp_008465  Q9FLS2 WD重复蛋白样 0 366 460
89 WD40重复蛋白 eucalyptusSpp_008616  Q9LYK6 假定蛋白 e-148 252 321
90 WD40重复蛋白 eucalyptusSpp_008690  Q9SW94 G蛋白β亚单位 0 326 376
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
91 WD40重复蛋白 eucalyptusSpp_008708  Q8L862 假定蛋白 e-167 297 487
92 WD40重复蛋白 eucalyptusSpp_008850  O22725 F11P17.7蛋白SPTREMBL 0 402 853
93 WD40重复蛋白 EucalyptusSpp_009072  Q9SAJ0 F23A5.2(2形)(推定mRNA输出蛋白) e-176 288 350
94 WD40重复蛋白 eucalyptusSpp_009465  Q9FLX9 NOTCHLESS蛋白类似物 0 384 475
95 WD40重复蛋白 eucalyptusSpp_009472  Q9SZA4 WD重复蛋白样蛋白 0 374 457
96 WD40重复蛋白 eucalyptusSpp_009550  Q9FKT5 GblAAF54217.1(假定蛋白) e-167 275 313
97 WD40重复蛋白 eucalyptusSpp_010284  O22466 WD-40重复蛋白MSI1 0 397 423
98 WD40重复蛋白 eucalyptusSpp_010595  Q94C94 假定蛋白 0 419 789
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
99 WD40重复蛋白 eucalyptusSpp_010657  Q94AH2 假定33.1KDA蛋白 0 243 298
100 WD40重复蛋白 eucalyptusSpp_012636  Q8L611 假定蛋白 0 756 1133
101 WD40重复蛋白 eucalyptusSpp_012748  AAD10151 推定WD-40重复蛋白,MSI4 0 375 469
102 WD40重复蛋白 eucalyptusSpp_012879  Q8VZY6 授精独立性胚乳蛋白 0 291 377
103 WD40重复蛋白 eucalyptusSpp_015515  Q8LPI5 推定WD重复蛋白EPTREMBL 0 360 493
104 WD40重复蛋白 eucalyptusSpp_015724  O22607 WD-40重复蛋白MSI4 0 395 522
105 WD40重复蛋白 euealyptusSpp_016167  Q93YS7 推定WD重复膜蛋白 0 663 917
106 WD40重复蛋白 eucalyptusSpp_016633  Q9SUY6 假定43.8KDA蛋白 e-174 240 384
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
107 WD40重复蛋白 eucalyptusSpp_017485 Q8RXC4 推定144.7kDa蛋白 0 650 1348
108 WD40重复蛋白 eucalyptusSpp_018007 094289 含WD重复的蛋白 e-129 302 794
109 WD40重复蛋白 eucalyptusSpp_020775 Q8W403 Sec13p e-150 242 304
110 WD40重复蛋白 eucalyptusSpp_023132 AAK52092 WD-40重复蛋白 0 458 515
111 WD40重复蛋白 eucalyptusSpp_323569 Q9XIJ3 T10024.21.SPTREMBL 0 404 576
112 WD40重复蛋白 eucalyptusSpp_023611 Q8LAJ2 剪切刺激因子50K链(剪切刺激 e-174 301 438
113 WD40重复蛋白 eucalyptusSpp_024934 Q94AB4 AT3g13340/MDCll_13.WD重复蛋白SPTREMBL 0 343 444
114 WD40重复蛋白 eucalyptusSpp_025546 022212 含假定61.8k-Da Trp-Asp重复的蛋白 0 352 566
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
115 WD40重复蛋白 eucalyptusSpp_030134  Q9LVF2 基因组DNA,染色体3,PI克隆:MIL23  0 677 946
116 WD40重复蛋白 euealyptusSpp_031787  AAL91206 WD重复蛋白样  0 264 329
117 WD40重复蛋白 eucalyptusSpp_034435  Q9SAJ0 F23A5.2(2形)(推定mRNA输出蛋白)SPTREMBL  e-178 290 349
118 WD40重复蛋白 eucalyptusSpp_034452  Q94BR4 假定蛋白(推定前mRNA剪切因子  0 381 525
119 WD40重复蛋白 eucalyptusSpp_035789  P93563 鸟嘌呤核苷酸结合蛋白β亚单位  3E-88 171 356
120 WD40重复蛋白 eucalyptusSpp_035804  Q9FNN2 WD重复蛋白样.SPTREMBL  0 356 589
121 WD40重复蛋白 eucalyptusSpp_043057  Q9LV35 WD40重复蛋白SPTREMBL  0 472 610
122 WD40重复蛋白 eucalyptusSpp_046741  Q93VK1  AT4g28450/F2009_130  0 363 452
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
123 WD40重复蛋白 eucalyptusSpp_047161  Q9ZUN8 推定WD-40重复蛋白  0 350 473
124 CDKA型 pinusRadiata_001766  Q9M3W7 推定CDC2相关蛋白激酶CRK2.459e-128  e-128 237 436
125 CDK A型 pinusRadiata_002927  Q9FRN5 推定丝氨酸/苏氨酸激酶  0 349 470
126 CDK B-1型 990309PRCA009171HT  Q9FYT8 周期素依赖性激酶B1-2  e-145 244 303
127 CDK B-1型 pinusRadiata_013714  Q9FYT8 周期素依赖性激酶B1-2  e-174 222 304
128 CDK B-1型 pinusRadiata_016332  Q9FYT8 周期素依赖性激酶B1-2  e-178 228 304
129 CDK B-1型 pinusRadiata_021677  Q9FYT8 周期素依赖性激酶B1-2  e-176 229 304
130 CDK B-1型 pinusRadiata_027562  Q9FYT8 周期素依赖性激酶B1-2  e-118 211 304
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
131 CDK C型 pinusRadiata_001504  Q9LNN0 F8L10.9蛋白  0 434 790
132 CDK C型 pinusRadiata_025211  Q9LNN0 F8L10.9蛋白  0 371 746
133 CDK C型 pinusRadiata_020421  P93320 Cdc2MsC蛋白  0 318 432
134 CDK D型 pinusRadiata_003187  080345 CDK活化激酶1AT(CDK活化激酶CAK1AT)  e-137 226 485
135 CDK D型 pinusRadiata_015661  Q947K6 CDK活化激酶  0 266 407
136 周期素A pinusRadiata 013874  Q96226 周期素  e-108 223 474
137 周期素A pinusRadiata_014615  CAC27333 推定A样周期素(片段)  0 332 390
138 周期素B pinusRadiata_004578  065064 可能G2/有丝分裂特异性周期素(片段)  9E-87 162 217
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
139 周期素B pinusRadiata_023387  004389 B样周期素  2E-98 220 466
140 周期素D pimisRadiata_006970  P93103 周期素D样蛋白  1E-75 135 293
141 周期素D pinusRadiata_010322  CAC17049 来自专利HO0065040的序列33  e-131 171 254
142 周期素D pinusRadiata_022721  P93103 周期素D样蛋白  1E-76 137 289
143 周期素D pinusRadiata_023407  Q9SMD5 CYCD3,2蛋白  8E-90 139 278
144 周期素依赖性激酶调控亚单位 pinusRadiata_001945  Q947Y1 推定周期素依赖性激酶调控亚单位  5E-55 74 86
145 周期素依赖性激酶调控亚单位 pinusRadiata_008233  CAB69358 来自专利WO9841642的序列1  4E-49 65 86
146 周期素依赖性激酶调控亚单位 pinusRadiata_008234  CAB69358 来自专利WO9841642的序列1  4E-49 65 86
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
147 周期素依赖性激酶调控亚单位 pinusRadiata_022054  CAB69358 来自专利WO9841642的序列1 8E-55 70 82
148 组蛋白乙酰基转移酶 pinusRadiata_012137  Q9FK40 组蛋白乙酰基转移酶(AT5g50320/M XI22_3) 0 496 555
149 组蛋白乙酰基转移酶 pinusRadiata_012582  080378 181(片段)SPTREMBL 0 354 402
150 组蛋白乙酰基转移酶 pinusRadiata_015285  O80378 181(片段) 0 342 401
151 组蛋白乙酰基转移酶 pinusRadiata_017229  Q9LNC4 F9P14.9蛋白 e-118 268 585
152 组蛋白乙酰基转移酶 pinusRadiata_020724  Q9AR19 组蛋白乙酰基转移酶GCN5(表达蛋白) e-177 355 639
153 组蛋白去乙酰基转移酶 pinusRadiata_004555  AAM13152 组蛋白去乙酰基转移酶 0 331 488
154 组蛋白去乙酰基转移酶 pinusRadiata_004556  AAM13152 组蛋白去乙酰基转移酶 0 331 488
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
155 组蛋白去乙酰基转移酶 pinusRadiata_005729  Q9M4U5 组蛋白去乙酰基转移酶2b异形 9E-62 154 348
156 组蛋白去乙酰基转移酶 pinusRadiata_007395  AAM13152 组蛋白去乙酰基转移酶 0 335 426
157 组蛋白去乙酰基转移酶 pinusRadiata_009503  Q8W508 组蛋白去乙酰基转移酶 0 365 427
158 组蛋白去乙酰基转移酶 pinusRadiata_011283  AAM19887 ATIG08460/T27G7_7 0 255 366
159 组蛋白去乙酰基转移酶 pinusRadiata_012322  Q9FML2 组蛋白去乙酰基转移酶(推定组蛋白去乙酰基转移酶) 0 327 435
161 组蛋白去乙酰基转移酶 pinusRadiata_023236  Q8RX2B 推定组蛋白去乙酰基转移酶 e-144 23B 390
162 肽基脯氨酰基异构酶 pinusRadiata_000171  Q9FJL3 肽基脯氨酰基异构酶 0 364 549
163 肽基脯氨酰基异构酶 pinusRadiata_000172  Q38949 FK506结合蛋白FKBP62(ROF1) 0 365 552
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
164 肽基脯氨酰基异构酶 pinusRadiata_001480  Q8VXA5 推定环孢霉素A结合蛋白 e-125 161 172
168 肽基脯氨酰基异构酶 pinusRadiata_001692  FKB7_WHEAT 70 kDa肽基脯氨酰基异构酶(EC5.2.1.8) 0 418 553
169 肽基脯氨酰基异构酶 pinusRadiata_005313  AAB64339 FKBP型肽基-脯氨酰基顺-反异构酶 1E-97 135 175
170 肽基脯氨酰基异构酶 pinusRadiata_006362  BAB39983 推定肽基-脯氨酰基顺-反异构酶叶绿体290 3e-77 3E-77 129 168
171 肽基脯氨酰基异构酶 pinusRadiata_006493  Q9C835 假定26.4 kDa蛋白(EC5.2.1.8)(肽基-脯氨酰基顺-反异构酶) 2E-62 128 235
172 肽基脯氨酰基异构酶 pinusRadiata_006983  AAK96784 亲环素 e-103 151 204
174 肽基脯氨酰基异构酶 pinusRadiata_007665  Q9LDC0 FKBP样蛋白(基因组DNA,染色体3,PI克隆: e-138 239 378
175 肽基脯氨酰基异构酶 pinusRadiata_012196  Q93VG0 亲环素(EC5.2.1.8)(肽基-脯氨酰基顺-反异构酶) 4E-74 132 160
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX 一致性 BlastX重叠
176 肽基脯氨酰基异构酶 pinusRadiata_013382  Q9C588 假定60.2KDA蛋白 0 288 581
177 肽基脯氨酰基异构酶 pinusRadiata_016461  004287 免疫亲和素 9E-66 88 109
178 肽基脯氨酰基异构酶 pinusRadiata_017611  Q9C566 亲环素-40(EC5.2.1.8)(表达蛋白) e-163 276 360
179 肽基脯氨酰基异构酶 pinusRadiata_019776  AAM14253 假定20.3KDA蛋白 e-110 146 190
180 肽基脯氨酰基异构酶 pinusRadiata_020659  AA063961 假定蛋白SPTREMBL 7E-85 159 227
181 肽基脯氨酰基异构酶 pinusRadxata_022559  AAK43974 推定肽基-脯氨酰基顺-反异构酶 2E-73 113 153
182 肽基脯氨酰基异构酶 pinusRadiata_024188  Q9P3X9 肽基-脯氨酰基顺-反异构酶(EC5.2.1.8) e-122 210 379
183 肽基脯氨酰基异构酶 PinusRadiata_027973  Q9SR70 T22K18.11蛋白(AT3g10060/T22K18_11) 3E-69 125 171
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
184 WD40重复蛋白 pinusRadiata_001353 Q9FNN2 WD重复蛋白样SPTREMBL 0 317 590
185 WD40重复蛋白 pinusRadiata_001978 PRL1_ARATH PP1/PP2A磷酸酯酶基因多效调控PRL1 0 341 502
186 WD40重复蛋白 pinusRadiata_002810 AAK49947 TGF-β受体互作蛋白1 0 273 326
187 WD40重复蛋白 pinusRadiata_002811 AAK49947 TGF-β受体互作蛋白1 0 273 326
188 WD40重复蛋白 pinusRadiata_002812 AAM15129 假定58.9KDA蛋白 e-127 225 521
189 WD40重复蛋白 pinusRadiata_003514 Q9FJ94 相似于肌球蛋白重链激酶SPTREMBL e-137 242 445
190 WD40重复蛋白 pinusRadiata_004104 GBB_ORYSA 鸟嘌呤核苷酸结合蛋白β亚单位 0 294 378
191 WD40重复蛋白 pinusRadiata_005595  Q9FTT9 推定DKFZP564O046 3蛋白 0 320 459
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
192 WD40重复蛋白 pinusRadiata_005754 Q94JT6 Atlg78070/F28K19_28SPTREHBL e-168 294 451
193 WD40重复蛋白 pinusRadiata_006463 GBLP_MEDSA 鸟嘌呤核苷酸结合蛋白β亚单位样…538 e-152 e-152 261 324
194 WD40重复蛋白 pinusRadiata_006665 AAM20553 假定119.9KDA蛋白1229 0.0 0 655 1169
195 WD40重复蛋白 pinusRadiata_006750 AAM13119 假定35.4KDA蛋白560 e-158 e-156 264 312
196 WD40重复蛋白 pinusRadiata_007030 Q9LJN8 有丝分裂检测点蛋白595 e-169 e-169 284 335
197 WD40重复蛋白 pinusRadiata_007854 Q8H919 含推定WD域的蛋白 0 429 644
198 WD40重复蛋白 pinusRadiata_007917 AAD10151 推定WD-40重复蛋白,MSI4 0 353 462
199 WD40重复蛋白 pinusRadiata_007989 Q9LRZ0 基因组DNA,染色体3,TAC克隆:K20I9 0 480 687
SEQID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
200 WD40重复蛋白 pinusRadiata_008506 MSI1LYCES WD-40重复蛋白MSI1 0 364 420
201 WD40重复蛋白 pinusRadiata_008692 Q8W403 Secl3p e-134 218 301
202 WD40重复蛋白 pinusRadiata_008693 Q8W403 Secl3p e-137 222 301
203 WD40重复蛋白 pinusRadiata_009170 Q9MOV4 U3 snoRNP相关性样蛋白SPTREMBL e-127 244 524
204 WD40重复蛋白 pinusRadiata_009408 Q9SAJ0 F23A5.2(2形)602 e~171 e-171 282 350
205 WD40重复蛋白 pinusRadiata_009522 Q8RXQ4 假定43.8kDa蛋白 e-129 231 395
206 WD40重复蛋白 pinusRadiata_009734 AA027452 Peroxisomaltargetingsignal type2 receptor.SPTREMBL e-142 227 317
207 WD40重复蛋白 pinusRadiata_009815 AAM20433 细胞周期素开关蛋白 0 326 500
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
208 WD40重复蛋白 pinusRadiata_010670 AAN72058 表达蛋白 e-157 264 345
209 WD40重复蛋白 pinusRadiata_011297 AAM13100 WD重复蛋白ATAN11 e-157 262 337
210 WD40重复蛋白 pinusRadiata_013098 AAM13153 假定39.1KDA蛋白487 e-136 e-136 229 352
211 WD40重复蛋白 pinusRadiata_013172 Q8H0T9 假定蛋白 0 437 860
212 WD40重复蛋白 pinusRadiata_013589 AAK52092 WD-40重复蛋白 0 448 512
213 WD40重复蛋白 pinusRadiata_013608 AAC27402 表达蛋白 e-141 202 358
214 WD40重复蛋白 pinusRadiata_014299 Q9XED5 细胞周期开关蛋白SPTREMBL 0 335 488
215 WD40重复蛋白 pinusRadiata_014498 Q9FH64 WD重复蛋白样 e-152 206 329
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
216 WD40重复蛋白 pinusRadiata_014548 Q93ZS6 假定B2.2KDA蛋白 0 505 763
217 WD40重复蛋白 pinusRadiata_014610 Q9M298 假定104.7kDa蛋白 0 450 922
216 WD40重复蛋白 pinusRadiata_016090 Q9SIY9 推定WD-40重复蛋白SPTRE MBL 0 442 802
219 WD40重复蛋白 pinusRadiata_016722 022826 推定剪切因子SPTREMBL e-159 257 310
220 WD40重复蛋白 pinusRadiata_016785 AAG60193 推定WD40蛋白 0 344 464
221 WD40重复蛋白 pinusRadiata_017094 Q9LV35 WD40重复蛋白 0 406 604
222 WD40重复蛋白 pinusRadiata_017527 Q9AYE4 假定35.3kDa蛋白 e-154 254 314
223 WD40重复蛋白 pinusRadiata_017591 080706 F8K4.21蛋白 0 905 1218
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
224 WD40重复蛋白 pinusRadiata_017769 Q9XIJ3 T10O24.21 0 446 607
225 WD40重复蛋白 pinusRadiata_018047 Q8VZY6 授精独立性胚乳蛋白 0 285 373
226 WD40重复蛋白 pinusRadiata_018414 Q947M8 COPI 0 455 638
227 WD40重复蛋白 pinusRadiata_0189B6 Q9LFE2 WD40重复蛋白 0 518 886
228 WD40重复蛋白 pinusRadiata_019479 Q9SZA4 WD重复蛋白样蛋白 e-156 276 454
229 WD40重复蛋白 pinusRadiata_020144 QBW514 MSI型核小体/染色质组装因子C 0 288 413
230 WD40重复蛋白 pinusRadiata_022480 Q8W514 MSI型核小体/染色质组装因子C e-167 287 426
231 WD40重复蛋白 pinusRadiata_023079 QBW514 MSI型核小体/染色质组装因子C.SPTREMBL e-169 283 397
SEQID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
232 WD40重复蛋白 pinusRadiata_026739 Q93YS7 推定WD重复膜蛋白SPTREMBL 0 591 918
233 WD40重复蛋白 pinus Radiata_026951 Q93VS5 AT4g18900/F13C5_70(假定蛋白) e-163 290 503
234 WEE1样蛋白 pinusRadiata_026529 Q9SRY9 F22D16.3蛋白 e-122 209 451
235 WD40重复蛋白 eucalyptusSpp_006366 Q8LF96 PRL1蛋白 0 374 492
236 WD40重复蛋白 eucalyptusSpp_017378 022607 WD-40重复蛋白MSI4 0 371 453
237 WD40重复蛋白 pinusRadiata_000888 022466 WD-40重复蛋白MSI1 0 364 420
238 周期素依赖性激酶抑制剂 pinusRadiata_014166 Q9FKB5 基因组DNA,染色体5,TRC克隆:K24G6(周期素依赖性 5E-42 114 304
239 CDK D型 pinusRadiata_003189 Q9M5G4 CDK活化激酶 8E-21 56 100
SEQ ID 目标 专利标识  BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
240 组蛋白乙酰基转移酶 pinusRadiata_009356  Q9FJT8 组蛋白乙酰基转移酶HAT B 7E-85 187 510
241 组蛋白去乙酰基转移酶 pinusRadiata_000065  Q9LPW6 F13K23.8蛋白 5E-18 71 209
242 组蛋白去乙酰基转移酶 pinusRadiata_014197  Q8GXJ1 推定组蛋白去乙酰基转移酶 e-170 308 519
243 肽基脯氨酰基异构酶 pinusRadiata_009081  Q9ZRQ9 亲环素(EC5.2.1.8)(肽基-脯氨酰基顺-反异构酶) e-106 185 190
244 肽基脯氨酰基异构酶 pinusRadiata_0313417  Q8H4T0 推定肽基-脯氨酰基顺-反异构酶蛋白 e-140 235 345
245 WD40重复蛋白 pinusRadiata_005755  Q9SKW4 F5J5.6. e-143 144 329
246 WD40重复蛋白 pinusRadiata_006670  Q9LDG7 WD-40重复蛋白样(MJK13.13蛋白) e-163 393 960
247 WD40重复蛋白 pinusRadiata_007027  Q8GWR1 假定蛋白 e-157 276 470
SEQ ID 目标 专利标识 BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
248 WD40重复蛋白 pinusRadiata_007276 Q9LF27 假定47.3kDa蛋白 e-138 235 428
249 WD40重复蛋白 pinusRadiata_007390 Q94AH4 推定RING锌指蛋白91 3e-17 3E-17 53 158
250 WD40重复蛋白 pinusRadiata_012648 022212 含假定61.8kDaTrp-Asp重复的蛋白 0 324 561
251 WD40重复蛋白 pinusRadiata_013171 Q8H0T9 假定蛋白 0 437 860
252 周期素B eucalyptusSpp_045414 Q9LDM4 F2D10.10(F5M15.6) e-142 255 423
253 周期素依赖性激酶抑制剂 eucalyptusspp_044328 Q9FKB5 基因组DNA,染色体5,TAG克隆:K24G6(周期素依赖性 1E-54 121 260
254 组蛋白乙酰基转移酶 eucalyptusspp_015615 Q9AR19 组蛋白乙酰基转移酶GCN5(表达蛋白) 0 390 563
255 肽基脯氨酰基异构酶 eucalyptusSpp 017239 Q8GWM6 假定蛋白 0 364 591
SEQ ID 目标 专利标识  BlastX最高命中 基因名称 BlastX e值 BlastX一致性 BlastX重叠
256 WD40重复蛋白 eucalyptusSpp_018643  Q93VS5 AT4g18900/F13C5_70(假定蛋白) 0 229 327
257 WD40重复蛋白 eucalyptusSpp_019127  Q9SRX9 F22D16.14蛋白SPTREMBL e-131 232 337
258 WD40重复蛋白 eucalyptusSpp_022624  Q9LFE2 WD40重复蛋白 0 594 868
259 WD40重复蛋白 eucalyptusSpp_032424  Q8LPL5 细胞周期开关蛋白 0 255 327
260 WD40重复蛋白 eucalyptusSpp_037472  Q9SK69 推定WD-40重复蛋白(AT2G20330/F 11A3.12) 0 461 677
序列表
SEQ ID N0:1
CAAAAGGGAAACAAGGATTTCAACTCCTGAGATTTTAAGTAACTCAATTAGCAGTTCCAA
CATTAAACCATTATTATTACCCCTTTTATCAGAGCTTTTTGACCTGGTTTAGGAAAATTCCA
AGAACTCAGCCAAATCTTCGACCTTTTTGGGTCTGCAAACCTTGCAAAGAAAGCCCAACT
CAGCTGGAGGCTGGAGATGAACATTTCCATGGCGGATTCCTGCCAAGATTCAATTTTGAT
GACGCAGAGCGCGAACCACCACCTTCCCACCGCCTAATTACTCCCTTTCCTCTCTCTC
TCTCTCTCTCTCTCTCTATAGGAATTTCGGTCTTCTTCGAAGGGGCAGTCGCACTGTATT
GGGGAAAATCTCTCCAGTGGCGAGGATGGGAGACGGGAGCCTCGGCTCCGGCGGCAG
GGGCAACAGCGGCGGCGGCGGCGGCGGCGGCAGGCCGGAGTGGCTGCAGCAGT
ACGATCTGATCGGCAAGATCGGCGAGGGCACTACGGCCTCGTTTTCCTCGCGCGAAT
CAAGCATCCATCCACCAATCGCGGCAAGTACATCGCCATCAAGAAGTTCAAGCAGTCCA
AAGACGGCGACGGCGTCTCGCCCACCGCGATCCGCGAAATCATGCTGCTTCGTGAAAT
CTCGCATGAGAATGTTGTGAAGCTCGTGAACGTTCACATTAACCCCGTGGACATGTCGTT
GTATCTGGCTTTTGATTACGCAGATCATGATCTTTATGAAATTATCAGACATCAGAGAGC
AAGGTTAACCAGGCCATCAATCCCTACACAGTTAAGTCATTGCTTTGGCAGCTGCTTAAT
GGACTGAATTATCTCCACAGTAATTGGATCATCATCGAGATCTGAAGCCATCAAATATT
CTGGTCATGGGTGAAGGAGAGGAGCAAGGCGTCGTGAAAATTGCAGACTTTGGACTTG
CCAGGGTCTACCAAGCTCCTTTGAAGCCACTATCTGATAATGGGGTTGTGGTAACTATCT
GGTATCGGGCACCAGAATTGCTGCTTGGTGCAAAGCACTATACAAGCGCTGTTGATAG
TGGGCTGTCGGATGCATTTTTGCTGAGCTTTTGACATTCAAGCCATTGTTTCAAGGTCAA
GAAGTGAAAGCCAATCCTAATCCTTTCCAACTTGATCAACTTGACAAGATTTTCAAGGTCT
TGGGTCATCCAACGCAGGAGAAATGGCCTATGCTTGTGAATCTGCCTCATTGGCAATCA
GATGTGCAACATATTCAGAGACACAAGTATGATGACAATGCGCTTGGTAATGTTGTACGT
CTCTCTAGCAAAAACGCTACATTTGACCTCCTGTCAAAGATGCTAGAGTATGATCCTCAA
AAGCGTATAACAGCAGCTCAAGCCTTGGAACATGAATATTTTCGTATGGAACCTCTTCCT
GGGCGCAATGCACTTGTACCCAGCTCACCTGGAGATAAAGTGAATTATCCTACTCCTCC
TGTAGATACAACTACAGATATTGAGGGGACAACTAGTCTTCAACCTTCACAATCGGCATC
ATCTGGAAATGCTGTTCCTGGAAATATGCCCGGTCCTCATGTTGTGACAAATAGGCCCAT
GCCTCGTCCTATGCATATGGTCGGTATGCAAAGGGTGCCAGCTTCAGGAATGGCCGGTT
ACAATCTAAATCCTAGTGGCATGGGTGGTGGAATGAATCCTAGTGGCATCCCCATGCAG
CGGGGAGTTGCGAACCAGGCCCAACAGTCTCGAAGGAAGGACCCAGGAATGGGAATGG
GTGGATACCCTCCGCAACAGAAACAAAGGCGCTTCTAAGGGATCTAATTTACTGCGAGA
AATTTCGAGACCTTTCAGAAATTAGTTTGCTCTAGTGATACTGGCTGCAGCATCATGCTT
GCTTTGGTGACACGATGCAAAGAACTAGACTGCATCTTAAGGAAGGAAAGGCAACTATC
GACTTCATTAATGGGCTCATCCTCTGGGGGCTTGAGGCTTCATCGGCATTGGTGCCGA
CTATGGCAGGATGGAATAATCAACGGCTGTTCTGCTAAATGCATTGGAAGTTGCTGCAG
ATTCACAGTGATAGATATCAATGGAGAATTGGATTCCAGCATCATAGCACTCGCAATCTT
TAGGGATCTGCCTCAATTTTTTGCTGAGGTTATAACTCGCTCTTTCAGAAATGTCATCGAT
TATTGATGGGCATGTCCATCAATCCGACACGTTTCCCTAATGTACATCAATTCTGCCAAA
AAAAAAA
SEQ ID N0:2
CTGATCTCGTCGCCGCCGGCCGCCCGTCAGAGTCGCTCCCGCCGCGGCGCGCACTCC
TCCGAACCGTTCGAATTGTTTGTCCGTTGAAAACCCGCGAGATGGAGAAGTCCAACAG
CTCGCAAAGATCGGGGAGGGCACGTACGGGATCGTGTACAAGGCCAAAGACAAGAAGA
GTGGTGAACTGCTCGCTTTGAAGAAGATTCGTTTGGAGGCGGAGGATGAGGGCATTCCC
TCCACTGCCATCCGCGAGATTTCCCTCCTCAAACAGCTCCAACACCCAAACATCGTCCG
GTTGTACGATGTCGTTCACACAGAGAAGAAGCTGACACTGGTGTTCGAGTTCTTGGACC
AGGACCTCAAAAAGTACTTGGATGCGTGCGGTGACAACGGACTCGAACCGTACACTGTC
AAATCTTTCTTGTACCAACTGCTGCAAGGCATCGCCTTCTGCCACGAGCACCGCGTGCT
CCACCGCGATCTCAAACCGCAGAATTTGCTCATCAACATGGAGGGTGAGTTGAAGCTGG
CCGACTTCGGTCTGGCGCGCGCCTTCGGTATTCCCGTCCGGAATTACACGCACGAAGT
CGTCACCCTCTGGTACCGCGCCCCCGACGTGCTGATGGGATCGCGGAAGTACTCCACG
CAAGTCGACATTTGGAGCGTCGGCTGCATCTTCGCTIGAGATGGTGAACGGTCGGCCGC
TGTTTCCCGGCTCGAGCGAGCAGGACCAACTGCTGCGCATCTTCAAAACGCTCGGGACT
CCGTCCCTCAAAACGTGGCCGGGAATGGCGGAGCTGCCGGACTTTAAGCACAATTTCC
CCAAGTATGTCGTGCAGAGCTTCAAGAAGATCTGTCCAAAGAAGCTCACAAGACCGGC
CTCGATCTGTTGTCGCGGATGTTGCAGTACGACCCCGCCAAGCGCATCTCCGCCGAGC
AGGCGATGGGCCACCCGTACTTCAAAGACCTCAAGTTGAGGAAACCGAAGGCGGCCGG
ACCGGGACCCTAAAAAAAAGCCACCCAACTCACCCAATTCTTCTTCTTTTTCCATCTTTTT
TTTTCGTTGTAGTCAAGTTGGTGGTCGGTTGTCGGGAAAATTTATTTCATTGTTTTTTTCT
TCTCGAGATTTTTCGGTTCTGTGGGTACCGTTTCCAAACAAAAAAACTTTTGCCTGCCGT
GTCTTCACTTTGCTTCCCTCTGCTATTTAGCTGTGAACTTTCTTTTTTTTCTCATTCGCTTC
TTTGGGCGCGTGTGTTCAGTGGGATAAGTGTTCTCGTTTTTCTCTGTACATACTCGCGC
TTCTACGCAGCAAATCACGAAGAGACCGTGTTTACTGAGCAAATATTTTTTTCTAATAAAA
AAAAA
SEQ ID NO:3
GCCAACGCCAACACCACCACCAGCGGCGGCAACAGCAGCAGCAGCAACAGCAACCGG
CCGCCATTACTGCCCCCTCCAGGTTGCTGTTGCATTGCGCGGCATCTTTGAACTGGAGT
TGAATGGATCAGTATGAGAAGATCGAGAAGATCGGGGAAGGAACTTATGGCGTGGTCTA
TAAGGCTATTGATCGCTCCACCAATAAGACAATTGCTCTGAAGAAAATTCGTTTGGAACA
GGAAGATGAAGGAGTTCCGAGTACTGCAATCAGAGAAATCTCTCTCTTGAAAGAAATGCA
GCATGGAAACATTGTCAAGTTGCAGGACGTAGTGCACAGTGAGAGGCGTCTATATCTAG
TTTTTGAGTACTTGGACTTGGATTTGAAAAAGCACATGGATTCATGTCCAGAATTTTCTAA
GGACACCCACACAATAAAAATGTTCCTTTATCAGATCCTGCGTGGCATTTCCTATTGCCA
TTCTCATAGAGTTTTACATCGAGATTTGAAGCCCCAGAATTTGCTGCTAGATCGTCGTAC
TAATTCATTGAAGCTAGCTGACTTTGGGCTGGCCAGGGCTTTTGGGATTCCTGTTAGGAC
ATTTACCCATGAGGTGGTGACTTTATGGTATAGAGCTCCTGAGATACTCCTTGGATCCCG
CCATTACTCAACGCCCGTTGACGTGTGGTCTGTGGGTTGTATATTTGCAGAGATGGTGA
ACCGGCGACCACTATTTCCTGGGGACTCTGAGATTGATGAATTGTTTAAGATTTTCAGAA
TAATGGGCACGCCAAATGAAGATTCATGGCCCGGAGTGACCTCTTTGCCTGATTTTAAAT
CAACCTTTCCTAAGTGGGCTTCACAGGACCTAAAAACTGTTACGCCAACTGTTGATCCAG
CTGGCATCGATCTTCTTTCTAAAATGCTGTGCATGGATCCTAGAAGAAGAATAACTGCCA
AGGTCGCTCTTGAGCACGAATACTTCAAGGACGTCGGTGTCATACCGTGAATATCATTG
CCTTACTTACTCGGTGAGACATGTCTATATGGAGTTTGTAGCATATTAATGGGTTTTGATT
GTGTGAAATGTGAATCTTTTCAATTCTTAACCGCATCTTCTTTCTCGAGTCAAATACAAGT
CCTGTTGAACTGGATTTTGGATGTTTTAGCCAGCAAAGTTGATCTCTTAAAAAAAAAA
SEQ ID NO:4
CTAACTCTCATCTAATTCAAAAATGGTTATGAAAAGTAAACTGGACAAGTACGAGAAGCTT
GAAAAGCTTGGTGAGGGTACTTACGGTGTTGTATACAAGGCCCAAGACAAGACGACCAA
GGAAATATATGCTCTTAAGAAGATCCGTTTGGAGTCTGAAGATGAAGGTATTCCAAGTAC
CGCAATTAGAGAGATTGCTCTTTTGAAAGAACTCCAACATCCTAATGTTGTCAGGATCCA
TGATGTTATCCACACCAATAAGAAGCTCATTCTGGTCTTTGAGTTTGTCGACTATGATCTT
AAAAAGTTTCTTCACAACTTTGATAAAGGTATAGACCCCAAGATTGTAAAGAGTCTTCTTT
ACCAACTGGTTAGAGGAGTGGCTCACTGCCATCAACAGAAAGTCCTTCATAGAGATCTTA
AACCTCAAAATCTGCTCGTCAGTCAGGAAGGTATTCTAAAGCTTGGTGATTTTGGTCTTG
CCAGAGCCTTTGGTATTCCAGTCAAGAACTATACAAATGAAGTCGTTACTCTATGGTACA
GAGCTCCTGATATTCTTTTGGGATCTAAGAACTACTCTACTTCTGTTGATATTTGGAGTAT
TGGGTGCATCTTTGTTGAAATGCTCAACCAAAAACCTCTCTTCCCCGGAAGCTCTGAACA
AGATCAACTTAAGAAGATCTTCAAGATTATGGGAACCCCCGATGCCACTAAATGGCCTGG
AATCGCTGAGCTACCTGACTGGAAGCCTGAAAACTTCGAGAAATATCCTGGAGAACCTTT
GAACAAAGTTTGTCCCAAGATGGACCCTGATGGTTTGGATTTACTTGATAAAATGTTGAA
ATGCAACCCAAGTGAAAGAATTGCTGCTAAGAACGCTATGAGTCATCCCTACTTCAAGGA
CATTCCTGATAACTTGAAGAAATTATATAACTAGATACAAGGTTAGCTAGGTATATAATAG
CGGTACAAGTCTTTTTTAAAGTAATTTACAAAAAGATACTTGATTATCTTAGATAAGAATTA
TATGTCTGCTAAACTTGGAATATGTTGGGTGAGAATACATATTCTATAATTATATAACAATT
TATATGTCGAAAATTATTATCTAAGCTAATAAATAACTATTAAAAAAAAAA
SEQ ID NO:5
GGGAAGGGAGGTACCAAAATAAAAAGGGAGGGCAGAGACGCGCACGCTCCCCGACCAA
AGAAGACACGCCCCCGCCCAGCGCTGGCGCTAAACACCCGCCATTTCTGCAACCAGGT
ATCGCGCCGGAAGGACTTTTGACCTGGGCTGGATGGATCAGTACGAGAAAGTGGAGAA
GATCGGTGAAGGAACATATGGTGTGGTGTATAAGGCCATTGATCGGCTCACCAATGAGA
CTATAGCTTTGAAGAAGATACGCTTGGAGCAGGAAGACGAAGGAGTACCGAGCACCGC
GATCCGAGAAATCTCTCTCCTGAAGGAAATGCAGCATGGGAATATAGTCAGGTTGCAGG
ATGTAGTGCATAGTGAGAATAGGCTGTATCTAGTTTTTGAGTATCTGGACTTGGATTTGA
AGAAGCACATGGATTCATCTCCAGACTTTGCCAAGGATCCTCGTCTGGTAAAAATATTTC
TTTATCAAATACTGCGTGGAATAGCATATTGTCACTCACATAGGGTGCTCCACAGAGATT
TAAAGCCTCAGAATCTGTTAATTGATCGGCGTACAAATGCCTTAAAGCTTGCTGACTTTG
GACTTGCAAGAGCTTTTGGTATTCCTGTTAGGACTTTTACACATGAGGTGGTGACTTTAT
GGTACAGAGCACCAGAAATACTTCTTGGTTCTCGTCACTATTCTACCCCGGTTGATGTCT
GGTCAGTTGGTTGTATATTCGCTGAGATGGTGAACCAACGTCCTTTATTTCCTGGAGATT
CTGAAATTGATGAACTGTTCAAGATCTTCAGGATCTTGGGCACGCCAAATGAGGACACAT
GGCCGGGAGTAACTGCCTTGCCAGATTTCAAGTCTGCCTTTCCTAAGTGGCCTGCGAAG
AATCTACAAGATATGGTTCCAGGTCTCAATTCAGCTGGAATCGATCTTCTATCTAAAATGC
TCTGCTTGGACCCCAGCAAAAGAATTACAGCCAGGAGTGCTCTCGAGCATGAATACTTC
AAGGACATCGGATTTGTACCCTGATTATCTTTTGCCATTTGGTCAAGATTGTTATATATTG
GCCTGTGTAGGTGATTCTGCTCTTCTTTTAAGGTGTGAGTTGTGCGTGATCCCACATATT
CACATGTTGCAGTTTCTCTGAAGCCTTGGACAAATCAAGTAGAACTTCTCTCGGCAGCAT
CAGTTTTTCTAATCCATGCCTTGTTGCTTGCTGATCCAAAAAAAAAA
SEQ ID NO:6
ATCTCCCTCCTCTCGACAAAATTCAAATTCCTCGCCGACGCCGACCACCACTTTCTCTCT
CTCTCCCGCTCAACGTTCCAAAAACACCAACCGCCCCATTTCTAGAGAGAGAAACAAGA
GAGAGAAGAGAGCGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGCGATCGG
AGATCGGAGATCGGCGGCGGCGATGGAGAAGTACGAGAAGCTGGAGAAGGTCGGGGA
GGGGACGTACGGCAAGGTGTACAAGGCCAAGGACAAGGCGACGGGGCAGCTCGTGGC
CCTCAAGAAGACGCGCCTCGAGATGGACGAGGAGGGCGTCCCCCCCACCGCCCTCCG
CGAGGTCTCCCTCCTCCAGCTCCTCTCCCAGTCCCTCTACGTCGTCCGCCTCCTCTCCG
TCGAGCACGTCGACGGCGCTCCAAGCGCAAGCCGATGCTGTACCTGGTGTTCGAGTA
CCTGGACACCGATCTCAAGAAGTTCATCGACTCGCACCGCAAGGGGCCAAACCCTAGG
CCCGTCCCCGCCGCGACCGTGCAGAACTTCCTCTACCAGCTCCTCAAGGGCGTCGCCC
ACTGCCACAGCCACGGCGTGCTCCACCGCGATCTCAAGCCCCAGAACCTGCTCGTCGA
CAAGGAGAAGGGGATCCTGAAGATCGCCGACCTCGGGCTCGGCCGCGCCTTCACCGTC
CCGCTCAAGAGTTACACGCATGAGGTCGTCACTCTCTGGTACAGAGCGCCTGAGGTGTT
GCTGGGATCCGCTCACTATTCGATCGGCGTGGACATGTGGTCTGTCGGGTGTATCTTCG
CTGAGATGGTGAGAAGGCAAGCCTTATTTCCTGGGGACTCTGAGTTTCAGCAACTGCTT
CACATATTCAGGCTATTGGGAACCCCAACTGAGAAGCAATGGCCAGGAGTTACCACTTT
GAGGGATTGGCATGTTTATCCACAATGGGAACCTCAAAACTTGGCAAGAGCAGTTCCAT
CCCTTGGACCAGATGGGGTGGACCTTCTGTCGAAAATGCTCAAATATGATCCTGCCGAA
AGGATCTCGGCTAAAGCAGCACTTGATCATCCCTTTTTTGACAGTCTCGACAAGTCTCAG
TTCTGATAATGCCTCGGATATATGGCCGAGTGTTCGCTGGACGGCCTCTTATGTTCTBAT
GTGGTCCTTGTATTTACCTTAAAAGTTACTTTCACTGCTTAAAGGGTGCTTCTGTTTGCTA
GGCACTCGACGCAGCCGTGTGTCATCATCTCGGTGATGAAGTATCATGTAACCAATATC
AATGAATCTATTTTTTGTCCTTTCAAAAAAAAAA
SEQ ID NO:7
GAGAGAGAGAGAGAGAGAGAGAGAGGGAAGAGAGAGAGAGAATGGAGAGACCGGCGAC
GGCGGCGGTGTCGGCGATGGAGGCGTTCGAGAAGCTGGAGAAGGTCGGGGAAGGGAC
GTACGGGAAGGTGTACAGGGCCCGGGAGAAGGCGACGGGCAAGATCGTCGCCCTGAA
GAAGACCCGCCTCCACGAGGACGAGGAGGGCGTCCCTCCCACCACCCTCCGCGAGAT
CTCCATCCTCCGCATGCTCTCCCGCGACCCTCACATCGTCAGGTTGATGGATGTCAAGC
AAGGTCAGAACAAAGAAGGCAAGACGGTACTTTACCTGGTGTTTGAGTACATGGAAACT
GATTTGAAGAAGTACATCCGCGGTTTCCGCAGCTCTGGAGAGAGCATCCCTGTCAACAT
TGTTAAGAGTTTGATGTACCAACTCTGCAAGGGCGTGGCTTTCTGCCATGGGCATGGGG
TCTTGCATAGGGACCTTAAGCCTCACAATCTCCTCATGGACAAAAAGACGTTGACTCTTA
AAATAGCGGATCTTGGACTTGCCAGAGCTTTCACAGTCCCAATAAAAAAATACACTCATG
AGATACTGACTCTTTGGTATAGAGCTCCTGAAGTTCTTTTGGGGGCTACTCACTATTCAA
CCGCGGTTGACATGTGGTCTGTCGGCTGTATATTTGCTGAGTTAGTCACCAAGCAAGCA
CTCTTCCCTGGAGATTCTGAACTGCAACAGCTCCTACACATTTTCAGACTGTTGGGTACT
CCAAATGAAAAGATGTGGCCAGGGGTTAGCAGTTTGATGAACTGGCATGAGTATCCACA
GTGGAACCTCAAAGCCTGTCCACCGCGGTTCCCAATCTGGACAAGGATGGGTTGGATC
TGCTTTCTCAAATGCTGCATTATGAGCCCTCGAGGAGGATTTCTGCAAAGGCAGCCATG
GAGCATCCTTACTTCGATGACGTCAACAAGACTTGCCTATGAGACTGCGCAAAGTGTGTA
GATTAGGAACCGGGATTTGGAATTTTAGCGAACTGAAAAGACATGTACTGAAACTAATT
GCTTGCCTCAAGCATGACCTTTTGTGCACCATGTGCTTCTGTCAATTAATATCTTTTCTAG
GATGATATGGTTAATGCATCCTCACGCTTATATATATATCTCAATTTGTGTCAATTGATGTT
GCAAGCATGTTATACGAAAAAAAAAA
SEQ ID NO:8
GGCAGAATTTCTTGCTTCCTCCCACCCCAGTTCCTTCTTTTTTACCGAGCAAGATTGACC
GGGGAGGACACACGTTTCACGCCTCGCTGGAGGAGCTTGCACCCTGACCGGAGCAAGG
TGGGTAGATTTGACTTGCTGATGCGGAGATTGAGCTGCTTGTGCTTGGGAGTCGCCTTG
TTCTTGCGGTGAGTGATCCGTCGGAGGATATTGGTGGCTGGTGGCTGGTGGGGTTAAG
AGGCCGTACACGAGCTGTATGGGATATGTTCAGAGATGAAAATTAGAAGCGAGACATGG
GTTGCGTGCTTGGTCGAGAGGTGTCGTCCGGCATAGTCACAGAGTCCAAAGGGCGGGA
TAGTTCGGAGGTCGAGACCAGCAAGAGGGATGATTCGGTCGCTGCGAAGGTAGAGGGA
GAGGGTAAAGCCGAGGAGGTGCGGACCGAGGAGACCCAGAAGAAGGAGAAGGTCGAA
GATGATCAGCAGTCGCGGGAGCAAAGGAGACGGTCCAAGCCAAGCACGAAGCTAGGCA
ACCTGCCAAGCATATTCGAGGAGAGCAGGTTGCTGCTGGATGGCCCTCGTGGCTCTC
GGACATATGCGGGGAGGCTCTCAATGGCTGGATTCCTCGAAGAGCGAACACGTTTGAG
AAAATTGACAAGATTGGACAAGGGACATATAGCAATGTGTACAAAGCCAAGGATTTGTTG
ACTGGTAAAATCGTTGCACTAAAAAAGGTCCGGTTTGACAACTTGGAACCTGAAGCGTG
AGGTTTATGGCACGAGAAATTCTCATTCTACGGCATCTGGATCATCCTAATGTTGTGAAG
TTGGAGGGTTTAGTTACCTCACGAATGTCCTGCAGCTTGTACCTGGTATTTGAGTACATG
GAGCATGATTTGGCGGATTAGCAGCAAGTCCAGCAATCAAATTCACTGAGCCACAGGT
CAAGTGTTACATGCATCAGCTGCTCTCTGGACTAGAGCACTGTCACAATCGCCGTGTGC
TTCACCGTGATATTAAGGGCTCAAATCTACTGATTGACAATGGAGGAGTTCTTAAAATTG
GTGATTTTGGGCTGGCTTCATTCTATGATCCAGACCACAAGCATCGAATGACAAGTCGG
GTGGTCACTCTGTGGTATCGTCCCCCTGAACTTCTACTTGGGGCCAATGATTATGGTGT
GGGTATAGACTTGTGGAGTGCTGGTTGTATACTAGCCGAGTTGTTGGCGGGGAAGCCCA
TCATGCCTGGTCGAACAGAGGTAGAGCAACTCCACAAGAATTCAAAATTATGTGGCTCAC
CTTCAGAGGAGTATTGGAAGAAATATAAGTTGCCAAATGCAACATTGTTCAAGCCTCGAG
AACCTTATAGAAGATGCATCAGGGAAACATTTAAGGACTTCCCCCCATCATCTTTGCCAC
TTATTGAAACTCTTCTGGCAATCGATCCAGCGGAACGCGGGACAGCCACTGATGCTTTG
CAAAGCGAATTCTTCAGGACGGAGCCATATGCTTGTGAACCGTCTAGCCTCCCACAGTA
TCCCCCCAGTAAGGAAATGGATGCTAAAAAACGTGATGACGAAGCTCGACGGTTGAGAG
CAGCTAGTAAAGGACAAGCAGATGGTTCTAAGAAAGAACGGACACGTGATCGACGTGTC
AGGGCTGTCCCTGCTCCAGAAGCCAATGCCGAGCTTCAGCATAACATTGATAGAAGACG
ACTCATTTCCCACGCAAATGCGAAGAGCAAAAGCGAAAAGTTCCCTCCGCCACATCAGG
ATGGAGCGCTTGGTTTCCCATTGGGAGCTTCTCATCGTTTTGATCCGGCTGTTGTCCCTC
CCGATGTCCCATTTACCTCTACCTCATTTACTTCTTCGAAAGAGCATGATCAAACATGGTC
AGGCCCACTGGTCGACCCTCCCGGTGCTCCAAGGCGAAAGAAGCACAGTGCAGGCGGA
CAACGAGAGTCTTCAAAATTATCTATGGGGACCAATAAAGGGAGAAGAGCAGATTCTCAT
TTGAAAGCATACGAAAGCAAAAGCATAGCTTAAACGATCTACCGGACGGAAGACCATTTG
TTTTACAAACACGGAAAAAGTTAGTTCGAGGGTAAACTTCACTCGGTATTTGTTATCCATG
TCAATGTTTATTTTTTTGTCCCATGGAGGATTTAAATTTTTGCCGTTACCGTTTTTTGGTTT
GTTGGGATGGTCAGTCAATGTTGGTGCTGTAAATTTTTGCTTCAGTGGCTTCTTGGTACC
AAGTATTCCATATTTTTTACACTCTGAAGAATTGAAATAAGTATTGACAAAAAAAAAA
SEQ ID NO:9
ATAGAAAGAGGATTGGAGTTCTTGAGAATTGTGAGATGGGTATTGGGTGTTGAGTGA
TTCTGTAAAGATCAATTCTTTTTAGAGTGATTCGTTGATTGTTGCAACATGGGTTGCATGT
ATTCTAAGAGTTCAGCGGTTGATGACAGCCGAGAGAGTCCCAAAGACAGAGTATCGTCT
AGTAGGCGATTATCAGAGGTGAAAACTTCAAGATTGGATTCATCGAGGAGAGAGAATGG
GTTCAGGGCAAGAGATAAGGTGGGTGATGTGAGTGTTATGTTGATCGATAAGAAAGTGA
ACGGGTCTGCTAGGTTTTGCGATGATCATGATGAGAAGAAGAGTGATCGTCTTCAGAAG
CAGAGGAGAGAGAGAGCGGAAGCAGCTGCGGCGGCTGATCACCCTGGCGCTGGTCGG
GTCCCCAAGGCGGTAGAGGGCGAGCAGGTGGCAGCCGGGTGGCCGGTGTGGCTGTCT
GCGGTGGCAGGAGAGGCCATCAAGGGATGGCTTCCGCGACGGGCGGACACTTTTGAAA
AGCTGGACAAAATTGGCCAAGGAACTTACAGTAGCGTTTACAAGGCACGTGATGTCACT
AATAACAAAATTGTGGCTCTGAAAAGAGTGCGGTTCGACAACCTCGATACTGAGAGCGT
CAAGTTCATGGCGAGGGAAATCCATATCTTGCGTATGCTTGATCATCCTAATGTTATAAA
GCTGGAAGGCTTGATAACTTCAAGGATGTCCTGTAGCCTGTACCTTGTTTTCGAGTACAT
GGAGCACGATCTTACTGGACTTGCGTCGCGGCCTGATGTAAATTTTCTGAGCCACAGA
TCAAATGTTACATGAAGCAGCTTCTAAGCGGTCTCGATCATTGTCATAAACATGGGGTCC
TACACCGGGACATAAAGGGCTCGAACCTTCTCATTGACAACAATGGCATCTTGAAGATTG
CGGATTTTGGTTTAGCCAGCGTCTTTGATCCTCATCAGACCGCTCCGCTGACAAGCCGG
GTGGTAACTTTATGGTACCGACCACCTGAACTTTTGCTTGGAGCTTCTCGCTATGGAGTT
GAGGTGGATTTGTGGAGTACCGGCTGTATACTAGGTGAACTCTATACTGGCAAGCCTAT
ATTGCCAGGGAAAACAGAGGTGGAGCAATTGCATAAAATTTTTAAGCTTTGCGGTTCACC
GTCTGATGATTACTGGAGAAGATTGCATCTTCCCCATGCAGCTGTTTTCAAGCCTCCACA
ACCTTATCGACGATGTGTTGCGGAGATATTCAAAGAACTCCCTCCAGTTGCTTTGGGCCT
CTTAGAGACCTTAATCTCTGTAGATCCTTCACAAAGAGGGACAGCAGCTTTTGCTCTCAG
GAGTGAGTTCTTTACAGCAAGTCCCCTTCCTTGTGATCCTTCAAGTCTGCCCAAATATCC
ACCAAGCAAAGAAATTGACATGAAATTGCGGGAGGAGGAAGCAAGACGGCGTGGAGCA
GCTGGAGGAAAGAAAACGAGCTTGAAAAGAGAGGAACCAAAGATTCACGAACAAATTCTGC
ATATTATCCTAATGCAGGACAATTGCAGGTCAAACAATGCCATTCCAATGCAAATGGCCG
AAGTGAAATTTTCGGCCCTTATCAAGAGAAAACTGTGTCTGGGTTCTTGGTTGCACCCCC
TAAACAGGCGCGAGTTTCCAAGGAAACAAGAAAAGATTACGCAGAGCAACCAGATAGAG
CTTCATTCTCGGGACCACTAGTTCCAGGTCCCGGATTCTCCAAGGCTGGAAAAGAACTT
GGCCATTCAATCACTGTCTCAAGAAACACCAATCTATCAACACTGTCGAGTTTAGTCACT
AGTAGAACCGGAGATAACAAACAAAAATCCGGTCCTTTAGTGTCAGAATCAGCAAACCAA
GCAAGCAGATATTCAGGACCCATAAGGGAGATGGAGCCTGCGAGGAAGCAGGACCGTA
GGAGTCATGTGCGGACGAATATTGATTATCGTTCGAGAGAAGATGGAAATTCCAGTACC
AAAGAACCTGCTCTGTACGGACGTGGGTCTGCAGGAAACAAGATCTACGTCTCAGGCCC
GTTGCTCGTCTCATCAAACAATGTGGATCAGATGCTCAAAGAGCATGACCGTCGAATCCA
AGAACATGCCAGGAGAGCACGGTTTGATAAGGCTAGAGTGGGCAATAATCATCCCCAAG
CAGCCGTCGACTCTAAGCTCGTGTCCGTGCACGATGCCGGGTAGATTCCCTGCAAAGA
GCTACTAAATGTCAGATATATCTTCAAAGCTGGGCAAGCTTACTTGCCTATGCGAAGGGA
AGCGCATATAGAGGCTTAGCAGGGTAGCTCATCCATTTGGTTTGGGATTTATCTTATATG
TATAGTCGGCATCTTTCTCTGCTCATTCTTATTTATTTTCATGGTAGCGAGGTGGGATCTC
TGGTAAGGAAGAGTCGGCCACCTTTTAGTACACTATGTAATACATAGTTTACGACAGGAT
ATAGCAGAATTAGAATAAGGTCAATCTAGAAACTGATTTGTTTCTGAATGGCGCGCCGCG
CACGCAGCAAGCGCTTGTAACCAGTGTTAAGAGAAAAAATGCATCGCATCGTCACTTGA
ACGGCCTCTTCGGCTTTAAGCGGTTTGCTTGGCGAGAGCAAGTAAGACGTTTTCAATTGT
TCTAAAAAAAAAAAA
SEQ ID NO:10
GCTGGAGTGCCGACCGAAGCACCTCCGAGTCTTCCTCCTCCCCTCCACCGCGTGCTGC
GGTGGCTGACCGCATGATACCAGATGGGTTGCATCCCCACCATCATCTCCGACGGCCG
CCGCCGCTCCGCCGCCCCCGACAAACGCCGCCCCCGCCCTCGCCGGAGCTCGAGCGA
GGGGGAGGCGCCGCCCCACGCCACCGCCGCGGGCTCTGAGGGGGGTGAGTCCGCCC
GCGGTGCTCCGGGGAAGGAGAGGCCGGAGCCGGCGCCGCGTTTCGTCGTCAGGAGCC
CGCAGGGCTGGCCGCCGTGGCTGGTCGCCGCGGTGGGGCATGCCATTGGCGAGTTCG
TCCCTCGCTGCGCCGATAGCTTCCGGAAGCTCGCCAAGATTGGGGAAGGCACGTACAG
CAACGTGTACAAGGCGAGGGATCTGGTGACGGGGAAGACGGTGGCGCTGAAGAAGGT
CAGGTTCGATAATCTGGAGGCGGAGAGCATCAAGTTCATGGCGCGGGAGATTCTGGTG
TTGACGAGGCTCAACCACCCTAACGTGATCAAGCTCGAGGGGCCGGTCACTTCGCGCA
TGTCGTCGGGCCTTTACTTGGCTTTCGAGTACATGGAACATGATCTCTCTGGAATCGCG
GCCCGCCAGAATGGCAAGTTCACTGAGCCTCAGGTCAAATGTTTTATGAGGCAGCTGTT
GTCAGGGCTTGAGCATTGCCATAATCATGATGTCCTGCACCGTGATATTAAGTGCTCCAA
TTTACTTATTGACAATGAAGGAAACCTGAAAATAGCTGATTTTGGACTGGCCACATTCTAT
GATCCTGAGCGCAAGCAAGTTATGACCAATAGAGTCGTTACACTATGGTACCGAGCACC
GGAGCTTTTGCTTGGGGCCACCAGCTATGGAATTGGCATTGACCTCTGGAGTGCAGGCT
GCATTTTGGCGGAGTTACTGTATGGAAAGCCGATTATGCCAGGACGGACAGAGGTAGAG
CAACTGCATAAGATTTTCAAATTATGTGGCTCACCATCTGAAGCATATTGGAACAAATTCA
AATTGCCAAATGCCAATATTTTCAAGCCGCCACAGCCTTATGCTCGTTGTATAGCAGAAA
CATTCAAGGACTTTCCACCATCTGCTCTGCCTCTCCTTGAGACTCTGCTTTCGATAGACC
CTGATGAACGGGGCACCGCCACAACAGCACTAAATAGTGAATTCTTTGCTGCTGAACCT
CATGCATGTGAACCATCAAGCCTGCCAAAGTATCCCCCAAGTAAAGAGATGGATCTAAA
GTTGATCAAGGAAAAGACGAGAAGAGATTCAAGCAAGAGACCCAGTGCAATCCATGGTT
CCAGAAGAGATGGAATTCATGATCGAGCTGGGAGGGTAATTCCAGCTCCAGAAGCCACT
GCAGAGAATCAGGCAACTCTCCATAGGCCGAGAGCTATGAAGAAAGCCAACCCCATGAG
CAGAAGCGAGAAATTCCCACCAGCCCATATGGATGGAGTGGTTGGATCTTCTGCTAACG
CATGGCTATCTGGTCCAGCATCAAATGCCGCACCTGATTCCCGTCGACACCGTTCACTT
AATCAGAACCCATCAAGTTCAGTTGGAAAGGCTTCAACTGGATCCTCCACAACCCAGGA
AACGTTGAAGGTGGCTCCCGAGTTGTTGCAGGTGGGAAGCTCATCCTTGCATCCATGCC
ATCGGATGCTTGTTTATGGATCCAATCTCACGATCAGAAGTAAATAGTTGGTGCCTCTGG
CTCTGGATGCTTTCTGTAAATGGGAAAAGAAGAGCAGCACACAGACTCGATAATCTTTTT
GTTGGCTAGGATCCTTTAGATTCTTTTTCCCCACACAAGTTCTGTATCTTTTTTCAGAGAG
TAACCATTTGTTGGACAAAAGAATAAAGATTCTGCATTGTCAATGTCAACCTATTCCTTCT
GTTAAAAAAAAAA
SEQ ID NO:11
GGGGTTTCCTTACTTCTTCACAGATGTAATGCACGTTCCTTCCTGCAAGAAGGTGGGGT
GTATCCTCCCCCCCGTACATGGTCGTTTCCTGATGCGGGTCAATCACCGTTTCATGTAAT
TCGATCCAATGCGGTCGGTGCATCGCAGCTGATGGGCTGCATATGCGCAAAGCAGGCG
GACCGGGGACCAGCGTCGCCTGGTTCAGGGATCTTGACTGGAGCGGGTACCGGTACC
GGTACCAGGTCGTCGAAGATCCCATCCGGTCTGTTTGAATTCGAGAAGAGCGGTGTGAA
GGAACACGGGGGTCGTAGCGGGGAATTGAGGAAGTTGGAGGAGAAGGGTTCATTGAGC
AAGAGATTGAGGTTGGAGTTGGGATTTTCACATCGGTATGTGGAAGCCGAACAAGCGGC
TGCAGGATGGCCTTCTTGGCTCACCGCCGTTGCTGGTGACGCAATTCAGGGATTGGTTC
CTCTTAAAGCAGATTCTTTCGAGAAATTGGAGAAGATAGGACAAGGCACATACAGCAGTG
TTTTCCGAGCAAGGGAGTTGGCAAATGGAAGGATGGTTGCTCTTAAGAAAGTCCGGTTC
GACAACTTTCAACCCGAGAGCATTCAATTTATGGCACGAGAGATATCGATCCTCCGCAG
GCTTGATCATCCGAATATCATGAAACTGGAGGGCATAATCACTTCTCGAATGTCGAACAG
CATTTATCTTGTGTTCGAGTACATGGAGCATGACCTTTACGGGCTAATATCTTCTCCTCA
GGTCAAGTTCAGCGATGCCCAGGTCAAATGCTATATGAAGCAGCTTCTGTCAGGAATAG
AGCATTGCCACCAGCACGGAGTGATTCATCGAGACGTAAAGTCTTCGAACATTCTGGTG
AACAATGAGGGGATTCTTAGAATAGGAGATTTCGGACTGGCTAACATTCTTAATCCAAAG
GACAGGCAACAACTCACCAGTCATGTCGTTACCTTATGGTACCGTCCTCCTGAGCTTCTC
ATGGGTTCCACAAGCTACGGCGTGACTGTGGATCTGTGGAGCGTTGGCTGTGTTTTTGC
AGAACTTATGTTTCGAAAGCCTATTCTCAGAGGGAGAACAGAGGTTGAGCAATTGCACAA
GATTTTTAAACTCTGTGGTTCCCCGCCCGATGGCTACTGGAAAATGTGTAAGGTACCCCA
GGCCACCATGTTTAGACCCCGCCATGCTTATGAATGCACTCTGCGAGAGAGATGCAAAG
GTATTGCGACCAGTGCAATGAAGCTGATGGAGACTTTTCTTTCCATAGAACCGCACAAGC
GTGGAACTGCTTCAAGTGCGCTCATATCTGAGTATTTCAGGACAGTACCATATGCATGTG
ATCCATCAAGCTTGCCCAAGTATCCTCCTAACAAAGAAATTGATGCTAAACATCGAGAAG
AGGCACGGAGGAAAAAGGCTCGTTCTAGAGTACGAGAAGCCGAAGTAGGGAAAAGGCC
AACAAGAATCCATAGAGCTTCGCAAGAGCAGGGGTTTTCCAGTAACATAGCTCCAAAAG
AGAAGAGAAGCTATGCCTGAGAGCTTGGATTCAAACCTCTTCCTTGATAGAGATGCTCAC
AAGCACCTTGCTTCTTCGCCATCAGGATGTCTCGGTTTTATAGTCGAATGCTCTTGTAAT
CCAACAATTTCAATGTTGATTGCCCAGGAGTTGCAAGGCCAGGCTCAGTTTGTTCGGAG
AAATCATGTGAGAAGTAACCTGAAGGGAGAAATTGATGTTCCATGTAGAGAGCCTCTGAA
GTCATCTTGCGATGCCTCGGCGACTTCTGAAGTGGCGGACACGTCTCAAGGACCGAGTA
TTTTATCGGGTCCAACACAGATCAGTGCGTCAAATGGCTTTGCATGGGCGAGAAGGCGA
AAGGATGATTCTGTTCTGAAAAGGTCATATAGTAGGTCTAGCGCCAGAAGTCAAGTTAGT
GCACTCGATTCATCAAGCATCGTGTTCGAACCCCATGACACGGATGCAAACGGAAGCAC
CATGCATGAACTGCGGAAACTCGGAATCGAGCTTGATCAGGATGATTCCTTTGAAGTACT
TGTGTTGAGGGCTAATACATCCCGATCTGAAGTTGAATCGATGAAGTGAGTGAATGAGAA
TGATACTTGAAGTTCGGCATCGACAAAGTGGTTACAGACATTGACAAAGTAGTTCTAGTC
ACCGTTGTTGTGAAGGTAGTTATAGCCATCGATTAGACAGTGATTAAAGTAGTACCCGTG
CCAATGAAGTGCCGTCCAAATGCTCAGCCACACTGCGGTCGACAGAATTCGATTTCTGG
TTGGCTACGAGTTGCTGATATCGCGACATGGCTGGGGCAGATGCAGTGAAAGAATTACC
CACAACATGCAGGATAGGGTGTTTGTAACACGCTTGATTTTCAACTCGAATGCTCTTCAT
ATGAATGTAACCAATTTCCTCTAGGATTTCTCTATGGACTAGGATGTAACCAGGCCTCCC
CTCAAAAGTAGATGAAACGCTTTATGCTTTATCTTTCTTTTCCTTGGAATGTACGCCTTAC
TTCCTTTAAAAAAAAAA
SEQ ID NO:12
AGGGTGACCTAAATCGTTTATACCTTGCTCAGGCTAGTGAGAGAAAGAGAGAGAGATAG
AGAGAGAGAGAGAGACAGAGAGATGGCCGTAGCAGCGCCGGGCCACCTCAACGTCAAT
GAGTCCCCGTCGTGGGGATCGCGGAGCGTCGACTGCTTCGAGAAGCTCGAGCAGATCG
GCGAAGGCACTTACGGTCAAGTTTACATGGCTAAAGAGAAAAAAACTGGTGAAATCGTG
GCTCTGAAGAAGATTCGGATGGACAATGAGAGAGAAGGGTTTCCAATAACTGCCATACG
TGAGATCAAAATTTTGAAGAAACTCCATCATGAAAATGTGATAAAGCTGAAGGAAATCGT
GACTTCTCCAGGTCCTGAAAAAGATGAGCAAGGAAGGCCAGAGGGAAACAAGTATAAGG
GTGGCATCTACATGGTTTTTGAATACATGGATCATGATCTAACTGGCCTTGCCGACCGTC
CAGGGATGAGATTTTCTGTTCCCCAAATAAAGTGCTATATGAGACAGCTTTTGACAGGGC
TTCATTATTGTCATATCAATCAAGTTCTTCATCGTGATATAAAAGGGTCTAATCTTCTTATT
GACAATGAGGGAAACCTGAAGCTTGCAGACTTTGGCCTCGCTCGATCTTTCTCAAATGAT
CACAATGCAAACCTCACCAACCGTGTTATAACATTGTGGTATAGACCTCCGGAGTTGCTG
CTTGGGGCAACAAAATATGGTCCAGCTGTTGATATGTGGTCTGTGGGTTGCATCTTTGCC
GAACTTCTCCATGGGAAGCCAATTTTTCCTGGAAAAGATGAGCCAGAGCAATTGAACAAG
ATTTTTGAGCTTTGTGGAGCTCCGGATGAAATTAATTGGCCTGGTGTCTCGAAGATTCCC
TGGTACAACAACTTCAAGCCAACCCGGCCAATGAAGAGACGCCTCAGGGAGGTATTCAG
ACATTTTGACCGGCATGCTTTGGAGTTACTAGAAAGAATGTTAACACTGGATCCTTCTCA
GAGAATTTCTGCCAAGGATGCGTTGGATGCAGAGTATTTCTGGGCTGACCCATTGCCTT
GTGATCCAAAGAGTTTGCCTAAATATGAGTCATCTCATGAGTTCCAGACGAAGAAGAAGC
GCCAGCAGCAGAGGCAACACGAGGAAACAGCCAAGCGCCAGAAACTGCAGCATCCTCC
TCAGCATCCCCGGCTGCCACCTGTTCAGCAGTCAGGGCAAGCGCATGCACAAATGAGG
CCAGGACCTAACCAACTCATGCATGGTTCCCAGCCCCCCGTGGCGACAGGCCCGCCGG
GGCACCATTATGGAAAGCCCCGTGGACCATCGGGAGGAGCTGGGAGGTATCCTTCCAG
TGGAAACCCAGGCGGTGGGTACAATCACCCAAGTCGCGGTGGTCAAGGAGGTAGTGGA
GGTTACAACAGCGGGCCATATCCTCCTCAGGGGCGAGCTCCACCGTATGGTTCGAGTG
GAATGCCTGGTGCTGGACCTCGAGGTGGCGGGGGTAATAACTATGGTGTTGGTCCATC
GAATTATCCTCAAGGTGGTGGTGGTCCTTACGGTGGGTCTGGTGCTGGTCGTGGGTCAA
ACATGATGGGCGGTAACCGGAATCAGCAGTACGGCTGGCAGCAGTGAATTTCCACTGGT
CGCTGCTATTGTGGGCAAAAGGTTTTTGAATTGCGAGTGTAGTTTTTTATGATGACACTAT
TTCTGCGAAAGAGCAAAGAGGATCCACATACAAGAGTTGTTACGCTACACATCCTATACC
ATCAAAGGAACGTTGGAATGCCAGCATTGCTGTTTCATGATTGATACGTGTATGTAAAAT
CAATGTGGAAGCATTGTAATCTTGGCTTTGTCGAGGCTTTGCTTGAATAATGGTTTCCAA
TCGGTTTCAATGACGGCGTACAGACCCACTTGTTTCGGGAAAAAAAAA
SEQ ID NO:13
TGGCATTGGATTTGATTAGCGAGTTGACGCAACAATCTGTCCCAACCCTCTGTCTCTCAC
ATCCACCGGCTCTTCGCCTTTTCAGGGTCCAGGCTTGATCCGCAATTCTCTCCCTCTCTT
TTCCTCTCTTTCTCTCTCTCTCTCTCCCTCCTTCCTTGCCCCTCCCGCCTCACATTTCGAA
GCAAAAACCAACTTGGAGCAGACGTTATGGTTTTTTAGACATTTCCCATTTAGGATTTTCC
TCGTGGCAGCTCGGAAGTTCGGACGTTTGGTTCCCATCCTTGTTCTTGTTCCAAGGCATT
GGGGGTTTCTCAGCATAGGAGGGAAGTAGGGACGGTCATTTGCCACCTCGAAGTTAATT
TGGGTTGCAAATAAGGACGGAGAACTTTGGTTGATTTTCTGGATCAGGGTTTTTCCTCAT
TCGTTTTTGCCATAGGATTAAGCATTTCTTGGTTTTCGCATTGAAGCAATTCACCAAGGGA
AAGTTGTAGAGATTGATAGGAATTAAGATCCTCTGGTTTCTTTTGGGAATGCTGATCGAT
GACAGTCTCAATTTTAGAAAAGAGGGGTTTCTCTCGGAACCGGTACTTTGATTTGACGAA
GAGGGCAAATTTGAAGTGGGATAATGGGTTGCATTTGCACCAAAGGAATTCTTCCCGCC
CATTACAGGATTAAAGACGGGGGTTTGAAGCTTAGCAAATCTTCAAAAAGGTCTGTTGGA
TCCTTAAGGAGAGATGAACTGGCGGTTTCAGCCAATGGTGGAGGCAACGATGCTGCAGA
TCGACTGATATCAAGCCCACATGAGGTTGAGAATGAAGTGGAGGATAGGAAAAACGTTG
ATTTCAATGAGAAACTGTCAAAATCCCTTCAAAGACGAGCTACAATGGACGTGGCAAGTG
GGGGACACACACAAGCACAGTTGAAAGTGGGTAAAGTAGGTGGCTTTCCCCTTGGTGAG
AGAGGAGCACAAGTGGTAGCCGGATGGCCTTCTTGGCTGACAGCAGTGGCCGGAGAAG
CTATTAACGGATGGGTGCCTCGAAGGGCCGATTCATTTGAGAAGCTGGAAAAGATTGGG
CAAGGTACTTACAGCAGTGTATATAGAGCACGTGACCTGGAAACGAATACAATTGTAGCC
TTGAAAAAGGTTCGATTTGCTAATATGGACCCAGAGAGTGTTCGATTCATGGCAAGGGA
GATAATTATCATGCGAAAGCTTGACCATCCAAATGTCATGAAGCTGGAGGGTTTGATAAC
TTCAAGGGTTTCTGGCAGTCTGTATCTTGTGTTTGAATATATGGACCATGATCTTGCTGG
CCTCGCTGCCACTCCGAGTATAAAGCTCACTGAATCTCAGATCAAATGTTACATGCAACA
ACTACTTCGTGGACTTGAATATTGCCACAGCCATGGTGTTCTACACCGTGACATAAAAGG
CTCTAACTTGTTAGTTGACAACAATGGCAATCTCAAAATTGGAGACTTTGGATTGGCAAC
TTTTTTCCGGACCAATCAAAAGCAGCCTCTAACGAGTCGCGTAGTCACTCTCTGGTACCG
ACCTCCTGAGCTGTTGCTTGGTTCTTCAGATTATGGAGCTTCTGTGGATTTGTGGAGTTC
TGGTTGCATCCTAGCTGAATTGTTTGCTGGGAAGCCCATAATGCCTGGAAGAACAGAGG
TGGAGCAATTGCACAAAATTTTCAAACTTTGTGGGTCTCCGTCAGAAGAATATTGGAAGA
AGTCAAAATTGCCACATGCAACCATTTTCAAGCCTCAGCAACCTTACAAGCGTTGTCTCC
TGGAGACATTTAAGGATTTTCCTTCATCTGCATTGGGCCTGCTAGATGTTCTTCTTGCAG
TTGAACCAGAATGCCGTGGAACGGCTTCCTCAGCCCTTCAGAATGAGTTCTTCACATCCA
ATCCTCTTCCAAGTGATCCGTCAAGTTTGCCAAAGTATCCATCAAGCAAGGAGTTTGATG
CCAGACTTCGAGATGAGGAAGCCAGAAAACATAAAGCTACTGCCGGTAAAGCTCGTGGT
CTTGAATCAATTAGAAAGGGTTCGAAAGAGTCTAAAGTTGTGCCTACATCAAATGCCAAT
GCTGATTTAAAGGCATCCATCCAGAAGCGACAAGAGCAATCGAATCCCAGAAGTACCGG
TGAGAAACCAGGAGGAACGACCCAGAACAATTTCATTCTATCTGGACAATCAGCAAAACC
CAGTCTTAATGGATCAACACAAATTGGAAATGCAAATGAGGTTGAGGCTTTGATTGTGCC
AGACCGAGAACTCGATTCTCCAAGAGGTGGGGCCGAGTTGAGAAGGCAGAGATCTTTCA
TGCAAAGAAGAGCATCGCAGTTATCCAGGTTTTCTAATTCTGTTGCAGTGGGAGGTGATT
CGCATCTCGATTGTAGTAGGGAGAAGGGTGCTAACACCCAATGGAGGGATGAGGGTTTT
GTTGCCAGGTGTAGTCATCCAGATGGTGGTGAATTAGCGGGAAAGCATGACTGGTCACA
TCATTTGTTACATAGGCCGATATCTTTGTTTAAGAAGGGGGGAGAGCACTCCCGAAGGG
ATTCCATTGCGAGTTATTCTCCCAAAAAGGGCCGAATCCACTATTCTGGACCTTTGCTTC
CCTCAGGAGATAACCTCGATGAAATGCTCAAGGAGCATGAGCGGCAAATACAAAATGCC
GTCCGGAAAGCTCGCCTTGATAAGGTCAAGACGAAGAGAGAGTATGCCGATCACGGGC
AGACGGAGTCACTTCTCTGTTGGGCAAATGGTAGGTGATTGTAAGGATGACAAGGAGTA
AGTGCATTTCTCTCTTGAAGCAAGGAGGCCCATGAGACGGCAACAATGTCGACAGGAAG
CAGAATTCCTATCCAGAAGCGGTAAAAGATATGATCGACACAAGCATTTTGTGTTGGAGC
CTCAGCTAATTGTATGTCATCGAGTACTTTTTCGAAGCACTTGATGATGCTCCTTGTGTC
GTATGAAATCCGAAGCAATCGGATTTGGTAAAATCAGGTGGCGGTCTTGTTTGGTGCTC
CAACAATGGAAAGGGCTGTATATTGTCAATGTTTACTCTGATTGTTGGAAGGATTCTCTG
TACAGTGATGGCCCATCCCTATCTATTCATTGACTGTTCCCTGAAAAAAAAAA
SEQ ID NO:14
CTTCGAGCTCCGATGGACCCGGACCCGAGCCCGGACCCGGACCCGCCGAAGAGCTGG
AGCATCCACACCCGGCGGGAGATCATCGCCCGGTACGAGATCCTGGAGCGCGTCGGCT
CCGGCGCCTACTCCGACGTCTACCGCGGCCGCCGCCTCTCCGACGGCCTCGCCGTCG
CCCTCAAGGAGGTCCACGACTACCAGTCCGCCTTCCGCGAGATCGAGGCCCTCCAGAT
CCTCCGCGGCTCCCCCCACGTCGTCCTACTCCACGAGTACTTCTGGCGGGAGGACGAG
GACGCCGTCCTCGTGCTCGAGTTCCTCCGCAGCGACCTCGCCGCCGTGATCGCCGACG
CCAGCAGGCGGCCGAGGGACGGCGGCGGCGGGGGGGCGGCGGCGCTGCGGGCCGG
CGAGGTCAAGCGGTGGATGCTGCAGGTCTTGGAAGGGGTCGATGCTTGTCACCGGAAC
TCGATCGTTCATCGCGACTTGAAGCCCGGGAATTTGCTCATATCGGAGGAGGGAGTGCT
TAAGATTGCTGATTTTGGGCAGGCAAGGATACTCCTGGATGATGGAAATGTTGCTCCAG
ACTATGAGCCTGAATCATTCGAAGAGAGATCATCGGAACAGGCTGATATCCTTCAGCAG
CCAGAAACTATGGAGGCAGATACCACATGCCCTGAAGGTCAAGAGCAGGGAGCTATCAC
TAGGGAGGCATACCTCAGAGAGGTGGACGAATTCAAGGCTAAAAACCCTAGGCATGAAA
TCGACAAGGAAACAAGCATATTTGATGGCGATACTTCTTGTCTGGCCACATGCACGACCA
GTGACATTGGAGAAGATCCTTTTAAAGGTTCCTATGTTTATGGGGCCGAAGAAGCTGGA
GAAGATGCACAAGGCTGTCTCACATCTTGTGTTGGGACACGCTGGTTCAGAGCACCTGA
ACTGCTCTATGGGTCCACAGACTATGGGCTCGAGGTTGATCTCTGGTCACTGGGATGCA
TTTTTGCTGAGCTTTTGACTCTGGAACCCCTTTTCCCTGGGATTTCCGATATCGACCAAC
TTAGTAGAATCTTCAATGTTTTGGGCAACCTGAGCGAGGAAGTCTGGCCAGGCTGTACG
AAACTTCCAGATTATAGAACAATTTCATTCTGCAAAATCGAAAACCCCATCGGTTTGGAAT
CCTGCCTGCCGAACTGCTCAAGTGATGAAGTCTCTTTAGTTCGGCGACTTCTTTGCTACG
ATCCAGCTGCAAGAGCCACTCCCATGGAACTGTTGCAAGACAAGTACTTCACTGAAGAA
CCACTTCCAGTTCCAATTAGTGCGCTTCAGGTGCCCCAGTCAAAGAATAGCCATGATGA
GGACTCTGCTGGTGGTTGGTATGACTACAATGACATGGACTCGGATTCTGATTTTGAGG
ACTTTGGCCCTTTGAAGTTCACACCTACAAGTACTGGTTTCTCTATACAGTTCCCCTAGTC
ATCGTGAGAATGCAGGGATCAAATTTGTGAGTACTTGCTAAAATTTTTGCTACGGATAAT
GTTGTGAGGCGAGGCAGTCGAAATTACGGAGGTTGACTTCTTTCGACATTCATATGCTTG
CTGTTATAGACAAGGGGGACGTGTCTGTAAAAGACCATGAGAGGGTTATATTCTCGGTG
TCCTTTTGTCTTGTAAATTTCCTTCACCAGAGACACAAATTCCCATCGAAAAAAAAAA
SEQ ID NO:15
CCAAAAAAGAAGAAAAGAGAAGAAGAGACGTTCCGTCTGACTGGTCTCGCTGCAAAAGC
TCCGCCGGGAGTCCCCTCCACGCCGCCGTCACTCTCTCCCTCTTTGAGCTCCGATGGA
CCCGGACCCGAGCCCGAGCCCGGACCCGCCGAAGAGCTGGAGCATCCACACCCGGCG
GGAGATCATCGCCCGGTACGAGATCCTGGAGCGCGTCGGCTCCGGCGCCTACTCCGAC
GTCTACCGCGGCCGCCGCCTCTCCGACGGCCTCGCCGTCGCCCTCAAGGAGGTCCAC
GACTACCAGTCCGCCTTCCGCGAGATCGAGGCCCTCCAGATCCTCCGCGGCTCCCCCC
ACGTCGTCCTACTCCACGAGTACTTCTGGCGGGAGGACGAGGACGCCGTCCTCGTGCT
CGAGTTCCTCCGCAGCGACCTCGCCGCCGTGATCGCGGACGCCAGCAGGCGGCCGAG
GGGCGGCGGGGTGGCGCCGCTGCGGGCCGGCGAGGGCAAGCGGTGGATGCTGCAGG
TCTTGGAAGGGGTCGATGCTTGTCACCGGAACTCGATCGTTCATCGCGACTTGAAGCCC
GGGAATTTGCTCATATCGGAGGAGGGAGTGCTTAAGATTGCTGATTTTGGGCAGGCAAG
GATACTCCTGGATGATGGAAATGTTGCTCCAGACTATGAGCCTGAATCATTCGAAGAGA
GATCATCGGAACAGGCTGATATCCTTCAGCAGCCAGAAACTATGGAGGCAGATACCACA
TGTCCTGAAGGTCAAGAGCAGGGAGCTATCACTAGGGAGGCATACCTCAGAGAGGTGG
ATGAATTCAAGGCTAAAAATCCTAGGCATGAAATCGACAAGGAAACAAGCATATATGATG
GCGATACTTCTTGTCTGGCCACATGCACGACCAGTGACATTGGAGAAGATCCTTTTAAAG
GTTCCTATGTTTATGGGGCCGAAGAGGCTGGAGAAGATGCACAAGGCTCTCTCACATCT
TGTGTTGGGACACGCTGGTTCAGAGCACCTGAACTGCTCTATGGGTCCACAGACTACGG
GCTCGAGGTTGATCTCTGGTCACTGGGATGCATTTTTGCTGAGCTTTTGACTCTGGAACC
CCTTTTCCCTGGGATTTCCGATATCGACCAACTTAGTAGAATCTTCAATGTTTTGGGCAA
CCTGAGTGAGGAAGTCTGGCCAGGCTGTACGAAACTTCCAGACTATAGAACAATTTCATT
CTGCAAAATCGAAAACCCCATCGGTTTGGAATCCTGCCTGCCGAACTGCTCAAGTGATG
AAGTCTCTTTAGTTCGGCGACTTCTTTGCTACGATCCAGCTGCAAGAGCCACTCCCATGG
AACTGTTGCAAGACAAGTACTTCACTGAAGAACCACTTCCAGTTCCAATTAGTGCGCTTC
AGGTGCCCCAGTCAAAGAATAGCCATGATGAGGACTCTGCTGGTGGTTGGTATGACTAC
AATGACATGGACTCGGATTCTGATTTTGAGGACTTTGGCCCTTTGAAGTTCACACCTACA
AGTACTGGTTTCTCTATACAGTTCCCCTAGTCATCGTGAGAATGCAGGGATCAAATTTGT
GAGTACTACGTAAAATTTTGCTACGGAGGCGAGGCAGTCGAAATTACGGAGGTTGACTT
CTTTCGACATTCATATGCTTGCTGTTATAGACAAGGGGGACGTGTCTGTAAAAGACCATG
AGAGGGTTATATTCTCGGTGTCCTTTTTTCTTGTAAATTTCCTTCACCAGAGACACAAATT
CCCATCGACTTGTCAAAAAAAAAA
SEQ ID NO:16
GGAATCGAGAAAAGTCCCTCTCCCTCTCTCCATCTCTCCATCTCTCCTTCCTTTTTTG
GAAGAAAGGTTGTGGGGGATACATCATCGCCGTTGGGAAACGGAGAGGGGGTTACCGG
AGCGGGCGGGGGGAGATAGCGATTCGTGTTGAACTGCTGGTCTCGCCGCCGCCGCCG
CCGCTCCTCGAGATGTCGAACCAGCACCGGCGCTCCTCCTTCTCGTCCTCCACGACGTC
GTCCCTCGCCAAGCGCCACGCCTCCTCCTCCTCCTCCTCCTTGGAGAACGCCGGGAAG
GCCTTCGCCGCCGCCGCCGTCCCGTCGCACCTCGCCAAGAAGCGGGCCCCCCTCGGC
AACCTGACCAACCTCAAGGCCGGCGATGGCAATTCCCGCAGCTCATCGGCACCATCCA
CTTTGGTGGCTAATGCGACGAAACTGGCAAAGACGAGGAAGGGATCTTCTACTTCCAGC
TCCATCATGGGCCTCTCGGGAAGTGCTTTACCGAGATATGCTAGCACGAAACCCAGTGG
AGTTCTTCCTAGCGTTAATCCTTCCATTCCAAGAATAGAGATAGCCGTTGATCCCATGTC
GTGCAGCATGGTTGTTTCGCCCAGTAGATCTGACATGCAATCGGTTTCGCTGGATGAGA
GCATGTCCACCTGCGAGTCTTTCAAGAGTCCCGACGTTGAGTATATCGACAATGAGGAT
GTTTCGGCAGTTGATTCTATCGACAGGAAGACATTTAGCAATCTTTATATCTCAGATGCT
GCAGCAAAAACAGCGGTTAACATTTGCGAGAGAGATGTACTCATGGAAATGGAAACAGA
TGAGAAGATTGTCAATGTTGATGACAACTACTCGGATCCGCAACTCTGTGCAACCATTGC
TTGTGACATTTACCAGCACTTACGTGCATCTGAGGCCAAGAAGAGACCTTCCACTGATTT
TATGGATAGAGTACAAAAGGATATAACTGCCAGCATGCGTGCCATACTAATTGATTGGCT
TGTGGAGGTGGCTGAAGAATACAGGCTCGTACCTGATACACTGTACCTGACTGTTAACT
ACATAGATCGGTATCTTTCAGGAAATGTGATGAATAGACAACGTCTGCAGCTGCTTGGTG
TTGCTTGCATGATGATAGCTGCCAAGTACGAGGAGATCTGTGCACCTCAGGTGGAAGAG
TTCTGTTATATTACCGACAATACGTACTTCAAGGAGGAGGTATTGCAAATGGAATCTTCT
GTGTTGAATTACTTGAAGTTTGAAATGACTGCTCCCACTGTCAAGTGCTTTTTAAGACGAT
TTGTTCGTGCTGCGCAAGGCGTGAATGAGGTTCCATCTTTGCAACTGGAGTGCATGGCC
AACTATATTGCAGAACTCTCTCTACTAGAGTACGATATGCTTTGTTATGCTCCATCTCTTG
TAGCTGCATCAGCAATATTTCTGGCCAAATTTGTCATCACCCCTTCGAAGAGACCATGGG
ACCCGACGTTGCAGCATTACACTCTCTACCAGCCTTCCGATCTGGGAAACTGTGTCAAG
GATCTGCATCGGCTGTGTTTTAACAACCATGGTTCGACCTTACCAGCAATCAGGGAGAA
GTACAGCCAGCATAAGTACAAATACGTGGCCAAGAAGTATTGCCCTCCCTCGATACCTC
CGGAGTTTTTCCACAATCTTGTCTATTAGCCGTTGACGACACTCTGCTCCAATGCTCCGT
TGGAACCTATCCTCCGTTTGCGCACCAGCAACTCTTACGTGCTTTCGCCCCATAATCCG
GTGATTGTCAAAGAGTGTTGATCTGTATTCACTTCGTAAGATTTTAGGATCTCTGTCCCTT
TCCCCATCTTCGTTGCTGAAGTGTAAATCCCCTCCTCTCTTTCGAGTTCCGTCATTTTATT
TTGCACCGTTGATGATGTTGTACTCTGGTGCACGCGGTTGATTTTAGCCCGGCTGCATG
GCCGTACGGAGGATTAGTAGCTTCTTTATGATGAAATGTCGTGGCAGATTTCTTTTTTTCT
TTCTGTTCTGTTTTGATTGCTCTTTTTTCCTCCATAGAATGAGCTCACTTCTGCGCAGAAG
TTGGTGAAGATACCGGTAAGTTGACTGCTAATGTGAAATTTGAGCTCCCAAAAAAAAAA
SEQ ID NO:17
GCTTTCTCTTTCCTCCTGGCGAGCTTTCTCTCTCTACATCTCTCTATGCGGACCTCCTCC
GAGGGGTCGCGGAGCTCCAGCACAACAGCGTTACTTGTTCTTTACACTTGGCTTTTCGA
ATTTCATATCGAGCATCTGCAGAATGTTTGTTCCCTCTTTGTCATGATACAAGGATAGTGG
GTGCAATGAGGAGCCATTTGATTTGATGACACGTACTTCCTTGGGAGGAGTGCAAATAT
GAACAAAGAAAATGCAGTTGGAACAAAAAGTGAAGCACCCACTATCCGAATTACCCGATC
AAGGTCTAAAGCATTGGGCACGTCAACAGGGATGCTCCCATCCTCAAGGCCCTCCTTTA
AACAGGAACAAAAGCGTACTGTCCGTGCAAATGCCAAGAGATCAGCATCAGACGAGAAC
AAAGGAACTATGGTTGGAAATGCTAGCAAGCAGCACAAAAAGAGAACAGTACTTAATGAT
GTTACTAACATCTTTTGTGAGAATTCATACTCGAATTGTCTCAATGCTGCCAAAGCTCAGA
CCAGCAGGCAGGGTAGAAAGTGGTCTATGAAGAAGGACAGAGATGTGCATCAAAGTGG
AGCTGTCCAAATAATGCAGGAGGATGTCCAAGCGCAATTTGTAGAAGAGTCGTCCAAAA
TAAAAGTGGCAGAGTCTATGGAAATCACTATCCCAGACAAATGGGCAAAACGAGAAAATT
CAGAGCATAGTATCTCGATGAAGGACACTGTAGCAGAGTCTTCAAGGAAACCACAAGAA
TTTATATGTGGTGAGAAGTCAGCAGCACTAGTTCAACCAAGTATCGTAGATATAGATTCA
AAACTCGAGGATCCTCAAGCTTGCACCCCATATGCACTTGACATATACAACTATAAACGC
AGCACAGAGCTTGAACGAAGACCTTCAACTATATATATGGAAACCTTGCAGAAAGATGTC
ACTCCAAACATGCGTGGAATTCTAGTTGACTGGCTTGTTGAGGTTTCTGAAGAATATAAA
CTAGTTCCTGATACTCTTTACCTCACTGTGAATCTCATAGACCGATCTCTCTCACAAAAGT
TTATTGAGAAGCAAAGACTCCAATTGCTTGGTGTTACTTGCATGCTAATTGCCTCGAAGT
ACGAGGAAATCTGTCCACCACGAGTGGAAGAGTTTTGCTTTATCACGGACAATACCTATA
CAAGCCTAGAGGTATTGAAAATGGAGAGTCGAGTTCTAAATCTTCTGCATTTTCAGTTGT
CCGTTCCCACCGTTAAAACATTTCTGAGGAGATTTGTTCAAGCTGCCCAAGTATCGAGCG
AGGTCCCTTCTGTTGAACTAGAATATTTGGCAAATTATTTGGCGGAGTTAACTCTCGTTG
AGTATAGTTTCTTAAAGTTTCTGCCTTCCCTTATGGCTGCATCTGCTGTACTTCTTGCCAG
ATGGACACTCAATCAATCTGACAACCCTTGGAATCTAACTCTAGAACACTACACAAAGTA
CAAGGCATCAGAGTTAAAAGCTGCGGTTCTTGCATTGGAAGATTTGCAGCTCAACACCA
GTGGTTCCACCCTAAATGCGATACGTGAAAAGTATAGACAACAGAAGGTAAACTATTCAT
TACTGATCCATAGCAAGGCCAATCATGAGATATTATAGACCACACATTTTTGCTAAACTGT
TGGAGTTCATAGAACTAGACGTTTTCCATTGACACTTGATCAACTGATCATGGCAGTTCA
AATGCGTGGCGACTTTAACTCCCTCTGAACCTGCACTTTCACTCTTCCAAAGATTTAACTA
CAGCTGATCTTCCTGAAGGGCTAACTTGAGAATAATCAAATGGAGTGACTTCCCACATTG
CGGCACTTCCAGGGTGAATTCGAGCTCCACTAATAGGATGTCTACTTACCATTTTTTGTC
GACCGAGGTTTTGTTGTCCACATTAATGTTCTCATTGTTGTAGCATTTATTTACGGTCTGG
TGCTTCCAGAACAGTCCCTTAATGATGAGGAACTCTCAGATCATGACCCCATACCAGGC
GGCAAAACATTGCGCCGCTCGGCAATAAACCTCGGAGTTTTCCTAGTCTCTAGGAACTC
GTACAGTAACCCCATTTCTTTTGCTTGCCCGAGTGTCAAGCTTTCAGATGTACTGTTTAGT
CACATTACTTCTCCGAAAGAAGATTTGGAGATCATTCTCTTTGTACTTCAATGAATTTAAC
AGAACGATACTGGAATCTCTCTGTTACTCGTTCAAAAAAAAAA
SEQ ID NO:18
GCTACTGCCCCACACACACTCTCTCTCTCTCTCTCTCTCCTTTTCCCCCAAATCAATAAGA
AGAGACGACGACTGTGTAGTAGTGAGGGAAGCGGCCGCAAACCGGGCGGTACTGAAAT
TTTGATGGAGGGTTTCGGGGGAATCTCTCTCTGCTGCTGATCTCCGGGAGGGCTGGTTC
GTCTTCGGCTTCGGGAGGGGGAGGAGCGGGAGATCGTGATTTCGAAGGAGGCAGAGAT
GGCCGGATCGGACGAGAACAACCCGCGCGTTGTCGGAGGTGCGCACGTTCAAGAGGG
CTTGCGGGTCGGAGCGGGGAAGATGGGTGCGGGGAATGTTCAGCAGAGACGAGCTCT
GAGCAACATCAACAGCAACATCATCGGGGCTCCTCCTTATCCATGCGCGGTCAACAAAA
GGGTCTTGTCCGAAAAAAATGTCAACTCTGAAAACGATCTCCTCAACGCTGCTCATCGGC
CAATTACTAGGCAGTTTGCTGCTCAGATGGCTTACAAGCAGCAACTTAGACCTGAGGAG
AACAAGAGGACGACCCAATCAGTCTCAAATCCCAGCAAATCTGAAGATTGTGCCATCTTA
GATGTGGATGACGACAAGATGGCTGATGACTTTCCGGTGCCAATGTTTGTGCAACACAC
CGAAGCAATGTTAGAAGAAATTGATCGGATGGAGGAGGTTGAGATGGAAGATGTAGCTG
AAGAACCTGTCACGGACATTGACAGCGGTGATAAAGAGAACCAGTTGGCTGTTGTTGAG
TACATTGATGACCTATACATGTTCTATCAGAAAGCCGAGGCTTCTAGTTGCGTTCCCCCA
AACTACATGGATCGGCAGCAGGATATTAATGAGCGGATGAGAGGTATACTAATTGACTG
GCTGATTGAGGTTCATTACAAGTTTGAATTGATGGATGAGACCTTGTATCTTACGGTCAA
TCTCATCGATAGATTCTTAGCTGTTCAACCTGTAGTGAAGAAAAAACTCCAGCTAGTAGG
GGTAACAGCCATGCTTCTGGCATGCAAATATGAAGAGGTCTCAGTTCCAGTAGTGGAGG
ATCTCATTCTGATTTCAGACAGGGCTTATAGCAGGAAAGAAGTTCTAGAAATGGAGCGGT
TGATGGTGAATACTTTGCACTTTAACATGTCAGTGCCTACTCCTTATGTTTTCATGAGGAG
ATTTCTTAAAGCCGCTCAATCTGACAAGAAGCTCGAGCTCTTGTCATTCTTCATCATCGA
GCTTTCCCTGGTTGAATATGACATGCTGAAGTTCCCACCCTCTTTATTAGCTGCTTCTGC
AATCTACACTGCTCTGAGTACAATTACCAGAACTAAACAGTGGAGTACAACATGTGAATG
GCACACCAGCTACTCAGAAGAACAGCTTCTGGAATGCGCCAGATTGATGGTGACTTTCC
ATCAGAGGGCTGGATCAGGGAAGCTCACTGGTGTGCACCGGAAGTATAGCACATCCAA
GTTTGGTCATGCTGCGAGAACTGAGCCTGCTAACTTTCTCTTGGACTTCCGTTTGTAGTG
GGTTGCGTGTGTCGTTGTGTACCTTGCCTAAATCAAACATAAAAAACTGACTTTTGGGCC
AGGGTGGGGGGCTTAAAAATATAGCAACATAAGTTGCCCTGCGAGTGTGTGTATAGCTG
ATTCATTTCAGCGTTTTATCTGAACATATGATGGTGATTCATTCCTTGTATCCCTTGAACA
TATATGATGCCGATTCATTTCAGCGTTTTATTTGCACCTGACAGTGGATAGCTGTTGTTTG
TTGACACTGTTCCTTAGTTGCATTTATTCATCGATTTTCATTGTTGCCAATGATGATGTTAC
TAGCAATAACATCCCTTGAACTTAAAAAAAAAA
SEQ ID NO:19
AAGCAGTGGTATCAACGCAGAGTACGCGGGGATTCTCGTTTCTTCCTGCACAGGATCTC
ACTGTGAGAGAGACCAGAGGAGAAGGCACAGGTTCTCTCGTGGGTAGGGTTTCGCCTC
GCCAATCAGATTCGGAGGAGCTTTTTGTTCTTTGATCGAAATGGCTTCAAGACCCATTGT
CCCCGTGCAAGCGAGAGGCGAGGCAGCGATAGGAGGAGGAGCGGGTAAGGCAGCGAT
AGGAGGAGGAGCGGGTAAGCAGCAGAAGAAGAACGGCGCAGCAGAAGGAAGAAACCG
CAAAGCCCTCGGCGACATTGGCAATCTCGTCACTGTTCGAGGCATCGAAGGCAAGGTTC
AGCCTCATCGCCCCATCACAAGAAGCTTCTGCGCACAACTACTGGCAAATGCACAAGCT
GCCGCGGCTGCCGAAAACAACAAGAAACAAGCTGTAGTTAACGTGAATGGAGCGCCGT
CCATTCTTGATGTACCTGGAGCAGGCAAAAGAGCCGAGCCTGCGGCGGCGGCGGCGG
CGGCGGTGGCAAAAGCTGCTCAAAAGAAAGTCGTTAAACCGAAACAAAAGGCTGAAGTT
ATCGATTTAACGTCGGATTCGGAAAGAGCGATCGAGGCCAAGAAGAAGCAACAACATCA
TGAGCCTACGAAGAAGGAAGGAGAGAAATCATCGAGAAGGAATATGCCCACTCTCACTT
CGGTCCTCACTGCTCGAAGCAAGGCAGCTTGTGGGATGACTAAAAAACCCAAAGAGAAG
GTCGTCGACATCGACGCGGGTGACGCCCACAACGAGTTGGCTGCATTCGAATATATCGA
AGACATCTACACTTACTACAAGGAGGCTGAGAATGAGAGCCTGCCCCGTAACTACATGT
CGTCCCAGCCGGAGATAAACGAGAAGATGAGGGCGATCCTGGTGGACTGGCTGATCGA
GATCCACAACAAGTTCGACCTCATGCCCGAGACCCTCTACCTCACCATCAACATCATCGA
CCGGTTCCTCTCGGTGAAGGCAGTCCCGAGGCGGGAGCTCCAGCTGCTGGGCATGGG
CGCTCTCTTCACCGCCTCCAAGTACGAAGAGATCTGGGCTCCCGAGGTGAACGATCTGG
TGTGCATCGCGGACAGAGCCTACAGCCATGAGCAAGTGCTGGCGATGGAGAAGACGAT
CCTGGGGAAGCTGGAGTGGACGCTCACCGTACCCACCCACTACGTCTTCCTCGTTCGAT
TCATCAAGGCGTCGCTCGGCGATCGCAAATTGGAGAACATGGTGTACTTCCTGGCCGAG
CTCGGGGTGATGAACTACGCGACGCTCACGTACTGCCCCTCCATGGTGGCGGCCTCGG
CGGTGTACGCGGCGCGGTGCACTCTCGGCCTAACCCCGCTATGGAACGACACCCTCAA
GCTTCACACCGGCTTCTCCGAATCCCAGCTCATGGACTGCGCGAGGCTGCTGGTGGGA
TACCACGCGAAGGCGAAAGAGAACAAGCTGCAGGTGGTGTACAAGAAGTACTCGAGTTC
TCAGAGAGAAGGCGTGGCCCTGATCCCGCCGGCCAAGGCTCTGCTTTGCGAAGGCGGT
GGTCTGTCGTCTTCTTCGTCGTTGGCGTCCTCTTCTTGACGAAAGGCTCTGCTATAACTA
CAAGAGAGAGAGGAGAGACGAGTGCACACAGCTTTTGAGTTTGTTCTCGGCCAGAGAAA
AATGACAGATTGATATCGATGATGATGACTGTCGTGTCATCAGTAGTGTGCTTTTTTTTTA
CAGTCACGGATTGGTGACTTTTGTTGTTGACTGGAATTCTTCTCCTTTCACTTGTTTGAAG
ATTCGTTTAATTCGTTGAAGGAAACATTGGCTGCTGTAAAAAAAAAAAAAAA
SEQ ID NO:20
CTCTCTCTCTCTCTCTCTCTCTCCTGCTCGTAAGATCAAAGGAGGTCGACGACGCTCTCT
CTCTCTCTCTCTAGGTCTCTGGGTGGCGGTGGGTGTTCGGGAGGGAGAGAGAAGGGGG
AGAGGTAGAGGGGGAGAGAGAAGGGAGAGATTGTGTTCTGAGGAGTTTATTGTTCTTCT
TACCAGATAAGTGAGGCAAAACGAAAAATGGGTTTGCCTGATGAGAACAATGCTGCACT
GAGCAAACCCACGAATCTCCAAGTTGGAGGGTTGGAGATCGGAGGCAGGAAATTCGGG
CAGGAGATTAGGCAGACCAGGAGGGCGTTGAGCGTGATTAACCAGAATTTAGTTGGAGA
TCGTGCTTACCCATGTCATGTCGTCAACAAGAGAGGCCACTCCAAAAGAGATGCAGTGT
GCGGGAAGGATCAGGTGGATCCAGTTCACAGACCACTTACAAGGAAGTTCGCAGCGCA
AACGGCGAGCACGCAGCAACATTGCATCGAGGAAGCAAAGAAGCCAAGAACGGCAGTT
CAAGAGAGAAATGAATTTGGAGACTGCATATTTGTGGACGTGGAGGACTGCCAACCATC
TTCAGAAAATCAACCAGTCCCTATGTTTCTGGAAATCCCAGAATCAAGATTGGATGATGA
CATGGAGGAAGTGGAAATGGAAGATATAGTGGAGGAGGAAGAGGAAGAGCCAATTATG
GACATCGATGGTCGAGATAAGAAGAACCCACTTGCAGTGGTTGACTACATTGAAGACATT
TATGCTAACTATAGGAGAACGGAGAATTGTAGCTGCGTCTCTGCTAACTACATGGCACAA
CAAGCTGACATTAATGAGAAGATGAGGTCCATCCTGATTGACTGGCTTATAGAGGTGCAC
GACAAATTCGATCTCATGCACGAGACTTTGTTCCTCACGGTCAATCTCATAGACAGATTC
TTGGCTCGGCAATCTGTCGTGAGAAAGAAGCTCCAGTTGGTGGGGTTGGTGGCCATGTT
ATTAGCATGCAAGTATGAAGAAGTCTCTGTTCCTGTTGTAGGAGACTTGATTCTAATATC
GGACAAGGCTTACACCAGAAAGGAAGTTCTTGAAATGGAAAGTTTGATGCTCAACTCGTT
GCAGTTCAACATGTCAGTGCCTACTCCGTATGTTTTCATGCGACGGTTTCTCAAGGCTGC
TGAATCTGATAAAAAGCTTGAGGTGCTATCCTTCTTCTTGATTGAGCTCTCACTGGTGGA
GTACGAGATGGTCAAGTTCCCGCCATCTCTTCTAGCCGCCGCTGCGATCTTTACAGCTC
AGTGCACCCTCTATGGTTTCAAACAGTGGACCAAGACTTGTGAATGGCACAGCAACTAC
ACAGAAGACCAGCTCCTAGAATGTGCAAGGATGATGGTAGGTTTCCATCAAAAAGCTGC
AACAGGGAAGCTCACAGGAGTACATAGAAAGTACGGTACATCAAAGTTCGGTTACACAT
CAAAATGTGAACCTGCTAACTTTCTATTAGGAGAGATGAAGAATCCATAGCAGGAGCTCA
GTGTACAGCTAACCTTTTTTTCACTGAGAAAAATTGGGTGAAAAATAAATTTTTAGCAACT
TGGGTTGCGCTTGGGAATTGGTAAAAAGGAGAAAATTCTCTAGATTTAAGCCTAACCCTC
GGTTGTTAGTCTCTTACAGCCAATTGTTTCAGATATCTCATCATTGTTTGTGATTGATTGA
ACACTAGACTAATGTTTTTCCTTTCCAATTGTAGTTCGTCTTTTATTGTAACAATAAATTGA
TAGATACTGATTCGAAATAATATTTCTCATAAAAAAAAAA
SEQ ID NO:21
GTAAGATCAAAGGAGGTCGACGACGCTCTCTCTCTCTCTCTAGGTCTCTGGGTGGCGGT
GGGTGTTCGGGAGGGAGAGAGAAGGGGGAGAGGTAGAGGGGGAGAGAGAAGGGAGA
GATTGTGTTCTGAGGAGTTTATTGTTCTTCTCACCAGATAAGTGAGGCAAAACGAAAAAT
GGGTTTGCCTGATGAGAACAATGCTGCACTGAGCAAACCCACGAATCTCCAAGTTGGAG
GGTTGGAGATCGGAGGCAGGAAATTCGGGCAGGAGATTAGGCAGACCAGGAGGGCGTT
GAGCGTGATTAACCAGAATTTAGTTGGAGATCGTGCTTACCCATGTCATGTCGTCAACAA
GAGAGGCCACTCCAAAAGAGATGCAGTGTGCGGGAAGGATCAGGTGGATCCAGTTCAC
AGACCACTTACAAGGAAGTTCGCAGCGCAAACGGCGAGCACGCAGCAACATTGCATCG
AGGAAGCAAAGAAGCCAAGAACGGCAGTTCAAGAGAGAAATGAATTTGGAGACTGCATA
TTTGTGGACGTGGAGGACTGCCAACCATCTTCAGAAAATCAACCAGTCCCTATGTTTCTG
GAAATCCCAGAATCAAGATTGGATGATGACATGGAGGAAGTGGAAATGGAAGATATAGT
GGAGGAGGAAGAGGAAGAGCCAATTATGGACATCGATGGTCGAGATAAGAAGAACCCA
CTTGCAGTGGTTGACTACATTGAAGACATTTATGCTAACTATAGGAGAACGGAGAATTGT
AGCTGCGTCTCTGCTAACTACATGGCACAACAAGCTGACATTAATGAGAAGATGAGGTC
CATCCTCATTGACTGGCTTATAGAGGTGCACGACAAATTCGATCTCATGCACGAGACTTT
GTTCCTCACGGTCAATCTCATAGACAGATTCTTGGCTCGGCAATCTGTCGTGAGAAAGAA
GCTCCAGTTGGTGGGGTTGGTGGCCATGTTATTAGCATGCAAGTATGAAGAAGTCTCTG
TTCCTGTTGTAGGAGACTTGATTCTAATATCGGACAAAGCTTACACCAGAAAAGAAGTTC
TTGAAATGGAAAAGTTGATGCTCAACTCGTTGCAGTTCAACATGTCAGTGCCTACTCCGT
ATGTTTTCATGCGACGGTTTCTCAAGGCTGCTGAATCTGATAAAAAGCTTGAGGTGCTAT
CCTTCTTCTTGATTGAGCTCTCACTGGTGGAGTACGAGATGGTCAAGTTCCCGCCATCTC
TTCTAGCCGCCGCTGCGATCTTTACAGCTCAGTGCACCCTCTATGGTTTCAAACAGTGGA
CCAAGACTTGTGAATGGCACAGCAACTACACAGAAGACCAGCTCCTAGAATGTGCAAGG
ATGATGGTAGGTTTCCATCAAAAAGCTGCAACAGGGAAGCTCACAGGAGTACATAGAAA
GTACGGTACATCAAAGTTCGGTTACACATCAAAATGTGAAGCTGCTAACTTTCTATTAGG
AGAGATGAAGAATCCATAGCAGGAGCTCAGTGTACAGCTAACCTTTTTTTCACTGAGAAA
AATTGGGTGAAAAATAAATTTTTAGCAACTTGGGTTGCGCTTGGGAATTGGTAAAAAGGA
GAAAATTCTCTAGATTTAAGCCTAACCCTCGGTTGTGAGTCTCTTACAGCCAATTGTTTCA
GATATCTCATCATTGTTTGTGATTGATTGAACACTAGACTAATGTTTTTCCTTTCCAATTGT
AGTTCGTCTTTTATTGTAACAATAAATTGATAGATACTGATTCGAAAAAAAAAA
SEQ ID NO:22
GAGACGTTGGCTTTCTGGCTTAAAGGCTATTCTTTGTGCACAATGACCTGAGGGAGGTC
TCGACAGACCACTTCTTCTCCGCCAAAGAAGAAGATGGCGATGGTACAGCGACAAGGTC
ACGACCCATCATCGCCGCAGGAGCAAGAAGACGGTCCTTCCTCCTTCTTGTCCGACGAT
GCTCTCTACTGTGAAGAAGGCAGATTCGAAGAAGACGACGGCGGCGGCGGCGGCCAAG
TTGACGGAATTCCACTCTTCCCCTCACAGCCGGCGGATCGACAGCAAGACTCGCCGTG
GGCAGACGAAGACGGCGAGGAGAAGGAGGAGGAGGAGGCGGAGCTGCAGTCGCTCTT
CTCCAAGGAGCGCGGAGCGAGGCCGGAGCTCGCGAAAGACGACGGGGGCGCCGTCG
CGGCGCGGCGGGAGGCCGTGGAGTGGATGCTGATGGTGAGGGGCGTCTACGGGTTCT
CGGCGCTCACGGCGGTGCTGGCCGTCGATTACCTCGACCGGTTCCTCGCCGGGTTCCG
CCTGCAGCGGGACAACAGGCCGTGGATGACGCAGCTGGTGGCCGTCGCGTGCCTCGC
CCTGGCCGCCAAGGTGGAGGAGACCGACGTCCCTCTCCTCGTAGAACTCCAAGAGGTC
GGGGACGCGAGGTACGTGTTCGAGGCGAAGACGGTGCAGCGGATGGAGCTCCTGGTG
CTGTCGACGCTCGGGTGGGAGATGCACCCGGTGACGCCGCTGTCGTTCGTCCACCACG
TCGCCAGGAGGCTCGGCGCGAGCCCCCACCATGGGGAGTTCACCCACTGGGCGTTCCT
CCGCCATGCGAGCGGCTCCTCGTCGCCGCCGTCTCCGATGCGAGATCGCTGAAGCAT
CTCCCGTCGGTCCTGGCCGCGGCGGCGATGCTGCGCGTCATCGAGGAGGTCGAGCCG
TTTCGTTCCTCGGAGTACAAGGCCCAGCTCTTGAGTGCCCTCCACATGAGCCAGGAAAT
GGTGGAAGACTGTTGCAGATTCATTCTGGGAATAGCAGAGACCGCGGGCGATGCCGTG
ACCTCGTCCCTCGACAGCTTCCTGAAGCGCAAGCGTCGCTGTGGTCACCTTAGCCCGA
GGAGCCCGAGCGGGGTCATCGACGCCTCGTTCAGCTGCGACGACGAGTCGAACGACTC
GTGGGCCACCGACCCGCCATCCGATCCGGACGACAACGACGATCTGAACCCTCTACCG
AAGAAGAGCAGGTCGTCGTCGCCGTCCTCCTCCCCCTCCTCGGTGCCAGACAAGGTGT
TGGACTTGCCCTTCATGAACAGGATCTTTGAGGGCATCGTCAACGGCAGTCCTATCTGA
TCGTCCCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTAGAATTTGTATCGACCC
TTTTCAATTAAATCAAAGTGAAGAAAATGTGAAGTGAAAGATGAGAGCTTTGCGTTGAAG
AAACGGGAAGGGTCTGCGCTTACGTATGCATGTCTTTTTTTGGCGCTCCCTCTCGGTCT
CTTCAATGATCTTGAAGTGTCCCTTTCACTTCAGAATTTGCTTCATGTATGGGACATGGAC
AGGAGATATATATATTATGTCACCATTACAATAAAAAATTACAATTTTTCGGAAAAAAAA
SEQ ID NO:23
AGATGGCTCCGAGAGCAGGCCCATTCTTGGACCTCTCTTTCTTCTTTATCTACACCGTCT
CCATTGCCCCTCTCATGGAGTAATGATCTTGTCGTTCATCGACCCTTCTTCTTCTAAGAA
GAAGCAGAAGCAGAAGCAGAAGCAGAAGCAGGAGGAGGAGGAGCAGAAGGAGGAGAT
GGAGGCCAGTTATCAACCCCACCACCATGGTCATCTTCGTCAACACGACCCATCGAGCT
CTCAACAAGAAGAGCAAGTCCCTTTCGACGCCCTTTACTGCTCGGAGGAGCACTGGGGA
GAAGAAGACGAAGAAGAGGGATTGGCGAGCGATGGGCTCTTGTCGGAAGAGAGAGATC
ACAGATTGCTGAGCCCGCGAGCCTTGCTCGATCAGGACCTGCTCTGGGAAGACGAGGA
GCTGGCCTCCCTCTTCTCCAAAGAGGAGCCGGGCGGCATGCGCTTGAATCTGGAGAAC
GACCCGTCTTTGGCCGATGCTCGCCGCGAGGCCGTGGAGTGGATAATGAGAGTCCACG
CGCACTACGCGTTCTCCGCTCTCACGGCTCTGCTCGCGGTGAACTACTGGGATAGGTTC
ACGTGCAGCTTCGCCTTGCAGGAAGACAAGCCGTGGATGACTCAGCTCTCCGCCGTCG
CTTGCCTCTCTCTCGCCGCCAAAGTGGAGGAGACCCAAGTGCCTCTTCTCATCGATTTC
CAAGTCGAGGACAGTAGCCCCGTCTTTGAGGCGAAGAACATACAGAGAATGGAGCTCCT
GGTGCTCTCGTCGCTCGAATGGAAGATGAATCCAGTGACCCCGCTATCGTTTCTTGATTA
CATGACAAGGAGACTAGGGCTGACGGGCCATCTGTGCTGGGAGTTTCTTAGAAGGTGC
GAGAACGTCCTCCTCTCTGTAATCTCAGATTGCAGATTCACGTGTTATCTTCCTTCAGTG
ATAGCTGCTTCCACAATGCTGCACGTAATCAATGGCCTAAAGCCTCGTCTCGATGTCGAA
GACCAAACCCAGCTCCTGGGAATCCTAGCAATGGGCATGGACAAGATCGATGCTTGCTA
TAAGCTCATCGACGACGACCACGCATTGAGAAGCCAGAGATATTCCCACAACAAGCGCA
AGTTTGGATCGGTCCCCGGGAGCCCCAGAGGAGTAATGGAATTGTGCTTCAGCTCCGAT
GGCTCCAACGATTCTTGGTCCGTGGCGGCCTCGGTATCGTCCTCCCCGGAGCCTCACT
CCAAGAAGAGCAGAGCCGGCGAGGAAGCCGAAGACCGGCTTCTGCGGGGGCTCGAGG
GCGAGGAAGATGACCCCGCGAGCGCGGATATCTTCAGCTTCCCTCACTAGCGGGCACT
TCATCCGGGTCCTGGTTATCATACTCTTATATATGTTGGGGAATAACGGTTCATATGTTTC
ATGTAATGCGCAAGTTAAATTGCCAACCATCTCTCGGTTCCAGCCTAATTCCCCAGAACG
ATTGAAAACACACCAGATCGGAAGGTTGCAAGAACCATTCTCGACTGGGATTGCTGGGT
TCCTAGGCTTTTGAGATTTGGAATGGAGATGGTTGGCATTTTATCCGGATTATAACTTTAG
GGATTATGAACAAAATGAAGAGAAGAAATCGTTGGGAAGGATCCTGCTGTTCCTGTTGTT
CCTTTTTTCCCTGTACGGTCCCCCCTCCTTCTTTCCCTATTTTCTGTATCGAAGCACTTCT
GCAGCAGTTCAGCTATAGGTAGTACTTTGAAATAAAAGAGCTCATCATGAAGGGTAAGCA
TTTTTGTTTCAAAAAAAAAAA
SEQ ID NO:24
TCTCTCTCTCCCTCTCTCTCTCTCTCTCCATCCCACATTCTTGAAAGAGACACATCAGAAG
CGCAGACATGAATCTGGGCACCGGCTCGTTCTCTTTCCTGCTCCATCTATAATGAGCAAA
GCCTTCGCTCTCTTCTCCGCAGAAGGGCATGGCTCTGCAGGAGGAGGACACCCGCCGC
CACTACCCGACCGCTCCTCCGTTCTCGCCCGACGGCCTCTACTGCGAGGACGAGACCT
TCGGTGAAGATCTGGCCGACAATGCCTGCGAATACGCCGGCGGGGGAGCCCGGGATG
GCCTCTGCGAGATAAAGGACCCGACTTTGCCTCCGTCGTTGCTCGGGCAGGACTTGTTT
TGGGAAGATGGCGAGCTCGCCTCCCTGGTTTCGAGAGAGACCGGGACGCACCCCTGCT
GGGACGAGCTAATCTCTGATGGGTCAGTGGCGCTTGCTCGGAAGGATGCGGTCGGGTG
GATTTTGAGGGTCCATGGGCATTACGGGTTCCGTCCGTTGACTGCTATGTTGGCCGTGA
ACTACTTGGACAGGTTCTTTTTGAGCCGGAGTTACCAGAGGGACAGGCCTTGGATAAGC
CAGCTCGTAGCTGTGGCTTGTCTCTCCGTTGCTGCCAAAGTGGAGGAAACCCAAGTGCC
TATTCTCCTTGACCTGCAAGTGGCTAATGCGAAATTTGTGTTTGAATCGAGGACGATTCA
GAGAATGGAGCTCTTGCTGATGTCTACACTTGATTGGAGAATGAACTCGGTGACTCCGA
TTTCGTTCTTTGATCACATCCTTAGGAGGTTTGGCTTGACGACTAATTTGCACAGGCAGT
TTTTCTGGATGTGCGAGCGTTTACTTCTCTCAGTGGTTGCAGATGTGAGGCTTGCAAGTT
TTCTTCCGTCAGTTGTTGCCACGGCTGCAATGTTGTATGTTAACAAGGAGATAGAACCGT
GTATATGCAGTGAATTCCTGGACCAGTTACTGAGCCTGCTCAAGATCAATGAGGACCGA
GTAAATGAGTGCTATGAACTCATTCTTGAATTGTCAATCGACCATCCTGAGATCCTCAACT
ACAAGCACAAGCGTAAGAGGGGATCAGTACCCAGTAGCCCCAGTGGTGTGATTGATACT
TCTTTCAGCTGTGACAGTTCGAATGATTCGTGGGGTGTGGCATCGTCTGTTTCTTCTTCA
CTGGAGCCTCGGTTCAAGAGGAGCAGATTCCAGGATCAACAGATGGGCCTGCCATCTGT
GAATGTTTCATCCATGGGTGTGCTTAATAGTTCTTATTAGTCTTAGCTTATTATCTTTGATT
GGACATGCTATAAAGCAGTCTTTCCCCTGCTTAAAAAAAAAA
SEQ ID NO:25
GGTTCCTCCTCCTCCTCCTCTCTCCATTTCCGCTCTCGATTCCTCTCCTTCCGCTAAACC
CCGCAGCTTCGGAATTCCCGCTCGGGTTTTCGCTGAGAAGATGGGCCAGATCCAGTACT
CCGAGAAGTACTTCGACGACACCTACGAGTACAGGCATGTGGTTCTCCCTCCTGATGTG
GCCAAGCTTCTCCCGAAGAACCGCCTCCTTTCTGAAAATGAATGGCGTGCCATTGGAGT
GCAGCAGAGTCGTGGTTGGGTTCACTATGCAATTCATCGCCCTGAACCACACATAATGC
TATTCAGGAGGCCTCTGAATTACCAGCAGCAGCAAGAGAACCAGGCCCAGCAAAACATG
CTTGCTAAGTAGACATGTTATATTTCTAATGCTTTGAGAACAATATTACAGTATAATTAGG
GTTGGAAGCTTTAGTAAATGTTAGGATGTTTTGAAACTTGTCATTGTAATTGGCAGCAATT
CTCCTCTTTGGAGAATCTATCGTGGGACTTTGTTTAAAAAAAAAAA
SEQ ID NO:26
CCAGATCCATGGGTTCGATTGACCCGCCAAAAGCCGAACAGAACGGCACCGCGGCCGC
CGCCGTCGCCGATCCCGGCCAGAAGCCCGGCGCCGGAGACGCCATGCCGCCGCCGCC
GCCCGTCAAGCACTCCAACGGGACCGCGGCGGAGCCCGATGTTGCGACGAAGAGGAG
GAGGATGAGCGTGCTTCCCCTCGAGGTGGGCACGCGCGTGATGTGCCGCTGGCGAGA
CGGCAAGTATCACCCCGTGAAGGTCATCGAGCGGCGGAAGCTGAATCCCGGAGATCCT
AACGACTACGAGTACTATGTTCATTACACGGAATTCAACCGGAGGCTCGATGAATGGGT
GAAACTTGAACAGCTTGATCTGAATTCTGTAGAGACTGTGGTCGATGAAAAAGTGGAGG
ACAAGGTGACGGGGTTAAAAATGACACGTCACCAAAAGCGGAAGATTGATGAGACTCAT
GTCGAGGGGCATGAGGAGCTTGATGCTGCCAGCTTGCGTGAACATGAAGAATTCACCAA
AGTGAAGAATATCGCAACCATAGAGCTTGGAAGATACGAGATCGAGACATGGTACTTCT
CCCCCTTTCCACCAGAATACAATGATTGTTCGAAGCTCTACTTTTGTGAGTTTTGCCTCAA
TTTTATGAAGCGCAAGGAACAGCTTCAAAGGCACATGAAGAAGTGTGATCTTAAACATCC
CCCTGGAGATGAGATATACAGGAGCGGTACATTGTCAATGTTTGAGGTTGATGGCAAGA
AGAATAAGGTTTACGGGCAGAATCTTTGCTATTTGGCAAAACTCTTTCTAGATCACAAGA
CCCTGTATTATGATGTAGATCTTTTTCTATTTTATGTCTTGTGCGAATGTGATGATCGAGG
TTGCCACATGGTTGGATATTTCTCCAAGGAAAAGCATTCCGAGGAATCCTACAATTTGGC
TTGCATCCTCACACTCCCACCATATCAGAGGAAAGGCTATGGAAAGTTCTTAATAGCTTT
TTCATATGAACTTTCCAAAAAAGAAGGAAAAGTCGGCACGCCAGAGAGGCCTCTGTCTG
ATCTTGGGCTGTTGAGCTACAAAGGATACTGGACAAGGGTTCTGTTGGACATTTTGAAAA
AGCATAAAGCAAATATTTCTATCAAGGAGCTCAGTGACATGACAGCAATAAAGGCTGATG
ATATTTTGAACACCCTTCAGAGCCTAGACTTAATTCAGTATAGAAAAGGGCAGCACGTCA
TATGTGCAGACCCGAAAGTCTTGGGATCGTCATCTGAAAGCTGCTGGACGAGGTGGCCTA
GAGGTTGACGTCAGCAAACTGATCTGGACTCCATACAGAGAGCAAGGTTGATATTAACT
GGCAAGTTAGGATAGGCATTTCCTTTGGCTTGTATCATTACACAGTTTGTACAAAATGCA
CAGCTTGTAGTATCTCAGGCGGAATAAGCCTATGTTTCTAAAGTCGCTGACCTCACGGTC
CAATACTTTGGCCTTTGATTTGGTAGTTTTGTGGAAAATCATCGACTAGACCGATGGTCA
AAGTGGTAATCATGTAATTAAACGCGTTTGTCATTGCTCGGCAGGAA
SEQ ID NO:27
CGCACGAGGCTCTCTCTCTCTCTCTCTCTCTCTGCGCGCAATTTCCCGCCTTTCCTCCTC
CCATTTCCCGCTCGGAGAAACCCTAACGATGGCGCAGAAGCACAGCACCGCCCCCGAT
CCGGCGGCCGAACCCAAGAAGCGGCGGCGCGTCGGCTTCTCCGGCATCGATGCTGGA
GTCGACCCGAACGGCTGCTTCAAAGTGTACCTCGTGTCCAGGGAAGAAGAAGTGGGGG
CTCCGGATAGCTTCTGCCTTGATCCAGTTGACTTAAGTCACTTCTTTGAGGAGGAAGATG
GAAAAATCTATGGATATGAAGGATTGAAGATATCCGTCTGGGTAAGCTGTGTATCATTTC
ATTCATATGCAGAGATTGCCTTTGAGAGCAAATCGGATGGAGGAAAAGGAATCACAGAT
CTGAACACTGCTCTTAAGAATATGTTTGGTGAAACTCTCGTGGATAATAAAGATGACTTC
CTTCAAACTTTTTCCAAGGAGACCCAATTTATTAGGTCTACAGTTTCAGCCGGGGAGATT
TTAAAGCATAAACACTCTGACGATCATGTCAACGATTCCGTTAGTAATCTAAAAGTTGGTT
CTGATGTCGAGGCTGTGCGCATGCTGATGGGCGATATGACAGCAGGACACCTGTATAGT
CGGTTGGTCCCTCTTGTTCTGCTTCTTGTAGACGGTAGCAGTCCTATTGATGTAACGGAT
TCAAGTTGGGAGCTATATCTTCTTATTCAGAAGACAAGTGATCAGCAAGGCAATTTTCAT
GATAGGCTTCTTGGTTTTGCTGCTGTATATCGTTTCTATCACTATCCTGATAGTTCGCGAC
TGCGGCTTGGTCAGATTCTAGTATTACCACTTTACCAGCGCAAAGGCTATGGCCGCTATC
TCCTGGAGGTGCTTAACAATGTCGCAATAGCTGATGATGTTTATGACTTCACAATAGAAG
AGCCAGTGGATAATCTTCAACACTTGCGAACATGTATCGATGTGCAACGGCTTCTGAGCT
TTGACAAAGTCCAGCAGGCAGTAAATTCAACAGTATCTCAGTTGAAGCAAGGCAAACTAT
CAAAGAAAACCTACATCCCTCGGTTGTTGCCTCCTCCTAGTGTGGTTGAGGATGCCAGG
AAGCGTTTCAAAATCAACAAGAAGCAGTTTCTTCAATGTTGGGAGATTCTAGTCTATCTTG
GGCTTGATCCCGCCGATAAGAGCATACAGGGATTATTTTTCCGTCATTTCAAACCGTGTCA
GGGCAGACATTTTAGGAAAAGACTCTGAGACTGCCGGAAAGAAAGTGATTGAAGTACCG
AGTGATTTTGATCCAGAGATGTCCTTCGTCATGCATAGGGCAAAAGCAGGCGGTGAAGC
TAATGGTATCCAAGTGGAGGACAACCAGAACAAGCAAGAAGAGCAGCTGCAGCAGTTAA
TCGATGAAAGATTGAAGGACATCAAGCTGATCGCTGAGAAGGTAACTCAGAAATGATCA
CAGAAAAAAGGTCATGTAATACAAATGTTGTAGCCCTTGGAATGGAATTAACAGAGGTCC
TGTAGATTGACTGAGGTGGGCATTGCCCTTTTAGCTTATGATTTGAGAAACCCTTGCAGG
CTGCGATTAGCGGATCATGAGAGCATAGTTTTGCTTATGCTCAATCCCTAGTTTTGGGTC
CAATTTCATTAGAGGTAGATCATTTCCTGTTTTTATAACCTTCGTGTAAGTTAGGGAAGGA
GCTGCGGTCGCGGTCCTGGCATTTTCAAGCGCCTGTCTCTCGGTTCAAAAAAAAAAA
SEQ ID NO:28
CTCTTCCTCCCTCCCTCTCTCTCTGCGCGCAATTTCCCGCCTTTCCTCCTCCCATTTCCC
GCTCGGAGAAACCCTAACGATGGCGCAGAAGCACAGCACCGCCCCCGATCCGGCGGC
CGAGCCCAAGAAGCGGCGGCGCGTCGGCTTCTCCGGCATCGATGCTGGAGTCGACCC
GAACGGCTGCTTCAAAGTGTACCTCGTGTCCAGGGAAGAGGAAGTGGGGGCTCCAGAT
AGCTTCTGCCTTGATCCAGTTGACTTAAGTCACTTCTTTGAGGAGGAAGATGGAAAAATC
TATGGATACGAAGGATTGAAGATATCCGTCTGGGTAAGCTGTGTATCATTTCATTCATAT
GCAGAGATTGCCTTTGAGAGCAAATCGGATGGAGGAAAAGGAATCACAGATCTGAACAC
TGCTCTTAAGAATATGTTTGGTGAAACTCTCGTGGATAATAAAGATGACTTCCTTCAAACT
TTTTCCAAGGAGACCCAATTTATTAGGTCTACAGTTTCAGCTGGGGAGATTTTAAAGCAT
AAACACTCTGACGGTCATGTCAACGATTCTGTTAGTAATCTAAAAGTTGGTTCTGATGTC
GAGGCTGTGCGCATGCTGATGGGCGATATGACAGCAGGACACCTGTATAGTCGGTTGG
TCCCTCTTGTTCTGCTTCTTGTAGACGGTAGCAATCCTATTGATGTAACGGATTCAAGTT
GGGAGCTATATCTTCTTATTCAGAAGACAAGTGATCAGCAAGGCAATTTTCATGATAGGC
TTCTTGGTTTTGCTGCTGTATATCGTTTCTATCACTATCCTGATAGTTTGCGACTGCGGCT
TGGTCAGATCCTAGTATTACCACTTTACCAGCGCAAAGGCTATGGCCACTATCTCCTGGA
GGTGCTTAACAATGTCGCTATAGCTGATGATGTTTATGACTTCACAATAGAAGAGCCAGT
GGATAATCTTCAACACTTGCGAACATGTATCGATGTGCAACGGCTTCTGAGCTTTGACAA
AGTCCAGCAGGCAGTAAATTCAACAGTATCTCAGTTGAAGCAAGGCAAACTATCAAAGAA
AACCTACATCCCTCGGTTGTTGCCTCCTCCTAGTGTGGTTGAGGATGCCAGGAAGCGTT
TCAAAATCAACAAGAAACAGTTTCTTCAATGTTGGGAGATTCTAGTCTATCTTGGTCTTGA
TCCAGCCGATAAGAGCATACAGGATTATTTTTCCGTCATTTCAAACCGTGTCAGGGCGGA
CATTTTAGGAAAAGACTCTGAGACTGCCGGGAAGAAAGTGATTGAAGTGCCGAGTGATT
TTGATCCAGAGATGTCGTTCGTCTTGCATAGGGCAAAAGCAGGTGGCGAAACTAATGGT
ATCCAAGTGGAGGACAACCAGAACAAGCAAGAAGAGCAGCTGCAGCAGTTAATCGATGA
AAGATTGAAGGACATCAAGCTGATCGCTCAGAAGGTATCTCGGAAATGATCACAGAAAAA
AGGTCATGTAATACAAATGTTGTAGCCCTTGGAATGGAATTACCAGAGGTCCTGTAGATT
GACTGAGGTGGGCATTGCCCTTTTAGCTTATGATTTGAGAAAACCCTTGCAGGCTGCGA
TTTGCGGATCATGACAGCATAGTTTTGCTTATGCTCAATCCCTAGTTTTGGGTCCAATTTC
ATTAGAGGTAGATCATTTCCTGTTTTTATAACCTTCGTGTAAGTTAGGGAAGGTGCTGCAT
TCGCGGTCCTGGCATTTTCAAGTGCCTATCTCTTGGTTCAAAAAAAAAA
SEQ ID NO:29
CAACGAGCGAAAATCGCTATAAATTCCCTCCTAAACCCTCGCCTCACCCTTCGTCCCTGA
ACCCTCGATCTCTCTCTAGCTAGCTTTTGGCGCCCGGCAGAGCTTCCTCTCCTCCCCTC
GCCTCGACTCGCCTCCTCTCTTCGAGCTAGAAAGCTCTCGATGGCCCTTCCGATGGAGT
TCTGGGGAGTTGAAGTGAAGGCTGGACAGCCCCTTAAAGTCAACCCTGGCAATGCTAAA
ATCTTGCATCTTTCTCAGGCATCACTTGGTGAATGCAAGAGTAGCAAAGGAAATGAATCA
GTGCCTCTCCATGTGAAGTTTGGCGATCAGAAGCTTGTTTTAGGAACTCTCTCCACGGA
GAACTTCCCTCAATTAGCATTCGACTTGGTTTTTGAGAAAGAGTTTGAACTATCTCACAAC
TGGAAAAGTGGAAGTGTCTACTTTTGTGGATACAAGTCTGTCGTACATGATGATGATGAT
GAATTTTCTGATTTGGAGAGTGATTCAGAAGAGGAAGATCTTCCGATGATTGGTGTGGAA
AATGGAAAGGTTGCAGCACAAGCATCAGCTAAGACTGCTACTGCCAGTGCTAATGCTAG
CAAGGTTGAATCATCGGGAAAGCAAAAGGCCCGCATTCCACAACCAATGAAAGTTGATG
AGGATGACAGTGATGAGGATGATGACGACGAAGATGAAGATGAATCTGATGAGGAGGG
AGTTGATGGTGAGGCTGATTCTGATGAGGAGGAAGATGAAAGTGATGAGGAAGAGACTC
CAAAGAAGGCTGAAATAGGCAAGAAGAGAGCTGCGGATTCTGCAACTAAGACACCTGTC
CCTGCCAAGAAGTCAAAGTTACCTACTCCACAGAAGACAGATGGTAAGAAAGGTGGCCA
TACAGCAACTCCTCACCCTGCAAAACAGGCTGGAAAGAATCCTGCCAACAGTGCCAACA
AGTCGCAAAGCCCCAAATCAGCTGGCCAAGTCTCTTGCAAATCATGTAGCAAGACGTTC
AATTCAGACGGTGCTCTTCAGTCTCATTCAAAGGCTAAGCATGGTGGCAAGTAAATTAAA
GAACCAAGATAACCATCCGGAAATAGTCGGATGTTTTTTGTTCGTCATCTTCTGTCCCCC
CAAAAGTAGTGTTTGAGGTGTATCTCGGTGGAGGTTTTGTTGTGAGGGCTTGGTAGGTTT
TCATTCTATTGTAATGTCGGCGACGGAGATTTTGGATGGGGATTCTTTATTCTAATCAGC
AGTAATTAGTCATTGGGTTGTCAACTTTTCCCTCTTTCCTTCACTTGGTCGTGGTACATTT
TGCATGTATTGTGATGGATTTTTGACTAAAAAAAAAA
SEQ ID NO:30
CCTAGCTCGTTCCCCGCCCTGGCAACGAGCGAAAATCGCTATAAATTCCCTCCTAAACC
CTCGCCTCGCTCTTCCTCCCTGAACCCTCGATCTCTCTCTAGCTAGCTTTTGGCGCCCG
GCAGAGCTTCCTCTCCTCGCCTCGCCTCCTCTCTTCGAGCTAGAAAGCTCTCGATGGCC
CTTCCGATGGAGTTCTGGGGAGTTGAAGTGAAGGCTGGACAGCCCCTTAAAGTCAACCC
TGGCAATGCTAAAATCTTGCATCTTTCTCAGGCATCACTTGGCGAATGCAAGAGTAGCAA
AGGAAATGAATCAGTGCCTCTCCATGTGAAGTTTGGCGATCAGAAGCTTGTTTTAGGAAC
TCTCTCCACGGAGAACTTCCCTCAATTAGCATTCGACTTGGTTTTTGAGAAAGAGTTTGA
ACTATCTCACAACTGGAAAAGTGGAAGTGTCTACTTTTGTGGATACAAGTCTGTCGTACA
TGATGATGATGATGAATTTTCTGATTTGGAGAGTGATTCAGAAGAGGAAGATCTTCCGAT
GATTGGTGTGGAAAATGGAAAGGTTGCAGCACAAGCATCAGCTAAGACTGCTACTGCCA
GTGCTAATGCTAGCAAGGTTGAATCATCGGGAAAGCAAAAGGCCAGCATTCCACAACCA
ATGAAAGTTGATGAGGATGACAGTGATGAGGATGATGATGAGGACGATGACGACGAAGA
TGAATCTGATGAGGGAGTTGATGGTGAGGCTGATTCTGATGAGGAGGAAGATGAAAGTG
ATGAGGAAGAGACTCCAAAGAAGGCTGAAATAGGCAAGAAGAGAGCTGCGGATTCTGCA
ACTAAGACACCTGTCCCTGCCAAGAAGTCAAAGTTACCTACTCCACAGAAGACAGATGG
TAAGAAAGGTGGCCATACAGCAACTCCTCACCCTGCAAAACAGGCTGGAAAGAATCCTG
CCAACAGTGCCAACAAGTCGCAAAGCCCCAAATCAGCTGGCCAAGTCTCTTGCAAATCA
TGTAGCAAGACGTTCAATTCAGACGGTGCTCTTCAGTCTCATTCAAAGGCTAAGCATGGT
GGCAAGTAAATTAAAGAACCAAGATAACCATCCGGAAATAGTCGGATGTTTTTTGTTCGT
CATCTTCTGTCCCCCCAAAAGTAGTGTTTGAGGTGTATCTCGGAGGAGGTTTTGTTGTGA
GGGCTTGGTAGGTTTTCATTATATTGTAATGTCGACGACAGAGATTTTGGATGGGGATTC
TTTATTCTAATCAGCAGTAATTAGTCATTGGGTTGTCAACTTTTCCCTCTTTCCTTCACTTG
GTCGTGGTACATTTTGCATGTATTGTGATGGATTTTTGACTAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAA
SEQ ID NO:31
GAAAATTCATCGTCTTCCAGAAAATTCACCGCACAGTCCCCTCTCGAGCTTCTCCGCCGT
CAGCAATGGAGTTCTGGGGTGTGGAAGTCAAAAGTGGGGAACCCCTCAATGTCGAACC
CGGGGCTGAAACAGTTGTGCATTTGTCACAGGCCTGTCTCGGAGAAACCAAGGAAAAAA
CAAAGGAATCTGTACTTCTGTATGTCCATATTGGGGTTCAGAAACTCGTCCTTGGAACTC
TCTCTGCTGATAAGTTTCCCCAGATACCTTTTGATCTGGTGTTTGAGAAAAGTTTTAAGCT
TTCTCATAATTGGAAAAATGGAAGCGTCTTCTTCAGTGGATATAAGACGCTACTTCCATGT
GGATCCGACGCTGACAGTCCATATTCTGACTCCGACACTGATGAGGGTCTTCCAATTAAT
GTTACTGCTCAAGCTGACGTACCTGCGAAAAAGCACCAGTGACTGCTAATGCCAATGC
AGCAAAGCCCAACCTGGCTTCTGCTAAGCAAAAGGTAAAGATTGTTGAATCAAATGAGGA
TGGGAAAAATGAAGGGGACGATGATGAAGATGCTGATGTATCTTCGGATGATGATGCTG
AGGATGATTCTGGTGATGAGGACATGGTCGATGGAGGTGATGAAAGTAGCGATGAAGAT
GATGATGACAGTGAGGAAGGGGAAAGTTCTGAAGAGGAAGAGCCTAAGGCTCAACCAA
GTAAGAAGAGACCTGCTGATTCTGTACTGAAAACCCCTGCGTCAGATAAGAAATCAAAAC
TGGAAACTCCTCAAAAGACTGATGGCAAGAAGGCCAGTGAGCATGTAGCAACTCCATAC
CCTTCTAAGCAAGCTGGGAAGGCGATTGCCAGTAAAGGTCAAGCTAAGCAACAAACACC
AAACTCCAACGAGTTCTCGTGCAAACCTTGCAACCGGTCATTTAAGTCGGATCAGGCTCT
TCAATCACACAACAAAGCGAAGCATGGTGGAAGCTAAAACTTTGGGAAGTCCAAAGACA
CTGGCGATTTTGTCCGACGGAGGTGGGTGATGCCAATCGAAGAGAGCTCTGCAACCAG
GATGTTCTTTTATTATTAGTTGTGGCTTTTCATGACTCCATTCTGGAACGAACTATGGGTC
TTAACGATCTGTAACGGCTTAAAACTTTTGAAGTTGAAATACTCGTGTCCTTCAGTTAGCT
GTTCGCATTTTGAGCCCAAAAAAA
SEQ ID NO:32
GCGACGAGCGAGGGAGAGGGGAAGAGAAGGAGAGAGAGAGTCCGTGAAAAGCTTCAA
AAATCATTACTTTGGGCGGAGACATCTCACACAGGAAACAGAAGTGACGAGTGATGGAT
ACTGGAGGGAATTCTCTCCCGTCTGGGCCTGATGGGGTGAAGCGGAAGGTGTGCTACT
TCTATGACCCTGAAGTCGGCAATTACTACTTGCTCCAGCATATGCAGGTCCTCAAGCCC
GTCCCCGCGAGAGATAGGGATCTCTGCCGCTTCCATGCCGACGATTACGTCGCCTTTCT
TAGGAGTATAACCCCCGAGACGCAGCAGGATCAGCTGAGGCAGCTCAAGAGGTTCAAT
GTTGGAGAGGATTGTCCTGTGTTCGATGGCCTTCACTCTTTCTGTCAGACTTATGCCGGA
GGCTCGGTTGGCGGTGCCGTGAAGTTGAATCACGGTCTCTGCGATATCGCTATCAACTG
GGCAGGTGGGCTGCATCATGCTAAGAAGTGCGAGGCTTCTGGATTTTGCTATGTCAATG
ACATCGTGCTTGGCATATTGGAGCTCCTTAAACAGCATGAGCGCGTTTTGTATGTGGATA
TTGATATTCACCATGGGGACGGTGTTGAAGAGGCTTTCTACACAACTGATAGGGTCATGA
CCGTTTCATTTCACAAATTTGGTGATTACTTTCCCGGTACTGGCGATATACGTGACATAG
GATATGGCAAAGGGAAATATTACTCCCTCAATGTTCCATTGGATGATGGGATTGACGACG
AAAGCTACCATTCTCTATTCAAACCAATAATTGGAAAAGTAATGGAAGTGTTCAAACCCG
GTGCTGTGGTCCTTCAATGTGGCGCTGACTCTTTGTCAGGAGACCGATTAGGATGCTTC
AATCTATCAATCAAAGGACATGCAGAATGTGTCAGATACATGAGATCATTCAATGTGCCA
GTGTTGCTGCTAGGAGGCGGTGGTTACACCATTCGTAATGTTGCTCGTTGCTGGTGCTA
TGAGACTGGAGTGGCCCTTGGACTTGAAGTTGATGATAAGATGCCACAACATGAGTATTA
TGAATACTTCGGTCCTGACTATACACTTCATGTTGCTCCGAGTAACATGGAAACAAAAA
TTCGCGACAATTGCTTGAGGAGATAAGATCGAAGCTTCTCGAAAATCTCTCTAAGCTCCA
GCATGCACCCAGTGTCCCATTCCAGGAAAGACCCCCTGATACTGAGCTCCCAGAGGCTG
ATGAAGATCAAGAAGATCCGGATGAAAGATGGGATCCAGACTCTGACATGGATGTTGAC
GAAGATCGCAAGCCCCTGCCTAGCAGAGTAAAGAGAGAACTAATAGTAGAACCTGAGGT
CAAAGACCAGGATTCCCAGAAAGCATCTATAGATCATGGAAGAGGGCTTGACACAACAC
AGGAGGATAATGCATCTATAAAGGTCTCTGATATGAATTCTATGATAACAGATGAACAGA
GCGTGAAAATGGAGCAAGACAATGTGAACAAACCATCTGAGCAAATATTCCCTAAATAGA
TTGTCCATTGTTGGTAATTGACTTGTAATGCTTCTGTGGCAAATTCTTAAAGCTTATGATG
TCAAAACGTAGCTCTTTTTTGTGTGAACTATCCTGCTAAATTAAACCTCAGCAAATACTGC
TTCCTTGTTTTATAGATCCAGACATTTCGATTTTGTGTAAAGTTGACAGTTAAGCTATTGA
ACTCCCAGTACGGTTCGTTAGATTTTAAACTGATCAATGAAATCTTCATAGTTTTATGATA
AAAAAAAAA
SEQ ID NO:33
GCGAGCGAGGGAGAGGGGGAGAGGGGGAGAGAGAGAGTCCGTGAAAAGCTTCAAAAA
TCATTACTTTGGGCGGAGACATCTCACACAGAATCCGAGGAAACAGAAGTGACGAGTGA
TGGATACTGGAGGGAATTCTCTCCCGTCTGGGCCTGATGGGGTGAAGCGGAAGGTGTG
CTACTTCTATGACCCGGAAGTCGGCAATTACTACTACGGCCAAGGGCATCCGATGAAGC
CGCATAGAATCCGCATGACGCACGCCCTGCTGGCGCACTATGGCTTGCTCCAGCATATG
CAGGTCCTCAAGCCCGTCCCCGCGAGAGATAGGGATCTCTGCCGCTTCCATGCCGACG
ATTACGTCGCCTTTCTTAGGAGTATAACCCCCGAGACGCAGCAGGATCAGCTGAGGCAG
CTCAAGAGGTTCAATGTTGGAGAGGATTGTCCTGTGTTCGATGGCCTTCACTCTTTCTGT
CAGACTTATGCCGGAGGCTCGGTTGGCGGTGCCGTGAAGTTGAATCACGGTCTCTGCG
ATATCGCTATCAACTGGGCAGGTGGGCTGCATCATGCTAAGAAGTGCGAGGCTTCTGGA
TTTTGCTATGTCAATGACATCGTGCTTGGCATATTGGAGCTCCTTAAACAGCATGAGCGC
GTTTTGTATGTGGATATTGATATTCACCATGGGGACGGTGTTGAAGAGGCTTTCTACACA
ACTGATAGGGTCATGACCGTTTCATTTCACAAATTTGGTGATTACTTTCCTGGTACTGGC
GATATACGTGACATAGGATATGGCAAAGGGAAATATTACTCCCTCAATGTTCCATTGGAT
GATGGGATTGACGACGAAAGCTACCATTCTCTGTTCAAACCAATAATTGGAAAAGTAATG
GAAGTGTTCAAACCCGGTGCTGTGGTCCTTCAATGTGGCGCTGACTCTCTGTCAGGAGA
CCGATTAGGATGCTTCAATCTATCAATCAAAGGACATGCAGAATGTGTCAGATACATGAG
ATCATTCAATGTGCCAGTGTTGCTGCTAGGAGGCGGTGGTTACACCATTCGTAATGTTGC
TCGTTGCTGGTGCTATGAGACTGGAGTGGCCCTTGGACTTGAAGTTGATGATAAGATGC
CACAACATGAGTATTATGAATACTTCGGTCCTGACTATACACTTCATGTTGCTCCGAGTAA
CATGGAAAACAAAAATTCGCGACAATTGCTTGAGGATATAAGATCGAAGCTTCTCGAAAA
TCTCTCTAAGCTCCAGCATGCACCCAGTGTCCCATTCCAGGAAAGACCCCCTGATACTG
AGCTCCCAGAGGCTGTAGAAGATCAAGAAGATCCGGATGAAAGATGGGATCCAGACTCT
GACATGGATGTTGACGAAGATCGCAAGCCCCTGCCTAGCAGAGTAAAGAGAGAACTAAT
AGTAGAACCTGAGGTCAAAGACCAGGATTCCCAGAAAGCATCTATAGATCATGGAGAG
GGCTTGACACAACACAGGAGGATAATGCATCTATAAAGGTCTCTGATATGAATTCTATGA
TAACAGATGAACAGAGCGTGAAAATGGAGCAAGACAATGTGAACAAACCATCTGAGCAA
ATATTCCCTAAATAGATTGTCCATTGTTGGTAATTGACTTGTAATATGCTTCTGTGGCAAA
TTCTTAAAGCTTATGATGTCAAAACGTAGCTCTTTTTTGTGTGAACTATCATGCTAAATTAA
ACCTCAGCAAATACTGCTTCAAAAAAAAAA
SEQ ID NO:34
GGAGAAGGCAAGGCAACCCACCAGCCAAGCGGACGGAGAGAGGGTCTGCTCCTCCTC
CTTCCGTCGCAGCACCACAGCACCTTCACGCCAATGGTGCGGCCTCGCTAGCGATACA
CCGTCTCCGCCGCTTCGTTCGCAGGACGGGAAGCTCAGGATGCGACCCAAGGACAGGA
TTTCGTACTTCTACGACGGAGATGTGGGCAGCGTCTATTTCGGTCCGAATCATCCGATG
AAGCCGCACCGGCTCTGCATGACCCACCATCTTGTCCTCTCTTACGAGCTTCACACGAA
GATGGAATTTACCGGCCGCACAAAGCCTACCCCGCTGAGCTCGCCCAGTTCCACTCTC
CTGATTATGTCGAGTTTTTGCACCGGATTACACCCGACACCCAGCACCTGTTCCCAAATG
ATTTGGCAAAATATAATCTTGGAGAAGATTGTCCTGTCTTTGAAAACTTGTTCGAGTTTTG
TCAAATTTATGCTGGTGGCACGATAGATGCTGCAAGAAGATTAAATAATCAACTCTGTGA
TATAGCTATCAACTGGGCTGGAGGATTACATCATGCCAAGAAGTGTGAAGCATCTGGATT
CTGCTATATAAATGACCTAGTTTTGGGAATATTGGAGCTGCTGAAATATCATGCACGTGT
TTTATATATTGACATAGATGTGCATCATGGTGATGGCGTAGAAGAAGCCTTTTACTTCACT
GACAGGGTAATGACTGTTAGCTTTCACAAATTCGGGGATATGTTCTTTCCAGGAACTGGT
GATGTTAAGGAAATAGGAGGAAAAGAAGGAAAGTTCTATGCTATCAATGTCCCACTCAAG
GACGGAATAGACGATACTAGCTTCACTCGACTTTTCAAGGCTATCATTTCGAAGGTCGTC
GAAACATACCAACCTGGTGCTATTGTACTCCAATGCGGAGCAGATTCACTTGCTGGGGA
CCGTTTGGGGTGTTTCAATCTCTCTATTGATGGACATTCTGAGTGTGTAAGGTTCGTCAA
GAAATTCAATTTGCCATTATTGGTTACTGGTGGTGGAGGATACACAAAAGAGAATGTTGC
TCGATGTTGGGTTGTTGAAACAGGAGTTCTCTTAGATACAGAACTTCCAAATGAGATTCC
TGAAAACGAGTATTTCAAGTACTTTGCTCCTGATTATTCATTGAAGATTCCTCGTGGAAAT
ATCGTACTCGAGAATCTAAATAGCAAGTCCTACCTCAGTGCAATCAAAGTGCAAGTGTTA
GAAAACCTTCGTAATATTCAACATGCCCCAAGTGTACAAATGCAGGAGGTTCCACCCGAT
TTCTATATTCCCGATTTTGACGAAGACGAGCAGAATCCTGATGAACGAATGGATCAGCAT
ACCCAGGACAAGCAAATCCAACGAGAGTGATGAATATTATGATGGGGACAATGATAACGA
CCATAACATGGATGATTCATGATGCCAAGCTAAGCAATATGTAATTTTTTTTTTAAGGATA
CGACATTTGAATTGGCGATCACAATCTACTGTAGTCAATACTCAAGTGGGAGGTGTAAAT
AGATTCCATCTGAATTTTGTGAAGCAGATATTAGTTCCTACTTTTGTGAAAAAAAAAAAAA
AAAA
SEQ ID NO:35
GGCACCAAACATCGAGGCACAACAGCATCGTCCCCCGCGTCTCCGTCGCGACACCCTC
CTCGTCCGGTGGCGGTGGTGGCGGTGCGGAGTTTAAAAGCGTCGCGGTTTCCCTCGAC
ACGACCGAGCCTCCCCCACAACCTCACTCGCCCGCTCGCTCGCGATCGACGGCGCGCG
CTCGCCGGCTTCCGCCTTCCTCTCCGGCGATCTCGGCATGATGTATTAAAGGAATGACA
GTGGCTGAAGATTTCCATGTAAACAACAGGTCTAAAATGGTTTCCCAGGCAACTCCAGAA
AGTCGTCTCACAGGTGGAGAAGATGATAACAGTCTGCACAATCAAGTTGATGAGCTCCT
CTGTCAGGAATTACCAGAAAGACAAGTGATATTGGAGTTTGAAGGCACCAGGCCTAAGC
CTTATTTCAGTGATCATAACGGGGGAGAAAATAGTGCATTAGGTGTGAGAGCCACCGAA
GATGATTTGAATTCTGATGTGGAGGCTGAAGAAAAGCAGAAAGAAATGACTTTAGAAGAT
ATGTACAAAAATGATGGCACCTTATATGATGACGATGAGGATGACAGTGATTGGGAACC
GGTGAAAAGGCAAGTGGAACTGATGAGGTGGTTTTGCACCAACTGTACAATGGTGAACG
TTGAAGATGTCTTTCTCTGCGATATATGTGGGGAACATAGAGATTCAGGTATCTTGAGAC
ATGGCTTTTATGCATCTCCATTCATGCAAGATGTGGGTGCTCCTAGCGTTGAAGCAGAAG
TACAGGAATCACGTGAAGATCATGCTAGAAGTTCTCCGCCAAGCAGTTCAACTGTTGTG
GGTTTTGATGAGAAGATGCTGCTACACTCAGAGGTGGAGATGAAGTCACATCCTCACCC
TGAAAGGGCAGACCGTCTTCAAGCTATTGCTGCTAGCCTTGCTACTGCCGGCATATTTC
CTGGAAGATGCCGTTCGCTTCCTGTGAGAGAAATTACAAAAGAAGAGCTGCAGATGGTC
CATTCTTCAGAGCATGTTGATGCTGTTGAAATGACAAGCCATATGTTTTCTAGCTACTTTA
CTCCTGACACATACGCGAATGAACATTCAGCGCGTGCTGCTAGGATTGCAGCCGGTTTG
TGTGCTGATCTCGCTTCAACAATTATTTCTGGACGTTCGAAGAATGGTTTTGCTCTGGTT
CGTCCTCCTGGTCATCATGCTGGTATCAAACATGCCATGGGGTTCTGCCTCCACAATAAT
GCAGCAGTTGCTGCACTAGCAGCACAAGGTGCGGGGGCGAAGAAAGTGCTTATAGTTG
ACTGGGATGTTCACCATGGAAATGGCACGCAAGAGATTTTTGATGGAAATAAATCGGTCT
TATACATATCACTACATCGACATGAAGGAGGAAACTTTTATCCCGGTACAGGTGCTGCCC
ACGAGGTCGGTACCATGGGTGCTGAAGGATACTGTGTGAATATTCCATGGAGCCGTCGA
GGAGTCGGTGACAATGATTATGTTTTTGCATTTCATCATATAGTGCTTCCTATAGCTTCCG
CATTTGCTCCTGATTTCACCATCATATCAGCTGGATTTGATGCTGCGAGGGGTGATCCCC
TAGGGTGCTGTGATGTCACTCCGGCTGGCTACGCACAGATGACACACATGTTGAGTGCT
CTTTCTGGTGGGAAGCTGCTTGTCATTCTAGAGGGAGGTTACAATCTTCGCTCAATCTCC
TCTTCTGCTGTGGCAGTTATTAAGGTGTTACTGGGTGACTCACCTATATCTGAAATTGCG
GATGCGGTGCCCTCGAAAGCTGGCTTGCGTACTGTGTTGGAAGTCCTGAAGATACAAAG
GAGCTACTGGCCTAGTCTTGAATCCATTTTTTGGGAATTGCAGTCACAATGGGGAATGTT
TCTTGTTGATAACAGAAGAAAACAGATCAGAAAGAGACGACGGGTGTTGGTGCCAATAT
GGTGGAAATGGGGTCGGAAAAGTGTGTTGTATCATCTCCTAAACGGTCATCTTCATGTGA
AAACGAAGCGGTGAATGTCTAGGCTCTAGTCTGTGTATTCTTCCTTTGTGCAACTTTGTA
TACCATCAGCTGGCAGTCCTCTTTTATCCGTGGAAGACTCTCATGGTGATTTTGACAACC
ATTTTTACATTGTAAATCTAGGTTAACCATCATTAGAGTAGAGCTCCTTCTGATCTACTAC
TGTTAAGAGAGGAGTGGTCAGCGGCCTTCATTTTATGTTTAAATGTTCAAATGTTGATAAT
TGCAACTTTTTTGGGAGCTTGCTTTTGCATTAAGCTCGGATTGGATCGAGAAGCATGCCG
ATTTTGTTGGGGCAGTCAGATGTATGTGATCATGTGTAATCAGTATATCAGGTTAGAAAC
AGTACTCTTGAGCTTAGCGGGCACTGTTCTTCGCTGCTTCAGAGGTTTAAGTCCTCAAGG
TCAGGGAAAATGTTTGTGGTTTCGCTTCTTTGAGTTCATAACTGTCTTTTTGGCTCAAAAA
AAAAA
SEQ ID NO:36
ACAGGCAAAAAGCATAAGCTGGCAAGAGCTTCCTCCTGCTGAGTTACAGCATCCTTT
AGGATCAGAAGAGGGGAAACATCATCGGCTGCTTCGCTATTCAACTCTCGAATGGCTGC
TGCTCCTTCTTCTCCTCCGACGAACCGGGTCGACGTGTTCTCACGACGGGATGCTGA
GCCACGACACCGGTCGGGGCGTGTTCGACACGGGATCCGACCCGGGCTTCTTGGACGT
GCTCGAGAAGCACCCGGAGAACCCCGACAGGGTCCGGAACATGGTCTCGATCCTCAAG
CGAGGACCCATCTCTCCCTTCATCTCCTGGCACACTGCAACGCCTGCTCTCATCTCTCA
GCTCCTCTCTTTTCACTCTCCGGAATACATAAATGAACTAGTGGAAGCTGATAAAAACGG
AGGGAAAGTTCTTTGCGCTGGAACTTTCTTGAACCCAGGGTCATGGGATGCCGCGCTTC
TTGCTGCTGGTAACACACTTTCTGCTATGAAATACGTACTTGATGGTAAGGGGAAGATTG
CATATGCACTAGTGAGGCCACCTGGTCACCATGCTCAGCCTTCTCAAGCCGATGGATAT
TGCTTCCTAAACAATGCCGGTCTAGCAGTTCGATTGGCATTGGACTCTGGGTGCAAAAG
GGTTGTCGTTGTTGATATAGATGTGCACTATGGAAATGGAACAGCGGAGGGATTTTACCA
ATCCAGTGACGTTCTCACCATCTCTCTTCACATGAATCACGGGTCCTGGGGTCCATCTCA
TCCACAAAGTGGATCTGTCGATGAGCTTGGTGAGGATGAAGGATACGGGTATAATATGA
ACATTCCGTTGCCAAATGGAACCGGAGACAGGGGATACGAGTATGCTGTGACTGAGCTG
GTTGTGCCAGCTGTGGAGAGTTTCAAACCTGAAATGGTTGTTCTTGTTGTTGGCCAAGAC
TCTAGCGCGTTCGATCCAAATGGAAGGCAGTGCTTGACAATGGATGGATATCGGGCGAT
TGGTCGAACAATTCGTGGCCTCGCGGATAGGCACAGTGGAGGCCGCATTCTCATTGTCC
AAGAAGGTGGATATCATGTTACCTACTCAGCTTATTGCCTTCATGCCACGGTGGAAGGCA
TTCTTGATCTTCCAGATCCATTATTAGCTGATCCAATTGCTTATTACCCAGAAGATGAGGC
TTTTCCTGTGAAGGTGGTCGACTCAATCAAAAGGTACCTTGTAGATAAGGTACCTTTTCT
CAAGGAACATTGATACAGAGATGTTGCGATATAACTTTGCTTTATTGGCACGGTGTCCCC
AAATCTAGTTTGAATATTTTGTTTTTTTAGGACTTCATCTGAGATAGAAGTAGAAGCTAATA
TGAAGAAAGTAGTGATTTCTATATTTTATTTTTTGGGTGAGAA
SEQ ID NO:37
AGCGAGAGCGAGAGAGAGAGAGAGAGAGAGAGAGAGGCGCGCGGCGGAGATGGTCGA
GAGCAGCGGCGGCGCGTCGCTGCCGTCGGTGGGGCAGGACGCGCGGAAGCGGCGCG
TGAGCTACTTCTACGAGCCGACGATCGGCGACTACTACTACGGGCAGGGGCACCCGAT
GAAGCCCCACAGGATCCGGATGGCGCACAACCTCATCGTCCACTACTACCTCCACCGC
CGCATGGAGATCAGCCGCCCCTTCCCCGCCGCCACCACCGACATCCGCCGCTTCCACT
CCGAGGACTACGTCACCTTCATCTCCTCCGTCACCCCCGAGACCGTCTCCGACCCGGC
CTTCTCCCGCCAACTCAAGCGCTTCAACGTCGGCGAGGACTGCCCCGTCTTCGACGGC
ATCTTCGGGTTCTGCCAGGCCTCCGCCGGCGGTTCCATGGGCGCCGCCGTCAAGCTCA
ACCGCGGCGACTCCGACATCGCCCTCAATTGGGCCGGCGGGCTGCATCACGCCAAGAA
GTCCGAGGCTTCTGGGTTTTGCTATGTCAACGATATCGTGCTTGGGATTCTCGAGCTCCT
CAAGGTCCACAAGCGTGTTCTCTACGTGGATATCGATGTTCACCATGGGGATGGGGTGG
AGGAGGCATTTTATACGACGGATAGAGTCATGACCGTATCTTTCCATAAGTTTGGGGACT
TCTTCCCTGGAAGCGGGCACATTAAGGACACCGGGGCAGGGCCTGGGAAGAACTACGC
CCTCAATGTGCCACTGAATGATGGCATAGATGATGAAAGTTTCCGTGGTATGTTTCGCCC
CATTATCCAAAAGGTTATGGAAGTATATCAACCGGATGCCGTTGTTCTTCAGTGTGGGGC
AGATTCACTGTCTGGAGACCGATTGGGATGCTTCAATTTGTCTGTGAAAGGTCATGCAGA
TTGCCTCCGTTTTCTAAGATCTTTTAACGTTCCTTTAATGGTCTTGGGTGGGGGAGGTTA
TACAATGAGAAATGTCGCTCGTTGCTGGTGTTACGAGACAGCTGTTGCGGTGGGAGTAG
AACCTGAAAATGACTTGCCTTACAATGAGTATTACGAGTACTTTGGCCCAGATTATACTCT
TCATGTCGAGCCATGTAGCATGGAGAATCTTAATGCACCAAAAGATTTGGAGAGAATCAG
GAATATGTTATTAGAGCAGCTATCGAGAATCCCACATGCACCAAGTGTACCTTTTCAAAT
GACACCGCCCATTACACAAGAGCCAGAAGAGGCAGAGGAAGATATGGATGAAAGGCCA
AAGCCACGCATTTGGAATGGTGAGGATTATGAATCGGATGCTGAAGAAGACAAGAGTCA
GCATAGATCATCAAATGCTGATGCTTTGCATGATGAAAATGTTGAAATGAGGGACAGTGT
TGGTGAAAACAGTGGGGACAAAACAAGGGAAGATCGTTCTCCATCTTGACAAAGATCAG
CAAAGATCAGCTCTAACTCTTGTGTACTTTTGGGTCTTTCTGATATGCGATGCATGTAATT
CTTTCTAGGCCAGGAGATAACCTTATCAAGCCAATAGAGTCTAATGGTTGGCCACACTGG
ATGATTGCTGATAGACGAGGCGGGTTATGTTTATCAGGTGTCACAGTGCATGTTGGAATA
GGAAGACTGAATTGCTGAGTGTAAATGGGATGTTTTCTTATGTCTCCAGCTGATGTGGTT
TGTTTCATTGGCTCATGTGAGATGGATTTACTTAGTTAAAAAAAAAA
SEQ ID NO:38
CTGCGCTTCATTTGCCATTCGTTTTCTCCTCTGCCTCGTTTCTTTGCGGTTCGGACGCAT
CATGGATGTATCTCCAAAGAGTAATCTGTCGATTCCGTGGACAAGATCCACAGCTGCTTC
TTGAGGTCATGCTATATTGGAACCAGCTTAAGGTCAGTTGAGAGGACTTCAACCAATGGT
AGTTCCTAGCAGTAACCCCCACAACAGGGAAATGGCCATTAGGAGGAGGATGGCCAGC
ACATTCAATAAAAGAGAAGATGATTTTCCATCCTTGAGAGAGTACAATGACTACTTGGAA
GAAGTGGAGGAAATGACATTCAATCTGATTGAAGGAGTGGATGTGCCGACTATTGAAGC
AAAGATTGCAAAGTACCAGGAAGAAAATGCTGAGCAGATTATGATTAATCGTGCGAAAAA
GGCTGAAGAGTTTGCTGCGGCATTGGCTGCAAGCAAGGGGCTGCCTCCTCAAACTGAT
CCAGATGGGGCTCTTAACTCTCAAGCAGGATTGAGTGTTGGGACACAGGGACAATACGC
TCCTGCAATTGCTGGAGGACAGCCACGGCCAACAGGCATGGCTCCGCAGCCGGTGCCA
CTTGGGACTGGCCTCGACATCCATGGATATGATGATGAAGAAATGATAAAGCTCCGAGC
TGAAAGAGGAGGCAGAGCGGGAGGGTGGAGCATAGAGCTAAGCAAGAAAAGGGCTCTT
GAAGAAGCATTTGGAAGCCTTTGGTTATAATTCAGTTAGTTCTAGCACTGTTTGTGGGTG
GGAGCATTCTAAGATGGTTCTTAAGGGAGGAAGCGAGAATTGCAATTGTTATGATTGGTC
TGACTAAGAAGTGTTTGCAATTATCCGCGTGAAGATCATGTCTTTCTGGTTCTTGCATGG
AGAGCATTTGGAGCAGCAGCAGCAGCCTTGCTGCTCTCTGGCAATTTTAATGTGACATT
GAAGGTGTAAAACTGTTACCTCTAATACGATCATACTTGGAATAAAAAAAAAA
SEQ ID NO:39
CTTCCTCTCAGACCGTCCCAGAAGCCTTATCTTCATTGGCTTCTCTCAGCTCCTGTACCA
ATTTCGTGTCTCTCTTGTTTTCTTCCGACACACCCGTTTCCCATGTCTTTGTAACACCTCG
AAAGCTCTTATCTTCTGCGCTCTCTCCCATGGCAGCAATCATATCCTGCCACCACTACCA
CTCCTGCTGCTCCTCCCTCATCGCCTCGAAGTGGGTTGGCGCCAGAATACCCACAAGTT
GTTTCGGCCGTTCGAGTACGCAGAGCAACAATGCGGCGTCGGTCAGGCAGTTCGTCAC
CCGGTGTTCTTCAAGCCCGAGCAGTCGCGGCCAATGGCAGCCTCACCAGAACGGAGAG
AAAGGGAGATCATTTTCCTTGAGAGAATGTGCGATATCTATTGCTTTGCCGGTTGGATTG
GTGACGGGAGTGCCTTCGTTGGATATGTCTACGGGCAATGCTTATGCTGCTAGTCCTGC
TCTGCCTGATCTCTCTGTCCTAATATCTGGCCCACCGATTAAAGACCCCGAGGCTTTGCT
AAGATATGCTCTTCCGATTAATAATAAAGCCATAAGGGAAGTTCAGAAGCCGCTCGAGGA
CATCACGGATAGTCTCAAGGTTGCAGGACTCAGAGCTCTTGACTCCGTGGAGAGAAATG
TGAGGCAAGCATCTCGGGTGCTCAAGCAAGGTAAAAATCTGATTGTATCTGGCTTAGCT
GAGTCAAAGAAGGATCACGGAGTGGAGTTGCTTGACAAGCTGGAAGCTGGAATGGATG
AGCTCCAACAAATTGTGGAGGATGGAAATAGAGATGCTGTAGCGGGCAAACAGAGGGAA
CTGCTAAACTATGTTGGAGGTGTTGAAGAGGATATGGTGGATGGCTTCCCTTATGAAGTT
CCTGAGGAATACAAGAATATGCCTCTCTTGAAAGGTAGGGCTGCTGTGGATATGAAGGT
CAAGGTCAAGGACAATCCCAACCTGGAGGAGTGTGTGTTTCGAATAGTCTTAGATGGTT
ACAATGCACCTGTGACTGCTGGGAATTTCGTGGATTTGGTAGAGAGGCACTTCTACGAC
GGCATGGAAATCCAGAGAGCCGATGGATTTGTTGTTCAGACAGGTGATCCCGAAGGTCC
TGCTGAGAGTTTTATTGATCCTAGCACAGAGAAACCCCGGACGATACCTTTGGAGATCAT
GGTGGATGGGGAGAAAGCGCCAGTATATGGAGCAACTCTTGAAGAGCTTGGCCTCTACA
AGGCTCAAACAAAGCTTCCATTTAATGCATTTGGGACAATGGCCATGGCCAGAGATGAG
TTTGAGGACAATTCCGCATCAAGTCAGATATTTTGGCTGCTAAAGGAAAGTGAACTGACC
CCTAGTAACGCGAATATATTAGATGGGCGGTATGCAGTTTTCGGTTATGTGACAGAAAAT
CAGGATTTCTTAGCAGATCTTAAGGTCGGTGATGTCATAGAGTCGGTGCAAGTCGTCTC
GGGCCTGGATAATCTGGCGAATCCCAGCTACAAGATTGCTGGCTAGAGTTTTCCTTTTCT
CTCCGTTTCCCAATATCGTTCGTTTGGACACTGCACATGCCAGCTTGTATGACTCCAACG
TGTAGTTGTTTAAGGCTAATGTTCTTCGTATGATTCTTGTTATTTATGAGAGATAATGTGTT
GTCCACATTGAGAATGTATTCTTTTTTGGGATTGCATTGATATCAAATTCAGATCTATTAG
TGAAAGTTGGCATGAGTCTCAATCTTAGGGGAATACAGTACGGAGAATGAATTTCTGGAT
SEQ ID NO:40
GGTCATCTCTCTCTCTTCCTCGAGACTTCGCCCCCCGCAAAACCCTTGCGCTCCGAACT
CTCTAGAACCGCCTCCCTCTCTCCGCCCTTATAAATCTCTCCCTCTCTCCCCCCCCCCGC
CGCGCCTCCGCTCGCTTCCCGACACCCCCACGGAGACGCCATCCCCAGCTCCGCTTCG
CGATCCGTTTGCAGCCTCGCGGTCGTCGGCAGCCGTCCCTCTCTCTCGGGCGAGAGAC
TTCCGGCGAACATGGCCGGCGAGGACTTCGACATTCCGCCCGCCGACGAGATGAACGA
GGACTTCGACCTCCCCGACGACGACGACGACGCCCCCGTCATGAAGGCCGGCGATGA
GAAGGAGATCGGCAAGCAGGGCCTCAAGAAGAAGCTCGTCAAGGAAGGCGACGCCTG
GGAGACTCCCGATAATGGCGACGAAGTTGAAGTGCATTACACCGGGACGCTCCTGGAT
GGTACTCAATTCGATTCTAGCCGGGACCGCGGGACTCCTTTCAAGTTCACTCTTGGCCA
AGGGCAAGTGATAAAGGGATGGGATCAAGGTATCAAGACAATGAAGAAGGGGGAAAAT
GCAATTTTCACCATACCTCCAGAGCTAGCTTATGGCGAGGCTGGCTCACCTCCAACTATA
CCTCCTAATGCAACTCTGCAATTTGATGTTGAACTACTTTCTTGGACGAGTGTCAAGGAT
ATTTGCAAGGATGGTGGTATTTTCAAGAAAATTCTGGTGGAAGGCGAAAAATGGGAAAAT
CCTAAAGATCTGGACGAAGTATTAGTCAAGTATGAGTTTCAGCTGGAAGATGGTACCACC
ATTGCAAGGTCTGATGGAGTGGAATTCACTGTTAAAGAAGGACATTTTTGCCCAGCAGTT
GCTAAAGCTGTCAAGACAATGAAAAAGGGAGAGAAAGTTCTTTTGACTGTCAAGCCACAA
TATGGATTTGGCGAGAAAGGTAAGCCAGCTTCTGGTGATGAGGGTGCTGTTCCACCTAA
TGCCACTCTTCAAATAACGTTAGAGTTGGTATCGTGGAAGACTGTGTCTGAAGTGACCGA
TGACAAGAAAGTAATCAAAAAGATTCTCAAGGAAGGGGAAGGGTATGAGAGGCCTAACG
AAGGAGCTGTAGTTGAAGTGAAATTGATTGGGAAGTTGCAAGATGGCACGGTATTCGTG
AAGAAGGGCCACGATGATTGTGAGGAGTTATTCAAGTTCAAGATCGACGAAGAACAAGT
AGTGGACGGGCTTGACAAAGCTGTGATGAACATGAAAAAGGGAGAGGTTGCTTTGCTGA
CTGTTGCACCTGAGTATGCTTTTGGATCTTCTGAGTCTAAGCAGGATCTGGCTGTGGTAC
CTCCTAGTTCGACTGTGTATTATGAGGTGGAGTTGGTCTCTTTTGTCAAGGACAAGGAGT
CGTGGGACATGAATACTGAGGAGAAGATTGAGGCTGCAGGTAAGAAGAAAGAAGAAGG
AAATGTTATATTTAAGGCAGGGAAGTATGCGAAGGCTTCCAAAAGATATGAAAAGGCTGT
GAAGTACATAGAATACGATACCTCCTTTAGTGAAGATGAGAAGAAACAAGCGAAGGCACT
GAAAGTTGCTTGCAATCTGAATGATGCAGCATGCAAGCTGAAACTCAAAGATTACAATCA
GGCTGAGAAACTATGCACCAAGGTTTTGGAGCTGGATAGCAGGAATGTGAAGGCTCTTT
ATCGGAGGGCGCAGGCTTACATTGAGCTGTCTGATCTTGATTTGGCTGAATTTGACATCA
AGAAGGCGCTAGAGATTGATCCTCACAATAGGGATGTGAAACTGGAGTATAAAGTATTGA
AGGAGAAGGTCAAGGAGTTCAACAAAAAGGATGCCAAGTTCTATGGCAATATGTTTGCC
AAAATGAGCAAACTAGAACCTGTTGAGAAGACTGCAGCTAAGGAGCCTGAGCCTATGAG
CATAGACAGCAAGGCGTGAGAGAGAAGCACCTAATTCCGTTTCTTTTCTTTCGGTTCATT
TGGGTCCTTCTCTACATGCAAGGAATTCTCTATTCCGCAGCAATGTAACTGGGATTATTTT
ATCGGATTGAAAAACATATAATGTCGGTACAGTGTGTCGTGTGGTTTAGGTCAGTTATGT
TGCGGTTGGAGTTTTCAAAAGTTCTTGTACCATTGAAAATTCTCTTGCGCAGTTTGGAGA
GGTTATTGTCTCTCAATCGATCTGTCTGTTTACTTGTCCAGTTGATATGAGTATCATAACT
CGGATGGTGACAACTTTGTACTACGGTCGGCACCGGTAGATGATGGTGCATATGGGAGA
CGAGGTACGGTTATAAAAAAAAAA
SEQ ID NO:41
GGGACCGGCGGCCGACGGCGTCGTTAGGGGTGTTCTTCTTCCCCAGCCGAGCTCCAGA
AAGACCTGGAAATCGCGGGCGGCGGAAGATTCGCCTTCGTCGAGATCAGCGCGGGGA
GCTCCGGCTTCGAGTAGAAGATGTCGACCGTGTACGTGTTGGAGCCGCCGACGAAGGG
GAAGGTGGTGCTGAACACGACGCACGGCCCGCTCGACGTCGAGCTCTGGCCGAAGGA
AGCGCCCAAGGCCGTACGGAACTTCGTCCAGCTCTGCCTCGAGGGCTACTACGACAAC
ACCATCTTCCACCGCATCATCAAGGACTTCCTCGTCCAAGGCGGCGATCCCACCGGCTC
CGGCACAGGTGGCGAAAGTATTTATGGGGATGCGTTTTCTGATGAGTTTCATTCACGTTT
GAGGTTCAAGCACAGAGGTTTAGTGGCTTGTGCCAATGCGGGATCACCACACTCCAATG
GGAGTCAGTTTTTTATCACATTGGATCGATGTGACTGGCTTGACCGGAAAAATACCATTT
TTGGAAAGATAACTGGCGATTCCATATACAATCTTAGTGGATTAGCTGAGGTCGAAACTG
ACAAGAGTGATCGCCCGTTGGATCCACCTCCCAAAATTATCTCAGTGGAGGTACTCTGG
AACCCTTTCGAAGATATTGTTCCTAGAGCACCAGTGAGGTCTTTGGTTCCAACTGTACCT
GATGTTCAAAATAAAGAACCAAAGAAAAAGGCCGTAAAAAAGCTGAACTTACTTTCATTTG
GAGAGGAAGCTGAAGAGGAGGAGAAGGCATTGGTTGTTGTTAAACAGAAGATCAAGAGC
AGTCACGATGTGCTTGATGATCCTCGCTTGCTGAAAGAACATATACCAAGTAAACAAGTG
GACTCATATGACAGCAAAACCGCAAGGGATGTCCAATCTGTTAGAGAAGCTTTAAGCTCA
AAGAAACAGGAGCTGCAGAAAGAGTCCGGAGCTGAATTTTCAAATTCATTCAGGGAAATT
GCCGATGATGAAGATGACGACGACGACGATGCGAGCTTTGATGCAAGAATGCGTAGACA
AATACTTCAGAAAAGAAAGGAGCTGGGTGATCTCCCTCCAAAGCCAAAGCCAAAGTCGC
GTGATGGGATTTCTGCCAGAAAGGAGCGGGAGACATCTATTTCAAGGGACAAAGATGAT
GATGATGATGATGATCAACCAAGGGTGGAAAAGCTCTCCTTGAAGAAAAAAGGAATAGG
ATCTGAAGCCAGAGGTGAGCGAATGGCTAATGCTGATGCAGATTTGCAGCTGTTGAATG
ATGCTGAACGAGGTAGACAGTTGCAAAAACAGAAAAAGCACCGCCTTCGAGGACGTGAA
GATGAGGTACTTACTAAGCTCGAAACGTTCAAGGCGTCAGTATTTGGAAAACCTTTGGCG
TCAAGTGCCAAGGTTGGAGATGGTGATGGTGATTTATCTGACTGGAGATCAGTGAAGTT
GAAGTTTGCTCCTGAGCCTGGCAAGGACCGAATGACTCGTAATGAAGACCCAAATGATT
ATGTTGTTGTGGACCCTCTCTTGGAGAAGGGAAAAGAGAAATTCAACAGGATGCAGGCC
AAAGAAAAACGAAGAGGACGGGAATGGGCGGGCAAATCTCTCACTTGATCCTATTTATC
CCCATTTGCATGTCTCATGAATATAACCCACTCCGTGAGCCACCGGGCGATGGTGCCAG
GTTCATGCTTGCAGAGCCATTCGACCAACAAACTTTGGACAAACTAAACCAGCAATTCTA
GAAGTGCCATGTTAGAGTTGGTAGCTCTCTTCATTGAAGAGATCGACCTGTTGACATCCT
CTTTCTGCATCACTAGGACACAAATCAACATGTACATAAGTCTCTCCTTTGAGAGAGGCG
GATATTCTAGTAGGACCGTTCTCGAAGTGGGACTGTTCTTGCTGAGAGTTTTCTTAGTTC
TTTCCTTTTGTTTTCTCTCTCAACATTATGTTTTGCTGGTCTCATTTCGACCGAAGCGCTT
CTGGAAGAAAACAATACGGCCGGAAGACTTGGAAAACAAGGGGAGTGATCTCCGTCTCT
GCCCACTAACGCCGAGCATGACAGGAAGGAAAGGCGTTGGATTCTGACCAACCCCACC
ACGATGTTCTCGATTCCTTCTAGACGTTTGACGATGGGGGCTGACTGCAGACCTGTCCG
AGCCTGCTCATAATTTGGTTGGCTATTCTTGAAGGAGAGCCATCACATGGTCCTTGTTGG
CTGTGTCGGGTCCATTGGAGTTTTGATCGTCTTGTGTCGTATTAGTTGTCATCATCATGA
GAATTTGGGAATCTGACTCTTCTCATTCTTGAATCGCAACGATCAATTTCTATGCATAGAA
GAGAGAATTATTTTTAAAAAAAAAA
SEQ ID NO:42
CCGGATAACCGCGTCTTATCTGAAACCGCCGGTCCAAACCACCATCAATGGCGTCCGCA
ATCTCCATGCACAGCTCTGGCCTCCTCCTCCTCCAGGGTACCAACGGTAAAGATGTTAC
TGAAATGGGTAAGGCACCTGCAAGCTCTCGAGTTGCTAATATGCAGCAAAGGAAGTATG
GGGCGACATGCTGTGTGGCCAGAGGATTAACATCTAGATCTCACTATGCCTCCAGCTTG
GCATTCAAACAATTTTCTAAAACCCCCTCCATCAAATATGACAGGATGGTTGAGATCAAG
GCTATGGCCACAGACTTGGGGCTGCAGGCGAAAGTAACAAACAAATGCTTCTTTGACGT
GGAAATTGGCGGAGAGGCTGCTGGTAGAATAGTAATAGGCCTCTTGGAGATGATGTTC
CAAAAACTGTCGAGAACTTCCGTGCTTTATGCACAGGTGAAAAGGGATTTGGCTACAAAG
GATGCTCCTTCCATCGTATCATTAAGGATTTTATGATCCAGGGAGGGGATTTCACTAGAG
GAAATGGAACTGGAGGAAAGAGCATCTACGGTTCAACTTTTGAAGATGAGAACTTTGCAT
TGAAGCATGTCGGACCTGGAGTACTGAGCATGGCTAACGCTGGCCCTAGCACTAATGG
GAGCCAATTTTTCATATGCACTGTAAAGACTCCATGGTTGGACAATCGCCACGTTGTGTT
TGGACAAGTCGTCGACGGGATGGATGTTGTGCAGAAACTTGAATCTCAGGAGACAAGTC
GATCGGACGTGCCTCGGCAGCCATGCAGAATTGTGAACTGTGGGGAACTTCCCCTAGAT
GGTTGATTCACATGAGCTGGCTTGTTGCAATGCGGAGCACAAATTTTGGTAATGTGTTCC
GGTTTTGTCATGTTTCATGCGACAGCTCTTCGTTGTGTCAGTATTTTTTAACTGGATGTTC
ATGTATGCTTAGAGATTCCATTGGTGATTCTGACTTCAGTGGTAGCCCATGAAATTTGGG
TTCTGTGTTAGAATTCTTTCGTCAAGGAAAGTGAATTTTCAAAAAAAAAA
SEQ ID NO:43
GGGAAAAAGGAAAAATATCTTCGTTCTCTTCCCTCTTCATCGCCTTCCATGGCGGCTTCT
TTCACAGCTCTGTCCAATGTCGGCTCGCTTTCCAGCCCGAGGAACGGCTCGGAGATTAG
ACGGTTCCGCCCCTCCTGCAACGTCGCCGCCAGCGTCCGGCCGCCTCCGTTGAAGGCC
GGCTTGTCGGCGTCGTCCTCGAGCTCTTTCTCCGGCTCTCTGCGCCTGATTCCCCTCAG
CTCTTCTCCTCAGAGGAAATCCCGTCCGTGCTCGGTTCGAGCAAGTGCTGAGGCTGCG
GCCGCTCAGTCGAAAGTCACCAACAAAGTCTACCTTGACATTAGCATCGGTAACCCTGT
CGGGAAACTAGTCGGAAGAATTGTGATTGGATTATACGGCGACGATGTCCCGCAAACAG
CAGAGAATTTCCGCGCATTATGCACAGGAGAGAAGGGCTTTGGATACAAAGGTTCAACG
GTCCACCGTGTCATCAAAGATTTTATGATTCAGGGAGGTGACTTTGACAAAGGAAATGGA
ACAGGGGGTAAAAGCATATATGGCCGTACTTTTAAAGATGAAAACTTTAAGTTGTCTCAT
GTCGGACCTGGGGTGGTCAGCATGGCAAATGCAGGGCCCAACACCAATGGTAGCCAGT
TCTTTATCTGCACTGTGAAGACACCGTGGCTGGACCAGAGGCATGTTGTGTTTGGGCAA
GTGCTGGAAGGCATGGACATTGTCAGGCTCATTGAGTCACAAGAAACAGACCGAGGAGA
CCGTCCCAGGAAAAGAGTGGTCGTCTCTGACTGCGGTGAACTTCCCGTGGTGTGATATG
CCTTGATTGGTTGGGTTCTGCAAGTTAGTTCTGGCTAAAATCTGAGTCATGCTCTCCCTC
GAATTTTCTTCTTTGTTAATTTCTCTGTTTTGTTGAGAGAGCAAACTTTGGGAAGAGCCAG
GTCGCGATTGGTAACGCTCAACTATTGTAGTGTACCGACGGAATTTTCCTTAAATTGGAA
GCACAGAATGACTGCTCCTGAGTCTCTTCTTTAAAAAAAAAA
SEQ ID NO:44
GGCCGATCGCTGTGGCTGATCTCGTCGCTCCGGTTGTAAAGCATGTCGAGCACATCATC
TCCTCCTTCTTATTCTTCTTCTTGGATCAAAGCATGTTCAGCACATCATGACCCAACAGAA
GCCCTTTGGCTTGCATCTGTAACTCAGTTCCCCTTTCCATACAAGCAGGCTTTTCATAAA
AATCATGGCTGAGGCAATCGATTTGACGGGTGATGGAGGGGTCATGAAGACAATTGTGC
GACGAGCAAAACCGGATGCAGTCTCTCCTTCAGAGACCCTTCCTCTTGTTGATGTTCGTT
ATGAAGGAGTACTTGCTGAAACTGGTGAAGTCTTTGACTCAACACATGAAGACAATACTC
TATTCTCCTTTGAGATCGGAAAGGGCTCGGTGATCAGTGCTTGGGACACTGCATTGAGA
ACTATGAAGGTTGGGGAGGTTGCAAAAATCACATGTAAGCCAGAATATGCCTATGGCAG
CACGGGTTCTCCACCCGATATCCCACCAGATGCAACCCTTATTTTTGAGGTGGAGTTAGT
TGCATGCAAACCGTGCAAGGGCTTTTCAGTGACCAGTGTCACAGAAGACAAGGCTAGGC
TTGAGGAGCTGAAGAAGCAAAGGGAGATAGCTGCCGCAACCAAAGAGGAAGAGAAGAA
GAGGAGGGAAGAAGCTAAAGCTGCAGCTGCTGCTCGTGTTCAAGCCAAGCTGGATGCT
AAGAAAGGTCACGGGAAGGGAAAGGGAAAAGCAAAATGAACAGCTTTATAAGGAATACT
GAAGTAGATCATAACTCAGTTTTCCTTCATATGCCTTTTAAACTTGGCTAGGATCTTAACC
TTGAGTATTATGGAATATAAGTTAAGACCAGACTTCCCATACTTGATGGTGTGCTTGGATA
TACTGTATAAGCATTCTATATTATGCTTGTTGGCTTCGTTTTGAGGGAGCAAAACTGACTA
GAAAGGAGAAAGTATGTAGTACATCGACTCTGATGAATGGCATTTTGATTACGGTAGCTT
TCTCAAAAAAAAAA
SEQ ID NO:45
GGAATTCATTGACGACGACAAGTTAACGTCGACCGCTTCTCTGCCCCTTGAATTTTCCCG
AGAAAACCAGGAACCTGCCAAATATCTCTCTGAAAGATCTCCATGGGTAACCCGAAGGT
GTTCTTCGACATGTCGATCGGCGGCCAGCCGGCCGGCCGGATCGTGATGGAGCTCTAC
GCCGACGTGGTGCCGCGCACGGCGGAGAACTTCCGCGCGCTCTGCACCGGGGAGAAG
GGCGCCGGCCGCTCCGGGAAGCCCCTCCACTACAAGGGCTCGAGCTTCCACCGCGTG
ATCCCGGGGTTCATGTGCCAGGGCGGCGACTTCACCGCCGGGAACGGGACCGGCGGC
GAGTCGATCTACGGCTCCAAGTTCGCCGACGAGAACTTCGTCAAGAAGCACACCGGCC
CGGGTGTCCTGTCCATGGCGAACGCCGGCCCGGGGACCAACGGCTCCCAGTTCTTCGT
CTGCACCGCCAAGACCGAGTGGCTCGACGGCAAGCACGTCGTGTTCGGGCAGATCGTG
GACGGGATGGACGTGGTGAAGGCCATCGAGAAGGTGGGGTCCAGCTCCGGCAGGACC
TCGAAGCCCGTCGTCGTCGCGGACTGCGGCCAGCTTTCCTAGATCGGGCGGTCCCCGT
CGGTCGCCGGGATCTCCCCCCCCCCCCCCCTCCCCGTGTGCGTGTGGGATCTTATCTG
ATCTCCAGTCCGTTTCCGAGCATGGTGTTTTAGGGCTTTCCTTTTTTTTTCGGCTTTTAGC
GTGTGGGGTGTTCGGCCAGATCTGGTTATGGGTCTCTCCGCGGATCCTTGTTGTCTGTA
GGATCGTCGAGACCTTTTATGATGGTTATGTTGAACTGCCAGTCGGCCATTAACTGCTGG
AAAAATAAGATAGGGAGCTTTTCTTGAAATGCGACCCTCTTTTTACCCTGTAAAAAAAAAA
SEQ ID NO:46
TCTCGATCTCGAACTCTCTACCTGTTGATCCGTGTTGACCATGCCGAACCCGAAGGTCTT
CTTCGACATGACGATCGGCGGGGCGGCCGCCGGGCGGGTCGTGATGGAGCTGTATGC
GGACACGACCCCACGCACCGCGGAGAACTTCCGGGCGCTCTGCACCGGCGAGAAGGG
GGTCGGGAGGAGCAAGAAGCCGCTCCACTACAAGGGCTCCAAGTTCCACCGGGTCATA
CCTAGCTTCATGTGCCAGGGAGGTGACTTCACGGCCGGGAATGGGACCGGAGGGGAGT
CCATATACGGAGTGAAGTTTGCAGATGAGAACTTCATAAAGAAGCACACGGGGCCGGGG
ATCCTGTCCATGGCCAACGCAGGGCCAGGGACGAACGGGTCCCAATTCTTCATATGCAC
TACGAAGACGGAGTGGCTCGACGGGAAGCACGTGGTGTTTGGGAAGGTGGTGGAGGG
AATGGAAGTGGTCAAGGCCATCGAGAAGGTTGGGTCCTCTTCCGGTCGCACATCCAAGC
CCGTCGTGGTTGCTGATTGCGGCCAGCTTCCTTGAGCCATAATAATAGAGATCTCTCTCT
CACACACAATGGCTTCTTCATATAATAAAAAGTGTCACCTATGAAGTGTTCAATGTTGTCG
ATCTTTTGTATGTGCTCGTTTGAGGTTTGATTACTTCTTGGTCGTGCACTCCTGAAAAGAA
GAGAAGATCTCACCCGTTTTTCGTTTTCAATGGGATAATCTCGTATTACTTGGCTGAAAAT
AAGGTGAAACATTTACTTGCCTTGCAAGAGGGAAACTTTTGACAAGGTTTGGAACATAAA
TTGTTGAATACGATGTATTATAATGTTGGTGTCTTGGTGAAATACAGAATTATGCTTGCGT
GGAAAAAAAAAA
SEQ ID NO:47
GCAGCAGCGTAGCAGCTTTTATGGCCGCCGGTCGCCGATCGCTCAACTCCTTCGCTCCT
CCTTCTCCTCCTTCGTCTTCGCCGATCGCTGTGGCTGATCTCGTCGCTCCGGCTTTTCAT
AAAAATCATGGCTGAGGCAATCGATTTGACGGGTGATGGAGGGGTCATGAAGACAATTG
TGCGACGAGCAAAACCGGATGCAGTCTCTCCTTCAGAGACCCTTCCTCTTGTTGATGTTC
GTTATGAAGGAGTACTTGCTGAAACTGGTGAAGTCTTTGACTCAACACATGAAGACAATA
CTCTATTCTCCTTTGAGATCGGAAAGGGCTCGGTGATCAGTGCTTGGGACACTGCATTG
AGAACTATGAAGGTTGGGGAGGTTGCAAAAATCACATGTAAGCCAGAATATGCCTATGG
CAGCACGGGTTCTCCACCCGATATCCCACCAGATGCAACCCTTATTTTTGAGGTGGAGT
TAGTTGCATGCAAACCGTGCAAGGGCTTTTCAGTGACCAGTGTCACAGAAGACAAGGCT
AGGCTTGAGGAGCTGAAGAAGCAAAGGGAGATAGCTGCCGCAACCAAAGAGGAAGAGA
AGAAGAGGAGGGAAGAAGCTAAAGCTGCAGCTGCTGCTCGTGTTCAAGCCAAGCTGGA
TGCTAAGAAAGGTCACGGGAAGGGAAAGGGAAAAGCAAAATGAACAGCTTTATAAGGAA
TACTCAAGTAGATCATAACTCAGTTTTCCTTCATATGCCTTTTAAACTTGGCTAGGATCTT
AACCTTGAGTATTATGGAATATAAGTTAAGACCAGACTTCCCATAAAAAAAAAA
SEQ ID NO:48
CTGCAAGCGTGCTCGGAGATCGTGGAGATGGCGACGGCGAGATCGTTTTTCCTCTGTG
CTCTGCTCCTCCTCGCAACCCTATATCTCGCTCAGGCGAAGAAGTCTGAGGATTTGAAA
GAGGTGACTCACAAAGTTTATTTTGATGTGGAGATAGCTGGAAAACCGGCAGGTCGAAT
TGTCATGGGTCTATATGGAAAAGCAGTTCCAAAGACTGCTGAAAACTTCAGGGCGCTGT
GTACAGGTGAGAAAGGCACTGGAAAGAGTGGAAAACCTCTTCACTACAAGGGAAGCAGT
TTCCACCGGATCATTCCAAGCTTCATGCTACAGGGAGGTGACTTCACTCTTGGTGATGG
CAGAGGTGGCGAGTCTATTTATGGAGAGAAGTTTGCTGATGAAAACTTTAAGTTGAAGCA
CACTGGACCAGGGCTTTTGTCTATGGCGAACGCTGGTCCTGACACGAACGGTTCACAGT
TTTTTATCACAACTGTGACGACTAGTTGGCTTGATGGGGGAGACACGTCGTGTTCGGGAAA
GTGTTGTCTGGGATGGATGTGGTTTACAAAGTGAAGCTGAGGGAAGACAGAGCGGCAC
CCCCAAAAGCAAAGTCGTGATAGCAGACAGTGGTGAACTCCCACTCTGATCGAACTGAT
GCCAACATGTTTACCATGACTAGCTCTTAGGCAGCAGTTAAGAATAAATGCAAAAACAG
TGTAGGTTATGTGAAACCGAAAACTGGATTTGGAGTTATAGCCGTGCTCTTCTTTTTGGG
CATTTCGGTTGTTCTAATGTGTCTGGAGTGAATATTTTTCAATATGATGTAATTTTGCCTTT
GTAAAAAAAAAA
SEQ ID NO:49
TCGACATTGAGGATCCGGGAGCCGACTCCCTTCGCTCGATGGAGGTTCGAGGGGTCGC
GGCCTGACGACGATCGAGGGCGCCCGCCTTCCTATGCCATCTTCTCATCTTCTTCATCT
TCTTCCGCGGCTCGGGAATGATGCGTCGAGAGATCTCGGTCCTGCTCCAGCCTCGCTTC
GTCCTCGCCTTCCTCGCCCTCGCCGTCCTCCTCCTCGTCTTCGCCTTCCCCTTCTCGAG
ACAGAGAGGAGACCAAGTAGAAGAAGAACCTGAAATTACCCACAGAGTATACTTGGACG
TTGACATTGATGGACAACATTTAGGTAGAATTGTGATTGGATTATATGGCGAGGTGGTAC
CAAGAACTGTAGAAAATTTCCGGGCTTTGTGCACAGGGGAAAAGGGCAAGAGTGCCAAT
GGAAAAAAACTCCACTACAAAGGAACACCTTTTCACCGTATAATATCTGGCTTCATGATC
CAAGGGGGAGATGTAATCTACGGTGATGGAAAAGGATATGAATCCATATACGGTGGCAC
CTTTGCTGATGAGAATTTTAGGATAAAGCATTCCCACGCAGGTATCATTTCCATGGTGAA
TTCTGGCCCTGACTCTAATGGATCACAATTCTTTATTACCACAGTCAAAGCCAGCTGGTT
GGATGGAGAGCATGTTGTTTTCGGCAGGGTTATTCAAGGCATGGACACTGTCTATGCAA
TCGAAGGTGGAGCTGGAACATACAACGGAAAGCCCAGAAAGAAAGTCATAATCGCTGAC
TCTGGAGAGATTCCGAAAAGTAAGTGGGACGAGGAAAGGTGACCACTTTTTGTTTTTGG
GTACACGCAGTTAGGATAACTAGCATGAAAGCCCGATCCCGCATATACAGGACTTGGAG
GCGAGTTCTCTTCTTTGGTCGACATTCTTTGCCCCGTCTTTTGTCCAGAAACCTGGCTAC
CCAGCTAACTCCAACGCTCATATTCACCATGGTACAGTTCTTGGAAATGCCATGGGTCTG
TACCCGACTTCTACGATGTATTCTTTGTAAAGATGAATGGGTTCACTCAACTTTTTTTCTG
GCCTTTTGTCTTGTGTCTGCTGTTCCATTCATGAGCAGTCTACAGGAGTTTTGGTCTCATC
ACCCAGTCCACGACGTATGAGTCCTGATAAGAATTT
SEQ ID NO:50
GGGAGCAGCGCATTTCATCAGACATTTGGTGAGCGACACCATCTCCCCTGCAAGAAAAG
AGAACCTCTCGCGGCTCCTCTCCCGGCGTGGCCCTCGTAGCTGGCTTTGTGCGTAGAA
GATGTGGGCGACCGCGGAAGGAGGTCCTCCCGAGGTCACTCTCGAGACCTCCATGGGT
TCTTTCACTGTCGAGCTATACTTCAAGCATGCGCCGAGGACTAGCCGGAACTTCATCGA
ACTCTCTCGGAGGGGTTACTACGATAACGTCAAGTTCCACCGAATCATCAAGGACTTCAT
CGTGCAAGGTGGGGATCCCACCGGGACTGGAAGGGGTGGAGAATCGATCTACGGTAAA
AAGTTTGAGGATGAGATAAAACCAGAACTGAAGCACACGGGTGCAGGTATCCTGTCCAT
GGCAAATGCTGGACCAAACACCAACGGCAGCCAGTTCTTCATCACCTTGGCGCCATGCC
CTTCACTTGATGGAAAACACACAATATTTGGACGAGTATGCAGAGGAATGGAGATTATCA
AAAGACTTGGGAGTGTCCAAACAGACAACAATGATAGACCGATCCATGATGTGAAGATAT
TGCGGACATCAGTAAAAGATTGATTTCGAGGGAGTTAATTTTCACTGACTGGGTGTTCTG
TGATACCGTCTTTCAACTGCACATTGATCATGGAGCCATATCTGTCCAACATTTGTAATAT
TGTGCAATGGCGTTAGTATCATCTGATGCATGTTATCGCCGGTCACTTCTCTTCGACAAA
AGATACATTATGCAATGCGTTGATGCTTTGTCAGGATTATGTCAGCTTGGGACAAATTTG
GGGCAATTTCATGGCATATGAACTCAACAGGATCT
SEQ ID NO:51
GAACTTCGATTGCAGTTCACAAGAGGTTGTGGCTAGGATGTCGAATCCAAAGGTTTTCTT
CGACATATTGATTGGTAAGATGAAAGCAGGCAGGGTTGTGATGGAGCTCTTTGCAGACG
TCACGCCCAAGACCGCCGAAAATTTCCGCGCACTGTGCACCGGGGAGAAGGGGATCGG
CAGATCTGGGAAACCGCTGCATTACAAGGGATCAACCTTTCACCGTATAATCCCAAACTT
CATGTGTCAAGGCGGGGATTTCACTAGGGGCAACGGAACCGGAGGGGAGTCTATATAC
GGAATGAAATTTGCCGATGAGAACTTCAAGATTAAGCACACCGGCCTGGGAGTGCTGTC
GATGGCCAACGCGGGACCCGACACCAATGGCTCTCAGTTCTTCATATGCACCGAGAAGA
CCCCGTGGCTGGATGGGAAGCACGTCGTGTTTGGGAAGGTCATAGACGGTTATAACGT
GGTTAAGGAGATGGAGAGCGTGGGTTCTGATAGCGGAAGCACGAGGGAAACGGTTGCA
ATCGAAGACTGCGGTCAGCTATCAGAGAATTGATGGCTAGCACTGTGTAGAAAGGTGAA
TTTAAAGTACTTGTCTACACTGCTTATTAAATCATTGTGGATGAGCCCCTTAAAATTTCTTA
TGATG
SEQ ID NO:52
GAAGAAGGAGGAGGAGCCGAAGTGCCTGCATTCCGAGAGCCGAAGTGCGTTGGCTTCC
CTTGCGATCGTTCCTCCGTTTCGTGATCCGACCCGAAGACCCCGCGCGCCATGGACGA
CGACTTCGAGTTCCCGGCCTCGAGCAACGTCGAGAACGACGACGACGACGGCATGGAC
ATGGACGACATGGGCGGGGACGTCCCCGAGGAGGAGGACCCGGTGGCGAGCCCCGC
CGTCCTCAAGGTCGGCGAGGAGAGGGAGATCGGGAAGGCCGGGTTCAAGAAGAAGCT
CGTCAAGGAAGGCGAAGGGTGGGAGACCCCGAGCTCCGGCGACGAGGTCGAAGTGCA
TTACACGGGGACTCTGCTCGACGGGACCAAGTTCGACTCGAGCCGCGATCGTGGGACG
CCGTTCAAGTTCAAGCTCGGTCGAGGTCAAGTGATCAAGGGATGGGATGAAGGGATCAA
GACTATGAAGAAGGGTGAGAATGCCATATTCACCATCCCTCCGGAACTTGCCTATGGTG
AATCGGGATCCCCTCCTACCATTCCTCCTAATGCCACTCTTCAATTTGATGTGGAGTTGC
TCTCATGGAGTAGTGTGAAAGATATATGCAAGGATGGAGGGATCCTCAAGAAAGTTCTTG
TGGAAGGAGAGAAGTGGGATAATCCCAAGGACCTAGATGAGGTTTTTGTTAAATATGAA
GCAAGCCTCGAAGATGGGACGCTCATTTCGAAATCAGATGGGGTGGAGTTCACTGTGG
GAGATGGATATTTCTGCGCTGCGTTGGCCAAGGCTGTAAAGACAATGAAGAAAGGAGAG
AAGGTTTTGCTGACAGTGATGCCACAATATGCATTTGGGGAGACTGGCAGACCGGCTTC
TGGGGATGAGGCTGCTGTTCCCCCTGACGCTAGCCTCCAAATTATGCTCGAGCTAGTCT
CTTGGAAGACCGTTTCTGATGTAACCAAAGACAAGAAGGTCTTGAAGAAAACTTTGAAGG
AAGGAGAGGGGTATGAGCGCCCGAATGATGGGGCAGCTGTTCAAGTGAGACTCTGCGG
TAAACTCCAAGATGGAACGGTCTTTGTGAAGAAGGATGACGAGGAGCCTTTCGAGTTCA
AAATAGACGAGGAACAAGTGATCGATGGACTTGACAGAGCTGTCAAAAACATGAAGAAA
GGGGAAGTGGCATTGGTGACAATCCAGCCGGAGTATGCCTTTGGCCCGACCGAATCTC
AACAAGATTTGGCTGTTGTTCCTGCCAACTCCACTGTGTATTATGAGGTCGAGCTGTTAT
CATTCGTTAAGGAGAAAGAATCATGGGAGATGAACAATCAGGAGAAAATTGAAGCTGCA
GCAAGGAAGAAGGAAGAGGGGAATGCTGCATTTAAAGCTGGAAAATATGTAAGGGCTTC
GAAGAGATATGAGAAGGCAGTGAGGTTTATCGAATATGATTCATCATTTAGTGATGAGGA
GAAGCAGCAGGCAAAGACCCTAAAAAACACCTGCAACCTTAACGATGCAGCTTGCAAAT
TGAAACTCAAAGACTTCAAGGAGGCAGAGAAACTGTGCACCAAGGTATTGGAAGGTGAT
GGTAAAAATGTGAAGGCACTCTACCGGAGGGCTCAAGCTTACATACAACTTGTTGATTTG
GATCTTGCAGAGCAGGACATCAAAAAGGCATTAGAGATTGACCCTAACAACAGGGATGT
GAAACTCGAGTACAAGATACTGAAGGAAAAAGTGAGGGAATATAACAAGAGAGATGCCC
AGTTCTATGGGAATATGTTTGCCAAGATGAACAAACTGGAGCATTCTAGAACAGCGGGC
ATGGGGGCAAAGCATGAGGCAGCACCTATGACGATAGATAGCAAAGCGTAGATTGTACT
CGTGAATAGCTAATGGTGATGTATGTTTGGAGTCCGCTTGAGTAATGTGTACCCACATCA
ACTGTCCATGTATTCCTTAACAGACGCTAAAAAACTCGTCTATCGACTTGAGACTGTCTTG
GCGTGTATTTTGGAATAAACTATTATCACGTTTTGTTAAATATAATACCAAAACTCCGTGC
TGTGAATAAAAAAAAAA
SEQ ID NO:53
TTTTGGTTTGTTTCCTTCTTTCCCGCCGTGGAGGAGGAGAAGAGTCTAGAGAGAGAGAG
AGAGAGAGAGGGAGATGGCGAAGCCGAGGTGCTTCATGGACATCAGCATCGGAGGGG
AGCTCGAAGGCAGGATCGTCGGCGAGCTCTACACCGACGTCGCCCCCAAGACGGCCGA
GAATTTCAGGGCCCTCTGCACCGGCGAGAAGGGCATTGGCCCTCACACCGGCGCCCCC
CTCCACTACAAGGGGGTTCGCTTTCATCGTGTTATCAAAGGATTCATGGTGCAAGGTGG
AGATATCTCTGCTGGTGATGGTACCGGGGGAGAATCTATCTATGGCTTGAAATTTGAGG
ATGAAAATTTTGATCTGAAGCATGAAAGGAAGGGAATGTTATCAATGGCAAACTCTGGCC
CAAACACAAATGGCTCCCAGTTCTTCATCACAACAACCCGCACTTCGCATCTGGACGGA
AAGCATGTTGTGTTTGGGAGGGTAGTTAAAGGAATGGGAGTGGTCCGATCAGTCGAGCA
TGTTACAACTGCTGCTGGTGACTGTCCAACTGTTGATGTTGTAATCGCTGACTGTGGAGA
AATTCCTGCTGGGGCGGATGATGGCATTAGAAATTTTTTTAAGGATGGCGATACTTACCC
AGACTGGCCTGCAGATCTTGATGAGAGTCCTGCTGAACTTTCTTGGTGGATGGATGCTG
TAGATTCTATCAAGGCATTCGGGAATGGAAGTTATAAGAAACAAGATTACAAAATGGCTC
TCAGAAAGTATCGAAAGGCCCTGCGCTATCTGGATATCTGCTGGGAGAAAGAAGGGATT
GATGAAGTTGAGAGTTCATCTTTGAGGAAAACAAAGTCACAGATATTCACCAATAGTTCT
GCTTGTAAATTGAAACTATGTGATCTCAAGGGAGCATTGTTAGATGCAGAATTTGCTGTT
CGTGATGGAGAAAACAATGCAAAAGCTTATTTTCGACAGGGACAGGCTCATATGGAACTT
AATGATATCGATGCTGCAGCGGAAAGCTTCTCTAAGGCGTTGGAGTTGGAGCCAAATGA
TGTTGGGATCAAGAAAGAGCTTAATGCTGCCAAGAAGAAGATTTTTGAGAGGCGCGAAC
AGGAGAAGAGGGCATATCGGAAGATGTTTCTATAGCAGATTCTCTACTATTGCAGAGGAA
AGCACAAAGATTCTTGGCAAACCAACTGTATAGATTGGAGGCAGGGATCAGATCAAGATT
TTTATTTGCAGTTAAATAGAGTTGAAATTCATTAGGATAAGAGTTTTTGTTTTACACCCGG
GGGGGGGACCATCGTCGTTGCCCAAATGGCATGACCAGGACATTTTGGAACTGGACATT
TTTAGTCACTTCTCCACGTCGTTTCGGTTTAACTCTAATGGGAGCGTTGGGTGTTCAGGG
CAAAAAAAAAA
SEQ ID NO:54
TCTCGCTCTAAACTCCAAGTTCTAAACTTGAAGCATTCTTGAAGTTGCGGGACATGACTA
AGAGAAAGAACCCTCTCGTCTTCTTAGATGTGTCGATTGATGGGGATCCTGTGGAGAGA
ATTGTTATTGAGCTCTTTGCTGACACTGTTCCCAGGACAGCAGAAAATTTCCGATCACTC
TGCACAGGAGAGAAAGGAGTGGGGAAAACTACTGGCAAGCCTCTGCACTACAAAGGAT
CGTACTTTCATCGAATTATTAAAGGGTTCATGGCCCAAGGTGGTGATTTTTCAAATGGAA
ATGGCACTGGTGGAGAAAGTATATATGGAGGGAAGTTTGCTGATGAGAATTTCAAACTAG
CACATGATGGACCTGGTCTTCTCTCTATGGCAAATGGTGGGCCAAATACCAATGGGTCC
CAGTTCTTCATAATATTCAAGCGCCAACCTCACCTTGATGGGAAGCATGTTGTATTTGGG
AAGGTTATGAGGGGAATGGAAGTTGTGAAGAAAATTGAACAGGTAGGAAGTGCCAATGG
AAAACCCCTCCAACCGGTGAAAATTGTAGATTGCGGTGAGACTTCTGAAACAGGAACAC
AAGATGCTGTTGTGGAGGAAAAAAGTAAATCTGCAACACTAAAAGCAAAAAAGAAACGAT
CTGCAAGAGATTCGTCTTCTGAATCCCGAGGAAAAAGACGACAGAGAAAATCCCGCAAG
GAGAGAACGAGGAAACGAAGGAGATACTCTTCATCTGATTCCTACAGCTCTGAGAGCTC
TGATAGCGATTCTGAATCTTATTCCTCAGACACTGAGTCAGAATCCAAATCACACTCTGA
GTCATCGGTATCTGATTCAAGCTCAAGTGATGGAAGGCGCAGGAAAAGGAAGTCAACAA
AAAGAGAAAAACTTCGTCGGCAAAGAGGGAAGGATAGTCGTGGGGAGCAAAAGAGTGC
CAGATATGACAAAAAATCAAGGCACAAGAGTGCAGATAGTTCAAGTGATTCAGAAAGTGA
AAGTTCTAGCCGTAGTAGGAGTAGAGATGACAAGAAGAAATCTTCTCGACGTGAGTCTG
CTCGAAGTGTTAGCAAGTTAAAAGATGCCGAAGCAAATTCTCCTGAGAATTTAGAATCAC
CAAGGGATCGTGAAATCAAGAAAGTTGAGGACAACTCATCACATGAAGAAGGTGAATTTT
CACCAAAGAATGATGTTCAGCATAATGGACATGGTACTGATGCAAAATTTGGTAAATATG
ATGATCAACGTCCCCGTTCAGATGGTTCAAAGAAGTCCAGCGGAAGCATGAGGGACAGC
CCCAAAAGGTTGGCTAACAGTGTCCCTCAGGGAAGTCCATCAAGCAGTCCTGCACATAA
AGCCTCTGAACCTTCTTCCTCTATTCGTGCCCGCAATCCATCAAGAAGCCCTGCGCCAG
ATGGTAATTCGAAGCGCATTAGAAAGGGCCGTGGCTTCACAGAACGTTTCTCCTATGCC
CGTCGATATCGTACCCCATCTCCAGAAGATGTAACATATAGGCCCTACCATTATGGTAGA
AGAAACTTTCATGACAGGAGGAATGATAGGTATTCAAATTACAGAAGCTATTCTGAGCGC
TCACCACACAGAAGATACAGAAGCCCCCCAAGAGGCAGGAGCCCTCCAAGATACCAAC
GAAGAAGAAGTCGAAGCAGGAGTGTGTCTCGCAGCCCGGGTGGTAATAAAGGCCGATA
CCGGGGTCGGGACCAGAGCCGGAGCAGGAGCAGGAGCAGGAGTCGGAGCCCTAGAC
GTGGTTCTAGTCCAGCGAATAAGCAATTGCCACTGAGCGAGAGGTTGAAATCACGTCTT
GGGACCAGAGTCGATGAGCATTCTCCACGCAGAAGAAGGTCCTCCTCGAGAAGCCATG
ATTCCTCAAGATCTAGATCCCCTGATGAAGTACCAGACAAGCATGAGGGCAAAGCTGCC
CCTGTATCTCCTGCAAGGTCTCGATCCAGCTCCCCTTCTGGGAGGGGGTTGGTCTCCTA
TGGCGACGCCAGTCCTGATTCTGGGATCAACTAAAGGGTCTTCCACTTCACATCATCAC
GATTGGCCGAGAGATTTGATGATCTGTGGAGTACCTGCGCCCAACGTTTTGAGTCAGGT
GGACTCGCATGGGGTGTGCAACCGTTGGAATGAACCAAGAGCGGCTTGGACCAGCCTC
ACTCGGTCGGTATTATGTACTCGGTAAATGAAATTTAATGACATAGCTGCTGGATTTTTAC
GAGGTCATCTTAAACGTTCTGAGAATTTCCTGAGATCATGGCCCTCTCATTTGCATCTTAT
AAAACTTTAGTAGACTTGGATTGAAAAAAAAAA
SEQ ID NO:55
TCGATACCTCCTCGGAGCCGGAGGTTTCTGTCGGGGGGAGGCCGGGGCGAACATGTC
GGTGCTGCTCGTGACGAGTTTAGGGGACATCGTGGTGGACTTGCACGCCGATAGGTGC
CCCTTGACCTGCAAGAACTTCCTCAAGCTCTGCAGGATCAAGTACTACAATGGGTGTGT
GTTTCACACTGTGCAGAAGGATTTCACAGCACAGACAGGCGACCCCACTGGGACTGGAA
CAGGCGGCGATTCTGTTTATAAATTTCTTTATGGTGATCAGGCTCGCTTTTTCATGGATG
AGATTCATCTTGATCTTAAACACTCCAAGACAGGAACGGTTGCCATGGCAAGCGGAGGA
GAAAATCTCAATGCTTCTCAGTTCTATTTCACATTGCGGGATGATTTGGATTATCTCGATG
GAAAGCACACGGTGTTTGGTGAAGTTGCAGAAGGCCTTGAGACATTAACTAGGATTAAT
GAAGCCTATGTAGATGAGAAGGGGAGACCCTATAAGAACATCAGGATCAGGCACACATA
TATATTGGATGACCCTTTTGATGACCCTCCCCAGCTAGCTGAGTTGATTCCTGATGCTTC
TCCTGAAGGAAAGCCGAAAGATGAGGTGGTCGATGATGTGCGACTTGAAGATGATTGGG
TACCTTTGGATGAGCAACTAGGCCCGGCTCAGCTTGAGGAGGCTATTCGTGCTAAGGAA
GCACACTCCCGTGCAGTTGTGCTTGAGAGTATTGGAGATATTCCTGATGCTGAGATTAAG
CCTCCAGATAATGTGCTCTTTGTTTGTAAACTGAATCCAGTTACCGAAGATGAGGACTTG
CATACAATCTTTTCACGATTTGGAACCGTGGTATCGGCTGACGTTATCCGTGATTTTAAG
ACTGGAGATAGTTTATGCTATGCATTCATAGAGTTCGAGAACAAAGATTCATGTGAGCAG
GCATATTTTAAGATGGACAATGCCTTGATTGATGATAGGAGGATAAAGGTGGATTTCAGC
CAGAGTGTTGCAAAACTTTGGTCCCAGTTCAAACGAAAAGATAGCCAAGCAGCTAAAGG
AAAAGGTTGCTTTAAATGTGGGGCTCCTGATCACATGGCCAGGGAATGTCCTGGAAGCT
CCACTAGACAGCCACTTTCAAAGTACATTCTGAAGGAGGACAACGCACAGAGGGGTGGA
GATGACTCTAGGTATGAAATGGTCTTTGATGAAGATGCACCAGAAAGTCCAAGTCATGGA
AAGAAAAGGCGAGGACGAGATGACCGGGATGATAGACATAAGATGAGCAGGCAAAGTG
TAGAAGAAACAAAATTCAATGATAGAGAGGGAGGGCACTCAGTGGATAAGCATAGGCAG
AGTGAAAGAAGTAAGCATAGAGAGGATGAGATGAGTCGAGACAGTAAAGCTAGTGAGGC
TGGTAGGAGAAGGATTGACAGGGATTTTCCCGAAGAGGAAAGAGATGGAGAGAAATATA
CGGAGAGTCACAGAGACAGAGATGGTAAGAGAGGTGATTATCGAGATTATAGAAAGGGA
AGAGCAGATGTTCAGACTCATGGAGACAGGAGAGGTGATGAAAACTATAGGAGAAAGAG
TGCTGCTTATGATGATGGCCATGAGGGTGCGGGAGCTGCTAGGAGAAAAGATTCCAATG
ATGATCACCATGCATATAGGAGAGGATATGGAGATTCTAGGAAGGGAACTAGAGATGAA
GATGACGATGGACGAGGAAGGAGGGATGACCCTAGTTATAGGAGAAGCAGTGGACACA
AGGATAGCTCCAACGGTGGAAGGGAAGAGCAGAAGTATAGAAGTGGAGAGACAGATGG
CAAAAGTCATCCGGAGAGAAGCCATCGAGGTGATAGGAGGAGATGAAGGGCGGGCCAG
AGTACAGAGACATTTCTTCTACTGGAGAGCTCCCTTGTAGGGATGGGTGCCTAATGCTG
TCCCGTCGGGGTGCTGCATGACTTTAAATAGCTGATTCTGACCATATCCTAAAATTCTGG
TTTTGATTGGGTCATTTTGCACTCAAAAAAAAAA
SEQ ID NO:56
GCGAGCATGAGGCCTTTCAATGGCGGATCCTCCATCGCTTGCCTCGTCCTGGTGATCGC
CGCCGGCGCGCTGGCCGAGTCGCAAGGGCCCCACCTCGGATCGGCTCGCGTCGTTTT
CCAGACTAATTACGGGGACATCGAGTTCGGGTTCTTTCCGGGCGTCGCTCCGAGGACG
GTGGATCACATCTTCAAGCTCGTCCGCCTCGGGTGCTATAACACCAATCACTTCTTCCG
GGTGGATAAAGGTTTTGTTGCCCAAGTCGCTGATGTTGCCAACGGGAGAACCGCGCCCA
TGAATGATGAGCAGAGAACGGAAGCCGAGAAAACTATCGTTGGGGAGTTCAGTAATGTC
AAGCATGTTAGGGGCATTCTTTCCATGGGGAGATATGACGATCCAGACAGTGCACAATC
CTCTTTCTCAATACTTCTCGGAGATGCTCCGCATCTTGATGGCAAGTATGCTATATTTGGT
AGAGTTACCAAAGGTGACGAGACATTGAAAAAGCTTGAGCAGCTACCTACTCGCCGTGA
AGGGATGTTTGTAATGCCAACTGAACGCATCACAATTCTATCATCATATTATTATGATACT
GGGGCTGAGAGTTGTGAAGAGGAGAATTCAACTTTGAGGCGCAGGCTTGCTGCTTCAG
CTGTTGAGGTAGAGAGACAGAGGATGAAATGCTTCCCGTGATGCTTTCTAGATCATTGG
AAGGAAGTGGTCTGATTTCCTTATCATTCATCCATTGTGCTTTGATGTATCCTCAGTGTAC
TGCTTTTAGCTATGTATAGATCGAGTCAACTCATTGAAGATATGGGGAAGGAAGATTCAA
CTGTTCCATTCATTCTGTAAGTTTAACAAAAAGCATGAAGATGAAGGTTACTTTGTGAGAA
CATTCTTGTCCACTAAGCTGACATTGTTAGTGGATTTACTTGAGAGGTGAGAAATGAAAG
ATCATAGACATTTTCACTTCCAAAGTTGGGAACTCAAGCCATTTATCCATATATATATTGTT
CCTGAATGCTGTTTTTTCAGCCAAAAAAAAAA
SEQ ID NO:57
AGAAAGTAAGGTTGATCAATTTGGCGGAAAGTCAGGCTTTTGGTGCAGCTTTTTGCAGAA
TCTCAGAGGTTTCGAACTCAGGATGCCGAACCCAAAGGTTTTCTTTGATATGCAGGTCG
GCGGTGCCCCAGCCGGCCGGATCGTGATGGAGCTCTATGCGGATGTGGTGCCAAAGAC
GGCTGAGAACTTCCGCGCGTTGTGTACCGGCGAGAAAGGCACCGGCCGCTCAGGCAAG
CCTCTGCACTTCAAGGGTTCGTCGTTCCACCGTGTGATCCCAGGGTTCATGTGCCAGGG
CGGTGACTTCACAAGGGGCAACGGTACCGGTGGAGAGTCGATCTACGGCGAGAAGTTT
GCCGATGAGAACTTTGTAAAGAAGCACACGGGGCCTGGCATCCTCTCCATGGCCAACG
CCGGCCCTAACACTAACGGCTCCCAGTTCTTCATCTGTACCGCCCAGACCTCGTGGCTG
GATGGTAAGCATGTCGTATTTGGTCAAGTTGTAGAGGGCTTGGAGGTCGTGCGCGATAT
CGAGAAGGTTGGATCTGGATCTGGCAGAACTTCAAAGCCGGTTGTCATTGCCGACTCTG
GACAGCTCGCTTGAATTTTTATTATTTACCTTCGCCTTTACGCTGCATACGTTAATAGGTT
ATTATTTCCTTCAACCATTACGCTGCATAGGTTGTTAGCGTATTGTTTCCCTTTACCATTA
CGCTGCATGACTCCCTAGGGTTTGTCAGCATAGGCGTTTTAAGGGTTTTTTGCATCTTTC
TACTCAAGATAGTCGCTGCATAAGTTCCTAGGGTTTGTCAGCAAAGAGGCTTCAAAGGTT
TTTGTGTCTTTTCTAGTTATTATAAACGCTTCATAGGTTCCTAGGGTTTGTCAGCATATAG
ATTTCAAGGGTTTTTTTAGATCTTTGGAGTTGAGATAAACGCTATGGCAATAACCCAGTAA
TGTTTGTTTTTATCATATGAAATTTTTACATCTGGAGTTGCATTCGCAGTAAAAAAAAAAA
SEQ ID NO:58
CTTGCTTGCGGCTCGGGATCCGATCAACAACACACGTTTCAACAGATACCGACGATTCC
TGTCCTTAGAAACACGTTCGACATCTTCTATCTCTTGTCGAAATCACACAAATCAAAATCC
AATCATGCGTTTCACCAGCATCACCAGCGCCATTGCGCTCTTCGCCGCCGCTGCTTCGG
CGCTCGACAAGCCGTTGGATATCAAGGTTGACAAGGCGGTCGAGTGCTCGCGCAAGAC
CAAGGCCGGCGACAAGATCCAAGTGCACTACCGCGGCACCCTCGAGGCAGACGGCAG
CGAGTTCGACGCCTCGTACAAGCGCGGCCAACCGCTCAGCTTCCACGTCGGCAAGGGC
CAGGTGATCAAGGGATGGGACCAGGGCCTCCTCGACATGTGCCCCGGCGAGAAGCGC
ACTCTCACCATCCAGCCCGACTGGGGCTACGGCAGCAGAGGCATGGGACCCATTCCCG
CCAACAGCGTTCTGATCTTCGAGACCGAGCTCGTTGAGATTGCTGGTGTTGCCAGAGAG
GAGCTTTAGATGGGTAATTTGTCCATGACAATCGTAGTCGAAGACACGATACGCTCTTAG
ATGGTACGGAAATCTGATTTGAGCTTTCTCAAAAAAAAAA
SEQ ID NO:59
TGAATTTTCCCGAGAAAACCAGGAACCTGCCAAATATCTCTCTGAAAGATCTCCATGGGT
AACCCGAAGGTGTTCTTCGACATGTCGATCGGCGGCCAGCCGGCCGGCCGGATCGTGA
TGGAGCTCTACGCCGACGTGGTGCCGCGCACGGCGGAGAACTTCCGCGCGCTCTGCA
CCGGGGAGAAGGGCGCCGGCCGCTCCGGGAAGCCCCTCCACTACAAGGGCTCGAGCT
TCCACCGCGTGATCCCGGGGTTCATGTGCCAGGGCGGCGACTTCACCGCCGGGAACG
GGACCGGCGGCGAGTCGATCTACGGCTCCAAGTTCGCCGACGAGAACTTCGTCAAGAA
GCACACCGGCCCGGGTGTCCTGTCCATGGCGAACGCCGGCCCGGGGACCAACGGCTC
CCAGTTCTTCGTCTGCACCGCCAAGACCGAGTGGCTCGACGGCAAGCACGTCGTGTTC
GGGCAGATCGTGGACGGGATGGACGTGGTGAAGGCCATCGAGAAGGTGGGGTCCAGC
TCCGGCAGGACCTCGAAGCCCGTCGTCGTCGCCGACTGCGGCCAGCTTTCCTAGATCG
GGCGGCCCCCGTCGGTCGCCGGGATCTCCCCCCCCCCCCCTCCCCGTGTGCGTGTGG
GATCTTATCTGATCTCCAGTCCGTTTCCGAGCATGGTTTTTTAGGGCTTTCCTTTTTTTTT
TCGGCTTTTAGCGTGTGGGGTGTTCGGCCAGATCTGGTTATGGGTCTCTCCGCGGATCC
TTGTTGTCTGTAGGATCGTCGAGACCTTTTATGTTGGTTATGTTGAACTGCCAGTCGGCC
ATTAACTGCTGGAAAAATAAGATAGGGAGCTTTTCTTGAAAAAAAAAA
SEQ ID NO:60
ATCTCCCATGGCTGCTGACGACTGACCCCCCGTCAAAGCGCAGAGAAGAGACAGCGAC
CGCGCACCCGCAAGGCCCTCCGCCCTCTCCGGCGAGTCACGGGAGAAAGACGGGAAA
TTTAAGCGGGGGAGGAGAGCGAATTCGGGGAATGGCCGTCGCGACGAGGTCGAGATG
GGTCGCCATGTCGGTGGCGTGGATTCTGGTCTTGTTCGGAACCCTGGCTCTCATCCAGA
ACCGATTAAGCGACACCGGAGCTTCGTCTGATCCGAAACTTGTTCATCGCAAAGTTGGT
GAGGAAAAGAAGAAGCCGGACGATTTGGAAGAAGTGACTCACAAGGTTTTTTTCGATGT
CGAGATCGGAGGAAAACCAGCGGGTCGAATTGTCATGGGCCTCTTTGGCAAAACCGTC
CCCAAAACAGTCGAAAACTTCCGAGCTCTTTGCACGGGGGAGAAAGGGATTGGAAAGAG
TGGGAAGCCCCTGAATTACAAAGGGAGCCAATTCCACAGGATCATTCCTAAATTTATGAT
CCAGGGCGGTGACTTCACTCTTGGAGATGGGAGAGGGGGAGAGTCAATCTATGGGAAC
AAATTTTCTGATGAAAACTTCAAACTGAAGCATACAGATGCAGGGCGCCTTTCGATGACA
AATGCTGGGCCAGACACAAATGGGTCACAATTTTTCATTACCACTGTGACAACTAGCTGG
TTGGATGGCCGCCATGTTGTGTTTGGAAAGGTGCTGTCTGGGATGGATGTCGTCCACAA
GATTGAGGCTGAAGGGGGACAGAGCGGCCAACCTAAAAGTATCGTTGTCATTTCAGACA
GCGGAGAATTAGATCTATGAACACTTCCCTGCACACTCCCCTCCAGGTTGCACTTGACAA
GTTTGACCTCATAGCTTACTCGACACGAATGTAGGGCTCAGGCATCGTGCACTGCTTTG
ATGCAAATGTTTTTTTTTCTTTAATGGTGAAGAAGAAAATGTGAGGCAAGCTTGTTGATTT
GATATTGGGACATACTTGAAAGAGCTTGGATCGATCATAGAAGCCAGCCATGAATAGAG
ATAACTTTTCTGAGTGTGAATTGGATATTACGTTGCAAATAGCCGAATGAAATGTAGAAAA
CAGGATGTTTGTTCTGAAAAAAAAAAAAAAAAAAAA
SEQ ID NO:61
GGGAAGAATCCCGTCCGGTGGAAACCCGAACGCCGTTCGTTCCCTCTCGACGAGTCGC
CGGCGAAGCTCAGCTCGTCGAAACGAGCAAAGAATCTCTCCTCCTCCTCCTCCTCCTCC
TCCTCCTCCCGCCGCCGCCGCCTCCGCCGCCGCAGCAGCAGCAGTCGAGATGGCCGT
CACTCTGCACACGAATCTCGGCGACATCAAGTGCGAGATCTTCTGCGACGAGGTCCCCA
AAGCTGCCGAGCACAATGCAAGGGGAATACTCTCAATGGCTAATAGCGGGCCTAACACC
AATGGAAGTCAATTTTTCATCGCATATGCAAAACAACCACATCTGAATGGATTATACACCA
TCTTTGGCAGAGTGATTCACGGGTTTGAGGTCCTAGATATCATGGAAAAGACTCAAACAG
GGCCAGGGGACAGACCTCTTGCGGAGATCAGACTCAATCGTGTCACAATTCATGCAAAAT
CCTCTTGCTGGGTAGCCTTTCTTGATGGGTTTTTTTTTTTCTTATCTTGATAGGACTCTTTT
ACAGTTTTGCTTTAGGTTAGGGATCCCTGTAAGCTGATGATAGATATTGGAGATGGTACT
TGTAAGATTCAATTACATCGAACTACTTCAAACCTTGTATTGGAGCATTCCAAATACAAAA
GATCGAAAGATGCTTGTTGATATGTGCCGGTGGCTCATCTGTTGTGCTGGTTGCATGAT
GGTCTTTGTTTGCGATGTATTCGCCGTTCATCTCAAAAAAAAAA
SEQ ID NO:62
AGTCGAAGACGAAAGAGGAAAAGCAAGCTCCTCGAAGAGAGCTCGAGCATCTCCCATG
GCTGCTGACGACTGACCCCCCGTCAAAGCGCAGAGAAGAGACAGCGACCGCCCACCCG
CAAGGCCCTCCGCCCTCTCCGGCGAGTCACGGGAGAAAGACGGGAAATTTAAGCGGGG
GAGGAGAGCGAATTCGGGGAATGGCCGTCGCGACGAGGTCGAGATGGGTCGCCATGT
CGGTGGCGTGGATTCTGGTCTTGTTCGGAACCCTAGCTCTCATCCAGAACCGATTAAGC
GATACCGGAGCTTCGTCTGATCCGAAACTTGTTCATCGCAAAGTTGGTGAGGAAAAGAA
GAAGCCGGACGATTTGGAAGAAGTGACTCACAAGGTTTTTTTCGATGTCGAGATCGGAG
GAAAACCAGCGGGTCGAATTGTCATGGGCCTCTTTGGCAAAACCGTCCCCAAAACAGTT
GAAAACTTCCGAGCTCTTTGCACGGGGGAGAAAGGGATTGGAAAGAGTGGGAAGCCCC
TGAATTACAAAGGGAGCCAATTCCACAGGATCATTCCTAAATTTATGATCCAGGGCGGGTG
ACTTCACTCTTGGAGACGGGAGAGGGGGAGAGTCAATCTATGGGAACAAATTTTCTGAT
GAAAACTTCAAACTGAAGCATACAGATGCAGGGCGCCTTTCGATGGCAAATGCTGGGCC
AGACACAAATGGGTCACAATTTTTCATTACCACTGTGACAACTAGCTGGTTGGATGGCCG
CCATGTTGTGTTTGGAAAGGTGCTGTCTGGGATGGATGTTGTCCACAAGATTGAGGCTG
AAGGGGGACAGAGCGGCCAACCTAAAAGTATCGTTGTCATTTCAGACAGCGGAGAATTA
GATCTATGAACACTTCCCTGCACACTCCCCTCCAGGTTGCACTTGACAAGTTTGACCTCA
TAGCTTACTCGACACGAATGTAGGGCTCAGGCATCGTGCACTGCTTTGATGCAAATGTTT
TTTTTTCTTTAATGGTGAAGAAGAAAATGTGAGGCAAGCTTGTTGATTTGATATTGGGACA
TACTTGAAAGAGCTTGGATCGATCATAGAAGCCAGCCAGAAT
SEQ ID NO:63
CTTCTCTGCCCCTTGAATTTTCCCGAGAAAACCAGGAACCTGCCAAATATCTCTCTGAAA
GATCTCCATGGGTAACCCGAAGGTGTTCTTCGACATGTCGATCGGCGGCCAGCCGGCC
GGCCGGATCGTGATGGAGCTCTACGCCGACGTGGTGCCGCGCACGGCGGAGAACTTC
CGCGCGCTCTGCACCGGGGAGAAGGGCGCCGGCCGCTCCGGGAAGCCCCTCCACTAC
AAGGGCTCGAGCTTCCACCGCGTGATCCCGGGGTTCATGTGCCAGGGCGGCGACTTCA
CCGCCGGGAACGGGACCGGCGGCGAGTCGATCTACGGCTCCAAGTTCGCCGACGAGA
ACTTCGTCAAGAAGCACACCGGCCCGGGTGTCCTGTCCATGGCGAACGCCGGCCCGGG
GACCAACGGCTCCCAGTTCTTCGTCTGCACCGCCAAGACCGAGTGGCTCGACGGCAAG
CACGTCGTGTTCGGGCAGATCGTGGACGGGATGGACGTGGTGAAGGCCATCGAGAAGG
TGGGGTCCAGCTCCGGCAGGACCTCGAAGCCCGTCGTCGTCGCCGACTGCGGCCAGC
TTTCCTAGATCGGGCGGCCCCCGTCGGTCGCCGGGATCTCCCCCCCCCCCCCTCCCCG
TGTGCGTGTGGGATCTTATCTGATCTCCAGTCCGTTTCCGAGCATGGTTTTTTAGGGCTT
TCCTTTTTTTTTCGGCTTTTAGCGTGTGGGGTGTTCGGCCAGATCTGGTTATGGGTCTCT
CCGCGGATCCTTGTTGTTTGTAGGATCGTCGAGACCTTTTATGTTGGTTATGTTGAACTG
CCAGTCGGCCATTAACTGCTGGAAAAATAAGATAGGGAGCTTTTCTTGAAATGCGACCCT
CTTTTTACCCTGTAAAAAAAAAA
SEQ TD NO:64
AGAAGAGATTGCTGGTTTTGGCAGAGGGCGGAGGCGCAGATAAGAAGAGGGAGGCGAT
AGAGCAGCTCCTTCGCGAAAAATCCTCTTCATCGCTTCGCTTCGGGTTGGGGCGGAGGT
CGGCTCCTGTTAGGGTTTCGGCCGGCCAGATCCGCGGGCGGATCTCTCCTTTCTCCGG
GTTTGGATGAGCCCGGTCGCTGCGAACGCCATGGAGGAGGCGGCGGAACCCGAGGTC
CCAGCTCCCGTCACTCCCAGCAAGGACGACGCCGACACCGACGCCGCCGTGTCCCGCT
TCTTGGGATTCTGCAAGAGTAAATTGGGCCTGGCCGAGGGAAATTGCGTGCAGTCTTCG
ACGCTGCTGAGGAAAACCGCACATGTGCTGCGGTCGAGTGGGACTGTTATCGGTACTG
GAACGGCGGAAGAAGCAGAACGCTATTGGTTCGCTTTTGTGTTGTACACTGTTAGAAGG
GTAGGCGAGAGGAAAGCCGAGGATGAGCAAAATGGATCGGACGAGACTGAGGTCCCTT
TGTCCCGGATATTGAAAGCTTCAGTACTAAACCTCATCGACTTTTTCAAAGAGATTCCGC
AATTTGTTATCAAGGCTGGGGCAATCGTAAGCGGCATATATGGTGCTAATTGGGACAGC
AGGCTAGAGGCCAGAGAGATGCAGACAAACTACGTGCATTTATGCATTTTGTGCAAGTTT
TATAAACGTATATGCGGTGAATTCTTCATTCTAAACGATGCAAAAGATGATATGAAGTCCG
CTGATTCAAGCACTTCTGACCCCGTGATTATGTACCAACCTTTCGGATGGTTGCTGTTTT
TGGCGCTTAGAATACATGCCTTGAGCCGATTTAAGGACCTTGTTTCTAGCACAAATGCTT
TGGTCTCTGTACTGGCTATTCTAATAATTCATCTTCCAACCCGCTTCAGAAAATTCAGCAT
TTCTGACTCATCACAACTTGTTAAGAGAAGTGAGAAGGGGGTTGACCTTGTTGGATCACT
CGCTTATCGTTACGATACCTCAGAAGATGAGATCAAGAGAACATTGGAAAAAGCCAATAA
CGTGATAGCAGAAATTTTGGGGATAACTCCGCCTCCAGCTTCAGAGTGCAAAGCAGAGA
ATCTGGAGAATGTTGACACAGATGGTTTGATCTACTTTGGAAATCTCATGGAGGAAACAT
CTTTGTCATCCATTTTAAGTACTTTAGAAAAAATTTATGAAGATGCAACTCGTAATGACAG
TGAATTCGATGAGAGGGTATTTATAAATGACGACGACAGCTTGCTTGTATCGGGTAGCTT
GTCTGGGGCCGCCATTAATTTAACTGGTGCCAAGAGGAAATATGACTCATTTGCCTCCC
CTGCAAAGACTATCACAAGACCACTCTCTCCCAGCCGCTCTCCAGCATCTCACATCAAC
GGTATAATTGGTGGCACTAATTTGAGGATCACTGCTACTCCTGTGGCCACTGCTATGACA
ACTGCCAAGTGGCTTCGGACGTTCGTGTCCCCACTCCCATCAAAACCTTCGACTGATCT
ACAGGGATTTCTAGCGTCATGCGATAGGGATGTGACCAGTGATGTGATACGTAGGGCCA
ATATAATTTTGGAGGCCATCTTTCCAAACAGTCCTATTGGCGAGCGTACTGTAACTGGAG
GCTTGCAAAATGCTAATCTCATGGACAACATGTGGGCTGAGCAACGAAGGCTGGAAGCC
CTTAAGCTTTACTATCGGGTTTTGGAAGCTATGTGTAGAGCAGAGGCACAAATTTTACAT
TCGAATAATTTGACCTCTTTGTTAACAAATGAGAGGTTTCATAGGTGTATGCTTGCATGTT
CCGCTGAGTTGGTTCTTGCAACACATAAGACTGTGACGATGTTGTTTCCTGCAGTGCTAG
AGCGAACAGGTATTACAGCTTTTGACCTTAGTAAGGTGATAGAGAGTTTTGTTAGACATG
AGGAAACTCTTCCCAGAGAATTAAGAAGGCATCTGAATACATTAGAAGAGCGACTTCTAG
AGAACATGGTGTGGGAGAGAGGTTCTTCAATGTACAATTCCTTGGTAGTGGCAAGACCA
GCTCTTGCTCCAGAGATAAATCGGCTAGGCCTATTACCAGAACCAATGCCGTCCTTGGA
TGCCATTGCCTTGCTTATTAATTTTTCTTCCAGTGGATTGCCCCAGTCACCGGTGCAAAA
GCACGAGGCTTCTCCTGGTCAGAATGGGGATATCAGGTCTCCCAAGAGAATTAGTACAG
AATACCGGAGTGTATTAGTTGAAAGAAACTTCACTTCACCAGTAAAAGATCGACTGTTAG
CCTTGAGCAATATCAAGTCAAAGCTACCGCCACCTCCACTTCAGTCTGCATTTGCCAGTC
CAACTCGACCACATCCGGGAGGTGGAGGGGAAACATGTGCAGAAACCGCAATCCATATA
TTCTTTAGCAAGATTACTAAGCTGGCAGCCGTCAGAATTAATGCCATGCTTGAAAGGCTA
CAACTCTCTCAGCAGATAAAGGAAGGTGTTTATTGCCTATTTCAGCAAATACTCAGTCAG
CGAACTAATCTCTTCTTTAATCGCCATATTGATCAAGTAATACTCTGCTGTTTCTACGGTG
TTGCGAAGATCAATCAAATAAACCTGACCTTTAGGGAGATCATTTACAACTATAGGAAGC
AACCCCAGTGCAAGCCACAAGTGTTCCGCAATGTTTTTGTTGATTGGTCAACCCGGCGG
AATGGGAAAGCGGGGAATGAGCATGTGGATATTATCTCTTTCTACAATGAAATATTCATT
CCTTCTGTGAAACCGCTGCTCGTTGAACTTGGGCCCACAGGAGCAACTACTAGAACAAA
CCGGACTTCTGAAGTTGGCAATAAAAATGATGCTCAATGCCCTGGGTCACCTAAGATATC
TTCTTTTCCAACTCTCCCGGATATGTCCCCTAAGAAAGTATCTGCATCCCACAATGTCTAC
GTATCTCCACTGAGATCATCTAAGATGGACGCTTCAATCTCTCATAGCTCGAAAAGCTAT
TATGCTTGTGTTGGAGAGAGCACCCATGCCTACCAAAGCCCTTCCAAAGACCTTGTTGC
CATCAACAGTCGCCTGAATGGTAACCGAAAGGTGAGAGGCACGCTTAATTTTGATGATG
TTGACGCTGGCCTCGTCAGTGACTCTATGGTGGCCAACAGCCTCTACCTTCAGAATGGG
AGCAGTATGTCTTCATCAACTGCTAAATCATCTGAGAAACCAGAATCATAAAGTTAAATGA
TCCCCAGTCCCATTCCCATGCATGTAAATTCCAGTTTTCTTGTTCCCCTCAACTGGCTAAA
TCAGGAAGTGGAAGCGTGCAGTGAGTTATTTGTATAATAGCTGCCCTTTCCTTATCTTCT
GTCCATGAGACTGCTGCAAGCAAGTTACTTTAGCAGTTTAGTTTAATAAAATCCAATCACC
TTCACATGGTGAAAAAAAAAA
SEQ ID NO:65
CCCACACCGCTCGGCTAGGGTTTCTCTCCAACAGCGCACGGACAGGAAGAGCAGAGAC
CCGCCGGCGGTTACACCCTCCTCCGCCGCCCCCCGCCGAATCGAGGCCGTGGCAGCC
CCACCGGAGCTCCGAGAAGCGAGGGCGTTTTCCGATTCGATCGCCGACCATGAGGCCG
ATTCTGATGAAGGGCCACGAGAGGCCGCTGACGTTCCTCAAGTACAACCGGGAGGGCG
ACCTCCTCTTCTCCTGCGCCAAGGACCACACCCCCACCGTCTGGTTCGCCGACAACGG
CGAGCGCCTCGGCACCTACCGCGGCCACAACGGCGCCGTCTGGTGCTGCGACGTCTC
CCGGGATTCGATGAGGCTGATTACCGGAAGCGCGGACACGACGGCGAAGCTGTGGAG
CGTGCAGAACGGGACGCAGCTGTTCACGTTCAACTTCGATTCGCCGGCGAGGTCGGTG
GATTTCTCCATCGGCGATAAGCTCGCGGTGATCACCACCGATCCTTTCATGGAGCTGCC
TTCGGCTATTCACGTTAAGCGCATTGCCAGAGACCCTGCCGACCAGGCTAGTGAGTCTG
TCCTCGTCCTTCGGGGCCATCAAGGAAGAATAGCTAGAGCTGTCTGGGGGCCTCTGAAC
AAGACTATCATAAGTGCTGGAGAGGATGCTGTAATTCGCATTTGGGACTCTGAGACTGG
GAAGCTTCTCAGAGAGTCAGACAAGGAAACTGGCCATAAAAAGGCAGTAACTTCACTTAT
GAAATCCGTTGATGGTTCCCATTTCGTCACTGGTTCACAAGATAAATCTGCCAAGCTATG
GGACATCAGGACGTTGACTCTAATTAAGACATATGTGACAGAGCGCCCTGTTAATGCGG
TTACAATGTCTCCGCTTCTTGATCATGTGGTCCTTGGAGGTGGCCAAGATGCTTCGGCT
GTGACCATGACTGATCATCGTGCTGGAAAGTTTGAAGCCAAGTTTTTTGACAAGATTCTT
CAAGAAGAAATTGGAGGTGTGAAGGGACACTTTGGGCCTATTAATGCTTTGGCTTTCAAT
CCTGATGGGAAAAGTTTCTCAAGTGGAGGTGAGGATGGTTATGTTAGGTTGCATCACTTT
GACCCGGATTACTTCAACATCAAGATTTAGCTTTTGAGAAAATAGTTTTGCAGTTTTGAAA
ATTTGCTTCTGTATTGGCTCTTGATCATTGTGGAGAGAGTATTGTGACAAAGAAACAACT
GCCTTCTCGGAGGTTGGGGGAAGTACACATTGTGGAAAGATATCGTTGTCTGTATGTTG
GGGCGAGTTTTATGGGAAAAGGCTCTGCCTTCAGAGTGGCGGAATTTTTCCAGAATGTT
TATTATCATGTTATTTTCCTATCTTAGATTGGAAATTTTGTTTTCAAGGTTAGGTTGTCTAA
ATTATGAGGATCTTAATGAAAGGAGCTATTTCCTAGCAAAAAAAAAA
SEQ ID NO:66
GAACGAGGAAAGCCCGCGGATTTGGACAAGTATAAGAACCGGGCTGCCATTGCAGCGA
GCTTCTCTGCGTCTCGCCCCTTCGAGTCCGGCTCCGTTGATCTTTCGGTTTTTTTTTTGC
CCAATTTTCGGTTTTTTTTTTTGGGTTTTCCATAGGTTTTCTTTGTTCTTTGGTCGGGCTTC
GATCCGCATTGAACCTTCTTAGCAGATTAGTCAGTGAAGTGCGAACACCCACGAATTTTT
TTTTTTTTTTTCCTTGGCAAAAGATCGGATCCCAGTGTTTTGGGATCAATCCCTCTGTAAA
TCGCCGCAGGAAAGGAGGAGAATTTCAAAGCATTTTCGTGTTTTCGATTTACCCTCTTGT
AAGATAGCTAGTGATCAGCTCTTTCCGTGCTCAAATCGTTGGTTGCTTTTTTGAGGTTTC
CTGGATCTTGGGTCAGATTGAATTTTAGGGCTTTCTTCTCTTTTGTTATTTTGGGGGTGCT
TACCCTCTTTTGGCGGTTCCTCGTAATTTATTTCTTGTGGTTCGAAATGGACAAGAAGAG
GACGGTGGTGCCGCTCGTGTGCCACGGGCATTCGCGGCCGGTGGTCGATTTGTTTTAC
AGCCCGATCACGCCGGATGGCTTCTTCCTCATCAGTGCGAGCAAAGATTCTAGCCCAAT
GCTGCGAAATGGAGAGACTGGAGATTGGATTGGAACCTTTGAGGGACATAAAGGTGCAG
TCTGGAGCTGCTGCCTGGACACTAATGCTCTACGTGCTGCATCCGGCTCTGCAGATTTC
TCCGCGAAACTGTGGGATGCATTGTCCGGGGATGATTGCACTCTTTTGAACACAAGCA
CATTGTCCGGTCATGCGCCTTCTCAGAGGACACTCATCTCTTGCTGACCGGCGGTGTCG
AGAAAATTCTTCGAATTTTTGACTTGAACCGACCAGATGCTCCTCCCAGAGAAGTTGACA
ATTCACCAGGTTCAATAAGAACTGTTGCATGGCTCCATAGTGATCAGACCATATTAAGTT
CCTGTACTGATATTGGTGGCGTAAGGTTATGGGATGTAAGGAGTGGGAAAATTGTTCAAA
CTTTGGAGACAAAATCCCCTGTCACTAGTTCTGAAGTGAGTCAGGATGGTCGCTATATTA
CCACAGCTGATGGTTCGACCGTTAAATTCTGGGATGCAAACCACTTTGGGTTGGTGAAG
AGCTACAACATGCCCTGCAATATCGAGTCAGCCTCATTGGAACCAAAGCTTGGGAATAAA
TTTATTGCTGGTGGAGAAGACATGTGGGTTCACATTTTTGATTTCCATACTGGAGAAGAG
ATTGGATGCAACAAGGGTCATCACGGTCCTGTCCATTGCGTGCGATTTTCACCCGGTGG
GGAATCCTACGCATCTGGATCAGAAGACGGCACTATAAGAATTTGGCAGACTGGGCCTG
CAAATAATGTCGAGGGTGATGCAAATCCAAGCAACGGCCCAGTAACTGGTAAAGCAAAA
GTCGGGGCAGATGAAGTTACTCGTAAAGTGGAGGACTTACAAATTGGCAAAGAAGGGAA
AGATTGGCGAGAAGGATAATGCTGCAGACGCATGATGTGCTCTTTAGGTTTGATCTGTCT
GTTTTGTCTATCCTGCGAGTTTCGAGCATGTGCGTGTGTGATGCTTGTGCTTTGCAAATA
AAGTGGACATGCTCTTGATAAGTTTCTTTCCTCTGCCATCCTTTTAAATGAATCTCCCTTG
GAAGCTCGCTTTCTCCTCTTGTTCCTCTTGTAACAGCAGACACCGCCGAAGTTTGGAGAC
TTCATTTTTATGCTTTCCGGATTGTGCTTAATGAAGAAGAATCTGGAATTTTGCTAAAAAA
AAAAAAAAA
SEQ ID NO:67
CCTAATTCCCAGGGGAAGGGGAGTATATAGCCTCGCTTCCATCCAGCCATTCATCTCTG
GTCGCGCCTTTTGCCTCCAGCTCGCCGAAGAAGAAAGCCCCCCCGGCAGAACCGCCGC
CCCCCTCCTCCTCCTCCTCGCCGGAACCACCGTGAGCAATGGCGGAGGGACTAATCCT
GAAGGGCACGATGAGGGCCCACACCGACATGGTGACGGCCATCGCCATCCCGATCGAC
AACTCCGACATGGTCGTCACCTCCTCCCGCGACAAGTCCATCATCCTCTGGCACCTCAC
CAAGGAGGAGAAGGTCTACGGCGTCCCCCGCCGCCGGCTCACCGGCCACTCCCACTTC
GTCCAGGACGTCGTCCTCTCCTCCGACGGCCAGTTCGCCCTCTCCGGATCCTGGGACG
GCGAGCTCCGCCTTTGGGACCTCGCCACCGGCGTATCCGCACGCAGGTTTGTGGGCCA
CACCAAGGATGTGCTTTCTGTGGCATTCTCCATCGACAATCGCCAGATCGTGTCGGCAT
CTCGTGATCGCACCATCAAGCTGTGGAATACATTAGGAGAGTGCAAGTACACAATTCAG
GAGGGCGAAGCTCATACTGATTGGGTCAGCTGCGTGAGGTTCAGTCCCAACACACTGCA
ACCCACCATTGTGTCTGCTTCATGGGAATAGAACCATCAAGGTCTGGAACTTGACCAATTG
CAAGCTTAGAAACACCTTGGCGGGGCACAATGGCTATGTGAATACAGTAGCTGTATCTC
CTGATGGGTCGCTCTGCGCTAGTGGTGGCAAGGATGGAGTGATTTTGTTGTGGGACTTG
GCAGAGGGCAAGAGGCTGTATAATCTGGAGGCTGGTGCTATAATTCACTCTCTTTGCTT
CAGCCCCAATAGATACTGGCTCTGTGCCGCTACTGAGAACGTATTAAAATCTGGGACC
TCGAAAGCAAGAGTATCGTCGAAGATTTGAGGGTTGATTTGAAAACGAAGCTGACAAAA
CTGATGGAACCACGACCGCTGCTTCAAATAAGAAGGTTATATACTGCACGAGCTTGAACT
GGAGTGCAGATGGAAGCACCTTGTTCAGTGGATACAACGATGGCGTGATCAGAGTTTGG
GGCACTGGGCGTTACTAATTCGAGGAACTGCCAGTCGGCCTTTTCTTGATCTTTTCTTGA
AGACTTAGGATGATGTCTTGAAAGTGTTGCAAGTAATTTTTGTTTCACATGACGTGCGGT
TGTTTCGGTTTTGTGGCCAAGTAATCTTTGGTTTTTTTCCCATAATTTTTAAAATGTACGCT
GGTCCTCAATGTAGTTGATTTGTAGCTGATTCTGTTCCAAAAAAAAAA
SEQ ID NO:68
CTCTTCTCTTCCTCTCCGTCGCCGCTCTCTCTCTCCTCCTCCTCCTCCTCCTCCCGCAGC
AGCAGAGAAACCCTAATCGAAGAGCAGACATGGCGGAGGGGCTGCACCTGAAGGGGAC
GATGAAGGCCCACACGGACATGGTGACGGCGATCGCGGTCCCCATCGACAACGCCGAC
ATGATCGTGACCTCCTCCCGCGACAAGTCCATCATCCTCTGGCACCTCACCAAGGAGGA
CAAGGTCTACGGCGTCCCCCGCCGCCGCCTCACCGGCCACTCCCACTTCGTCCAGGAC
GTCGTCCTCTCCTCCGACGGCCAGTTCGCCCTCTCCGGCTCCTGGGACGGCGAGCTCC
GCCTCTGGGACCTCGCCACCGGCGTCTCCGCCCGCAGGTTCGTGGGCCACACCAAGG
ATGTGCTCTCGGTGGCCTTCTCGATCGACAACCGGCAGATTGTGTCGGCGTCTCGTGAC
CGCACGATCAAGCTGTGGAACACTCTGGGAGAGTGCAAGTACACCATTCAGGAAGGTGA
GGCTCACAATGATTGGGTGAGCTGTGTCAGGTTCAGTCCCAACACCCTTCAGCCGACCA
TCGTGTCCGCATCTTGGGACCGCACTGTGAAAGTGTGGAATTTGACCAACTGCAAGCTG
AGGAACACCCTGCAGGGGCATTCTGGCTATGTGAACACCGTGGCTGTGTCTCCTGATGG
GTCACTTTGTGCGAGCGGTGGCAAGGATGGAGTGATATTGCTTTGGGACTTGGCAGAAG
GGAAGAAGCTGTACTCACTGGAGGCTGGTGCCATTATTCACTCACTTTGCTTCAGTCCAA
ATAGGTACTGGCTCTGCGCAGCCACAGAGAACAGCATTAAGATCTGGGATCTGGAGAGC
AAGAGCATTGTCGAGGACTTGAGGGTTGATTTGAAGAATGAAGCTGATATGAGTGATGG
AACTACGGGGGCCATGAGCTCAAATAAGAAGGTCATCTACTGCACGAGCTTGAACTGGA
GCGCTGATGGAAGCACCTTGTTCAGTGGTTACAATGATGGCGTGATCAGAGTCTGGGGA
ATCGGGCGTTATTAGGAAATTTTTGACGATCGCTACTGCTGAACTGTTTTTCAAAGATTCC
CTTTTTCGTATGTCTTTTTAGTGGTTTAAAGAACCTTTTAGGATCTTGTTTAAGATGCTTGG
AGCAATATATGCCTGCCTCTACTCTTATGTTTAGAAGATTTGGCCTTTGGGCAGTCATATT
TTTTTGAGAACAGAGATATAAAAAAAAAA
SEQ ID NO:69
CGCTCTCTCTCTCTCTCCTCCTCCTCCTCCCGCAGCAGCAGAGAAACCCTAAGCGAAGA
GCAGACATGGCGGAGGGGCTGCACCTGAAGGGGACGATGAAGGCCCACACGGACATG
GTGACGGCGATCGCCGTCCCCATCGACAACGCCGACATGATCGTGACCTCCTCCCGCG
ACAAGTCCATCATCCTCTGGCACCTCACCAAGGAGGACAAGGTCTACGGCGTCCCCCG
CCGCCGCCTCACCGGCCACTCCCACTTCGTCCAGGACGTCGTCCTCTCCTCCGACGGC
CAGTTCGCCCTCTCCGGCTCCTGGGACGGCGAGCTCCGCCTCTGGGACCTCGCCACCG
GCGTCTCCGCCCGCAGGTTCGTGGGCCACACCAAGGATGTGCTCTCGGTGGCCTTCTC
CATCGACAACCGGCAGATTGTGTCGGCGTCTCGCGACCGCACGATCAAGCTGTGGAAC
ACTCTGGGAGAGTGCAAGTACACCATTCAGGAAGGTGAGGCTCACAATGATTGGGTGAG
CTGTGTCAGGTTCAGTCCCAACACCCTTCAGCCGACCATCGTGTCCGCATCGTGGGACC
GCACTGTGAAAGTGTGGAATTTGACCAACTGCAAGCTGAGGAATACCCTGCAGGGGCAT
TCTGGCTATGTGAACACCGTGGCTGTGTCTCCTGATGGGTCACTTTGCGCAAGCGGTGG
CAAGGATGGAGTGATATTGCTTTGGGACTTGGCAGAAGGGAAGAAGCTGTACTCACTGG
AGGCTGGTGCCATTATTCACTCACTTTGCTTCAGTCCAAATAGGTACTGGCTCTGCGCAG
CCACAGAGAACAGCATTAAGATCTGGGATCTGGAGAGCAAGAGCATTGTTGAGGACTTG
AGGGTTGATTTGAAGAATGAAGCTGATATGAGTGATGGAACTACGGGGGCCATGAGCTC
AAATAAGAAGGTCATCTACTGCACGAGCTTGAACTGGAGCGCTGATGGAAGCACCTTGT
TCAGTGGTTACAATGATGGCGTGATCAGAGTCTGGGGAATCGGGCGTTATTAGGAAATT
TTTGACGATCGCTACTGCTGAACTGTTTTTCAAAGATTCCCTTTTTCGTATGTCTTTTTAGT
GGTTTAAAGAACCTTTTAGGATCTTGTTTAAGATGCTTGGAGCAATATATGCCTGCCTCTA
CTCTTATGTTTAGAAGATTTGGCCTTTGGGCAGTCATATTTTTTGAGAACAGAGATATAAT
ACCTGTTTTTACGGCAAAAAAAAAA
SEQ ID NO:70
TTTTTTTTTTGGTTTTTCTTTCAATACTTATCTTTTATTCATCAAAGCGAGCTGAGGCGGAG
AGGGAGGGAGGACATGGTTAAAGTTCAAAAGCTAAAGTACAACAGCGTTTACATGGAAG
ACACTTATTAGAGAGACCCATCCATGAGTGTAAGCTTCATGACCGAGTCTCCGGCACGA
CAAAAACCGCCGACGCGATCTTCTACTTCTTGCCACCGGTTGCCTTCTTCTTAGGCTGAG
CCGGCTTCGGCTCGGCCTTCTTGGGGGCCGGCTTGGATGTCAGGGGTTCCAGCGCCG
CCATTCGCGACAACCACGCCGGAGAACGGGACGATGAGCAGCAACTCGCCGGCCTTCC
ACCGCGATTCCGACGACGACGACGACCAGGGCGAGGTCTTCCTCGACGACTCCGACAT
CATCCACGAGGTCGCCGTCGACGACGAAGATCTTCCTGACGCCGACGATGAGGCAGAC
GAGGCAGAGGAAGCGGATGACTCTTTGCACATATTCACAGGCCACAATGGGGAAGTGTA
CAGCCTTGCATGCAGTCCCACAGATGCAACACTAGTGGCAACTGGGGCTGGAGATGATA
AAGGGTTTCTGTGGAGGATAGGTCATGGAGATTGGGCTGTTGAGCTCCAAGGTCATAAG
GATTCCATCTCTAGTTTAGCGTTTAGTCTGGATGGGCAGTTGCTTGCATCTGGAAGCCTT
GATGGAGTCATACAGATTTGGGATGTTCCATCTGGAAATCTTAAAGGCACCCTTGATGGA
CCTGGAGGGGGCATAGAGTGGATCAGGTGGCATCCCAAGGGACACATAATATTAGCAG
GTTCGGAGGATTCCACTGTTTGGATGTGGAATGCTGACAAGATGGCCTACTTGAATATGT
TTTCAGGGCATGGTAACAGCGTAACGTGTGGAGATTTTACTCCTGATGGTAAAACAATTT
GTACCGGCTCGGATGATGCAACATTGAGAATTTGGAATCCCAAGAGTGGGGAAAACATT
CATGTTGTAAAAGGTCATCCATATCATGCCGAAGGACTAACAAGCATGGCAATAAGCTCT
GATTCAGGTCTTGCTATTACTGGTGCCAAAGATGGATCTGTTCGCATTGTCAATATATCA
AGTGGAAGGGTTGTTAGTTCTCTGGATGCTCATGCAGATTCTGTCGAATTTGTGGGACTG
GCTCTAAGCTCCCCGTGGGCTGCAACTGGAAGCTTGGATCAGAAGCTCATTATATGGGA
TCTCCAGCATTCTTCTCCCCGCGCCACCTGTGATCACGAGGATGGAGTGACTTGCTTGA
GTTGGGTCGGTGCATCAAGATTTTTGGCCTCGGGTTGTGTTGATGGCAAAGTACGAGTG
TGGGATAGCCTTTCTGGTGATTGCGTAAGAACATTTCATGGGCATTCCGATGCTATTCAG
TCCCTGTCGGTGTCTGCCAATGAGGAGTTCCTTGTTTCTGTTTCAATTGATGGAACTGCT
AGGGTTTTTGAGATTGCAGAGTTTCACTAGGGATAACACCGGATGCACGATATTGTTTTT
GTCTGTACATTTTCTGTACTCTAAAACTATGATCTTTTCTTTTTGTAGCAGAAACACCCCC
CCCCCACCCCCCCACAAACCCCCCACAAAAAAGAAAAGCATATTATCTTTTTGAAGCGGA
AATATATATTTATGCTCATACATAAGTAATGTACTACTTGACAAGATGAGGAAAGTATTTG
TTTCGCGGTGTGATTTAGTTGATCAATTTACATTTTAAAAAAAAAA
SEQ ID NO:71
CAAAACTTCAACGAAGTTCCGTTGATTAAGGAATGGGAACCTCTCAGCATCAATTATCTT
CATGCCTCCAGCTTCTTCCAAGACGCCGAGGAAATAAGAACCTCATTTTCAGACGGACG
ATGGCCAGCGGTGGCGCCGCCGCCGTCGCCCCGCCGCCGGGCTACAAGCCCTACCGC
CACCTGAAGACCCTGACCGGCCACGTCGCCGCCGTCTCCTGCGTCAAGTTCTCCAACG
ACGGCACCCTCCTGGCCTCCGCCTCCCTCGACAAAACCCTAATCATCTGGTCCTCCGCC
GCCCTCTCCCTCCTCCACCGCCTCGTCGGCCACTCCGAGGGCGTCTCCGACCTCGCCT
GGTCCTCCGACTCCCACTACATCTGCTCCGCCTCCGACGACCGGACCCTCCGCATCTG
GTCCTCCCGCTCCCCCTTCGACTGCCTCAAGACCCTGCGCGGCCACACCGACTTCGTCT
TCTGCGTCAACTTCAACCCGCAGTCCAGCCTCATCGTCTCCGGGTCGTTCGACGAGACG
ATCCGCATCTGGGAGGTCAAGACCGGCCGGTGCCTCAACGTGATCCGGGCCCACTCCA
TGCCCGTCACCTCCGTCCACTTCAACCGCGACGGCTCCCTCATCGTCTCCGGGAGCCA
CGACGGGTCGTGCAAGATCTGGGACACCAAGAACGGGGCCTGCCTGAAGACGCTGATC
GACGACACGGTGCCCGCCGTCTCCTTCGCCAAGTTCTCGCCCAACGGCAAGTTCATCCT
CGTCGCCACTCTCAATGACACCCTCAAGCTGTGGAACTATGCAACTGGGAAGTTCTTGA
AGATTTACACGGGTCATAAGAACAGCGTCTACTGTTTAACTTCTACATTCTCTGTTACGAA
TGGGAAGTACATTGTTAGCGGCTCAGAGGATCGGTGCATCTGCATATGGGATCTCCAAG
GGAAAAACCTAATTCAGAAACTCGAAGGCCACTCTGATACGGTCATTTCCGTCACGTGC
CATCCATCGGAGAACAAGATTGCATCTGCTGGCCTTGATTCCGATAGAACTGTGAGAATT
TGGCTTCAAGATGCTTAACTGCTTAGGCTGTGATGGATAGATGGATTTCAAGCTGTTTCC
CTGGATTTTGGGGAGGGTAATTTGCAGGTGAAGTGGGCTCGAGCAAGCAAAGTGCTCTT
CCTTGTTCTGATGTGGGTTCAAGCTTTGCCATCCTCGCATGTGATGGATACAATAATCTA
CCAAAGGCTACTTAGACCGACAAGCAGGATGAATGGGTGAAGTCCAAAATAATTGGAAC
ACTTTTGGGGATAAGTTGTTACTAGATTATGATCCAGCATAGATACTCGATGTGGTATAG
AATTTATCCAATGTACTCCTAAATGTAGATACATCGTGTATTGATTTCTCCCCATGTTTGA
GGATACTCAGCCTGAATTACGTATCTAATGGGTTGTCCTGAAAGTGTAAGTCCTACGTCA
TTCCTGATAAGGCTCTTTTTCTGTTTCTCATAGGTGCCCCATTGGTTCTGAAGAAACACTT
CCATCAAAATAAGTTTCATTGAATGTGTACGGGTTCTTTTCTTAGTAACAGCTCCATCTAT
TTGTAAAAAAAAAA
SEQ ID NO:72
CTGATTCGCAGATTCGCGCAGACGAGGAAATTAGTGGTCAAGAGGAGAAGAGAACTTAG
TCATCATGCCTTCTCAAAAGATCGAGACAGGTCACCAAGACATAGTCCATGATGTAGCTA
TGGATTACTATGGAAAGCGTGTGGCAACAGCTTCGTCTGATACCACTATCAAGATAATAG
GCGTGAGCAATAGCTCTGGATCACAGCACCTTGCTTCATTGAGTGGTCATAAAGGCCCT
GTTTGGCAGGTTGCTTGGGCACACCCTAAATTTGGATCAATCCTTGCTTCTTGCTCATAT
GATGGACAAGTTATCTTATGGAAGGAGGGTAATCAAAATGATTGGGCTCAAGCTCATGTT
TTCAATGATCACAAGTCATCCGTGAATTCCATCGCTTGGGCGCCTCACGAACTAGGTCTC
TGCTTGGCTTGTGGTTCATCTGATGGGAATATCTCTGTCTTTACTGCCCGACCTGATGGT
GGTTGGGACACAACTAGGATAGAGCAAGCTCACCCTGTTGGTGTCACTTCTGTTTCATG
GGCCCCGTCCATGGCTCCGGGTGCTCTAGTTGGATCTGGTTTGCTAGATCCTGTTCAGA
AGCTGGCCTCTGGAGGGTGTGACAATACAGTGAAGGTGTGGAAGCTTTATAATGGAACT
TGGAAGATGGATTGCTTCCCTGCCCTTCAGATGCACTCTGATTGGGTCAGAGATGTGGC
TTGGGCACCTAATTTGGGGCTTCCGAAGTCCACAATTGCTAGTGCTTCTCAAGATGGGA
CTGTTGTAATATGGACTGTGGCCAAGGAAGGAGAGCAATGGCAGGGTAAAGTTTTGAAG
GATTTCAAGACTCCAGTTTGGCGGGTTTCCTGGTCGCTTACTGGAAATTTGTTGGCAGTT
GCTGATGGGAATAACAACGTAACTTTGTGGAATGAGGCAGTGGATGGTGAGTGGCAACA
AGTTACTACAGTTGAGCCATAGATTTGGAGTTTGTCTGTTTTGTTATCTACTTTAATGTGT
TTTGCCTTGCCTAGGACCTCTTTGATAGACTGTTATTGTTATGGTGTTTCTCTTTTTGATTT
TAGAGTTTGTGAATACAATTATTTTAACATAATCTTTTCTTGGGAGATGAATGAAGGGGTA
TTATCAAAAAAAAAA
SEQ ID NO:73
CCAACCAAACCCCAACACTCTCCAGCTCTACTCTCTCTCTCTACTTTCTCTCTCCATCTCC
TTCGCCGGAATTCGAATGAGATGAAGATCGCGGGCCTCAAGTCCGTCGAGAACGCCCA
CGACGAGTCGGTGTGGGCCGCCGCGTGGGTGCCGGCGACGGAGTCCCGGCCGGCGC
TGCTCCTCACCGGCTCCCTCGACGAGACCGTGAAGCTGTGGCGGCCCGACGAGCTCGC
CCTCGAGCGGACCAACGCCGGCCACTTCCTCGGCGTCGTCTCCGTCGCCGCCCACCCC
TCCGGCGTCATCGCCGCCTCGGCCTCCATCGACAGCTTCGTCCGGGTCTTCGACGTGG
ACACCAACGCCACGATCGCCACGCTCGAGGCACCGCCGTCGGAAGTCTGGCAGATGCA
GTTCGATCCCAAGGGCACCACTCTAGCGGTGGCAGGTGGAGGGAGCGCATCAATCAAG
CTTTGGGACACTGCCACATGGGAACTGAATGCAACCCTCTCGATTCCTCGTCCAGAACA
GCCCAAACCCTCCGAAAAAGGCAACAAGAAGTTTGTCCTCTCTGTCGCGTGGAGTCCTG
ATGGCAGAAGGCTTGCCTGCGGTTCAATGGATGGTACAATCTCTATATTCGACGTGGCT
CGGGCCAAGTTTCTGCACCACCTGGAAGGCCACTTCATGCCAGTGCGATCTCTCGTATT
TTCCCCAGTTGAGCCACGGCTGCTTTTTTCTGCATCCGACGATGCTCATGTGCACATGTA
TGATTCCGAAGGTAAGTCCTTGGTGGGGTCCATGTCTGGCCATGCTAGCTGGGTATTGA
GTGTTGATGTCAGCCCGGATGGAGCAGCACTTGCGACAGGTTCAAGTGACAGGACTGT
GAGGCTGTGGGATCTCAGTATGAGGGCTGCCGTTCAGACGATGAGTAACCATTCGGATC
AAGTTTGGGGGGTTGCCTTTCGACCAATGGCGGGTGCTGGTGTCAGAGCTGGTGGTCG
GCTTGCTAGTGTATCTGATGACAAGAGCATATCACTCTATGATTACTCATGAAGGTATATC
TGCAAATTGTGAAAAGATAAGTTCTTGGAGCCACTGATCGTTAACCTTTTGTTTGAATAGT
CTGCAAGAGATTTTCTTGGTTTCAGATCATTTTTATTTGTATAATTAGCTGACTATGATGAC
TTTCGGAGAAATGGCTCAATATAGGTCTGATCTGATGGGTGCCGTGTATTAACTTCGAGG
TCAATTTGATACCCGATATTTCTGAGAGGTTTTTTTGTGAAGTTATATTGACTATGGGGGT
TACTATCACTAATCAGCTCAGCTGTTGATATAAAAAAAAAA
SEQ ID NO:74
GCAAGATCGATTGCTCTGTAGAGCAAAGGAAAGGAGAAAAGCATGGAGATCGATCTCGG
AAACCTCGCATTCGACGTCGATTTTCATCCATCGGAGCAGCTCGTCGCCTCCGGCCTCA
TCACCGGCGACCTCCTCCTGTACCGCTACGGCGACGGCTCCTCGCCGGAAAAGTTGCT
GGAAGTGCGAGCGCACGGCGAGTCTTGTCGGGCTGTTCGGTTTATCAACGATGGGAAA
GCGATTCTGACCGGTTCTCCGGATTGCTCGATTCTCGCGACGGATGTGGAGACGGGAT
CCGTCGTCGCTCGAGTCGAAAATGCTCACGAGGCTGCTGTCAATAGGTTGGTCAATTTG
ACGGAGTCTACTATTGCCACGGGAGATGATAATGGGTGCATCAAGGTTTGGGCACCAG
ACAACGTTCTTGCTGCAACACATTTAGTGCCCATGAAGATTTCATATCGGACATGACATTT
GCATCTGATTCCATGAAACTTGTTGTCACAAGTGGAGATGGGACTCTATCTGTTTGCAAT
CTTCGAAGTAATAAAGTCCAAACTCGGTCCGAGTTCTCTGAAGATGAGTTACTATCTGTT
GTCATTATGAAGAATGGAAGGAAAGTTGTTTGTGGAACACAAAGTGGAACATTATTACTA
TATTCATGGGGATTCTTCAAGGATTGCAGTGATCGCTTTGTGGATCTCTCCCCCAGCTCA
GTTGATGCGTTGCTAAAGCTTGATGAGGATAGGATCATTGCAGGAACTGAGAACGGACT
TATCAGTCTGATAGGAATATTACCCAACAGAATCATCCAACCGATTGCAGAGCATTCAGA
TCATCCTATTGAGCGCCTTGCCTTCTCTCATGATAAAAAGTTTCTTGGCAGCATATCGCAT
GATCAGACATTGAAGCTCTGGGATTTGAATGACATACTAGGGTCTGAAGATTCTCCATCA
AGTCAAGCAGCCATAGACGACAGTGATAGCGATGAGATGGATGTGGATGCGAATCCTCC
CGATTCTAGCAAAGGGAACAAGAAGAAGCATTCAGGCAAAGGAAATGATGTTGGCAATG
CCAACAACTTTTTCGCCGACTTAGGCGATTGATGGCTCCAATCTTCATATGCACATGACA
AGTTGAGGTTCATACAAACATATCAAGGCTCGATTCTAGTCGGCACATTTGGCAAGCAAT
TCGTACATGTTTTCATTGACAGTGTGTTCTCAGGGGGGAATTTTGACAGGACTCCATGAC
CAGGGGCTTCTTATTGTTTATCTGACATTCCCAGGGATGTAATACCCCATCTTAAACTCAA
ACAAGTTTATGTATTTTGAGTGTTCTAGGACAACTATATATACAAAGTAGAAATCCAAATT
GCAAAAGGCATATTCCTCCTAAAAAAAAAA
SEQ ID NO:75
GGGGAAAGTCAAGCTGTCTTTCATCATGGTCATTTGTCCGATCACGATCGCCCGTTCCG
GCGATCGAGACGGCGACTTCAGCTGCTGATTAGCCACGTTCGATTCCGTCTGATTTGGT
CGGAAAAAAAAAGGGGTTTGAAAATGAGTCAGCAGCCGTCGGTGATCCTCGCGACCGC
CAGCTACGATCACACCATCCGATTCTGGGAGGCCAAGAGCGGGCGATGCTACAGAACG
ATTCAGTACCCGGATTCGCAAGTCAACCGGTTGGAGATAACTCCACATAAGCGGTACCT
GGCTGTGGCGGGAAATCCCAGCATAAGATTGTTTGATGTCAACTCAAACACCCCTCAAC
CGGTGATGAGTTTCGACTCCCATACCAATAATGTCATGGCTGTGGGGTTTCAGTATGATG
GAAATTGGATGTACTCAGGCTCTGAAGATGGAACTGTTAGAATATGGGACCTGAGAGCT
CGTGGTTGCCAAAGAGAATATGAAAGCCGTGGAGCTGTAAATACAGTTGTTCTGCACCC
AAATCAGACTGAACTAATATCAGGAGACCAAAATGGAAATATCCGTGTATGGGATTTGAC
AGCGAATTCATGCAGCTGTGAGTTGGTACCAGAGGTGGATACAGCGGTCAGATCCTTAA
CAGTTATGTGGGATGGGAGTCTGGTGGTTGCTGCAAATAATAATGGAACATGTTATGTTT
GGCGCTTGTTGCGTGGGAGTCAGACAATGACCAACTTTGAGCCACTTCATAAACTACAA
GCACATAATGGATATATTCTTAAATGTCTTCTTTCACCTGAGTTTTGTGAACCCCACAGAT
ATTTGGCTACTGCTTCTTCTGACCATACCGTCAAGATTTGGAATGTTGAAGGTTTCACTCT
AGAAAAGACTTTGATAGGACATCAACGCTGGGTGTGGGACTGCGTTTTCTCGGTAGATG
GTGCTTATCTTATAACAGCTTCCTCGGACACAACAGCAAGACTCTGGTCCATGTCGACTG
GTCAAGATATCAGGGTGTATCAAGGACATCATAAAGCAACTACTTGCTGTGCCCTCCACG
ATGGTGCGGAAGGGTCTCCAGGCTGATATTGGAAAAACGCTAGGGCTTAAACCGCATCA
GTTCGCAATTCCCCTAACGGAGGCCGACTGTTTTCCTTTTGAGTAAAAAAGTTCGTTGTT
GTCCGAACAATCACATTCTTCGCAGATTAAAGTGCACATAAGTTGCTAGCGTTGGACTAT
CCAAGTTGTCTTTTGAATATCGAAATGATGAATTTCCGAGGGAAACAGGGCAAGAGATGA
AAGCTTGAAGCTTTTTAGCACGGCCAGCTCATGACTCGTGGATCTCAGATGTCCCAAGTT
TTCCGGTCTTTCTACTTTATTGCAGAAAGTGGTGGGTTCACTGAATTGCTTGTAAATATAC
TCAATAATCAATGTAGTTGTTCGTCGATAAAAAAAAAA
SEQ ID NO:76
CCTCCTCGAAGCAGGCAGCTGAAACTACTAGCTGGACGGAGCTCGGGAGAGAGAGCAA
AGATGGAAGATGCCATGGACATGGAAGTGGAAGTGGAAGTGGAAGCGGAAGAACACTC
TCCTTCCAGCTCGAATCCGAGCGGCTCTTCCTTCCGCAGATTCGGGCTCAAGAACTCGA
TCCAGACCAACTTCGGCAGCGACTACGTCTTCGAAATTACTCCCAAGTTTGATTGGTCGC
TGATGGGGGTATCGTTGTCGTCCAACGCGGTGAAGCTGTATTCCCCAACGACGGGTCA
GTACTGCGGAGAGTGCAGGGGGCACTCCGATACCGTCAATGGCATTTCGTTCTCGGGA
CCGTCGAGTCCCCACGTCTTGCACTCTTGCTCTTCCGATGGCACCATCCGAGCTTGGGA
CACCAGGTCTTTTAAAGAGGTTTCTTGCATAAGTGCCGGGCCATCGCAGGAGATCTTCA
GCTTTTCGTTCGGTGGTTCAAGTGACAGTCTTCTTTCTGCTGGATGTAAATCTCAGATAC
TCTTTTGGGATTGGAGAAACAAAAAGCAGGTTGCATGTTTAGAAGATTCTCATGTGGATG
ATGTTACCCAGGTTTGCTTTGTCCCTCACCATCAAAATAAGCTTATTTCAGCTTCAGTGGA
TGGGCTGATATGTATATTTGATACGGCTGGAGACATCAATGATGATGAGCATATGGAATC
GGTGATTAATGTGGGAACTTCAATTGGCAAAGTGGGGATATTTGGACAGACATTTGAAAA
GCTCTGGTGTTTGACACATATTGAGACCTTAAGTGTTTGGGACTGGAAAGAGGGAACAA
ATGAAGCCAACTTTGAAGATGCCCGAAAATTAGCATCTGATAGCTGGTCACTAGATCATA
TTGATTATTTTGTTGACTGTCACTCAGCTGAAGAAGGTGAAGGTTTGTGGGTGATTGGTG
GCACAAATGCCGGGACTTTAGGATACTTCCCTGTAAAGTACAAGGGAGGGGCAGCAATA
GGATCCCCAGAGGCCGTTCTTGGAGGTGGCCACTCGGATGTGGTTAGGAGCGTTTTGC
CTATGTCGGGCATGGCAGGGACAACTTCCAAAACCCGAGGCATTTTTGGATGGACGGGT
GGCGAGGATGGTCGCTTGTGTTGCTGGCTTTCTGATGACTCTTCCGCTACAAGCCGATC
GTGGATGTCAAGCAATCTGGTCTTGAAGTCATCGAGAAGTCACCACAAGAAAAATAGGC
ACCAACCTTACTAGTTTTTTTGTCAGATTTGCTTCTTTTATTTCCATGTATCAAGCCGCATC
AATGTTTGTCGCTGCAATTAACATGTGTGCAGTCGATCCCGTATTTGATTGATTTTTTGTA
ATATATTACAGAGCGACCCGTGTTCTGTACTAAAAAAAAAA
SEQ ID NO:77
GAATCGACGAACAACGCATCGCCGATGCAGCGAAATCCTCTGGATCATTGATTCTCCCT
CCCTTCCCCCACGAGTTAAAAGAAAAATCAATCAAGGAATTTCTACAGGGAACGAACAAA
AATTCAACGAGCTCCCGGATTCGTTATTGCCTCGCCCCGCCCCGCCCGTCGATTTCGCG
GTTGGTTGGGTTACATGTCTCAGCACCAAGAGTATCCGATGGAATACGCAGCAGATGAT
TACGATGTGGGAGAAGTAGAAGATGATATGTACTTCCATGAAAGAGTAATGGGCGATTCA
GACACCGATGAGGATGAGGAATATGATCATCTGGATAATAAGATTACCGATACCTCTGCT
GCTGATGCTCGGAGAGGTAAAGATATCCAGGGGATTCCTTGGGAAAGACTGAGCGTCAC
TCGTGAAAAATACAGAAGAACTAGAATAGAGCAGTACAAGAACTATGAAAATGTACCACA
ATCCGGAGAAAGTTCGGAAAAGGATTGCAAACCCACAAGAAAAGGTGGAAACTATTATG
AGTTTTGGCGAAACACAAGATCTGTGAAGTCCACAATCCTTCATTTTCAATTGAGAAACTT
GGTTTGGTCCACGACAAAGCATGATGTTTACCTTATGTCGCACTTCTCGATCATTCACTG
GTCGTCATTGACTTGCAAGAAGACTGAAGTGCTTGATGTTTATGGACATGTAGCTCCTCG
TGAGAAACACCCTGGAAGTCTCTTAGAGGGTTTTACACAGACACAAGTCAGTACTCTTGC
AGTACGGGATAAATTACTGATTGCTGGTGGTTTCCAGGGGGAGCTTATCTGTAAGAACTT
AGATCGTCCTGGTGTTAGCTACTGTTGCAGAACAACTTATGATGACAATGCTATCACTAA
TGCGGTTGAGATTTACGATTATCCCAGTGGCGCCGTACATTTCATGGCATCGAATAATGA
CTGTGGAGTCAGAGATTTCGACATGGAGAAATTTGAGCTCTCCAGACATTTCACCTTTCC
TTGGCCGGTGAATCATACTTCTCTGAGTCCTGATGGTAAGCTCCTCGTCATTGTTGGTGA
CAACCCTGAGGGGATAGTGGTGGATTCTCAAAGAGGAAAGACCATTAGGCCACTACAAG
GGCACTTGGATTTCTCATTTGCATCTGCATGGCATCCGGATGGTCACATATTTGCCACTG
GAAACCAAGACAAGACTTGCCGTATCTGGGACATTCGCAACTTATCAAAGTCCGTTGCTG
TACTCAAGGGCAACCTTGGAGCCATCCGGTCCATTCGGTTCACCTCCGATGGGCGATTT
ATGGCCATGGCAGAGCCTGCTGATTTTGTCCATGTCTATGATGTGAAGAGTGGGTATGA
GAAGGAGCAGGAGATCGATTTTTTTGGCGAGATCTCTGGTGTGTCTTTCAGCCCTGACA
CTGAATCGCTATTTGTTGGGGTGTGGGATCGCACCTATGGCAGTCTCCTTCAGTATAACC
GATGCAGGAATTATTCATATCTCGACTCCATGTAGTGAGGAGATGCCTGCACCGTTTCTT
AACTAAATAACTGCAATTATATAGGATTCTTGAAATATGGCTAAAGTGGCTTCAGCGCATT
GTGTAAATGTAGATAGGTGATATATTTCTCGTTGCAATGTAGGGTAAGAGAACAAGTATG
GTGTTACTTGGTGGATCATTGTCTTCCTTCTTTAGGTCATAGAATGCTTGCTCTTCATTGG
AAGGTTGAGCGAGAAGAGAGAGAGAGAGAGAGAGAGAGAGAGCTTTATGGTGATGAAG
TGTTCTGTTGTTTGCAGAATAATGGAGATGTATTTAATAAAGGTTGGTGGTGTTTGGGTG
AAGATTGAAGCAAACTCTTCTCATTGCCAATAAGAACCCTATTCACTTTCTCTGTCAA
SEQ ID NO:78
CTCTCTCTCGGCGGCTCCACCTCAAAACCCTATCCCGAAACCTCCTCGCCGGGGAGAG
GTCGAATCGAGCGCCGTCGAGTGCCGATCCGACCGGGAGGGACGAAGGAAATGGGGG
CGAGCAGCGACCCGAACCCGGACGTCTCCGACGAGCACCAGAAGCGGTCGGAGATCTA
CACGTACGAGGCGCCGTGGCACATCTACGCCATGAACTGGAGCGTCCGCCGCGACAAG
AAGTACCGGCTCGCCATCGCCAGCCTCCTCGACCACCCGGCCGCCGCCGCCGCCGTC
CCCAACCGCGTCGAGATCGTCCAGCTCGACGACTCCACCGGCGAGATCCGGGCCGACC
CGAACCTCTCCTTCGACCACCCTTACCCGGCCACCAAGGCGGCCTTCGTCCCGGACAA
GGACTGCCAGCGCGCCGACCTGCTCGCCACCTCCAGCGACTTCCTCCGCATCTGGCGC
ATCGCCGACGACTCCTCCCGCGTCGACCTCCGCTCCTTCCTCAACGGCAACAAGAACAG
CGAGTTCTGCCGCCCCCTCACCTCCTTCGACTGGAACGAGGCGGAGCCCAAGCGCATC
GGCACCTCCAGCATCGACACCACCTGCACCATCTGGGACATCGAGCGCGAGACCGTCG
ACACCCAGCTCATCGCCCACGACAAGGAGGTCTACGACATCGCCTGGGGCGGCGTCAG
CGTCTTCGCCTGCGTCTCCGCCGACGGCTCCGTCCGCGTCTTCGACCTCCGCGACAAG
GAGCACTCCACCATCATCTACGAGAGCTCCGAGCCCGACACCCCGCTCGTGCGGCTCG
GGTGGAACAAGCAGGACCCCCGCTACATGGCCACCATCATCATGGACAGCGCCAAGGT
CGTGGTGCTCGACATCCGCTACCCTACCATGCCGGTCGTCGAGCTGCAGAGGCACCAG
GCCAGCGTGAACGCCATCGCGTGGGCGCCGCACAGCTCCTGCCACATTTGCACGGCG
GGGGACGACTCGCAGGCACTCATCTGGGACCTGTCATCAATGGCGCAGCCTGTGGAGG
GCGGCCTGGACCCGATTCTGGCCTACACTGCAGGGGCGGAGATTGAGCAGCTGCAGTG
GTCGTCCTCGCAGCCTGATTGGGTCGCGATCGCCTTCTCTTTGAAACTCCAATAATCACA
TTTACCATCAACAGGCATCAGCAACATACTGTTGTAGTGTAATTAATTTAATGGAAAAGTT
CATTTGTGTGGTGGGCCTTGGAAAGTGACTTATTGGTGCCCTGTTCATTGGACTTGAGTG
ACTGGTGAAACTGATGTTGCTGATGGCTGTTGATGGTAAATGTGATTATTGGAGTTTCAT
AAAAAAAAAAAAAAA
SEQ ID NO:79
GTCAACACTCAAAAGCAGCAGCTCAAGTGGCCAGCGCTTCCCGCGGATTAGAGAAATCC
GCAGATCGGCGATGCGCGGCGGCGGCGGCGGCGGCGACGCGACGGGCTGGGACGAG
GACGCCTACCGGGAGAGCGTCCTGAAGGAGAGGGAAGTCCAGACCCGGACCGTGTTC
CGCGCCGCCTTCGCCCCCTCGCCGAGCCCTAGCCCCAGCCCCGACGCCGTCGTCGTC
GCCTCCAGCGACGGCTCCGTCGCCTCCTACTCCATCTCCGCCTGCCTGTCCGACCACA
GATTGCAGTCGTTGCGGTTTGCTGACGCGAAGTCTCAAAACGTTCTAGAGGCGGAACCC
GCTTGTTTCCTTCAGGGGCATGATGGACCTGCCTATGATGTCAAATTTTATGGCGAGGGT
GAAGATTCTCTTCTACTGAGTTGTGGTGATGATGGACGGATTCGGGGGTGGATGTGGAG
AGACATTACGAGCTCAGAGGCGCATGACCATTCACAAGGAAACAGTGCAAAACCAGTAC
TTGATTTGGTGAATCCCCAAAGTAGAGGTCCTTGGGGTGCCCTTTCACCAATCCCCGAG
AACAATGCCCTGGCTGTAGATGTTAAGCGGGGATCCATTTATGCTGCAGCTGGTGATTC
TTGTGCATATTGTTGGGATGTGGAATGTGGTAAAATAAAAACTGTTTTCAAAGGGCATTCT
GACTACCTGCACTGTATAGCTGCACGGAACTCTTCTAGTCAGATTATAACAGGTTCTGAG
GATGGGACAGCAAGAATATGGGATTGCAGAAGTGGTAAGTGTGTCCAAGTAATTGATCC
AGACAAAGACCATAAGAAAGGTTTCTTCGCAAGTGTTAGTTGTCTTGCTCTTGATGCAAG
TGAGAGTTGGTTGGTCTGTGGTCGGGGTCGGGATTTATCTGTCTGGAGTATATCTGCTT
CTGATTGCATAGCAAAAATCTCAACCAATGCTCCTGCACAGGATGTCTTATTTGATGATAA
TCAAATATTACTGGTTGGAGCTGAACCTTTGATCAGTCGTTTAGACATGAACGGGGCGGT
TCTTTCTCAAATACATTGTGCACCTCAGTCGGTGTTTTCTGTTTCTCTGCATCAATCTGGT
GTAACTGCAGTCGGGGGTTATGGAGGCTTAGTCGACGTAATCTCCCAGTTCGGAAGCCA
CCTCTGCACATTTCGCTGCAAATGCATATGAATGAAAAACTGCATCGACATGATATTCTT
GTAACGAACCTGCAGTGCTAAGCCCATGCCTCGGAGGAAATATGGTTGCTCCAAAGAAC
CTTGTGCATTTGCCTTAGTTAACCTTATCTGGCAAGGAGTTGACCTGACTTTTGCTAAAAT
CTGAGGTTTTGGATCTGTGGGCTGCTCTTTTCTTTCACTAAGTTTGACAAAGAGGGTGCT
TGCTTGCAGCCTGAACATGCCGTGAAATGTAGTAGCGTAGAGAAAAAAGAGAGCTCGAC
TTCGTTATCGCAGGAGACTTTTGTGTGGACCATTATACTCCTTTTGATAGATTTTTCCTTC
TAAAATTTTGTATCTTGTAAAAGATTTGGTGATTTCAACGTTTAGAAGGCTAAT
SEQ ID NO:80
TCTCTCTCTCCTCTCCCTCCCAACCTTTCTCTCTCCTCCCGACCTCGACTGCGGATGGCG
TCGTCGCCGGAGTCACGCGCTGCCTGCTGAAGCGATACGGCGGCGACCATGGAGGCT
CCAATCATCGATCCGCTGCAGGGAGATTTCCCTGAAGTGATCGAGGAGTACTTGGAGCA
TGGCATCATGAAGTGCATCGCCTTCAACCGCCGCGGTACCCTCCTCGCCGCTGGATGC
ACCGATGGAAGTTGCATTATATGGGACTTTGAGACCCGGGGCGTCGCAAAAGAGCTCCG
AGACAAAGAATGCACTGCTGCAATTACGAGTGTCTGCTGGTCCAAGTATGGTCACCGCA
TACTTGTGTCTGCTTCTGATAAGTCTCTTATTCTTTGGGATGTTCTAAGTGGAGAGAAGAT
AGCACATACGACTCTTCAGCATACCGTTCTACAAGCTTGTCTTCATCCTGGTTCCTCAAC
TCCATCTATTTGCCTGGCCTGCCCTTTCTCATCTGCTCCCATGATTGTTGACTTGAACACT
GGAAGCACAACAGCTCTTCCAGTTTTGACTGCTGATGTGAGCAATGGAGCCACTCCTCT
GTCCCGCAACAAGACATCAGATACATCAGTAACTTACTCTCCATGCAATGCCTGCTTTAA
CAAGCACGGGGATCTGGTTTATGCAGGAACCTCGAAGGGAGAAATACTTATAATTGATC
ACAAGAATGTCAGAGTGTGTGCCATCGTCCTTGTTTCTGGTGGCGCAGTTATAAAAAATG
TCGTGTTTAGTAGAAATGGACAGTACATGTTGACAAATTCCAATGACCGGCTGATTAGAA
TCTATAAAAATCTTCTACCTCCTAAAGATGGATTAAAGATGCTTGATGAATTGAATGAGAG
CTTCAATGAATCAGATGATGTGGAGAAATTGAAGGCTATTGGATCGAAGTGCTTAGAACT
TCTCCATGAGTTTCAAGATTCCATCACTAGGGTCCAGTGGAAAGCACCTTGTTTTAGTGG
TGATGGAGAGTGGGTGATTGGTGGTGCTGCAAGCAGAGGAGAGCACAAAATTTATATAT
GGGGATAGGGCCGGGCATCTTGTTAAAATTCTTGAAGGCCCAAAAGAAGCATTGATGGAT
TTGGCATGGCATCCTGTTCATCCCATTATCATTTCTGTTTCTTTGACTGGCTTGGTTTATA
TATGGGCTAAAGACTATACTGAAAATTGGAGTGCATTTGCTCCAGATTTCAAAGAGCTCG
AGGAGAATGAGGAGTATGTCGAACGAGAAGATGAATTTGATCTGGTGCCTGAAACTGAA
AAGGTAAAAGGACTAGATGTTCATGAAGATGATGAAGTTGATGTTTTGACTGTGGAGAGG
GATTCAGTTTTTAGTGACTCAGATATGTCCCAGGAGGAACTATGCTTTTTGCCTGGAGTT
CCTTGTCTCGATATTCCTGAGCAGCAAGACAAGTGTGTTGGCAGTTGTTCAAAGTTGCCA
GATGGTAACCATTCTGGATCGCCACTTTCAGTAGAAGCTGGTCAGAATGGAAATGCAAG
CAACCATAATTCAAGCCCTCTCGAACCTATGGAGAATTCAACTGCTGATGACACAGACGG
AGTGCGTTTAAAAAGAAAACGCAAACCTTCGGAGAAGGGGTTGGAATTGCAGGCTGAGA
AGGTCAAGAAACCTGTGAAACCTTTAAAATCGTCTGGTAGATTGTCAAAAACTAATAAAC
CTGTGATTGATCCGGATTCTAGTAATGGTGTATATGGAGATGATGGTTCTGATTGATTTG
ATTAAATTGCAGTTCTCATTTCTTAAAAAAAAAA
SEQ ID NO:81
AAAAAACAGACGACCAGAGGAAGATTTCTCTTTCCTGATTCTAGGGTTTCCTTCCCCTTT
CGTCTTCTTCGTCTTCCGCTCCATCAATCGGCTCCGACACCCGCAGGCACCGAGGCAGC
TTCGTCGCCCCCGCGGCCGAAGAGGACGGGTCAAGGTTGCAAATTTTGCCCAATCCCG
CGCAGATTCCGCGCCCCTCGCCGGCGTCGCGTCGTGTCGGAGGAGGAAAATGGCACG
TCCGAGCAGGGCGGCCCGGAAATGGGGAGCGTGCTGATCGCCCGAGAGGGATCGGGC
GCGGCGGCGAGAGCTGGATTTGGGTTCCCTCGCAGGAGAATTGTGGATGCATGAGGCT
GCTCCGAGGATATAATGAGGGGTGTCTCGTGGCCCGAAGACGGCAATAATCCTTCGACC
TCGAGTTCTTCGCAGCGCAATCAGCAGCAGGCGCATGCGCCCCGAGCTGTTTCTGGGC
ACGCTGCAAGTCATCCAAGTGCTAGCAATATCTTTAAACTCCTAGTGCAAAGAGAGGTCT
CTCCACGGTCAAAACATTCATCAAAGAAATTGTGGAGAGAAGCTTCAAAGTGCCAGCCCT
ATCCATTCCAGCAAAGTTGTGAAGCAGTGAGAGATGTGAGACAGGGTCTCATATCATGG
GTGGAGTCAGCGTCACTGCGGCATTTGTCTGCCAAATATTGTCCACTTGTGCCTCCTCC
AAGATCAACAATTGCAGCTGCCTTTAGTCCTGATGGAAAAATACTTGCTTCTACACATGG
AGATCATACGGTGAAACTCATTGATTCTCAGACAGGAAGTTGCTTGAAGGTGTTAAGGG
GTCATCGGAGGACACCATGGGTGGTGAGATTCCATCCGCTGTATCCGGAGATCCTTGCA
AGTGGCAGTCTGGATCATGAAGTTCGCCTGTGGGATGCAAATACCGCAGAGTGCATAGG
ATCACGCAATTTTTACCGTCCTATTGCATCAATTGCATTCCATGCTCGAGGAGAGCTTCTT
GCTGTTGCATCTGGTCACAAGTTGTACATATGGCACTACAATAGGAGAGGAGAGACATC
ATCCCCAACTATCGTTCTGAGGACACAGCGATCTCTTAGAGCCGTGCATTTCCACCCACA
TGCAGCTCCATTTCTTTTAACAGCTGAGGTCAATGACCTTGACTCAGCAGATTCAGCTAT
GACTCTTGCAACTTCTCCTGGTTATTTGCACTACCCTCCTCCTACTGTATATTTTGCAGAT
GCTCATTCTCATGAAAGATCTAGGTTGGCGGATGAACTGCCTCTCATGCCTTTGCCATTA
TTGATGTGGCCTTCCTTTACTAGAGACGATGGAAGAGTACCCTTGCAGCGCATAGATGG
GGATGTTGGTCTTAATGGACAGCAAAGGGTAGATTCATCTTCTTCAGTGCGCCTTTGGAC
ATATTCAACCCCATCAGGGCAGTATGAGCTCCTTCTGTCTCCAGTTGAAAGTGGCAACTC
TCCTTCCATGCCTGAGGAAACGGGAAATAATGCTTTCTCAAGTGCAGTGGAAGCTGAAG
TAAGTCAATCTGCAATGGATACTGTGGAGGATATGGAAGTGCAACCTGAAGAGAGAAAT
ACTCAATTTTTCTCTTTTAGCGACCCCAGATTTTGGGAGCTGCCTTTATTGCATGGATGGT
TGGTTGGTCAGACCCAAGCTGGTCCACGAAGTGTACGTCAATCAAGTCCTGGAGATATT
GAAACTCAATCTGCTTTTGGTGAGGTTGCAAGTGTTTCACCAATCACATCTGGAGTGATG
CCAGTTAGCATGGACCCGTCACGGTTTGGTGGAAGATCTGGTTCTAGATATCGCTCTCC
TGGGTCTCGGGGGGTGCACGTGACTGGACCTAATAATGATGGACCACGAGATGAAAAC
GATCCTCAATCTGTTGTTAGTAAACTCAGGTCTGAACTTGCAGCCTCGCTGGCTGCAGCA
GCATCTACGGAGTTACCCTGCACTGTGAAGCTTAGAATATGGCCACACGACGTAAAAGA
TCCTTGTGCACAGCTTGATTTAGAAAGTTGCCGCTTAACTATTCCACATGCTGTTCTATGC
AGTGAAATGGGAGCCCATTTTTCTCCATGTGGGAGATTTTTAGCTGCCTGTGTGGCATGT
GTGCTGCCTCATTTGGAATCTGATCCCGGATTACATGGTCAAGTCAATCAGGATGTCACA
GGAGTGGCAACCTCACCTACAAGACACCCAATTTCTGCTCATCAAATCATGTATGAGCTA
AGGATATATTCCTTAGAGGAGGCAACATTTGGAATTGTGCTTGCTTCACGACCAGTAAGA
GCCGCTCATTGTTTAACCTCCATTCAGTTCTCTCCGACATCTGAACACTTGTTACTTGCCT
ACGGCCGTCGCCATAGTTCACTTCTTAAGAGTATTGTCATTGATGGAGAGAATACGGTGC
CTATTTACACCATTTTGGAGGTATACAGAGTTTCTGACATGGAACTTGTGAGAGTTCTTCC
AAGTGCAGAGGATGAAGTCAATGTTGCATGCTTTCACCCTTCAGTTGGAGGTGGCCTCA
TTTATGGAACGAAGGAAGGCAAGTTGAGGATTCTCCACTATGACAGCTCTCATGGCTTGA
ATCTAAAGTCATCTGGTTTTCTTGATGAAAATGTGCCAGAGGTGCAGACTTATGCTTTGG
AATGTTAGTAGCCAGCAACTGCAACATGTTTGTGATACTTACTGGTCAACCTCAAAGATC
TTGGGTCGAGGGCAGCTGTACCTATATCTTCACAACTGTAGACCCATCTGCGGCCAGGC
TCGTGGTTCCTCAATCAATTACTAGTTAAACGCAATTCTGTACCAACTACCATTCTTTGCT
GTTTGTGGGGATTTATGTTATCAACTGCAAGAGCATTAGGAAGTTCATCTAACTTGAGAT
AGATGGGTTAGTGCGAGTTGGAAATGTCGGGATTCGTGTTGTATATTTGTTTGTCCGCTT
TGATATTTACGGACTTTTGTAAAAAAAAAA
SEQ ID NO:82
CGAAGTAAACCTCTCGGAACCTGAATCTGCTCTCAAAATCTCGTCGGTGCCGGAGAAAG
TTTCCCCGATCGATCGCCTCCTCCGCCGAGGAGCGCATGGATTCCGCCGTCGCGATCG
CGGCGCTGTCGCTCGTGGTCGGAGCCGCGATCGCCCTCCTTTTCTTCGGCAATTACTTC
CGCAAGCGAAGATCCGAGGTCGTGGCCATGGCCGAGGCGGACCTCCAGCCGCATCCC
AAGAACCCGTCGCGGCCTCCGCCGCAGCCCGCCGCCAAGAAGGTCCACGCCAAGTCC
CATGCTCACGGCGCCGATAAGGATAAAAACAAGAGGCATCATCCTTTGGATCTGAATACT
TTAAAAGGTCACGGGGATTCAGTCACAGGGCTATGTTTCGCTTCTGATGGACGAAGTTT
GGCAACAGCTTGTGCTGATGGTGTTGTGAGGGTGTTCAAGTTGGACGATGCCTCAAACA
AAAGTTTCAAATTTCTGAGGATTAACTTGCCTGCTGGAGGCCATCCAACTGCTGTTGCAT
TTGGTGATGGTGTATCATCAGTAATTGTGGCATCTCAACATTTGTCTGGTTGTTCTCTGTA
CATGTATGGAGAAGAGAAGCCTACAAATTTGGACAGCAATAAGCAGCAGACTAAGCTAC
CTATGCCTGAAATCAAGTGGGAGCATCACAAAGTTCATGAGCAGAAAGCAATATTGACCC
TTTCTGGGGCTGCTGCAAATTATGACAGCGGTGATGGGAGTACAATAATTGCATCTTGTT
CAGAAGGAACTGACATCATAATTTGGCATGCAAAAACTGGGAAGATTTTGGGGAATGTC
GATACAAATCAATTGAAAAACACTATGTCTGCTATCTCACCCAATGGGCGATTTATTGCTG
CTGCTGCTTTTACTGCTGATGTTAAGGTCTGGGAGATTGTTTACTCAAAAGATGGTTCTG
TGAAGGGAGTTACAAAAGTCATGCAGCTTAAGGGACACAAGAGTGCAGTGACTTGGTTA
TGCTTCACTCCAAACTCAGAACAAATAGTTACAGCATCAAAGGATGGCTCTATAAGAATTT
GGAACATCAACGTTCGATACCACCTTGATGAGGATACGAAGACTTTAAAGGTGTTTCCAA
TCCCATTGCAAGATTCAAGTGGTACTACTTTACACTATGAGCGCCTCAGTCTATCCCCTG
ATGGAAAGATACTGGCAGCAACCCATGGTTCAATGTTGCAGTGGCTGTGCATTGAAACT
GGAAAGGTTTTGGACACAGCTGAAAAAGCTCATGACGGTGACATCACTTGCATGTCCTG
GGCACCACAGAGTATTCCAACAGGCGATAAAAAAGTTAATGTTTTGGCGACGGCCAGTG
GTGATAAAAAAGTGAAGCTTTGGGCAGCCCCACCACTTCCCTCATAGATGAGTTAATACA
GGAGGCTGAAATTCAGCATCAGAACAATGTGAAGCTTGCTCCTACGAATACGCTTTGCC
ATTTGAGATGCATTATGGGGTTCTAAATTATATCTGAGCCACTGTCACTCTTCACAAGAAA
GCCTCAGAGTGTCCTATGGAACGTTCCAACACTGTCCATTCTGACAGCATCCGGACATA
CAGTTCATACTTCGACGTTGTTAGCCTTTCCTTAAAAAATTTTGCCGATAAACTTGGCCTA
ATGCAGAATCCAACAGGAAGAGTTTTCTGTATTCCTTCTGTTTATTTGTCTTCTCTTTGGA
GGGATATTGAGAAGGTTGTTGTTGAGATGGTGGCTGAACTGCGAGGCGATGACCCAGTA
TAATCTATCTGAGCCGGACTGATCTGGACGGCTCAATCAAGTGCTTTTCCTTTTAATTTG
CTTGCTTTTTGGGACAGGCTATTACACTTTCAAATTTGCTTGTGTGGCTGAAGCACCATAT
TCTACTAAGTTAGTCAGCCTCAGAGTGGGAGGACGAACCTATGCGGATGCGCATTGGCA
AATCTGAACAATCATTCTGTAGAACACTAGAGTCTATATGCTTGACTGTATCGGTTAATTA
ATTCAAGATGACCACGATATGTTGTTGCACATGTGATCGTCAAAACTGGTCGTTCTTCAA
AAAAAAAA
SEQ ID NO:83
AGAAGAGGCCACCTAAAACCCTAGAAGCCAGAGACTGCAAACCTCATAAAACCTCCAAA
GCGCGCACCAAGAAACCCCCCGCAGAAGAAGAGCGCGAGCTTTCGCATTCGCATCAAT
GGAGGTGGAACCCAAGAAGGCGTCCAAGACCTTCCCGGTGAAGCCGAAGCTGAAGCCG
AAGCCGAGAACCCCTTCTGGCAAGACCCCAGAATCCAAGTACTGGTCCTCCTTCAAGAC
CACCCACCCCCTCGACAACCTCTCCTTCTCCGTCCCTTCCCTCGCCTTCTCCCCTTCCC
CTCCCCACCTCCTCGCCGCCGCCCACTCCGCCACCGTCTCCCTCTTCTCCCCCCACCG
CACCACCATCTCCTCCTTCTCCGACGTCGTCTCCTCCCTCTCCTTCCGCTCCGACGGCC
AACTCCTCGCCGCCTCCGACCTCTCCGGCCTCATCCAGGTCTTCGACGTCCGCTCCCG
CACCCCGCTCCGCCGCCTCCGCTCCCACGCCCGCCCTGTCAGGTTCGTCCGCTACCCG
GTCCTCGACAAGCTCCACCTGGTGTCCGGGGGCGATGACGCCCTCGTCAAGTACTGGG
ACGTGGCTGGGGAGAGCGTGGTGTCGGAGCTTAGAGGGCATAAGGACTACGTCCGGTG
CGGGGACTGTTCGCCCGCCGACGCGAATTGCTTCGTCACCGGGTCTTATGATCACGTG
GTGAAGCTCTGGGATGTGAGGGTGAGAGATGGGAACAGGGCGGCGACGGAGGTGAAT
CACGGGTCGCCGGTGCAGGATGTGATCTTCTTGCCGTCGGGGAGTTTGGTCGCCACTG
CGGGAGGGAATAGCGTGAAGATTTGGGACTTGATTGGAGGAGGGAGGATGGTTTATTC
GATGGAGAGTCACAACAAGACCGTGACCTCCATTTGTGTGGGTACGATGGGGGCGCAG
CAGAGCGGGGAAGAAGGTGTGCAGCTGAGGATCTTGAGCGTTGGTCTTGACGGGTATA
TGAAGGTGTTTGATTATTCTAGAATGAAGGTCACACATTCGATGAGGTTCCCGGCACCTC
TGCTGTCGATCGGGTTTTCGCCAGATAGCAACGTGAGAGCCATTGGGACTTCGAATGGT
ATTTTGTATGTGGGAAAGAGAAAGGCGAAGGAAAATGCGGAGGGTGGTGCCAATGGAAT
CTTAGGGTTGGGCAGTGTGGAGGAGCCGCGGAGGCGGGTCTTGAAGCCCTCGTTCTAT
AGGTACTTCCACAGAGGTCAGAGTGAGAAGCCATCCGAGGGAGATTATTTGGTTATGAG
GCCGAAGAAGGTGAAGTTGGCTGAGCATGATAAACTCTTGAAGAAGTTCCAGCATAAGA
ATGCTCTCATTTCAGTTTTAGGTGGAAATGATCCTGAAAAGGTGGTGGCTGTGATGGAG
GAATTGGTGGCTCGGAGGGCACTGCTAAAGTGCGTCCTGAACTTGGATGCGGATGAATT
GGGTCTGATTTTGACGTTCTTGCATAAGAACTCGACTGTGCCTAGATATTCAAGCTTGTT
GCTGGGGCTGGCGAAGAAGGTTATTGACTTGAGGCTCGAAGACATAAGAGCATCCGAT
GCCTTGAAGGGTCATATTAGGAACCTCAAGCGCTCAGTTGATGAGGAGATTCGGATACA
AGAGGGGTTGCAAGAGATTCAGGGTATGGTATCTCCTTTACTAAGGATTGCAGGCAGAA
GATAGCGATAGAGTTATACTGCATGTACTGAGGTAAATGTTTTGATTACTCCACCCAATT
GCGTTGGTTCTCTACTTTTTCCCCTTGAGGGATGAAATATGAGGAGATTCGGATACAAGA
GGGGTTGCAAGAGATTCAGGGTATGGTATCTCCTTTACTAAGGATTGCAGGCAGAAGAT
AGCGATAGAGTTATACTGCATGTACTGAGGTAAATGTTTTGATTACTCCACCCAATTGCG
TTGGTTCTCTACTTTTTCCCCTTGAGGGATGAAATTTTGCTGCAATGTATGAGTTTCACTA
AATTTATGGAACACTTATGTTTTTATACGGAGATGGTTGGAGTTGAAGGTCCATCTTCCG
GGTTTTATCTGTAACTGGATGCTCAAGTAAAACAATTTTTTTTCCCTTTAAAAAAAAAA
SEQ ID NO:84
CGAGTTCAGCTCAAGCGCCATCTCCGATCGCCGGTCACCGCATCGATGCAGGGAGGAT
CGTCGGGCGTCGGCTATGGCCTCAAGTACCAGGCCAGGTGCATCTCCGACGTGAAGGC
CGACACCGACCACACCAGCTTCCTCACCGGAACCCTCAGCCTCAAAGAAGAGAACGAG
GCCATTTGCTGAGGCTCTCGTCGGGCGGCACGGAGCTGATCTGCGAGGGCTTGTTCT
CGCACCCGAGCGAGATTTGGGACCTCTCTTCCTGTCCCTTCGATCAGCGCATCTTCTCT
ACTGTTTTCTCCACCGGTGAATCTTATGGAGCTGCTGTGTGGCAGATTCCTGAGTTATAT
GGACAGTTAAATTCTCCTCAGTTGGAAAAAATTGCCTCGCTTGATGCTCATTCACGCAAG
ATCAGTTGTGTTCTTTGGTGGCCGTCTGGAAGGCATGACAAGTTGGTTAGCATTGATGA
GGAAAACATCTTCTTATGGGGTTTAGATTGTTCGAAAAAGTCGGCTCAGGTCCAATCACA
GGAGTCTGCTGGCATGCTGCACAACCTCTCTGGTGGAGCGTGGGATCCACATGATGTAA
ATACTGTCGCTGCAACCTGTGAATCATCAATCCAATTTTGGGATCTTCGGACTATGAAGA
AAGCAAATTCACTAGAATCTGTCCATGCTCGTGACCTAGACTATGACATGCGAAAGAAGC
ACTTACTTGTTACCTCCGAGGATGAATCTGGTGTACGTGTATGGGATCTTAGAATGCCTA
AAGCTCCTATTCAAGAGTTTCCTGGTCACACACACTGGACTTGGGCTGTTAGGTGTAATC
CTGATTACGAAGGACTGATTCTGAGTGCAGGTACAGACTCAGCTGTAAATTTGTGGTGGT
CATCTACTGCGAGCAGTGATGAGCTGATATCTGAGAGGCTAATTGACTCGCCTACCAGA
AAGCTTGATCCATTGCTCCATTCATACAATGACTATGAAGACAGTGTTTATGGCCTGGCA
TGGAGTTCACGAGAACCTTGGATATTTGCATCGTTGTCATACGATGGGAGGGTGGTTGT
AGAATCAGTGAAGCCCTTCCTGTCCAGAAAATGACTTCAGGGACACTAAATTAGGTTCAT
TGCCCCTTTGCGGTCCTGGCCTTTTTACATTTGGAGATGCTACGAAGTATTTCTTCTTGC
TTTTTCGGTTGGTTGGGGACTTATAAATTCTCATTATAGATTTTTTCTCCAATCATTGATT
TTGAACAAGGTATGGTTGAGAGATCATGTTTCTTCTTCCCAGCCCTGGAAAGCTGTTTTA
TCTTTGGATCATTCGAGTGAAACTTCCAACCGATTAAGGCAAGAATTGTTAGGAGGTGTA
TACTTTCTGTAACTGTATTCAATGAGCATACACCTGACGGAAAAAAAAAAATCATCGAATC
ACTGTAAAACTTCCTTCACAAATTTTTAAAAAAAAAA
SEQ ID NO:85
GGGAGAGAGAGAATCCAGGTCCATGGCGGAAGAGGAGGGAAGCGCGGAGCTGGAGCA
GCAGCTGGAGGAAGAGTTCGCGGTGTGGAAGAAGAACACCCCGATCCTCTACGACCTC
CTCATCTCCCACGCCCTCGAGTGGCCCTCCCTCACCGTCCACTGGGCCCCCCTCCTTCC
CCAGCCCTCCTCCTCCGCCGCCGCCGCCGCCGGCGACCCTTCCCTCGCCGCCCACCG
CCTCGTCCTCGGCACCCACACCTCCGACGGCGCCCCCAACTTCCTCATCCTCGCCGAC
GCCCTCCTCCCTTCTTCCGAATCCGATCATTGCGGCGACGACGCCGTGCTCCCCAAGGT
GGAGATATCGCAGAAGATTCGCGTGGATGGGGAAGTGAATAGGGCGCGTTTCATGCCA
CAAAATCACAATATTGTGGGCGCTAAGACAAATGGTTGCGAGGTTTATGTTTTTGATTGTT
CTAAGCAGGCTGCGAAGCAGCACGATGGTGGGTTTGATCCGGATTTGAGGTTGACGGG
TCATGATGGAGAAGGCTATGGATTGTCCTGGAGCCCCCTGAAAGAAAACTACCTTTTAAG
CGCGTCCCATGATAAAAAAATTTGTCTCTGGGATATATCTGCTGCCGCTCAGGATAAAGT
GCTCGGTGCAATGCATGTTTTCGAGGCTCACGAGGGTGCGGTCGGCGATGCATCTTGG
CACTCGAAGAATGACAACTTATTTGGGTCTGCTGGTGATGATTGTCAATTGATGATCTGG
GACTTGCGAACGAATAAAGCACAACAATGTGTCAAAGCACATGAGAAAGAGGTGAATTCT
GTATCTTTCAACTCATATAATGACTGGATTCTGGCAACCGCGTCTTCAGACACAACAGTT
GGACTATTTGACATGCGGAAGCTGACCACGCCATTGCATGTCTTTAGTAGCCATGAAGG
AGAGGTCTTGCAAGTAGAATGGGATCCTAATCACGAGGCTGTGTTAGCTTCTTCCTCAGA
GGACAGAAGGGTGATGGTCTGGGACCTCAACAGGATCGGAGATGAACAGCAAGAAGGA
GATGCAAGTGATGGCCCTGCCGAACTGCTCTTTTCTCATGGGGGTCATAAAGCTAAGAT
CTCAGATTTCTCATGGAACAAAAACGAACCATGGGTTATCTCAAGTGTGGCTGAAGACAA
CTCTGTTCAGGTTTGGCAAATGGCTGAGAGTATCTGTGGAGATGACGACGACATGCAGG
CCATGGAAGGATATATATGAGATTTGGTACAATTGCTTCATCGAAATTTCTCGCTCTGGC
TTTGTTTCGGGGCCAACAGAGAATTTTTGCTGAATTTACTAATTCTACAATGCTTCACGAA
GCATTTTCTTTCAGTCCTTTTAGCAGCTAAGATGGAAGATAAACTCACTGGCATGATCCG
TTTGGTGTAAATCAGCTTATTTGCGAAAAAAAAAA
SEQ ID NO:86
AGAGAGGGAGACTTCCACCAAAAGGACAAACTAGAAGAATCACCTCCTCCAACCTCCGG
GACCTTACCTTCTCCTCCCTCCTCCTTGAGAGCACCATCGCTATTATCGAGGCCATTTTT
TTCGTTTGCCTCTTTTCCAATTCGTCTTCTCTCTCTCTCTCTCTCTCTCTCTGTGTTCTTTT
TTTATATGTTTAAATCTCGCCTTCATTTTTATCCCACTGTAATGGATTCCTTGCTTCCTCAT
GCGGGACTTCGGTAGACAGAGGAGGGAGAGGGAGAGAGGGAGAGGGAGCGGGGTGG
AATTTTCAGGAGTGGAAATCCCCAGGGCCGTTTCGATTTCTGGGTTTCGACCCGAATCC
CATTTTTGATTTCCAGGTTCGTATGGGTTGAAAGCTAAAAGGGTCATGGGGAATTACGGT
GAAGAGGACGAGGATCAGTATTTTGATGCCCTGGAGGAGACGGCCTCTGTTTCAGATCG
GGGTTCAAACAGCTCTGATTGTTGTAGTTCAGGTTCCGGGCTTGATGAGAACGTTTTGGA
TAGTTTAGGGTTCGAGTTTTGGACTAAATTTCCCGAGAGCGTGAGGGCGCGCCGCAATC
GGTTCTTGATGTTGACGGGGTTGGGAATAGAGGCAAATTCAGTGGACAAGGAGGATGCA
TTCCCGCCATCTTGTAACGAAATCGAGGTGTATACGTGTAAAGTAACAAGAGATGATGGG
GCGGTTCAAAGATCTTTGGATTCATATAATTGTATTTCACTGTTGCAGTCATCCACATCCA
TTCGGTCGAATCAAGAAGTTGAATCATTGCGTGGGGACTCGCTCCTCAGCAGTTTCAGG
GGTAGAAGTAAGGAGTCTGATGATTTGACGGAATTGTGTGGGATGGGTTGTCCAGAGTC
AAAGAGAAATGCGGTGAGTGAATTTGGTTCAGTAAGTCAGGGAAGCATTGAAGAACTGA
GGAGAATAGTTGCATCTTCTCCTTTGGTTCATCCGCTGTTGCATAGGAAATTAGAGTATG
AAAGGGAATTGATTGAAACCAAACAAAAAATGGGAGCTGGTTGGTTGAGGAAATTTGGCT
CTGCCACATGCATTAGTGGAAGACAGGGTGACACATGGTCAGACCCTGATGATCTTGAA
ATAACGGCAGGGATGAAAATGCGGAGAGTTCGTGCACATTCATCTAAGAAGAAATACAA
GGAATTGTCTTCCCTTTATGCTGCACAGGAATTTCTGGCACATGAGGGGTCAATTTCGAC
AATGAAGTTCAGTATGGATGGGCAATACTTGGCAAGTGCAGGTGAAGACACAGTAGTGA
GAGTTTGGAAGGTGACTGAGGAGGATAGATCAGAGAGGGTCAACGTTACTGTGGACCCT
TCCTGTTTATATTTTGCTCTGAATGAATCCACGCAGTTGGCGTCGCTCAATACGAATAAA
GAGCACATTGGTAAAGCAAAAACATTTCAGAGATCATCAGACTCATCGTGTGTCATTTTA
CCCTTAAAGGTCTTCCAGATAACAGAGAAGCCTTGGCACGAGTTCAAGGGACATAATGG
CGAGGTCTTAGACCTCTCCTGGTCTAGTAAAGGGTATTTGCTATCCTCCTCTACTGATAA
GACCGTGCGGTTGTGGAGGGTTGGATGTGATAGATGCCAGAGAGTATATTCTCACAATG
ATTATGTTACTTGTATAAGTTTCAACCCTGTTAATGAGAACTTCTTCATCAGCGGGTCCAT
AGATGGAAAAGTACGCATCTGGAATGTGTTTGGTGGTCAGGTTGTTGCTTACATCGATTG
TAGGGAGATTGTCTCTGCAGTGTGCTATCGATCCGATGGAAAGGGAGCCATCGTGGGTA
CCATGACAGGCAATTGCCTTTTTTATAGTATTAAAGATAATCATTTGCAAATGGATGCACA
AGTATACCTACATGGTAAGAAGAAGTCTCCTGGCAAGAGAATAACTGGCTTTCAGTTTCC
TCCCAATGATCCTGGCAAACTAATGATCACATCTGCTGATTCTGTAATTCGAGTATTAAGC
GGGCTAGATGTAGTCTGCAAACTCAAAGGCCCTCGAAATTCAGGGGGACCAATGATTGC
CACTTTCACTTCAGACGGGAAACACGTGATCTCAGCAAGTGAAGATTCAAATGTCTACAT
CTGGAACTACGCCGGTCAGGACAAGACTTCATCCCGGGTGAAGAAGATATGGTCTTGCG
AGAGTTTCTGGTCCAGCAATGCCTCTGTTGCTTTACCCTGGTGCGGCATAAGAACTGTG
CCTGAAGCACTTGCACCTCCTTCACGAAGTGAAGAGAGAAGAGCGAGTTGTGCAGAAAA
CGGGGAGAATCATCATATGCTGGAGGAATATTTCCAGAAAATGCCCCCATATTCCCCGG
ACTGTTTCTCTCTCAGCCGCGGATTCTTCCTGGAGCTCTTGCCCAAGGGATCCGCGACT
TGGCCGGAGGAGAAATTATCTGATACTAGCCCACCTACGGTCTCATCTCAAGCAATTTCT
AAACTGGAGTACAAGTTCCTGAAGAGCGCGTGCCACAGCGTGTTGAGTTCTGCTCACAT
GTGGGGCCTGGTGATCGTGACTGCAGGTTGGGACGGAAGGATCAGGACATATCACAAC
TATGGATTACCTGTACGTTCTTGAAATCCCGATTCACTAAAATTATGAAGGCGGTGAGTT
CAGTCGTCCGCCTTTTCTTAGTTTTACCCTACGAATACATGGTGATGCTCACGCTCAAGG
GGGGTTTGCCTCAAGTGTAAAAGGATGCCCCTAATAGATTATATGCCAAGTGTAGTATAT
ATAATAGTGCTTTTGTCACACACCAAAAAAAAAAAA
SEQ ID NO:87
GAGAAGTAAAGCCTTGACGCTGGTTTTTGGTCAGCTCCGTTTCCCTCACTCATCAACTC
TCTCCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCGTGCGCCATGGACATCGACTTC
AAGGAGTACCGACTTCGCTGCGAGCTGCGCGGCCACGAGGACGATGTCCGGGGCGTAT
GCGTGTGCGGGGACGGCAGCATCGGGACCTCGTCGCGGGATCGGACGGTGAGGCTGT
GGGCTCCGAGCGCCGGCGAGAGGCGCAAGTACGAGGTGGCGAGGGTGCTGTTAGGGC
ACAAGAGCTTCGTGGGTCCCCTGGCGTGGGTTCCGCCCACCGAGGAGCTTCCGGAGG
GCGGGATCGTGTCCGGCGGGATGGACACTCTCGTGATGGCTTGGGATTTGAGGAATGG
AGAGGCGCAGACGTTGAAGGGCCATCAGTTGCAGGTCACCGGCATCGTGTTGGACGGC
GGCGACATTGTTTCTGCCTCTGTTGATTGTACCTTAATAAGATGGAAGAATGGCCAGCTT
ACGGAGCACTGGGAGGCTCATAAGGCCCCCATACAAGCAGTCATAAGGTTGCCCTCCG
GGGAGCTTGTTACAGGTTCAAGTGACACAACCTTAAAACTCTGGAGAGGAAAGACGTGC
ACTCAGACTTTTGTTGGGCATACAGATACCGTTCGAGGCTTGGCAGTGATGCCTGATCT
GGGCATTCTATCTGCATCACATGATGGGTCTATCAGGTTATGGGCAGTGAGCGGTGAAT
GTCTAATGGAGATGGTTGATCATACTTCTATAGTCTATTCTGTTGATTCACATGCTTCGGG
CCTCATAGTCAGTGGTAGTGAAGATCGTTTTGCCAAAATATGGAAAGATGGAGTTTGCTT
TCAGAGCATTGAGCATCCTGGTTGTGTGTGGGACGTCAAATTTTTGGAAGATGGAGACA
TCGTGACAGCATGTTCTGATGGGACCATACGTATATGGACAAATCAAGAAGATAGGATG
GCTAACTCAACTGAGCTCGAGTTATTTGATTTAGAACTATCTTCTTACAAGAGAAGCAGG
AAAAGGGTTGGAGGACTGAAATTGGAAGAGTTGCCCGGGTTAGAGGCTTTACAAGTGCC
AGGAACTAGTGATGGCCAGACAAAAGTAATCAGAGAAGGAGACAATGGGGTGGCATATG
CATGGAATTCAACTGAACTAAAATGGGATAAGATTGGTGAAGTAGTTGATGGACCAGAAG
ATAGCATGAACCGCCCAGCCTTAGATGGTGTTCAATATGATTATGTATTTGATGTTGATAT
TGGAGATGGTGAGCCTACACGTAAGTTGCCCTACAATCGATCAGATAATCCATACGACA
CTGCCGATAAGTGGCTTCTCAAGGAGAACCTTCCTCTTTCCTATCGCCAACAGATAGTGG
AGTTTATACTTGCAAATTCTGGGCAGAGGGACTTCAATCTTGATCCATCATTTCGTGATC
CCTATACTGGTTCCAGTGCATATGTTCCCGGCGCACCTTCACAGCTGGCAGCTAAACAA
GCGAGACCTACTTTCAAGCATATACCCAAGAAAGGAATGCTGGTCTTTGATGCGGCTCA
GTTTGATGGAATTCTTAAAAAGATCAATGAGTTTAATAATACTTTGCTCTCTAATCAGGAA
AAGAAGAACTTATCATTGACAGATATAGAGATTTCCAGATTGGGAGCAGTTGTCAAAATTT
TAAAGGACACGTCACATTATCATTCTAGCAAATTTGCCGATGCTGACTTTGATTTGATGTT
GAAGTTGCTAGAATCATGGCCATATGAAATGATGTTTCCTGTTATTGATATTTTCAGGATG
GTAATCCTGCATCCAGATGGGGCAGATGGACTTCTGAGGCATCAAGAGGACAAGAAGGA
TGTTCTTATGGAATCAATCAAGAGAGCTACTGGAAATCCTTCAGTTCCTGCAAATTTCTTA
ACTAGCATCCGGGCTGTGACTAATCTATTCAAGAATTCAGCATACTACAGCTGGTTGCAG
AAGCATCGTAGTGAGATGCTTGATGCGTTTTCGAGCTGCTCTTCTTCTTCAAACAAGAAT
CTACAGTTGTCTTATGCTACTCTATTACTCAATTATGCTGTGCTATTGATTGAGAAGAAGG
ATGAAGAAGGTCAATCTCAAGTTCTTTCAGCAGCACTTGAGCTTGCAGAAAATGAATCTT
TAGAAGTTGATGCCAGATATCGAGCTTTAGTAGCTATTGGATCATTGATGCTTGATGGTC
TTGTTAAAAGAATCGCTTTGGATTTTGATGTTGAGCACATTGCCAAAGCTGCAAGGACTT
CTAAAGAAGCCAAGATTGCTGAAGTAGGAGCAGATATCGAACTCCTTATAAAGCAGAGTT
GAATGACTGTTATTTGAGCTGTGCAAATATTCAACCGCTCGTGATTCCGTCGTGGGACTG
CACACCTGTTATACAGAAGTCTTTCAGAACAGCATTCATGCTACCGTGGAGGAACAATGG
GTCACTTGGATTGAGAGATGGCCACATTTGGAAGGCATTGATTGGCCTTAAAGTATCATT
GTTGGGAATCGTATACTGTGTAGGAGATTACATGATAACAGATTGAGATTGCACAGGTCT
ACGATTTTCTTTCTGCCAGCAAAAGCTGAGACTTTTGTATGGACTAAAAACTCGCCATAA
GAAGCATGGAAACGCGGGGGGGCGGAAAACCAGTGCCGTGTGCCCTGCAATGCATACG
TCTTTCACTGTAAAAAAAAAA
SEQ ID NO:88
CAAAAGTTGCAGCTTCCTGCAAATGGAATGGCCGAATCAGCTGCGAGGAAGCAAGCTCT
CGCGGCCACCTTCGATGCATCAGACCGAGCCGTAGTCGGAGATCCACTCGCCTCCGAC
ACCTCCCGGCACGCCTCGATTCTCCTCGCGTGCCCCCGATCCCCTCTTCAATCGTGAAT
CGGGAGCTCGGGAACTTGCTCTTCCTTCGTCTGTTGCCGCGTGATTCGGGGTTGCGTTT
CAGCGGAATGGAGTTCACGGAAGCGTACAAGCAGTCGGGCCCTTGCTGCTTCTCTCCC
AACGCGCGCTTCATTGCCGTCGCTGTCGATTATCGCCTCGTCATACGCGACACATTGTC
GCTCAAGGTTGTGCAGTTGTTTTCATGCTTGGATAAAATAAGCTACATAGAATGGGCCCT
TGATTCTGAATACATACTTTGTGGTCTCTATAAAAGACCAATGATACAGGCATGGTCATTA
ATTCAACCTGAATGGACATGTAAAATAGATGAAGGTCCAGCTGGCATTGCTTATGCTAGG
TGGAGCCCGGATAGTCGGCACATACTGACAACATCTGATTTCCAACTCCGCTTGACAGT
TTGGTCATTGGTCAACACAGCATGTGTACACGTGCAGTGGCCAAAGCATGCTTCCAAGG
GGGTGTCTTTCACTCGAGATGGAAAGTTCGCTGCAATATGCACAAGACATGATTGCAAG
GACTACATCAATCTGCTTTCTTGTCATAATTGGGAGATAATGGGTGTTTTTGCTGTTGATA
CTTTAGACTTAGCTGATATCCAGTGGTCACCGGATGATAGTGCTATAGTGATATGGGATT
CACCTCTTGAATACAAGGTTTTGGTATACTCACCAGATGGTAGGTGTCTGTTTAAGTATC
AAGCCTATGAAAGTGGATTGGGAGTGAAAAGCGTTTCTTGGTCTCCTTGTGGACAATTTC
TAGCCGTTGGTAGCTATGACCAGATGTTAAGGGTCTTGAGCCACCTTACTTGGAAAACTT
TTGCAGAATTCACGCATCTATCTAATGTTCGTGCTCCATGTTGTGCTGCCATCTTTAAGG
AGGTGGATGAGCCCTTGCAAATTGACATGTCTGAATTGTCTTTAAGTGATGATTATATGC
AAGGCAATTCTGGAGATGCTCCAGAGGGACATTACAGAGTCAGATATGATGTTACAGAA
GTTCCAATTACTTTGCCTTGCCAGAAGCCTCCAGCGGACAGACCGAATCCCAAACAAGG
AATTGGTCTGATGTCATGGAGCAATGACAGCCAGTATATATGCACTCGCAATGATAGTAT
GCCGACCATTCTTTGGATCTGGGACATGCGCCATCTTGAACTTGCTGCCATCTTGGTTCA
AAAGGATCCTATCCGAGCTGCAGTTTGGGACCCTACAGGCACTCGCCTAGTCCTTTGCA
CAGGAAGCTCGCACCTGTACATGTGGACTCCTTCAGGTGCCTACTGTGTCAGCGTCCCC
CTATCTCAGTTTAATATAACAGATTTGAAGTGGAATTCGGATGGGAGCTGTCTTTTGCTCA
AGGACAAGGAGTCATTTTGCTGTGCTGCAGCACCACTGCCACCAGACGAATCTAGTGAT
TATAGCTCAGATGATTGAGGCATCGATGTGTACTCTCCAGAGTGACATGGACTGCAAACT
GCAGTTGTTCGTAGGACTGTATATTTAGGCATGTTAGATCCACCGGATAGAGAATAGAGA
TTGACTTTTTGGTGTTGTATAATTCTTTACAGTTGGGATGATTAGCTTCTGGGAGGTAAAT
TGATGACAATCGATTGGCCGCTCCTCATCTTTTCCTTTTGAACTGTAGAAAGCCGGACTT
TTGGGTAATGGTCAATCTGTTTGCCTGTGTTCGATAGAAGGGCAAAAATCTGTACGCTGA
TATGAATCGCGGTGAGAGCC
SEQ ID NO:89
GTCACCTCGAAGTGCTGCGCTCAAACTAAACTCCCCCAGGCCACCGTCGCCGCGTCTC
CTCTCGTTCGCTCGCGCGGGGGGCGCCCACCAGAGCAGCCGCGGAAGAACGCGATCG
ATCGACGGCGATGGCAACGATCGCGGCGCTTGACGACGACATGGTCCGCAGCATGTCG
ATCGGAGCCGTTTTCTCCGACTTCGTTGGGAAATTAAATTCGCTTGATTTTCACCGTAAG
GATGATATTTTGGTCACGGCCGGCGAGGATGATTCAGTGAGATTGTATGATATTGCAAAT
GCCAGGTTGCTCAAGACCACCTTTCACAAGAAACATGGCACTGATCGTGTGTGCTTTACT
CATCATCCAAACTCCCTCATATGCTCTTCAACCAAAAACTTAGATACTGGAGAGTCCTTAA
GATATATATCAATGTATGATAACCGAAGCCTCCGCTACTTTAAGGGACACAAACAGAGAG
TTGTTTCCCTGTGCATGTCTCCAATCAATGATAGCTTCATGTCAGGCTCTCTTGACCACA
GCGTGAGAATGTGGGATCTCCGTGTGAATGCTTGCCAGGGAATTCTGCGTCTACGTGGC
AGACCTACTGTTGCATATGACCAGCAGGGTCTTGTCTTTGCTGTAGCTATGGAAGGGGG
TGCTATCAAATTGTTTGATTCACGCTCTTACGACAAGGGCCCCTTTGATGCCTTTTTAGTT
GGTGGAGATACATCTGAGGTCTGTGATATCAAGTTCAGCAATGATGGAAAATCAGTGCTA
TTATCAACCACAAACAACAACATCTATGTTCTCGATGCATATGCAGGAGATAAGCAATGT
GGATTCAATTTAGAACCATCTCCAAGTACACCGATAGAGGCAAGTTTTTCACCAGATGGC
CAGTACGTTGTATCAGGCTCAGGAGATGGAACTTTGCATGCATGGAATATCAGCAGGCG
AAATGAGGTAGCATGTTGGAATAGCCATATTGGCGTCGCATCATGCTTGAAATGGGCTC
CTCGTCGGGCCATGTTCGTAGCGGCATCTACTGTCCTCACTTTCTGGATTCCAAATTCCG
AGCCCGAGCTTGCTTCTGCCAAAGGCGAGGCCGGAGTTCCACCAGAACAAGTATAACTG
TGAAGCCTCCATTTACAGCTCTAGACCCTGCCATGAGCTTCTTTAAGCGAGCAATAGAGT
TAATAATATGTTGTAAATCTTCTCATGTGCCTGGCGTAAATTTTGCAGTTATTACTAGACC
AAGATAGTTTCATGCTCCACAGAACAAATTGACACCCCCACCTGTACTTTAAAGTCACAA
TTTTCAAAAGCTTCTGTTTTCTTAAAAAAAAAA
SEQ ID NO:90
GCTTCGTCGAGCTGTCCAGACGCGTCCCAAAATCAGTTGCTCTCGATTCTCCTCCCGCG
ATCTCCTTCCCTTCCTTCCCTTCCCCCGTCTGTACATCTCGAAGCCCTAACCTCCGAAGC
TCCTTCCGATGCACCGCCGCCACCGCCGCCGTCATCACCGCCGCGACTGCTCCCGCCG
ATGAGCCGAGCCGGGCGATGACGGCTCCTGCCTAGGGGAAGAGGGAAACACTCTCGC
GCGCGCGCCCCCGACGACGACGATGTCCGTGGCGGAGCTGAAGGAGCGCCACAGGGC
CGCGACCGAGACCGTCAATTCTCTCCGGGAGCGCCTCAAGCAGAAGCGAGTGCAGCTG
CTCGACACCGACGTGGCCGGGTACGCGAGGACTCAGGGGAAAACTCCGGTCACTTTTG
GGGCGACGGATCTGGTGTGCTGCAGGACCTTGCAGGGCCACACTGGCAAGGTGTACTC
ACTAGATTGGACTCCTGAAAGGAATCGCATTGTCAGCGTGTCTCAAGATGGGAGATTTAT
AGTGTGGAATGCTCTGACCAGTCAGAAAACTCATGCCATAAGGCTTCCTTGTGCATGGG
TAATGACATGTGCTTTTGCACCGAATGGTCAGTCTGTTGCCTGTGGCGGTCTTGACAGC
GTATGCTCTATCTTCAACCTCAATTCCCCCGTTGACAGGGATGGGAACCTACCGGTATCA
AGAATGCTCAGTGGGCATAAAGGTTATGTGTCATCATGTCAGTATGTACCAGACGGAGAT
GCTCACCTGATTACAGGATCTGGTGACCAAACATGTGTCTTGTGGGATATTACCACGGG
TCTGAGAACTTCTGTCTTTGGAGGAGAATTTCAATCAGGGCACACTGCTGATGTTTTAAG
TGTGTCAATCAATGGATCGAGCCCAAGAATATTTGTTTCTGGTTCATGTGATTCAACTGCT
AGGATGTGGGATACTCGTGTTGCAAGTCGAGCCGTTCATACATATCATGGACATGAAGG
CGATGTAAATGCCGTAAAGTTCTTTCCAGATGGAAATAGATTTGGAACTGGGTCTGATGA
CGGTACTTGCAGGCTCTTTGACATCAGAACAGGGCATGAACTTCAAGTGTACTATCAACA
GCGTGGCATCGATGAGATCCCACATGTCACCTCCATTGCATTTTCCATTTCGGGGAGGC
TGCTAATTGCTGGATACTCAAATGGTGACTGCTTTGTGTGGGATACATTATTGGCCCAGG
TTGTGTTGAACTTGGGATCGTTGCAGAACTCACACGAAGGTCGGATCAGCTGTTTGGGT
GTGTCTGCTGATGGAAGTGCCTTGTGTACTGGCAGTTGGGATACAAACCTAAAGATTTG
GGCTTTTGGAGGGATTCGGAGGGTGACTTAATTTCTGCAGGAGCCGAACCCTTTGATTT
CTTCGAGGTCTCTCATTTCATGTTACTGCGTCCGTTTCGTTACATGCCCATAATTTGTGGA
AAAATAGATTGAGAAGCCCTATCCTGTAATTGCATCGCTGTCATTCTCATGATGAAAAAG
AAAATGACATGGATTCGATCAATCGCCACATGACAACTAAAACAAGCGGTTCACGTGATT
GTAATTTTATCCTCATTGCTTCTGTGTGCTTGCTTTGTTCCTAACGTCATTCTCATGAACG
TCTTGTAAAAAAAAAA
SEQ ID NO:91
GCGGCGAGAGCATCGAGCACTGACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCT
CTTCTTCATCACTTGTTCGACTCGGATTGGTCGATCAGCCCTCCCGTCGCGCCGCCGAC
CGTCCCTCGCTCCCCTCCGTTCGCGCGCTCCACCGGCTGATCGGGTCGTCCCGCGGG
GATGAAGAAGAGGCCGAGAGGAGCAAGCCTCGATCAGGCGGTCGTCGATATACGGCGG
CGAGAGGTCGGCGGCCTCTCCGGTCTGAGCTTCGCTCGCCGCCTCGCCGCTTCCGAG
GGTCTTGTGTTGAGGCTCGATATCTATAATAAACTGAAAGGGCATAGGGGATGTGTCAAC
ACTGTCGGTTTCAATCTTGATGGTGACATAGTAATATCGGGCTCGGACGACAGGCATGT
GAAGCTCTGGGATTGGCAAACCGGGAAAGTCAAGTTGTCATTCGATTCGGGTCATCTAA
GCAATGTCTTCCAAGCCAAGATCATGCCATACACTGATGATAGGAGCATTGTCACTTGTG
CAGCTGATGGACAGGCAAGGCATGCTCAGATTCTGGAGGGGGGACAAGTTCAGACAAT
GTTGTTAGCCAAGCATCGGGGAAGGGCTCATAAGTTGGCTATAGATCCTGGAAGTCCGC
ATATAGTTTACACATGTGGTGAAGATGGATTAGTTCAACGTCTTGATCTCAGGAGTAATA
CTGCCAGAGAACTTTTCACTTGCCGAGAAGTATATGGGACTCATGTGGAAGTTGTTCATT
TGAATGCAATTGCAATCGATCCAAGGAACCCAAATCTATTTGTGATTGGTGGGTCAGATG
AGTATGCTCGGGTGTATGATATTCGCAATTACAAGTGGAATGGATCGCATAATTTTGGTC
GATCTGCTAACTACTTTTGTCCTTCTCATCTCATTGGTGAGGCGCATGTGGGAATAACAG
GCTTAGCCTTCTCTGGTCAGAGTGAACTCCTCGTCTCTTACAATGATGAATCCATCTATC
TCTTCACCCAGGAGATGGGACTGGGTCCTGATCCGCTCTCTGCCTCCACCAAGTCCGTG
GACAGTAATTCCAGTGAGGTGACATCTCCCACAGCTGTGAATGTTGATGACAATGTTACC
CCTCAAGTCTACAAAGGGCACAGGAACTGTGAGACAGTGAAGGGTGTTGGCTTTTTTGG
GCCTAAATGTGAATATGTGGTGAGCGGGTCTGACTGTGGCCGCATATTCATTTGGAAGA
AAAAAGGGGGACAGCTTATCCGTGTCATGGCAGCTGATAAACACGTTGTAAACTGTATTG
AGCCTCATCCTCATATCCCTGCTCTGGCTAGTAGCGGAATAGAAAATGACATCAAAATCT
GGACTCCCAAGGCTATTGAAAGAGCTACTCTACCAATGAATGTCGAACAGCTAAAGCCA
AAGGCTAGGGGGTGGATGAACAGAATATCGTCGCCCCGGCAACTGTTGTTGCAGCTATA
TTCCCTGGAAAGGTGGCCGGAGCACGGTGGAGAGACTTCATCTGGCTTGGCTGCTGGG
CAGGAGGAACTCACGGAACTCTTTTTCGCACTGAGCGCTAACGGCAACGGGTCTCCGG
ATGGTGGTGGTGATCCTTCTGGTCCACTCCTTTAGGCGACTGATGTGGTGCGTTCAGAA
GACGACTAGCAACTGTGCATAATAATGTAGGGCAAAACCGTTTTGCTTCCCCATCCACAC
TGTTTTTTTCTGTTTCTTCTTCGGCTTTCTTCGGCTTCTCTTCCTCCTCTTCATGGATGTAT
ATGTTCATCCACATAACTTGTTCATGCTTGTTTATATTCTTTTATTTATTTTTCTCCTCCTTT
TCATGGATTAAAAAAAAAA
SEQ ID NO:92
GAAGACTGTCTTCCTCAAGAAAAAGCAAAGCGGGGAATCAAGAAAGCAAGGCCTTCCCG
TTCTTGGCCAGCACTGTCGAAATCTCACGCGCGAATTTCTCGCGAATCCGTCGTCCTTC
CTTCGTCCCTTCCGAATCTGTGAATCCGATCCCCATCTTTCGTCTTGTCCGGAAGAGCGT
GAGTTTGTTGTTCAGAGCTTCCTTCGGGCTGGAGGAAAGTCTCGGGTTCCGGCTGCTTA
ATCTTGGGGGTTCTGGGTTCTGGGTTTTGGCTCCCGAGTGGCTGGGAGGAAGATGTCC
AAGCGGGGCTACAAGCTGCAGGAATTTGTGGCCCATTCGTCCAATGTCAACTGTCTCAG
TATTGGAAAGAAGGCATGCCGGCTTTTTCTCACTGGTGGAGATGACTGCAAGGTCAATC
TATGGGCCATCGGCAAGCCGAATTCCTTAATGAGCCTTTGTGGTCATACAAATGCTGTAG
AGTCCGTAGCCTTTGATTCTGCAGAAGTGTTGGTGCTTGCTGGAGCTTCTTCTGGTGTCA
TTAAGCTGTGGGACGTGGAAGAAGCAAAGATGGTTCGTGGTCTTACTGGACACAGATCC
AATTGCACCGCTATGGAATTCCATCCATTTGGAGAGTTCTTTGCATCTGGTTCCACTGAC
ACAAATCTGAAGATATGGGATATCAGGAAGAAGGGATGTATACACACGTACAAGGGTCA
TACTCGAGGCATTAGCACCATCAGATTCTCTCCTGATGGTCGCTGGGTTGTTTCAGGGG
GAAATGATAATGTTGTGAAGGTGTGGGATCTAACTGCTGGAAAGCTTTTGCATGATTTTA
AGTTCCATGAAAATCATATCCGATCTATAGATTTCCATCCCTTGGAGTTCCTACTTGCTAC
AGGTTCGGCAGATAGAACGGTTAAATTCTGGGACTTGGAAACGTTTGAACTAATCGGATC
TTCCAGACCTGAGGCCGCAGGAGTACGTGCAATTGCCTTCCATCCTGATGGGAGGACCT
TGTTCTGTGGTTTGGAGGATAGCTTAAAGGTTTACTCATGGGAGCCTGTAATCTGCCATG
ATGGTGTTGACATGGGATGGTCAACCCTTGCTGATCTTTGTATTCATGATGGAAAACTCT
TGGGTTGCTCATATTACCAAAGTTCTGTTGGTGTTTGGGTAGCGGATGCGTCGCTTATTG
AACCGTATGGAACTAATGTAAAGCCTCAGCAGAAGGATAGTGGGGATGATGAAATTGAA
CACCAAGAAAGTCGTCCCTCGGCTAAAGTTGGGACCACCATAAGATCAACTTCAATCATG
CGCTGCGCCTCTCCGGATTATGAAACCAAAGACATAAAGAATATATATGTGGATACCGCT
AGTGGTAATCCTGTTTCTTCACAGCGTGTTGGTACCACAAACTTTGCAAAAGTGACTCAA
CCGCTGGATTTTAACGACACTCCTAATTTGACACTACGGAGGCAGGGTTTGGTAACAGA
AACGCCAGATGGATTAAGTGGGCATGTTCCTAGTAAATCTATTACTCAACCGAAAGTTGT
TAGTAGGGACAGTCCTGATGGAAAAGACTCATCTCGAAGAGAATCTATTACTTTTTCAAG
GACTAAACCGGGCATGTTGCTCAGGCCTGCTCATTCGAGGAGGCCGTCGAGCACCAAA
TATGATGTCGACAGGTTATCAGCATGTGCTGAAATTGGAGTGCTTAGCAGTGCAAAAAGT
GGTTCTGAAAGTCTTGTAGATTCATTTTTGAACATCAAAGTTGCACCTGAAGATGGAGCA
AGAAATGGCTGTGAAGACAATCATTCGAGTGTCAAGAATGTCTCTGTGGAATCTGAGAAG
GTGCTACCACTGCAGACACCTAAAACAGAAAAGTGTGACCAAACTGTCGGTTTCAAAGAA
GAAATCAACTCTGTCAAGTTTGTCAATGGAGTTGCAGTTGTGCCAGGCAGGACACGCAC
TTTAGTTGAGAAGTTTGAAAAAAGAGAAAAATTGAACAGCACTGAAGATCAAACAATCAAT
ACACCTGAAAATCCCACGTTAGATAAAACCCCTCCTCCCTCTCTTGCAGAGAACGAGGAA
AAGAGTGACAGATTAAACATCGTTGAACGAAAGGCAACAAGAATGTCTTCTCATATGGTA
ACCGCTGAGGATAGAACACCTGTTACCCTTGTTGGCAGTCCTGAAGATCAGTCAACTGT
AATGGCTCCTCAAAGAGAATTACCAGCTGATGAATCAAGTAAAACTCCTCCATTGCCGGT
TGAGGATTTGGAGATTCATCATGGATCAAATGTCAGTGAAGATAAGGCGACTATTTTGTC
TTCTCAGACAGTATCAGAGGAGGATAGCAAACGCTCTACCCTTATTCGCAACTTCCGTAG
AAGAGACAGATTTAAAAGTACTGAAGGTCGATCCCCTGTAATGGCTACTCAAAGAAAACT
ACCAACAGATGAATCAGGCAAAACTTCTTCTTTGCCAATGGAGGATTTGGAAATCAAAGG
TGGATTGAATGTGAGTGAAGATAAGGCAACTAGTTTCTCTTCTCGTGCACCACCGAGGG
AGGATAGAGCACACTCCGCTCTTGTTCGCAATGTTCGAAAAAGAGACAAATTTAAAAGTA
CTAATGATACAATTACTGTAATGGTTCATCAAAGAGGACTATCGACAGATGAAGCATCAA
CTGTCTCAGTTGAGAGGGTTGAAAGGAGACAATTATCTAACAATGTGGAGAACCCGCTA
AATAATTTGCCCCCTCACTCAGTACCTCCAACGACCACGAGAGGGGAACCTCAATATGT
GGGGAGTGAGTCAGACTCTGTTAACCATGAGGATGTCACTGAACTTCTACTGGGAAATC
ATGAAGTTTTCTTAAGTACTCTCCGTTCTCGCTTGACAAAACTACAGGTAGTTTAGCTCTT
TTTGAATGTTTACGTGGTTGAGTAAGGCTAATTTGTATATTGCATGTCCCTGTATACTTTC
GTTTTGAATGTTGGCCTCCTTACACTTTGGCTATTCCTCTAATAACTAGTCTTTTATTCCC
CACCACAATCGTACAACTTGCCAGGTAGGGTTAGCTAAGGGTTGCAATTATGTTGGCAC
CATTTCTAATACTTGAAAGACATAGTATTAGCAGAAAGTATTGCAGTTTCAGCAAAGTGTG
ATGACCAAATGCTTCTAAATTCTAATGCGTAAATCTAATCACCAAATGGATGGTTCAGAAG
TCATAGAAATGGCATTTTGACCGACTATATATTACTTTTCTTAGCTGCATTTCTACTTGGTA
GTTGCCTTGACAATTTTCTCTTGTATTTCCAGGTAGTCAGGCATTTCTTTGAAAGGAAAGA
TATGAAAGGTGCTATTAACGCATTGAGGAGGTTGCCGGATCATTCTGTTCAAGCAGATGT
AATCAGCATCCTTACGGAAAAAATGGAGATGCTGAACTTAGATTTATTTTCTTGCTTGCTT
CCTGTGCTTGTGGGTTTACTGGATAGCAAGGTAGAAAGGCATTCATGTGTGTCTCTCGA
GATGCTACTAAAGCTTGTAGCAGTTTTTGGTTCAGTAGTTCGGTCAACAGTTTCTGCACC
TCCGACTGTTGGTGTTGATCTACATGCTGAGCAAAGGCTAGAGTGCTGCAACCAGTGTT
TTACAGAGCTGCAAAAGATCCAGAAAATTATACCTATGCTTGTGAGGAGAGGTGGTGTAA
TTGCAAGAAGCGCTCAGGAATTGAATCTAGTCCTTCAACAGTGCTGATGGACCATACCAT
CTGCATCCCTTTTCTGCTTACCTGCAGAGTCCAGTTGGATGACATCTAACAGAATACTCG
TGTACATGGACACCGTCTGTCCTTAACTCGGAGTTTGGCTTTACGGCTACTGATTTTGCC
GCTTCCATCGATCAGTTTTGCTTGGTTTCTCATGAGGCAAATGCCCGAGCACTATGCGCA
TGGAAAATTAAGCTCGAAGACTCAAGAGGCCTTGTTCGGGGCATGACTCCATGATATTC
CGAAAGAGTAGGATCTCTCAATTCTTTGATTCTGTTGTATGGTGTATCTTATTGTATCTTC
TATCTGCCCCCCATGTAATTCTTCATGTGTACTTCTCGCTCCCGTGTAATTCATGTGTACT
TCTCGATGGTATAGTGAAAATCCACTTTGTTCAAAAAAAAAA
SEQ ID NO:93
CTGGACCTCAGTAGAAAGCACGAACCCGTCTTCTTCTCCCTCTTCTTCTTCTCCTCCTCC
TTCCATCGGCGAGCGAGAGAAGACGCAGAGAGAGAGAGAGAGAGAGAGAGAGAGAATC
GGAGGAACCTCCATGCGCATTGGCGAACGATGTCCACCTTCCTGACCGGCACCGCTCTT
TCCAATCCCAACCCGAACAAGTCGTACGAGGTCGTTCAACCTCCGAATGACTCCGTCTC
GAGCCTGAGCTTCAACCCCAAGGCCAACTTCTTGGTGGCCACTTCCTGGGACAACCAGG
TCAGGTGCTGGGAGATTGTGCGGAGTGGAACAAGCCTCGGCACTACGCCAAAGGCATC
CATATCTCATGACCAGCCAGTATTGTGCTCAACTTGGAAGGATGATGGAACAACTGTCTT
TTCTGGCGGCTGTGACAAGCAAGTCAAAATGTGGCCGTTGTCAGGTGGCCAACCAATGA
CTGTGGCCATGCATGACGCGCCAATAAAAGAGATTTCGTGGATACCTGAAATGAACCTTT
TAGTTACTGGAAGTTGGGACAAGACATTGAGATACTGGGACACGAGACAAGCCAATCCA
GTTCATATTCAACAATTGCCTGAGCGTTGCTATGCCCTTACTGTGAGACATCCCCTCATG
GTTGTTGGTACTGCAGATAGGAATCTGATAATCTACAATTTGCAAAGTCCTCAGACTGAA
TTCAAGAGAATCAGTTCGCCCCTCAAGTATCAGACCAGATGTCTCGCTGCCTTCCCTGAT
CAACAAGGCTTTCTGGTTGGCTCTATTGAAGGAAGAGTTGGGGTACATCATCTAGATGAT
TCACAACAAAGCAAGAATTTCACCTTCAAATGCCACAGAGAGGGCAGTGAGATATACTCA
GTCAATTCATTGAACTTCCATCCTGTACATCACACCTTCGCAACTGCTGGTTCTGACGGC
GCCTTCAATTTTTGGGACAAGGATAGCAAACAGAGACTGAAGGCAATGTCAAGGTGCAG
TCAGCCCATTCCCTGCAGTACCTTCAATAATGACGGGTCTATTTTCGCATACTCGGCGTG
CTATGATTGGAGTAAGGGTGCAGAGAACCATAATCCTGCAACAGCCAAAACTTACATTTT
CCTGCATTTACCACAGGAATCTGAGGTTAAAGGCAAACCACGACTGGGGACTACTGGAA
GAAAATGACTCCTAATTTCTAGTCGCAAAGCCCCATCATCATATTTCCAGTTGGAAGAGT
CGTATTCCCTGTCGGAAGGACGAGGGATTTGTACATGTATAGACTTTCCAGCTGCTTCTC
CCAAAAAATTTAGCAGTTGACCTGTTTTTGGCCGAAGTCAAGGATTTCGTTGTGTAGTAC
TGGGAGTTACTACTTGTATGTATGTAAATCATGTGGCGTCTGTCCAGTTACTATAGATGG
GAGTCCCCTGGGTGGATTTTGTGAAATATATTTTGGTGATCGGTTGTCTTTTCGTTGTATC
CACAAGACTACAGAAATTACATGCTAAAAAAAAAA
SEQ ID NO:94
CTTCCTTCCCTGTCACAAGACCCTGCCTTCCTTCTCTGTCTATCTACGCACTCCTGGCAT
CATCGTATCGATTAAAGCTTCGATTTTTATCGGGATTCGAGGAGAAAAGATTGGATTTTTG
CTGCTGTGAGGGAAGAAGATGGAGGTGGAAGCGCAGCAGCGAGACGTTAACAACGTGA
TGTGCCAACTGGTGGATCCAGAAGGAACGACCTTGGGTCCTCCCATGTACCTTCCCCAG
GACGTTGGCCCTCAGCAGCTTCAGCAGATGGTCAACAAACTCCTCAGTAACGAGGACAA
ATTGCCGTATACTTTTTACATATCGGACCAGGAGCTCGTTGTCCCTCTTGAATCTTACTTA
CAGAAAAACAAAGTTTCTGTGGAGAAGGTGTTGTCCATAGTCTATCAGCCACAAGCCATT
TTCCGAATTCGTCCTGTAAATCGCTGTTCAGCAACAATTGCTGGTCACTCAGAAGCTGTT
CTGTCTGTGGCCTTTAGTCCTGATGGGAAGCAACTGGCTAGTGGTTCAGGAGATACCAC
AGTCCGATTGTGGGACCTAAGTACTCAGACCCCGATGTTTACATGCAAAGGGCACAAGA
ATTGGGTCCTCTCCATTGCATGGTCGCCTGATGGCAAGCATCTTGTAAGTGGAAGCAAG
GCTGGGGAAATCCAATGTTGGGATCCGTTGACCGGGCAACCATCAGGCAATCCACTTGT
TGGCCACAAGAAATGGATCACAGGTATATCTTGGGAACCAGTCCATCTGAGTTCACCAT
GCCGTCGCTTTGTGAGTTCTAGTAAAGATGGTGATGCACGCATATGGGATGTAACACTAA
GGAGATGTGTAATATGTCTGAGCGGTCACACTCTAGCTGTTACATGCGTAAAGTGGGGA
GGAGACGGTGTTATATATACTGGCTCTCAGGACTGTACAATCAAAGTCTGGGAAACTTCC
CAGGGGAAGCTGATAAGGGAATTGAAGGGTCATGGACATTGGGTTAACTCCCTTGCTCT
GAGCACTGAGTATGTTCTTCGGACTGGAGCTTTCGATCACACTGGCAAACAGTATTCATC
TGCTGAAGAAATGAAGCAAGTTGCATTAGAAAGGTACAAGAAAATGAAAGGCAATGCTCC
TGAAAGATTGGTCTCTGGATCTGATGATTTTACCATGTTTTTGTGGGAACCTTCTGTCAG
CAAGCACCCAAAAACGCGAATGACTGGTCATCAACAGCTTGTGAATCATGTCTACTTTTC
ACCCGATGGGCAATGGGTGGCTAGTGCTTCGTTTGATAAATCTGTGAAGTTGTGGAATG
GTATTACTGGGAAGTTTGTTGCTGCATTCAGGGGACATGTCGGGCCTGTATATCAAATAA
GTTGGTCTGCGGATAGTAGACTTCTTTTAAGTGGGAGCAAAGACTCTACTCTGAAGATCT
GGGATATTCGCACAAAGAAGTTGAAACGAGATCTTCCAGGTCATGCGGATGAGGTTTTT
GCGGTCGATTGGAGTCCAGATGGTGAGAAAGTAGTTTCTGGTGGTAAAGATAAGGTGTT
GAAGCTATGGATGGGTTAATGCATTGGGAGTTATACAAGGCTTATCAGACCATTCCATAT
GCAAATGAAATGAGATGGCAGTTCAAATGCTTTCATCCGAAGTGAAGAATTGATTGTTTG
ATAGCCCATATTACGGCTGAAGACACAAAGTGCACACTCAACTATCAGGTACTGCAAGTT
CGGATGCAATCAGCCACAAAGTGCGCACTCAACTATCAGGTACTGGAAGTTAGGATGCA
ATTTGGTTGCGTAAAGCCCTCTTGCATCAGGATGGTAGTCGCAGCCGATATGCATTGTCT
ACTTATGATGCAGCTGATATTCTCAAAGGTTTTGTTAATCTAAGAAACACTGTTCAATGCT
GTGTTAATTGAGAGTATCATGAGTCACCGATATTTTGTTCATATAAGTAATTGGCAGAATC
AGTCTTTGATATTCATATCCAGTTATAGTGAGCCCTTCATTTTTCTATTGGCTTTATACATA
ATGGCAGTCAAGTATGTTGGCCAGAGAGGAGCCTTCTTGGATTAACCCCTAGCTTGAGT
CATTAAATCAATTTATCGCACAAGTTTCAATAAAAAAAAAA
SEQ ID NO:95
CATCTCTCTCTCTCTCTCTCTCTTTCTCTCTGTGATTGCCTGCGGATCGGCTCTCTGCAG
GCGCTTGAATTGTGAAGCGCAGTCTCTTGATCGCATTTTGCCAGGGCTCTCGCGAAGAT
TTCGCGTCTCCGTGCCTCTTTACTTAAAGATCTGATTTTTGTCTCGAATCACCTTTGTTTG
TGTTGCAGGGTTCCTTCTTTTATTTTTATATTCTTTTGCGAGGTTTCTATCTCTCGCTCCAT
TCGCAAAGAACTTTCCCTCCGCCACCGTTGCGTAACTCGAATAGCCGGATTTTCGTTTTC
GTTTTTATTTCCCCGTTAATTTTCTGCATATCGCCTTTCTTGTGCTTCCCTTTTTTTTTGGT
CTTTGTGTAGTATGGATGCAGGATCTGCGCACTCTTCATCGAATATGAAGACGCAATCCC
GTTCCCCGCTTCAAGAACAGTTTCTGCAGAGGAGGAATTCTCGCGAAAACCTGGACAGA
TTTATACCCAATCGCTCAGCCATGGATTTCGATTATGCGCACTACATGCTGACCGAAGGG
AGGAAGGGGAAGGAGAATCCGGCGGTGAGTTCCCCCTCAAGAGAAGCCTACCGGAAGC
AACTGGCGGAAACGCTCAACATGAACCGGACTCGAATCCTGGCTTTCAAGAACAAACCC
CCGACCCCCGTGGAGTTGATTCCGCATGAACTCACTTCTGCTCAGCCAGCCAAGCCAAC
AAAAACCCGCCGATACATTCCTCAGACCTCGGAGAGGACACTGGATGCTCCTGACCTTT
TGGACGACTACTACCTCAACTTGTTGGACTGGGGAAGCAGCAATGTACTTTCAATTGCTT
TGGGGAATACAGTTTATCTCTGGAATGCTTCGGATGGCTCTACCTCTGAACTTGTCACGA
TCGACGACGAAACTGGCCCTGTAACCAGTGTCAGCTGGGCCCCAGATGGTCGCCACAT
TGCTGTCGGCTTGAACAATTCTGATGTACAGCTATGGGATTCTGCTGATAATAGACTGCT
AAGGACTTTGAGAGGCGGCCACAGATCTCGGGTTGGGTCTCTGGCATGGAACAATCACA
TCCTTACAACGGGAGGAATGGATGGACTGATCGTCAATAATGATGTGAGAGTGAGATCT
CACATTGTCGACACCTACAGGGGTCACACCCAGGAGGTTTGTGGGTTGAAATGGTCGGC
ATCAGGGCAACAATTAGCTAGTGGAGGGAACGATAACATCCTCCACATCTGGGACAGGT
CCACGGCATCTTCCAATTCACCCACTCAGTGGCTTCACCGCCTTGAAGAGCACACTGCC
GCTGTTAAAGCCCTTGCCTGGTGTCCTTTCCAGGGGAATTTGCTGGCTTCCGGTGGAGG
TGGAGGTGATCGGACCATTAAGTTTTGGAACACCCACACCGGTGCGTGCTTGAACTCTG
TGGATACAGGTTCCCAGGTCTGTGCTCTGCTATGGAACAAGAACGAGAGAGAGTTGCTT
AGCTCTCATGGATTCACTCAGAATCAGCTCACCCTTTGGAAGTACCCTTCAATGGTAAAG
ATTGCAGAGCTGACTGGCCATACTTCCAGAGTGCTATTCATGGCCCAGAGTCCGGATGG
TTGCACTGTAGCATCAGCGGCAGGGGATGAAACACTGAGGTTTTGGAATGTGTTTGGTG
TTCCGGAAGTGGCTAAACCAGCTCCAAAAGCTAATCCGGAGCCTTTTGCTCACTTGAATC
GCATTCGCTAAATGAAAAGTCATCTTTGCTGGTTATGTCTATCGTTTTGATCTTCAAGAAT
GCCAAACAATGGGATTCACAAGTCAAGAAGATCGCCACATTGAACAAGTATTGCGAATAG
TTGAAGACAAAATATGCGCCTATACAAAACCATAGTAGCAGTTGCCGTGAGTGTTCCCAA
GTGTATATTTGTACATTTTATATAGTCAAACCATAATTTTTTTTTCCAATGGATGTAGTCAC
TGATGCGTGACCTGCCATGTATATTTTTTGTGAACTAAAATTTTGACTATTGTCACTCCTC
TTTTAGAGCAAACAATATGTTTTTGTAGCAAAAAAAAAAAA
SEQ ID NO:96
TACTTTCTTCCCCTCCTGCGTCGCTAGCAGCGCGAGAGCAAGCTTCCGCAAGCTTTCGA
TTCGCTCGCATGGAGGAGGCGATCCCCTTCAAGAATCTCCCAAGCAGAGAATACCAAGG
TCACAAGAAGAAGGTGCATTCGGTGGCCTGGAACTGCACGGGGACGAAGCTCGCGTCC
GGCTCCGTCGATCAGACCGCTCGCGTCTGGCACATCGAGCCTCACGGCCATGGTAAGG
TTAAAGATATTGAATTGAAAGGGCACACGGACAGTGTGGATCAATTATGTTGGGATCCGA
AGCATGCAGATTTAATTGCAACAGCATCAGGCGACAAGACTGTACGGCTGTGGGATGCA
CGAAGTGGAAAATGCTCGCAGCAAGCAGAACTCAGTGGGGAAAACATCAACATAACCTA
CAAACCTGATGGAACACATGTGGCAGTTGGTAATAGGGATGATGAGCTAACGATTTTGG
ATGTTCGGAAGTTTAAGCCAATTCACAAGCGGAAGTTCAATTATGAGGTAAATGAAATAG
CATGGAACATGTCAGGAGAGATGTTCTTTTTGACAACAGGAAATGGTACTGTTGAAGTAC
TAGCATACCCATCTCTTCGACCTGTTGATACCCTTATGGCTCACACTGCTGGTTGCTATT
GTATTGCAATTGACCCAGTTGGAAGGTATTTTGCTGTTGGAAGTGCAGATTCCTTAGTGA
GCCTGTGGGATATCTCTGAGATGCTCTGTGTGAGGACTTTTACGAAACTTGAATGGCCC
GTAAGGACAATAAGCTTCAATCATACTGGAGATTACGTTGCTTCTGCCAGTGAAGATCTG
TTCATTGACATTTCAAATGTTCAAACTGGACGGACAGTGCACCAAATTCCATGCCGAGCT
GCGATGAACAGTGTGGAATGGAATCCTAAATACAATTTACTCGCATATGCTGGGGATGAC
AAAAACAAGTACCAGGCAGATGAAGGTGTTTTCCGGATATTTGGATTTGAGAGTGCATGA
GATTTCAGAGCAGGGCGAAATGATTTTGCGCATGCACAAGAGTTTCCAATTTTGTGTTCG
CTTAGGATTTTTTGTACACATTCGAAATCTCGTGTATGGGACGGATCTATCTTTCTTGCAA
GGAGTTCTTTGCACGTCTGTCTATTTACGAAGGAAGAGCGTGTTCTTTCTTTGATCTTAC
CAAGAACTTATTTACTTCACAAAACTTTTTTTTTTAAAAAAAAAA
SEQ ID NO:97
GGCTCCGTCCCGAAAATCTCGAAAGCTTTCAACCGTCCCCCGCCTAGGGTTTCCACTTC
TCTTGGCGGGAACCCGAAAAATCCCCCGTTGAAAATCCTCGAGAGTCCAGAATTCCCCC
CGGCGGCGAGCGGCGGCGGCGGCGGCGGCGATGGGGAAGGACGAGGAGGAGATGAG
GGGGGAGATCGAGGAGCGGCTGATCAACGAGGAGTACAAGGTGTGGAAGAAGAACAC
GCCGTTCCTGTACGACCTGGTCATCACCCACGCCCTCGAGTGGCCCTCCCTCACCGTC
GAGTGGCTCCCCGACCGGGAGGAGCCCCCCGGGAAGGACTACTCCGTCCAGAAGCTC
GTCCTCGGCACCCACACCTCCGAGAACGAGCCCAACTACCTCATGCTCGCCCAGGTCC
AGCTCCCCCTCGAGGACGCCGAGAACGACGCCCGCCACTACGACGACGACCGCGCCG
ACGTCGGCGGCTTCGGCTGCGCCAACGGCAAGGTTCAAATTATACAGCAAATAAATCAT
GACGGAGAGGTGAACCGAGCCCGTTACATGCCACAAAACTCCTTTATCATAGCAACCAA
GACAGTTAGTGCAGAAGTTTATGTTTTTGATTACAGCAAGCACCCATCTAAGCCTCCACT
AGATGGTGCGTGCAGTCCTGACTTAAGGTTGAGGGGCCACAGTACTGAAGGCTATGGAT
TGTCGTGGAGTAAGTTCAAGCAAGGACACCTACTCAGTGGTTCAGATGATGCTCAGATAT
GCTTATGGGACATCAATGCAACTCCTAAAAACAAGTCTCTCGATGCCATGCAAATCTTCA
AGGTTCATGAAGGTGTTGTTGAAGATGTCGCGTGGCATCTCAGACATGAATATTTATTTG
GTTCAGTTGGGGATGATCAATACCTGCTAATATGGGATCTACGGACTCCATCAGTAACCA
AGCCTGTCCAATCTGTAGTGGCTCATCAGAGCGAGGTTAACTGTCTAGCATTCAACCCCT
TTAATGAATGGGTTGTTGCGACGGGTTCTACAGATAAGACTGTTAAGTTATTTGATCTAC
GCAAGATCAGCACTGCACTTCACACCTTTGATGCTCACAAGGAAGAAGTTTTCCAAGTTG
GGTGGAATCCGAAAAATGAGACAATCTTAGCTTCATGCTGCCTTGGCAGACGACTTATG
GTGTGGGATCTTAGCAGGATTGATGAAGAGCAGACGCCGGAGGATGCTGAAGATGGAC
CGCCCGAGTTGCTTTTCATTCATGGTGGCCATACGAGTAAAATTTCAGACTTTTCATGGA
ACACCTGTGAAGACTGGGTTGTAGCCAGCGTGGCTGAAGACAACATCCTTCAAATATGG
CAGATGGCCGAGAATATCTACCATGATGAGGATGATGTTCCAGGAGAAGAGTCCAATAA
AGGATCTTGACCTAATCATTAACAATTCAGAAGTTTAGGAGAAAGCCGTAGTAACTTGAA
CCCCTATTTATCTTTACCTGTACCTACGCGCACCATTATCTTTGATCTTGCAATGCTGTGC
TAGATGATGAGTTTTTACTGTCTTTAGAATTACCATTTGTAAAGGATTCACTTCTCTTTTTA
ATTATTTTTTTGAACTCCGAAAAAAAAAA
SEQ ID NO:98
AAAAATAAAAGCGACGGTTTGTGATTTTCGAACAAAAAGTATTTTGGGGAGACATTTCCC
TGAATCCCTCAACGGTTTGCCTCGCCCCTGTTCGGAAAATCCGAGGGAAAAATAGGTTC
GAAACCTGCGTCGCCTCAAAGATGGGATCGAACCGGAATGATTAGACCCTTGATTAGGG
TTCTCTCACATTCGATCCCCTGAAATTATCATCATCATCATCATCATCATCATCAAATCGA
ATCCATTTTCAGGATCATTTTGAGTTTCGAGTTCGATCTCCATTGCCGATCCGATCTCGG
GAGGACGAGAAGGGACGCGGCGCAAAGATTTCGGTGCGTGAAGGAGGTTGTTCCGGG
GACGACAAATGATGAGGGGCTTCTCGTGTACGGAAGATGGCGATGCGCCTTCGACCTC
GAGTACTTCGCCGCCGCCGCCGCCGCCGCCGCCCCATCGACAGCAGATGCAGGCGCC
TCGAGCTTCTTCTTCTTCTTCTGGGCAGCCGACGAGCCGGCGGAGCACTGGCAATGTCT
TTAAACTCTTAGCACGAAGAGAGGTCTCTCCACGTTCAAAACATTCTTTAAAGAAGTTTTG
GGGAGAAGCCTCAGAGTGTCAGCTCTGTCCCTTCCAGCAAAGTTATGAAGCAGTAAGAG
ATGTAAGACGAAGTCTCATCTCATGGGTTGAGGCATTTTCACTGCAACATTTGTCTGCCA
AATATTGCCCCCTTATGCCTCCTCCAAGATCAACGATTGCTGCAGCCTTTAGTCCTGATG
GGAAAATACTTGCTTCTACACATGGAGATCATACTGTCAAACTTATTGATTCTCAGACGG
GGAGTTGTTTAAAGGTGTTAAGGGGTCATAGGAGGACACCGTGGGTGGTAAGGTTCCAC
CCCTTGTACCCAGAGATCCTTGCAAGTGGTAGTTTGGATCACGAAGTTCACCTGTGGGA
TGCAAATACTGCAGAGTGCATAGGATCTCGCAATTTTTACCGTCCTATTGCATCCATTGC
GTTCCACGCCCAAGGAGATCTTCTTGCTGTTGCATCTGGTCACAAGTTGTACATTTGGCA
CTACAATAGAAGTGGAGAGACATCATCACCAACTATTGTTTTGAGGACACCACGATCTCT
TAGGGCTGTGCATTTCCATCCACATGCAGCTCCATTTCTTTTAACAGCTGAGGTCAATGA
CCTTGACTTGACAGATTCGGCTATGACCCTGGCAACTTCTCCCGGTTATTTGCACTACCC
TCCTCCTACTATATATCTTGCAGATGCTCATTCTAATGAAAGATCTAGGTTGGAAGATGAA
CTGCCTCTCATGCCTTCACCATTGTTGATGTGGCCTTCCTTTACTAGAGATGATGGAAGA
GCAACCTTGCCACACATAGGTGGGGATGTTGGTCTTAGTGGACAGCAGAGGGTAGACTC
GTTGTCTTCAGGGCAGTATGAATTCCATCCATCTCCAATTGAACCTAGCAGCTCTACTTC
CATGCATGAAGAGATGGGAACTGATCCTTTCTCTAGTGTAAGGGAATCTGAAGTAACTCA
GTCTGCAATGAACATTGTGGACAATACGGAAGTGCAACCTGAAGAGAGAAGTACTTATA
GTTTCTCTTTTAGCGACCCAAGGTTTTGGGAGCTGCCTTCGGTGTATGGATGGTTAGTTG
GTCAAACCCAAGCTGCCCCTCGAACTGCACCAAGTCCTGGAGCTCTTGAAACTGCATCT
GCTCTTGGTGAGGTAGCGAGTGTTTCACCTGTCAGATCTGAATTTATGCCAGGTGGCAT
GGACCAGCCACGGCTTGGTGGAAGATCTGGGTCTGGATGTCGGTCTTCTGGGTCTCGG
ATGATGCGCACAGCTGGACTTAATGATCACCCACATGATGAGAACTATCCTCAATCTGTT
GTTAGTAAACTCAGGTCAGAACTTGAAGCCTCGCTGGCTGCAGCAGCATCTACAGAGTT
GCCATGCACGGTGAAGCTTAGAGTATGGCCATACGACATGAAAGATCCTTGTGCACTTTT
TCGTTCAGAAAGTTGCCGCTTAACTATTCCTCATGCAGTTCTATGCAGTGAAATGGGCGC
CCATTTTTCTCCTTGTGGCAGATTCTTTGCTGCCTGTGTTGCATGTGTGCTACCTCAGTT
GGAAGCTGATCCTGTATTACACGGCCAGGTCGATCCCGATGTCACTGGAGTGGCAACCT
CACCTACTAGACACCCTGTTTCTGCTTATCAAATCATGTATGAGCTTCGGATTTATTCCTT
GGAGGAGGCAACATTTGGAATGGTGCTTGCATCACGATCCATAAGAGCTGCTCATTGTT
TAACCTCCATTCAGTTCTCTCCAACATCTGAACACTTACTGCTCGCCTACGGCCGTCGAC
ATAATTCGCTTCTTAAAAGTATAGTCATTGATGGAGAGAATACAGTGCCTATTTACAGTAT
ATTGGAGGTCTACAGAGTTTCTGATATGGAACTTGTGAGAGTTCTTCCCAGTGCGGAAGA
TGAAGTAAATGTTGCATGCTTTCACCCTTCAGTTGGAGGTGGCCTGGTTTATGGAACAAA
GGAAGGGAAGTTAAGGATTCTCCAAATTGACAGTTCTGGGGGCTTAAATCCAAAATCAAC
TGGTTTTCTTGATGAAAACATGGCAGAGGTGCCTACATATGCTTTAGAATGCTAGTAGCT
TGGAAGTGCAACACGCTCGTGAAATTTACTGCTCGACCTTGAAGCAGTCTTTTGAAGCAT
ATTATGTCAAAACTGGGAAGAAGTTCATCAATTATGACATCTGAGGTAAACTCAAAAAAAA
AA
SEQ ID NO:99
CTTCGAGAGAGAGAGAGAGAGAGATGGGCGAGGGCGATCTCCCCCGGACGGAGGCGG
GCGTGCTCCGGGGACACGAGGGCGCGGTGCTGGCCGCCCGGTTCAACGGCGACGGCA
ACTACTGCCTCAGCTGCGGCAAGGACCGCACCATCCGCCTCTGGAACCCCCACCGCGG
CATCCACATCAAGACCTACAAGTCCCACGGCCGCGAGGTCCGCGACGTCCACTGCACC
TCGGACAATTCAAAGTTGATATCTTGCGGTGGTGACCGGCAGATATTCTATTGGGATGTG
TCGACTGGCCGAGTCATTAGGAGATTCCGTGGGCATGATAGCGAGGTGAATGCAGTAAA
GTTTAACGACTATGCATCTGTCGTAGTATCAGCCGGCTATGATCGTTCAGTGCGTGCTTG
GGATTGCAGATCTCATAGCACTGAGCCGATTCAGATTATCAACACGTTTCAAGACAGTGT
GATGTCTGTTTGCTTAACAAAAACTGAAATTATTGGTGGCAGCGTTGATGGTACTGTTCG
AACATTTGACATTCGTATTGGTAGAGAAATATCTGACGACTTGGGGCAACCTGTCAACTG
TATTTCAATGTCAAATGATGGTAACTGCATATTGGCGAGTTGTTTAGATTCAACTTTGCGT
CTTGTAGATAGGTCTGCCGGTGAGCTATTGCAAGAATATAAGGGCCATACTTGCAAGTC
CTACAAATTGGATTGCTGTCTTACTAATACTGATGCACATGTGGCTGGTGGATCTGAGGA
CGGTTATGTCTTCTTCTGGGATTTGGTCGATGCATCAGTGATATCCAAATTTCGAGCTCA
TTCCTCTGTGGTAACAAGTGTAAGTTATCATCCAAAGGAAGACTGTATGATCACTGCCTC
TGTGGATGGCACAATTAAGGTGTGGAAGACATGAAACTGTTCAGCTGAGCCTGAGGGAT
CATTCCTTTGTCTGAGATACGATATAGGGAATTCACCGAGTCAATTCTTAGGATATCTTGA
ATCAGACCATTTGTACTTACTTTTGCCTCAGGCAATGCATCTTTTCATGCTTCATTAGGAG
GTAATATCTTTTGGGTAGCAAAACTTTTTGCAGAATTTCGGAAGCTGAGGTTCATGTATAT
TATAGTCTAGCTCGGGAGGATTGCACTTCCAGCAAATTTCTTAGGGGTTGTGTACCATTG
ATGGGATGGATTGATAATCTATGAGCTGAATGTGCATTTCGATTTCTTGTCTGTGCCAGA
GAAAAAAAAAA
SEQ ID NO:100
GACACGCCCAAAATCCACTCCTTCTCCCTCCCTTCCCTTCCCTTCCCTCTCATGCACAAG
ATCAGATCACGCAAACCCTAGATCACCACCGCCAGATCACCACCGCCATCGCCCCATCT
GAGATTCATTCTTCCCATTCCCCGATTGATCCGATTGATTTTATACTGTACCAATAAATCC
GTAATCTTGAGAGCTTTTGCGCTCTGTGCAATTCCCGGCGATGGCGTGCATAAAGGGGG
TGGGCCGGTCGGCGTCGGTTGCCATGGCCCCCGACGGCGGGTACTTGGCGACTGGGA
CCATGGCCGGGACTGTGGATCTCTCCTTCAGCTCCTCCGCCAGCCTCGAGATCTTCGGC
CTCGACTTCCAGTCCGACGATCGGGATCTCCCCCTCATCGCCGAATCCCCCAGCTCCGA
GCGCTTCAACCGGCTCTCCTGGGGCAAGAACGGATCCGGCTCCGACGAATTCTCCCTG
GGCCTGATCGCCGGTGGCCTCGTCGACGGGACCATTGGCCTTTGGAACCCGCTCTCCC
TGATCCGTTCTGAGGCTGGTGATAAGGCAATCGTAGGACACCTTTCTAGGCATAAAGGA
CCTGTTCGTGGTCTTGAGTTTAACGTCATTGCACCAAACTTACTTGCTTCTGGGGCTGAT
GATGGTGAAATTTGCATTTGGGATTTGGCTGCACCAAGAGAACCTTCTCATTTCCCTCCT
CTTAGGGGTAGTGGTTCTGCTGCTCAAGGTGAAATTTCATTTTTATCTTGGAATAGCAAA
GTTCAACATATATTAGCCTCCACTTCCTATAACGGGACAACAGTTGTATGGGATCTCAAG
AAACAAAAACCAGTCATAAGTTTTTCAGATTCAGTTAGAAGGCGTTGCTCAGTTTTGCAAT
GGAACCCTGACCTGGCTACTCAACTTGTTGTTGCATCAGATGAAGATAGTTCTCCTACTT
TAAGGCTTTGGGATATGAGAAACATAATGTCACCTGTTAAAGAATTTGCGGGGCATACAA
GAGGTGTTATTGCAATGTCCTGGTGTCCTAATGATAGTTCCTATTTGGTCACCTGTGCTA
AAGACAATAGAACCATATGCTGGGATACAGTTACTGGAGAGATTGTTTGTGAATTGCCTG
CTGGATCCAACTGGAATTTTGATGTGCATTGGTACCCTAAGATACCCGGAGTTATATCAG
CATCTTCATTTGATGGAAAGATTGGCATCTACAATGTTGAGGGTTGTAGCCGCTATGGTG
TCAGGGAAAATGAATTCGGGGCTGCAACTCTGAGAGCTCCAAAATGGTTTAAACGACCT
GTTGGCGCATCCTTTGGTTTTGGAGGCAAGGTGGTTTCATTTCATACTCGGTCAACAGGA
GGTCCTTCAGTCAATTCCTCAGAGGTTTTTGTACACGATATCATTACAGAACAAACTTTGG
TGAGCCGCTCGTCTGAATTTGAAGCTGCAATCCAAAGTGGGGATAGACCTTCATTGAGA
GCATTATGTGAAAAGAAGTCTCAACATTGCGAATCTACAGATGACCAAGAAACATGGGGA
TTCTTGAAAGTTTTGTTAGAAGATGATGGAACCGCAAGGTCAAAGCTACTTGCTCACCTT
GGTTTCGATATTCCTACGGAGACAAATGATGGTTCACAAGAGGATCTCTCCCAGCAAGTT
AATGCTCTTGGGCTCGAGGATGTGACTGCAGATAAAGTAGTGCAGGAGGACAACAACGA
AAGCATGGTGTTTCCTACTGATAATGGCGAAGATTTTTTTAACAATCTTCCTAGTCCCAGG
GCCGATACACCAGTATCAACTTCTGCTGATGGCTTTCCTACTGTGAATGCTGCCGTGGAA
CCATCGCAAGACGAAGTAGATGGACTTGAGGAGAGCTCTGACCCATCGTTTGATGATAG
TGTTCAACGTGCTTTGGTTGTTGGAGACTACAAGGCTGCTGTTGCATTGTGCATGTCTGC
TAATAAATTGGCTGATGCTTTGGTTATTGCACATGTTGGAGGTGCCTCCCTATGGGAGAG
TACTCGTGATAAGTACCTCAAAATGAGCCGCTTACCTTACTTGAAGGTTGTTTTTGCAATG
GTGAATAATGATCTCCAGAGCCTTGTGGATACCAGGCCCCTCAAATTCTGGAAAGAGAC
GCTCGCTATTCTATGTAGTTTTGCACAGGGGGAGGAGTGGGCGATGCTCTGCAACTCTC
TAGCCTCAAAACTTATGGCAGCTGGTAACATGTTGGCAGCGACACTCTGTTTCATTTGTG
CTGGGAATATTGATAAAACAGTGGAAATTTGGTCGAGAAGCCTGGCAACTGAACATGAT
GGGATGTCCTACATGGACCTTCTTCAGGATTTGATGGAAAAGACTATTGTTCTTGCATTG
GCTAGCGGCCAAAAACAATTCAGCGCTTCAGTGTGCAAGCTTGTGGAAAAATATGCAGA
GATTTTTGGCTAGTCAAGGGCTTTTGACAACAGCGATGGACTATTTGAAGCTGTTGGGGA
CAGATGATTTGTCACCTGAACTCGCGGTCTTGAGAGATCGTATCGCCTTCTCTGTAGAAG
CTGAAAAAGGAGCCAACATCTCTGCTTTTAATGGCTCTCAAGATCCAAGAGGTGCAGTAT
ATGGTGTCGATCAATCCAATTATGGCATGGTTGATACTTCTCAGCACTATTATCCGGAAG
CTGCACAACCACAGGTGCCGCATACTGTTCCTGGTAGTCCTTATGGTGAAAAACTATCAG
CAGCCTTTTGGCTCTTCATTTGGGAAAGGATACAATACTCCCATGCAATATCAGGCTCCT
TCACAGGCGTCTATGTTTGTTCCATCCGAGCCACCTCAGAATGCCCAGCCAAGTTTTGTT
CCAACTCCTGTCACCAGTCAGCCCACGACGAGATCTCAATTTATACCAGCGCCTCCTCTT
GCCCTGAGGAACCCAGAGCAGTATCAGCAGCCAACATTGGGTTCTCATTTGTATCCGGG
AAGTGTTAATCCTACTTTTCAACCTTTGCCTCATGCACCTGGTCCGGTTGCCCCTGTGCC
ACCACAAGTGAGCTCTGTTCCTGGCCAGAATATGCCACAAGCTGTGGCTCCTACTCAAA
TGCGGGGGTTTATGCCAGTTACTAATCCGGGGGTTGTTCAAAATCCCGGGCCAATTTCC
ATGCAGCCGGCTACTCCCATTGAATCAGCAGCTGCACAACCAGTTGTCTCACCTGCTGC
ACCCCCACCTACTGTGCAGACTGCAGATACTTCAAACGTTCCTGCCCCCCAGAAACCTG
TCATCGCAACATTGACGAGACTCTACAATGAGACATCAGAAGCATTGGGAGGCTCACGT
GCAAATCCTGCCAAGAAGCGTGAGATAGAAGATAATTCAAGGAAAATAGGTGCCTTGTTT
GCAAAGCTCAACAGTGGAGACATATCTAAAAACGCAGCTGATAAGCTTGTCCAGCTATGT
CAGGCTTTAGACAACGGTGATTATAGCACTGCTCTGCAAATACAGGTACTTCTCACGACT
AGCGAGTGGGATGAGTGCAATTTCTGGTTGGCAACGTTGAAGCGGATGATCAAGACAAG
GCAGAATGTGAGACTAAGTTGAGAAGAGGCCAATTCTTCTCAACGCATCTCCTTTGTTCT
TTTTTCTCTCCTTAGCCTTGTGTGGTGGATTCTGTAGGGGAATTGATCATTTCTTTGGAAA
GCATGGCTCTCTTAGATTGCTACTGGAGTAATTTTTGGTCTTTTGTTATAGAATGTCTTTT
TTCATTGATGCTGACTTCTGAAAGCCGTTAGGATAGTCTTTAAAGGAGTTGGTGATTATT
GATTTCCACCCAATATATGTAGCGTCAAGGAATTAGAGAGCTTTTTACTAGCCCTGGTGG
ATGTTTTTGACACTATTTTTCACTATAAGTTACTCTCCCAGGCTGGTCTCTTCTGAGATGC
CCGACGTCTGTTCATGACTTAGTGTTATAGTGATCATTTATGAATTTTTGGTATGCTTTTG
GTGAGTGATAGCTTGTCAAAGAGAGAGAGAGAGGCTTGTCACAAGAAAACAATCAATTAT
CTTGTTGGTGTGGGGCTGAAGATTCTTCGTTTTGTTTGTATTTTCTTTTTCCATTATCTGG
ATAAGGAAGACAGCAGCAAATGTTGTCGCTGAAAAATAACCTTGCTTTTGACACTGTTGA
GAGAAGACTCGGTAGGATGTATTATTGGATGCTGTTATTCCAAAAAAAAAA
SEQ ID NO:101
GGAAACGCCGAGAGAGGGAAAGGAAGGAAGGAAGGAGAAGGAGATGAAGGAGAGAGG
GAAGGGGGCGGGGAGATCGGTGGACGAGAGGTACACCCAGTGGAAGTCGCTCGTCCC
CGTCCTCTACGACTGGCTCGCCAACCACAACCTCGTCTGGCCCTCCCTCTCCTGCAGAT
GGGGTCCACAGCTCGAGCAAGCTACTTACAAAAATCGACAGCGTCTTTACCTATCTGAA
CAGACAGATGGCAGTGTTCCAAATACACTTGTTATTGCAAATGTCGAGGTTGTCAAACCA
AGAGTTGCAGCTGCAGAGCATATCTCTCAGTTTAATGAAGAAGCGCGCTCTCCCTTTGTC
AAGAAGTTCAAAACTATTATACATCCTGGGGAGGTAAACAGGATCAGGGAGCTGCCCCA
AAACAGTAAGATAGTAGCAACTCACACCGATAGCCCTGATGTCCTCATCTGGGATGTTGA
GACCCAGCCTAATCGCCATGCTGTTTTAGGTGCTTCAACTTCACGTCCAGATCTGATACT
CACCGGGCATAAAGATAATGCTGAGTTTGCTCTTGCAATGTCCCCAACCGAACCTTTCGT
TCTTTCTGGAGGTAAAGACAGATATGTAGTTTTATGGAGCATTCAAGATCATATCTCAACT
TTGGCTGCAGATCCTGGGTCTGCAAAATCTCCAGGGTCTGCTGGAACCAACAATAAGCA
GTCTTCTAAGGCTGCTGGTGGCAATGATAAGACTGGGGACAGCCCTTCAATTGAACCTC
GTGGAGTCTACCTGGGGCATGGCGACACTGTTGAGGATGTGACTTTCTGCCCGTCGAGT
GCACAGGAGTTCTGTAGTGTGGGGGATGATTCTTGTCTCATCCTATGGGATGCAAGAAC
TGGCTCTTCTCCAGCTATCAAGGTTGAGAAAGCTCATCATGCTGATCTCCATTGTGTTGA
TTGGAATCCTCATGATGTAAACCTGATTCTAACCGGGTCAGCTGATAATACAGTTCGCAT
GTTTGACCGCCGAAACCTCACTTCGGGGGGAGTTGGATCACCTGTCCATACTTTTGAGG
GTCACAATGCTGCTGTGCTCTGCGTACAGTGGTCTCCAGACAAGTCATCTGTCTTTGGAA
GTTCTGCTGAGGATGGCATTTTAAATATCTGGGATCATGAAAAGATTGGTAGAAAGATAG
AGACCGTGGGTTCAAAAGTACCCAATTCTCCTCCAGGGTTATTCTTCCGTCATGCTGGG
CACAGGGACAAGGTTGTTGACTTCCATTGGAATTCATCCGATCCCTGGACAATTGTTAGT
GTTTCTGATGATGGTGAAAGTACTGGTGGAGGTGGTACACTGCAGATATGGCGGATGAT
CGATTTGATTTACAGACCTGAGGAAGAAGTTCTGGCTGAGCTGGACAAATTCAAGTCTCA
CATTCTCTCTTGCACCTCTTGAACTTCTCATAATAAGAAGAAAGATCAGCAAGGTTGATGA
CTAACTTGAAGTGGGAAGCAAGTTAACCGTGTTCGATTGTTTTGTGTAAAAAGGAGCTCA
TCATCCTCGAGTTTCATCCTTTTTTATGATGCTAGGGATTATTTAGTGCGTCTGCTTGAGG
TTTAAGTTTTGTCTCGGTGTGGACATGGAGTCTGACAAATGTGTTCTCCAGCATTCCTTAA
GGCCATGGCTGAGAGGCTCGTTTTGGTTGCTTCATTGGATTATGAAGAGATTGCTTAATA
CTTAAAAAAAAAAAAAAAAAAAA
SEQ ID NO:102
CTAATTTCAAGGTTCTCTTTCCTTTCTTGCAGACAACACTCGCCCCTCGCTTCTGGTCTTT
TCCAAAGCAATAAAATTCCCTCGTTCGCCGTTCGGCGTCTTCACCGGCGGCGCGCGCC
GCACTGCGTACCCACCGGCTGTCGCGTTCTCGCGGATCGAACTCGAGGAAAAGGCATC
GGCGGCGGATCGGGGCAAATGGCGAAGATCGCGCCCGGGTGCGAACCGGTGGCGGG
GACGCTGACCCCGTCGAAGAAGAGGGAGTACAGGGTCACCAACAGGCTCCAGGAGGG
GAAGCGTCCCCTCTACGCCGTCGTCTTCAACTTCATCGACTCCCGCTACTTCAACGTCTT
CGCCACCGTCGGCGGCAACCGGGTTACTGTTTATCAGTGTCTCGAAGGTGGAGTAATAG
CTGTGTTGCAGTCATACATTGATGAAGATAAGGACGAGTCGTTTTACACGGTCAGCTGG
GCGTGCAATATCGATAGAACCCCGTTTGTGGTGGCGGGAGGAATCAATGGTATCATCCG
TGTAATTGATGCTGGCAATGAGAAGATACACAGGAGTTTTGTAGGCCATGGGGATTCAAT
AAATGAAATCAGGACTCAACCATTGAACCCATCCCTCATCGTGTCTGCTAGCAAAGATGA
ATCCGTTAGGCTCTGGAACGTTCATACGGGAATTTGTATCCTGATATTTGCTGGAGCTGG
GGGTCATCGCAATGAAGTTTTGAGTGTGGACTTCCATCCTTCCGACAAGTACCGTATTGC
AAGTTGTGGTATGGACAATACGGTTAAAATCTGGTCAATGAAAGAGTTCTGGACATACGT
GGAGAAGTCATTTACATGGACAGATCTTCCATCGAAGTTTCCCACCAAATACGTGCAGTT
TCCAGTTTTCATAGCTCCAGTTCATTCAAACTATGTTGACTGCAACAGGTGGCTTGGTGA
TTTTGTTCTGTCAAAGAGTGTTGACAACGAGATTGTGCTTTGGGAACCCAAAATGAAGGA
ACAATCTCCGGGAGAGGGGTCGGTGGATATCCTTCAGAAATATCCAGTTCCAGAGTGTG
ACATTTGGTTCATCAAATTTTCCTGTGACTTTCATTATCACTCAATTGCTATAGGAAATAG
GGAAGGGAAGATCTACGTATGGGAGCTGCAGAGTAGCCCTCCTGTTCTAATTGCAAAGT
TGTCTCATCCCCAATCAAAATCCCCAATCAGACAGACTGCCATGTCATTTGATGGGAGCA
CAATCCTGAGCTGCTGTGAGGATGGTACTATATGGCGCTGGGATGCAATTACGGCATCA
ACATCCTAAGCCTTCCCTGGCAGATGGACTGGAGAACTCCGTTAGTAATTATGAATCCCT
CTTGTGTGGGCATGTTCCCCACCATGTATCAGCTGAATGGGAGCTGCTTCAACCTCTTAT
CTCGATGGAGACTCGAATAGCATCACCGCACAGATGCAAGCGGACAACTGCTTTTTCGT
AACGAAGAAAGCAAGTGGATGATTTGGTTGTGCATCAGTCTGAACGATTTATGAAGTTAC
TTTTTGGTGTCAAATGTACTCTCCGTGAATCATTTCACTTCGCAAACTGGGATTTGTACCC
TAGAAACATCCAGTTTAATCTACCTTAACTTCCCAGAAAAAAAAAA
SEQ ID NO:103
ATTTCGGGAGGAGTAAAAAAGGAAAATGAAAAGAGAGGGAGAGACAGAGAGGTCGGGC
ATTATCCATTCAGCATAGGAGGCGGGGGAGAAGGAGACAAAGCCCATGAGATCATCAG
GATTGGTGGTGGGATTACTGTTTAACTGCGACATGGATCGAGCAATGTCGCTGTTCACT
CTGTTGCCACTAGCGGTATGAATACGGCAATGCATTTTGGTGCTGGTTGGCGATCGATT
GCTGAGATGGGGTATACGATGAGCAGACTAGAGATTGAGCCTGAGTCGTGTGAGGACG
AGAAGAGCTTGGATGGGGTTGGTAACAGCCAGGGACCGAATGAGTTGCCGAGATGCTT
GGATCATGAGTTGGCGCATTTGACGAATCTGAAGTCGAGGCCCCATGAACATTTGATCC
GAGATTTCCCTGGGAGGCGGGCTCTGCCTGTTTCCACCGTTAAGATGCTGGCGGGTCG
AGAGTGTAATTATTCACGAAGAGGGAGGTTCTCCTCCGCTGATTGCTGTCACATGCTGA
GCAGATATGTGCCTGTTAATGGTCCTTCGCCCCTGGATCAGATGAATAGTCGAGCTTATG
TTTCGCAATTTTCAGCTGATGGTTCTCTATTTGTTGCTGGCTTTCAGGGTAGCCACATTAG
AATTTATAATGTTGATAAAGGATGGAAATGTCAGAAGAACATTCTTACCAAGAGTTTACGG
TGGACGATCACTGATACATCTCTTTCTCCTGACCAACGTTACCTTGTGTATGCCAGTATG
TCACCCATCGTCCATATTGTTGACATCGGCTCCGCTGCTATGGATTCTCTTGCAAACATC
ACGGAGATCCATGAGGGTTTGGATTTTTCCGCTGACAGTGGACCATATTCTTTTGGAATC
TTCTCTGTTAAATTTTCTACCGATGGACGAGAAGTCGTCGCTGGAAGCAGCGACGATTCT
ATATATGTCTATGATCTTGTGGCAAATAAGCTTTCCCTCAGAATTCCAGCACATGAGTCTG
ATGTGAACACAGTATGCTTTGCTGATGAAAGTGGTCATATAATTTATTCTGGGAGTGATG
ATACATACTGCAAGGTGTGGGATAGACGTTGCCTGAGTGCCAGAAATAAACCTGCAGGA
GTTCTAATGGGACACCTTGAAGGCATTACGTTCATTGATAGCCGTGGTGATGGTCGTTAT
TTCATATCAAATGGCAAAGATCAGACGATCAAACTTTGGGATATCCGGAAAATGGGCTCT
GATATCTGTCGTCGAGGCTTTAGGAATTTCGAATGGGATTACAGATGGATGGACTACCCA
CCCCGGGCTAGGGATTCGAAACACCCTTTTGATCTGTCAGTGGCAACATATAAAGGCCA
TTCGGTGTTGCGTACTCTTATTCGGTGCTACTTCTCCCCAGTACATAGCACTGGTCAAAA
GTATATCTACACTGGATCCCATGATTCCTGTGTTTATATCTATGATGTGGTGACTGGAGC
TCAAGTTGCGGCCCTCAAGCACCATAAATCGCCGGTCAGAGACTGCAGTTGGCACCCG
GAGTACCCGATGATTGTGAGCTCTTCTTGGGATGGGGATATTGTGAAATGGGAATTCTTT
GGGAACGGAGAAACTGAGATCCCGGCGATGAAGAAGAGGATCCGGAGGCGGCATTTGT
ATTAAAAGATGCTTACTATTTTTCACTCGATGACGGTTGGCCGGATAAATAATCGCTTATA
TAGTCCTAATAAGTTCCATCTCATGTACAGATAAGGTGTATCATTCAAGCTATAGTCCCAA
ACGACACGCGATTCGTGTGAATTTCGTTGTACATAATCGACCTTTCGAATGTAATTGTGA
CTGTGAGCCATTGTTGCTTGTTACAAGCTTCCAAAAAAAAAA
SEQ ID NO:104
ATTCACTGGCCTAACTACTACCCGACCGGGAGCAACCGAAGCGCACGGTGCCCGTCGA
GCGCGCTCTCTCTCCGGCGATGGAGCCGCAGCCGCAGGCCCCGAAGAAGCGCGGCCG
GAAGCCGAAGCCGAAGGAGGACAAGAAGGAGGAGCAGCTCCACCAGCCGCCGCCGCC
GCCGCCGCCGCAGCAGCAGGCGGCTCCGGCGCCGGCACCGGCGGCCACCAGGTCGT
CGACGTCGGGGTCGGCGGGGGGCCGGGACCGGAGGCCGCAGCAGCAGCACGCGGTC
GACGAGAAGTACGCGCGGTGGAAGTCCCTCGTCCCAGTCCTCTACGACTGGCTCGCCA
ACCACAACCTCCTCTGGCCTTCTCTCTCTTGCCGGTGGGGCCCGCAACTCGAGCAAGC
GACTTATAAGAATCGGCAGCGGCTCTACATTTCTGAGCAGACTGATGGCAGTGTTCCAA
ATACTTTGGTGATAGCAAACTGTGAAGTTGTGAAACCTAGAGTTGCGGCTGCAGAGCAC
GTGTCCCAGTTTAATGAGGAAGCTCGCTCTCCCTTCATAAGGAAGTACAAGACAATTATA
CATCCTGGAGAGGTTAACAGAGTCAGGGAACTTCCTCAGAATCCCAATATTGTGGCAAC
TCACACTGACAGCCCAGATGTTCTCATTTGGGATGTGGAATCTCAGCCTAACCGGCATG
CTGTCTATGGAGCTACAGCTTCTCGTCCAAATCTGATTTTAACTGGACATCAAGAGAATG
CTGAATTTGCCCTTGCAATGTGTCCAGCTGAACCCTTTGTTCTCTCTGGAGGGAAGGATA
AGACGGTGGTTTTGTGGAGTATCCAAGACCATATAACAGCATCTGCAACAGATCAAACAA
CTAATAAATCTCCAGGATCTGGAGGATCCATCATTAAGAAGACTGGGGAAGGTAACGAG
GAAACTGGAAATGGCCCTTCTGTTGGACCACGAGGAATCTACTGTGGACATGAGGATAC
TGTTGAAGATGTGGCTTTTTGTCCATCCACTGCACAGGAATTTTGTAGCGTTGGTGATGA
TTCATGCCTTATATTGTGGGATGCACGAGTTGGGACTAATCCCGTTGCTAAGGTCGAGAA
GGCACATAATGGTGACCTCCATTGTGTGGATTGGAATCCCCATGACAACAACCTAATCTT
AACTGGGTCGGCAGATAACTCTGTTAACATGTTTGATCGGCGAAATCTCACTTCTAATGG
AGTTGGTTCACCAGTCTACAAGTTTGAGGGGCATAAGGCAGCTGTTCTTTGTGTGCAGT
GGTCTCCAGACAAGCCTTCCGTCTTTGGGAGTTCTGCTGAAGACGGTCTCTTGAACATTT
GGGATTATGAGAGGGTTGATAAAAAGGTTGACAGGGCTCCAAATGCTCCTGCGGGATTG
TTTTTCCAGCATGCTGGTCACAGGGACAAAATTGTCGACTTCCACTGGAACGCAGCTGAT
CCATGGACTATGGTTAGCGTATCTGATGACTGTGATACTGCTGGAGGAGGTGGTACATT
GCAGATATGGCGAATGAGCGATCTGATCTACAGGCCGGAAGAAGAGGTTTTGGCTGAG
CTGGAGAATTTCAAGGCGCATGTACTGGAATGCTCGAAGGCATGAGAGTGCCTCGAGAA
CAGGCCCTCCGGGTCTCAAAACACTAACTAGACAAAGCGGGTTTTTGCTGGCTGTTACT
GCTGTAAAATCTGTAGGTACTTAGCCATGGTTTAGACTCATCTGTGAGCGCCAGGACTCC
CCTCTTTACGCAGATGGTGACTGAAGCTGGTTCCGAGATCGGCATATGTAGCTGGTAGA
GGTGTGGATATTGCATAGACCGAACCTCCGCAGGTCCGCATTCTCGAGTGAGAAACAGA
GATAAATTTTAAGGGGGTTCCCAAAAAAAAAA
SEQ ID NO:105
TCCTTACCCGCCCCCGAACTCGCGAAGTTCCCAACTTCAAATCCTTCGGTCACCACCAC
CGCCACCTCCTCCGTCGCGGGAACGATGGGGATATTCGAGCCGTACAGAGCGGTCGGA
TACATCACCACCGGCGTCCCCTTCTCCGTCCAGCGCCTCGGCACCGAGACCTTCGTCAC
CGTCAGCGTCGGCAAGGCTTTCCAAGTGTACAACTGCGCGAAGCTGAGCTTGGTGCTTG
TTGGTCCCCAATTGCCAAAGAAAATACGGGCCCTCGCATCGTACCGTGAATACACTTTTG
CTGCTTATGGAAGTGACATTGGCATATTCAAGCGTGCTCACCAGTTAGCAACTTGGAGC
GGGCATACTGCTAAGGTTTGTTTGCTGCTGCTGTTTGGAGAGCACATATTGAGTGTAGAT
GTTGATGGCAATGCATACATATGGGCGTTCAAAGGAATGAATTACAACCTTTCCCCAGTG
GGACACATCTTGTTGGATAGCAATTTTACTCCTAGCTGTATAATGCATCCAGACACTTACT
TAAATAAGGTTATTCTGGGAAGCCAGGAAGGGCCACTGCAGCTTTGGAACATAAGCACT
AAGACAAAGCTCTATGAGTTCAAGGGATGGAATTCCTCTGTTAGCAGTTGTGTTTCATCA
CCTGCTTTAGATGTTGTTGCGGTTGGCTGTGCTGATGGCAAGATTCACGTTCACAATATT
CGCTATGATGAAGAATTGGTTACATTTTCTCATTCGATGAGAGGTTCTGTGACCGCCTTA
TCTTTCAGCACAGATGGGCAGCCTCTTCTAGCTTCTGGTAGTTCATCTGGTGTTGTCAGC
ATATGGAATCTCGATAAAAGAAGGCTCCAGTCAGTCATAAGAGATGCTCATGACGGTTCT
ATAATTTCTCTCCACTTCTTTGCTAATGAGCCTGTGCTAATGAGTTCATCTGCAGATAACT
CAATTAAGATGTGGATTTTCGACACAAGTGATGGAGATCCTCGTCTATTACGCTTTCGAA
GTGGTCATAGTGCTCCTCCACTCTGCATAAGGTTCTATGCTAATGGTAGGCATATACTAT
CAGCTGGTCAGGATCGTGCCTTCCGGCTTTTCTCGGTTGTCCAGGATCAGCAAAGTCGA
GAGCTATCTCAACGTCATGTGTCCAAGCGAGCAAAAAAATTGAAGTTAAAGGAAGAAGA
GATAAAGCTGAAGCCTGTTATAGCATTTGATGTTGCTGAAATTAGGGAGCGGGACTGGT
GCAACGTAGTGACAAGTCATATGGATACCCCGCAGGCATACGTGTGGAGACTTCAAAAT
TTTGTCATAGGGGAGCATATTCTGAGACCATGCCCCAACAAGCCCACACCTGTTAAGGC
TTGCATGATTAGTGCATGTGGAAATTTTGCCATTTTGGGAACAGCAGGTGGTTGGATTGA
GCGATTCAACCTTCAATCAGGAATTAGTCGAGGAAGTTACATAGACCAGTTAGAAGGAAC
AAACAGTGCTCATGATGGTGAAGTGGTTGGAGTGGCTTGTGATGCCACAAATACTCTCAT
GATAAGTGCCGGTTATGCTGGGGACATCAAGGTCTGGGATTTCAAGGGACGTGAATTGA
AATCTAGATGGGAAATTGGTTCTTCTTTGGTAAAAATATCTTACCATCGTCTGAATGGTCT
TTTGGCTACGGTTGCAGATGATTTTATCATTCGCCTTTTTGATGCTGTCGCCCTAAGAAT
GGTCCGAAAGTTTGAAGGTCATACTGATCGAATTACAGATTTGTGCTTTAGTGAGGATGG
GAAATGGCTCTTGTCTTCCAGTATGGATGGCAGTCTAAGGATTTGGGATATTATTTTGGC
CAGACAAGTAGATGCTGTATTTGTTGACGTTTCCATCACAGCTTTATCTCTGTCACCAAAT
ATGGATATTTTAGCCACGACACATGTTGATCAAAATGGGGTCTTTCTCTGGGTTAACCAA
TCAATGTTTTCTGGAGATTCTGACATCAATTTGTATGCAAGTGGGAAAGAAGTTGTGACC
GTCAAGTTGCCATCAGTATCATCAGTGGAAGGTTCTCAAGTTGAAGAATCTAATGAGCCT
ACTATTAGACATTCAGAGTCTAAAGATGTCCCTTCCTTCCGGCCTTCGCTTGAGCAAATT
CCAGATCTAGTTACCCTCTCACTTTTGCCAAAGAGCCAGTGGCAGAGCTTGATTAATCTA
GACATTATAAAGGTTCGCAATAAGCCAGTTGAGCCCCCCAAGAAACCTGAGAAAGCTCC
TTTCTTTTTGCCTTCTATTCCATCTCTTTCTGGAGAAATACTATTTAAGCCAAGTGAGATGT
CTGATAAGGGAGACATGAAAGCTGATGAAGACAAATCTAAGATAACACCTGAAGTGCCAT
CATCACGGTTTCTGCAACTGCTTCATTCATGTTCAGAGGCGAAAAACTTTTCACCCTTTAC
CACCTACATCAAAGGGTTATCTCCCTCAACTTTGGATTTGGAACTTCGGATGCTTCAGAT
AATAGATGATGATGCTGTTGACGCTGATGCTGATGATCCACAAGATGTTGACAAGAGACA
AGAGTTGCTTTCCATTGAACTACTTATGGATTACTTTATACATGAAATATCATGCCGGAGT
AATTTTGAGTTTGTACAAGCTCTAGTCAGGTTGTTTTTGAAGATACATGGGGAGACAATTA
GACGCCAGTCAGTGTTGCAGAATAAAGCAAAGGTCCTTTTGGAGACTCAGTGCTCTGTA
TGGCAGAGAGTGGATAAGTTATTTCAAGGTGCTAGATGCATGGTTGCCTTTCTTAGCAAC
TCACAGTTTTAGCAAAAGGAAGATGTTATAGACGGAAAAAGTAGCTGGATCAACACTGAG
GCATGAGTTTAAATGTTGGATGGGTTCCTCTTAGAGCTATAAATTCGCTCATTTTGTAACT
GAGCAATCATCCTGTATCACTTCTTTCAGAAATTTTCAATGGAAGCCTTTTTTTAAAAAAAA
AA
SEQ ID NO:106
CCGTTTGCGCGTGCTTTCTCTCTCTCTCTCTCTCTCTCCTCTGCTCACTTCTTCGGGAAC
TTCGATTCCGGCCATGGAGGAGACGAAGGTGACGTGCGGGTCGTGGATCCGCCGCCC
GGAGAACGTTAACCTCGCCGTGCTGGGGAGGTCGCCGCGGCGCCGCGGCTCCGCCGC
GCTCGAGATCTTCGCCTTCGATCCCAAGTCCACTTCCCTCTCCTCATCCCCGCTGGTTG
CTCATGTGATCGAGGAAATTGAAGGCGATCCGTTGGCGATCGCGGTTCACCCCAACGGA
GAGGATATCGTGTGCTTCGCGAGCTCCGGTAGTTGCTTATCATTTGAATTATCTGGTCAA
GAATCAAATTTGAAGCTGTTGACCAAGGAGCTTCCTCCTCTTAGAGGGATTGGTCCTCAA
AAATGCATGGCCTTTAGTGTTGATGGATCTAGATTTGCCACTGGTGGAGTGGATGGTCG
TCTAAGAATCCTGGAATGGCCTAGTCTGCGCATCATTCTAGATGAACCAAAAGCACACAA
AAGCATTAGAGACCTGGATTTTAGTCTAGATTCAGAGTTTTTGGCTACGACATCTACTGAT
GGATCGGCTAGAATATGGAAGGCCGAAGATGGTTTGCCTTGCACTACTTTGACTCGCAG
ATCTGACGAGAAGATTGAATTGTGTCGGTTCTCTAAGGATGGAACTAAACCCTTTCTATT
CTGTACTGTTCAAAGAGGTGATAAAGCTGTTACTGGTGTTTGGGACATAAGTACATGGAA
CAAGATTGGGCACAAACGACTCCTCAGAAAGCCTGCTGTTGTAATGTCCATCAGTTTGGA
TGGAAAATATCTTGCTCAGGGAAGCAAAGATGGTGATATGTGCGTTGTTGAAGTGAAGAA
AATGGAGGTTAGCCATTGGAGCAAGAGGCTACACCTGGGAACCTCCCTTACATCACTAG
AATTCTGTCCCATTGAAAGGGTTGTGATTACTACTTCTGATGAGTGGGGTGTGTTAGTGA
CCAAGTTAAATGTTCCTGCCGATTGGAAAGCATGGCAAGTGTATCTACTACTTTTGGGTT
TGTTTTTGGCGTCTCTTGTTGCGTTCTACATCTTCTATGAAAACTCCGACTCATTCTGGGG
CTTCCCTCTTGGAAAGGATCAACCTGCAAGACCTAAGATCGGCAGTGTTCTGGGGGATC
CCAAGTCTGCTGATGATCAAAATATGTGGGGTGAATTTGGGCCATTGGATATGTGAGCAA
ACCGCCGGTTGTGTCGTGCTGCAAGTCTCCAGAAGGAGCTGCAGCAGCCAGTGGTCTG
AAGAACTCCATTTGATTATTGTTACACAAAGGCAATTTGTTTATAGGTCTTTTTGGAACAA
AATGTCCGCCTGGGCGTGAACAAGCGGCATGGGACTGACACCGTCGCTTTCATTAAAGA
AGTCTCATCAAAGTCGAACTAATACGAATTTTGATTTTAAAAAAAAAA
SEQ ID NO:107
CTCTCATGAACCTCACGAAGCGCGCGCACACCAGCAGCTCCGCGAGCACGAGCGGCGA
GCACCACGGCAAGAGCCAGAGCTCCGCATTTCGCAGCGCAAATGGCGGACCCCGTAGA
GCACCAGCACCAGCAGCACCAGCAGCACCAGCTTCAGCAGCAGCGGCGGCGCGGATG
GCGGATTCAGGGCGGGCAGTACCTCGGGGAAATCTCCGCTCTCTGCTTCCTCCACCTC
CCTCCTCCTCCCCTCTCCCTCTCCTCCTCGCCGGTACTCTCTCTCTCTTCTGGACTCGAC
TCGGAATCGCGAGACCGACCGGCGTGCTCCTTCCGTTTCCCGTCTGCAGGTTCCGGCT
CGCAGGTGTCGCTGTTCGACCTGGCGTCCGGGGCCATGGTGCGGACGTTCTACGTCTT
CCGGGGGATCCGCGTGCACGGGATCGTCCTCGGCTGCGCCGATTTCCCCGGCGGATC
GTCCTCGTCTTCTTCGACGCTCGATTACGTCATCGCCGTGTACGGTGAGAGGAGAGTGA
AGCTGTTCCGCCTGTCCGTGCGGCTCGGAAGGGGAGCTGGCGAGGGGAGCGGAACTG
TGCTGTCCGCGGATTTGGAGCTGGTTTCCGCGGCGCCGAGGTTGAGCCACTGGGTTAT
GGATGTTCGCTTCTTGAAGGAGAATGGAACTTCTGAGGATGAGCTGCAGAGGTGTCTTA
CCGTTGCTATAGGATGCAGTGACAACTCCATACGCCTTTGGGACGTCGATAAATGTAGC
TTCGTTCTTGCAGTTTCTTCCCCTGAGAGATGCCTTCTGTACTCCATGCGGTTGTGGGGT
GACAATCTTGAAGATCTCCAAGTTGCATCTGGAACAATTTACAATGAGATTTTGATCTGGA
AAGTGGTTCCCAATCATGATGCTCCATCTTCAAATGAGCTCACAGAAGAAGGCCTGACAA
ACTCTTGTGCTGGCAACAGCGTCCACGAATGTCTTCGCTATGAAGCCTATCACATCTGTA
GACTTGTTGGTCATGAAGGTTCAATATTTAGAATCGCATGGTCCTCTGATGGCTCCAAAC
TCGTCTCTGTATCTGATGATCGTAGTGCACGTATTTGGGAAGTTCATTGCAAGGTGCAAT
ATTCTGAAGATGCGGGGGAGGTTGGATTGCTTTTTGGACACAGTGCTCGAGTTTGGGAT
TGCTATATATCTGATAATTTGATTGTTACTGCTGGCGAGGATTGCTCATGTCGTGTGTGG
GGACTGGATGGACAACAGCATGATGTCATCAAAGAGCATATTGGAAGGGGTATATGGCG
GTGTCTCTATGATCCATGGTCTTCACTCCTTGTCACTGGTGGTTTTGACTCTGCAATTAAA
GTGCATAAACTGGATGCTTCTTTAGCTGAGGCTTCCGCAAAACAGTCCAACATAAAAGAC
TTGAGTGACGGAACTGAGCTATTTACTACACATCTTCCGAATTCATCAGGTCATAGCGGA
CACATGGACAGCAAAAGTGAGTATGTCCGATGCTTGAGCTTCTCATGTGAAGATGTGAT
GTATATTGCCACTAATCACGGTTATCTCTATCATGCTAATTATGCAATGATGGCGATCTA
AGGTGGACTGAACTTGCTCAAGTAAGTAATGAGGTGCAAATTATTTGTATGGAACTGTTG
CCCTCAAATCCATATGATCCTCGGATTGATGCTGATGATTGGGTTGCTGTTGGAGACGGT
AAAGGATGGACAACAGTTGTCAGAGTTGTGAAGAATTCTGATTCTCCTAAAGTGAGCACC
AGCTTCTCTTGGGCAGCTGAAATGGATCGACAGCTCTTGGGAATCCATTGGTGCAAATC
ACTAGGACATAGGTTCATTTTCACTGCTGACCCTAGAGGAGCCTTGAAACTTTGGAGATT
CTTCGAGGTTTCACAATCTAGTTCTCTTTATCCAGAAAACAGTCCACGGATTTCCTTGATT
GCAGAGTTCAAATCAGATTTGGGTGCTCGAATTTAGTGCTTGGATGTGGCATTTGAGAGC
GAGCTGCTGATCTGTGGGGATCTACGGGGTAATCTGGTTCTGTTCCCTTTATTGAAGGA
CCTGTTGCTGGATACCTTTGTCGTGTCAGCAGCTAAAATCTCTCCAGTAAACCATTTTAAA
GGAGCCCATGGCATCTCAGCAGTTTCCAGCATTTCAGTGGCTCACATGAGTTTCAATCAC
ATTGAATTACGTTCTACTGGAGCTGATGGATGCATATGCTACATGGAATATGACAAAGGT
CTGCAATCTTTAAATTTTGTAGGGATGAAGCAGGTGAAAGAATTAAGCATGATTGAATCT
GTTTCCACTGAAAATGAATCTACCGGTTACAGAACAAGTGGCAGTTATGCATCTGGTTTT
GCATCCACAGATTTTATAATATGGAACCTAGTAACTGAGGCTAAGGTCCTGCAAGTTTCA
TGTGGTGGTTGGCGGCGTCCACATTCTTATTATCTGGGTGATGTACCAGAGATGAAGAA
CTGCTTTGCTTATGTCAAGGATGATATTATTTACATCCGTAGACACTGGATAAAGGACTCA
AAGGACAAGATACTCCCTCAAAATCTACGTTTGCAGTTCCATGGAAGGGAGGTGCATTCT
TTATGCTTTGTCACTGGAGATTTCCAGCTCCGAAAAAATAAACAATCAAGTTGGATTGTGA
CTGGTTGTGAGGATGGGACAGTGAGGCTGACTAGGTATACTCAGTGCACTGACAATTGG
TCTTCATCCAAATTACTTGGAGAGCATGTTGGTGGATCAGCTGTCAGATCAATATGTTGC
GTTTCAAATATCCATACAACCTCATCAGGTACTAGCGTGTCTGATGTAAAGGGTATAGAG
AACCTCCCAAAGGATATAAAGGGAACACTTATGGAGGATGAATGTAATCCATCATTGCTT
ATTTCTGTTGGTGCAAAGCGTGTTCTGACTTCTTGGCTGCTGAGAAGAAGAAAACAGGAT
GGGAAAGAAGATGACGTTACTGATTTACAAGAAGCCGAAAATAGTTCATTGCCTTCATCA
GCTGGATCCTCTACATTTTCATTCCAATGGCTTTCCACGGACATGCCAGTTAAGTACTCA
GTGCCTTCCAAAAAATCAGGAAGCATTAAAAAGCTAATCGGTGTCTCTGACACCAATGTA
AGATGTAAATCACTTCTGCCTGATAGTGAGGCTCTGCAGTCAAAAGTATCTGCAGTTGAT
AAGAATGAAGATGACTGGAGATACCTAGCTGTCACTGCTTTTCTCGTTAGACATTCTGGT
TCTAGGTTAATTGTCTGTTTTATTATCGTTGCTTGTTCAGATGCTACACTTGCAATACGAG
CTCTTGTCTTACCCTATCGTCTATGGTTTGATGTTGCTCTGATGGTTCCTCTATCATCACC
AGTTCTGTCATTGCAGCATGTCATCATTGGAAGATGTCAATTACCAGATGAAAATGTGCA
GATTGGGAATGTATATGTCGTGATTAGCGGAGCCACTGATGGGAGCATTGCTTTCTGGG
ATCTGACTGAAAGTGTTGAAGCCTTTATGAGGCGGTTGTCAAATATCCATCTAGAAAAGT
TCATGGATTGTCAGAAACGGCCACGGACTGGGAGAGGAAGCCAGGGTGGAAGATGGTG
GAGATCCCTGAGCAAAATTGCTTGTAAGGAACAACCAATCAATGATCCTGTTACTGCGAA
AGCTATAAAGGAACTAAACAGGAAATTGACTGGCGGTGTTGCCTGCGGGTCTTCATCTT
CTATGCTAGATGCCTCCCCTGAGTTAGATAGCAATGCAGCTAATTCCTCATTTGAAATTAT
TGAAGTAAATCCTTTCCATGTTCTTAATGGTGTTCACCAATCTGGCGTGAATTGCCTTCAT
GTTTGCGAAACAAAACATGGCCAAAGTTCCGATGGTCGTTTTCTGTACCAGTTAGTCAGT
GGTGGTGATGATCAAGCACTTCATCTTCTTAAATTTGAGGTGTTGGTGCAGCCCCCAGTC
CAAGTTCCAGATGTTCCAAATTCAGACATCAGAAATTCTATACTCGTTGAGGAATTTCTTC
TTGATGAGCAGAACCAGAAAACCAAGTGTACAATTGAATTCATTTCTCAGGAAAAAATTG
CTTCTGCTCACAACTCTGCTGTAAAAGGCGTTTGGACAGACGGCACTTGGGTTTTCTCGA
CTGGTCTTGATCAGCGTGTCAGGTGCTGGATTAGCAAGGATCGTGGTACACCAACGGAG
CTTGCTCATTTTATCATTAGTGTGCCAGAGCCAGAAGCATTGGATGCAAGATCCATTTGC
TGGGACCAGTACCAAATAGCAGTAGCTGGAAGAGGGATGCAGATGATCGAATTTCATGT
GCCTTCTTCTGAGATTCGGTAACGACAATCAAAGGTTTTCCCTTTTTTTCTTTCTTTGCCC
ACTAAAAGTTCTTCAGGGCAAAGCCAAAGGTTTATCCCTCATTGGATTTGATATATAAACT
GAGAGTGTCTTGCACACCATTAAAATGGCCCATCAACAGTGAGTTGCATCTAAAAAAAAA
AAAAAAAAAAAAAA
SEQ ID NO:108
CTGATTTGTCCAATTTGAACATCACATCTTTGTATTCGTACTTGACACTCCTTCGACATGC
CATATAAACTTTCAGCAACGCTATCGAACCACTCATCTGATGTTCGTGCTGTCGCGTCGC
CTTCAGATGATTTGATCCTTTCTGCGTCGCGTGATTCTACTGCGATTTCATGGTTCAGAC
AATCGCCATCTTCCTTCACCCCCGCCTCCGTCATACGAGCAGGCTCCAGGTTCGTCAAT
GCGATAGCCTACCTGCCACCGACACCGAGAGCCCCTCAAGGGTATGCGGTAGTGGGCG
GTCAAGACACCGTCGTTAACGTCTTCGCTCTGGGTCCCGGCGACAAGGAAGAGCCCGA
ATACACACTCGTCGGCCATACCGACAACGTCTGTGCCCTCAGCGTGAACTCGGACGATA
CCATCATTTCTGGATCATGGGACAAGACAGCCAAAGTTTGGAAGGATTTCGCTTTGGTTT
ATGACTTGAAAGGTCACCAACAGTCTGTCTGGGCTGTGCTGGCTATGAACGAAAAAGAG
TTTCTGACCGCTTCTGCGGATAGAACCATCAAGTACTGGGTCCAGCACAAAACAATGCA
GACATATGAAGGGCACCGAGACGCTGTGCGGGGGTTGGCGCTAATTCCTGACATCGGT
TTCGCTTCGTGTTCCAATGACAGCGAAATTCGGGTGTGGACAATGGGAGGGGACGTGGT
GTATACGTTATCGGGACACACGTCTTTTGTATACAGTCTGTCTGTTCTGCCGAACGGCGA
TTTGGTCTCTGCAGGAGAAGATCGCTCTGTACGGGTGTGGCGTGACGGAGAGTGCTCT
CAGGTTATCGTTCATCCAGCCATTTCGGTGTGGGCGGTATCTACGATGCCGAACGGTGA
CATCATCAGTGGATCTAGTGATGGAGTAGTTCGGGTCTTCAGTGAATCTGAGAAGCGTT
GGGCTACCGCCAGTGAACTCAAGGCACTTGAAGACCAAATTGCAAGCCAATCCCTTCCG
TCACAGCAAGTTGGTGACGTCAAAAAGACCGATTTGCCAGGGCCTGAAGCCCTTTCCGT
TCCAGGGAAAAAGGCTGGGGAAGTGAAGATGATTCGGAGTGGAGACGTCGTCGAAGCA
CATCAGTGGGATAGCTTGGCCTCTAGCTGGCAGAAAATAGGCGAGGTTGTGGATGCAAT
TGGCTCAGGACGTAAAACAACTGCACGACGGCAAGGAGTATGACTATGTGTTTGATGTTG
ATATCCAAGAAGGTGCTCCTCCATTGAAACTACCTTATAATGTTTCCGAAAATCCCTATAC
CGCTGCTCAGCGTTTCCTCGAACAGAACGACTTGCCTACGGGCTATCTCGATCAAGTGG
TGAAGTTTATAGAACAAAACACTGCAGGAGTCAAGCTTGGCAATGATGGTTATGTTGACC
CATTTACAGGGGCATCTCGATACCAGCCAGCGACACAGTCCACTTCGAATACGGCGTCG
TCGTCTTACATGGATCCTTTTACGGGTGGATCCCGCCACATTGCAGAGTCAGCTCCTTCC
AATGTACCACAGGGCTCTCATGCAACTGGGATCATTCCGTTTTCTAAGCCTATATTCTTC
AAACTGGCCAATGTATCTGCAATGCAAGCCAAGATGTTCCAATTTGACGAAGTTCTTCGC
AATGAAATATCCACAGCGACTCTCGCGATGCGTCCTGATGAAGTGATCATGGTCAACGA
GACATTTACATATCTTTCTAAAGTGGTTACTTCCACATCCTCGGCACGGACTTCTCTCGG
ATGGATCCATATTGAAACAATCATGCAAATACTGGACAGGTGGCCTGTTCCTCAAAGATT
TCCGGTCATAGACCTTGGTCGTCTGGTAACGGCATACTGTATGAATGCTTTTTCTGGTCC
CGGCGACCTCGAAAAGTTCTTCAGTTGCCTATTCAGGACATCGGAGTGGACTTCCATCA
CCTCTGGAAGCAAGGCGCTGACCAAGGCGCAAGAAACAAATGTACTACTTCTATTCCGC
ACGATTGCGAACTCGTTAGACGGTGCACCTTTAAACGACATGGAGTGGATCAAGCAGAT
ATTTAGGGAATTGGCACAGACACCGCAATTGGTTCTCAACAAGTCCCATCGGTTGGCAC
TAGCCTCCGTTTTGTTCAACTTTTCGTGCATCGGCCTCAAAGGCCCTGTCCCTGCGGAC
GTGAGGACATTGCACCTGACTATAATTTTGCAGGTGTTGCGATCTCCAAACGATGACCCC
GAGGTTGCTTATCGGACCTGTGTTGCCCTGGGAAACATGCTCTACTCAGACAAAACGCG
AGGCACACCGCGAGACGCACAATCGCCATCACCAACTGAACTCAAGAGTGCCGTTGCC
GCAATCAAAGGGGGATTCTCGGATCCAAGGATAAACGATGTTCATAGAGAAATCATGTC
CCTCATCTGAGCGCGTTCAACCTGTTGCAAAGGTTTGGGGGCTGTACAGCGTGTATTTC
TTGTTACGATACTTGAGGGGTTAGAGGCACCTACGAATTAGGAATGTTGGATTGTCGCTG
TCAAGCGCTCGACAGCATCAAAAAAAAAA
SEQ ID NO:109
TTTTCCAAGTCAAAATCTCTCTCTCTCTCTCTCTCTCTCTCTGTGCAGCAAGGCACGGCA
GCTCTGGGGGCGGACGCCGGCGGAGCAAAGCGGCGACCAGGGGAGCGGCTGCAAGA
CGACGTCCACGATCTCCATCCTCTGTAAGAATTACTTGAAAGATGCCTCCACAGAAGATT
GAAAGTGGACACAAGGACACGGTCCATGATCTGGCGATGGACTACTATGGTAAGAGACT
GGCGACAGCGTCGTCGGACCACACCATTAACGTTGTTGGAGTCAGCAGCTCGGGATCT
CAGCATCTTGCCACGCTGATCGGCCATCAAGGACCCGTTTGGCAGATATCTTGGGCTCA
TCCGAAATTCGGGTCTTTGCTTGCTTCGTGCTCGTATGACGGACGGGTTATCATATGGA
GAGAAGGTAATCCAAATGAGTGGACGCAGGCACAAGTATTTGAAGAGCACAAATCATCG
GTCAATTCCGTTGCTTGGGCTCCTCATGAGCTCGGCCTTTGTCTGGCTTGCGGTTCATC
CGATGGGAATATTTCGGTGTTCACTGCTAGACAGGACGGCGGTTGGGATACTTCAAGGA
TCGATCAGGCTCATCCGGTCGGGGTCACCTCCGTGTCGTGGGCCCCGTCGACTGCCCC
TGGTGCACTTGTTGGTTCTGGCATGATGGAACCTGTTCAGAAGCTCTGCTCAGGTGGTT
GTGATAACACCGTGAAGGTGTGGAAGCTCTACAACAGGGTCTGGAAGTTGGACTGCTTT
CCGGTGCTTCAGATGCACACTGACTGGGTGAGGGACGTTGCCTGGGCACCCAACTTGG
GCCTTCCGAAATCGACCATCGCAAGCGCATCGCAGGACGGGAGGGTCATCATATGGAC
CTTGGCCAAGGAAGGGGATCAATGGCAGGGGAAGGTTTTGTACGATTTTAGGACTCCGG
TTTGGAGGGTCTCGTGGTCCCTGACCGGTAACATTTTGGCGGTGGCAGACGGGAATAAC
AATGTCTCCCTTTGGAACGAAGCCGTTGATGGTGAGTGGATCCAGGTTTCAACAGTCGA
GCCATAGGATTCAGCTTGTGGTTGTCCTGTCGAACTGCTTTTAGCTGCGCAATGTGCATA
GAGAACGATTTGCCAACATAGTAGAACCCGCGAACATGCTTGGGTGTACTTCATAGATG
AATTTATGTGCTTTTTGGGCCGGTATAGATGCTTTAAATTGTTCTTGCTTTGCGGCTAGAT
ATCCTTATGAATGAAGTTTGGATGATAAGTGGCGCCAGACTTTCTACTCACCCTTTTTTGT
CAGCCATGTTTTGAGCGATGCGGTGACCAGTTTGGCCTGAATAGTTGGTCTTAATGGCTT
GTGGATTTGCACCGGCTGTTTACCATTGAAACTTGTATGACCTATTGCTGATGAACCTCT
TAATCTGCTCAGAAGGGTGATCGTTTAGAAAAAAAAAA
SEQ ID NO:110
GCTTCCATGGCGCACCCCGAATTCTACTGATCGAATTCGCCTTCTCTTCTCACTGCAAAA
CCCTAAAACGCACGCCCTCTCCCTCTCCCTCTCCCTCTCCCTCTCACAGCTGCCAAAAT
GTCTGCTCCTATGCTGGAAATCGAGGCCCGCGACGTGGTCAAGATCGTGCTCCAGTTCT
GCAAGGAGAACTCGCTGCACCAGACGTTCCAGACCTTGCAGAGCGAGTGCCAGGTCTC
CCTCAACACCGTCGACAGCATCGAGACGTTCGTCGCCGACATCAACAGCGGCAGGTGG
GACGCCATACTGCCTCAGGTCGCCCAGCTTAAGCTCCCCAGGAATACGTTGGAGGATCT
GTATGAGCAGATTGTATTGGAAATGATTGAACTTCGTGAGTTGGATACTGCTCGTGCAAT
TCTAAGGCAAACTCAGGCAATGGGTGTCATGAAGCAGGAGCAACCTGAAAGATACTTGC
GACTTGAGCATCTCCTTGTTAGGAACATATTTTGATCCCAATGAGGCTTACCAAGACTCCA
CAAAAGAAAAACGACGTGCACAAATTGCCCAAGCTCTTGCTGCGGAAGTTACTGTAGTG
CCACCTTCAAGATTGATGGCCCTAGTGGGGCAGGCGCTCAAGTGGCAGCAGCACCAAG
GATTACTACCTCCGGGAACCCAATTTGATCTATTTCGAGGAACTGCTGCTATGAAGCAAG
ATGTGGATGATATGTACCCTACAACTCTTTCACACACCATTAAGTTTGGAACCAAAAGTCA
TGCAGAGTGCGCTAGGTTCTCGCCAGATGGACAGTTTCTTGTTTCTTGCTCTGTCGATG
GATTCATTGAGGTCTGGGATTACATGAGTGGGAAGCTCAAAAAGGATCTTCAGTATCAG
GCTGATGAGACCTTCATGATGCATGATGATCCCGTTCTTTGTGTTGATTTTAGTAGAGAC
TCGGAGATGCTTGCTTCTGGGTCGCAAGATGGCAAGATCAAAGTTTGGCGAATAAGAAC
AGGTCAATGCTTGCGTCGTCTTGAACGTGCACATTCTCAGGGTGTCACAAGTGTCCTCTT
TTCCCGTGATGGCAGTCAGTTACTCAGTACCTCCTTTGATGGCTCAGCCAGAATCCATG
GCCTTAAATCTGGGAAGCAGCTGAAAGAGTTTCGAGGCCACTCATCTTATGTGAATGATG
CAATATTCAGCAATGATGGCAGTCGTGTTATTACCGCCTCAAGTGATTGTACTGTAAAGG
TCTGGGATGTTAAGACTTCAGACTGTCTTCAAACATTTAAGCCTCCACCTCCATTGAGGG
GAGGTGATGCTTCTGTTAATTCTGTTCATCTCTTCCCAAAGAATGCTGATCACATCGTTGT
TTGCAATAAGACGTCATCAATTTATATCATGACTCTACAGGGACAGGTGGTGAAGAGTCT
TTCATCTGGTAAAAGAGAGGGTGGAGATTTTGTGGCAGCCTGTGTATCACCAAAAGGTG
AATGGATTTACTGTGTCGGTGAAGACAGGAATTTGTACTGTTTCAGCTGCCAGTCTGGGA
AATTAGAGCATCTTATGAAGGTTCATGAAAAGGATGTAATCGGCGTAACACACCATCCCC
ACCGTAATCTCGTTGCAACATACAGCGAAGATAGCACGATGAAACTATGGAAGCCTTGA
CGTATTATTTCTCATCTTGCGACTTTGCAATTGTTAGACTGTTGTAGATGTAATTGTCATTT
CATTTATTGACCGGTGAAAGATTCCGAGTGTATTTTAGGGAAAGAGCAGTTATAATCTCC
TCATTGGCCACCAAAAAAAAAA
SEQ ID NO:111
GAATCCCCGAAATCGCACTGGAAAATTCCACTGCGCTGCTTCTCCGAATCGCCCCCATG
GATCTCCTGCAATCTTACGCGGAGGATAACGACGGCGATCTCGGCCGTCACTCCTCGCC
GGAACCCTCGCCGCCCCGCCTCCTCCCCTCCAAATCCGCCGCCCCGAAGGTTGACGAC
ACCACGCTCGCCCTCACCGTCGCGCAGACCAACCAAACCCTAGCCCGCCCCATCGACC
CGTCCCAGCACGCCGTCGCTTTCAACCCCACCTACGACCAGCTCTGGGCCCCAATCTG
CGGCCCCGCCCACCCCTACGCCAAGGACGGCATCGCCCAGGGCATGCGCAACCACAA
GCTCGGCTTCGTCGAGGACGCCGCCATCGGCTCCTTCCTCTTCGACGAGCAGTACAAC
ACGTTCCAGAGGTACGGGTACGCCGCCGACCCCTGCGCCTCCACGGGGAACGAGTAC
GTCGGCGACCTCGACGCGCTCAAGCAGAACGACGGCATCTCGGTGTACAATATCCGCC
AGCAGGAGCAGAAGAAGTATGCTGAAGAGTACGGAAGAAGAAGGGCGAGGAGAGGG
GCGAGGGCGGCAGGGAGAAGGCGGAGGTGGTTTCCGATAAGAGCACATTCCATGGTAA
AGAGGAGAGGGATTATCAGGGCAGGTCGTGGATCGCGCCGCCCAAGGACGCCAAGGC
AACGAATGATCATTGTTATATACCCAAGAGATTGGTGCACACCTGGAGTGGGCACACCA
AGGGCGTGTCTGCCATTAGGTTCTTCCCCAAGCATGGTCACTTGATCCTCTCAGCTGGG
ATGGATACTAAGGTGAAGATCTGGGACGTGTTCAATTCGGGTAAGTGTATGAGGACTTA
CATGGGTCATTCGAAAGCTGTTAGGGACATATCGTTTTGTAATGACGGGACTAAGTTCTT
GACTGCCGGTTATGATAAGAACATCAAATATTGGGATACTGAAACCGGGAAGGTTATCTC
TACTTTTTCCACTGGGAAAATTCCTTATGTGGTTAAGTTGCATCCGGACGATGAAAAGCA
GAATATTTTGTTGGCGGGAATGAGCGATAAAAAGATTGTTCAGTGGGATATGAATACGGG
GCAGATTACACAGGAATATGATCAGCATTTGGGTGCGGTGAATACGATTACCTTTGTTGA
TGATAACCGGAGGTTTGTGACGTCAAGCGATGATAAGTCCCTTCGTGTTTGGGAATTTG
GGATCCCTGTGGTTATTAAGTATATAAGTGAGCCTCATATGCATTCTATGCCCTCGATCT
CGCTGCATCCAAATACGAATTGGCTTGCAGCACAGAGTCTGGACAATCAGATTCTTATTT
ATAGCACTAGGGAAAGATTTCAGCTCAATAAGAAGAAGAGGTTTGCTGGCCACATTGTG
GCTGGCTACGCCTGTCAAGTTAACTTCTCACCAGATGGTCGTTTTGTTATGTCTGGAGAT
GGAGAGGGTAGATGCTGGTTCTGGGACTGGAAGAGTTGCAAAGTCTTCAGAACTCTGAA
ATGTCATGAGGGGGTATGTATTGGGTGTGAGTGGCATCCACTGGAACAAAGTAAAGTAG
CAACCTGTGGCTGGGATGGCTTAATCAAGTATTGGGACTGATCTTCTTGCACCTTTCATG
CAAAGGCGTTTGATACTTCTTTGCTTCTCACATTGTGGACACTGAACAGCCTGTGGAACA
CAAGGCATACTTTCCGAGGTCAATCAGTTGGAATCTGCTCAGACTAGGGAATCTGCATG
ACAGTGGTGTACTTACAAACTCCTTTTGTAACTAGATGCAAAATCTTCACCAGCCTGTTGT
ACTTCAGTTTATTTTGCTAGCAGAACATTCACCACCAATTTTCCCAGTGTACTGCGAGAGT
GATGCTACATAAGTTTACTCTTGTGTCTAACTTTTCCAATTAGATCCAGATTGAGTTGACA
TAAAAAAAAAA
SEQ TD NO:112
GGGGCAAACACCATGGAAAGATTTTGAACCTGCTAGAAAATGGCGGCAGTCATTCCGCA
GGCAATTAGGGTTTCAGCATACCTCCGGAAACGTCCGCAAGAATCAATCATCTCTCCCTC
GCCATGCGAACGGAGCTCGATTATAGGGTGTGATCGTCCCAATTTCGAAAGCAAAAGAC
TTGTCTTTAATCGAGAAATTGGGGCTTTTTAGCTGGCGGGTGGAAGTTAGGGTTTTGATT
CGGTGCAGAAGATGGAAAGCAACGGCAATTTGGAGCAGACTTTGCAAGATGGGAGGAT
ATACAGGCAGCTCAATTCGCTCATCGTCGCTCATCTTCGCGACCACAACTTCCCGCAGG
CGGCAAGTGCGGTTGCTCTAGCAACAATGACGCCCTTGAATGTTGAAGCCCCAAGAAAT
AGGCTTCTTGAGCTGGTTGCCAAGGGTCTTGCAGTGGAAAAGGGTGAACTATTGAGAGG
TGTTTCTCATGCTGGGACAAATGATCTGGGTGGATCAATACCTGCTTCATATGGCTTGGT
TCCAGCTCCGTGGACCGCTATTGATTTCAGTTCTCTGCGAGACACAAAGGGCATGTCCA
AGAGTTTCACTAAACATGAGACCAGGCATCTTTCAGACCACAAGAATGTTGCCAGATGTG
CAAGGTTTAGCACTGATGGAAGGTTTTTTGCAACAGGAAGTGCAGACACTTCAATTAAGC
TCTTTGAGGTCTCAAAAATAAAGCAAATGATGCTACCAGATTCTACAGATGGCGCTATAC
GAGCTGTTATACGGACATTTTATGACCATACACATCCTGTAAATGACTTGGATTTTCATCC
TCAAAATACTGTCCTGATATCTGCAGCCAAAGACCATACAGTAAAGTTCTTTGATTATTCA
AAGGCTACAGCAAAGAGAGCATTCAGAGTTATTCAGGATACTCACAATGTACGTTCAGTT
GCTTTCCATCCTTCTGGCGACTTTCTTTTGGCTGGGACTGATCATCCAATTCCACACTTG
TATGATGTCAACACATTCCAATGCTATCTTTCTGCAAATGTCCCAGAATTTGCAGTTAATG
CAGCGATAAACCAGGTAAGATATTCATCTAGTGGCGGCATGTATGTTACAGCATCTAAAG
ATGGTACTATACGATTTTGGGATGGGGCATCGGCCAACTGCGTTCGCTCCATTGCTGGT
GCACATGGGGCAGCTGAAGTAACTAGTGCCAACTTCACGAAGGATCAGAGATATGTACT
CTCTTGTGGGAAGGACTCTACTGTGAAACTCTGGGAAGTTGGCACAGGAAGATTGGTTA
AACAATATCTTGGAGCCACTCACATGCAGTTGCGATGCCAGGCTGTCTTCAACAATACAG
AAGAGTTTGTTCTATCCATTGATGAACCGAGCAATGAGATAGTGGTCTGGGATGCCATGA
CAGCAGAAAAAGTGGCAAGATGGCCATCCAACCATAATGGCCCTCCTCGTTGGATTGAG
CACTCACCCACAGAAGCAGCTTTTGTATCCTGCGGAACTGACAGATCGATCCGATTCTG
GAAGGAAACTCACTAGGGTGATCTATGGAGATGATCGTGATATTTGCAGCCTCGGACAT
CTTAGAAGTGATCACAATTTGATTCAGAAGGATGATGGCCCCATACACTGCGCCTTCCAC
AAATTCTTTCCGTCTTTGTAAGGAACTTGTGAGGTATTCGAGGGGACATTGTCAATAACTT
ATATCCTACCTCTAATGGAATT
SEQ ID NO:113
ATTTCCGGTTTCCAGATCGATCGCCCAGCCCCGTCGCACCGCCCCCCGCCCTCTCTCTC
TCTCCAATCCGCGCCGAGGTGAATCGCGTTCATTGTTGTGATAAAGATGTCCAATTTCCA
AGGGGAGGATGGTGAGTATGTAGCAGATGACTTCGAAGCAGAAGATGGCGATGAAGAG
CTCCATGGCAGAGAATCGGCAGACCCAGAATCTGATGTTGATGAAATAGACACTCCAAG
TAATAGATTTACCGATACTACTGCTGATCAAGCAAGAAGAGGAAGAGATATTCAGGGAAT
TCCTTGGGAAAGGCTTAGCATCACCAGGGAGAAGTACCGGCGGACTAGGCTAGAACAAT
ATAAGAATTATGAAAATGTTCCTCAGTCTGGAGAGAAATCGGGGAAGGATTGCACAGTTA
CAGAGAAAGGTAACTCATTTTATGAGTTTAGACGGAACTCAAGATCTGTCAAATCTACTAT
TCTTCATTTCCAGTTGAGGAATTTGGTTTGGGCAACATCAAAGCACGATGTTTACTTGAT
GTCAAACTATTCTGTCGTCCATTGGTCTTCATTGACGGGCAAGAAGTCTGAAGTTCTTAA
TCTTGCAGGACATGTGGCACCAAATGAGAAACATCCTGGAAGCTTGTTGGAAGGATTTA
CGCAGACTCAAGTCAGTACTCTAGCAGTAAAGGATAGATTTCTAGTTGCAGGCGGGTTT
CAGGGAGAGCTCATATGCAAGTTTTTAGATCGACCTGGAATTAGCTTCTGTTCCAGGACA
ACCTATGATGATAATGCCATAACAAATGCGGTTGAGATATACGTCAGCCCCAGTGGTGG
AATTCACTTTATAGCATCGAACAATGACTGTGGAGTCAGAGACTTTGATATGGAAAACTTT
GAGCTGTCTAAACATTTTCGTTTCCCTTGGCCTGTGAATCATACTTCTTTGAGTCCAGATG
GGAAGCTTCTTGTCATAGTTGGCGATGATCCTGAGGGTATTTTGGTTGATGCAAAAACTG
GAAAAACAATCATGCCGTTGCGCGGGCATTTGGATTTCTCCTTTGCATCGGAGTGGCAC
CCAGATGGTGTCACTTTTGCTACGGGAAACCAGGACAAAACTTGTAGGATTTGGGACAT
ACGAAACTTATCTAAGTCAATCGCTGTTTTGAAAGGCAACCTTGGAGCCATACGGTCTAT
CCGCTATACATCCGATGGCCGGTATATGGCGATAGCAGAACCTGCAGATTTTGTCCATG
TCTACGACACGAAAACGGGGTACAAGAAAGAGCAGGAGATAGACTTCTTCGGCGAGATA
TCCGGCATGTCGTTCAGTCCCGACACAGAATCGCTCTTCATTGGAGTGTGGGACCGAAC
ATATGGTAGCCTCCTCGAGTATGGCCGGCGACGGAACTTCTCGTACCTCGACTGCCTCG
TCTGAATAGAATGTGCGGTCACTGTTAATAATTTTCTTTCGACTGTGCAGCTTTTTGGGTG
ATAAATGTGTGAGTACGTGGACAGAGCATGGTGCACAGGGGAGTTCCATTTTGTGTGGG
TGAAATGTTTGTATATATGGGTCGAGTAGAATCTTGATGTCTTATATGATTTTAGTTGGCA
CAG
SEQ ID NO:114
GCTACTGCACTGCGCTCTCGGCGCCCTCCAAACGACCGAGGAAAGCTTCAGCTGAAAAA
AGCTTCGGCTACAATCCCTTCAGGTTCGGCTTGATATATATGGTTTTCTGCTAGCGACGC
GCGCCTGGAGTGTTGATTCGAGTAGGACCCGTATCGCGTCGATTGCCTAGCGTTCCGTC
AATTCGCCGTCATGGGTGTTGAAGAGGATCTCGAAGATTTGAATGCCCTCGCGGAGTCT
ACCGATGCCGCCGTCGATGGCCAGGCGGCTCTGGCTTCCGCCGTCGATAGCGTTACCC
TTCAGCCGGCCCCGCCAATACTTCCCCCCGTAATCCCTCCTCCCGCCGTGCCTGTGGTT
GCTCCCGTTCCGACAATTCCTCCGGTCCTTCGACCTCTGGCGCCTCTACCCATCCGCCC
TCCCGTTTTGAGGCCTCCCGCACCCAAAAGAGATGAGGCTGGAAGTAGCGACTCGGATT
CAGACCATGATGGTACCGCTGCCGGGTCGACTGCAGAGTATGAGATTACGGAAGAGAG
TAGGCTGGTGCGGGAGCGGCATGAAAAGGCGATGCAGGATCTGATGATGAAGCGGCGC
GGTGCTGCTCTGGCGGTGCCCACTAATGATAAGGCTGTGCGTGCCCGTCTTCGTCGGC
TCGGAGAACCTATGACCCTCTTTGGGGAAAGAGAGATGGAGAGGCGGGACCGGCTGCG
GATGCTTATGGCTAAATTAGATGCGGAAGGACAGCTCGAGAAGCTCATGAAAGCCCACG
AGGATGAGGAGGCTGCTGCTTCCGCTGCACCGGAGGATGTCGAGGAAGAGATGCTTCA
GTATCCATTCTATACTGAAGGATCGAAAGCTCTTTTCAATGCTAGAATAGATATTGCAAAA
TTTTCGATCACAAGGGCTGCACTCCGTCTTGAACGTGCAAGAAGGAGAAGGGATGACCC
TGATGAGGATGTGGATGCAGAGATAGATTGGGCATTGAAAAAGGCAGAGAGCTTGTCCT
TGCATTGCAGTGAGATTGGCGATGATCGACCACTTTCAGGCTGCTCTTTCTCCCACGAT
GGAAAGTTGCTTGCCACATGTTCCATGAGTGGAGTTGCTAAGTTATGGGACACATGTCG
CATGCCTCAAGTGAATAGAGTGTTGACATTGAAGGGTCACACAGAACGTGCTACTGATG
TGGCCTTTTCTCCGGTGCAAAATCATATAGCAACTGCTTCTGCTGACCGGACTGCAAAGT
TATGGAATACTGAAGGAACTATCTTGAAGACTTTTGAGGGGCATCTGGATCGCCTTGGTC
GTATTGCATTTCATCCATCAGGGAAGTACCTGGGTACGACTAGCTTTGACAAGACATGGA
GGTTGTGGGATATAGAAAGTGGTGAGGAACTACTTCTTCAAGAAGGCCATAGCAGAAGT
ATCTATGGGATAGACTTCCACCGGGATGGATCCTTAGTGGCATCTTGCGGACTGGATGC
TCTTGCACGTGTTTGGGACCTCCGCACCGGTAGAAGCATCCTTGCTTTGGAAGGACATG
TTAAGCCGGTTCTGGGAGTCAGCTTTTCACCCAATGGATATCATTTAGCCACCGGTGGT
GAAGATAATACCTGTCGTATTTGGGATTTAAGAAAGAAAAAGTCATTGTATACTATTCCAG
CTCATGCAAATCTTATATCAGAAGTGAAATTTGAGCCTCAAGAAGGATATTTTCTGGTGAC
TGCATCATATGATACTACGGCAAAGGTCTGGTCGGCCCGGGACTTTAAGCCAGTGAAAA
CTTTGTCAGTGCATGAAGCCAAAATAACATCAGTGGATATCACTGCGGATGCAAGTCATA
TAGTTACAGTGTCGCATGACAGAACAATTAAGCTCTGGACTAGTAACGACGACGTGAAG
GAACAGGCCATGGATGTTGACTGAATTCAATTTTTGTTAGCATGTGCTGTTTTAGCGGGA
GCACGGAAGTTGATCCTGTCAAGCGTTTGTAAACATTTTGGCAGATCTTTCCCCGTTTAA
AAGCGTTTCTAGCTTGCTTGTACTATAGTTTTTCAATTGGCTGCCCGTAAGGTATTCAAAA
GTACCAAACCATCCTAGCAGTAGTTTTTGCCCTTGACATGTTTTGGCCTGTGATCCAAGG
AAAGTTCTGATGTGGAGTGTCTTACGTCTTACGAACAAGGAGAGGACACTTTATGAGGTA
TCGCTTCCTCATTTTTCATATCTATCGTTATTGCCATTAGAATCAAAGCCCATTTGCTTGA
AAAAAAAAA
SEQ ID NO:115
ATAGTTTCAACAGCCTGCGAAGCTCTTCCCCCCCTGAACCAGACACAGTCTCTCTCTCTC
TCTCTCTCTCTTCTCTCGCGCCACGGCAGCTGCCAAATGCGATGGTGAAGGCTTACCTG
AGGTACGAACCGGCGGCGGCCTTCGGGGTGATCGCGTCGGTGGAGTCCAACATCGCG
TACGACGCCTCCGGCAAGCACCTCCTCGCCCCGGCGCTCGAGAAGGTCGGCGTCTGG
CACGTGCGGCAGGGCGTCTGCACCAAAGCCCTAGCCCCTTCCGCCTCCTCCGCCGCCG
GACCCTCCCTCGCCGTCACGGCCATCGCCTCCTCTCCTTCGTCTCTGATTGCGAGTGGA
TATGCTGATGGTAGCATACGGATATGGGATTTTGAGAAAGGTTCTTGCGAGACGACACT
GAATGGCCATAAAGGGGCTGTCTCTGTACTCAGATATGGCAAGCTTGGATCTTTACTCG
CATCTGGAAGCAAGGACAACGACATTATATTGTGGGATGTAGTCGGAGAGACTGGTCTT
TACCGCTTACGAGGGCATCGCGATCAGGTTACCGACCTTGTCTTTCTAGATTCTGACAAG
AAACTCGTTAGTTCCTCCAAAGACAAGTATCTCAGAGTGTGGGATCTTGAAACACAGCAC
TGTATGCAGATTGTTGGTGGTCATCACAGTGAAATCTGGTCCCTGGACACTGATCCAGAA
GAGAGATATCTCGTCACAGGGTCTGCAGATCCAGAACTTCGATTTTACACTGTTAAGAAT
GATTCATCTGATGAACGATCTGAAGCAGATGCAAGTGGGGGTGTGGGCAATGGTGACTT
AGCTTCTCATAACAAATGGGATGTACTAAAACAATTTGGCGAAATTCAGCGACAAAGCAA
GGATAGAGTTGCAACAGTGAGATTCAACAAGAATGGGAATTTGCTGGCTTGTCAAGCGG
CAGGTAAACTTGTAGAGGTGTTCCGTGTACTAGATGAAGCTGAAGCAAAGCGTAAAGCA
AAACGCAGGCTTCATAGGAAGAGGGAGAAGAAAGGGGCAGATGTGAATGAAAATGGAG
ATTCCAGTCGTGGTATTGGAGAAGGACACGACACCATGGTGACGGTTGCTGATGTTTTT
AAGCTCCTCCAGACTATTCGAGCTAGCAAAAAGATTTGTTCTATTTCTTTCTGCCCTGTAG
CTCCTAAGAGTTCGTTGGCCACACTGGCATTGTCATTAAACAATAATCTTTTGGAATTTCA
CTCTATTGAAGCTGATAAAACTAGTAAAATGCTAACTATTGAACTACAAGGACACCGCTCT
GATGTCAGAAGCGTCACCCTTAGTTCTGATAATACCCTTCTTATGTCTACCAGTCACAAC
TCGGTAAAGATTTGGAACCCGAGTACAGGTTCCTGCTTGCGAACAATTGACTCTGGGTAT
GGGCTTTGTGGTTTAATTGTTCCTCAGAATAAGCACGCACTTATTGGAACAAAAGATGGA
GCCATAGAAATATTTGATGTTGGAAGTGGCACTTGTATTGAAGTGGTAGAAGCTCATGGA
GGCTCTATCCGGTCAATTGTGGCTATACCAAACCAAAATGGTTTTGTCACTGGCAGTGCA
GATCACGACATTAAATTCTGGGAGTATGGTATGAAGCAGAAACCTGGTGATAATTCCAAG
CACCTAACTGTGTCAAATGTCAGAACCCTGAAGATGAATGATGATGTTCTTGTGGTTGCT
GTGAGCCCAGACGCTCAAAAAATTGCTGTTGCACTATTAGACTGCACAGTGAAGGTTTTC
TTCATGGATTCTCTGAAGCTTATGCATTCCTTATATGGGCACAGGCTTCCTGTGCTATGC
TTGGATATCTCGTCAGATGGAGATCTAATTGTAACTGGCTCTGCAGATAAAAATCTGATG
ATATGGGGATTGGATTTTGGTGACCGCCATAAATCCATTTTTGCACATGGAGATAGTATC
ATGGCAGTGCAGTTTGTGGGCAACACACATTATATGTTTAGTGTAGGTAAAGATCGGCTT
GTAAAATACTGGGATGCTGACAAATTCGAGCTTCTTTTGACGCTTGAGGGACATCATGCT
GATATTTGGTGCCTTGCAATTAGCAACCGTGGTGATTTTTTGGTCACTGGATCTCATGAT
CGTTCAATACGCCGTTGGGATCGTACGGAAGAACCATTCTTCATTGAGGAAGAAAAGGA
GAAGAGGTTGGAGGAGATGTTTGAGTCTGACCTTGATAATGCATTTGGGAACAAGTATGT
ACCCAAGGAAGAAATTCCAGAAGAGGGTGCTGTGGCCTTAGCAGGGAAGAAAACTCAAG
AGACACTTTCAGCAACTGATTCAATTATTGAGGCGTTGGATATAGCAGAAGTGGAGTTGA
AACGAATTGCCGAACATGAGGAGGAGAAAAACAATGGAAAGACTGCAGAATTCCACCCA
AATTACGTGATGCTGGGGCTTTCTCCCTCTGACTTTATTCTTCGTGCTCTTTCGAACGTTC
AAACTAATGACCTTGAGCAGACATTACTGGCTTTACCTTTCTCAGATGCTTTGAAGCTCCT
ATCTTACCTGAAGGATTGGACAACATACCCTGATAAGGTTGAGCTTGTTTCAAGGATTGC
TACAGTGCTTTTACAGACACATTACAATCAATTAGTTTCAACCCCGGCTGCCAGGCCTTT
GTTGACTACACTAAAGGACATTCTTCACAAGAAAGTCAAGGAATGTAAGGACACGATTGG
ATTTAATCTTGCAGCAATGGACCATCTTAAGCAATTGATGGCCTTAAGATCAGATGCACT
CTTTCAAGATGCCAAAGTGAAGTTGCTGGAAATTCGCTCACAACTTTCCAAACGACTGGA
AGAAAGGACGGACCCGAGAGAAGCAAAGCGCAGGAAGAAGAAACAGAAGAAATCCACT
AACATGCATGCCTGGCCATGAGGTCTTGGTAATGAGAATTCGCTAGTTGAAGAATTCGG
GAATTTTTTGCCACATCGTAACCATCATAGCACTTATCATCTAATTATGGTGAAAGGGAGT
TATATATATGTCAGTTTTGGCGGATGTTGAGTGTATAGGATGTGAAATAGCAAATGATAAT
CTCTCTTCTATCTTTTGGGCAATTGAACTTTTCATTCCCATAAAAAAAAAA
SEQ ID NO:116
GCCCCCGGACGTCTCCAGACCTCTCGCTGCGTCTGCACAGCATCTCCGTCGGAAGTTC
CCGTGCGACAATCGACATGGGCGGCGTGCAAGCCGAGAGAGAAGACAAGGACAAGGTC
TCTCTGGAGCTCACCGAGGAGATCCTCCAGAGCATGGAAGTCGGCATGACGTTCAGAG
ACTATAGTGGTAGAATCAGTTCCATGGATTTTCACCGGGCCTCGAGCTATCTAGTGACAG
CCAGCGATGATGAGTCTATTCGCCTTTATGATGTGGCAAGCGCAACATGTCTGAAGACA
ATTAATAGCAAAAAGTATGGGGTTGATTTGGTCTCATTCACTTCTCATCCGATGACTGTTA
TATACTCCTCAAAGAATGGTTGGGATGAGTCACTGAGGCTGTTATCCTTGCATGACAACA
AGTACCTGCGCTACTTTAAAGGTCACCATGACAGGGTTGTCTCCCTTAGCTTGTGCCCAC
GCAATGAATGCTTTATCTCTGGTTCTCTGGATCGCACTGTTTTACTCTGGGATCAACGAG
CTGAGAAGTGTCAGGGTCTTTTACGTGTACAAGGAAGGCCTGCCACAGCTTATGATGAT
CCGGGCCTGGTGTTTGCAATTGCTTTTGGAGGATGTGTTAGGATGTTTGATGCTCGCAA
ATATGAAAAAGGTCCTTTCGAAATCTTTTCTGTTGGGGGAGATGTGTCGGATGCAAACGT
TGTCAAGTTTAGCAATGACGGAAGGCTTATGCTTTTGACTACCACAGATGGGCATATACA
TGTTCTCGACTCATTCCGAGGCACACTGTTATACACTTTCAATGTCAAGCCAACATCAAG
CAAGTCCACCTTGGAGGCATCTTTTAGCCCTGAAGGAATGTTTGTCATTTCTGGTTCTGG
AGATGGCAGTGTCTATGCGTGGAGCGTAAGGGGCGGGAAAGAGGTCGCAAGTTGGTTA
AGTACTGACACGGAGCCTCCTGTCATAAAATGGGCCCCAGGAAACCTCATGTTCGCAAC
AGGATCGTCGGAATTATCATTCTGGATTCCTGATCTCTCTAAATTGGGAGCTTATGTTGG
AAGAAAGTAGAAGGGTCGGGATAGTCATTGCTTAGAGGCAACGCAGTTGCAAGTCTACC
ATTCCCACCAATTATAATTTGGAAAGAGTTTAAGATGCTGACTTATCCAGGAAAAGGTTGT
TTATACTTATAAACAACAGAGAGACAACTGTACAGGTGTTGTAAACACTCCCAGTGTGAG
GGTAATTTTGGAAGTTGTGCCTTAAAAAAAAAA
SEQ ID NO:117
ACTTGAAAAATCTCTACTTTTTTTCCCTCTTCTGAGAGAGAAGTTCTCCGCCTCGAAGCC
GCGACGGCGCCCGCCCTCCTCCTCGCTCCGCCGCTGCAATGGCCGCTTTCGGAGCAG
CTCCCGCCGGGAACCATAACCCTAACAAGTCTTCCGAGGTGATTCAACCTCCCAGCGAT
TCCGTTTCGAGCCTGTGTTTTAGCCCGAGGGCCAATCACTTAGTTGCTACTTCGTGGGAT
AATCAGGTACGGTGCTGGGAACTTACGAAGAATGGGGCTTCTGTAACTAGCGTGCCCAA
GGCGTCGATGTCCCATGACCAACCGGTACTTTGTTCAGCTTGGAAGGATGATGGAACGA
CTGTCTTTTCTGGCGGCTGTGATAAACAAGCAAAAATGTGGTCTTTAATGTCTGGAGGTC
AGCCAGTGACAGTTGCCATGCATGACGCACCCATTAAGGAGATTGCTTGGATCCCAGAG
ATGAATGTCTTAGTTACAGGAAGCTGGGACAAGACCCTGAAGTACTGGGACACGAGGCA
GTCAAATCCAGTACATACTCAACAGCTCCCAGAGCGCTGCTATGCAATGACAGTGAGAT
ATCCCCTGATGGTTGTTGGCACTGCAGATAGGAATCTTATTGTCTTTAATCTGCAGAATC
CTCAGGCTGAGTTCAAGAGATTTTCTTCACCCCTAAAGTACCAGACAAGATGTGTTGCAG
CTTTCCCTGACCAACAAGGTTTCCTGGTTGGATCCATCGAGGGGAGGGTTGGCGTTCAT
CACCTGGATGATTCCCAGATCAGTAAAAACTTCACATTTAAGTGTCATAGAGATAACAAC
GACATTTATTCTGTCAACTCCTTGAATTTCCATCCAGTGCATCATACCTTTGCCACTGCTG
GATCTGATGGTACTTTTAACTTTTGGGACAAGGATAGTAAGCAGAGACTTAAGGCAATGT
CAAGATGCAGCCAACCAATACCTTGTAGCACCTTCAATAATGACGGCACAATATATGCAT
ATTCGGTCTGTTATGACTGGAGCAAGGGGGCAGAAAATCACAATCCTGCTACAGCAAAG
ACCTACATCTTCTTGCACTTACCACAGGAATCGGAGGTCAAAGCAAAGCCACGGGTTGG
AACAACTAACAGGAAGTGAAGTTGCTGATTAGTTTAGAATATGAAGTTTTGAGGAAAACA
GTGGCAGTGAAGAAAATGGTGCTATTCATTTGCCAGAAATTGTAGAATAGGAATGGTCCT
TCCCTCTCATTGTGGTTTTTTGAGTCATAGTAATCCATTGTTGATATTTTTTTTTTCCTTTTT
TTGCTCATTTTCATATACCAGTAACTTTTGCGGTCTCTGATAGGCCAAGTTCTATTAACTG
TGGTAGATGGGAAGAGTGGCGCTTATTTCAGTACTGTTTTTGTCAAGTCATGTCTATCAC
ATTGATGTTTGAAGATTAAATACATTTTGTACTTTTCCGGAAAAAAAAAA
SEQ ID NO:118
GACACGCGCCCTCTCTCTCTCCCTCTCTCTCCCCCTCCCTCTCTCCCTCCCCTTGCCGC
TGCTCTCCAAAATCTTCCACCAAATCTCTTCTTCTTCAAAGACGCAGCAGCAGCAGCAGC
AGCAGCAGCCGCCCCAGCAGCCCTCGCTACACGCTCCCTCTCTCGCCCTCCCCCAAGC
CGCGTGCGGCGGAAACCGAAGCCCTAGCTCTCGCTCCCTCTCCTCTCTCGCGAAATGA
ACTGTTCCATATCCGGCGAGGTGCCGGAGGAGCCCGTCGTCTCGACCAAGTCCGGCCA
CGTCTTCGAGAGGCGCCTAATCGAGCGATACGTTTCGGATTATGGAAAGTGCCCAGTTT
CCGGGGAACCACTTACCATGGATGATGTCCTCCCTGTGAAAATGGGAAAGATAGTCAAG
CCCAGACCTTTGCAGGCTGCCAGCATACCTGGGTTGCTCAGCATTTTCCAAAATGAATG
GGACAGCTTGATGCTTTCCAACTTTGCACTGGAACAACAACTGCACACAGCTAGGCAAG
AGCTAAGTCATGCATTGTACCAGCATGATGCTGCTTGCCGTGTTATTGCAAGACTTAAGA
AGGAAAGGGATGAAGCCCGATCATTACTTGCACTGGCTGAAAGACAGATACCCATGACA
GCATCTTCTGATATTGCAGTGAATGCTCCTGCCATGAGCAATGGGAGAAAAGCTTCTCTA
GATGAAGAGCCAGGCTATGCCGGGAAGAAAATGCGACCTGGTATTTCTGCAAGCATCAT
TGCTGAGATAACGGATTGTAACTTAGCACTTTCTCAACAGCGTAAAAAGCGACAGATTCC
CTCTACACTGGCACCTGTTGAGGATTTGGAGAGGTACACCCAACTTTCTAGTTATCCACT
GCACAAAACTGGCAAACCAGGCATTACATCTCTTGATATTTGTCATTCCAAGGATATCATT
GCAACTGGTGGGATTGACACATCTGCTGTACTTTTTGATCGATCATCTGGACAGATCATG
TCTACACTATCAGGGCACTCAAAGAAGGTTACTAGTGTGAATTTTGATGCCCAAGGTGAT
ATGGTTTTAACTGGTTCTGCAGATAAGACTGTGCGGATTTGGCAAGGTTCTGAAGATGG
GAGCTATAACTGTCGGCATATATTGAAAGATCATACTGCTGAGGTGCAAGCAATCACAGT
TCATGCTACAAATAACTACTTTGCAACAGCATCTCTCGATAATACATGGTGCTTTTATGAG
TTTTCAACTGGTTTATGTCTTACTCAGGTTGAAGGCGCTTCAGGATCTGAGGGTTATACA
TCTGCAGCTTTCCATCCCGATGGTCTTATCCTGGGTACTGGCACCTCGAATGCCGATGTT
AAAATATGGGATGTAAAAACACAGGCGGAATGTTACAACATTTTCTGGCCATACCGGGGCA
ATTACTGCCATATCTTTCTCTGAGAATGGATACTTCCTTGCGACTGCAGCTCAGGATGGG
GTTAAGCTGTGGGATCTGCGGAAGTTGAAGAACTTTCGCACGTTTTCGGCATATGACAAA
GACACCGGGACAAATTCTGTTGAATTTGATCATAGTGGATGTTATCTTGGACTTGCCGGC
TCAGATATAAGAGTATACCAAGTTGCCAGTGTAAAATCAGAGTGGAATTGTGTCAAGACT
TTCCCTGATCTATCTGGGACAGGTAAAGTGACATGCGTTAAGTTCGGTCCGGATTCAAAA
TACATTGCTGTCGGATCGATGGATCACAATTTGCGAATCTTTGGATTGCCTTCGGAGGAC
GGTGCTATGGAATCATGAAGTGTTGGGATCACATCTAGCAGGTAGGATCAGCTTTTGTG
AGAGCAAAGTAGAGTTTAAGTTTCGTTGTGCTTGGACCGGAAAACTCACATGCTTAGAGT
TTAAGCTTTGTGCCAGCTGAATCAGAGTATGAAACATCCTGTACTGCTGCAGTGAACGAT
TAGCGGCCCTTTGTCTAACATGATTACGATTCTTTGTTTTTCTTTTCGAATTTTGCCTTCTT
GAGCACTAAGTCGTGTATACTATTGTTGTACAAGATTTTGGGACTGACCCATTCTCTAAAA
AAAAAA
SEQ ID NO:119
AAAAATTCTCTCTTCAGTGTTTTTCTTTGTGCTCCCTGCACCTTTGCACATTAACCTTTGG
GAAGGCAATAATGGCGGCGCCTGGAGTCGAGACTTTGAAGAAAGAGATTAAGGAGCTTA
AAGAGAAAATCGCACAACATAGGCTCGACACTGATGGGGAGCAACCATTGCCCGCTGCT
GCCAAGTCTAAGTCAGTGCCTGAAGTTTCTGCAGCATTGAAGCAGAGACGTATTCTGAA
GGGTCACTTTGGCAAAATCTATGCTTTGCACTGGTCCGCAGACTCCCGACACCTCGTTA
GCGCCTCACAGGATGGCAAACTCATTATTTGGAACGGCTTCACGACCAACAAAGTGCAC
GCGATCCCACTGCGGTCGTCTTGGGTCATGACCTGCGCCTATTCTCCCAGTGGCAACCT
CGTTGCCTGTGGTGGCTTGGACAACCTGTGCTCGGTGTACAAGGTTCCGCATGGAGGA
AACAAGGAATCTTCTTCGGCTCAGAAGACGTATGGTGAACTCGCACAACATGAAGGATA
CCTTTCTTGCTGCCGATTCATCAAGGACAACGAGATTGTGACGTCCTCCGGCGACTCTA
CTTGCATTTTGTGGGATGTGGAAACCAAGACGCCCAAGGCCATCTTCAATGATCACACA
GGAGACGTCATGTCTCTCGCTGTTTTTGACGACAAGGGCGTTTTCGTTTCTGGCTCTTGC
GACGCTACCGCCAAGCTGTGGGATCACCGCGTGCACAAGCAATGCGTTATGACCTTCCA
AGGCCATGAATCCGACATCAACAGTGTGCAGTTCTTCCCGGACGGCGACGCTTTCGGCA
CTGGCTCTGACGACTCTTCTTGCCGTCTCTTTGACATTCGCGCCTACCAACAAATTAACA
AATATTCCAGTGACAAGATCTTGTGTGGAATCACATCCGTCGCCTTCTCGAAGACTGGCA
AGAGCTTGTTTGCTGGCTATGATGACTACAACACTTATGTTTGGGACACTCTGAGTGGAA
ATCAGGTTGAGGTTCTGACCGGACACGAGAATCGTGTGAGCTGCTTGGGTGTCAGCGAA
GATGGCAAAGCACTAGCCACTGGCAGCTGGGATACATTGCTCAAGATTTGGGCATAACT
TGTATGAACTTTTTCTGTTGTCGACACTGTAATTACACGAGCTCCCCTTCTTTTGCTGTGT
ATGTCGAGGTTGTTTGTTGCTATTTGATGGATTGTCCGATAAAGCTGACAACACGAAAAA
AAAAA
SEQ ID NO:120
AAGCTGTTTCTTCTTTTTTCCCCTCGCCTCCTGAGCACAGGAAGGATCGTATCTTTTTTCG
ACCCTCTTGCCGCTTCCTTCTCGTGCGCCCCCAATTCGCGTTCATCTGCGGCTGTATTAT
CAAATGACTCTGGTGGATGGTAGGTGAAGAGTTTATAACCCACAAACCCTACCCGTCCTA
CAGCAGCCAACATTCTCCTTACTCTATCTCCGGAGTAGTCCAAAGCAACTTGTCCATTAC
TTTGGATCATAAACTTCTCCTAAGATCTGGCCTTTTACAAGCTCAATAATTTATGGGCTGA
GGTGCACTTTTCATGGGAGGTGTTGAGGATGAGAGTGAACCAGCCTCAAAACGCATGAA
GTTATCATCCAGAGTTTTGAGAGGTCTTGCAAACGGTTCATCTCGTACAGAGCCTGCAGC
TGGCTCTTCACTAGATTTAATGGCTCGGCCCCTACCAATTGAAGGGGACGAAGAAGTTAT
TGGTTCAAAGGGTGTGATCAAAAGAGTTGAATTTGTACGACTTATAGCGAAGGCATTATA
CTCCCTCGGTTATGAAAAAAGTGGTGCTCGCTTGGAGGAAGAGTCTGGGATACCGTTGC
AGTCCTCTGTGGTAAATTTGTTCATGCAACAGATATCTGATGGGCTTTGGGATGAAAGCG
TGGTGACGTTGCATAAAATTGGTCTCTCTGATGAAAATTTAGTAAAGTCAGCCTCTTTCTT
GATATTGGAGCAGAAATTCTTAGAACTTCTGGATCAGGAAAAAGCTATGGATGCTCTGAA
GACGTTAAGGACGGAGATCACACCTCTTTGCATAAAAAATAGTAGGGTACGTGAGCTCT
CGTCGTGCATCATCTCTCCATCATCATGTGGGCTTCTTAACCAGAATAAAAGAAATAGTA
CAAGAGCAAGGTCCCGTTCAGAGCTTCTGGAGGAATTGCAAAAATTACTTCCTCCAGCA
GTTATTATTCCAGAAAGAAGGTTGGAACATCTGGTGGAGCAGGCCCTTGTCCTGCAAAC
AGATGCATGTATGCTTCATAACTCTATAGATATGGAAATGTCACTGTACACTGATCATCAA
TGTGGTAAAGAACACATCCCCTGTCGAACTTTGCAGATTTTACAATCACATAATGATGAA
GTTTGGCTTGTGCAATTTTCACATAATGGGAAATATTTAGCTTCTGCATCCAATGATCGAT
CAGCAATCATTTGGGAGGTTGATGAGAATGGCAGCGTCTCATTGAAGCATAAATTGACTG
GTCACCAGAAGCCGATTTCTTCTGTCTGTTGGAGTCCAGATGACCGACAGCTTCTCACTT
GTGGGGTTGGGGAGACAGTGAGGCGCTGGGATGTCTCTTCTGGTGAATGCCTTCGTGT
TTATGAGAAAGCTGGCCATGGCCTCATTTCATGTGCGTGGTTTCCAGATGGAAAATGGAT
ATGTTATGGTGTTAGTGATCGGAGCATATGCATGTGCGACTTGGAGGGGAAAGAGATTG
AATGCTGGAAAGGGCAGAGAACTCTTAGTATTTCTGATTTGGAAATTACTAGCGATGGAA
AGCAGATCATAAGTATATGTAGAGAAACTGCAATACTTTTACTTGACCGGGAAGCAAAAT
ATGAAAGAATGATAGAAGAAAATCAAACGATAACTTCTTTCTCATTGTCAAAGGATAATAG
ATACTTGCTTGTTAATCTCTTGAATCAAGAGATCCATCTTTGGGATATAAAAGGGGATTTC
AGGCTGGTTGCAAAGTACAAAGGTCTTAAGCGCAGTCGGTTTGTAATCAGGTCTTGTTTC
GGTGGACTCAAACAGGCCTTTGTTGCCAGTGGTAGTGAAGACTCACAGGTTTACATTTG
GCACAAAGGCTCAGGTGAGCTGATCGAGCCATTGCCAGGTCACTCAGGAGCTGTGAATT
GCGTGAGCTGGAACCCAGCAAACCACCACATGTTGGCATCGGCCAGTGACGACCGTAC
CATCCGGATATGGGGCTTGAATGAGCTAAACACGAGGCACAAGGGTGCACGCCCCAAT
GGTGTCCACTACTGTAATGGCAATGGCACCAGCTGAAGAAGAAGAAGAAAGATGCATGG
CTCTAGCTGAGAGCCCTGATGCTGCTTTGTTAGCCATTGCTTTGGTGTAAATACGTATGT
TCATCTAAAAAGCAGACACATTCATGTCAGACCAAGCATGTGAATCTTCAGCAAACTTAC
TGGTAATAAATTTTTACTATTTCATCATAAAAAAAAAA
SEQ ID NO:121
CCTTTCAAAATTCCCTCTCTCTTTCTTTTCTCTCTCTAGCCAGATCTCATCTCCTTCTTCCC
CTTTCCCCTTTCATCTCTGCATCTGTACATGCCCTAATTTCTCTCTCCCTCTCTCTCTCTC
TCTTGTTTCTCTCTCTAGAAGATGACGCAGCTGGCGGAGACCTACGCGTGCATGCCCTC
GACGGAGCGCGGCCGCGGGATCCTCATCGCCGGCAACCCGAAGCCCGGGTCCAACTC
CGTCCTCTACACCAACGGCCGATCCGTCGTCATCCTCAACCTCGACAACCCGCTCGACA
TCTCCGTCTACGCCGAGCATGCCTACCCCGCCACCGTCGCCCGCTTCTCCCCCAACGG
CGAGTGGGTCGCCTCCGCCGACTCCTCCGGCGCCGTCCGCATCTGGGGCGCCTACAA
CGACCACGTCCTCAAGAAGGAGTTCAAGGTCCTGTCCGGCCGGATCGACGATCTCCAG
TGGTCCCCCGACGGCCTCAGGATCGTGGCTTCCGGCGATGGGAAGGGCAAATCGCTCG
TCCGCGCGTTTATGTGGGACTCAGGCACCAATGTGGGAGAATTCGATGGCCATTCACGT
CGAGTTCTGAGCTGTGCTTTTAAGCCAACTCGGCCCTTTCGCATTGTGACTTGTGGAGA
GGATTTTTTGGTGAACTTTTATGAAGGACCGCCTTTTAAATTCAAGCTGTCTCGCAGGGA
TCATTCCAACTTTGTCAACTGCTTGAGATTTTCTCCAGATGGCAACAGGTTCATTAGCGT
GAGCTCTGATAAAAAGGGAATCATCTATGATGGTAAGACTGGTGAGAAGATAGGAGAGC
TGTCATCTGATGGTGGTCACACAGGTAGCATTTATGCTGTCAGTTGGAGTCCTGATAGTA
AGCAGGTTATAACTGTGTCTGCTGACAAGTCAGCGAAGATATGGGACATTTCTGAGGAT
GGCAGTGGTAACCTAAGGAAAACATTGACTTCTTCTGGTTCAGGCGGGGTTGATGATAT
GTTAGTCGGTTGTCTTTGGCAAAACAACCACCTAGTCACTGTCTCTCTTGGTGGCACAAT
CAGCATATACACAGCAGGTGATCTTGATAAAGCACCTGTCTCTTTTTCTGGACACATGAA
GAACGTCTCTTCCTTATCTGTGCTCAAAGGTGATCCAAAAGTGATCCTCTCCAGCAGCTA
TGATGGTCTTATAATAAAATGGATTCAAGGAATTGGATTTAGTGGCAGAGTACAGAGGAA
AGAGTCTACACAAATCAAATGTTTGGCTGCAGTAGATGAGGAGATTGTAACCTCTGGATA
TGATAATAAGGTATGCAGAGTTTCTGGCAGTGGAGATGCAGAATTCATTGACATTGGCTG
TCAACCCAAGGACTTGAGTCTTGCCCTTCAGTGTCCTGAATTTGCTTTGGTTTCAACCGA
TACTGGAGTTGTCTTGCTCCGCGGTGCAAAAATTGTGTCAACCATCAATCTTGGGTTTGC
TGTGACAGCATCAACTGTCGCACCAGATGGAACTGAAGCAATTATTGGTGCACAGGACG
GAAAATTGCGCATATATTCTATTTCAGGTGATACATTAACAGAGGAAGCTGTTCTGGAGA
AACACAGAGGTGCTATTAGTGTCATACACTACTCGCCTGATTTATCCATGTTTGCTTCTG
GGGATCTGAACAGAGAGGCTGTTGTTTGGGATCGTGCCTCAAGAGAGGTGAGGCTGAA
GAATATCTTGTACCACACAGCCCGCATCAACTGTTTGGCTTGGTCCCCTGATAGCAGCA
CAGTAGCAACTGGATCACTTGACACTTGTGTCATCATCTATGAAGTTGACAAGCCAGCAT
CTAACCGTCTAACCATAAAAGGAGCTCACTTGGGTGGGGTTTATGGATTAGCTTTCACCG
ATGACTTCAGTGTGGTCAGTTCTGGTGAGGATGCGTGCATTCGTGTCTGGAAGATAAAC
AGACAGTGATCTTGGGGAGATTATGTGAAGTTTCAGAAATCGCATCTACTTGAGGGTAGA
TTTGCTGTAAGAAGTGGGAGGCTTTTTTTCGGCTGCGTCAACCATTAGAGTCCGAGCTG
CAGGAGTCACTGCCTCTTATCTTTTAAGAGAGGTTTTTTCATCATTGAGGCTTGTGTTTGT
AAGTGTTAGCAGATAAATCTTTGCTGTATTTTCTCTTCTCCTCCTTTTCTCCTCGAGATTTT
TATTGTGGGAATGTTTTTCAAGTTTTACTGTATTCTACTGTTTATTGCTTG
SEQ ID NO:122
TTTTAGTTCGCTTCTCCAGCGCCCCATTCTCTTTTTAGGGTTTTTCGGCGATCCAGAGAA
ACGGTCACCGCCGGCGAGAGCTTCTGAAACCGGCTTACTTCTGGCCGAGCAATATCGG
CAGACGCCGCGATGAAGGTGAAAGTGATATCGCGTTCCACGGATGAGTTCACCCGAGA
GCGCAGCCAGGACCTCCAGAGGGTATTCCGCAACTTCGACCCCAACCTTCGGACCCAA
GAGAAGGCGGTGGAGTACGTCCGGGCTCTGAATGCAGCTAAATTGGACAAGGTTTTCG
CAAGGCCCTTTGTTGGAGCCATGGATGGACACGTCGATTCGGTATCGTGTATGGCCAAG
AACCCGAATTACTTGAAGGGAATATTTTCTGGCTCTATGGATGGAGATATCCGCCTTTGG
GACATTGCTTCGAGGCGAACAGTATGTCAATTTCCTGGTCATCAAGGCCCTGTCAGAGG
CTTGGCGGCATCTACAGATGGTCAAATTCTGGTATCCTGCGGAATTGATAGCACGGTTC
GATTGTGGAATGTTCCTGTAGCTACTCTTGGGGAGTCTGATGGCACACATGAGAACTTG
GCAAAGCCACTGGCAGTTTATGTATGGAAGAATGCATTCTGGGCAGTTGATCACCAATG
GGATGGCGAACTTTTTGCTACAGCTGGTGCTCAAGTAGATATTTGGAATCAAAACAGGTC
TCAGCCAATAAGCAGTTTCGAATGGGGAACGGATACCGTCATATCTGTAAGATTCAACCC
TGGAGAACCTAATGTATTGGCAACATCAGGGAGTGACCGCAGCATAACCCTCTACGATT
TGCGTATGTCGTCCCCAACAAGAAAAGTTATCATGAGGACAAAAACTAATGCTATTTCTT
GGAACCCAATGGAGCCAATGAACTTCACTGCTGCAAATGAAGATTGCAATTGCTACAGCT
ACGATGCTAGAAAGTTGGAAGAAGCCAAATGTGTACACAAGGATCATGTTTCTGCTGTGA
TGGATATTGATTACTCTCCCACTGGCCGGGAATTTGTAACTGGATCCTATGATAGAACAG
TAAGAATTTTCCAGTATAATGGAGGTCACAGTAGGGAAGTTTATCATACAAAAAGAATGC
AAAGGGTATTTTGTGTCAAGTTCAGTTGCGATGCGAGTTATGTCATATCCGGGAGTGATG
ACACCAACTTGAGGCTTTGGAAAGCTAAAGCCTCTGAACAGCTGGGAGTTGTTCTTCCAA
GAGAGCGCCGCAAGCATGAGTATCATGAAGCTGTCAAGAGTCGATACAAGCACCTCCCT
GAAGTGAAGCGTATCGTGAGGCACAGACACTTGCCTAAACCAATATACAAGGCAGGTAT
TCTAAGGCGCACCGTGAATGAAGCAGATAGAAGGAAAGAAGAAAGAAGAAAAGCACACA
GTGCCCCTGGATCTAGTTCAGCAGAGCCATTACGTAAAAGAAGAATCATCAAAGAAATTG
AGTGATGCTTCTTCTTTTAGCGTGTTGTGCTGCATGGCATAATATTTGCATTTCTCAAATC
TGTATTGTTGGATGGGGCTCTATCTTTCTCAAGAGGCTGCTTTTGAGTGAAAAGTTAACT
TCACCTGAGAGATGAGATAGGGCGACGATGCACAAGCTCCTTTCAGTCTTAACAATTTTG
GTAGAATACGGCGGTCCTTTCCTATGAAGGGTGGCCCGAGAGTTAGCCAATGTTTGGTC
ACGCCAGCTTTTGCCTGGAGCATCACAAGTAGTCAGTTTCTGATCTATTGGCGATTTATC
AAGACATGATTTGTTTGTATCATAAGCTTTGATCTTTAACTTGAGTTCAAAAAAAAAAAA
SEQ ID NO:123
AAGCATAAACCCTAGCAATCAGACATCCCAAAAACCCTCTCTGCTCAACCCACTCGATCC
TCTTCCCCAGGGATCCCGGGAACCCTAAGAGAGCCGCCGGCTCCAACTACCGTCCATG
TACGATCCATACTAGATCTGACCAACTGTCCGCCGTTCACCGGGGCTTCGGTCGCCGGC
TCCCCTGTCATCAGGTCACCAACTTCCTTGACCTTCCGGCACTGTTGCCAGTCGCGTTTC
TCTGCACGCGTACCATCGAACGACCACGAGGATGGTTCGAAGCATAAAGAACCCCAAGA
AAGCCAAAAGAAAGAACAAGGGATCAAAGAACGGAGATGGGTCGTCTTCGTCTTCCTCA
ATTCCTTCAATGCCTACGAAGGTGTGGCAGCCAGGCGTGGACAAGTTGGAGGAAGGAG
AAGAGCTTCAGTGTGACCCGTCTGCTTACAATTCCCTTCATGCCTTCCACATCGGCTGGC
CTTGCCTCAGTTTTGACATTGTCCGTGATACACTGGGTTTGGTTCGCACTGAATTTCCAC
ATCAAGTTTACTTTGTTGCTGGAACTCAGGCAGAGAAACCTACTTGGAACTCAATTGGGA
TATTTAAAGTAAGCAACATTACTGGTAAAAGGCGTGAACTGGTGCCATCCAAACCTACTG
ATGATGCCGATGAGGAGAGTGACAGCAGTGATAGTGATGAAGATAGCGACGATGAAGTT
GGTGGATCTGGGACGCCAATTTTGCAGCTAAGAAAAGTAGGCCATGAAGGATGTGTTAA
TCGAATAAGAGCTATGAATCAAAATCCCCATATCTGTGCATCTTGGGGAGACTCTGGACA
TGTGCAAATATGGGACTTCAGCTCCCACTTGAATGCATTGGCTGAATCAGAAGCAGACG
TCAGCCAGGGAGCTTCCTCAGTTTTTAATCAGGCTCCATTAGTGAAGTTTGGCGGCCAC
AAAGATGAAGGTTATGCCTTAGATTGGAGTCCTCTTGTACCTGGGAGGCTCGTATCTGG
GGACTGCAAGAATAGCATCCATCTTTGGGAACCGACATCCGGTTCAACATGGAATGTTG
ATTCCACTCCTTTTATTGGACATGCTGCAAGTGTTGAAGATCTTCAATGGAGCCCCACAG
AAGAAAATGTCTTTGCCTCTTGTTCAGTCGATGGAACTATTGCGATATGGGATACCCGTT
TAGGGAAGACACCAGCTGCTTCTTTTAAGGCACATGATGCTGATGTGAATGTGATCTCAT
GGAACAGGCTGGCTACCTGTATGTTGGCATCTGGATGTGATGACGGGACATTTTCAATT
CATGATCTTAGATTACTCAAGGAAGGTGATTCTGTGGTTGCTCATTTCGAGTATCATAAAC
ACCCGGTTACCTCAATCGAATGGAGCCCACATGAAGCCTCCACATTGGCAGTATCATCG
GCTGACTGCCAGCTCACAATCTGGGATCTTTCCTTGGAAAAGGATGAGGAAGAGGAGGC
AGAGTTTAAAGCCAAAACGAAAGAGCAAGTGAATGCCCCAGAGGATTTACCTCCGCAGC
TCCTCTTTGTTCACCAGGGACAAAAGGACTTGAAAGAACTGCATTGGCATGCTCAGATTC
CGGGAATGATTGTATCTACTGCAGCAGACGGCTTTAACATCCTGATGCCCTCGAACATAC
AGAGCACTCTTCCTTCAGATGGTGCCTGAAATTTTGACCGAGACCGAACAATACCCAACA
CCCGGCCTAATTTTTCAAAGAACAGAGGCCTAAGGGCTGTCCAACCATGGAGTCGCACA
AACCGCAGTGCAAATTTCTGTGTACATTGTGATGCAATGATGAGCAAGTTTGAGACAGTA
GATGATGCAGCAACATATCGTGCTCATAACTGCGAGTTGCGTCTTTTTTTTTTTTTTTTTC
TTTTTTCGTTTGTACGGTTCGTCTTCCTTGCTATGCTTAGCTTTGTTGTCCCGCCCTTGTA
ATGTTTTCCCCATGCGAAGGTTTCTGGGAATTTTCAGTAGAAAATTCGGTCGTGGCGGCC
ATCCTCGATATTTTGGAATGTTTGGTAACTTGAGAGATTTTTTTATCATGAGCATGGGCAT
AAGTTTAATGCACATACGGATACTTTAGAGTCAAAAAAAAAAA
SEQ ID NO:124
GGGTCATGGTCATCGCTCTCATACTGCCCAGGTGAAGGAAAGCGCTAAGCATAAGCATA
TACCATAAGCATATAAGCAAATGGGTCCTATCCAAAATGCAAGAGGTTGGAAGCCTTGGA
GGTCATGGTCACCGCTCTCATACTGCCCAGGTGAAGGAAAGCGCTAAGCATAAGCATAT
ACCTTTGGCACCTCCTCCTTTAATGGTCATGCAAATAATCACGGTGAAGGAAGCTGAATC
TATATGAAGAGATATTGGCTTGCATAGTCACGGTGAAGGAAGCTGAATATATATGAAGAG
ATATTGGCTTGCATAGTCACGGTGAAGGAAGCTGAATCTATATGAAGAGATATTGGCCTC
TCTAGATTGTCTTGGGTTTGAACATTCTATTATAAGCAATCCATGCAGTGGAACTCCTTCA
ATGGTCATGCAAATGGTCACGGTGGAGGAAGCTGAATCTATACAAAGGGATATTGGCTT
CTTTATATTGGATTGGGTTTGAACATTCTATTATAAGCAATCCATGCATAGAGCTTCAGAA
CATTAAAAGCTTACAATCTTATCCCATATCCTTTATATCTATCCTCTCCCTCTTTAATTTCG
CCCCTGTAATTCCTTTTTCTTTCTCTGGTCTTTTGATCTAGTGTTTGAGGAGAATGGTTTA
GAGCAATTAGGCATGGAGAATTTCATTCGCTTCCATGGAGAGCTTCACGGGGATTTCAG
AGCTTTAGATCGTTTCCGGTTTCAGGCTATAATTATAAGGTATTTTTTTTAAATAGATCGA
GCTATTTGAAAAGGCCATGGAGTGGGGATTTAATCACATTACAACCCAGACTCCTCTTCA
GGTTAAATTGCATGCCACTGCATTGGAAATTGCAGTGGAAGAGTTGCCAGATAAACTGGT
TTTTAGTGGATGATTGTCGAGGTGCTGCAGCTTATAAAGGCTGCACTCAATGATGCCGAT
CTACTCTTTCCTTCTAGAACTCCAGAGTTACTGTTATTCTTGTTTTACTAGGGTGGACTTT
TGTGGTTGTGCTTCCTTTTATCTCAGTGGATGGTACCAGGTCATGGGTTTAAAGAATTAT
CTGCAGAGTGTTCGCTGTCAATTTTCAAGCAGTGTTTTACCAGCTTGTTGATCTCCTATTT
GATTATCTAGACAAAGCCATGGAGCGTTACAAGGTCATAAAAGAGTTGGGGGATGGCAC
TTATGGAAGTGTATGGAAGGCTTTGAATCAGCAGACACATGAGATAGTAGCTATAAAAAA
AATGAAGAGGAAATATTATATCTGGGAAGAGTGTATCAATCTCCGGGAGGTCAAGTCTCT
GCGGAAGTTGAACCATCCCAACATTATCAAGCTGAAGGAGGTCATTAGGGAAAATAATG
AACTCTTTTTCATATTTGAATACATGGAATGTAATCTGTATCAAATAATGAAAGAGAGATCT
ACTCCTTTTTCGGAAACAGCAATTATCAAATTTTGCTATCAAATACTGCAAGGGTTATCCT
ATATGCACAGGAATGGTTATTTTCACCGAGACTTAAAACCAGAGAATTTGTTGGTAACTA
GCGACTTGATTAAAATTGCGGATTTTGGGCTGGCAAGGGAAGTTCTCACTAGCCCACCT
TATACAGATTATGTTTCAACAAGATGGTATCGTGCTCCAGAAGTCCTGCTGCAATCTCCG
ACATACACTACTGCAATTGACATGTGGGCAGTTGGGGCTATACTGGCGGAACTCTTCAC
TTTGCATCCTCTCTTTCCTGGTGAAAGTGAACTGGATGAAATTTACAAGATCTGTGGTGT
GCTTGGCACTCCAGATTATGAGACTTGGCCTGACGGCATGCAGCTTGCAGCTTTTAGGA
ACTTCATCTTTCCCCAGTTTCTACCAGTCAATCTTTCAGTTCTTATTCCCCATGCTAGCCC
AGAAGCAATCGATTTGATTACGCGTCTATGTTCTTGGGACCCTCAGAAGAGGCCAACAG
CAGAGCAGGCACTGCATCATCCTTTCTTCCGTATTGGCATGTCTATTCCTCTTTCTCTTG
GGGGACATTTCCAAGATAATACATGTGCAGCAGAGGTAGATACAAATTTTCATTCCAAAA
AGGCATGCAAGGGGCGTGGAATGGGGGAAAAAGAATCAAGCTTGGAATGCTTCCTTGG
TTTGTCTTTGGGGCTCAAGCCAAGCCTTGGTCATCTGGGCGCAATGGGATCTCAAGGTG
TGGGAGCAGTGAAGCAGGAAGTGGGGTCCTCTCCAGGGTGTCAGAGTAATCCAAAGCA
GTCCCTATTTCAGGTTTTAAACTCAAGAGCAATTCTACCACTATTTTCCTCAAGCCCCAAC
CTGAATGTGGTTCCAGTCAAGTCCTCTCTACCTTCAGCATATACAGTAAACAGTCAAGTC
ATGTGGCCAACAATAGCAGGTCCACCTGCTGCTGCAGTTACTGTTTCTACTCTACAGCCA
AGCATACTCGGCGATTTTAAGATCTTTGGAAAATCCATGGGGTTGGCTTCACAGTATGCG
GGAAAGGAAGCTTCCCCTTTCAGCTAGATAATCAAATTTAGTTCATGCTTTCTGAACAAA
GTAAATTTTTAATGAGTGAAATTCACCATTTTAAGCTGATAGCTTTAGTTCCTACGTGGAA
TGTATAAATGCACCATTGTCCATAAGGCAAGAGCTTTAATCTGGATTCCAGTATATATACC
CAGTTTGGTTTCTAAATTCATTTGTAAATAGTTCCAACCGAGAGAGGATATTTTTGAGGTA
GATTGGCAATGGAATTACTGCTTGCCATTAAAAAAAAAA
SEQ ID NO:125
TGTTCAATTTAATTGGATGTGAAAGTCATTTGCAGGTTCTTTGCTTAACGAACGATCGTGT
CACGTTAGTGTCAGCGCTGTGTGAAGAATGAGTTTGAGGCGAATATATAAATGAATTAT
AAACGCCAGCTGAACTGAAAGCATTCAGTCATGGGCGAGATGGGTAGGGGCATCAACAA
TAGCAGCAATAACAATAATAGCAACAGGCCGGCATGGCTCCAGCATTATGATTTGGTGG
GCAAAATCGGGGAGGGCACCTACGGCCTTGTATTTCTCGCGCGGAGCAAATTACCCAAT
AACAGAGGTCTCCGCATTGCAATTAAAAAGTTCAAGCAATCTAAGGATGGGGATGGCGT
TTCCCCCACTGCTATCAGAGAGATAATGCTACTCAGGGAGTTTTCCCATGAAAATGTCGT
TAAACTCGTGAATGTGCATATAAATCACGTAGACATGTCTCTGTACTTGGCATTTGACTAT
GCGGAACATGATCTTTATGAGATTATCCGACATCATCGGGAGAAGCTAAATCATCATAAC
ATTAATCAATACACCGTTAAGTCACTACTCTGGCAATTATTGAATGGATTGAATTATCTTC
ACAGTAATTGGATTGTACATCGTGACTTAAAGCCATCTAATATCCTGGTGATGGGTGAAG
GTGAAGAGCACGGGGTTGTGAAAATAGCTGATTTTGGGCTTGCCAGGATATACCAGGCT
CCCTTAAAACCTTTATCTGATAATGGGGTTGTTGTTACGATATGGTATCGAGCACCGGAG
TTACTCTTAGGGGCCAAACATTACACGAGTGCTGTTGATATGTGGGCTGTAGGATGTATT
TTTGCAGAACTGATAACACTGAAACCACTGTTTCAAGGTGTGGAGGTCAAAGCTTCACCA
AATCCTTTTCAGCTTGACCAACTTGACAAGATATTTAAGGTCTTAGGACATCCAACAATAG
AAAAGTGGCCAACTCTAATGAATCTACCACATTGGTCAAAGAATTTGCAACAAATTCAACA
GCACAAATATGACAATGCAGGGTTGCATATTGGTCCCATTCCTGCAAAAAGCCCTGCTTA
TGATCTTCTTTCAAAAATGCTTGAGTATGATCCCCGCAAGCGTATCACAGCAGCACAAGC
TTTGGAACATGAGTATTTTCGAATTGATCCTCAACCAGGACGCAATGCACTTGTTCCCAG
CCAGCCTGGCGAGAAAGCTATTAATTATCCACCTCGTTTGGTCGACGCCAATACAGATTT
TGATGGAACAATCGCTCCTCAGCCTTCTCAGGTATCTTCTGGGAATGCACCATCTGGTTC
AATTGCATCAGCTGCAGTACCTGCTGTTAGACCACTTCCTCAGCAAATGCAGCTAATGGG
TATGCAAAGAATGCAGAATCCAGGCATGGCTGCTTTCAACTTAGGTGCACAGGCAAGCA
TGTCAGGACTCAATCACAACAATATTGCTTTGCAACGTGGTTCTTCTCAACAACAAGCTC
ATCAACAGGTTCGGAGAAAAGAACCCAATAGTGGGTTTCCAAATACAGGATATCCACCAC
CACCAAAATCAAGGCGCTTATAAGGATGAACTTCATGTGGAAAGTTGTATAAAATGCATG
TGAAGTCACTTCATATGTCCCTTGGTACGGCTGCCAAGCACTTCATCCTCTAATCAGTTG
GTGTCTTGGATGTTGTCAATCCAACATGTGAGGCCAGGCCTTGATGTTGGCCATATATG
GTGCTGATTGGTGTTCAATGGCTGGACTAAGAGACTCTTGTCCAAATTTAACAAGAGGCT
ATATTCTCTCTCCCAGAGAGGTCCTTGCCTATGCCAGAAATTTCAAAATGATGTCAGCCA
ATTCATCAGGTTATATTTCTGAATTAGCAGAAGCATTACATGAATACCTGTTACCTGATGG
AAACTAATTTTTACCTGATGGAATGGAGTGATCAGATTAGGGATCCTTTAGGGATTTGTAT
GGATTTAAGAACGCAGATCCTTGGATGCTCTGGTTACATGACTACTCCTTAGGGAATCAG
TCAGACATTTTAAATAACTTCCATATCTTTGAAGGTTCTTTTCTCTTCAGGCAGTAGATAC
CCAATTATATACCAACCTTGTATTAATTGCTGAGCATTCAGCATGACTTCATTTTTCTTCA
GACGTTTAGGTATCATATATGTTTTGACAATAAATAATGTGCGTATGATTCTTTGGATGCC
CTGGATATAAGACCAGTTATAAGACTACCTACCCTGGGGTAACTTGTCATGCATCATTAA
CCTTGATATCTTTGAAGGTTCTTTTCATCTTGGGCAATGGGACCCCAATCATATAACAAGT
TATGATTAGTTGTGAAGCAAAAAAAAAA
SEQ ID NO:126
GGTGGAATTTCCATTCAACACTGGAAAAACTCCAAATCAAAGGTCAGATATCATAACCTA
TATGCTACAATGGGATTATATAGATTAATAATGAATAATACAGCCCTGAACCCAATTCTGT
TCAAAATTTGAAACAATGCTAGCTGTTGGATGGAGATTGGGTATTTAAATATGGGCCGTC
AGATGGGGTTTGAAACAACCACAGCAACGGTTCCATTGAGTTTATTCAGATCCATTCCAT
CGTCTTACTTTTTTTTAGCTCAGTGGGGTGGGCAGGGTGTGTATTGTGGGCTCGTTGGT
GGGGTGAATTGATTACAGAGGAAGCCCAGGAAGGCACTGGAAGCATTTCAATATAAATA
ATCTCTGCAATTGATTGAATCGGCCCGCCATGGACAAGTATGAAAAGCTTGAGAAGGTC
GGGGAGGGCACCTATGGGAAGGTGTACAAGGCAAGAGACAAGATGACTGGACAGCTCG
TTGCTCTCAAGAAGACTCGCCTTGAGATGGACGAGGAAGGCGTCCCACCCAGTTCTCTT
CGTGAAATCTCCCTCCTGCAAATGCTGTCTCAGAGCATATATGTTGTTCGGTTGCTTTGT
GTGGAGCATGTGACGAAGAAGGGAAAACCACTGCTTTACCTAGTCTTCGAGTACCTTGA
TACAGATCTGAAGAAATTCATCGACTATCGACGCAGTGTCAATGCTGGTCCTCTGCCGCA
AAATGTTATTCAGAGTTTCATGTATCAACTGTTGAAAGGTGTAGCTCATTGTCACAGCCAT
GGAGTGTTGCACAGGGATTTAAAGCCACAGAATCTATTGGTCGATAAAAGCAAAGGCTTA
CTTAAAGTTGGAGATCTGGGACTTGGGAGGGCTTTTACTGTGCCTTTAAAGTGTTACACC
CATGAGGTTGTAACCCTATGGTACAGAGCTCCAGAGGTGTTGTTGGGATCAACTCACTAT
TCCACACCTGTGGACATTTGGTCTGTAGGATGTATTTTTGCTGAAATGGTGAGAAGACAA
CCACTTTTCCCTGGGGATTGTGAAATACAACAACTGCTTCATATCTTCACGTTGCTTGGA
ACCCCAACTGAGGAAATGTGGCCTGGAGTAAAACGTCTAAGGGACTGGCATGAGTATCC
TCAGTGGAAACCTGAGAACCTTGCTCGGGCAGTTCCAAATCTATCACCAACTGGTCTCG
ATCTTATCAGTAAAATGCTGCAGTGCGATCCTGCAAAGAGGATTTCAGCGAAGGCAGCT
ATGAATCACCCTTACTTTGATGATCTGGACAAGTCTCAATTCTGAAGTATTCAAATTTCAC
TATTTATGGGTGTTCAAGGATGCCAGAGACTTTTAGGTGATCATAGTTAAGGAACCGTTC
CTGCCAATTTGGAGGTTTAAGCAGCCATAGTAAAATAGTTTTTCTTGCTATAGATTGCAAG
TCTCATCTGTTGTTTGCAAAAGCAAACAGAAATTCCAATTTTTGCAGTTGATTCTACTCAG
CCTTTCATTCATTTTTTCATTAAGCGGTACTGGCAGAGGACATGTCTATTTATACAAGCAA
ATGGTCCTATTGGCTGTTTAAAACAGTTCTATTTAACTCGATCAAAATCTGACTTATTTTGA
AATTCTTCTAAAAACCAAA
SEQ ID NO:127
GTTCGTCCCACCCATATCTCTGAACACCAGAGTAGCAATGGATGGTTATGAGAAAATGGA
TAAGGTGGGAGAAGGAACTTATGGGAAGGTGTACATGGCCAGGGACAAGAAAACAGGG
CAACTGGTCGCCCTTAAAAAGACCAGGCTAGAGAACGATGGTGAGGGAATTCCTCCCAC
TGCCCTCCGGGAGATTTCTCTTCTGCAGATGCTTTCTCAGGATATCTACATTGTAAGGCT
GTTAGATGTGAAGCACACTGAGAACAAGCTTGGGAAGCCCCTTCTGTACTTAGTTTTTGA
ATACATGGAGTCTGATCTCAAGAAGTACATCGATAGCTATCGCCGCAGCCACACTAAAAT
GCCTCCTAGTATGATTAAGAGCTTCATGTACCAGCTGTGCCGTGGAGTTGCTTATTGCCA
CAGCCGTGGCGTGATGCACAGGGACTTAAAGCCTCATAATCTGCTGGTGGACAAGGAAA
AGGGCGTGTTAAAAATAGCAGATCTTGGACTGAGTAGGGCATTTACTGTTCCTGTCAAAA
AGTACACTCATGAGATTGTGACCCTTTGGTACAGGGCTCCTGAAGTTCTCTTGGGGGCT
ACTCACTACTCATTGCCTGTTGACATCTGGTCTGTTGGCTGTATATTCGCTGAAATGTCC
CGAATGCAAGCTCTCTTTACTGGGGATTCTGAAGTACAACAACTTATGAACATTTTCAGG
TTCCTAGGAACTCCGAATGAAGAAGTGTGGCCAGGAGTGACTAAATTGAAGGACTGGCA
TATCTACCCCGAATGGAAACCTCAAGATATAAGCCATGCTGTTCCAGACTTGGAACCAAG
CGGTTTAGATCTGTTGTCTCAAATGCTGGTTTATGAGCCATCCAAGCGAATCTCAGCTAA
GAAGGCACTAGAACATCCTTATTTTGATGACCTGGATAAATCACAATTCTGAGTCCCTTTA
AGGATTTAAGCAATTCAGGATAGTTTGTAGGTATCACCATTATTATATTGATTGGTTAATT
GATTTGTGCTTTTCTGGGGGTTCTATAAATGGCCTGTTGTCTGAGAAATAATCTCTGCAA
ACTTGTTGGCCGGTAAATAAGTGTTTTGTTGTGTTTGCACAAGCGAACGGACAATGTTGG
TCAGACCTCAAATATTGTACTCCCCACACTAGGGAGCATTTACGGTGAATATAATTTTTCA
TATTGTGTGTAAAAAAAAAA
SEQ ID NO:128
CCACTTTCGAAAAACCCGTTTCAAGCCTTTCACGAAAGTCCAACGGTCAGAAAATTCAAA
ATGACTGTTTGAGGCAGAGCCAATCTAGGACCACACTCCATTTATATATGCCCTCTGCTC
CTCTCGACCCTTAGAGTCCTCTGCGAATCTTGTTGTTAGTTACTGTGTACGCTGTAACAA
TGGATGCCTATGAGAAGTTGGAGAAGGTGGGAGAAGGAACCTATGGGAAGGTGTACAA
GGCCAAGGACAAGAACACAGGGCAATTGGTCGCCCTTAAGAAAACAAGGCTGGAGAGC
GACGATGAGGGTATTCCTCCCACCGCTCTCCGCGAGATTTCCCTTCTGCAGATGCTTTC
TCAGGACATCCACATTGTCAGGCTGTTGGATGTAGAACACACAGAGAACAAGAACGGGA
AACCCCTTCTTTATTTGGTTTTTGAATACATGGACTCTGATCTCAAGAAGTACATTGATGG
TTATCGCCGAAGCCACACAAAAGTCCCTCCCAATATTATTAAGAGCTTCATGTACCAGTT
GTGCCAGGGGGTTGCTTACTGCCACAGTCGTGGTGTAATGCACAGAGACCTGAAGCCT
CACAATCTCCTGGTGGATAAGCAAAGAGGTGTAGTAAAAATAGCAGATCTTGGCCTCGG
AAGAGCATTTACAATTCCTATTAAGAAATATACACATGAGATTGTCACTCTCTGGTACAGG
GCTCCTGAGGTTCTTCTGGGGGCTACTCACTACTCTACACCTGTAGACATCTGGTCTGTT
GGTTGTATATTCGCTGAAATGGTCAGATTGCAAGCCCTGTTTATTGGAGATTCTGAAGTA
CAACAACTTTTCAAGATTTTCAGTTTTCTAGGAACTCCCAATGAAGAAATCTGGCCAGGA
GTGACTAAATTCAGGGACTGGCATATCTATCCTCAATGGAAACCCCAAGATATAAGCTCT
GCTGTTCCAGACTTGGAACCAAGTGGTGTAGACCTGTTGTCTAAAATGCTGGTTTATGAG
CCATCCAAACGAATATCAGCTAAAAAGGCATTGGAACATCCCTATTTTGATGATCTGGAT
AAATCTCAGTTCTGAGATCGCTTTTAAGGATTTAAGTAATTCAGGATAGTATATATAAATTA
CTGCATGCATTGGTGGTTTTGGAAATGTCCCATTTTCTACGAATTCATCTTTGCAAAGGAA
AATATATTAGCCAGTATGTGTTTCTTTTAATGTGTATGGGCAGGCAAACAAGCAATGTTTG
GTTGCTGTTGTGGACCTTTTAAATATTGTATTCCCACACTGGAGAGTAACTTTCTTTGATG
ATCTGGATAAATCTCAGTTCTGAGATCGCTTTTAAGGATTTAAGTAATTCAGGATAGTATA
TATAAATTACTGCATGCATTGGTGGTTTTGGAAATGTCCCATTTTCTACGAATTCATCTTT
GCAAAGGCAAATATATTAGCCAGTATGTGTTTCTTTTAATTTGTATGGGCAGGCAAACAA
GCAATGTTTGGTTGCTGTTGTGGACCTTTTAAATATTGTATTCCCACACTGGAGAGTAACT
TTCTTTAAAATTAATGTTAAGATGTGATATTAAAAAAAAAA
SEQ ID NO:129
GATCGATCTATATCTCTGTGCTATAAACAAAGCAAAACAATGGATTCTTATGAGAAACTGG
AGAAAGTGGGAGAAGGAACCTATGGGAAGGTCTACAAGGCTAAGGACAAGAAAACAGG
GAAACTGGTTGCCCTTAAAAAGACCAGGCTGGAGAATGATGGCGAGGGAATTCCTCCCA
CAGCCCTTCGCGAGATTTCTCTCCTGCAGATGCTTTCTCAAGATATGAACATTGTCAGGC
TGCTGGATGTGGAACACACTGAGAACAAGAATGGGAAGCCGCTTCTGTACTTGGTTTTT
GAATACATGGACTCTGATCTCAAGAAGTACGTTGATGGTTATCGCCGCAGCCACACAAAA
ATGCCCCCCAAGATTATCAAGAGCTTCATGTACCAGTTGTGCCAGGGGGTTGCCTACTG
TCACAGTCGCGGTGTGATGCACAGGGACTTGAAGCCTCACAACCTGCTGGTCGACAAG
CAAAGGGGTGTGCTAAAAATAGCAGATCTTGGCCTGGGAAGGGCTTTCACAGTTCCTAT
CAAGAAATACACACATGAGATTGTGACCCTTTGGTACAGGGCTCCTGAAGTGCTCTTGG
GGGCTACTCACTACTCTACACCTGTTGACATTTGGTCTGTTGGCTGTATATTTGCTAAA
TGTCCCGAATGCATGCTCTGTTCTGTGGAGATTCTGAAGTGCAACAACTTATGAGCATTT
TCAAGTTTCTAGGAACTCCAAATGAAGGAGTTTGGCCGGGAGTGACCAAATTGAAGGAC
TGGCATATCTATCCTGAATGGAGACCTCAAGATTTAAGTCGTGCTGTTCCAGACTTGGAA
CCAAGTGGGGTAGACCTATTGACTAAAATGCTGGTTTATGAGCCCTCCAAAAGAATCTCA
GCTAAGAAGGCATTGCAACATCCTTATTTTGATGACCTGGATAAATCTCAATTCTGAGATT
CCTTTTAAGGATTTAGGCATTTTAAGGATTTGTCATATTTGGGGGTTTTGGAGATCTTCCA
TTTCTGAGATTTTCATCTTTGCACAAAGGCAAACATAAGCCCATATAAATAACTTGAAGTG
TTTGCACAAGCAAACAAGTAGTGGGAGCTTTCCCAAATATTGTTTTCCCACACCTGGGAG
CTTTTGTCATGAGCATTTATGCTCAGATTTAACATGGCCCTTCATGCTTAAACCTGTTGTT
GTTTGAAAATAATTTTAGAAAATGTGAAGTTGAAGCATGTTTTGAATTTATGGTGGTGGCA
TGTGGATATTTGAACTTGGTTGAGAAAAATTGAAACATCTTTGTTAGGGAAAAAAAAAAA
SEQ ID NO:130
TAATACGACTCACTATAGGGCAAGCAGTGGTATCAACGCAGAGTACGCGGGGGTATTCC
AGATATCTATATGTACGGAACACCCTCCCCGCCGAGAAAATTAAGGATGATCGTCGACT
GCGTAAGAGGCGGCAAAATTCGAAATTTGTTTATATCATCGCTATTCAGATGCATGCCTG
CACTAGCTTACGAGAAATAAATTTCTTAGCTCTGCATCAACAAACTAAACATGGAGAAATA
CGAGAAGTTAGAGAAGGTAGGGGAGGGAACCTATGGTAAAGTGTACAAGGGAAGAGAC
AAACGCACTGGAAGACTGGTGGCCCTCAAGAAAACCCCCTTTCACCAGGAAGAGGGCAT
TCCTCCCACTGCCATTCGGGAGATTTCTCTTCTCAAAAGCCTCTCGCAATGCATATACAT
TGTCAAGTTGTTGGATGTAAAGGCTTCATTTAATGGCAAAGGAAAGCACGTACTGTTTAT
GGTATTTGAATATGCAGATTCTGATCTCAAAAAGCACATTGATGCACACCGCCAATGCAA
TACCAAGTTGTCTCCAAGGTCTATTCAGAGCTATATGTTCCAATTATGTAAGGGTATTGCC
TATTGCCACAGCCACGGGGTGCTCCACAGGGATCTGAAGCCACAGAATATTTTGGTAGA
TCAAAAAATAGGGTTGCTGAAAATTGCAGATCTTGGACTTGGAAGAGCTTGCACAGTACC
TATCAAGAGCTATACTTTTGAGGTTGTTACTCTTTGGTACAGAGCTCCTGAAGTGCTGTT
GGGTGCCAAGCGCTATTCTATGGCATTAGACATATGGTCTCTTGGTTGCATCTTTGCTGA
ACTATGTAATCTGCAAGCACTTTTTGCTGGAGATTCTCAAATACAGCAGCTTATAAACATA
TTCAGGTTGCTGGGAACTCCTAATGAACAGCTATGGCCAGGTGTGACCCAGCTAAGCGA
CTGGCATGAATTTCCTCAATGGAGGCCTCAAGATCTTTCCAAAGTCGTGTTCAATCTGGA
TCCAAATGGTGTGGATCTTCTTTCTAAAATGTTGCAGTATGATCCTGCGAAGAGGATCTC
GGCAAAAGAAGCACTAGACCATCCATACTTCGACAGTTTAGACAAATCACAATTTTGACA
TCAGTTTTTTTATCCATGGAGTTGATTGGAAGCGGTTAAAAGGCATGCATCTGAGTGGTC
AGCATAAGCACATAATAAAGGAGAACAATAGGTAACTTCTGTACTGTTCATGAAACTTAAT
GGGAGGAAAGGTTTTCCAGTGTTTTCTGACATCTCGATTTCAGCACTGAATTGTGGCATT
ATATAAATTCAGGGAGGTTCATAGAGTTGCATTTTTGCATCTCTCAAGGCTATTTTTAGAA
AGTAGTCATTCCTATTGAAGGGTCAACCTTTAATTTTGGCTAGCAGGACTGTATAGGATT
ATATGCATACAAGATGACAAATAATTGTTATAAATCCACAATGTGAACAGTTATTTTTGAT
GGTGCCAATGAAGCTGTGCGCCTATGGAAAAAGCATACAGGGCGACAAATAATTATTAC
AAGTTCACAATGTAAGACACTTATTTTTGATGGCTACTGAAGCTGTGCGCCTACTAGAAA
TCGTTGATATGCTACACATGGCCAGAAAGCTTACCATAAGATTTCTTTTTTGGTTAAGATT
CTGGAATGGTTTAGGAATCTCTAGCTAGGGAGCTAAGCCTTAAATATGCTGGCAATTGTT
GTAATTGTGAACAAACACAGGCAGTTTCCACATTATTATTGTCATTTATCTTTCAATTCCC
ATTTGGTCTTCAATATTATTTAGTAATTTCCTTGTGTAAACTACGTTGTGGTGATCTTCTAC
CTGGCAGTCATTAGGGATATAGAAGTCAACAGGCTAAGGCCGGATTCTGCATATCTTTTT
CCATCTTAGAATGTTATTAAACTTGGCCTCATCCAAAGGACTCGGTATATTTGAGGGAAT
CGCAAGGACCCTGCCCCTAGTGGGAAGAGCTTTTGGTTGGTGAAAGATGCCATTCCCAA
CTGCAGAGTGATGTAAGGTTTGAATCACTTAGTGAAAGATTCTCATTAACTTAAAAAAAAA
A
SEQ ID NO:131
ATTTGTTTTCTCTTTTGGGTTTAAATTCCATGTTTTCGAGGATTTTTTGGGGGTTTTAGGG
TTGTCAGACGAGCGCTTTTGGAGTGGGTTTTGTAATTATTCTAATGGGGTGCGTTTGCGG
CAAACCCTCTGCGAGGGCCGCGGATTATGTAGAGAGCCCTGCAGAGAAGGGGGCATCC
TCCAATAGCCGATCTTCTTCAATGGCGTCTCGGCGGTTGGTAGCCCCCGCTGTCATGGA
CCAGGGCATTGATGCCGAGAACGGACACGAGGGGGATTATAGGACTAAATTGAGAGGA
AAACAGAGCAATGGTGCTGACCCAGTTTCATTGTTGTCGGACGACGCTGAAAAGCAACG
GCACTCTCGGCACCATCAGCATCAGCAGCATCATCCTATTCGTCCTCATCATCTGCGGC
CACAGGGGGAATTCGTTCCCAACGCCAATTCTAATCCCAGGTTCGGGAATCCTCCCAGG
CACATCGAAGGCGAGCAGGTTGCTGCAGGATGGCCAGCCTGGCTTACTGCTGTGGCTG
GCGAAGCGATCAAAGGCTGGATCCCACGCCGGGCCGACTCTTTTGAGAAGCTCGATAA
GATTGGACAAGGAACTTACAGCAATGTGTATAAAGCACGTGATTTAGATACTGGAAAAAT
TGTTGCCCTAAAGAAGGTGCGGTTTGACAATTTGGAGCCTGAAAGTGTGCGCTTTATGG
CCAGAGAGATACAGGTTTTACGTAGACTTGACCATCCAAATGTAGTGAAGTTGGAAGGAT
TGGTCACATCAAGGATGTCCTGCAGCTTGTACCTTGTATTTGAATACATGGATCATGACC
TTGCTGGCCTTGCTGCATGTCCCGGTATAAAGTTCACAGAACCGCAGGTGAAATGTTATA
TGCAACAATTGCTTCGAGGTCTTGATCATTGCCATAGCCGTGGTGTGCTACACCGTGATA
TCAAGGGTTCAAATCTTTTGATTGACAATGGCGGTATTCTAAAGATAGCTGACTTTGGCC
TAGCTACCTTCTTTCACCCTGATCAGAGGCAGCCCTTGACAAGTCGTGTTGTAACACTTT
GGTATCGACCTCCGGAACTTTTACTGGGTGCTACAGAGTATGGAGTTGCCGTGGATTTG
TGGAGCACAGGTTGCATACTTGCAGAGTTGCTTGCTGGAAAACCTATCATGCCAGGAAG
AACAGAGGTAGAACAACTGCACAAAATTTTTAAATTGTGTGGTTCACCATCTGAAGATTAT
TGGAAAAAGTCAAAATTGCCACATGCAACTATCTTCAAGCCACAGCAGCCATATAAACGC
TGTGTTGCAGAGACGTTTAAAGATTTTCCACCATCGGCTCTGGCACTGATGGAGGTTCTT
CTTGCCATAGAACCTGCTGATCGTGGAACTGCCACTTCAGCATTAAAGAGTGATTTCTTT
ACCACCAAACCACTCGCTTGTGATCCTTCAAGTTTGCCAAAGTACCCACCAAGCAAGGA
GTTTGATGCAAAAATTCGTGATGAGGAGGCAAGAAGGCAAAGAGCAGCGGGAGGAAGA
GGGCGTGATGCAGCTAGGCGCCCATCACGAGAATCAAGAGCAATTCCAGCACCAGAAG
CAAATGCTGAATTAGCAATTTCCATACAGAAGCGGCGCTTAAGTTCACAAGGGCCTTCTA
AAAGCAAGAGTGAGAAATTCAATCCCCAGCAGGAAGATGGTGCTGTGGGATTTCCTATT
GAGCCTCCAAGGCCTATGCATATTGGCATTGATGCAGGTGCCACTTCTCGCATGTATTCT
CAACAATTTGGGCCTTCTCATTCTGGTCCATTATCAAATCAAATTTCTAGTTCAATATGGG
GAAAGAATCAGAAAGAGGACGAGATACAAATGGCTCCAGGTCGTCCATCGCGGTCCTCA
AAAGCCACAATATCTGATTTCAGAAAACCAGGGGCCTGTGCACCCCAACCTGGAGCAGA
TTTGTCACATTTATCCAGTTTAGTCGCAACAGCAAGAAGTAATGCTGGTATAGATACACAT
AAGGACCGTAGTGGCATGTGGCAACATAATCGTATTGATGCAATAGATGGTGTACATAAT
AATGGGAAGCATGAATTTCTTGAAGTTCCAGAACATCCAAACAGACAAGATTGGACTCGG
TTCCAGCAGCCAGAATCATTTAAAGGTTTAGATAATTATCACTTGCAGGATCTGCCAGCA
ACTCACCATCGCAAAGATGAAAGGGTTGCTAGTAAAGAAGCTACCATGAACTGGCAGGG
TTATGGTGGTCAAGGGGGGGACAAAATACATTACTCGGGCCCATTGCTTCCGCCTTCTG
GAAATATTGATGAAATTTTGAAAGAGCATGAACGACACATTCAGCATGCTGTGCGTAGAG
CTCGGCAGGACAAGGGTAGACCACAGAGAAGCAATTTATCACAGAACGAGAGGAAAGC
ATTTGAACACAGAAGTTTTGTTTCTGGGGTGAATGGAAATGCAGGGTATTCTGATCTTGT
AAATGAATTGCCCATTTCAGTAGGTAGTAATAGGTTGAAAGTGAGCAAGACAAGAGGGA
CTGAAGAAATAGTTGAGCTGAGGGAGTTGGAGAGGGAACCCCTCTCATCAGTAATGGAG
AAGTATGAAAGAGAGCATGAAATGTGATACCTCCTGTGTAGTCTTGTTTTTCTTCATTTTA
GGTTCTGCAGGTGAGATAGAAGAGATGAGACTGTTCTGCTGTGCATATACCTCAACTGG
TTAATTGCATCTCAGAAAGAACTTTCAACATCTGGTTTTGTATGAAAGAGGCAGCGGAGT
TGAGAGCTTAATTTAGTTTAAGCTCAAGGTTCAGAGAATTCTTGCAGGACTGGGACCTGG
GGTATTTGAAAAACAACATATAGGTTTTCCCTATGAAAATGTACGGAATTTATTGTATTTTA
GATTCTGGAGGGCCATCTAAACTTCTGGCTGCTTGGTGCAACATTGATGGACTGATATAG
TTAAGGTCTTGACCTTTGTTGCATTTTGCTTTGCATGCAACCATGGTTGTAGATCTCATGC
ATGTACGAATGTAGAGGACCTGGTCTCAAAATGTTATTTGCTGCTTTGTTTTACCCTTGAA
GTGAAATGCAGGCAAGCTTCACCAAGGTATTCCCCCTAATATCTTTTCAGAACTTTTGGA
TTAAAAAAAAAAA
SEQ ID NO:132
TCTTTCCATACTCAATGTTTTTCCATACTATGGAATTGAATTCTGTTGAAAAATATTGCTAA
ATACAAAATAGTATTAGTAATAATTATTTTAAATTCCGCTTTTATTCGTATTTATAACTATTA
GCCAGCAGAGCCGAACGCATGCAGGAGTCTCTCTCCTTTTGCCTTCGAAGGTTTCGCTC
AGATGGGCTGCGTCTGTGCCAAACAATCCGACATTCTCGGTGAACCAGAATCTCCCAAG
GTCAAGGGTTCGAATCTCGCCTCCAGCAGGTGGTCGGTCTCCTCCGAAACAAAACAACT
GCCGCAACATTCTGATTCTGGAATCCTGCATCATCAGCATTATTACCACCCTCGAGACGA
ATCCGACGAAGCCAAATTGAAAGAGAGCAACTATGGTGGATCGAAGAGGAGAACAAGGC
AGGGAAGGGATCCCGCTGACTTGGATATGGGCATCTTCGTCCGCACTCCTTCCAGCCAA
TCAGAGGCCGAGCTGGTGGCAGCTGGATGGCCGGCCTGGATGGCAGCTTTTGCAGGG
GAGGCCATCCATGGCTGGATCCCTCGCAGGGCGGAGTCCTTCGAGAAATTGTACAAGAT
TGGACAAGGGACTTACAGTAATGTGTATAAAGCTCGTGATCTTGATAATGGAAAAATTGT
TGCCCTGAAGAAGGTACGTTTTGACAGTTTGGATGCTGAAAGTGTGCGATTTATGGCAC
GAGAAATACTGGTTTTACGCAAACTTGATCATCCAAATATTGTCAAATTGGAAGGACTTGT
TACTTCAGAGGTATCCTCTAGTCTGTACCTTGTATTTGAGTACATGGAGCATGACCTTGC
TGGACTTGCTGCTTGCCCGGGGATCAAGTTCACTGAACCACAGGTTAAATGTTATATGCA
ACAATTACTTCAAGGACTTGATCACTGTCACAGACATGGTGTACTCCATCGTGATATCAA
GGGTTCAAACCTTTTAATTGACAATGGAGGCATTTTAAAGATAGCTGACTTTGGTCTAGC
AACTTTCTTTTATCCTGATCAGAAACAGCTCCTGACAAGTCGTGTTGTAACACTTTGGTAC
CGGCCTCCAGAACTTTTGCTTGGTGCTACAGATTATGGAGTTGCTGTGGATATATGGAGT
GCTGGTTGCATACTTGCTGAACTGCTTGCTGGCAAGCCTATCTTGCCCGGAAGAACAGA
GGTGGAACAACTGCACAAAATATTTAAATTGTGTGGATCACCATCTGAGGACTATTGGAA
GGAGTCAAAATTACCACATGCAACCATATTCAAGCCACAACACCCTTACAAAAGTTGCAT
TGCTGAGGCTTTCAAAGATTTCTCTCCATCAGCTTTGGCCTTGTTAGAAACTCTCCTTGCT
ATAGAACCTGGTCATCGTGGAGAAGCAAGTGGGGCCCTTAAGAGTGAATTCTTTACAAC
GGAGCCGCTTTCTTGTGATCCATCAAGCTTACCTAAATACCCGCCAAGCAAAGAGTTTGA
TGCAAAATTGCGTGCTCAAGAAACAAGAAGGCAAAGAGATGTGGGTGTGAGAGGTCATG
GATCTGAGGCAGCAAGGAGAACGTCCCGACTATCTAGAGCAGGTCCAACACCAAATGAA
GGTGCTGAATTAACAGCATTAACTCAGAAGCAGCATTCGACTTCTCATGCAACTTCAAAC
ATTGGAAGTGAAAAACCAAGCACTAAGAAGGAAGATTACACTGCTGGATTGCATATCGAT
CCTCCAAGGCCTGTCAATCATTCTTATGAAACAACTGGTGTTTCACGTGCATATGATGCA
ATTCGTGGGGTTGCTTATTCTGGCCCATTGTCACAGACACATGTAAGTGGTTCAACATCA
GGAAAGAAGCCAAAAAGAGATCATGTAAAGGGACTTTCAGGTCAATCATCTTTGCAACCA
TCAAAACCTTTTATAGTTTCTGACTCAAGATCAGAGAGAATCTATGAAAAAAGCCATGTAA
CTGATTTGTCAAATCATTCAAGACTAGCAGTAGGAAGAAACCGTGATACTACAGACCCAC
ACAAAAGTTTGAGTACTCTGATGCAACAAATCCAGGATGGTACATTAGATGGAATAGATA
TTGGCACACATGAATATGCAAGGGCTCCAGTTTCTTCAACAAAGCAAAAATCAGCTCAAT
TGCAAAGACCGTCAGCATTGAAATATGTAGATAATGTTCAACTTCAGAATACACGTGTAG
GAAGTCGCCAAAGTGATGAAAGACCTGCCAATAAAGAATCTGATATGGTATCTCATCGTC
AGGGGCAGAGAATTCACTGCTCGGGACCTCTGCTGCACCCATCTGCCAACATTGAAGAC
CTTTTACAAAAGCATGAGCAACAAATCCAACAGGCTGTACGCAGAGCACACCATGGTAAA
CGTGAAGCTCTAAGTAACCAAATCATCTCTCCCTGGAAAGAAACCAGTGGACCATAGAGCT
TGGGTTTCTTCTGGAAAAGGAAACAAAGAATCACCATATTTTAAAGGAAAAGGGAACAAA
GAATTGTCAGATCTTAAAGGGGGACCAACCGCCAAAGTAACAAACTTTAGGCAGAAGGT
AATGTAAAGTATAGCTAAGGAAATTGCAGATGAAGGGATTCAGAAAGAGAACCCCTCCA
GTCAGGCACAAAAAGATATGAACAAAGGAAAAATACTTGCTACATGTCTTCTAGGGTCATA
TTCTGGTCTCTCTAGTTGCTGACGTCAATTTATGCATGGATTGGTTGATTTGGGATGGGA
GTTTTTTATTTTCTAAGCACTGGGATTGCTTCAGCGGGCAAATCAATAATCACGTACCTTA
TAAAATAGGGTATCCTCAATTTTTTTTTCTTTATTTATAACTGCGAGGGTTTATGGGATCTT
TTAACTCTGCAGAAAACTTATACAGGGAGTTTCAAACCATCAGATAAGTCTTTTGCATTTA
AGATTATTGATCACCTTCTAAAGAAGTCACAATTGTCTTGCAACTGCCCATTAAAACGTTG
AAGGACTATTTGGTTTTATCTCGAGATCTCTGTGATGCAACAACTGACTATATGGTCACTC
TTATGTTGCAGCGAATGCGAATGGCTCTAGCCACATTGTAATGATCTGTTTCAGTATTTTC
TGTACAGTGGAAGAATAAAGTTATGCCAAAGTTCTTTCAAGAACTTCCATTAGACTTTTGT
ATTTGTTCATACATGATTCGATAGAATAAGAAATCATGCATAAGAGACTTTCTTTTAAGGG
AAACATCTATGCCTGGAAGATTGAGATGAAGGGTTCTATTTTAATAGATATGGGATTGGT
TGCAAATGTAAGGAGCCAGGTTCTTCCATTGGCCTGGAGGATTTTGTATGGGACCCTTA
CATGCTGCTGATGAAGACCTATCACTTGGCTATGCTTGGCATTAAAACGAACTGAAAAGA
GAGGCGGCACTGAGGAGTCAAGCCACATCTGAGAAAAGACTTTAAATTCAGCATGCACT
CATTGTGAAGATTGTTATTATGGGTGAGTCCTAAATGGTTATGAACGCTGTATTTGTCTGC
AAGGATAGCGTTCTTGACAAGGGCCAAATCTTTCCATGTTTGGTTTACAGAAGTTTGTATT
TTAATTTTGTTCATTGCATTCTGTTGCAGTGTTTCAATTGAATCCGTGTCAGAGGAAGCAG
CACAATATGTAGGTTTTGTGGAGTTAATCTTTTAGTTTTAGGAAGCAAAGTTTTCTACATTT
CAGAGTAGGGCTTTCCCTCTACAGTTTTGAGTGTAGTGTGTTTCTTTTATATCTTGCAATC
AAAGAAATCATAGCTAATGATTCCATGCTATCCATGGTATCTACTTCACGATAATAAAGGT
CTTAGTCCACAAGTTTGATGTGGTATACAATGGAGCAAGAGAATTGCACAATGAATATCA
CATGACCTTGATTATTTCTAATGAACAATAAATGGTATGGTGGTTTTCATTATGAAAACCT
CAAAACTTTTATATGGAGTATCTTCATAACATTTTAAATGAGCAATAAATGATATGCTTCAA
AACTTTTATAAAAAAAAAA
SEQ ID NO:133
GTCCTTCATTGTTTAAAGCCGATTGATCTATATTTGTTAAGAACAAAGAAGATTCTGTACT
TCTGCATATTTATACAAATGCTGGAGCTCAAGTATTGATTGCATCGGAGACAACCCATCT
GTTGAATAGGCTGTGAATGTGCCAGCGGACCTTCCCTCGCTTGAATTTTTTCGTTTAGCA
GTTGAGTCTGAGGCCGATTTGGAAGAAAGCCGAGGAGGATGGCAGTCGCAAATCCTGG
TCAGCTGAATCTGCAGGAGGCGCCCTCATGGGGTTCTCGCAGCGTCAATTGCTTTGAAA
AGCTTGAACAGATTGGAGAAGGCACATATGGGCAAGTTTACATGGCCAAGGAGATCGAG
ACTGGGGAAATTGTTGCCCTGAAGAAAATTCGTATGGACAATGAAAGAGAAGGGTTCCC
AATAACCGCCATTCGGGAAATCAAGCTTCTGAAGAAGTTGCAGCATGAAAATGTTATCAA
GTTAAAGGAAATTGTGACCTCTCCAGGTCCTGAGAAGGATGAACAAGGGAAATCAGATG
GTAATAAGTACAATGGAAGCATTTACATGGTCTTTGAATATATGGATCATGATCTGACTGG
TTTAGCTGAGAGACCAGGAATGCGCTTTAGTGTTCCCCAGATTAAGTGTTACATGAAGCA
ATTGTTAATCGGGCTGCACTATTGTCATATCAACCAAGTTTTGCACCGGGACATCAAAGG
ATCTAATCTGCTGATCGATAATAATGGGATCCTAAAGCTTGCCGATTTTGGCCTGGCAAG
ATCATTCTGCAGTGACCAGAATGGAAACCTGACCAATAGAGTAATAACATTGTGGTATAG
GCCCCCAGAGTTGCTGCTAGGCTCAACAAAATATGGTCCAGCTGTTGACATGTGGTCAG
TGGGATGTATATTTGCGGAGCTTTTATATGGAAAACCTATTTTACCAGGAAAGAATGAGC
CAGAGCAACTCACTAAAATTTTTGAGCTTTGTGGATCACCGGATGAGTCCAACTGGCCG
GGTGTTTCCAAGCTTCCATGGTACAGCAATTTCAAGCCGCAAAGGCAAATGAAGAGGCG
TGTTAGGGAATCTTTTAAAAATTTTGACAGACATGCTTTGGATCTTGTGGAAAAAATGCTC
ACTTTGGATCCTAGTCAGAGAATAAGTGCAAAGGATGCACTTGATGCTGAATATTTCTGG
ACAGATCCGGTTCCTTGTGCACCAAGCAGCTTGCCAAGGTATGAACCATCACATGATTTT
CAAACAAAGAGGAAACGGCAGCAACAAAGGCAACATGATGAAATGACCAAGAGACAGAA
AATTTCACAGCATCCTCCACAACAGCATGTTAGACTACCTCCTATACAAAATGCTGGACA
AGGTCATCTTCCTCTAAGGCCAGGTCCAAATCCAACTATGCATAATCCACCTCCCCAGTT
TCCAGTTGGACCTAGTCATTATACGGGAGGACCTAGGGGAGCTGGTGGACAGAATAGG
CATCCTCAGAATATACGCCCACTCCATGCAGCTCAAGGGGGAGGATATAATGCAAATCG
AGGATATGGGGGCCCTCCTCAACAGCAAGGAGGAGGATATCCCCCACATGGGATGGGA
AATCAAGGACCAAGAGGTGGACAATTTGGGGGTAGAGGTGCAGGCTATTCTCAAGGAG
GTCCTTATGGTGGACCTGTCGGGGGCCGAGGTCCAAATGTTGGTGGAGGCAATCGAGG
TCCACAGTTTTGGTCGGAACAGTGAATCGATTGTGGAGTACACATTTAGCTTGCATGTGA
TACTATTAAATGCAAGTGTCTTTAGTTTGAAATTGTGAAATCATCCGAGCATGGCCGGATA
CACCTTTTTAAATTCAGGAATTATTCACTTGATCAGTACGTTCTGCTCACCCCTACAGCAA
AGCTACAGTTTTTTCTTACATCTAAGATTTTTCTTCTTTTTAGCGGGGTTACAAACCCAAC
AGGTGGTGTTCCTTATGGCTGTCAAGCATACATGCAGCAGCTCATTCAGCCTACTGATTT
GGGTGGTGCATATGCTCATTACTCCATTACAGGGATTCAAATGCCAACCATGTCACCACC
TACCACCTAATAGGCCTGAGTATTGCTCACCACTATGCTGATATGGGGAGCAATAACGTT
AGTAAATTTGTGAATTTTGAAGATTTTCTTCTAGCAAGTCATATTGTTGGGGCCTGTTAAA
TGAACTTTCTGGTGTATCCGGACAATTTAGGGATGTGTGTTTCGATTACTACCAAATGTA
CATTAGAATGCCATTTTCTTGATTTAGGTTGTTTTTTTAAGAAGTGACCAAATTCACTTTGA
CATTGTCTTGGCTAATCATGATGACTGATATTCTCTCTGTATTATTTAAAGACAACTCTCC
AGAAATATGACTCGAAGGGATTCCGTAAGAAAAAAAAAA
SEQ ID NO:134
CTTACAGTTTTGACATTCTCATAACGGGAATGTTCGAACTGTTAATACCATTTAATCACCG
AGTGACTTTAACGCCATTTCAGTCAGCAAAGATTTCTCAATTTGTTTATCAGAATTGGGTC
GGTTCGGTCATAACTCCTGCATTTCTTGTTTGTGTTTGATCTATAAGGCTACTCAGATGTT
AAATATATTTGATTCAGTCTACTCTGTTCTGAGGGCAAATGAAACTCTCCGGGCTTATGAA
GGTTGGTTTGAATTGAGGAATTTGGGATATATAAACACTGGCAATGTTCTCCGGACCATG
GAATTGGGGGTTTCTCCATAGGGTGTTTCAGGGTTTTGAGACTTGAATACCTCGAAAAGA
TTTGATGCGCGGGTTTTGAGAGGTAAAATCTCGAAAAGATTTGATGGGAACAGGTCTTCC
GGAGATTTGAGGAATGCAAAACATGGAGGACAATGTGCAGAGCAGCTGGAGTCTCCAC
GGCAACAAAGAAATATGTGCCCGTTACGAGATTCTAGAGAGGGTCGGCAGCGGAACTTA
TTCGGATGTGTACAGAGGGCGGAGAAAGGCGGACGGTCTAATAGTAGCCCTTAAGGAG
GTGCATGACTATCAAAGCTCTTGGAGAGAGATTGAGGCACTGCAGAGGCTTTGTGGGTG
CCCCAATGTTGTGAGGCTCTATGAGTGGTTTTGGAGAGAAAATGAAGACGCAGTTCTGG
TTTTGGAGTTTCTGCCTTCCGATCTCTATTCTGTCATTAAGTCTGGTAAGAATAAGGGAGA
AAATGGTATTCCCGAGGCCGAGGTTAAGGCCTGGATGATTCAGATCTTGCAAGGGCTGG
CTGACTGCCATGCCAATTGGGTTATCCATCGTGACCTAAAGCCCTCTAATCTGCTGATTT
CGGCTGATGGAATTCTCAAGCTCGCTGATTTTGGACAGGCAAGGATACTTGAAGAGCCT
GAAGCGATCTATGAAGTAGAGTATGAACTTCCTCAAGAGGATATCGTTGCTGATGCCCCA
GGAGAAAGGTTGATGGAGGAAGATGATAGTGTGAAGGGAGTGCGGAATGAAGGGGAGG
AGGATTCATCCACTGCAGTTGAAACTAATTTTGGTGATATGGCAGAAACTGCGAATTTGG
ATTTGAGCTGGAAAAATGAAGGTGATATGGTGATGCAGGGATTCACATCTGGTGTTGGA
ACTCGATGGTACAGAGCTCCGGAGTTGCTCTATGGAGCAACGATCTATGGAAAAGAAAT
TGATTTGTGGTCGTTAGGTTGCATTCTGGGGGAGCTCTTGATTTTAGAACCTCTCTTTTCT
GGGACTTCAGACATTGATCAACTTAGCAGGTTGGTTAAAGTTCTTGGGACTCCAACAGAA
GAAAATTGGCCTGGATGCTCCAATCTTCCTGATTATAGGAAACTTTGTTTTCCTGGTGAT
GGAAGCCCCGTTGGTCTGAAGAACCATGTCCCCAGTTGCTCAGACAGCGTGTTTTCTAT
TTTGGAAAGACTTGTTTGCTATGACCCTGCAGCTAGGCTAAATGCTAAAGAGGTACTTGA
GAATAAGTATTTTGTTGAGGATCCTTATCCTGTCCTTACCCATGAATTGAGAGTTCCCTCA
CCTCTGAGGGAAGAAAACAATTTTTCAGAGGATTGGGCGAAATGGAAGGATATGGAAGC
AGATTCTGACTTGGAAAACATTGATGAGTTCAATGTTGTTCACTCAAGTGATGGTTTCTGC
ATTAAATTTTCATAAACTCTAATTGGCTGATCATGTTTCATCTTCCTCAAATTGCCTTTTAT
ATGGATTAGTATTCTAAATTAGCTTTGGGGGCTGCATGTCTTCAAGCATTCACCACGACA
GTAAAGTAATCATTATGATTACTGATGTATTGCTGTCACTGGGTGAGAAACATTATAGCCA
TGAACCTGCTGTTAAATTTGTAGAATTGTCTTATACTTGCTCAAAATAGTGACTGCATGTT
TGAGAATAAATTTCTCTGCAAAGACCAACTCCAAATTCTTTCTTTTCACTTTGTACTAATGA
TCATTGTGACCACAAAATCTTTATACACAATACAGAAAATGCAAATTTAGAGAAAAAAAAA
A
SEQ ID NO:135
CTCCGGCACAGAAAGCCAAGATGTTGATCGCCTCCCACGCTCGCACTTGACCGTTCTCA
CGAGAAATCTCATGTCCATTCCTTCCGTGCGAGCCGACAGAGGACCCCAAGAGAAGCAC
AATATTTTTGCTATAACTAATAATGCCATTAACATCCGCTGTTGGGACGCCCGGAAGTAA
ATGATCCATCGTTGCGCGAAGAGCAAAGTCAGTTAAAGCGCTAACAACCATATCAGTAAC
GATGGACTTGAACCAGTACCCTGAAGACTTGAACCCGGAACTACCCGAGGGTACAGACA
ACGTAGATAATCCTGATAATAACAAAGGCTCCCCTGTTCCTTCTCCCCATCCTCCTCTGA
AGCCACTCGATCCCTCTGAACGGTACCGCAAAGGTATCACCCTTGGGCAAGGTACCTAT
GGGATCGTCTACAAGGCCTTCGATACTGTGACAAACAAAACTGTGGCAGTAAAGAAAAT
CCATCTGGGCAAAGCCAAAGAAGGTGTTAATGTAACTGCTTTAAGAGAAATCAAATTATT
GAAGGAGCTTTCACATCCAAATATAATTCAGCTAATAGATGCATATCCACACAAACAGAAT
TTACATATTGTCTTTGAATTCATGGAGACTGATCTTGAGGCTGTTATAAAAGACAGAAACT
TAGTTTTCTCCCCAGCTGATATAAAATCCTATTTGCAAATGACGTTGAAGGGGCTTGCTG
TTTGCCATAAAAAATGGGTTTTGCACAGGGATATGAAACCAAATAACTTGTTGATTGCTG
CTGACGGCCAACTGAAGTTAGGAGATTTTGGCCTGGCTCGCTTGTTTGGTAGTCCAGAC
CGCAAATTTACTCATCAGGTTTTTGCAGTTTGGTACAGGGCACCGGAGTTACTGTTTGGA
GCTAAACAGTATGGTCCAGCTGTAGATATTTGGGCAACAGGTTGTATTTTTGCAGAACTT
CTGCTCCGAAAACCGTTTCTTCAGGGTGTGAGTGATCTCGACCAGATAGGGAAGATTTTT
GCAGCCTTTGGAACTCCTAGACAGTCACAGTGGCCAGATGTGGCCTCTCTTCCAGATTT
TGTTGAATTCCAGTTTGTTCCTGCACCATCGCTACGTTCTTTATTTCCCATGGCTAGTGAA
GATGCCTTGGATCTTCTGTCAAAGATGTTTACGTTGGATCCAAAAAATAGGATTACTGCA
CAGCAAGCATTGGAACACAGATATTTTTCTTCTGTGCCTGCTCCCACGAGGCCAGATTTG
CTTCCCAAACCCTCTAAGGTGGACTCATCAAGGCCCCCAAAGCATGCATCTCCAGATGG
TCCTGTAGTGCTGTCTCCTTCTAAGGCAAGGCGAGTAATGCTCTTTCCAAATAATTTGGC
TGGAATTCTACCAAAGCAGGTTTCCCAATCTACAACTGGAGGGACACCAATTGAATTTGA
CATGCCGACACAAAAGCTACGTGAAGTTTGCCCTAGGTCCAGAATTACTGAATCTGGCA
AGAAACATTTGAAGAGGAAAACAATGGACATGTCTGCTGCATTGGATGAATGCGCAAGG
GAGCAAGAAGGGCAAGAAGGTAAAACCATTTTGGACCCCGACCATCAGCGTTCTGCCAA
AAAAGAGAAGCATATGTAATTTCACTTGATTTCACTTTGGGTGATCATATTGTTAGGCAAG
GAGACTAGGCATTTTCCTTGTCACTATCCTCATATTGATATCACCTCGTGTATGTTGTGG
GGTGGCAAAATTACTTCCATTATTGTTGATATCCTTTATGTTGAGAAGGTGGATTCATTTT
TAAGCAGTGGTCTCAAAAAATGACCCTGCAAGAACTGGAACTGCCAAATTTGTAAGCAAT
ATACTGTTGAGAAAAATGTTTTGCATCATCTTGAGTTCTCATTATATGACAAATTTCGGAG
GAGAAAAAGATTGGTTTCTAAAAAAAAAA
SEQ ID NO:136
GTAAAGATCCTCTATCTCTTCCCCTATTTGTTCCTCGCCACAGTACGTTATTTCGGCGAAA
TTCAGATGATCGGTCTCAACTATTACAGTGACCGCCATTACTGTCTGCTGTTCCTGTGAA
TGACAAAAGGTTCTTGATGTCAAGAAGTTAAGAAGTCGTGGAAACGGATTACGCGTTAAT
GCTGTTTACAAGGAAGGATTGCTAGTGCTCCGTCTGTTAGCGGATCCTGTTCTTGGTGG
GGAGATTAAAGATGGCAGGGGGACAGGAAAATTGCGTCCGTATTACTCGAGCTCGAGCT
GCCTGTGTTTCGAAAGCTTCTGCCCCTGTGATTCAATCTCAAGTAGATGAGAAGAAATCC
AGGAAAAGAGCTCCTAAGCGAGCTGCAGTGGATGATCTTGCTGCCAATGCAAGCGGTTC
TCAGCCTAAAAGGCGCGCAGTTTTAGGCGATGTCACAAATCTTCATGCTGCTGCTACCG
ATTGCTTGTCAACAGCGGAAGACCAGGTTGATGCTCCCAATCCTAGCATTAAAGGGAGG
GCGCGTAACAAGAAGAAAGAGGCAAGGACTTCAACGAAAGTAGTTAAAGATGAAATTCA
TCCGGAATCGAATCCTCTTGCTGATCATAGTTCGAATCTTTCGGAATGTCAAAAGCCTCC
CGCTGCAAAATTAGCAGAGCAACGATCATTAAGAGGCGTACCTTCAAAGGCAAAGCAAG
GGGGAAGCTCCAATTCACAAAGCTGTTCGAAGCACACAGACATAGACAAAGACCACACA
GATCCTCAAATGTGTACTACTTATGTCGAAGATATTTACGAGTATTTACGAAATGCAGAGT
TGAAGAATAGGCCTTCGGCAAATTTCATGGAGACTGCACAGAATGATATCACCCCAAATA
TGCGAGCAATTCTCGTTGATTGGCTGGTAGAGGTTAGCGAGGAGTATAAGCTTGTCCCT
GACACACTATACTTGACAGTATCTTACATTGATCGTTATCTGTCCGCAAATCCAACTAGTA
GGCACAAACTGCAACTTCTGGGAGTCTCTTGCATGCTAATTGCCTCGAAATACGAGGAA
GTCTGCCCTCCTCATGTGGAAGAATTCTGTTATATCACAGATAATACATACACAAGAGAC
GAGATGTTGTCAATGGAGAGAAAAATTTTGATCTTCTTGAACTTTGAGATGACAAAGCCT
ACAACAAAGAGTTTTCTCAGGCGTTTTGTACGAGCTTCTCAAGCAGGGAATAAGGCCCC
GAGTTTGCATATGGAGTTTTTAGCCAATTATTTGGCAGAGTTGACACTTATGGAATGCAG
TTTTCTCCAATACCTACCATCCTTGATAGCAGCCTCAACTGTATTTCTGTCTCGATTGACA
CTTGATTTTCTAACAAATCCATGGAATCCAACTTTGGCGCATTATACAGGATACAAGGCAT
CACAGCTTAAGGACTGTGTAATGGCAATCTATAATGTACAAATGAATCGAAAAGGTAGTA
CATTGGTTGCAATAAGGGAAAAAATATCAACAACACAAGTTCAAATGTGTAGCAAGCTTGC
CTCCACCTCCTTTTATTGCTGAAAGGTTCTTTGAAGACACCCCAAACTAAAGCTATCCATC
ATTGGTTGTGCACAAGCGTTTTGCTGAAGCTTGTTTGGAAGTACAATCTCATGTAATAGTT
GTAATTGAATCCGGAGTCTTTTACTTATTGTTGGTTAATTTCCATTTAGATGTGTCAATTTG
TTCAGACCGTCAGTGCCCATTCTTTTGGTGCTCATATTTTAACTCAGCGACTTACCAGCC
TAGTAAGCAATGGGGAGCTTGCATGTATTAGTTTTGAGCACTCAAAGGAAGTGCCCATTT
ATCAAACTGTATCATATTTTTTGGAAATTATTACGTGGAAACAGACGACTCCATGTGGTCA
TGGAGTAATGATTAACAAAAAAAAAA
SEQ ID NO:137
GAAAGGAGGGAGTGGTACGAGCTTCGGAAGAGGAGCTTCCAACCCGACACCAAATCTT
CTCACGGGGCTCCTCTATATGTACAGCAAACACTGGCGACTCTTTCCCTCAGTTCCGTTG
GTCCTACTGACATTAGCTGGAACAGGCTGATCGGCCATGGTGATCCTTTGGGCGGGCTG
CAAGAACTGGTAAACTTCCGCGGCTGGATCTCTGAATATTGCTGTTCGATTTACACATAT
TCAAAGGTGTAATAAAAATAACGATGACTGGAACCCAGGCATCCAATGTTCGCATTACAC
GGGCGAGGGCTGCAAAGTCCACGTTGAACAATGCCCTTCCTCCTCTTCCGCCAGCCCAA
GGTAAACCGAGAGGAAAACGTGCAGCAACGGAGTCAAATATTTCCGGATTCAGTGTTGC
TGCTGAACCCCTTAAACGACGTGCAGTGTTATCAGATGTGTCCAATATCTGTAAGGAAGC
GGCCGCGGTTGATTGCTTGAAAAAACCAAAAGCAGTTAAGGTTGTATCTCAGAATGCCAA
TGCCAAGGGAAGAGGCCGAGGAATTCCTCGGAATAACAAGAAGATTACTCAAGAGGCAG
AGATTAAAAAAGAGACTTCACCAGCTATTTGTAATGTTGACGATGCATCTGCTGGTAATG
CAATCGGGGATGATAAGCAAAACAATAATGTGAACCCTCTAAAAGAAGTACAGGACAATC
CTAAAGAACTTAATCCAATTGCTGAGCAAATTTCAGTACACCCACATTGCAAACAGTCAG
TGGAAAAACCGAATGAGAAAGAGATTGTGGTAAGTGATAATAAAGCAGCAATTGCTTCTC
TGAAGCAACAATCAACATTGCAGTCTCTACGTATACCAAAACAACCAAAATATTCTTTGAA
GCAAGGTAATCCTGTTCCTCTGGCAAACCTTCATGAAGATGTGGGACGGTCTAGCTGTT
CGGATTTCATTGACATTGATTCGGAATACAAAGATCCTCAGATGTGTACAGCCTATGTTA
CAGACATCTATGCAAATATGCGAGTTGTAGAGCTAAAGAGACGGCCCTTGCCCAATTTTA
TGGAAACCACTCAGCGTGATATCAATGCAAACATGAGGAGTGTACTCATTGATTGGCTTG
TTGAGGTATCAGAGGAGTACAAGCTTGTCCCAGACACACTTTATTTAACCGTATCTTATAT
TGACCGGTTCCTCTCTGCAAATGTAGTCAATAGACAAAGGCTACAGCTTTTAGGGGTTTC
TTGCATGCTTGTTGCCTCGAAGTATGAAGAAATTTGTGCTCCTCCAGTAGAAGAAATTTTG
CTATATTACCGATAATACCTACAAAAAGGAAGAAGTGTTGGAAATGGAAATTAGTGTTTTG
AACCGCTTGCAATATGATCTGACAACACCAACCACCAAGACCTTTCTCAGGCGTTTCATC
CGAGCAGCTCAAGCATCATGTAAGGTTTCAAGCTTGCATTTGGAATTCATGGGCAACTAT
CTTGCAGAGCTAACACTTGTAGAATATGATTTTCTCAAGTATCTACCATCTCTAATAGCTG
CAGCAGCTGTATTTGTGGCAAGAATGACCCTTGACCCTATGGTTCACCCTTGGAATTCTA
CTCTGCAACATTACACAGGTTATAAAGTATCAGACATGAGAGATTGTATTTGCGCCATAC
ACGACTTGCAATTGAACAGAAAAGGTTGCACATTGGCTGCAATAAGGGAGAAGTACAAC
CAACCTAAGTTCAAATGTGTGGCGAACTTGTTTCCACCTCCAATAATTTCTCCTCAGTTCC
TCATAGATAATGAGGTGTAGTTGATTCTGAATATGTGGCACTCATCTTCATTTGGCTACAT
CTCAAATGGATGCAGATAGCTGTGTTTTTCAGATTTGTGAATTGGGAATTCATCACTGAAA
AAGAATTCGTCCTGGTCCTTTAGGACATGTACTTATGTCCATGCAAGTGCTTCTTGCCTA
AGCTATGTTGATATTTCACCTTTCTTTAGATAAGGTGCTTGGTTGATGCTGTCATATAAAT
GGCCATGTCTACCATGCACATAAATTTAAACATTTTCTTGAAAAATCAAGATTATGACTAT
GGGTATCAGTGACCACTTTTCACTTGTATTTTATATGCAGAGTTAATCTGGAGGATCATTA
TCCATGAGAGTGGCCCATTCTGATTTTAAACAATGAACTTACCTGGTTGTCTATAACATTT
TTAGTTAATATTGAAATCATGGCTTAAAAAAAAAA
SEQ ID NO:138
CTTGAACAGGAACAGACAGATTGTTTCAAGGCCACATGCAGCAGCAAACATAGAGATTG
TTTCAAAGCCACCTGCAGCAGCAAACAGACAGATTGTTTCAAAGCTACCTTCAGCAGCAA
ACATACAGATTGTTTCAAAGCCACCCGCAGCAGCAAAGGGTCCTGCAATGGCAGCACCT
AACCAGAATGCGTTATTAATCAACAACAACAATAGGAGGCCCTTAGTAGACATTGGCAAT
CTAGTGGGAGCTTTGAACGCCCAATGCAATATCAGCAAAAATGGTGCCAGGAAGAGGGC
CTTCGGAGACATTGGCAATCTCGTGGAAGATCTGGACGCTAAATGCACTATCAGCAAATA
TTGGGTTAGAAAGAGGCCTAGAACAAACTTTGGTGTCAATGCTAACAAGGGAGCCAGTT
CTAGCACTCAGGGACAGGGTATTGTTGTCAGGGGAGAACAAAAAGCATGGGACAGGATA
GTCTGGGGAAACAAACAAAGCTGTGCTATTAAAATGAATGCTCAACACGTCACTGCTACT
CAGAGGGGTACTGCCATTTCAATCAGTGATATCATAGATTCAAGTGTACAAGATGGGGG
AATAAAAGCACCTAGCCAGCTGAAAGCAAGGAAACAAACAGTACGAACAGTGACTGCAA
CACTAACCGCTCGAAGCGAGGATTCATTGAGGGACGTCCTTGAGGTTCCTCCTGGCATT
GATGATGGAGACCGAGACAACCCACTGGCAGTTGTTGAATATGTTGAAGATATATACCAC
TTCTACCGGAAGATTGAGGTCAGGAGCTGTGTACCTCCAGATTATATGACAAGGCAGCT
TGAAATCAAGGACTCCATGAGGGGAGTTATAATTGATTGGTTAATAGAGGTTCATCGGAC
ATTTCTACTGATGCCAGAGACATTGTATCTCACTGTAAACATCATCGATAGATACCTCTCT
ATTCAGAGTGTTACAAGGAATGAATTACAATTAATGGGTATCACTGCAATGTTTATTGCTT
CCAAATATGAAGAAATCTCTCCTCCAAAGATCAACGATTTAGTCTACATCACGAAAGACG
CATACACATCAAAACAGATAGTGAATATGGAGCATACAATATTAAATCGACTGAAATTTAA
GCTTACGGTTCCCACTCCATATGTATTCCTTGTCAGGTTTCTTAAGGCAGCTGGTCCAGA
CAAAGTGATGAAAAATCTGGCATTCTTCTTGGTTGACCTGTGTTTGCTTCATTACAAAATG
ATCAAATACAGTCCATCAATGCTTGCTGCTGCTGCTGTCTACACTGCTCAATGCACTTTG
AAGAAGCATCCATATTGGAATAAAACACTTATCCTTCACATTGGCTACTCGGAAGCACAC
CTCAGGGAATGCGCACACTTAATGGCTGATTTACACCTGAAGGCAGAAGGAAGTAACCT
CAAAAGTGTTTATAAGAAGTATTCATATCCAATTTTTGGCTCTGTTGCTTTTTTAAGCCCT
GCCAAGATACCTGCGGGAACAGTTGCAGCTCCAGCAATTGATAAATGTGCACACCAGAT
TTACTTGAGGAACCTTCGCTGAAGGTTGCCCATTCTTCCGATCGGTCGAAAACCTATCTC
TGGTGATGTCGCTAAGAGAATGTTTTCGGTTGTGTGTATGTAAAATTGTAGATAGCCCTT
GAACAAGACATGTAAATTTATGGCTGATTATTTTTCATACATTGAGATTCTCATTATATTCT
GAACAACTAGTTTTCATTAGCAAGATAAATGCTAGATGCTTTGAACTGGCATAGACTGCT
AACATCCCCGGCCTTCTTAATTTCTAGGCGATATATATCACCGTAACTTTGGATTTGTAAA
GAATATAGGGGATCATTAGCTTTATGTAATTATATTTTCAAGTATGATTCTTCATTCAACAA
TCTTTGTTGATGGATACTTTGGTTCATAACATCTCTATAAGTGAATATGGAAGCCAATAGG
CTTGTAGTTCAAATGTAATTTTGCCAGTCTCGAAAATTAAATAGGCC
SEQ ID NO:139
CTAGAACAGTGAAGAAATTTGCGAGCACAAGATCCCTCGTTTCTGTGGTCGTTGTGATTT
CAGTTTATTTGGTTGCCAGAGCTGCCACCCCTGGGGCTCTGGGATACTGCCTGATTCAA
CCTTGTTATACAGGGTTTTATAACTTGTTTCTTGCGATTAGCCAGCAAATTTAATTTGCTTA
AATGTTCCCCAACAAGCAAACCCAAGGGCTTGTACAGAACAAGAAGATGGCTTCTAAGG
CAGCCCAACCAAAAGCAATGGTCCCTCCCCAGAGGGTTCCTCCAGCTGCAAACAATAGG
AGGGCTCTGGGAGATATTGGTAACATTGTGGCAGATGTCGGTGGTAAATGCAATGTCAC
CAAGGACGGAGTCAATGGGAAACCTCTTGCTCAGGTCTCTCGCCCTATTACAAGGAGTT
TTGGTGCTCAGTTACTTGCACAGGCAGCTGCAAACAAGGGGATCTCCGCTGCCAATAAC
CAAACTCAGGTACCGGTTGTGATTCCAAAGGCAGATGTGCGGGGAAACAAGCAAAGGA
GGACTAGCAAGAGCAAGGACATTCCTCCGACAACTGTAGTTACCAATGAATCAGATGATT
GTGTTATTATTGAACAAGCACAGAGAATAAAACCAACTTGTAATCACAATGTTGGAGCTG
TAGGGAACAAAGAGAAACCTCAATTACTAACAGCAAAGCCTAAATCACTAACGGCATCAC
TTACATCTAGAAGTGCAGTTGCGTTACGTGGGTTTAGATTTGATGATGAAATGACTGAGG
CAGAGGAGGACCCCTTGCCAAATATTGATGTGGGGGACCGCGATAACCAGTTAGCGGT
GGTTGAATATGTTGAGGATATATACAAGTTCTATCGTAGAACAGAGCAAATGAGCTGTGT
GCCAGATTATATGCCCAGACAACAAGAGATTAATCCCAAAATGAGAGCAGTACTAATAAA
TTGGTTAATTGAGGTACATTACAGATTTGGGTTGATGCCAGAGACTTTGTATCTTACCACA
AATCTTATTGATAGATATCTTGCTACTCAACTTGTATCGAGGAGCAACTATCAATTAGTGG
GTGCCACTGCAATGCTTCTAGCTTCAAAGTATGAGGAAATTTGGGCTCCAGAGATGAATG
ACTTTCTTGACATCTTAGAAAATAAATTTGAGAGGAAACATGTACTGGTTATGGAGAAGG
CAATGCTAAACAAACTGAAATTCCATCTCACAGTCCCAACTCCATATGTATTCCTTGTCCG
GTTCCTCAAAGCTGCTGCCTCTGATGAAGAGATGGAAAATTTGGTATTCTTCTTGATGGA
ACTGAGCTTGATGCAATATGTGATGATAAAGTTCCCCCCATCAATGCTCGCTGCTGCTGC
AGTCTACACCGCTCAAATCACTTTGAAAAAGACAACTGTCTGGAACGATGTACTTAAACG
CCATACAGGCTACTCTGAAATAGACCTCAAGGAGTGTACAAGACTGATGGTGGCATTCC
ATCAGAGTTCAGAAGAAAGCAAATTGAATGTAGTTTTTAAAAAGTACTCAATGCCAGAGTA
TGACTCCGTGGCACTTATCAAACCTGCCAAGCTCCCTGCATAGGCTTTCGATTGGAGTT
GCAGTTGCAGAGTGTGTAGCAACTGATGAGCATAGTTGTTATGTTTCTCAACTCAGTTGC
ACATCCAGCAACTGAGATTGATATGTCAATCAAAATTTTTAACATAGAAGATCATTCATTC
TCTTGATATTCCCCTTTAGGTGGATGGATTTTTTGGTCATTCTTTGTAACTGTACATAGTT
GTGGCAAAATCATGATTAAGAGAAGATATAGCCCACTTCATTTTTGGCATAGGCCTGCCC
ACATGGGAAGTGGTGGCTTAAAGCCAAGGGATTATTACTTGTCAATTGTAATAAATGCTT
GAGGGCATTTATGTGTTTTTTAATTTAAAAATTTGAATTATAGAACACTGAAATTGATTTCT
CAAAAAAAAAA
SEQ ID NO:140
GGGCTTTTGGGATTCGGAGATGTGAAGAAAATGAGGTGGGGAGGGAGTAGATAACCAG
AAGAAGATGAACAAGATCTGGATTTGATACCAGCCAGCCATGGCGCCAAGCTTCGATTG
TGTGGCCAATGCTTACATCGAATCCTGTGAAGACCAGGAGAAGCTCAGACAAAATGCCC
AAATCTTGGCCCAGTCTGGTGAGAACGATGTTGATGAACCAGTCTCCATGTTGGTACAG
AGAGAGACCCATTACATGCTTCCAGAGGACTATTTGCAGAGGCTCCGGAATCGAACTCT
CGACGTCAATGTTCGTCGAGAGGCCGTGGGCTGGATTCTGAAGGTGCATTCTTTCTACA
ATTTCGGAGCTCCAACGGCCTACCTGGCTGTCAATTACCTGGATCGATTTCTCTCGAGG
CACAGAATGCCGCAAGGTGTGAAGGCTTGGATGATTCAGCTCATGGCCGTGGCCTGCC
TGTCTCTCGCGGCTAAGATGGAGGAGACCCAGGTTCCACTCCCTTCGGACTTGCAGAG
GGAGGACGCCAGATTCATCTTTGATGCTCGCACTATTCAAAGAATGGAGCTTCTGATCTT
GAGTACATTGCAATGGGGAATGCGCTCCATCACGCCATTTTCATTCATCGATTACTTTGC
TTACAGGGCTGTTCAAGGTCATGGCCATGGCCATGATGCTACTCCAAAAGCTGTGATGT
CCAGAGCCATTGAACTCATTCTCAGTACCACAGAGGAAATAGATTTCATGGAGTATAGGC
CATCGGCAATAGCTGCAGCAGCCCTGTTATGTGCAGCTGAAGAGGTGGTGCCTCTGCAA
GCAGTCCATTACAAAAGGGCTTTGTCCTCCTCCATTACTGATGTTGATAAAGACAAAATG
TTTGGATGTTACAATCTGATTCAAGAGACCATCATAGAAGGAGGCTGCTACTGGACTCCA
ATGTCCCTACAATCCACTGAGAAAACTCCAGTGGGAGTGCTGGATGCAGCAGCCTGTTT
AAGCAATACACCCACCTCAAGCTACAGCGTGAAGCCATATGCCTCTGTTACAGCAGCCA
AGAGGAGAAAATTGAATGAAATCTGCAGTGCATTGTTGGTATCTCAAGCCCATCCCTGTT
GACACAGTACGAGGTCCCCAGCTTCGGTTGTCGAGATTGGAGAGTGGGATTTGGTAGG
GTTTTAGTGGAGCAGCAGATGCAAAACCCTATATAGATATATAATCGACTGTAAATGGGG
TATAGAGTACTTAGCCTGCAAATTGCTGTCCCTTCCTCGCCTATTTGACCAACTGGTTTG
GCCCAGACTGGATTTTGTCTTGTTTTAGAGAAGGGTTTAATTCAGTTTGAATAATGGAGG
ATGATGATGACAGATCACCCATAATATCAGTGAAGAAACTCATACACTGGACAGGCCAAC
CTTCCAAATATGTGTTTAGAAAACCTTTGTCTTCTTTTTATGTCATACACTCATCCAATCAA
ACAGGCCCACCTTTCCATAAAAAAAAAAAAAAAAAA
SEQ ID NO:141
GTTTATGCAATTCCAATATTTATCACATTCACCTCATATCATATATTTGCGTATTGTGACTT
CTATACAACAAAAAACAGGTACCAGAGCAGGCACGAATAACCTGCAGGTCTATCTATCTT
GTTTTTTTTAAGACGGGGTTGCAGAATGGCAGCAAATTTTTGGACGTCTTCTCACTGCAA
GGAACTTCTGGACGCGGAGAAGGTAGGAATCGTCCATCCGCTCGATAAGGACCAGGGG
CTTACTCAAGAAGATGTCAAAATTATAAAGATCAATATGTCAAATTGTATACGAACATTGG
CTCAATATGTGAAGCTGCGACAAAGGGTAGTGGCAACAGCTATTACATATTGCAGACGT
GTTTATACCAGAAAGAGCTTCACAGAATACGATCCGCAGTTGGTAGCTCCTACTTGCCTG
TACCTAGCATCCAAAGCAGAAGAAAGCACTGTTCAAGCTAAACTAGTAATCTTCTATATG
AAAAAATACAGTAAGCATCGTTATGAAATCAAGGATATGCTCGAGATGGAAATGAAGCTT
CTCGAAGCTCTTGATTACTATTTAGTCATCTACCATCCATACCGTCCTCTAATTCAGTTCT
TGCAAGATGCAGGTTTGAACGATTTGAAAGTTACGGCTTGGGCTCTAGTAAATGATACAT
ACAGGACGGATCTGATTTTGACTTATCCCCCTTATATGATAGCCCTGGCATGCATCTATT
TTGCATGCATTATGGAAGAAAAGGATGCACAGGCATGGTTTGAGGAACTTCGGGTTGAC
ATGAATGAGATCAAGAACATATCAATGGAAATAGTGGACTACTATGATAACTACCGAGTC
ATCCCAGACGAAAAGATGAATTCAGCTTTAAATAAACTCCCACACAGATTTTAAAGTATCA
AGCATACCTTTGTATTATGGTGAAACCCATTTGAAAGCATTCTATTTCTTTTTCTCATTTCC
ATTGTCATGGGAAATATAGAAGACTCGGGCTCTTGTAAAAGTTGGGAAGGGGTGCTATC
CATATCTAGAATCTACCATGCTCAATGAGGTATCTTCATTAGTATACTAATTGAATTCCAG
TTCATCCCTGTGTTATTCAAGCTTTATGGTAATTTAAAATTTTACTGCGTAGTGGAAGCTTG
AAAGCCTAGTTATTCAAGTGACAATATTTAAAAAGGACAATCTTATGTATTTTGTATTTCTG
TATTTTACATATAAGAACACATAATCTATAATCATCAGATAAAATATTATTGCGTTGAACAT
TAAAAAAAAAAA
SEQ ID NO:142
CTGGAGTTGTCTGCATTCTCGGAGCTTCAATCAATCCATGCTCCATCTCCTGTGCAGTTT
GAAGTCTGTGAAACCATCTGTTGAATGTGGGATTCAATAGGGCATTTGGGCACTTGTTCT
GCTAATTTGGGGGGGCCTTGGCGGCGGTGTTATGGAGCGTAAACTGATGTGCAAAGGG
CCTTAAACTATGCTGCCATTCCTGTATTGTTAGCACACTGAAAGGGGTTTTCATGGGTTTT
GATTAGATGAGTAGCAGTTGTAAAACATTGGTCTGTGCTGCTAAATTATGGCCCCTGCAC
TATCCTCCAGCTATGAATGCCTGTCCCATTTGCTCTGTGCGGAAGATGCCAGCAATGTAG
TGGGCTGCTGGGATGAGGATGAGAGTAAAATCTTTTGTGAGGAAGAAGAGGGTTTTGGG
ATTCAGCATTTCCCTGATTTCCCAGTTCCAGATGATGATGAAATAAGGGTCTTAGTGAGG
AAGGAAAGCCAGTATATGCCTGGGAAGTCCTATGTACAGTCCTATCAGAACCTCGGACT
GGATTTCACCGCAAGGCAAAACGCCATCGGATGGATTCTCAAGGTCCATGGCTCTTACA
ACTTTGGTCCACTGACTGCCTATCTGTCTATAAACTATCTAGATCGGTTTCTGTCCAGGA
ATCCACTACCGAAAGCCAAGGTCTGGATGCTGCAGCTCCTATCTGTGGCTTGCTTGTCT
CTGGCTGCTAAAATGGAGGAAACCCAAGTTCCTTTACTCCTGGACTTGCAGGCTGAAGA
GCCCGACTTCCTCTTCGAGCCGCGTACTATCCAAAGAATGGAGCTTCTGGTTCTGAGCA
CTCTGGAATGGCGGATGCTGTCTGTTACACCGTTTTCGTTCGTCGATTACTTCTTGCAGG
GTGGAGGGGGCAGGAAGCCACCGCCGAGAGCTATGGTGGCGCGAGCCAACGAGCTCA
TATTCAACACACACACAGTGTTGGATTTCCTTGAGCACCGACCATCTGCCATAGCTGCTG
CAGCTGTTATCTGTGCAGCCGAGGAGGTTTTGCCCCTGGAAGCAGCCCAGTACAAGGA
GACCATCCTCTCCTGCTCTCTTGTAGACAAGGAATGGGTGTTCGGCTCTTATAATCTGAT
ACAAGAAGTCCTGATTGAGAAATTCTCGACGCCTAAGAAGGCAAAATCAGCATCCTCCTC
AATCCCGCAGAGCCCTGTTGGAGTGCTGGACGCCTTCTGTTTGAGCAACAACAGCAACA
ATACTTCACTGGAAGCTAGCTTAAGCGTTAACCTCTACGCCTCTGTGGCTGCCAAACGCA
GGAAGCTCAACGATTACTGCAATACATGGCGAATGTTTCAGCACAGCACCTGTTAATCAA
ACATGAGATTGAAGGCCAAGGACTCAAAAACCAGTCCAGCATTTGTGTGCTGTTTTAGAG
GGTTTTAATACTTCAAGTCCTCTGTAGAATTTGATACCTCTGGACGCTACAGTGATCAGG
GATTTGATCTAATGCTAGTTTATTGATTTCTATGATCCAAGACCTCGTCATAGATCAAGTG
CCTAGTTTATTGATTTTTATGACACAGTCGAAGCTTTTCTAAGCTTTTCCGAGACTCCTAA
CATATGTAACAACAGAATTCGTGAGCCAAGAGAATTTGTTGAATTCGTTCTCTATACTTAT
TTTGTAACTCTGGACGGACCAGTTAAATTATAAATCCTGGAAGTTTTGATCAATTATATTC
AAAATCAAAAAAATCCTGCGTTGAAGGTTCTGTGAAAAAAAAAA
SEQ ID NO:143
GCTAACCTGTAGATATCCTTTGCTTCCCATTCCCTCTCATGGATTTGGATTTCCCCTGATT
TGATTCAGGACAAAAGCCCTTGTGGCTCCCCTACAAATATGTATTTGCATATCTGAAATTT
CCCTGGCCATAATAGCTGAGAGGCTTTTTCTTTTTTTTCTTTCATTCAATTTCATGGCGGG
GATTCCTGAGTTGGGTTGTTGATGAATAACAGGATCAGCAGCTCAGGGGCCCGGGGCTT
GTAGGGGGATGGCACCCAACTGCATAGACTGTGCCCCTAGTGATCTGTTTTGCGCGGA
GGATGCTTTTGGAGTTGTGGAATGGGGCGATGCAGAGACTGGAAGTTTGTATGGAGATG
AGGATCAGCTGCATTATAATTTGGACATTTGTGACCAGCATGATGAGCATTTGTGGGATG
ACGGTGAACTTGTAGCTTTTGCAGAAAAAGAGACCCTCTATGTTCCTAACCCAGTTGAGA
AAAACAGTGCTGAAGCTAAAGCTAGGCAGGATGCTGTGGATTGGATTTTGAAGGTTCAT
GCACATTATGGCTTTGGTCCTGTGACTGCAGTGCTCTCAATAAACTATCTTGATCGGTTT
TTGTCTGCAAATCAATTACAGCAAGATAAGCCATGGATGACTCAACTGGCAGCTGTGGCT
TGCCTCTCCCTCGCTGCCAAGATGGATGAGACAGAGGTTCCCCTTCTCCTGGACTTTCA
GGTTGAGGAGGCTAAGTATATATTCGAATCTCGCACCATTCAGAGAATGGAATTACTGGT
GCTTAGTACCCTTGAATGGCGAATGAGTCCTGTGACACCTCTTTCCTACATTGATCATGC
CAGTCGTATGATTGGGTTGGAGAATCACCATTGTTGGATTTTCACAATGCGCTGCAAGGA
GATACTGTTGAATACACTCAGAGATGCAAAGTTTTTGGGCCTTCTGCCCTCTGTTGTAGC
TGCTGCAATAATGCTGCATGTGATCAAGGAAACAGAGCTTGTTAATCCATGTGAGTACGA
GAATCGCCTGCTCAGTGCCATGAAAGTTAACAAGGACATGTGTGAAAGATGCATAGGAC
TACTCATAGCCCCTGAATCATCATCCTTGGGCAGTTTCTCTTTGGGTTTGAAAAGAAAGA
GCAGCACCATCAATATTCCTGTTCCTGGCAGCCCAGATGGAGTGCTGGACGCTACCTTT
AGCTGCAGCAGCAGCAGCTGTGGTAGCGGACAGAGCACCCCAGGGTCATATGATTCCA
ATAACTCCAGCATTCTTTGCATCTCACCAGCGGTGATAAAGAAGAGAAAGCTTAATTATG
AGTTTTGTAGCGATCTTCATTGTTTGGAGGATTAGTAGATTACCATAAATACAAGTGCTCC
TCTATGAAGAACGCTTTGGAAGCAAGCTTTGCTATTAAAACTTCATTTCAACTATACCTAT
CCAGAATTAAGTGCAAGTTCAGGCCTATAATAGGCAACTATTATAGCTCTATGCCATTAT
GTCTCGCATGGGAGAAACATATTGTTATTAAATACCATTCAATATGCTTATGATTCATGAA
TGCTTAAGAGATTCTGCTGCTGTGTGAAAAAAAAAA
SEQ ID NO:144
TTCTGCCCTTGCGAGCTGCTTAGTTTATTCCTCGTTAACTTCGTCCTTTCGACCTTTTACG
CGCCGTCTCGTTCGCCCAATCTCAGGTTCTCGCTCACTAATTTTCGTTTCAAAGGGTTTT
ATCAGAGCTTTCCACCGCTGTTTGCAGGGCATTTCAAGTTCTCAGGGCGGAATTCGACC
TGTCTGTTATCCGCCAGATCATTCAAGCCTTTAGTTCGCTCGACCAGGATGCCTCAAATT
CAGTACTCAGAGAAATACACTGATGATACCTATGAATACAGACATGTGGTTCTCCCTCCG
GAAACTGCCAAATTGCTTCCCAAGAACCGACTTCTCAATGAGAATGAATGGCGAGCCATT
GGAGTTCAGCAGTCTCGTGGATGGGTGCACTATGCCATTCATCGTCCTGAGCCACACAT
CATGTTATTCAGAAGACCTTTGAATTACCAGCAAAACCAGCAGCAACAGGCTGGGGCTC
AATCTCAACCTATGGGTTTGAAAGCCCAGTGAGTTTTATTGTGGGTTGTTGAAAGCAGTT
TCAATGTTCTGTTTGAAACTAATCAGAATAGGTTCTCCAGGGTGTTTGACTTTTTCCTTTG
CAGGTAGTTCTGCAGTTTTAGTATATTAGGGTGATGACTTTCTTTATCAAGGCTAGTCTGT
TGTTTAGTTAATACGGTTGACAATGAATGTCTAGTACATATTTTTGTGAACTATTATGAACT
ATTGCTTCTAAACTGTAGAAGCCTGTTATCTTTAGACTCGTGGTTATGTGAACTACTTTTA
CAGTAAA
SEQ ID NO:145
GGCGTGTCAATGTTTCAATACCAAATAGCATTTAGAATTATACTGCAGCGTTTTATCTGAT
ACACGAAGGATTTTTAAATTTGCCTGTAAAATGGATCAAATAGAGTATTCTGAGAAATACT
ACGACGATACCTATGAGTACAGGCATGTTGAGCTTCCGCCTGATGTTGCCCGGCTACTT
CCCAAGAATCGCCTTCTAACCGAGAATGAATGGCGAGGAATCGGGGTTCAGCAGTCTCG
TGGGTGGGTGCACTATGCTATTCACTGCTCTGAACCACACATTATGTTATTCAGAAGGCC
TTTGAATTACGAGCAAAACCACCAGCACCCTGAGCCACACATTATGTTATTCAGAAGGCC
GTTGAACTGCCAGCCAAACCACCAGCCACAAGCACATCATCCAACATAGGCTGTGGGGA
TTCGAGCCTGATGGTTATGCACTGTGGCCAGCAAGATGTTGAAGTTTTAGCTGAGTAATT
TGAAAGTTCCTTTTTTCCTTTTCACCATAGCTATTATTTGTGTACGTATTTCCCAGGCTATG
TACAGATTTAAATTGAAATCTAGCCATGACTATGGGCCTTGAGATATGACTATTGTATATG
TATCCCCTATCTTGTGAATTGTGAAATTTATATGTTTTTTTCTTCTGCGAATGGTTCAAAAT
TAGACGAACAGAAATTTTGTTCCCAGTACAGTAAAGCCCAAATGCAAGCCCAAATGCAAG
AGGGCGGACGCAGGCGTTGTGGGTTACATCATGTCATGTAATTTGTCTGATCAAGTTCTA
AGGCTGGCTTGGTATCAATGAACTTTTAACTTCTAATTTTGAAGACATACATTTATTCTTTT
AATTTGATGTCTTTCTGTGAGTAAAAAA
SEQ ID NO:146
GTTTTATCTGATACACGAAGGATTTTTAAATTTGTAAGTGCTCAGTTTTTGCAGGCCTGTA
AAATGGATCAAATAGAGTATTCTGAGAAATACTACGACGATACCTATGAGTACAGGCATG
TTGAGCTTCCGCCTGATGTTGCCCGGCTACTTCCCAAGAATCGCCTTCTAACCGAGAAT
GAATGGCGAGGAATCGGGGTTCAGCAGTCTCGTGGGTGGGTGCACTATGCTATTCACT
GCTCTGAACCACACATTATGTTATTCAGAAGGCCTTTGAATTACGAGCAAAACCACCAGC
ACCCTGAGCCACACATTATGTTATTCAGAAGGCCGTTGAACTGCCAGCCAAACCACCAG
CCACAAGCACATCATCCAACATAGGCTGTGGGGATTCGAGCCTGATGGTTATGCACTGT
AAGTGATCTGATTTGATTAACTATTTTATCAATTAATTTTTCATTTGAAATTCAACCAATCAA
TTTCTCTCGACCAATTCCATCTCTGTGCCTGATTCGTAGGTGGCCAGCAAGATGTTGAAG
TTTTAGCTGAGTAATTTGAAAGTTCCTTTTTTCCTTTTCACCATAGCTATTATTTGTGTACG
TATTTCCCAGGCTATGTACAGATTTAAATTGAAATCTAGCCATGACTATGGGCCTTGAGAT
ATGACTATTGTATATGTATCCCCTATCTTGTGAATTGTGAAATTTATATGTTTTTTTCTTCT
GCGAATGGTTCAAAATTAGACGAACAGAAATTTTGTTCCCAGTACAGTAAAGCCCAAATG
CAAGCCCAAATGCAAGAGGGCGGACGCAGGCGTTGTGGGTTACATCATGTCATGTAATT
TGTCTGATCAAGTTCTAAGGCTGGCTTGGTATCAATGAACTTTTAACTTCTAATTTTGAAG
ACATACATTTATTCTTTTAAAAAAAAAAA
SEQ ID NO:147
CTTTTGCAGTTTACCGTAGTCTTTCAGCGTTTCTACTTCAAAGAGACTATTTTTGCAGGCC
GGAAAGATGCCCCAAATACAGTATTCTGAGAAATACTACGACGATACCTATGAGTACAGG
CATGTTGTGCTTCCGCCTGACGTTGCCCGACTTCTTCCCAAGAATCGCCTGTTGAACGA
GAATGAGTGGCGAGGAATTGGGGTGCAACAGTCTCGTGGGTGGGTGCACTATGCTATC
CACCGTCCTGAGCCACACATTATGCTATTCAGAAGGCATTTGAATTACCAGCAAAACCAA
CAGCAGCAGGCACAACAGCAACCAGCACAGGCTATGGGGCTTCAAGCCTGATTGGTTTT
ATGTTGTGGCAAGCGTTGACAAAATCTCCTTGTTAAGGATTCTTGGATTCTTCCCGAGCA
AGCAGGCTTTCGTACGTAGTGTTAGATGGTTGATTTTTCCATCATAAAAGTAATTAGAATT
AATGTGAGAGTCCCATTTGCATTTAAAAAATGGTCATTATCCGAGATAGTGCGCTTTGTC
ATGGGAAAATGACTATTGAATGTGAGTTTCCTTTGCCCAGGGAATATGTAAAGGGATTTT
GAATTCTCATGTGAAAACGGCTATTACATGTGAAAATGGCTGTTATATGTTCAAAGTGATT
TCGAATTGATTTTTGTTACTTCAGGAATGAAATTATTTCTATCTTTTTTTGCTTTTTTAAAAA
AAAAA
SEQ ID NO:148
GGCTACGCGAACTGAACAGTGACAAAATATCACTCGGACGCCATGCTCTGCAAGTCATT
CAAAACTGTGTAAAACCGTAAACCATCCCTCTGTCCTCAAACTGTAGGTCGCCTCGATTG
CTGTCATGGCGCTGGTAGAAACGGAGCCTGTAACCTTAATACACCCGGAGGAGCCCAAA
AAATTCAAGAAAAAGCCCACTCCGGGCCGCGGCGGTGTGATTTCCCATGGGCTAACAGA
GGAAGAGGCAAGGGTTAAAGCCATAGCCGAGATAGTCGGAGCGATGGTGGAAGGATGC
CGGAAGGGCGAGGACGTCGACCTGAACGCCCTCAAGGCAGCAGCATGCCGGCGCTAC
GGGCTATCGAGGGCGCCGAAGCTGGTGGAGATGATCGCAGCTCTGCCCGATGGCGAG
AGGGCAGCGGTGCTTCCCAAGCTCAAGGCCAAGCCGGTGCGTACGGCCTCGGGCATC
GCCGTTGTGGCTGTGATGTCGAAGCCACATCGTTGTCCGCACATCGCCACCACTGGAAA
TATATGCGTGTACTGCCCCGGGGGCCCCGACTCAGATTTCGAATACAGCACGCAGTCTT
ATACTGGCTACGAGCCCACGAGCATGCGTGCCATTCGAGCCAGATATAATCCATATGTT
CAGACAAGGAGTCGGATAGATCAATTGAAGCGACTGGGTCATACTGTGGACAAGGTGGA
GTTCATCTTGATGGGAGGGACTTTCATGTCTTTGCCAGCTGACTATCGCGATTATTTCAT
TCGAAATCTGCATGATGCTTTATCTGGTCACACCTCTTCAAATGTGGAGGAAGCTGTGTG
CTATTCAGAGCACAGTGCCACTAAATGTATTGGCTTGACTATTGAAACGAGACCTGATTA
CTGTCTTGGACCTCATCTAAGACAAATGTTATCATATGGTTGCACACGTCTAGAAATTGG
AGTCCAGAGTACTTATGAGGATGTTGCACGTGACACAAATAGGGGTCACACTGTGGCTG
CAGTTGCTGATTGCTTTTGCCTGGCAAAGGATGCTGGTTTCAAGGTGGTAGCACATATGA
TGCCTGACTTGCCAAATGTTGGTGTGGAGCGAGATATGGAAAGTTTTCGGGAGTTCTTT
GAGAACCCCGCATTTAGGGCTGATGGTCTTAAGATCTATCCCACCCTAGTAATTCGTGGA
ACTGGACTTTATGAACTCTGGAAAACTGGCAGGTACCGGAATTATCCACCAGAGCAGCT
TGTGGACATAATTGCAAGGGTTTTAGCTTTGGTACCACCCTGGACCCGAGTCTATCGTGT
TCAGAGGGACATTCCAATGCCGCTCGTAACATCTGGTGTTGAAAAGGGCAATCTTCGGG
AGCTGGCTTTGGCACGTATGGATGACTTGGGACTTAAATGTCGTGATGTTAGAACCCGT
GAGGCTGGTATTCAGGATATTCATCATAAAATAAGACCGGAAGTAGTGGAGCTTGTACGA
CGTGACTATTGTGCTAATGAAGGTTGGGAGACATTCCTTTCATATGAAGATACACGGCAG
GACATTTTAGTTGGTTTATTGCGTTTGCGCAAATGTGGCCACAACACAACATGTCCAGAG
CTCAAGGGTAGGTGTTCAATTGTTCGTGAGCTTCATGTTTATGGAACTGCCGTGCCTGTT
CATGGCCGTGATGCAGACAAATTGCAACACCAGGGTTATGGGACGCTTCTTATGGAACA
AGCAGAGAGGATTGCTTGGAAGGAACATAGGTCAATCAAAATAGCTGTCATATCAGGTG
TAGGAACACGCCATTATTATCGAAAGCTTGGGTATGAGCTTGAAGGGCCGTACATGATG
AAGTATTTGAACTAGTTGTACATAGTAAGAGTGAAATCTTGTTTCATATCATTGGATATCG
TCCCTTCTTGGGTGCAATATGCATTATTTTCTGGTGCATCCTTAACACAGCTTGGTTACAT
GGTGAATTACAGTATTTGAAGGAGTCAAGAATGTTCTACCTGTTCCTGATTACAAGATTTA
GATGCTTTATGGAAAGCAGTATCATTTAGATGTAAAAGGATTGAGGCCAATTATACCTTGT
AACATAACATGTTATTTTTGACATTTATTCCTAATAGATTAATTTTCCTGCATTTAATATATA
TATATATATAATTTTATGAGATTTTTCACTTTCTTCAAAAAAAAAA
SEQ ID NO:149
CTCTCCTGAAAGCAAACATCTTGAAAGCAAACATCTCGGAAGAAGCCAAGCTCAATTGGA
TGTACACAGATGCTCGGATTCAGGGATTTATATACATCTATATGCGAGCATCTGCAGAGA
GCCTCGGGCAGACTCCCAATTATTGCAGCGGCCACTAGTTTGATCAGCACCCCTGAAAT
AGCAGCAGTAGAAAAAGAAAACAAGGCGCCAAATTCTGTTGACAAGATGGGCATGGGCT
CTGCGGACGAAAGTGGGAGGTTCAGTACAAGCAATGGCCAATTTATGAATATGAACAAT
GGGGTTGTCAAGGAGGAGTGGAAAAGGCGGGGTTCCAGTTGTCCCGTCAGCGCCCACGA
CAGTCCCTGTAATCACAAACGTTAAACTCGAAACCCCATCGTCGCCCGATCATGACATG
GCGAGGAAGAGGAAATTAGGGTTCCTTCCGCTGGAGGTCGGTACTCGGGTGCTCTGCA
AATGGCGAGACGGCAAATTCCACCCTGTCAAAATCATCGAGCGTCGCAAGCTGCCCAAC
GGAGCCACCAACGACTACGAGTATTATGTCCATTACACTGAATTCAATAGACGACTTGAT
GAATGGGTGAAGCTTGAACAACTTGAACTGGACTCTGTGGAGACTGATGCGGATGAGAA
GGTTGATGACAAGGCAGGAAGCTTGAAAATGACCCGCCATCAAAAAAGAAAGATAGATG
AAACTCATGTGGAGGGTAATGAAGAGCTTGATGCCGCGAGTCTTCGTGAGCACGAAGAA
TTTACAAAAGTGAAGAACATTACAAAAATAGAGCTTGGAAGATATGAGATTGAGACATGG
TACTTTTCACCATTTCCATCTGAATACAATAATTGCGAGAAGCTGTATTTCTGTGAATTTT
GCCTCAATTTTATGAAGCGGAAGGAGCAACTTCAACGACATATGAGGAAGTGTGATCTTA
AACATCCACCTGGAGATGAGATTTATCGAAGTGGAACACTTTCTATGTTTGAGGTCGATG
GAAAGAAGAACAAAGTGTATGCACAGAATCTCTGCTATTTGGCCAAGTTGTTCTTAGACC
ACAAGACACTTTACTACGATGTGGATTTGTTCCTATTTTATATTCTTTGTGAATGTGATGA
GCGGGGATGTCACATGGTTGGATATTTTTCAAAGGAGAAGCATTCCGAGGAGTCATATA
ATTTGGCATGTATACTTACACTTCCTCCATATCAACGGAAAGGATATGGAAAGTTCCTAAT
TTCCTTTTCATATGAGTTATCCAAGAAGGAAGGAAAAGTTGGGACCCCAGAGCGCCCCTT
GTCTGATCTTGGATTATTAAGCTACAGAGGTTACTGGACAAGGGTCCTGCTGGACATTTT
GAAAAAACACAAGAGCAATATTTCCATGAAGGAACTTAGTGATATGACAGCTATCAAGGC
AGACGATGTGCTCAGCACTTTACAAGGCTTAGATCTTATTCAGTATCGGAAAGGACAGCA
TGCTATCTGTGCAGATCCTAAGGTTTTGGATCGACACTTAAAAGCTGTAGGGCGAGGTG
GTTTGGAAGTTGATGTTTGTAAACTTATTTGGACCCCCTACAAGGAACAATGATGAAATAT
CTGTTTTTTTTCTTGTTTTTTTTTTCCTGTTCAATATGCCCCTTTCTAAATGAAGGGCACTG
TAATATTATCGATTATTCCTTGTTTTGTACCCCTAGAATCTGGGTAGGTGATCAGTAAACG
TTGTACATAGATGGTGGCGACGTTTTTTATTTGAAATTTTAGATTTTTTGAAAGAAGCAAC
TGGAATTTATCGAGAATAGGAATCATCCAGATTTAATGCCACTTAGGTGATCGGTGACCC
ACTTGTACATATAGATGTTGGCGATGTTTTTTTCCTGGAAGATGCAGCTGGAATTATAAAG
GACGGGATTCATCCACAAAAAAAAAA
SEQ ID NO:150
ATTTATGTGTGCACGCTAAGTTCCAACGATACAGTCCTTTGAAGCACGTGGAAAGAAACT
CGGCCTCTGCAGTGGACATAAACCCAAACCCTAGTGAGTTTTAGGGTTTCATGTAGAGA
AGGTTTGAAATCTGTTGGCGATGGGATCCTTGGATGAATCCACTTGCAGCGAAGAGATC
AGAGATGAAGGCAAGGATTCAATTAGGACTAAATTTAAAGTCGAGTCCACTGTAAATAAT
GCACAAAACGGAGGCAATGACAATTCAAAGAAGAAAAGAGCGGCCGGCCTTCCCCTTGA
AGTTGGCATTCGTCTCCTCTGCAAGTGGAGGGATTCGAAACTGCACCCTGTTAAAATCAT
TGAGCGTCGTAAGCTTCCTAATGGGTTTCCTCAAGATTACGAGTATTACGTTCACTACAC
TGAATTCAATAGAAGGCTTGATGAGTGGGTGAAACTTGAGCAATTTGAACTTGATTCTGT
GGAGACTGATGCTGATGAGAAAATTGAGGACAAGGGAGGAAGCTTGAAAATGACTCGCC
ACCAGAAACGCAAAATTGATGAAATCCACGTTGAAGAGGGTCAGGGTCATGAGGATTTT
GATCCTGCTAGCCTTCGAGAGCATGAGGAGTTTACGAAAGTTAAGAACATAGCAAAGGT
AGAGCTTGGGAGGTATGAGATTGAGACGTGGTACTTTTCACCTTTCCCTCCTGAATACAG
CCATTGTGAGAAGTTATTCTTTTGCGAATTTTGTCTCAATTTCATGAAGAGGAAAGAACAG
CTTCAAAGACATATGAGGAAGTGTGATCTGAAGCATCCACCTGGAGATGAAATATATCGC
AATGGAACCCTCTCCATGTTTGAGGTTGATGGAAAGAAGAACAAGATATATGGGCAGAA
CCTCTGCTATCTGGCAAAGTTATTCCTCGATCACAAGACACTTTATTATGATGTTGATTTG
TTCCTTTTTTACGTTCTTTGTGAATGCGATGATCGTGGATGCCATGTTGTGGGATACTTTT
CAAAGGAGAAGCATTCTGATGAAGCATATAATTTGGCCTGCATTCTTACTCTTCCCCCAT
ATCAGAGAAAAGGATATGGAAAGTTCCTCATAGCTTTTTCATATGAATTGTCCAAAAAAGA
AGGGAAGGTTGGGACCCCAGAGCGCCCTCTGTCTGATCTTGGGTTATTAAGTTACAGAG
GATACTGGACAAGGATCCTTCTTGACATTCTTAAGAAACAAAGGGGTAACATTTCAATCA
AGGAACTTAGTGACATGACAGCTATCAAGGTAGAAGATGTTATCAGTACACTGCAAGTGT
TGGATCTTATTCAGTATCGGAAAGGACAGCATGTTATATGTGCCGATCCGAAGGTTTTAG
ATCGTCACCTGAAAGCCGCTGGAATAGCTGGGCTGGAAGTTGATGTATCTAAGCTTATCT
GGACTCCCTACAAAGAGCAATGTGGATGATTAATTTTTTAGACATGTTTCTTCATGGTTAT
GCTGAGCGAAATGAGAAAATCTAGCAATTCCATTCTTTTTTGCATCCTTTGGAAAGGTAG
CTAGAGAAAATTGCTCCTGGTGAAGAGGCAGTTGTATATTTGTTTCCATGGTGGTTTTGA
TATTCTTTTTCAAATCTGAGAACATCTCTTACTCATTATTGCAAGTTTAAGGTTTCATCA
TGCATCCCTGCGTGCAACAAGTTGTGATAATTTAAGGCTGAATGAACAGCATTCCAGCAC
ACAAGTGTCACTTAAAGAAATTCATCAATTCTTTGAAATTATTGTTCCCTTTTGATGCGGC
CCCTTTCTGGAGGGGCAATCAATGTCTGCGTTGCCACTCTTTTGATGAGGATTGTAAAGG
AACGGTACCATTGCCAATAAAGTAATGAAAATTTTCTAACGAAAAAAAAAA
SEQ ID NO:151
ACGACTCCCCGTTCTTCGTCAGATTTACTTCATGGTTTTTCTCAAATCCATCCGAAGCAGT
AGCTGCTTATCAGTCTGCAACAAACCCTAAGAGAGAGAAAAATCGTTATTGGTTCTCATGG
CCTTAGTGGACCTTGCAATTTACTGCAAACGTATTTCCGGGTTTGTTCTGCATCGATTATT
GAGGGATACCATCTGCGGGTTTAATTTATTTTAGTCAGGGATTTAGCTGGTTTGTTTTAAG
CTGGGGTTTCTAGTTTAGTAGGGTTATTAATGGATCCGGTGTGAGATATTTGTTTACTTTT
AAGGGCTGTTACGATGGTTTAGGAGCCGAGCTACTGCTTGAACTATGGCTTAGGTGTTG
TGGAAAATGATATACATCATATTTCGTCAGTTTTTTAAGAAAGAAGTGAAGAACCAAGTAT
TTTGAAGTTGTCATATTATATAATACACTCTATAACAAGTTAAAATCCATTGGATTTGGCGT
GAGCTCCTCTGGCAGTGGAAATTGATGTCATCTTTAATGACAGTTGTTTTAGTAATGAAG
CTGTTAGGGTGTTTATAGGTTTTGGAGTCAAGGAACTGGGGTACTCAGTTGGTATCGTG
CTTTTCGTTGAAGATTTTGTGTATGGCATCTGCACCTATGGTTGGGTGTGATGATTCTCG
AGACAAACATCGTTGGGTAGAGAGCAAAGTTTACATGAGAAAGGGTCATGGCAAAGGAT
CTAAAGGAAATGCTGGTTTTAATGCTACAAATTCGACAGCACAAGTGCGGAGAGAGAAT
GATAACATGGGCAATTCGATTGCTGATAATGGAAAATCTGAAGCAGCATCGGAGGGACT
GTCGAGCCTTAGCAGGAAGCAAATAACTGTAAATCAGGATCATCCTCCGAATGAGACTA
GTAGCATGCCTGCTGTTGGAGGATTGCAGAATATAGATACTCATGTTACTTTTAAGTTGG
AGGGGTGCTCCAAGCAGGAGATATGGGAACTTCGCAAGAAATTGACTAACGAATTAGAA
CAGGTTAGGGGCACATTCAAGAAACTTGAAGCTAGAGAATTGCAGTTGAGGGGATATTC
TGTTTCGGCAGGAGTAAACACTAGTTACAGTGCCTCTCAGTTCTCTGGAAATGACATGAG
GAACAATGGCGGCAAGGAGGTCACATCTGAAGTGGCATCTGGTGGAGCGATCACGCCT
AAGCAGGCTCAACGAGAGTCTAATCCCCCTCGTCAGTTGAGCATTTCTCTCATGGAAAAC
AATCAAGCTGCAAGCGACATGGGGGAAAAGGGGAAGCGCACACCAAAGGCCAACCAGT
ATTACCGCAATTCGGAGTTTGTTTTGGGAAAAGATAAATTCCCACCTGCAGAGAGTAAGA
AATCTAAATCTACTGGTAACAAAAAGATTTCACAATCCAAAGTATTTTCTAAAGAAACAAT
GCAGGTAGGAAAAGAGTTCATGCCGCAGAAGTCTGTGAATGAGGTCTTCAAACAGTGCA
GCTTGTTGCTTACAAAGCTGATGAAACACAAGTATGGTTGGGTGTTTAACTTACCAGTTG
ATGCACAGGCGCTGGGACTTCATGATTATCATACCATAATAAAAAGGCCAATGGACCTTG
GTACTGTAAAGTCCAAGCTGGAGAAAAACCTATACAACTCACCTGCATCATTTGCCGAAG
ATGTGAAACTGACATTTTCTAATGCCATGACGTATAACCCAAAAGGGCATGAAGTTCATA
CAATGGCTGAGCAGTTGCTGCAATTATTTGAGGAGCGCTGGAAGACTATATATGAAGAA
CACCTTGATGGTAAGATGAGGTTTGGAAGTGGGCAGGGACTTGGGGCAAGCAGCAGTA
CTAAGAAACTGCCTTTCCAAGACTCGAAGAAGAATATAAAGAAATCAGAACCTGCTGGAG
GGCCATCTCCCCCAAAACCCAAATCCACAAATCACCATGCCAGTAGAACCCCAAGTGCC
AAAAAACCTAAGGCCAAAGATCCCCATAAACGTGACATGACTTATGAAGAAAAGCAGAAG
CTAAGTACAAACCTTCAAAATTTACCTCAAGAGAGATTGGAACTTATAGTTCAGATCATAA
AGAAGAGGAACCCATCTCTATGCCAGCATGATGAAGAGATAGAGGTCGACATTGACAGT
TTCGACACTGAGACTCTTTGGGAACTTGATAGATTTGTCACCAACTACAAGAAAGCTTG
AGCAAGAATAAGAAAAAAGCGTTGCTTGCTGATCAGGCTAAAAGAGCCAGTGAACATGG
TTCAGCCAGAAATAAGCATCCGATGATTGGAAGAGAGCTACCAATGAACAATAAAAAGG
GAGAACAAGGTGAGAAGGTGGTTGAAATAGATCACATGCCACCAGTAAACCCGCCCGTT
GTTGAGGTCGAGAAGGATGGTGTTTATGCAAAAAGATCGAGCAGCTCTAGTAGCTCCAG
TAGTGATTCTGGGTCTTCTTCCAGTGACTCTGATTCTGGGAGTTCCTCAGGAAGTGAATC
CGATGCCTATGCTGCAACTTCACCTCCTGCTGGTTCGAACACATCAGCTAGGGGTTGAT
TCTGAGAATACTGTGGCAGAAGGTGAAGATGGTTTATCTTATTTGGAGTCAAATGCAGAT
GTTTGGTTTTCATTCGATGGAACCCAATTGCTTCAGAGGCGACAATGTAGGAATGCAATG
CATGTATTCTGTTTATTCACCATTGACTGTGGCATTGTTATAACCAAAAAGAGAGTAGTAT
ATTTTTAATGCAACCACATTCCTCTGGTAAGCTGCTTTCTCGATGGTCATATTATAGGATA
TGATCAGAAATCCCATCTGTATATTTTGAAAGGACATTGGTAAGAGTGTAAGAGAGACCA
CTATTGATGCTACTATAAAAAGGGATTATCTTTAGGCAGTTGCACTCTGCAGGTATGTTCT
TACCATCGTAGCCACAGGATAGGAACTACTGACTTAGTGCAAAAGACATGATAGACAGTA
AAAGAGCAGCTAAGCTCATACAATCCTATCGTGTCCTTTTTTATTGTGGCTGTATTGAAGA
AAGTAGATTTTAGCCCATTGTTCTGTACTTTGAATTATTGATTCAAAGGGTGTATCATAGG
ACTTGCTGTGACAAGTTTTGACAGCCCGTCTTATAAACTGTTTAGATGTAAAGTATATTTT
AGCCGCTGTTGTTGTAAATTTATGTTTTTCATTGCTATCAACATTTAGTTTTCAAAAAAAAA
A
SEQ ID NO:152
GCTGCGATTGAAATTTAATCAGAAGTGAAATTGAAATTTAATCAGAACCCAAAGATGGAG
GGCCACTCGGGAGCTCTGGGATTTGGGCAAGGCTTCTCCAGGAGCTCTCAGTCTCCAA
ACCTTTCGCCCTCACCCTCGCACTCCGCCTCGGCATCAGTGACGAGCTCGGGCCAAAA
GCGCAAACGCAATGAAGTTGAGCACGCAGGGGTCGCTTCTAATTCCACGGGAATGTTCG
CAGTGCCCCCTTCCCACATATACTCCCACTTGCATCCCATGTCGATGTCGATGCCCATG
CCAATGCACAATTCGCACCCCTCCTCCCTCTCTGAGAGTCGAGACGGTGCCCTCACCTC
CAATGACGACGATGACAACCTCACCGGGGGCAACCAGTCGCAGCTGGACAGCATGAGC
GCAGGCAACACCGACGGCAGAGAGGATTTCGACGACGAAGATGACGATGATGACGATG
AAGAAGATGACGATGAAGTAGAAGGAGACGAAGAGGATCAAGATCATGACCCTGACGCC
GATGACGACTCGGACGATGGCCACGATTCCATGCGTACCTTCACCGCTGCTCGCCTGG
ACAATGGCGCTCCTAATTCCCGCAATTTAAAACCCAAGGCTGATGCTGCTGGAGTTGCA
ATTGCTCCCACTGTTAAAACTGAGCCAATTCTCGACACAGTCAAGGAAGAAAAAGTTAGC
GGTAACAACAACAACAACAGTGTCTCCGCTAACAATGCCCAAGTTGCCCCTTCTGGTTCC
GCTGTTTTACTCAGTGCTGTGAAAGAGGAGGCTAATAAGCCTACTTCTACCGATCACATC
CAGACCAGCGGAGCCTATTGTGCTAGAGAAGAATCCCTCAAGAGAGAGGAAGACGCAG
ATCGGCTTAAATTCGTGTGTTTCGGTAATGATGGAATTGATCAACACATGATCTGGCTTAT
TGGATTAAAGAATATATTTGCAAGGCAACTTCCAAACATGCCAAAAGAGTATATTGTTAGA
CTTGTAATGGACAGGAGCCACAAGTCAGTGATGATCATCAAGCAAAATCAAGTTGTTGG
GGGCATCACTTATCGCCCTTATCTTAGTCAGAAATTTGGAGAAATAGCTTTCTGTGCTATT
ACAGCGGATGAACAAGTAAAAGGCTATGGGACTAGGCTGATGAATCATTTGAAACAACAT
GCCCGTGATGTGGATGGGCTAACACACTTTCTAACATATGCTGACAATAATGCAGTTGGC
TACTTCATTAAACAGGATTTTACAAAGGAGATCAAGCTGGAGAAGGAGAGATGGCATGG
GTACATCAAGGATTACGATGGGGGAATCCTCATGGAGTGCAAAATAGACCCTAAGTTGC
CTTACACAGATCTACCAGCTATGATCCGCTGGCAGCGGCAGACAATAGATGAAAAGATA
AGGGAGCTCTCCAACTGCCATATTGTTTACTCAGGCATTGACATTCAGAAGAAAGAAGCA
GGCATTCCTAGGAAGCCCATCAAGGTTGAGGACATTCCTGGTCTAAAGGAGGCAGGCTG
GACAACAGATCAGTGGGGTCACTCGCGTTTTCGTCTACTGAATTCCCCATCCGAGGGCC
TGCCAAATCGCCAGGTCCTCCATGCCTTTATGCGGTCTCTACATAAGGCTATGGTCGAA
CATGCTGATGCTTGGCCATTCAAGGAGCCAGTAGATCCTCGTGATGTGCCTGATTACTAT
GATATCATAAAAGATCCAATGGATGTGAAGAGGATGTTTACAAATGCACGGACATATAAT
ACCCATGAAACTATTTATTATAAATGTGCAAACAGGTGATTATCTCACAGGTTGGATTCCT
ACTTTTTTAACAAACTTCAAGCTGGAACTCAGGTTAATAACAAGAGTCAGCAGTGATGGA
ATTGTCATCATAATTTCCTTTATTCAAATCAAACAAGCACTGGCTGTGCCCGCATGGTGCT
GTGCTTTATGCTTTTGCATGGTTTTCCTATAAGATGTATGAATTCGCACTGTGGTGCAATT
TTATGAATTAAACTCAAATGGAAAGGTTGCCGGCTTTTAACTATAGCTCTTGTAGATTTCC
TCAGTTGGCATGGAAGCAATTGGTGTTGCCACAGAAGAAGTTGGATTTTCCTTATATTTTT
TTACAAAAGCTGTTGTGTTTATAATTCTGCTTGTTAGTGTGAATAATGTTAATAGTTTGATC
TTCCTTGAATTTGTGGGGATACTGATATTTCTTTTTTGGGAAAGCTATGTGCTATAACAAT
AAATGCACCAATTGGTTTTTGGTTTGTCAAAAAAAAAA
SEQ ID NO:153
GATCGCTCGAGAGTTCCTCTCTCAGCTGTTTTCCAATTCTGGACTAAACAAGGGTGCAG
GCGGCAGCCAGTTGAAACAGTGGCAGAGTGGGGGTGAACATGCAACTTAGTGGGAGCA
TGGACCCGCTGTGCAAACACACGCATAGCTGGAAGTGCTATTAACGAGTAAATACTACG
AAGCCATTCGGTCTCTGGATTCATCCGACTAATTTTGGGGTGCAACCATTTGCCAGGGG
CCCTCGATTAGACGGAGGGTGGAATGGAAGAATCGGGCAATTCACTGACCTCTGGTCC
GGATGGAAGTAAGCGAAGAGTATCTTATTTTTATGATTCAGACATCGGGAACTACTACTA
CAGTCAGGGCCACCCGATGAAGCCACATCGAATCAGAATGGCTCACAGTCTGATAGTTC
ACTATGCCCTGGATGAGAAGATGGAGGTTTGTAGACCTAATCTTCTACAGAGCAGAGAG
CTTAGAGTATTTCATGCAGATGATTACATATCTTTTCTTCAGTCGGTCACTCCTGAAACTC
AACACGAACAGCTACGACAATTGAAGAGATTTAACGTGGGGGAGGATTGCCCTGTTTTT
GATGGGCTATACAATTTCTGTCAGACCTATGCAGGGGGCTCTGTGGGAGCAGCTATCAA
ACTTAATAACAAGGAAGCAGATATTGCCATTAATTGGTCTGGTGGTCTCCACCATGCTAA
GAAGTGCGAGGCATCTGGGTTTTGCTATGTAAATGACATTGTCTTGGCTATTCTTGAACT
TCTCAAGGTCCATCAACGAGTGCTTTATATTGATATTGATATTCATCACGGGGATGGTGT
GGAGGAGGCCTTCTACAGTACTGACCGAGTCATGTCTGTGTCATTTCACAAGTTTGGAG
ACTACTTCCCAGGCACAGGCCACCTTAAAGATGTAGGATACGGGAAAGGGAAGTATTAT
TCATTAAATGTTCCTTTGAATGATGGAATTGATGACGAGAGCTACAAGAATCTCTTTAGAC
CTATAATCCAGAAGGTAATGGAAATATATCAGCCGGAAGCTGTTGTCTTGCAATGCGGAG
CAGATTCTCTGTCAGGCGATAGGCTAGGTTGTTTTAATTTATCAGTCAAAGGACATGCAG
ACTGTGTTCGATTTCTTCGGTCATTTAATGTTCCATTGGTGCTTGTTGGAGGCGGTGGTT
ACACTATTCGCAATGTTGCCCGCTGTTGGTGTTATGAGACTGCAGTTGCTGTAGGCGTC
GAACCCCAAGATAAATTGCCTTACAATGAGTACTATGAATATTTTGGGCCAGATTATACTC
TCCATGTGGCTCCTAGCAACATGGAGAATCAGAATTCAGCCAAGGAATTAGCAAAAATAA
GAAACACATTGCTGGAACAATTGAAACGAATACAACATGTTCCCAGTGTGCCATTCCAAG
AAAGGCCACCAGACACCAAGTTTCCTGAAGAGGATGAAGAGGATTATGAAAAGCGACCT
AAAGGTCATAAATGGGGTGGAGAATACTTTGGTTCGGAATCCGATGAAGAACAAAAACCT
CAGAATCGTGATATAGATATTTCGGACAAACCTGGCATAAGAAGGCAGTCACCTCCTAAT
GTTGAAGCGGCCAAAAAAATAAAAGTAGAAGAAGAAGATGGAGACATTGGAATAGTTAAT
GAGAACGATGGTGCAAAGTGGCCATTGGGTGAAGCTGGATAAAAAGTATATTATGGAGG
AAATAAGCAGGAGTTTCAGATCATTTGCACAATTCCAAGAAGATTATTTTTCTCCGGAAGG
CAAAGAGCTACCGGGAATGGGAATTTCAAATGGTCATTTCTATTTCTCTAATTTCCAGTCA
GATGGAATTGTACTTCTATCTTCATAGACTTCAAGGCTATTGTGTACATAAAGAAAGCAGA
TAATAATAAAATTTTGTCGGTTTTATTGAATGAAATTCCCCATAATGTTTTACTATTCCGTC
TGGGCTTAGAGATGTACGTTAATTGGTCATTTAAGACGACTCAGTTACATAGAAATCCAC
AAGTGGCATAGTACAAACATAAATATAAAGCTTTTTTTACCAAAAAAAAAA
SEQ ID NO:154
CTCGAGAGTTCCTCTCTCAGCTGTTTTCCAATTCTGGACTAAACAAGGGTGCAGGCGGC
AGCCAGTTGAAACAGTGGCAGAGTGGGGGTGAACATGCAACTTAGTGGGAGCATGGAC
CCGCTGTGCAAACACACGCATAGCTGGAAGTGCTATTAACGAGTAAATACTACGAAGCC
ATTCGGTCTCTGGATTCATCCGACTAATTTTGGGGTGCAACCATTTGCCAGGGGCCCTC
GATTAGACGGTAAATAGTTGCTGAGATTTTTACTCGAATCTGCCAGTATTCAAATCTAGTC
AATATCCGTGTTGAGCTAAACAAGCGCTGAAAGTTTGCTCGAATCAGCCAAGAGGGTGGA
ATGGAAGAATCGGGCAATTCACTGACCTCTGGTCCGGATGGAAGTAAGCGAAGAGTATC
TTATTTTTATGATTCAGACATCGGGAACTACTACTACAGTCAGGGCCACCCGATGAAGCC
ACATCGAATCAGAATGGCTCACAGTCTGATAGTTCACTATGCCCTGGATGAGAAGATGG
AGGTTTGTAGACCTAATCTTCTACAGAGCAGAGAGCTTAGAGTATTTCATGCAGATGATT
ACATATCTTTTCTTCAGTCGGTCACTCCTGAAACTCAACACGAACAGCTACGACAATTGA
AGAGATTTAACGTGGGGGAGGATTGCCCTGTTTTTGATGGGCTATACAATTTCTGTCAGA
CCTATGCAGGGGGCTCTGTGGGAGCAGCTATCAAACTTAATAACAAGGAAGCAGATATT
GCCATTAATTGGTCTGGTGGTCTCCACCATGCTAAGAAGTGCGAGGCATCTGGGTTTTG
CTATGTAAATGACATTGTCTTGGCTATTCTTGAACTTCTCAAGGTCCATCAACGAGTGCTT
TATATTGATATTGATATTCATCACGGGGATGGTGTGGAGGAGGCCTTCTACAGTACTGAC
CGAGTCATGTCTGTGTCATTTCACAAGTTTGGAGACTACTTCCCAGGCACAGGCCACCTT
AAAGATGTAGGATACGGGAAAGGGAAGTATTATTCATTAAATGTTCCTTTGAATGATGGA
ATTGATGACGAGAGCTACAAGAATCTCTTTAGACCTATAATCCAGAAGGTAATGGAAATA
TATCAGCCGGAAGCTGTTGTCTTGCAATGCGGAGCAGATTCTCTGTCAGGCGATAGGCT
AGGTTGTTTTAATTTATCAGTCAAAGGACATGCAGACTGTGTTCGATTTCTTCGGTCATTT
AATGTTCCATTGGTGCTTGTTGGAGGCGGTGGTTACACTATTCGCAATGTTGCCCGCTGT
TGGTGTTATGAGACTGCAGTTGCTGTAGGCGTCGAACCCCAAGATAAATTGCCTTACAAT
GAGTACTATGAATATTTTGGGCCAGATTATACTCTCCATGTGGCTCCTAGCAACATGGAG
AATCAGAATTCAGCCAAGGAATTAGCAAAAATAAGAAACACATTGCTGGAACAATTGAAA
CGAATACAACATGTTCCCAGTGTGCCATTCCAAGAAAGGCCACCAGACACCAAGTTTCCT
GAAGAGGATGAAGAGGATTATGAAAAGCGACCTAAAGGTCATAAATGGGGTGGAGAATA
CTTTGGTTCGGAATCCGATGAAGAACAAAAACCTCAGAATCGTGATATAGATATTTCGGA
CAAACCTGGCATAAGAAGGCAGTCACCTCCTAATGTTGAAGCGGCCAAAAAAATAAAAGT
AGAAGAAGAAGATGGAGACATTGGAATAGTTAATGAGAACGATGGTGCAAAGTGGCCAT
TGGGTGAAGCTGGATAAAAAGTATATTATGGAGGAAATAAGCAGGAGTTTCAGATCATTT
GCACAATTCCAAGAAGATTATTTTTCTCCGGAAGCCAAAGAGCTACCGGGAATGGGAATT
TCAAATGGTCATTTCTATTTCTCTAATTTCCAGTCAGATGGAATTGTACTTCTATCTTCATA
GACTTCAAGGCTATTGTGTACATAAAGAAAGCAGATAATAATAAAATTTTGTCGGTTTTAT
TGAATGAAAAAAAAAA
SEQ ID NO:155
GGGGAAAACAAAGGCAGCATAAACCCTAAACCCGAAATTCCCTCTAGCCTCTGAACCTA
AACCCTAAATGTCTTCTCGACTTTGAAGCCTATAGAGTTCTCCAAAAGCTCAGTTTATCTA
TTTAGTTCATATTTTTCGTTGTGTGGTTTTACGGTTTGCGGCCAACGTTATTGTTCTCTTT
CGTGGAGCGTAATCGATCTGCAATCTTCGGGCGTCGTTCGTGCTCTCTAAGGGAGGAAG
AAGATATTTCATTCATCGTCATGGAGTTTTGGGGCGTTGAAGTGAAGCCTGGAGAGGCTT
TGACATGTGATCCTGGGGATGAAAGATACCTTCATATGTCCCAGGCTGCCATTGGGGAC
AAAGAAGGTGCTAAGGAAAATGAAAGGGTCTCCTTGTATGTCCACGTGGATGGTAAAAA
GTTTGTGCTTGGGACCCTGTCCCGTGGAAAATGTGACCAGATTGGACTGGATTTAGTGT
TTGAGAAAGAGTTCAAGCTCTCTCATACATCTCAAACTGGGAGCGTATTCGTATCTGGCT
ACACTACAGTAGATCATGAAGCCCTGGATGGATTTCCGGATGATGAAGACCTAGAGTCA
TCGGAAGATGAGGAAGAAGAATTAGCTCAGATAACGACTTTAACTGCAAAGGAGAATGG
TGGCAAAACCGGAGCCAAGCCTGTGAAGCCTGAATCTAAGTCATCTGTAACTGATAAAG
CAGCAGCTAAAGGGAAGCCAGAGGTGAAGCCACCAGTCAAAAAGCAAGAAGATGATTCT
GATTCTGATGAAGATGAAGATGAGGATGAAGATGAAGATGAAGACGATGATGATGAAGA
TGATGAGGATATGAAAGATGCGAGTGCAAGTGATGATGGGGATGAGGAAGATGACTCCG
ATGAAGAAAGTGATGATGATGAGGAGGAAGATGAGGAAACCCCAAAGCCGGCTGCTGG
GAAGAAGCGACCTATGCCAGCCCTCGACAACAAGTCACCAGCAACAGATAAGAAGGCAA
AGATCACCACACCAGCTGGAGGTCAAAAGCCAGGTGCTGACAAAGGCAAGAAAACAGAA
CACATTGCAACTCCTTATCCAAAACACGGTGCAAAGGGCCCTGCTTCAGGTGTTAAGGG
CAAAGAAACTCCACTTGGGTCAAAACAAACACCTGGTTCAAAAGTGAAAAATTCGTCTAC
ACCTGAGTCTGGCAAGAAATCTGGCCAGTTTAAATGCCAGAGTTGTTCAAGGGACTTTG
CAACAGAAGGAGCATTGAGCAGCCACAATGCTGCGAAGCATGGTGGAAAGTGAAGTATT
TCGTTTTTACATTTTCACAACTGTTTTTGCACTCTCAGTGCATCGTTGCCAGAAAATTCCT
AAACTAAGGTCAACTGTGTATCAGATCTCTAGTCAGTGTTTAGAGATTTACAGAAAGTTGT
GTACTAATTTGTATTGTAACGTCCATTTATCCAACGAGTCCTCCATTCATGCTATTTTTGT
GAGTTGCTTCTAAGGGAGGATTTCTCAAGGTAGCACAATTTTGTATTAGAGGTTAAATCT
AGTAGATGTCTGTTATGTGCGGCATTGTTGAATTGAATGATTTTATTTTTTCAAATTTGTC
GGAGGCTACTGATTTAAAAAAAAAA
SEQ ID NO:156
ATTGATTTAAGTGGGGTTCAAGAAAATCGTATCCCCAATCCGCATCTTGGGTGCTTACAT
CAATCTGTTTATATCAACCAATCCTCGCAACAAATTGGGGAAGAAAGGCCGTTCAAATGT
GGCATGGTATATTCTCTGTGTATAATCTTCGTGTCTGGATTTTTTCCGGCCCAAGCAGCC
CAAGCAGACCATGTAGGCTTGTTGTTAAACCATGATTAATTGACTTCATGGGTGTTTTCC
AGTCTGAAATTTGGTGGAGGACTAGAACCAGTTTTTAATAGTTTGTTTTGGCAGATGAAC
TTTATCCTTCTGAAAGTTTTTGAGAACAAAAGGAATTAGCGGTTGTTTATGAAGAAGAGG
CTCCATGATGGAGACTGGTGGCAATTCTCTTCCATCCGGACCGGATGGAGTGAAGAGGA
AAGTGGCATATTTTTATGACCCAGAGGTGGGGAATTATTACTATGGACAAGGGCATCCGA
TGAAGCCTCACCGCATAAGAATGACTCATGCCCTCCTTGTTCAGTATGGTTTGCATAAAG
AAATGCAGATTCTTAAGCCCTACCCTGCGAGAGATAGAGATCTCTGCCGTTTCCATGCAG
ATGACTATGTAGCATTTTTACGAGGAATAACTCCGGAGACAATACAGGATCAAGTTAAGG
CGTTGAAACGGTTTAATGTTGGGGATGATTGCCCGGTTTTTGATGGCTTGTACCAATACT
GTCAAACTTATGCTGGGGGGTCTGTCGGAGGTGCTGTGAAACTCAATCATAAACTCTGT
GACATTGCAATTAATTGGGCTGGGGGTCTTCATCATGCTAAAAAATGTGAGGCATCCGG
ATTTTGCTATGTGAATGACATTGTGCTGGCCATTCTGGAGCTCTTGAAATATCATAAGCG
TGTTTTGTATGTTGATATTGACATTCACCATGGTGATGGAGTTGAGGAGGCTTTCTACAC
AACGGATAGGGTTATGACAGTATCATTTCACAAATTTGGGGACTATTTCCCAGGGACAGG
TGACATTAGAGACATCGGGTGCGGGAAAGGGAAATATTATGCTGTAAATGTCCCTTTAGA
TGATGGAATTGATGATGAAAGTTTTCAGTCTCTGTTTAAACCTATTATTCAACAAGTGATG
TTGGTTTACAATCCTGAGGCAATTGTTCTCCAGTGTGGAGCTGATTCTTTGTCTGGAGAC
AGGCTAGGTTGCTTTAACCTTTCTGTGAAAGGGCATGCAGAGTGTGTTCGATACATGAG
GTCATTCAATGTGCCACTTCTGATGGTCGGTGGTGGTGGGTACACAGTTCGTAATGTTG
CTCGTTGTTGGTGCTATGAGACAGGGGTTGCTGTTGGTGTGGAGATTGATGACAAAATG
CCTCAACATGAATACTACGAATATATTGGACCAGATTACACTGTTCATGTGGCTCCAAGT
AATATGGAAAATAAGAACACGAAGCAGTATCTGGATAAAATTAGGTCCAAAATACTTGAA
AACATTAATTCATTGCCATGTGCGCCAAGTGCTCAATTCCAAGTGCAGCCTCCAGATACT
GATTTCCCTGAACTGGAAGAAGAGGATTATGATGAACGTACTAGAAGTCATAAATGGGAT
GGAGCAAGTTGTGATTCTGATTCAGAGAATGGTGATTTAAAGCATCGTAATCATGATGTT
GAGGAAAGTGCATTCCCACGGCATAATCTGGCCAATATTAGTTACAATACCAAGATTAAG
CTAGAAGGTGTTGGTACAGGTGGTCTTGACATGGCTGCTGGAACAGACACAAAAAAGAA
TGACGAGTCCTTTGAAGCCATGGACTATGAAAGTGGAGAAGAGTTACGACAGGATCATT
TTGCTTCTACAATAAATGCTTCTCAACCATGTGATCCTGCTCTTTTGACAGGAGTTCAAAA
TCAGTTGCAGTCAACAGATACTGTAAAGCCCATAGAACAAAGTGGCAATGCTCCAGGAAT
ACCACCACCTTCAGTGGCAACTGTAAGCACGGGCACTCGCCCGAGTTCAATTAGCCGGA
CATCATCTTTGAATTCAATGTCTTCAGTAAAGCAGGGTTCCATTTTGGGACCAAATCCTCC
TCAAGGGCTGAATGCATCTGGTCTACAATTTCCAGTTCCAACTTCTAATTCACCTATAAG
GCAAGGTGGGTCTTATTCTATAACTGTACAGGCCCCAGACAAGCAAGGTTTGCAAAATCA
TATGAAGGGACCTCAAAATATGCCAGGGAATTCTTGAAAATTATATATCCTTCTGCAATTT
GAAACACAGGTTACTTTGTATAATGATAATGTAATGTCATCCCCTTGGTCGCAATGTGCTT
GAGATCAGTTTAGAAGCTAATTATACACTTAGAATGCTAATTGGTGAGGTGCTAAAACTTC
TCTGTATTTTATTCTACAGTACTGTATTCGAAGATCCTGAAAATTTACTAAAACAAATGGAA
TATCAACAACCTAGGATCATTATAGCAAAAAAAAAA
SEQ ID NO:157
AATTACCATACACCACAGCATAATCAACTGGGCGGCTTAATTCTGGAACGCGAAACGAC
CGCAGGCCGAATCGAATTTCATACAGTACATGGAAGAGGAAGCCACGAAAGCCGGGAA
GCAACGCGTAGGGTAGGTGTCGTTTTTGCCCGTTGGAAATGCCTCCGAAAGATAGAGTC
GCCTACTTCTACGACGGGGATGTGGGAAGTGTCTACTTTGGCCCAAATCATCCTATGAA
GCCACATCGACTGTGCATGACACACCATCTGGTTCTTTCTTATGAACTTCACAAGAAGAT
GGAGATCTATCGACCACACAAGGCATATCCAGTGGAACTTGCTCAGTTTCATTCAGCAGA
CTATGTGGAGTTCTTGCATCGAATAACGCCTGATACACAGCACTTGTTCACCAAGGAGCT
AGTAAAATATAATATGGGAGAAGATTGTCCTGTATTTGAGAACCTCTTTGAGTTTTGTCAA
ATTTATGCTGGTGGCACTATAGATGCTGCTCATCGACTGAACAATCAGATTTGTGATATT
GCAATCAATTGGTCTGGTGGTCTTCACCATGCTAAAAAGTGTGAGGCATCAGGTTTCTGT
TATATTAATGACTTGGTGCTGGGAATTTTGGAGTTGCTTAAACATCATGCCAGAGTTCTCT
ATGTTGATATTGATGTCCACCATGGTGATGGAGTTGAAGAAGCCTTTTATTTTACTGACA
GGGTGATGACAGTGAGCTTTCACAAGTACGGTGACATGTTTTTTCCTGGAACAGGTGAT
GTTAAGGAGGTTGGAGAAAGAGAAGGAAAATATTATGCTATTAATGTTCCTCTCAAAGAT
GGAATTGATGATGCTAGTTTCACGCGACTTTTCAAAACTATTATTACCAAGGTAGTTGACA
TATACCAGCCTGGTGCCATTGTTCTTCAGTGTGGAGCTGATTCACTTGCTGGGGATCGC
CTTGGTTGCTTTAACCTATCCATCGATGGTCATGCACAGTGTGTGAGAATTGTGAAGAAA
TTTAATCTGCCACTATTGGTTACTGGGGGAGGAGGCTACACAAAAGAAAATGTTGCTCGT
TGTTGGTCCGTGGAAACTGGAGTACTTCTTGATACAGAACTTCCTAATGAAATTCCTGAT
AATGATTACATCAAGTACTTCGCACCTGACTATTCCTTAAAGATCAACACTGCTGGGAAC
ATGGAGAATCTGAATAGCAAGACATACTTGAGCGCAATTAAAGTGCAGGTTATGGAGAAT
CTGAGGGCCATTCAACATGCACCAAGTGTACAGATGCATGAAGTGCCTCCTGACTTTTAT
ATTCCTGATATTGATGAAGATGAACTTAATCCGGATGAACGTATGGATCAGCATACTCAA
GACAGACAGATCCAGCGAGATGATGAATATTATGATGGAGACAATGACATTGATCATGAC
ATGGAAGAGGCAAGCTGATAGCCGAGAAAGATATTTGCCCCTTTGATGCAATATTAATGA
TAACGCTTTGTTGGATGTCTTTTCTTTCTACAATGACAATAAATTAGTTTTTGTAATTTATA
AAGTATTCTCGAGTGTACTTTCAGAAATTTTGAGGAGCCCTTTTCCGCCAGAAATGATTTA
TGTTGCTTTTCTGTAATAGTTCTTCCCTCCAGGGATTCTGGCTGTGTTTCTCCCCCAGTTT
TCTGAAAATCTTGGTGATAACACCAGAAGTAGAAAACTTTGCTCTATATAATTTGTGCTCG
TGTGTGTACTTGAAGATCCATCCTCACATAGTCCAATGAAGTTTTGCAGTATCATATGCC
CAATTTTTTTTGGAAAAAAAAAA
SEQ ID NO:158
GCTCCCTTACATCGCTGGTCCGATCGTTGATAGTCACCCCACCTAGAGCCTCCCGGGTT
GTTGCATGAACTGAGATGCCTAACAATCGAAAATCGAACCCCACCATTTCAAGATCAAGA
CTCCCACATTCGCACCCAGCATAAAACGTCTCCCATCCCCAACAGATCTCGTTACCTCAG
AACAAATTAACGCCACAATCGCCATGGATTCTTCCAAGTCAGAAGAAGCAAACATTTTGC
ATGTGTTTTGGCACGAAGGAATGCTCAATCATGATTTAGGCACAGGGGTATTTGACACAT
TGGAGGATCCAGGCTTCCTGGAGGTGCTGGAAAAACATCCTGAGAATGCAGATAGAGTG
AGAAACATGCTCTCAATTCTCAGAAAGGGACCCATTGCTCCCTACACCGAATGGCACACT
GGCAGGGCTGCATATCTCTCAGAACTGTACTCCTTTCACAGACCAGATTATGTAGACATG
CTTGCCAAAACTAGTACAGCAGGTGGAAAGACCTTATGCCATGGCACACGCCTGAATCC
TGGTTCTTGGGAAGCTGCACTTCTTGCTGCTGGGACAACACTTGAAGCTATGCGCTATAT
TTTGGATGGACATGGAAAGCTATCTTATGCATTAGTGCGACCACCAGGCCACCATGCAC
AGCCAACTCAAGCTGACGGGTACTGCTTTTTGAATAATGCTGGTCTTGCAGTTGAACTAG
CTGTGGCATCTGGCTGCAAGCGAGTTGCTGTGGTTGACATCGATGTGCATTATGGGAAT
GGAACTGCAGAGGGATTCTATGAACGAGATGATGTGCTCACAATCTCTCTCCACATGAAT
CATGGTTCTTGGGGACCTTCACATCCTCAAACTGGATTTCATGATGAAGTTGGTCGAGGA
AAAGGTCTGGGGTTTAATCTAAATGTTCCTTTGCCTAATGGGACAGGAGATAAGGGATAT
GAGCATGCCATGCATGAATTGGTGGTACCAGCCATTAGCAAATTTATGCCTGAAATGATA
GTCTTGGTTATTGGTCAAGACTCCAGTGCATTTGATCCTAATGGAAGGGAGTGCTTGACA
ATGGAAGGTTATAGAAAAATTGGTCAAATAATGCGTCAACAAGCTGATCAATTTAGTGGC
GGCCGCCTTGTGGTGGTACAGGAAGGTGGCTATCATATTACTTATGCAGCATATTGTCTT
CATGCCACTCTTGAAGGTGTCCTATGTCTGCCACATCCACTTTTGTCAGATCCAATTGCT
TATTATCCAGAGCATGATATTTATAGTGAAAGAGTAACATTTATAAAAAATTACTGGCAAG
GAATCATCTCGACCACAGATAAACGTAACTGATTAGTTTACGATATGTGGTTGTGACTCT
GAGTATTTGAAGCTGTTTTACAGCTGTATATGGATAAATGTTATTAGTGTGTTGCAACAAT
ACGCTTTACCTTGTGTGTATAGTTTTATAACACTCTATGGTATCACTACCACTATGGGCCT
GTTTAGTCCAAGTTTCCTTTTTAATATTTGCCGGTATAAAGTATATGATTTTCTTAGTGTAA
AAAAAAAA
SEQ ID NO:159
GAAACACTGATATATATACCTAAATCCCAGGTTTAAATAAAAGCTACATGTGTTCGTCGCA
TCCTTCTCTTCTCTGATTGGGGATTTCTCTGGATTTTGCTATACAGTCAGATAGCAACGAA
ACATAGCAGAGGCACGTAAGCTTTGGTTTAACCTGTAGGCATTCGTTCTACTGTGTATTT
ATCTAGTCCATAGGAACGAGACATCGGAGATTAGAGGTGAGAGGCCATGGAGGAGTCT
GGCAATGCTCTGGTATCAGGGCCTGATGGGAGTAAGAGAAGAGTTACATACTTCTACGA
TGCAGACATTGGTAATTATTACTATGGGCAAGGCCACCCAATGAAGCCACACCGCATGA
GAATGGCCCATAACTTGATTGTCCACTATGGCCTCCACCAGAGGATGGAAGTTTGCAGG
CCTCATCTAGCACAGAGCAAGGACATCCGTGCCTTTCATACTGATGATTACATACATTTC
TTGAGTAGTGTAGCACCAGACACTCAGCAGGAGCAGCTGAGGCAGTTGAAGAGGTTCAA
TGTGGGGGAGGACTGTCCTGTGTTTGATGGGCTTTTCAATTTTTGCCAGTCTTCTGCAGG
AGGCTCAATTGGGGCAGCCCTTAAACTCAATAGGAAGGATGCAGACATTGCCATCAACT
GGGCGGGGGGCCTCCACCATGCCAAGAAGTGCGAGGCTTCGGGTTTCTGCTATGTCAA
TGATATTGTGCTTGGCATCCTGGAATTGCTAAAGGTCCATCAGCGTGTACTTTATATTGAT
ATTGATATTCATCATGGTGATGGAGTTGAAGAAGCCTTCTACACTACAGACCGTGTGATG
ACTGTATCTTTTCACAAATTTGGGGATTACTTTCCAGGCACAGGTCATATTAAAGATGTAG
GATATGGGAAAGGGAAGTACTATGCTTTGAATGTTCCTTTGAATGATGGAATTGACGATG
AGAGCTACAAGCACCTTTTCAGGCCCATTATCCAGAAGGTAATGGAAGTCTATCAGCCA
GAAGCAGTTGTTCTACAATGTGGAGCAGACTCACTCTCAGGGGACAGGTTAGGGTGCTT
CAATTTGTCAGTCAAGGGACACGCAGATTGTGTTCGTTTTGTTCGGTCATTTAATATACCA
CTTATGCTGGTTGGTGGTGGTGGTTATACTATTCGTAATGTGGCTCGCTGCTGGTGCTAC
GAGACTGCTGTTGCTGTAGGTGTTGAACCCCAAGATAAATTGCCTTATAACGAGTATTAC
GAATATTTTGGGCCAGACTACACCCTCTATGTAGCCCCTAGCAATATGGAGAACCTGAAC
ACAGAAAAAGATTTGGAGAAAATGAGAAACGTGTTGCTAGAACAATTGAGTAAAATACAA
CATACACCAAGTGTACCCTTTCAGGAAAGGCCCCCTGATACAGAATTTAATGATGAGGAG
GAAGAGGACATGGAAAAACGCTCAAAATGCCGTATCTGGGATGGGGAGTACGTTGGTTC
AGAACCTGAGGAAGATGGAAAGCTTCCAAGATTTGATGCAGATACTTATGAGAGATCTGT
TCTCAAGCATGAAAACAAAAGGTTAGTGCCTGTTTCAAATGTTGAACCTCTGAAGAGAAT
AAAACAAGAGGAAGATGGAGCAGCTGTTTAAGTACTAGATGATCTTAATTGCACTGCCAT
TTCCTTCAGATGGTCACTTCTCTCATGTTCATTCAATCATTTTAATTTTAACATTCTAATATGG
TTCATGCCCTAGATGAAGAAAGAAGGCCTGTGAACTTCTTTTAATTTTGATCCTGGCTTTA
AGGCTCATTTTAGAGAAGGAACTTTCATGCACTACATGGCTTAAAGATATTGATTTTGTGC
TGTGAAAGCAGAACGAAAATGAAAATGGCACTGCCACCCGGACATTTGTCCAATCTGCA
TGAGCACACTGAGGGAAAAGCTGAAGCAGAATCAGCTTTGACCAGTATTTAGTGTCTTGT
ATACAATTCTTGTTTCAGTGAATGCTGCTCCCCCTTCTCGATTACCAACCTGTACTTAAAC
ATTGTTGCTATCAGAAGAGTATTATTTTTCAATGAATGCAAGATAAATGTAAAAAAAAAA
SEQ ID NO:160
AATTACCATACACCACAGCATAATCAACTGGGCGGCTTAATTCTGGAACGCGAAACGAC
CGCAGGCCGAATCGAATTTCATACAGTACATGGAAGAGGAAGCCACGAAAGCCGGGAA
GCAACGCGTAGGGTAGGTGTCGTTTTTGCCCGTTGGAAATGCCTCCGAAAGATAGAGTC
GCCTACTTCTACGACGGGGATGTGGGAAGTGTCTACTTTGGCCCAAATCATCCTATGAA
GCCACATCGACTGTGCATGACACACCATCTGGTTCTTTCTTATGAACTTCACAAGAAGAT
GGAGATCTATCGACCACACAAGGCATATCCAGTGGAACTTGCTCAGTTTCATTCAGCAGA
CTATGTGGAGTTCTTGCATCGAATAACGCCTGATACACAGCACTTGTTCACCAAGGAGCT
AGTAAAATATAATATGGGAGAAGATTGTCCTGTATTTGAGAACCTCTTTGAGTTTTGTCAA
ATTTATGCTGGTGGCACTATAGATGCTGCTCATCGACTGAACAATCAGATTTGTGATATT
GCAATCAATTGGTCTGGTGGTCTTCACCATGCTAAAAAGTGTGAGGCATCAGGTTTCTGT
TATATTAATGACTTGGTGCTGGGAATTTTGGAGTTGCTTAAACATCATGCCAGAGTTCTCT
ATGTTGATATTGATGTCCACCATGGTGATGGAGTTGAAGAAGCCTTTTATTTTACTGACA
GGGTGATGACAGTGAGCTTTCACAAGTACGGTGACATGTTTTTTCCTGGAACAGGTGAT
GTTAAGGAGGTTGGAGAAAGAGAAGGAAAATATTATGCTATTAATGTTCCTCTCAAAGAT
GGAATTGATGATGCTAGTTTCACGCGACTTTTCAAAACTATTATTACCAAGGTAGTTGACA
TATACCAGCCTGGTGCCATTGTTCTTCAGTGTGGAGCTGATTCACTTGCTGGGGATCGC
CTTGGTTGCTTTAACCTATCCATCGATGGTCATGCACAGTGTGTGAGAATTGTGAAGAAA
TTTAATCTGCCACTATTGGTTACTGGGGGAGGAGGCTACACAAAAGAAAATGTTGCTCGT
TGTTGGTCCGTGGAAACTGGAGTACTTCTTGATACAGAACTTCCTAATGAAATTCCTGAT
AATGATTACATCAAGTACTTCGCACCTGACTATTCCTTAAAGATCAACACTGCTGGGAAC
ATGGAGAATCTGAATAGCAAGACATACTTGAGCGCAATTAAAGTGCAGGTTATGGAGAAT
CTGAGGGCCATTCAACATGCACCAAGTGTACAGATGCATGAAGTGCCTCCTGACTTTTAT
ATTCCTGATATTGATGAAGATGAACTTAATCCGGATGAACGTATGGATCAGCATACTCAA
GACAGACAGATCCAGCGAGATGATGAATATTATGATGGAGACAATGACATTGATCATGAC
ATGGAAGAGGCAAGCTGATAGCCGAGAAAGATATTTGCCCCTTTGATGCAATATTAATGA
TAACGCTTTGTTGGATGTCTTTTCTTTCTACAATGACAATAAATTAGTTTTTGTAATTTATA
AAGTATTCTCGAGTGTACTTTCAGAAATTTTGAGGAGCCCTTTTCCGCCAGAAATGATTTA
TGTTGCTTTTCTGTAATAGTTCTTCCCTCCAGGGATTCTGGCTGTGTTTCTCCCCCAGTTT
TCTGAAAATCTTGGTGATAACACCAGAAGTAGAAAACTTTGCTCTATATAATTTGTGCTCG
TGTGTGTACTTGAAGATCCATCCTCACATAGTCCAATGAAGTTTTGCAGTATCATATGCC
CAATTTTTTTTGGAAAAAAAAAA
SEQ ID NO:161
GGAGAGGTTCTGTCTGAGAGAAGAAGATGGATTTGAACTTGGTGAGCCATGGAGAAGAA
GAAGAAGGGGTAAGGCGTCGGAAAGTAGGAATTGTATACGACGAACGAATGTGCAAGC
ATGCTACTCCTGAGGATCAACCACATCCCGAACAACCAGATCGCATTAGGGTGATATGG
GACAAGCTCAACTCCGCCGGGGTCCTCCATAAATGTGTTATGGTGGAGGCGAAAGAAGC
ATCGGAGGAGCAATTGGCGGGGGTCCATAGTCGGAAACACATTGAGGTAATGAAAAGCA
TTGGCACTGCTAGATATAATAAGAAGAAGCGGGACAAGTTGGCAGCGTCTTACAGTTCC
ATTTATTTCAGCCAAGGCTCCTCGGAAGCCGCCCTTCTCGCTGCCGGATCCGTGGTAGA
GATATCTGAAAAAGTAGCTTCAGGGGAATTGGATGCCGGAGTTGCTATTGTTAGGCCAC
CAGGTCATCATGCAGAGGCTGACAAAGCCATGGGGTTTTGCTTGTTCAACAACATAGCT
ATTGCAGCAAAACACCTCGTCCATGAAAGGCCAGAGTTAGGTGTACAGGAAGTGTTGAT
TGTTGACTGGGATGTTCACCATGGCAATGGGACACAGCACATGTTTTGGACTGATCCAC
ATGTTTTGTATTTTTCTGTTCATAGATTTGATGCAGGAACATTCTATCCAGGAGGAGATGA
TGGGTTCTATGACAAAATTGGAGAAGGGAAAGGAGCGGGATACAATATAAATGTTCCTTG
GGAGCAAGGAAAATGTGGAGATGCAGATTACCTTGCTGTTTGGGACCATGTTTTGGTTC
CTGTTGCAAAGAGTTATGATCCAGATATGGTTCTTATTTCTGGAGGGTTTGATGCAGCAC
TTGGTGACCCATTGGGTGGATGCCGACTTACACCTTATGGGTATTCACTAATGACAAAGA
AGTTAATGGAGTTTGCAGGTGGGAAGATTGTCTTGGCCTTAGAGGGTGGCTATAATCTTA
AGTCGTTGGCAGATTCATTCCTGGCTTGTGTAGAAGCTTTGCTTAAAGATGGACCTAGTA
GAAGCTCTGTTTTGACTCATCCATTTGGATCAACATGGCGTGTAATACAGGCGGTGCGC
AAAGAACTTAGCTCATTTTGGCCAGCATTAAATGAAGAACTACAATTGCCAAGACTGCTG
AAGGATGCCTCAGAATCATTTGACAAGCTAAGTTCCAGTTCAAGTGATGAAAGCTCTGCT
TCTGAAGATGAAAAAATTGCTGAAGTGACGTCAATCATGGAAGTCTCTCCTGATCCA
TCCAGCATTCTTGCCCTAACTGCTGAAGACATTGCTCAACCACTGGCTGGGTTGAAAATT
GAAGAGGCTGGTACTGATAGCCAGAGATCATCAGATCACACTTTGTTAGATTTAACTAAT
GATGACACCCAAAAGTTGAAACAGTTCGAGGGGGAGATTTTTGTCATGATTGGTGATGAA
GAATCAGTTCCATCAGCCTCAAGTAGCAAAGATCAGAATGAGTCCACTGTAGTTTTATCA
AAAAGTAATATTAAAGCTCATAGCTGGAGATTGACTTTCTCCAGCATTTATGTTTGGTATG
CAAGTTATGGGTCAAATATGTGGAATCCAAGGTTTCTCTGTTATATTGAAGGGGGGCAGG
TTGAAGGCATGGCAAAGCGCTGCTGTGGTTCAGAGGATAAAACTCCTCCTCAAAGGATA
CAGTGGAAAGTTGTTCCTCATCGAATGTTTTTTGGGAGATCATACACAAATACATGGGGT
TCAGGAGGAGTGTCCTTTCTTGATCCAAATTGTAGTGATACGAGTGAAGCGCATGTCTGC
TTGTATAAAATAACGCTGGCACAGTTCAATGATTTGCTGCTTCAAGAGAATAATTTGAATT
GTGGGACTGAGCATCCATTAGTGGACTTATCTTCCATTGATGCAATTAGAAATGGGAATT
CTATACTTGAACTTATCAAGGATAGTTGGTATGGCACTCTCATTTACCTTGGCATGGAAG
GAGGCCTTCCAATTGTGACTTTCACGTGCTCCGTGTGTGACGTTGAAAAAGTTCAAGCATG
GACAACTACCGCTTTGTCCTCCATCTTCAAGGTACGAAAATATTCTCATTCGAGGTCTCG
TCCAAGGGAAAAAACTTTCTGAAGATGATGCAACGGCCTACATCCGTGCAGCATCAACAT
CACCCCTGCTGTAAATAAAACATGAACTGATCATTTATATTCATATCTATTATTTCTTTCCC
TTGGCACGCTACCAGTTCTGCCACTCTATAATAGCAGCATAGATTCAGTAGAACCACACT
TACATGCTCATGATTAATCGATTTTAAAGAATGCCCAAGGCAATCAATTCATGTCCGTGTC
CATATGGGGTCATTCTTTCTATTCAGGTGCATGCAGTACATTGAGATGATGTGCAGGATA
TTCAACCGAGTGCAGTCATTCTTTCAAACAACTTGACAGGTGTAAACTCGTGCCGAATCG
G
SEQ ID NO:162
TCCAGTCAATTTATAGGCATTGGCCGTTGCAGTTCCTGTACTCTCTGTTGTTTCTCTTGAA
GGAAAATCTATGGCGGACGAAGATCTGGATCTTTCCGATGTAGGGGAAGTAGAAGATGA
ACCAGGTGAGGAGATCGAATCCACTCCACCCCTCGCTGTAGGGCAGGAGAAGGAGATA
AACAGCTTGGCTTTGAAGAAAAAACTTCTGAAGGTTGGCACAAGGTGGGAAACGCCAGA
AAATGGAGACGAAGTAACAGTGCATTACACTGGAACTTTACCAGATGGTACCAAATTCGA
TTCCTCGAGAGACAGAGGAGAGCCCTTTACTTTCAAGCTTGGCCAAGGCCAAGTTATCA
AAGGATGGGACCAAGGCATTGTTACCATGAAGAAAGGGGAACGTGCACTATTCACAATA
CCTCCAGAATTGGCCTATGGTTCTTCTGGCGTGCGACCCACTATACCTCCCAATGCCAC
CCTCCAATTTGATGTGGAATTACTTTCGTGGACTAATATTGTTGATGTATGCAATGATGGA
GGCATTCTCAAGCGAATAATATCTGAAGGAGAAAAATATGAAAGACCCAAAGACCCAGAC
GAGGTGACTGTGAAATATGAAGCAAAGCTTGAGGATGGAACACTCGTGGCAAAATCCCC
AGAAGAAGGCGTGGAGTTTTATGTCAATGATGGACACTTCTGTCCAGCTATTGCGAAGG
CAGTCAAAACAATGAAGAGAGGAGAAAGTGTGATTCTCACCATAAAACCTACGTATGCCT
TTGGTGAGCGGGGTAAGGATGCTGAAGAGGGGTTTGCTGCGATTCCTCCAAATGCTACT
CTCACTACGAGTTTGGAATTAGTTTCATTTAAAGCTGTGATAGCAGTAACGGAAGATAAG
AAGGTTATCAAGAAAATCCTGAAGGAAGCTGATGGGTATGATAAACCAAGTGACGGGAC
GGTAGTGCAGATCAGGTATACTGCTAAGCTGCAAGATGGTACAATCTTTGAGAAGAAGG
GATATGAAGGCGAGGAGCCTTTCCAATTTGTAGTTGATGAAGAACAAGTCATTGCTGGTC
TTGATAAGGCAGTGGAAACCATGAAGACAGGGGAGATTGCCCTGATTACAATAGGAGCT
GAATATGGTTTTGGAAATTTTGAAACCCAGAGAGATCTGGCTGTAATTCCTCCAAATTCAA
CTCTCATCTATGAGGTGGAAATGATATCATTTACCAAGGAAAAAGAATCGTGGGATATGG
ATACAACAGAGAAAATAGAGGCCTCTAAACAGAAGAAGGAGCAAGGAAATTCTCTGTTCA
AGGTTGGAAAGTACCAGCGAGCTGCAAAGAAATATGAGAAGGCTGCAAAATATATTGAG
CACGACAGTTCCTTCAGTGCCGAGGAGAAAAAGCAATCAAAGGTTTTGAAAGTATCGTGT
AACCTGAATCATGCAGCATGCCGTCTTAAGTTGAAAGACTTCAAGGAAGCAGTTAAACTA
TGTTCAAAGGTGTTAGAGCTTGAATCACAAAATGTGAAGGCATTGTATAGAAGAGCACAA
GCATACATAGAGACAGCAGATTTGGATTTAGCTGAATTTGATATAAAGAAAGCTCTAGAG
ATAGAGCCACAAAATAGGGAAGTGCAACTGGAGTATAAAATTTTAAAGCAAAAGCAAATT
GAATATAATAAGAAAGATGCAAAGCTATATGGAAACATGTTCGCAAAGCTGAACAAATTA
GAAGCCTTTGAAGGAAAGGTATTGTCTTGAGAGCCATCGAACCTAACAGAGCAAGGGAC
AATTACTAAACGAAAGGCTTTTTTTGTGTATTTACTATTTTTATAGTTTGTAAACTGAATCA
CTGAAGCAATGGTTCGAACATAACTGTTCAATTTTTGCCAACTCTTGGGCAGATACTCTTT
TTAGGCCTTACTTTCACACCTAGTTTTGCATGGTTTATGGAAACTTATATTGCTGTTTTTCT
TTGCAACTAGGATTTTTTTTTTCTTTTTAATGAAAGCTGTTCCATGTTACAAGTTGCCAGC
GTGTCTACTGTCGTTGGCTTGCAAAGACATCTTCTTATCTTATTCCGTGTCTCCATAATCA
CCGGCTTCTTTAGTTCACCTTTGCAATTATAAGCAAGAAGCTTGACTTTGGTGAAAGTCTT
GGAAATTTTGTAGCTTCGTTGCTTCTGTATTAAATTCCAAGCAGAAGGGCACATGTTGTG
ACATCAAGTAGTAGATTGTTCTGCAGATTCTGGTTTCTATTCTTTTCTTTCGTTATACTATA
TGACAATCGAATGGACAAAGGAATTTCATCGAAAAAAAAAA
SEQ ID NO:163
ATTTTCCATTCACTCTGAATATCAATCTCCCCCAACAAAAGCCCATGGAGATTTCACAGGT
CACCTTTACCTAGGGAAGTTCCTTCAGCTTTGGCAGTTTAACTTAAAACTACAAGGCTCC
ATGATATAACAACGTGGCAATCCTAACATGTAATTTCATTGCCAAAATCCGACGTGGATC
AACTGAACTGGACAGCCCCTGCTGTAAATAAAGCTCTCGGAGTCTTGAATCGTTTACAAA
TATTGGCAGACGAATGGTCACACCAGGCAGTTTATCGGCCTTTTACGGTCTCAATTTGTA
GGCGTTGGGCGTTGCAATACGTGTATTCTCAGTAGTCTCATTTGAAGGAAAAATCAATGG
CAGACGAAGGTCTTGAACTCTCCGATGTAGCAGAAGTCGAAGATGAACCAGGTGAGGAG
TTCGAATCCGCTCCACCCCTCGTTGTAGGGCAGGAGAAGGAGTTAAACAGCTCGGGTTT
GAAGAAAAAACTTCTGAAGGCTGGCACAAGGTGTGAAACGCCAGAAAATGGAGACGAAG
TAACAGTGCACTACACTGGGACTTTACTGGATGGTACCAAGTTCGATTCTTCGAGAGACA
GAGGAGAGCCCTTTACTTTCAACATTGGCCAAGGCCAAGTTATCAAAGGATGGGACCAA
GGCATTGTTACCATGAAGAAAAGAGAACATGCACTATTCACGATACCTCCAGAATTGGCC
TATGGCGCTTCTGGCATGCCACCTACTATACCTCCCAATGCCACCCTCCAATTTGATGTG
GAATTACTTTCTTGGACTAATATTGTTGATGTATGTAAGGATGGTGGCATTCTCAAGCGAA
TAATATCTGATGGAGAAAAATATGAAAGACCCAAAGACCCAGATGAGGTGACTGTGAAAT
ATGAAGCAAAGCTTGAGGATGGAATGCTCGTGGCAAAATCCCCAGAAGAAGGCGTGGA
GTTTTATGTCAATGATGGAAACTTTTGTCCTGCCATTGTGAAGGCAGTCAAAACAATGAA
GAAAGGAGAAAATGTGACTCTTACCATAAAACCTGCGTATGCCTTTGGTGAGCAGGGTA
AGGATGCTGAAGAGGGGTTTGCTGCAATTCCTCCAAACGCTACTATCACTATAAATCTGC
AATTAGTTTCATTTAAAGCTGTTAAAGAAGTAACAGAAGATAAGAAGGTTATCAAGAAAAT
CCTGAAGGAAGCTGATGGATATGATAAACCAAGTGATGGAACGGTAGTTCAGATCAGGT
ATACCGCTAAGCTGCAAGATGGCACAATCTTTGAGAAGAAGGGATATGCAGGCGAGGAG
CCTTTTCAATTTGTAGTAGATGAAGAACAAGTGATTGCTGGTCTTGACAAGGCAGTGGAA
ACCATGAAGACAGGGGAGGTTGCTCTGATTACAATAGGACCTGAATATGGTTTTGGAAAT
ATTGAAACCCAGAGAGATCTGGCTGTAATTCCACCATATTCAACTCTCATCTATGAGGTG
GAAATGGTATCATTTACCAAGGAAAAAGAATCATGGGATATGAATACAACAGAAAACATT
GAGGCCTCTAAACAGAAGAAAGAGCAAGGCAATTCTCTGTTTAAGGTTGGAAAGTACCT
GCGAGCTGCAAAGAAATATGATAAGGCAGCAAAATATATCGAGCACGACAATTCCTTCAG
TGCTGAGGAGAAAAAACAATCAAAGGTTTTGAAAGTATCATGTAACCTGAACCATGCAGC
ATGCTGTCTTAAACTGAAAGACTTCAAGAAAGCAGTTAAACTTTGTTCAAAGGTGTTAGA
GCTTGAATCACAAAATGTGAAGGCCTTGTATAGAAGAGCACAGGCATATATAGAGACAG
CAGATTTGGATTTAGCTGAATTTGATATCAAGAAAGCTCTAGAGATAGAGCCACAAAACA
GGGAGGTGCGACTGGAGTATCTTATTTTAAAGCAAAAGCAAATTGAATATAATAAGAAAG
ATGCAAAGCTATATGGAAACATGTTTGCAAGGCAGAACAAATTGGAAGCCATTGAGGGA
AAGGATTAGATTATCCACTCAGATAGTTGTATATCAACTAGAAACTACTGTTGGGAATGG
CCTGAGGAGACTGATTAAATGTTCCTAGCACGCAAAAAGCTGCATTTAGTTGATTAGCTT
CCTGTTTTGACTGACATTTGCGGAAGGGTGAAGGTGAACCGATGGAGGTGGAGAAGCC
AACAAAGAGTTGATGAAAACAGAAAGCCAGAGGCAACGGGTGTAGATTCTAATTAAATAG
CAGCATGTTAATGTAATACATTTAGTTTTTAGATAACTGTTAATGTGTAGTAAAGCACTAG
GAAGAGAAAGATTTGGACATGTCATTGTATTTTGACACTTGAACGGAAGCATGTTAAAGA
ACCTGTGTTATTGTTTTAAAAAAAAAA
SEQ ID NO:164
GTCTTTTCTTGGCTGTATTTTCTGTCTTTGAAGCGGTATTTTGTACGCGAAGCTTGTTTTT
GTTGCTGGTTTTGGTGGCTTTAAGCGGGTTTTGTATGCTAAGCTTGTTTTTATTGGCGGT
TTTGGTAGTCTTTAAAGCGGTTTTGTCCAATCGCTTCTGTGCAGGTTGATCAATTTGGCG
GAAAGTCAGGCTTTTGGTGCAGCTTTTTGCAGAATCTCAGAGGTTTCGAACTCAGGATGC
CGAACCCAAAGGTTTTCTTTGATATGCAGGTCGGCGGTGCCCCAGCCGGCCGGATCGT
GATGGAGCTCTATGCGGATGTGGTGCCAAAGACGGCTGAGAACTTCCGCGCGTTGTGT
ACCGGCGAAAGGCACCGGCCGCTCAGGCAAGCCTCTGCACTTCAAGGGTTCGTCGT
TCCACCGTGTGATCCCAGGGTTCATGTGCCAGGGCGGTGACTTCACAAGGGGCAACGG
TACCGGTGGAGAGTCGATCTACGGCGAGAAGTTTGCCGATGAGAACTTTGTAAAGAAGC
ACACGGGGCCTGGCATCCTCTCCATGGCCAACGCCGGCCCTAACACTAACGGCTCCCA
GTTCTTCATCTGTACCGCCCAGACCTCGTGGCTGGATGGTAAGCATGTCGTATTTGGTC
AAGTTGTAGAGGGCTTGGAGGTCGTGCGCGATATCGAGAAGGTTGGATCTGGATCTGG
CAGAACTTCAAAGCCGGTTGTCATTGCCGACTCTGGACAGCTCGCTTGAATTTTTATTAT
TTACCTTCGCCTTTACGCTGCATACGTTAATAGGTTATTATTTCCTTCAACCATTACGCTG
CATAGGTTGTTAGCGTATTGTTTCCCTTTACCATTACGCTGCATGACTCCCTAGGGTTTG
TCAGCATAGGCGTTTTAAGGGTTTTTTGCATCTTTCTACTCAAGATAGTCGCTGCATAAGT
TCCTAGGGTTTGTCAGCAAAGAGGCTTCAAAGGTTTTTGTGTCTTTTCTAGTTATTATAAA
CGCTTCATAGGTTCCTAGGGTTTGTCAGCATATAGATTTCAAGGGTTTTTTTAGATCTTTG
GAGTTGAGATAAACGCTATGGCAATAACCCAGTAATGTTTGTTTTTATCATATGAAATTTT
TACATCTGGAGTTGCATTCGCAGTAAAAAAAAAA
SEQ ID NO:165
GAAGAAAGTAAGGTTTTGTAGCGAAGAAAGTAAGGTTGATCAATTTGGCGGAAAGTCAG
GCTTTTGGTGCAGCTTTTTGCAGAATCTCAGAGGTAAGCGGTCTTTGAAAAATGAAAATA
AACAAATCGTCTGAGAAAGTAAAACCTTAAAGCCTTTCAAGGAAAAGAACTGAACTTTTTG
GCTTTGAAGAAAAATGATCTTTTTGTCTCTGAACGGCCTGATATGAAAGATCCATGTAAA
GCTCGGCAGGATTTTGTTTGTGCCTTGCAGGTTTCGAACTCAGGATGCCGAACCCAAAG
GTTTTCTTTGATATGCAGGTCGGCGGTGCCCCAGCCGGCCGGATCGTGATGGAGCTCTA
TGCGGATGTGGTGCCAAAGACGGCTGAGAACTTCCGCGCGTTGTGTACCGGCGAGAAA
GGCAACGGCCGCTCAGGCAAGCCTCTGCACTTCAAGGGTTCGTCGTTCCACCGTGTGA
TCCCAGGGTTCATGTGCCAGGGCGGTGACTTCACAAGGGGCAACGGTACCGGTGGAGA
GTCGATCTACGGCGAGAAGTTTGCCGATGAGAACTTTGTAAAGAAGCACACGGGGCCTG
GCATCCTCTCCATGGCCAACGCCGGCCCTAACACTAACGGCTCCCAGTTCTTCATCTGT
ACCGCCCAGACCTCGTGGCTGGATGGTAAGCATGTCGTATTTGGTCAAGTTGTAGAGGG
CTTGGAGGTCGTGCGCGATATCGAGAAGGTTGGATCTGGATCTGGCAGAACTTCAAAGC
CGGTTGTCATTGCCGACTCTGGACAGCTCGCTTGAATTTTTATTATTTACCTTCGCCTTTA
CGCTGCATACGTTAATAGGTTATTATTTCCTTCAACCATTACGCTGCATAGGTTGTTAGCG
TATTGTTTCCCTTTACCATTACGCTGCATGACTCCCTAGGGTTTGTCAGCATAGGCGTTTT
AAGGGTTTTTTGCATCTTTCTACTCAAGATAGTCGCTGCATAAGTTCCTAGGGTTTGTCA
GCAAAGAGGCTTCAAAGGTTTTTGTGTCTTTTCTAGTTATTATAAACGCTTCATAGGTTCC
TAGGGTTTGTCAGCATATAGATTTCAAGGGTTTTTTTAGATCTTTGGAGTTGAGATAAACG
CTATGGCAATAACCCAGTAATGTTTGTTTTTATCATATGAAATTTTTACATCTGGAGTTGC
ATTCGCAGTACTCTCCTGGTCATCGAGAATTTTTTATGTTTTTTTTAATGCGTTTAAAATAT
GCTTAAATGGGTCCGGAAAACAAAAAAAAAA
SEQ ID NO:166
GAAGAAAGTAAGGTAAGAGGATTTTGTTAGCGAGGGGTATTCTCTGGTGGGCTTGGTAG
TCTTAAAAGCGGTGCTTTTATTCGTCTTTTCTTGGCTGTATTTTCTGTCTTTGAAGCGGTT
GATCAATTTGGCGGAAAGTCAGGCTTTTGGTGCAGCTTTTTGCAAAATCTCAGAGGTTTC
GAACTCAGGATGCCGAACCCAAAGGTTTTCTTTGATATGCAGGTCGGCGGTGCCCCAGC
CGGCCGGATCGTGATGGAGCTCTATGCGGATGTGGTGCCAAAGACGGCTGAGAACTTC
CGCGCGTTGTGTACCGGCGAGAAAGGCACCGGCCGCTCAGGCAAGCCTCTGCACTTCA
AGGGTTCGTCGTTCCACCGTGTGATCCCAGGGTTCATGTGCCAGGGCGGTGACTTCACA
AGGGGCAACGGTACCGGTGGAGAGTCGATCTACGGCGAGAAGTTTGCCGATGAGAACT
TTGTAAAGAAGCACACGGGGCCTGGCATCCTCTCCATGGCCAACGCCGGCCCTAACACT
AACGGCTCCCAGTTCTTCATCTGTACCGCCCAGACCTCGTGGCTGGATGGTAAGCATGT
CGTATTTGGTCAAGTTGTAGAGGGCTTGGAGGTCGTGCGCGATATCGAGAAGGTTGGAT
CTGGATCTGGCAGAACTTCAAAGCCGGTTGTCATTGCCGACTCTGGACAGCTCGCTTGA
ATTTTTATTATTTACCTTCGCCTTTACGCTGCATACGTTAATAGGTTATTATTTCCTTCAAC
CATTACGCTGCATAGGTTGTTAGCGTATTGTTTCCCTTTACCATTACGCTGCATGACTCC
CTAGGGTTTGTCAGCATAGGCGTTTTAAGGGTTTTTTGCATCTTTCTACTCAAGATAGTC
GCTGCATAAGTTCCTAGGGTTTGTCAGCAAAGAGGCTTCAAAGGTTTTTGTGTCTTTTCT
AGTTATTATAAACGCTTCATAGGTTCCTAGGGTTTGTCAGCATATAGATTTCAAGGGTTTT
TTTAGATCTTTGGAGTTGAGATAAACGCTATGGCAATAACCCAGTAATGTTTGTTTTTATC
ATATGAAATTTTTACATCTGGAGTTGCATTCGCAGTAAAAAAAAAA
SEQ ID NO:167
CTCTGGTGGGCTTGGTAGTCTTAAAAGCGGTGCTTTTATTCGTCTTTTCTTGGCTGTATTT
TCTGTCTTTGAAGCGGTATTTTGTTGATCAATTTGGCGGAAAGTCAGGCTTTTGGTGCAG
CTTTTTGCAGAATCTCAGAGGTTTCGAACTCAGGATGGCGAACCCAAAGGTTTTCTTTGA
TATGCAGGTCGGCGGTGCCCCAGCCGGCCGGATCGTGATGGAGCTCTATGCGGATGTG
GTGCCAAAGACGGCTGAGAACTTCCGCGCGTTGTGTACCGGCGAGAAAGGCACCGGCC
GCTCAGGCAAGCCTCTGCACTTCAAGGGTTCGTCGTTCCACCTGTGATCCCAGGGTTC
ATGTGCCAGGGCGGTGACTTCACAAGGGGCAACGGTACCGGTGGAGAGTCGATCTACG
GCGAGAAGTTTGCCGATGAGAACTTTGTAAAGAAGCACACGGGGCCTGGCATCCTCTCC
ATGGCCAACGCCGGCCCTAACACTAACGGCTCCCAGTTCTTCATCTGTACCGCCCAGAC
CTCGTGGCTGGATGGTAAGCATGTCGTATTTGGTCAAGTTGTAGAGGGCTTGGAGGTCG
TGCGCGATATCGAGAAGGTTGGATCTGGATCTGGCAGAACTTCAAAGCCGGTTGTCATT
GCCGACTCTGGACAGCTCGCTTGAATTTTTATTATTTACCTTCGCCTTTACGCTGCATAC
GTTAATAGGTTATTATTTCCTTCAACCATTACGCTGCATAGGTTGTTAGCGTATTGTTTCC
CTTTACCATTACGCTGCATGACTCCCTAGGGTTTGTCAGCATAGGCGTTTTAAGGGTTTT
TTGCATCTTTCTACTCAAGATAGTCGCTGCATAAGTTCCTAGGGTTTGTCAGCAAAGAGG
CTTCAAAGGTTTTTGTGTCTTTTCTAGTTATTATAAACGCTTCATAGGTTCCTAGGGTTTG
TCAGCATATAGATTTCAAGGGTTTTTTTAGATCTTTGGAGTTGAGATAAACGCTATGGCAA
TAACCCAGTAATGTTTGTTTTTATCATATGAAATTTTTACATCTGGAGTTGCATTCGCAGTA
AAAAAAAAA
SEQ ID NO:168
GGAATCGCTTGATGCATAATATAAAATCACTCGCTTCCCCCTCTTTCTTTCGCCTAGCCG
TTTCATTGGTTGTTTGCTAAGAATTCAGAGTGTCGTACTGCGCATCTGCTTGGCTTTTGAT
TCTGTTTATCTTCGTTAACATTTCAAGTGTGTAATTTTCCTCAGCAACGTTCCAATGGCGG
ACGATTTTGAACTCCCTGAAAGTGCTGGAATGATGGAAAATGAGGACTTTGGCGATACTG
TTTTCAAGGTTGGTGAAGAGAAAGAAATTGGGAAGCAGGGGTTGAAGAAATTGTTGGTTA
AAGAAGGGGGATCGTGGGAAACACCCGAAACCGGTGATGAAGTTGAAGTTCATTACACC
GGAACACTTCTGGACGGTACAAAGTTCGATTCTAGTCGGGATAGAGGAACTCCATTCAA
ATTCAAGCTTGGTCAAGGTCAAGTAATCAAAGGATGGGATCAGGGGATTGCAACGATGA
AAAAGGGGGAAAATGCAGTCTTCACCATTCCTCCTGATCTGGCATATGGGGAATCTGGA
TCGCAACCTACAATTCCACCCAATGCGACTCTGAAATTCGATGTTGAATTGCTCTCTTGG
GCTAGTGTAAAAGATATCTGCAAGGATGGGGGCATTTTTAAGAAAATTATCAAGGAAGGG
GAGAAATGGGAGCACCCAAAAGAAGCTGATGAAGTTTTGGTTAAATATGAGGCTAGACT
GGAGGATGGAACTGTTGTATCAAAATCAGAGGAGGGTGTGGAGTTTTATGTAAAGGATG
GATATTTCTGTCCAGCTTTTGCCATAGCTGTCAAAACAATGAAGAAGGGAGAAAAAGTAT
TGTTAACAGTGAAGCCTCAGTATGGTTTTGGACACCAAGGACGCGAAGCAATTGGAAAT
GATGTTGCCCGTTCCACAAATGCAACATTGTTGGTGGATCTTGAGCTTGTGTCTTGGAAG
GTTGTCGATGAAGTTACTGATGATAAGAAAGTGCTGAAGAAAATCCTGAAGCAAGGAGAA
GGTTATGAACGACCTAATGATGGGGCAGTGGTCAAAGTGAAATACACGGGGAAGTTAGA
GGATGGCACCATATTTGAGGAAAAGGGATCTGATGAAGAACCATTCGAGTTTATGGCTG
GAGAAGAGCAAGTTGTTGATGGATTAGATAGAGCGGTTATGACAATGAAGAAAGGAGAA
GTTGCCTTGGTGAGTGTGGCAGCTGAATATGGTTACCAAACTGAGATTAAGACAGACTT
GGCAGTTGTTCCACCAAAGTCCACCTTGATTTATGAAGTGGAGCTGGTTTCATTTGTAAA
GGAGAAAGAGTCATGGGACATGAACACCGCGGAGAAGATAGAGGCAGCTGGGAAGAAG
AAAGAAGAAGGGAATGCATTATTTAAGGTCGGCAAGTATTTTAGGGCATCCAAGAAATAT
GAAAAGGCTACAAAGTATATTGAGTATGACACTTCATTCAGCGAGGAAGAGAAGAAGCA
GTCGAAGCCCTTGAAGGTTACTTGTAATCTGAACAATGCAGCTTGTAAACTCAAATTGAA
AGATTACACACAAGCAGAAAAATTATGTACAAAGGTTTTGGAGGTTGAATCGCAAAACGT
TAAGGCTTTATATAGAAGAGCACAAGCATACATTCAGACAGCAGATTTGGAGCTCGCAGA
GCTGGACATAAAGAAAGCACTCGAGATTGACCCCAACAACAGGGATGTGAAGCTCGAGT
ACAGGGCTCTTAAAGAAAAACAAAAGGAGTACAATAAGAAGGAAGCCAAATTTTATGGCA
ACATGTTTGCACGAATGAGCAAGTTGGAGGAATTAGAAAGCAGGAAATCTGGGAGCCAA
AAAGTGGAGACTGCTAATAAAGAGGAAGGGTCTGATGCCATGGCTGTAGATGGCGAGTC
TGCCTGAAGTGCTTTTGCAAGTTTAGCGATATATTTCTATGCAGTCCTTTATAGATGAGTC
TACCTTTTAGCCATTCACAATGAAGATTGTAAGTTGGGTGAACTTTTTTACCACGCTAGGT
TGATCTATTTTAAGACTCTTTTGCAAGATGCTTCTGCATTCATAGCAAAATTGTATGAGGG
ACTGGCAAGCTGAATTTGCTTGAAAGTGATGAAAGAGGATGTATTTTTGAAGTATTGCCT
TGCTCTTAGTCTAATTGACTTAGATAATGAATCTCTTGGTTTTTGTTGAAAAAAAAAA
SEQ ID NO:169
AACAAAATAGCTGCGCGTACCACAAAGGTGACAAACGCCGGATTTCTCTTATCAGACTTG
TCAATGGCCGCCTCTCTCACTCCACTGGGTGCAGGCCTGGCTTATGCAACAATTTATGAT
CAAGCTAAAGTGAGGAAATTGGAACCCACAAAACGGTCCTTGATAGCGCTGTGCCAACA
TTCCGATTCTCAACATAGAAGATTTATTACTAGAAAATATCATGTGAACGTTCAAATTCTTA
ATCGAAGAGATGCAATCAGATTAATTGGTTTAGCAGCTGGACTATGTATTGATCTTTCTCT
GATGTATGATGCTAGAGGAGCTGGTTTACCTCCACAAGAAAATGCAAAGTTGTGTGATAC
AACGTGTGAGAAGGAACTTGAAAATGCTCCAATGATCACAACTGAATCTGGTTTACAATA
CAAGGATATCAAGATTGGGAATGGGCCAAGTCCACCTATTGGATTTCAGGTGGCAGCAA
ACTATGTAGCAATGGTACCATCTGGACAAGTTTTTGATAGTTCATTGGATAAGGGTCAAC
CTTACATCTTTCGAGTAGGTTCTGGCCAGGTGATCAAGGGTCTTGATGAAGGCCTTCTGA
GCATGAAAGTTGGTGGAAAACGTCGCTTGTACATCCCTGGCCCTCTTGCATTCCCAAAG
GGCCTTAACTCTGCTCCAGGAAGGCCGCGGGTAGCACCTAGCAGTCCTGTTATTTTTGA
CGTGAGTTTGGAGTTCATACCTGGCCTTGAAAGTGAAGAAGAGTAAACTGCTTGCATGA
CAGATTAGCAAGTCAAGTCATTGCGTAGAGTGATGGTCACCATAG~CTCTTTAGTTTCAT
TCATTTGATTGATTATTCTTTTTATAAAATTTGTTTCCGTTTTCCTTGTTTGGAAGAAGGCT
TACTGTTTTGAACTTGAAAATATGTTGAAATAGGTTAAATGCACTGTTAATTCTGGACGGA
TGGAAGGATTTCCTTTCCAATCATTTCCTTGCAACAAAGAGCTGACCACGGCAAGCTTAA
GCATCTGATGAATGATGCCACATGTATATCTTTTCTTAATATAATATATATCAAGCTTTTTT
CGTAAAAAAAAAA
SEQ ID NO:170
GTGTGAATCCATTTAAGTCTATCCAAAGGCCAAACAAATCTGAAAGAATTTTGTTTAACAC
ATTATCCATTTACGGGACAGGGCAAGCAGCAATGTCGGCCGCATCACTTTCTGCAGACA
TGGCGATCCGTGGCACTATTCTGGGGAAGACAGCATTACATGTTTTGGGACCTCAGGTT
GTATCCCAATGTCGCCAACCAGTGATGTTCAAATGTCCACCCCATACTCTCAGGAAGATG
AGATTTTCTGCTCAAGATTTGCAGTCAAAAAATTTCTATTCAGGATTTACACCATTTAAATC
AGTTTTTATTTCTACTTCAAAAAGGAGTTGGCAAGCAGGTTCTGCCAGAGCTATGTCACA
GGATGCTGCATTTCAGTCAAAGGTGACAACTAAGTGCTTTTTGGACATTGAAATTGGAGG
AGATCCTGCAGGAAGGATTGTGCTTGGACTTTTTGGTGAGGATGTACCAAAAACAGCTG
AGAATTTCCGCGCACTATGTACAGGAGAGAAAGGGTTTGGATACAAAGGATCTTCATTTC
ACCGAATCATAAAAGATTTTATGCTTCAGGGTGGAGACTTTGACAGGGGAGATGGTACA
GGAGGAAAGAGCATATATGGTCGCACTTTTGAAGACGAGAACTTCAAATTAGCCCATGTT
GGACCTGGAGTTCTAAGTATGGCTAATGCTGGCCCCAACACAAATGGCAGCCAATTTTTT
ATATGTACAGTCAAAACCCCATGGTTGGATAAACGCCACGTTGTGTTTGGACAAGTTATT
GAGGGGATGGAGATTGTTAAGAAACTCGAGTCGGAAGAAACAAATCGTACAGATCGGCC
AAAAAGGCCCTGCAGAATTGTTGATTGTGGGGAGCTTCCCTGAAGTTGCTGAAACAGAG
GCTTTAATTGTTTCAATTGAGTGATCATCTGAAAATAATTTTGTTTCGTCTGAAAAGAGGGC
TTTAATTATCATAGTTTTATTCCGGCTATCTTGATCATTCACGGAAGTCCCGAGAGTCAAT
TTGCCAAGTATTCCTATTAAACTCTATTGCTCTCAGAATTGATTTAAATGCCTTTAATGGG
TGCAATGTAATAGTCTTATTATGAGTCAATATGAACTGTTAGGGTAATGCCAAAAAAAAAA
SEQ ID NO:171
GCTAGATTTCGTCGAGGGAAGTCGAGTCGGAGACAAATTCGGGTATAATATGGAGGATG
GGTGAGTCAGAATTGGGTAAAAATAGAAGGGAAGACAGAGGATCACCCCAAAACATATT
TGTCTGATTGATCGATTGATTTCTAAAAGGTCCAACGTTGTTTTCTGCTCCTCCACAACCA
AATAGTTCATCTCCCCATCTCCTGCTTTTCAGATTCACTATACATTCAAAATTCGAAATTC
GATATTCGGGACTGTGTAATCGAAGGTCTATCTATTTCAAGGTCCCCAGATCGATCTTAC
TCCGAAGGAGAATCGTTTCGATCATCTTTGCTGAAAGTGAGAGCAGAAACCTTTGAGACA
AGTCAGGGGCAATGGGGAGAATTAAGCCACAGACATTGTTGCAGCAGAGCAAAAAGAAA
AAAGTACCCGGCCGTATCAGCGTTTCCACCTTAATAGTGTGCAACTTGATCATCATCTTT
CTTATGTTTTCTCTAGTGGGCATCTACAGGCAGAGGGCCAAGCGTAATCGGGCAACATC
TCGGTCTGATGGTGATGAGGAAATGGAGAACTTTGGGAGGTCCAAAATAAACAGCGTTC
CTCACCAAGCAATCGTAAATACTACAAAGGGTTTAATCACATTGGAACTTTTTGGTAAAAG
TTCTGCGCATACCGTTGAAAAGTTTGTGGAGTGGAGTGAACGTGGTTACTTCAATGGATT
ACCCTTCTATCGTGTCATTAAACACTTTGTAATTCAAGTTGGAGATCCAAAGTTTGCTGGA
AACAGGGAAGACTGGACTGTTGGTGGTCAGCTCAATGTTCAACTTGAGTTCAGTCCAAA
GCATGAAGCATTCATGCTGGGGACCTCCAAGTTGGAGGATCAGGGAGATGGATTTGAGC
TTTTCATAACAACAGCACCCATTCCGGATTTAAACGATAAGCTTAATGTCTTTGGGCGTGT
TATAAAAGGTCAAGATGTAGTTCAGGAAATAGAGGAGGTGGATACAGATGAACATTTTCA
GCCGAAATCCCCCATCATCTTAAATGATGTCCGTTTAAAGGATGAGCTCTGAGACATACG
TGCAGTCCTGTTTCAAGAAATTGTGATCGATAATCAGTTTTTCTTCCAAGTATTGCTTTTC
GCCTGTTCTGTCTCGAGCATTTTATATACCCGGAGGAAACAGCGCTGTGGATACCAGGA
AGATTGAAGTGTATGTATATTTAGGCAAATACTTCGTAAGTTTTGTGATGTCTCAGTTGTT
GGCTTTTACCTGTAGAGAAGTTGAGTTCTAAGCTGGAGATTTATTTCTTTACCACCTGAAT
ATTCTCATAGTTCTAGGCCAATTATAATTCTTTAGTCCCTTCTTATGTTGGGGAATGAAAT
TTACTGTTATTACTTTACACTGAATGATCATTAATGTGCCTCAGGATTTTAGCATCTCAGTT
TGCCGCTGAACATTTGACAAGAGATTTAATGATTAAGATTTCTTTTGACATTGGGAGAGAT
TTTTTCTAACTTGCGTTGAGACAAAATGACAATGAACTGCAATCCACTTTCGACAAAAAGG
AAGAGATCAAGAATCAAGTTTGATGGGCTGACCCCATGTTAACTTATTAGTTATTATAGTG
GAGGTTCCCATTTCCAAGTCAAACAGTTTGGATACCCACTTAATGATACCAGTATAACTAT
TAATTTTTGTAAAGCCACCAAGCTTGTATAAATGATGCCTTTCTTTTCAAAAAAAAAA
SEQ ID NO:172
CTTTCAAAGGGACGGGTTGCGAGAAATATGGCGAGGCAAAGTACTCTACTGCTCTTTTG
GAGCTTAGTATTCTTGGGTGCCATCGTATTTACTCAGGCCAAGCATGAGGAACTGGAAG
AGGTAACACACAAGGTATACTTTGATGTGGATATTGCTGGAAAACCTGCTGGTCGAGTTG
TTATTGGACTCTTTGGGAAAGCTGTACCCAAAACTGTAGAGAATTTTCGGGCGCTTTGTA
CAGGTGAGAAGGGCGTTGGAAAAAGTGGAAAGCCACTTCATTACAAGGGAAGTTTTTTC
CATAGGATCATTCCCAGCTTCATGATTCAAGGTGGTGATTTCACTCTTGGAGATGGCAGA
GGGGGAGAGTCAATCTATGGAACTAAATTTGCCGATGAGAACTTCAAACTGAAGCACAC
AGGACCAGTTTTTATTACCACAGTAACAACCGACTGGTTGGACGGCAGACATGTTGTCTT
TGGAAAAATTATTTCTGGAATGGATGTTGTATATAAAGTAGAAGCAGAAGGAAGACAGAG
TGGTCAACCAAAAAGAAAAGTAAAAGATTGCAGACAGTGGAGAGCTTTCTATGGATTAGCT
AACTCTTCTAGTTGAGATCTCCATCAATTAATGGATACAAACATTGAGTTTCACTTTGGCA
AACCGACATTCCCAAATTTAATGAGCCAATATGTTTGGCTCGTCATAAAGCCTGGGATAA
TCTTAAATAATAAATATCGAGTTGGTGACTCAAAAAAAA
SEQ ID NO:173
GAACGACTTTCAAAGGGACGGGTTGCGAGAAATATGGCGAGGCAAAGTACTCTACTGCT
CTTTTGGAGCTTAGTATTCTTGGGTGCCATCGTATTTACTCAGGCCAAGCATGAGGAACT
GGAAGAGGTAACACACAAGGTATACTTTGATGTGGATATTGCTGGAAAACCTGCTGGTC
GAGTTGTTATTGGACTCTTTGGGAAAGCTGTACCCAAAACTGTAGAGAATTTTCGGGCGC
TTTGTACAGGTGAGAAGGGCGTTGGAAAAAGTGGAAAGCCACTTCATTACAAGGGAAGT
TTTTTCCATAGGATCATTCCCAGCTTCATGATTCAAGGTGGTGATTTCACTCTTGGAGATG
GCAGAGGGGGAGAGTCAATCTATGGAACTAAATTTGCCGATGAGAACTTCAAACTGAAG
CACACAGGACCAGGATTCCTCTCCATGGCAAATGCCGGTCCTGACACGAATGGCTCTCA
GTTTTTTATTACCACAGTAACAACCGACTGGTTGGACGGCAGACATGTTGTCTTTGGAAA
AATTATTTCTGGAATGGATGTTGTATATAAAGTAGAAGCAGAAGGAAGACAGAGTGGTCA
ACCAAAAAGAAAAGTAAAGATTGCAGACAGTGGAGAGCTTTCTATGGATTAGCTAACTCT
TCTAGTTGAGATCTCCATCAATTAATGGATACAAACATTGAGTTTCACTTTGGCAAACCGA
CATTCCCAAATTTAATGAGCCAATATGTTTGGCTCGTCATAAAGCCTGGGATAATCTTAAA
TAATAAATATCGAGTTGGTGACTCAAAAAAAAAAA
SEQ ID NO:174
GTGGCCAATGAGATTACTGAAGCTGAAGACGGATATACAACCCGGACCACATTCCCAAT
TGAATTTGTAGCAGCTCTTGTGCATGCCCACGTAGTTCTGACATCAAATTTAGGACGCAA
CATAATGTGATTCAACTTCGCATTTCTTCTTCTAACGACCAGCGGTTTTCTCCCTTGAAGTG
AGGCTTCGTGTCATTTAATAACTAGGTAGGGTTTTCAGAAGTGGGGTTCATCGCATTTGC
CAGTACTTGCAATGGTACAGCGGAGAGAAGGGGGTGCGAGGTACAATATTGAAAATGGC
GGATCGTGATTATGCGAGGACAATATTGGTCAAATTGAATTGACTGGATCTCATGGTTAG
AGAGCTTAGTGGGATCGAGACGCTACAGAGACAATATACAGGAGGGGTCTCTCGTTCAT
CACTCTAGACAGTGTGTAACTGCACTTCAACAAATTAATTTTAGAAAGAATCTCTGCTATT
TATGGAGATGGACGAGATTCAGGAACAGTCCCAACCCCAATCCAGTGAGAAGCAGGATA
TTTCTCAAGAATCTGACACAGGCAATGATAAAACAATTAATGCTGAAAAGATCACATCCG
AGAATGCTGAAGTTGAGGAAGATGATATGCTTCCTCCAAAAGTTAATACTGAAGTGGAAG
TTCTTCATGATAAAGTTACAAAACAAATCATCAAGGAAGGGAGTGGAAACAAACCTTCAC
GGAATTCGACATGCTTCTTGCACTACAGAGCGTGGGCTGAAAGCACTATGCACAAGTTT
CAGGACACCTGGCAAGAACAACAGCCACTCGAACTGGTTTTGGGCAGAGAGAAAAAAGA
ATTGAGTGGCTTTGCCATTGGTGTGGCTGGCATGAAAGCTGGTGAACGTGCCTTGCTTC
ATGTGGACTGGCAATTAGGTTATGGGGAAGAAGGGAACTTCTCTTTTCCAAATGTACCAC
CTAGAGCCAATCTTATATATGAAGCTGAGCTTATTGGTTTTGAGGAAGCCAAGGAAGGTA
AGGCACGTAGTGATATGACAGTAGAGGAGCGAATTGAAGCAGCAGATAGGAGACGGCA
GCAAGGTAATGAACTTTTCAAGGAGGACAAATTAGCAGAGGCTATGCAACAGTATGAAAT
GGCCCTAGCATACATGGGAGATGATTTTATGTTTCAGTTGTTTGGAAAATACAAGGATAT
GGCAAATGCTGTGAAGAACCCTTGCCATCTTTAAATGGCTCAATGTTTGTTGAAGCTGAA
CCGTTATGAAGAAGCCATTGGCCAATGTAACATGGTATTAGCTGAGGATGAGAAGAATAT
CAAGGCTTTATTCAGACGTGGTAAGGCAAGAGCTACTCTAGGGCAGACTGATGATGCAA
GAGAGGACTTTCAGAAGGTTCGGAAATTTTCTCCTGAAGATAAGGCAGTGATACGAGAG
TTGCGTCTCCTTGCTGAACATGATAAGCAGGTTTATCAGAAACAGAAGGAGATGTTCAAG
GGTCTTTTTGGGCAAAAACCAGAACAGAAACCAAAGAAGTTACACTGGTTTGTTGTGTTT
TGGCAGTGGCTTCTGTCGATGATAAGAACTATTTTTAGGATGAGATCCAAAACTGACTAA
TTAATTATGGTTCAATTTGCCGATCTTGTTAATAATAAATATAACTTAATTTGTCTGTTGCC
AATAAGATCTATGCAACAATGCTATTCGACTTGGAAGTTGTGAAGTGGATCACTACTGGA
TTCCGTTACATTAGTTATTGCAAGTTGGTTATTATGTACGTTTATATCACAGAACTTTGACA
ATTGTCTGTGAACTACAGTATTAAATAGTTTTGGTGCTTGTTACTCTTAAAAAAAAAA
SEQ ID NO:175
CTTTTTTCATTAATTTGGTTATGAGTTCGCCATCTTGTATACTATAACCGGGCAATGATGT
TTGTCACTCAGTGGTGAGGAAAGGGAGAACAATGGCAGGAGCAGGAGAAGGAACTCCC
GAGGTCACCCTCGAGACCTCCATGGGTCCTATCACAGTGGAGCTCTACCACAAGCACGC
CCCCAAGACCTGCAGGAATTTCTTGGAGCTCTCAAGAAGGGGCTATTACAACAATGTCA
AATTCCACCGCGTTATCAAGGATTTCATGGTGCAAGGCGGAGACCCAACTGGCACAGGA
AGAGGGGGGGAGTCAATTTATGGCCCAAGATTTGAAGATGAAATTACCCGAGATTTGAA
ACATACTGGAGCGGGAATATTGTCCATGGCTAATGCAGGTCCTAACACAAATGGCAGTC
AATTCTTCATTTCACTGGCACCGACACCATGGCTTGATGAAAAACACACTATTTTTGGAA
GGGTATGCAAAGGGATGGATGTTGTCAAACGGCTTGGAAATGTTCAGACTGATAAGAAT
GATCGACCTATTCATGATGTGAAAATCTTACGAACAACAGTCAAAGACTGATAAATTCCAT
ATCAAGAAGGTCAACAGCTCCTATCAGACTACGTTGGATCATCTGTTGCAGGTTACTGCA
ATTTTTAGACTTCTTTTTATTATGCTTCATCCTTTAATGCATCCTTCAGTGAAGCAACTACT
ATTGATGTTTTGGCTCTAAAAATATAGACTTCGTTTATTTAAATTTTCCCAACATAAAATA
TGAAAGACCTAAAACAGAATCTCGTGAAATTTGCATTAGAAGATGTTACCTATATTTTGTA
TATGGTTTCAGTGGAGTTTCCTGATGTGCAGATTACACCATGAACAAATGCAATTACCCT
GTTTTATTCTATCCCGCTTTAATTAATATTGGTCATGTTTTTCTTATTCTGGCTTGGGAATT
TGGTTGGCAGATTTACAAGGTATTGTCATT
SEQ ID NO:176
GCTCTATTATAGACCAGCTTCATCATAAGGTAAAGTTTGGCGAGCCGCGTGGACTTATTC
TTTGACCTTTTTGGCCAATAAATTTCAATCCCCGAACCATTCGGTCATCGAAATTCTCCCC
GAGATCGCTCTGTAATTCCATGCCATTTGAGCCCTTAATTTGAATTGTTTCTTTCAAGCAC
TGTATTTTAATTTTTCCCGGTGAAGTGTTAGTGTTGGTGGATTTTCCGCTGACAATCGAAT
CAGGGAATGATGGATCCGGAGTTGATGAGGCTGGCACAAGAGCAGATGAGCAAGATTT
CCCCCGATGAACTCATGAAGATGCAACGACAGATAATGGCCAATCCAGACTTAATGCGT
ATGGCATCAGAGAATATGAAGAATTTGAAGCCTGAAGATATAAGGTTTGCAGCAGAGCA
GATGAAAAATGTACGCAAGGAAGAAATGGCTGAAATAAGTGAGCGCATATCAAGGGCTT
CACCAGAAGAGATTGAAGCAATGAAAGCTCGTGCAAATCTTCAGTCTGCATATCAATTAC
AAGTAGCTCAGAACCTAAAGGATCAGGGAAATCAGCTTCATGCTCGAATGAAATACAGTG
AAGCAGCAGAGAAATATTTGCAAGCAAGGAACAATCTGACAGGAATACCCTTCTCCGAA
GCTAAGAGTTTGTTATTAGCTTCTTCCTCGAATTTAATGTCCTGCTACCTAAAGACTGGGC
AGTACGAGGAGTGTGTACAAACAGGTTCAGAGGTTCTGGCATATGATGCAATGAATGTTA
AAGCATTGTATCGCAGAGGCCAAGCCTACAAACAAATTGGAAAGCTAGAATTGGCTGTT
GCAGACTTAAGAAAAGCGGTGGAGGTTTCTCCTGAGGATGAAACAATAGCACAGGCTTT
AAGAGAAGCAAGCACAGAATTAATGGAGAAAGGGGGCACTCAAGATCAAAATGGCCCTC
GTATTGAAGAGATAATTGAGGAAGAAGCTGTTCAGCCAACTGCTGAAAAGTATCCGCAGT
CGGCTCCTATGGTGACTTCTGTTACAGAAGATGTAAGTGATGATGAGCAGGGGTCAGAA
GATCAAAATGGGTTCTCTAGAGATAGCTTTCAAGCAACTAATGCGCCCGATGGGCAGAT
GTATGCTGAGAGCTTGAGGAACTTGACTGAAAATCCTGACATGTTAAGAACTATGCAGAG
TTTGATGAAAAATGTCGACCCAGACTCCTTGGTTGCACTAAGTGGAGGTAAATTGAGCCC
TGATATGGTCAAAACTGTATCTGGTATGTTTGGTAGGATGTCACCAGAGGAGATCCAGAA
TATGATGAAAATGTCGAGCACCTTATCAAGACAAAATCCATCTACTTCATCTAGATTTGAT
GACATCACACGTGGACATTCAAACATGGATTCATCTCCACAATCTGTTTCAGTAGATAAT
GACCTTTTTGAAGAAAATCAGAATAGAGTTGCGGAGTCATCTACAAATTTGAGTTCCAGT
GCAGCCTTCTCAGGCATGCCAAACTTTTCTGCAGAAATGCAGGAGCAAGTGAGAAATCA
AATGAATGATCCAGCCACCAGACAGATGTTTACGTCAATGATACAAAATATGAGCCCTGA
AATGATGGCAAGTATGAGCGAGCAGTTTGGGGTTAAGCTTTCACCTGAGGATGCAGTTA
AGGCACAAAATGCTATGGCCTCTCTTTCTCCCAATGACTTAGACAGATTGATGAATTGGG
CAACAAGGCTTCAGACTGCAATTGACTATGCTCGAAAGATCAAGAACTGGATACTAGGAA
GGCCAGGTCTTATTTTTGCAATTTCCATGCTTCTTCTTGCTATTATTCTTCATCGGTTTGG
ATATATTGGAGACTAAGAAACGATACATCATTTGGATTTTCAAATATTGATGTAGTTGATG
CGTGCTTAAACCAATTGAGGAAGCTACTTCAGGCATGCATTTTGATCCTTTATGACCTTAT
TTCCAGCAATGCCAGCCGTATAGTTAACTGAGAAATGCTGGGAACTCTGCTGGAATTATT
TGTTAACAAGGAAGCAGTTGACTTTTGCTTGTGGATTGTACTGTGGTACATGGTATAAAT
CTATAGGCTATGTCGATTATTTTTGGCATCCTCATGCCTTTTATTAAAAACACACGGATGC
TTGAGAATGATGTTTTCCTTTGTGTTAGGTCAGTTTTTGTCCAAATCATGTGGCACCTTTG
GTTAATAACATTATCCACATCAATTTGTACATGGGATCTTCTTACGTTTATTTTTATTTTTTT
AAAAAAAAAAAAAAA
SEQ ID NO:177
ACGGACGACTTCCATTTTTTCCACGTGAATCCAATTGTTCGCTAGCTTTTAAAGGAATTG
GTTTTACGCACATAAAAAGAAGAATGGGAGTAGAGAAGGAGATTTTGAGGCCTGGTAAT
GGCCCCAAGCCGCGTCCTGGCCAGAGCGTGACGGTCCATTGCACAGGCTACGGTAAGA
ATGAGGATCTGTCTCAGAAATTCTGGAGCACAAAAGATCCAGGACAGAAGCCTTTCACAT
TCACAATTGGGCAAGGGAGAGTTATTAAAGGATGGGATGAAGGCGTTTTGGACATGCAA
CTAGGAGAGATTTTTAAACTTCGGTGTTCACCGGATTATGGATATGGGTCAAATGGATTT
CCAGCATGGGGTATACGGCCGAATTCTGTTTTGGTTTTTGAGATTGAAGTTTTGAGTGTA
AACTAGAGTACTATATCATTATAGAATATATGATATATAAGATATAAGATATTGCCAGCAAA
CTATTTGACAGGTTATTTAATAAAGTGTGCTACATTTTCATTGTATTTTGATTAAGATTTCT
AAGGTATGATGGATGGTGTTTGTAGTAAAAAAAAAAAAAAAAAAAAA
SEQ ID NO:178
AATTAACGAAGGAATCAGGCCCTAAGCCTCAAGCTTCGGCCAACCATAGTCATAACAACA
CAACTAGGGTTTTAAGCTTTTGGCATGTTAAGTCTGGTCCACAAGTAAGAGGTTCGAATC
CGCGGAGATGCCGAATCCAAGGTGTTACCTGGACATCACTATTGGGGAGGAGCTGGAG
GGGAGGATCTTGGTGGAGCTTTACAGCGATGTGGTGCCCAAAACGGCCGAAAATTTCAG
GGCTTTGTGCACAGGAGAAAAGGGCATTGGTCCTCATACCGGCGTCCCCCTGCATTACA
AGGGGCTTCCTTTCCATAGAGTTATCAAAGGTTTTATGATACAAGGTGGAGACATCTCAG
CTCAAAATGGTACAGGGGGTGAATCAATATATGGATTAAAGTTTGATGATGAAAACTTTC
AACTGAAACATGAGCGCAGAGGAATGTTGTCAATGGCAAACTCTGGACCTAATACAAATG
GCTCTCAATTTTTCATCACAACAACAAGGACCTCCCATCTGGATGGAAAGCATGTTGTCT
TTGGGAAGGTAATTAAAGGCATGGGAGTTGTGCGCGGCATTGAGCATACTCCCACAGAA
AGTAATGACCGCCCTTCTCTAGATGTTGTAATTTCAGATTGTGGGGAGATCCCAGAAGGA
TCAGATGATGGAATAGCTAATTTCTTCAAAGATGGAGATTTGTATCCAGACTGGCCAGCT
GATCTTGATGAGAAATCTGCAGAAATTTCATGGTGGATGAATGCAGTAGATTCTGCGAAA
TGTTTTGGGAATGAAAATTACAAGAAAGGGGATTATAAGATGGCTTTAAGAAAATACAGA
AAGGCATTGCGCTATCTTGACATTTGCTGGGAGAAAGAAGAAATTGATGAAGAGAAGAG
CAATCATTTAAGGAAGACCAAGTCGCAGATATTTACAAATAGTTCTGCTTGTAAATTAAAA
TTAGGAGATTTAAAAGGTGCCTTATTGGACACAGAGTTTGCAATGCGTGATGGGGAAGAT
AATGTAAAAGCATTATTCCGTCAGGGTCAGGCATACATGGCTCTGAAAGATGTTGATTCT
GCTGTAGCAAGCTTCAAGAAAGCATTGCAGTTAGAACCTAATGATGCCGGGATTAGGAA
AGAGCTTGCAGTTGCCACGAAGATGATTAATGACAGACGTGATCAGGAACGGAGAGCCT
ATGCAAGGATGTTTCAATAGCCTATCTTATCATTTTTGTCCGCATGTAGTTTAGGATATTA
GGAAGAGTTTAGGGAAAGGACAGGCATTTGATATCTCTCATTGGACTATATCGGGATTCC
TTTTTAAAAGTACCCAGTGATATAGGAACTGAGCAAGAGGCAAACAAATTCAAATCAGGG
TTGAGCCACGATTTTACATTTTAACGAATATGATGATAACATGGTTTGTCAAGGTCTTGCA
GCAATGTCTTGCAGCATCGCCAATGTCAAGTGTGAATAATGAAGGAAAATATTTTTTCAAT
TGATTGAACATGGGCAGCATTTTGAAATTAATTCCCTTTTAAATGTGGACAGAGGCACTAT
AAGAATGCGAAATATCGTCGGAGCACGACTAATTGTTTTTTAACTTGCGCTTTTCTTTTTT
AGGCGAATGTGAAAAATTCCTTGCTGGTAGATATTGTTGAGTTATAATGGACAAAAATAG
AGCCAGTGAGCAATGGTGATTTTTCTTAATGAAAAATTGATATTATTGCC
SEQ ID NO:179
CGAAATAGTGGAATGCGTATTGATATGGCTAAGAGTTCTCTCTGCAATTCTCAGAAACCC
TGGCAGATAATTAAGCTCAACGCTGCTACTGTTAGCGTCAGTTACCTTGGTGGTAGACAA
GAACAAGAAGCGCAAGTAGTATCTAGAGCAACAGAAGCTGTGGATTCACACGGCCGTCA
TTTGACAGTGCCTTCCAGTGTGTTCACCTCTGCATCATCTCAGATTTGAGCCCAGTCTTA
ATTACCTGAATTATAGACAGAAAATATGGGGGATCGTAATAGATCTAAATGGAGATGGAGG
TGTGCTGAAAACCATAATAAGGTCAGCTAAGCCAGGTGCTATGCAGCCAACTGAAGATC
TTCCCAATGTCGATGTACATTATGAGGGCACCCTCGCAGATACTGGTGAGGTATTTGATA
CTACCCGTGAAGATAATACTCTCTTCTCATTTGAGTTGGGCAAGGGCACAGTAATCAAGG
CTTGGGATATAGCTGTTAAAACCATGAAGGTTGGTGAGGTTGCAAGAATTACTTGCAAAC
CTGAATATGCTTATGGAAGTGCTGGCTCTCCTCCAGATATTCCAGAAAATGCAACACTCA
TTTTCGAGGTGGAGTTGGTAGCATGCAAACCACGGAAAGGTTCAACTTTTGGCAGTGTTT
CAGATGAGAAGGCCAGACTAGAGGAGCTGAAGAAACAGAGAGAAATTGCTGCAGCTAG
CAAAGAAGAAGAAAAGAAACGCAGGGAGGAAGCCAAAGCAACAGCGGCTGCTCGTGTT
CAAGCAAAATTGGAGGCAAAGAAAGGGCAAGGTAGAGGGAAAGGGAAAAGCAAGGGAA
AATGATAGACTAGTTCTACAAAGCCCTAGGATGATGGACTTCATTTCTTTTGCATTAAGAT
GAAGTTATAAATATAAACTTCAATATATTATTTTTTGGTTTGCTTCCCACTTGATTAAAAAC
CAAAAAGGTCCATCTAAATTTTTGTTTTTTGGATGATATGTTTGTGCAATGTAACTGTTTAC
ACTTCTCGGTTTTTGCTTAAAAAAAAAA
SEQ ID NO:180
AACACATTGAAAATAATATTTGTATTTTTCCACTGACATGGGACTCGGTTTAAAAATTGCC
TCTGCATCGTTTCTTCCCATATTCAACATCATGGCGACTCGTTCTCTGTGTATCCTGCTG
GTCTGCTTCATCCCTGTTCTCGCACATGTTTTATCGCTTCAAGATCCAGAGCTTGGCACA
GTCCGTGTATATTTTCAGACTACATATGGTGATATTGAATTTGGATTTTTCCCTCACGTGG
CTCCTAAAACTGTGGAACACATTTATAAACTTGTACGGCTTGGCTGCTACAATTCCAATCA
CTTTTTTCGGGTTGACAAGGGCTTTGTTGCTCAAGTGGCTGATGTTGTAGGTGGGAGAG
AAGTTCCTCTAAATTCAGAACAACGGAAGGAAGGGGAGAAAACTATTGTTGGTGAATTTA
GTGAAGTAAAACATGTTCGGGGAATTCTTTCAATGGGAAGATATAGTGATCCAGATAGTG
CATCATCCTCGTTCTCTATTCTTCTAGGAAATGCACCTCATCTAGATGGTCAGTATGCGG
TTTTTGGAAAGGTAACCAAAGGGGATGATACTCTTAAAAGATTAGAGGAAGTGCCTACTC
GCCAAGAGGGAATTTTTGTGATGCCATTGGAGCGGATCAGGATTCTTTCAACATACTACT
ATGATACTAATGAAAGAGAATCAAACTTGACCTGCGACCATGAAGTGTCAATTTTGAAAA
GGAGATTAGTAGAATCTGCTTATGAAATTGAATATCAGAGAAGAAAGTGCCTTCCATGAG
AAGTTGAACCGTCTTTTTATAGATCTTTTCAAAGATGAGTGTTGGGCTTTTCTAGCATACT
TGATTCTTCCATTTATATTCGAACATAAATTTTGTTTGTTTATTTGATTCCAATTATGACATT
AGTTCCATGTTAAATGCTAGATTTCTTATGGGGTTGGAACATTCCTCGCTGCCTTCTGGT
AATATTAGGTTATGCGTTTTTCATTCTGAAAAAAAAAA
SEQ ID NO:181
CATAACTGTAAAGCCAGAGCAGAGAGAGAGTACAACGATGGCGAGCAAGAGGAGCCTG
AGAACGATGAATGTTTGGCCAACGATACCGCCATTGGTCCTCCTGATACTCTTATGCTTC
TCTTACATGTCAAGTTCAGTGGTTGCTAAGAAGAGTGATGTCTCAGAGCTACAAATTGGA
GTGAAGCATAAACCAAAATCATGTGATATTCAAGCCCATAAAGGTGATAGAATCAAGGTG
CACTATCGGGGATCTCTTACAGATGGGACAGTTTTTGACTCAAGCTTTGAGAGGGGTGA
TCTAATAGAATTTGAACTTGGTAGTGGCCAAGTAATTAAAGGATGGGACCAAGGACTTCT
AGGAATGTGTGTTGGAGAAAAGCGCAAGTTGAGGATACCTTCCAAATTGGGTTATGGAG
CACAAGGCTCCCCTCCCAAGATTCCAGGTGGAGCAACGCTTATATTTGATACAGAACTTG
TTGCTGTTAATGGGAAAGGCATTAGTAACGATGGTGATAGCGATCTGTAAATAAGATTGC
TCCATTTTGCTTTAAAGGGCGGAGTTCATACTTTAGAAAGCGAACTTGAAGCAACTTAATT
TTAACAGGTTACATAATCTATTGTAGCATTCTCTTGAAATGGGAAATTGAGGTTGACTGTG
TACTTCTCCAGTGGACAGGAGAAAGCGATAAAATTCAAACGTTGTTGTCATTATTTCCTTC
GCTGAGTGACATGTGATACAATGAAAGGGAGTCTGAACCGTCTTTGTTGACCTTTATTAT
TACTCGATGTATATTTCTTCCCACTAAAGAGGCAAAATATTCTATTTTAAATTGAAAGACTT
TAAATTAAAAAAAAAA
SEQ ID NO:182
CGAAACCTTCTGAAATTTCAGTTGTCATCGCGCGCTATGTCTGGGGCACCTGCAGAGCG
TCCTATCTCTTACTTTGACATCACCATCGGCGGCAAGCCCATTGGCCGCATCGTCTTCTC
ATTATATGCAGAGCTAGTGCCCAAGACAGCTGAAAACTTTCGTGCATTGTGTACGGGTGA
AAAGGGTATCGGAAAGTCTGGCAAACCCCTTTGCTATGCTGGGTCTGGCTTTCACCGGG
TAATCAAGGGTTTCATGTGTCAAGGTGGTGACTTCACAGCGGGAAACGGCACTGGGGGA
GAATCGATCTACGGCGAGAAATTCGAAGACGAGGCGTTCCCCGTGAAGCACACGAAGC
CTTTCTTGCTGTCCATGGCAAATGCAGGAAAAGACACCAATGGGTCGCAGTTCTTCATCA
CTGTGAGCCAAACCCCCCACCTTGATGACAAGCACGTGGTATTCGGTGAAGTGATCAAA
GGCAAATCAATAGTCCGTGCGATTGAAAACTACCTTACTGCTTCGGGAGATGTACCTACT
TCCCCTATCATCATTTCTGCATGCGGCGTTCTCTCCCCCGATGATCCTTCTCTTGCAGCT
TCTGAGGAAACGATTGGCGACAGCTACGAGGACTACCCTGAAGACGACGATTCAGATGT
GCAGAATCCTGAAGTTGCGCTGGATATCGCACGAAAAATCCGTGAGCTTGGTAACAAGC
TCTTCAAGGAAGGGCAAATAGAGCTCGCGCTCAAGAAATACCTTAAATCGATACGGTATT
TGGATGTGCATCCTGTGTTGCCGGATGACTCTCCACCGGAGCTTAAGGACTCGTATGAC
GCACTTCTTGCGCCCCTTCTATTGAACTCGGCGCTTGCTGCTCTGCGCACGCAACCCGC
GGACGCCCAAACTGCGGTGAAGAACGCCACGCGAGCTTTGGAGCGTCTGGAGCTGAGC
GATGCAGATAAAGCCAAGGCCCTTTATCGCAGGGCGTCAGCGCATGTTATCCTCAAGCA
GGAAGATGAAGCTGAGGAGGACCTTGTTGCTGCTAGCCAGCTTTCACCTGAAGACATGG
CGATCTCCAGTAAGCTCAAGGAAGTGAAAGACGAGAAGAAGAAGAAAAGGGAAAAGGA
GAAGAAGGCATTCAAGAAAATGTTTTCGTCGTGACATACATCGTAATCATTGGAAGTATA
CCAGT
SEQ ID NO:183
AAATGGAGGGAATGAAGTTGAAAGAATACTGCGCAAACAGAGAACAAAAATTGGTGGAC
AATGGCTTCTTCACTCCGCAGTTCTCTTTTTTCGTCATGGGCTTTGGATTCCAAGTCTGTA
TGCTCCCTATTTAACCTCAATCCAGGGAAGATGGGGCTGCCTTCCATTTCTACGCCACTG
AATTGGAGGACTTGCTGCTGTTCACACAGTTCTGAATTGTTGGAGCTAAATGAGGGGCT
CCAATCTTCTAGAAGGAAAACTGTGATGGGGTTATCCACCGTAATAGCTTTAAGCCTTGT
TTATTGTGATGAAGTAGGAGCTGTATCCACAAGCAAAAGAGCTCTAAGATCACAGAAGGT
TCCAGAGGACGAATACACAACTCTTCCAAATGGGCTCAAATACTATGATTTGAAAGTTGG
AAGCGGGACGGAAGCTGTAAAAGGTTCACGAGTTGCGGTACACTATGTAGCCAAATGGA
AAGGTATCACTTTCATGACTAGCAGGCAAGGCATGGGTATTACTGGTGGCACTCCTTATG
GATTTGATGTGGGTGCTTCTGAAAGAGGAGCTGTCTTAAAGGGATTGGACTTGGGAGTT
CAAGGGATGCGAGTTGGAGGCCAGCGCATACTTATAGTACCCCCGGAGCTTGCATATG
GTAATACAGGAATTCAAAGATCCCACCAAATGCGACTCTTGAGTTTGATGTGGAGTTGA
TTAGCATCAAGCAAAGTCCATTTGGGTCAAGTGTAAAAATCGTTGAAGGGTAGTTGTGAA
ATTGATTAAATTGCTAATAAATATTGTTGTAATTCATTATCATCATGTTTCTTATAATTTAAG
AGCTATGAAACAACTACCTTTTGGAATGGTTTTGCTTTTTATAATTTAAGAGCTATGAAAC
AACTACCTTTTGGAATGGTTTTGTTTTTAGCATCCCAATTATATTTACAATGCTTTGAAAAA
TCAATAAAATAAGTTTCTATTTAAAAAAAAAAA
SEQ ID NO:184
GCGAAGCAAGCAAACCAAATACCAGCAGCTCGAGGGTTTGGTTGTCGTTCCTCTTTCCC
TTTGCATTTTCAAACGATCTGGGCGCCTCCTTAACAATAACATATCAGTTTCTGCTTTCCC
TATACATTTTCGAAGTTCGGGAAATCTGTATGGCGCAAATCTGAACCTTGGCCGTGGAAA
ATCCACAGAGCTCCGGCGGAAAACAGAGGAGGCGGCCTTGAAACGCTTATAAACAGGC
TTGGATAGCCATATTGATCGGTGTTCAGGCAAGGAGGAATTCCAGATCCTGCACTTCTTA
GAAGGTTTCCGTCTTTTGTTACATTTTAATTTTCACCAGAAGGATTAATCACCTGCGAGGT
GCATTTCACCCTCTTTTGGTTGGAAACATTTATGGGCTAAATTGTGGCGTGTGAACGTTT
CATGGGAGCCATTGAGGATGAAGAACCACCCCTGAAGCGTTTAAAGGTGTCCTCACCCG
GCTTGAGAAGAGGTTTGGAGGAGGAGGCCCCTTCATTGTCTGTGGGGTCAGTGAGCAT
CTTAATGGCCAAATCATTGTCTTTGGAGGAGGGTGAGACAGTAGGTTCCAAAGGACTCA
TAAGGAGGGTGGAATTCGTAAGGATAATAACACAAGCTCTATATTCTTTGGGTTACCAGA
AAGCTGGTGCGCTTTTGGAAGAGGAGTCAGGAATACTTCTCCAGTCTTCAAATGTAGCTT
TATTTAGAAAGCAGATACTCGATGGAAAGTGGGATGGAAAGTGTTGTTACACTGCGGGGT
ATTGATCAAGTGGAGGTGGAAGGAAACACACTGAAGGCTGCATCCTTTCTGATCTTGCA
GCAGAAGTTCTTTGAACTATTGGACAAAGGTAACATCCCTGAAGCTATGAAAACACTTAG
ATTAGAAATTTCTCCAATGCAATTGAACACAAAAAGAGTACATGAACTTGCAAGCTGCATT
GTGTTTCCTTCACGGTGCGAGGAGCTTGGATATTCAAAGCAGGGGAATCCAAAGTCCAG
TCAACGGATGAAAGTTTTGCAAGAAATTCAACAATTATTACCACCTTCTATAATGATCCCA
GAAAAGAGGCTGGAACGTTTAGTGGAGCAGGCCCTTAATGTGCAGCGAGAAGCATGCAT
TTTTCACAACTCCTTGGATCCGGCCCTTTCTCTATATACTGATCATCAATGTGGACGAGAT
CAAATTCCTACAACGACATTACAGGTGTTAGAGTCACATAAAAATGAAGTGTGGTTTCTA
CAGTTTTCAAACAATGGGAAGTATTTGGCTTCTGCATCAAAAGATTGTTCTGCAATAATTT
GGGAGATAACTGAAGGGGATTCTTTCTCCATGAAACATAGATTAAGTGCACACCAGAAAC
CAGTTTCATTTGTTGCTTGGAGCCCTGATGATAAACTGCTGCTTACATGCGGTATTGAGG
AGGTTGTTAAGCTTTGGAATGTTGAGACTGGTGAATGTAAACTGACCTATGATAAAGCCA
ACAGTGGATTCACTTCCTGTGGTTGGTTTCCAGATGGTGAACGGTTTATCTCTGGCGGA
GTTGATAAATGCATCTACATATGGGATCTTGAGGGAAAGGAGTTAGATTCATGGAAAGGA
CAAGGAATGCCCAAAATATCTGATCTCGCAGTCACATCTGATGGTAAAGAAATAATCAGC
ATATGTGGGGACAATGCTATTGTGATGTACAACTTGGATACAAAAACTGAAAGGTTAATT
GAGGAGGAAAGTGGAATAACTTCCCTATGTGTATCAAAGGACAGTAGATTTCTCCTCCTA
AACCTTGCAAATCAAGAGATACACCTCTGGGATATTGGAGCTCGTTCAAAGTTATTGTTA
AAGTATAAAGGTCACAGACAAGGTCGCTATGTGATAAGGTCCTGTTTTGGTGGGTCTGAT
CTTGCATTCGTTGTTAGTGGTAGTGAAGACTCACAGGTATACATTTGGCACCGAGGTAAT
GGAGAACTTTTAGCTGTTTTGCCTGGTCACTCTGGTACAGTAAACTGTGTGAGCTGGAAC
CCTGTGAACCCACATGTGTTTGCATCTGCTAGTGATGACTATACTATTCGTATATGGGGT
GTAAACAGAAACACTTTCAGGAGTAAGAATGCTAGTTCTAGTAATGGCGTCGTTCACCTT
GCAAATGGAGGGCCATAGCCAAGAGAGATAAAAGACCAATTTTGAGCAGCAGTTTAAGC
ATTGTAATAACGCTCATTATTCATTTCTGCACTTGTAAATTCTGATCTTACTTCATTTATTT
CTATTCCAAAGACCAAGCTTGTAAATTATGCTGGTTCCATATGGGGGTTAATCAGTATCC
TCGTTATTTGTGACACCAAATTATCAAATCTCACAAAATTTGTGAATGGGACTAAAAAAAA
AAAAAAAAAAAAAAAAAAA
SEQ ID NO:185
GGCATGGTGCAGATCCTTCGCCGTCAAACACATGTACATGTAATTTTCAAGCTGGACTGA
TATTCCCCTCCTCATTTTCCATTAAGAAAACGAATGGAAATGCAGTAGAAATTGTTCAGAG
ACTCGTAGACAGAGGTTTAAGGAACATAGATAGATTTAGAAATGCCGGGAACAACTGCG
GGTGCAGGTATCGAACCCATCGAGCCTCAGTCCCTGAAGAAGCTCAGTCTCAAGTCTCT
TAAGCGCTCCTTCGATCTCTTTGCTTCCCTTCATGGCGAACCTCAACCTCCCGATCAACG
CAGTCAACGAATAAGAATTGCCTGCAAGGTGCGAGCTGAATATGAAGTCGTAAAGAACTT
GCCAACTTTGCCACAACGGGAAGTTGGCAGTTCAGTATCAAACTCAAATGTTGGAGAAA
CTCATTCATCTCTGACAACTAATCAAGCTCAAGGATTTCCAACTGACACATCAGGAGATTT
GTCGAAAGATGAAGGGAAAGAGATTACTAGCATTGCTGTTCATTTGCAGCCACAAACAG
GATTGATTGATGGAAAGGCAGGAGCAATAGCTGGAACAAGTACTGCTATTTCTTCTGTGG
GTTCTTCAGATCGGTACCAGCCAAGTGCGGCCATCATGAAAAGGCTACCCAGTAAATGG
CCACGTCCTATATGGCATCCTCCTTGGAAGAACTATCGGGTTATCAGTGGGCATTTGGG
CTGGGTGAGATCTGTTGCTTTTGATCCTGGTAATGAGTGGTTTTGTACTGGCTCTGCTGA
CCGGACAATAAAGATTTGGGAAGTAGCCACAGGAAAACTCAAGCTCACATTAACAGGGC
ATATTGAGCAGATACGAGGTTTGGCTGTAAGTTCTCGGCATCCATATTTGTTTTCTGCTG
GTGATGATAAGCAGGTCAAATGCTGGGACCTTGAGTATAACAAGGCTATTCGCTCTTACC
ATGGACATCTAAGTGGAGTTTACTGTTTGGCACTTCATCCGACATTGGATATTCTTTGTAC
TGGTGGTCGTGATTCTGTTTGCCGAGTATGGGATATTCGCACCAAAGCTCAAATATTTGC
ACTCTCTGGCCATGAGAACACTGTGTGTTCAGTTTTTACCCAAGCAATAGATCCTCAAGT
GGTGACCGGGTCTCATGATACTACAATCAAGTTATGGGACCTTGCTGCAGGAAAAACAA
TGTCTACTCTTACGTATCACAAAAAATCTGTTCGGGCAATTGCAAAACATCCTTTTGAGCA
CACTTTTGCGTCTGCATCTGCTGATAATATAAAAAAATTCAAACTTCCCAAGGGAGAGTTC
CTGCACAATATGCTATCACAGCAGAAGACAATTGTTAATGCCATGGCCATCAACGAGGAC
AATGTTCTTGTATCTGCAGGTGACAATGGAAGCTTATGGTTCTGGGATTGGAAGAGTCGT
CATAATTTTCAGCAGGCTCAAACAATTGTACAGCCTGGATCTCTGGACAGTGAAGCTGGA
ATATATGCACTCCAATATGATATAACTGGCTCTAGGCTTGTTTCTTGTGAGGCTGACAAAA
CAATTAAAATGTGGAAAGAAGACGAGACAGCCACTCCAGAGAGTCACCCAATAAATTTTA
AAGCGCCTAAAGATATCAGGCGTTTCTAAATCTTTAGTTAATTAATTGTATTTCTATTAAAA
TTGGGCATTTTAGTTGTATATGATCTATAGCCTTTTATAGAGTGCTAAATGAAGAGGTAAT
CTTTGTATTCAGCTTTGTTGTGAACTATCAATAGACGGGGATGGTCCTTTTTAGCTGCTC
CTTAAGCAGCTCAAATGATCTGTAAGAGGTAGAGACGTAGTTGTTGCTGTATTCAAGTAT
GATACATCAGCACAGTAAAGCATCTTCCTTACAAAGCTATTTTATTCAAAAAAAAAA
SEQ ID NO:186
GCATAAGTGAGTTCATAGTCTTTTGCTCATAAAATCTATGAACAAAAACCCTTGGAACACT
TACATGGAGGGCCTTATATATTCATTTCTTTACTCATCATCGCTTCTTTTTCGTCTTTCTTC
ATCGGTGCCAGTGCTCGCGGCTTGTCTACACTGAAGTCCGATAAGGAACCTAATACATT
GCAAGCAAGATGAGGCCGATTTTGATGAAGGGCCACGAGAGGCCTCTGACGTTTCTCAA
GTATAACAGGGACGGGGACTTGCTGTTTTCTTGTGCGAAGGATCACACGCCCACCGTCT
GGTATGGACACAACGGGGAGCGCCTTGGGACCTATCGAGGACACAACGGTGCTGTCTG
GTGCTGTGATGTCTCAAGGGACTCTACACGTTTAATAACCAGCAGTGCAGATCAGACTG
CCAAGTTATGGAATGTGGAGACAGGAGCTCAACTATTTTCTTTTAACTTTGAATCCCCAG
CTAGAGCGGTGGACCTTGCCATAGGGGATAAGCTTGTTGTAATAACCACAGATCCATTTA
TGGAGTTGCCTTCAGCAATTCATATCAAACGTATTGAGAAGGATCTGTCAAAGCAGACTG
CTGATTCTGTACTTACAATAACCGGGATCAAAGGAAGGATAAACAGAGCTGTTTGGGGC
CCTCTAAATAGTACAATAATCAGTGGTGGAGAAGATTCTGTTGTTCGTATTTGGGACTCA
GAGACAGGAAAGCTGCTCAGGGAAAGTGACAAGGAAACAGGCCATCAGAAACCTATAAC
ATCTTTGTGCAAGTCTGCAGATGGATCCCACTTCCTGACAGGGTCCCTAGATAAATCAGC
CAGGCTATGGGATATCAGGACTTTAACTCTTATTAAGACGTATGTCACAGAGCGCCCTGT
AAATGCAGTTGCAATTTCTCCACTACTTGACCATGTTGTGATAGGTGGAGGTCAGGAAGC
ATCTCATGTGACTACTACTGATCGTCGTGCAGGAAAATTTGAAGCAAAATTCTTTCATAAG
ATTCTTGAAGAAGAAATTGGCGGGGTTAAGGGACACTTTGGACCTATAAATTCACTGGCA
TTTAATCCTGATGGAAGGAGCTTTGCGAGTGGTGGAGAGGACGGTTATGTAAGATTGCA
TCATTTTGATCCTGATTACTTTCACATCAAGATGTAATTTGGTACTTTGGAAAGAAGGCTT
GGTCTGTAGCGTTTCTCATTACATGATGGATTGCCAGGAATTCACAAAAAATATCGCACA
ATTGGTGTCTTGATTATGTAAGATTCTCGAGAAGTAGGAAATTTTAAGAGGTTTTAACAAA
TTTTGGGGAGGCTTGATTGATTTTTCAGACAGAATTGTTTGTATCTTGTTGGGAATTATTA
CTATTTATATACTTCAGCTGAGCTGGGTCGTGGTAAAAAAAAAAAAAAAGGGAAAGCAAG
GTCCTACAACACCACTGCCCGACTTGGTCACCATTCATACTCCGAAAGAAGAGGAGGAA
TTCGCTGCCAAGCCAGTGCTTGGCAAGGAAACTGAGATACTTGTCTAGGTCACTTTTAGT
TCATTACATACAACATTTATATTTAAGCTTCTAATATATTCTTGACCCATCCACATGAATTC
AATTCCGGTCATATGTAGACGACTATAATGTTGTTTGTGTCCTATAACTATAGTGTTGGAT
CAACGTCGTTCAATTTTTGACTATTTTTTAACTTCGATATGATTGTTGACGTGTTAAGGGA
GGCTCTTTAGTTCTCAAGGTGTTAAAGCTTGGGGAATTGTAGTTTCTCAAGGATGTTAGT
TTATTTGGCAACGACGCTACTTGTTCATATTTGAAATTAATTAAAATTTCAAGCGTGGCAT
TTAATTTGCCAGTTTTCTCCTTTTAAAAAAAAAA
SEQ ID NO:187
GAGGATACCGAGTTGATTACTTGCAATATTTCTTGTTTTGGGCTTCCGCGGTTCTTCAGA
ATTGAAGAGCGCTTAGGGTTTGTTATTTATTGCAGAAGTCCGATAAGGAACATAGTACAT
TGCAAGCAAGATGAGGCCGATTTTGATGAAGGGCCACGAGAGGCCTCTGACGTTTCTCA
AGTATAACAGGGACGGGGACTTGCTCTTTTCTTGTGCGAAGGATCACACGCCCACCGTC
TGGTATGGACACAACGGTGAGCGCCTTGGGACCTATCGAGGACACAACGGTGCTGTCT
GGTGCTGTGATGTCTCAAGGGACTCTACACGTTTAATAACCAGCAGTGCAGATCAGACT
GCCAAGTTATGGAATGTGGAGACAGGAAATCAACTATTTTCTTTTAACTTTGAATCCCCA
GCTAGAGCGGTGGACCTTGCCATAGGGGATAAGCTTGTTGTAATAACCACAGATCCATT
TATGGAGTTGCCTTCAGCAATTCATATCAAACGTATTGAGAAGGATCTGTCAAAGCAGAC
TGCTGATTCTGTACTTACAATAACCGGGATCAAAGGAAGGATAAACAGAGCTGTTTGGG
GCCCTCTAAATAGTACAATAATCAGTGGTGGAGAAGATTCTGTTGTTCGTATTTGGGACT
CAGAGACAGGAAAGCTGCTCAGGGAAAGTGACAAGGAAACAGGCCATCAGAAAGCTATA
ACATCTTTGTGCAAGTCTGCAGATGGATCCCACTTCCTGACAGGGTCCTTAGATAAATCA
GCCAGGCTATGGGATATCAGGACTTTAACTCTTATTAAGACGTATGTCACAGAGCGCCCT
GTAAATGCAGTTGCAATTTCTCCACTACTTGACCATGTTGTGATAGGTGGAGGTCAGGAA
GCATCTCATGTGACTACTACTGATCGTCGTGCAGGAAAATTTGAAGCAAAATTCTTTCATA
AGATTCTTGAAGAAGAAATTGGCGGTGTTAAAGGACACTTTGGACCTATAAATTCACTGG
CATTTAATCCTGATGGAAGGAGCTTTGCGAGTGGTGGAGAGGACGGTTATGTAAGATTG
CATCATTTTGATCCTGATTACTTTCACATCAAGATGTAATTTGGTACTTTGGAAAGAAGGC
TTGGTCTGTAGCATTTCTCATTACATGATGGATTGCCAGGAATTCACAAAAAATCGCACA
ATTGGTGTCTTGATTATGTAAGATTCTCGAGAAGTAGGAAATTTTAAGAGGTTTTAACAAA
TTTTGGGGAGGCTTGATTGATTTTTCAGACAGAATTGTTTGTATCTTGTTGGGAATTATTA
CTATTTATATACTTCAGCTGAGCTGGGTCGTGGTACAAACATTTTACACCCTATAACAAAA
TATAGTGTCATAAGTTTACACCAGGTAACAACTCTATATCCAAAGCTGCAGTTAAATTGTC
CCTGCCCTTCTGTGGAAGGCTTAATGAACATGTTTAAGGTTCAAAAAAAAAA
SEQ ID NO:188
CTGCGTTATGTCATCGTCGGTCATAAACCCCAGGGTTTAAGTTGTCTTTATCACCCTCTG
CCTGGCTCGTTAGTGTAGTGAACACCATTATTTCACTGCTACCATAATATACAGAAAATCT
CCTAGAATATACAGAAAAGACCTCGAAATGGCGGAGAATAACGTCGGCGACTTCATACC
ACTTGATCGACAAGAGTACCCTTCAAAACCCGCTCCAGGCGCCGTTGACTCGAGCTTCT
GGAAATCCTTCAAGAAGAAGGAAGTTTCCCGGCAGATTGCAGGCGTTACTTGCATCAATT
TTTGCCCTGAACCTCCTCACGATTTCGCAGTTACATCGTCTACCCGGGTACATATCTATG
ACGGTAAATCTTGTGAGCTCAAGAAGACAATCACGAAGTTTAAAGATGTGGCATATTCAG
GAGTATTTCGTTCTGATGGACAGATCATTGCTGCTGGGGGTGAGACAGGGGTCATTCAG
GTCTTCAATGCAAAGTCTCAAATGGTTCTCCGGCAACTGAAGGGGCATGGCAGGCCTGT
TCGAGTTGTAAGGTATTCTCCACAGGACAAACTTCATCTACTTTCAGGTGGTGATGATAG
CATGGTCAAGTGGTGGGATATCACAACCCAGGAAGAGTTACTAAATCTAGAAGGGCATA
AGGACTATGTTCGCTGTGGTGCTGCAAGTCCATCGAGTGTTAATTTATGGGCTACAGGTT
CATATGATCACACAGTACGGCTGTGGGACTTGCGCAACTCAAAGACTGTTTTGCAATTAA
AGCATGGGAAACCACTGGAAGATGTTTTATTTTTTCCATCTGGTGGATTGCTGGCCACCG
CAGGAGGTAACGTAGTGAAGGTATGGGACATTCTTGGAGGAGGACGACCCATACACACA
ATGGAAACCCATCAAAAGACTGTCATGGCTATGTGTATTTCAAAAGTACCCAGGTCGGGC
CAGGCTTTGGGTGATGCACCTTCTCGTCTTGTTACTGCATCACTAGATGGATATATGAAA
GTTTTTGATCTTGATCATTTTAAGGTCACCCATTCTGCAAGGTATCCAGCTCCAATTCTAT
CAATGGGCATATCCTCTTTGTGTAGGACAATGGCAGTAGGAACCTCATCTGGGCTTCTTT
TCATCCGCCAGAGGAAGGGGCAAATAGAAGATAAGATACATTCAGATTCATCACGTCTTC
AAGTAAATCCAGTTAATGATGAAAAGGATTCTGCTGTTTTGAAACCCAATCAATATCGTTA
TTATTTACGTGGTCGCAGTGAAAAACCTTCTGAAGGAGATTATGTTGTGAAAAGAATGGC
AAAAGTGTATTTCCAAGAGTATGATAAAGATTTACGTCACTTCAATCATTCAAAAGCTTTG
GTTTCTGCTCTAAAAGCTGCAGACTCCAAGGGTACAGTAGCTGTCATAGAAGAACTAGTT
GCTAGAAAGAGGCTGATACAAACTTTGTCAATTCTCAATTTGGACGAGCTCGAGTTGCTA
ATCAATTTTTTATCCAGGTTTATTCTTGTACCAAAATATTCAAGATTTCTCATTTCCCTTAC
AGATAGGGTTCTGGATGCACGTGCAGTAGATCTTGGAAAGTCAGAAAATTTAAAAAAACA
GATTGCAGACCTGAAGGGAATAGTCGTTCAGGAGCTTCGAGTGCAGCAGTCCATGCAAG
AATTGCAAGGGATTATTGAGCCACTAATTCGAGCTTCTGCTCGATAAATGATCTAAATGG
AGAGTTTTATTCATTACATGAAAGAGTATGTCACCTTTCGTGCTCCATCTATTGATTTTTAT
TTTTAAAGCCTAGAGAAAAAAGTATTTATCACAAAATGCTGAAGCATTTTTGGTTCAAGGT
TTCGTGTTCCCTTGTAATATTTCTGTTTCAGTGGGGCTAGCTGGAATGACATGTTCATCTA
TGGAAGAAGAAACAGAGGAATTTATGAGGGATCGAGAAAGAGAATTTATTTCATTATTTC
TCATCTCTTGTTTTTGATGTGCCGCTCATTCAGTTTGAGCCCTTGTAACATAATGTTCGGT
CAATGGCATGGCCATTGAAACAATTTTGGTCATGTGACGGGCATACAGAAAATTTGTATC
TAAGTATCTATACTTATCTCCAAGTATAGAAGCTTAGTGAGTCTTATTAGTTCTTAAAATAC
TAATTTTTCTGCGATATTTTGCATAAATATCTAAATTTATAAGGATTTAAAAAAAAAA
SEQ ID NO:189
GTTCCCCAATTCCCATCAACGTTTTATAAGAATTGGTACGACTTTGCGGCCTGCGGAGAC
AAGGTGGGTCTCGCATCATGGCGTTGGAAGAAAGTTCACGAGGCAGAGCAAACAGAGA
AGAGTTCGGAGAGCATATTCATTTCTTCGATTGTTTATTCGTCTAAGACGAACAGTCCGA
GGACATGTCCGGCGGATGACTTGGTGAATTTGTTATCAAAATTTTCTTAATACTCCGAAA
GGTTCGGTCTCAGCATTTCCCTTTAACATCTTTTATATTTTATTTCAGTAGCAGACACGAA
CCAAATCAAAATACCCTTCATACGGCTTAATTCTCAGTCCCGTAATTTGGCGGTAATTTGT
TTTCCCTTTTCTGCTCGATGGTTTAATGTCGATGAATGAAGCAGTTGTGTTTCTGGGCTG
CTTTCGGGCAGTTCTCATAGGGCGGTTTTTAAGATATCCAGGGTTAGAAAACTGGTTTTC
GACAGGAAATCTGCTCACGATTTCGACCCTTAAATCGAAAGTGTACCTGGTTTCATCCAG
TTTCTTTTTTCAAAAATAGAAATTCCGTATTATTCTCCAGGTGAAGCTTACAATATCAGGTT
TCTTCTCGTCTGGACTATATCACGGCAACACTGAAATGTTTGTTAGATGGGAAGCCGAGA
AAAGTTTCGGCTATTTGATAGTTAGTTTACATTTAAAGGCAGCGGAGCAGTTTTTTTTCAC
GAAGGGTTTGATGTTTAAAGGGAGACAAGCAGTTAGTTCGAAAGGATTGGGTAACAGTA
ACAGTTTGTTTGGTAGACAGGTTCAGGCTTTATTTCGTATCGACCCAGATAGCTTCCGGG
AATTGAGAATTTGAAGAATTTAAACAGACAATTATTGCTGTCAAATTACAGGTTTCTGGCA
TCAGGGTTTCCTGCAATTAGGGTTTACTGATATCAGGTAAAGCCATGGACGTCGAAACCT
CAGGGAAACCTACAGGAAACAAAAGAACCTACACACGTCTGCCAAGGCAAGTATGCGTG
TTTTGGCAGGAAGGCAGATGCACTAGAGAATCATGCAATTTTCTGCATGTCGACGAACCT
GGATCAGTCAAAAGGGGGGGAGCAACTAATGGGTTTGCTCCAAAGAGATCATATAACGG
ATCGGATGAGAGGGACACGCTGGCCGCAGGACCTCCCGGAGGATCGAGAAGGAATATT
TCTGCTAGATGGGGAAGAGGACGAGGTGGTATATTCATATCCGACGAGAGGCAGAAGAT
TCGCAATAAAGTATGCAATTACTGGCTTGCAGGCAATTGCCAGCGTGGAGAGGAATGCA
AATATTTACACTCCTTTGTTATGGGTTCGGACGTGAAGTTCTTGACGCAGCTGTCCGGCC
ACGTTAAGGCTATTCGAGGGATTGCTTTTCCTTCTGATTCTGGTAAACTCTATTCTGGAG
GCCAAGACAAGAAGGTTATAGTGTGGGACTGCCAAACAGGGCAGGGTACAGACATCCC
ATTAAATGACGAGGTTGGATGCCTCATGAGTGAAGGCCCATGGATTTTTGTTGGCCTCC
CAAATGCGGTCAAGGCTTGGAATATTTTGACCTCAACAGAACTAAGTTTAGTTGGTCCCA
GGGGACAAGTACATGCTCTGGCTGTTGGTAATGGAATGCTTTTTGCTGGAACACATGAT
GGCAGTATATTGGCTTGGAAGTTTAGTCCTGCTTCCAATACCTTTGAGCCTGCAGCTTCC
CTTGTAGGGCATACCCAAGCTGTTGTTTCATTAGTCTCAGGAGCAGACAGACTTTACTCT
GGTTCCATGGACAAGACAATAAGAGTTTGGGATTTGGGAACTTTTCAATGTCTACAGACC
CTGAGGGATCACACATCAGTTGTAATGTCTCTTTTATGCTGGGATCAGTTTCTGTTGTCAT
GCTCTTTAGACAATACAGTGAAGGTCTGGGTTGCTACATCAAGTGGAGCCCTTGAAGTG
ACATATACTCATAATGAAGAACATGGTGTTCTTGCGCTTTGTGGAATGAACGATGAACAA
GCCAAGCCTGTCTTGCTTTGTTCTTGCAATGATAACACTGTGCGTCTTTATGACCTGCCA
TCGTTCAGCGAACGAGGTAGAATATTCTCAAGAAACGAGGTACGAACATTCCAGATTGCA
CCTGGTGGATTATTTTTCACCGGTGATGCGACAGGTGAACTTAAAGTTTGGAACTGGGC
AACCCAGAAATCCTGATGCAACTTAAAAAATTCCTGTTGCAATCAGGGCACTCTGCTTGA
AATGCAAATCTCTCTGGGAACTCCTTCACATGGCAGAAAGGTGCAGAGTCTGGAGAAGT
TGAAATTTCACGTCCTGTATACTCACTCAAGCAACTTTAGGATGAAGAGCTAAAGTATATC
AAAGATATATAAGTGGGTAATATTAACACTGAATAGCTGCAAGTGATGCATGCTTGATTTC
CGTCAGGAAAATCAGAATTTTAAA
SEQ ID NO:190
GTCCTAATGACTGATACTGGTTTCTGGTTTACCGTCATGTTTCCCGGACCCAAGCCGCC
GGTTTTCAGAACGGCTGATGCTTTCTGAACCCAGAGCGGAGAGAGCGAGACCAGGGTC
TTTGGATTTCGATCTATGGTCGAGTCCTTCCCTTATGCGAAGGTTATGTTCCCTAAATCTG
CCAAGAAAATAATTATTTGGCATATGACATAGAGCTCGGTTTACTGATTTGCATACCGAAC
ATCTCTGGTTTTGCTTATTTTGCTGGTTCAAATCAGGGTTTCGATGCATTCTCCATAACAT
TAACACTCCTGCTTCTGTAAATTATCGAGCTATGTCGGTGCAAGAGCTTAGGGAACGTCA
TGCAGCGGCAACGGCAAAGGTTAACGCTCTCAGAGAGCGGATAAAGGCTAAGCGCCTC
CAGCTGTTGGATACAGACGTTGCAACGTATGCCAGCAGTAATGGAAGGACCCCAATCAG
CTTCAGCTTCACGGATCTGGTTTGCTGTAGAACACTTCAAGGGCACACAGGAAAAGTTTA
TTCATTGGACTGGACATCTGAGAAAAATCGCATTGTGAGTGCATCTCAAGATGGACGGTT
GATAGTTTGGAACGCTTTGACAAGCCAGAAAACACATGCAATTAAACTCCCCTGTGCCTG
GGTGATGACATGTGCATTTTCTCCTAGTGGGCAGGCAGTTGCATGTGGTGGGCTTGACA
GTGTCTGCTCCATTTTCCAGCTTAACAACCAGTTAGACAGAGATGGACATCTACCTGTTT
CCAGAATTCTTAGTGGTTCATAGGAGCTATGTCTCTTCTTGTCAATATGTTCCAGATGGGG
ATACACATGTAATAACTGGCTCTGGTGATCGGACATGCATTCAGTGGGACGTAACAACA
GGACAAAGGATTGCAATCTTTGGAGGTGAATTTCCATTGGGTCACACTGCTGATGTTATG
AGTGTATCCATCAGTGCAGCAAACCCCAAGGAATTTGTTTCAGGCTCTTGTGATACAACA
ACCCGTTTATGGGATACGCGTATTGCTAGCAGGGCCATACGAACATTTCATGGACATGA
GGCAGATGTGAATACTGTGAAGTTTTTTCCTGATGGACTAAGATTTGGATCTGGCTCAGA
TGACGGTACCTGTAGGTTATTTGACATACGAACAGGGCACCAACTGCAAGTCTATCGTCA
ACCTCCCAGGGAAAATCAGTCCCCAACTGTGACTGCAATTGCATTCTCATTTTCTGGTAG
GCTTCTATTTGCAGGATATTCCAACGGGGACTGCTTTGTGTGGGACACAATATTGGAAAA
GGTAGTTCTAAACTTGGGAGAGTTACAAAATACACATAATGGCCGGATAAGCTGTCTTGG
ATTGTCAGCCGATGGGAGTGCTTTATGTACGGGGAGCTGGGACAAGAATCTAAAGATTT
GGGCATTTGGAGGCCACCGAAAAATAGTCTGATTGGTGAGAGATTATAAAAACGAAATTC
ACTTGAATGATTAGCCTTGCGAAAATGCACTCTTTATAAAGTGGGATGAGGTATGTGTTT
CCTTCCTATTGGCTAACCTGAATTCCTCCTGCAGCCCCAATTATTAAATGCTATCCTGCAA
CAGTCATGATTCGATGGGCCTAAGCTGTATTGATATTCTCTTGATCGTTGGAGAAGGCAG
ATCATTTGTAACTTTGTAATATGTATTGATATTTTATTTGTGTTTTCTCATTGCATTTGAGCT
GTGCTACACAAAAATGATCCTTGGCTCTATGATTCCCAATTCTCAGTTCAAGTACATGTGA
AGGATCTGGAGTTTACAGTCACATTTCCATATTCAATTATACATGAAACCTTGTGGATTTG
AATGAAAAACAATGTAATTGGTTTAGTATGTTACAAGTTTATGATTACAAGGTTTAAAAAAA
AAA
SEQ ID NO:191
CTGTCGACCCCGGTCTGGGATGATTTCTTAAACCCTCTCGACGCCCTTTCTTACGCTTAA
AACCCTTTCTTCTTTTGTTTTCATCTTCACAACCACTGGACAGAGATGTTCTGACCGCGG
ATTCACTGAGCTATTGCTGAACCAAAGATTTTTGCTTCGCGGCTACCCAGGGTTGTGTGT
CCAAATTGATATTTTCCACCACGTTTCTGGGTTTTTTTGCCCTGTGAAATAATGAAGGTGA
AGATAATATCACGTTCAACAGATGAATTTACAAGGGAACGGAGTAACGATCTTCAAAGGG
TTTTTAGAAACTTTGATCCTAACCTACATACTCAGGCAAGGGCTCAAGAATATGTCCGTG
CTCTCAATGCAGCAAAATTAGACAAGATTTTTGCAAAACCCTTTCTGGCAGCAATGAGTG
GACACATTGATGGCATTTCTGCCATGGCAAAGAGCCCACGTCATTTGAAAAGCATATTTT
CGGGTTCCGTGGATGGAGATATCCGGCTCTGGGACATTGCGGCCAGGCGCACAGTTCA
ACAATTTCCAGGTCATAGAGGAGCGGTGCGTGGTTTGACTGTGTCTACAGAAGGAGGGC
GGCTTATTTCTTGTGGAGATGATTGCACAGTTAGACTATGGGATATCCCAGTTGCTGGCA
TTGGGGAATCAAGTTATGGCTCAGAGAATGTTCAAAAGCCACTAGCAACCTATGTGGGA
AAAAATTCTTTCAGGGCTGTAGATTACCAGTGGGATAGTAATGTGTTTGCAACTGGTGGG
GCTCAAGTTGACATTTGGGATCATGACAGGTCAGAACCGACAAACAGTTTTGCTTGGGG
GTCTGATACTGTCATTTCTGTACGGTTTAATCCTGCAGAGAAAGATATATTTGCGACAACT
GCCAGTGACCGCAGTATTGTTCTATATGACCTTCGAATGGCATCACCTTTGAACAAGTTA
ATTATGCAGACAAGGAATAATGCCATTGCATGGAATCCAAGAGAACCTATGAATTTCACT
GCAGCTAATGAAGATTGCAACTGCTATAGCTATGATATGAGAAGAATGAATATTTCAACAT
GTGTGCACCAAGACCATGTTTCCGCTGTGATGGATATTGATTATTCTCCATCTGGTCGGG
AGTTTGTGACGGGATCTTATGATAGAACAGTGAGAATATTCCCCTATAATGCAGGTCACA
GCAGAGAAATCTACCATACGAAACGAATGCAAAGGGTGTTTTGTGTGAAATTTAGTGGTG
ATGCGACTTATGTTGTCTCTGGAAGTGATGATGCTAATATTCGTCTGTGGAAAGCTAAAG
CATCAGAGCAGTTAGGAGTGCTTCTCCCGAGAGAGCGCAAAAGGCACGAATATCTAGAT
GCTGTAAAGGAACGTTTTAAGCATCTTCCAGAAATTAAACGCATTGAGAGACATCGACAC
CTACCAAAGCCTATATACAAAGCAGCGCTTCTACGGCATACTGTAAATGCCGCAGCAAA
GAGGAAAGAGGAGCGTAAGAGAGCGCACAGTGCTCCTGGATCTGTGGTCACTAATCCT
CTTCGGAAAAAGCGAATTGTAGCTCAATTGGAGTGAATATCTAATTCTAGCACGCCAATG
CAGGTTATACTATGTATTGGAGTGAAAATTTAATTATAGTATATGCTAATTCAGGTTATACT
ATGTATTCAAAATATGTTATTGGGCAATCGTTATTGATTTTACCTATCGCTATCTCACTGTC
CGCCAATTTAGTGTAATTCTATGTACCAAAAAAAAAA
SEQ ID NO:192
TCTAGGTGGGGTGTTAGGGGAATTGCTCACCGGATTTTGATTTTCTGGTCTTTTTATGTG
CGTGGCTCGTGTAGTTTGATACTGAACATTCAGGAATTGAAGCTTCTTCACAAGGATTCG
GATTTTGATTTGTTTCAGAAGGTGGGGTACAAGATTCAGTAGCGAAGTTCAGTGTTTTGC
CGAAGGGCTCTTTCAAATTAAGTAAGATGGATCACTACTACCAAGATGACTTTGATTATTT
GGTGGATGACGAGATGGTTGATTTTGCTGATGACGTCGAAGATGATGTACGCACCCGGC
GCAGGAGTGATATAGACTCAGATTCTGAAAATGATTTCGATTTAAACAATAAGTCACCTG
ACACAACAGCTCTTCAAGCCAAGAGGGAAAAGATATTCAGGGTATTCCATGGAATCGAT
TGAACTTTACTAGGGAAAAGTATAGGGAAACTCGTTTACAGCAGTATAAGAACTACGAGA
ACCTTCCACGGCCGCGACGCAGTCGCAATCTGGACAAGGAATGTACTAATTTTGAAAGG
GGATCCAGTTTTTATGATTTTCGCCACAATACGAGATCAGTCAAGGCCACTATTGTCCAC
TTTCAGTTGAGGAATTTAGTGTGGGCAACATCAAAGCATAATGTTTACCTCATGCAGAATT
ACTCTATCATGCACTGGTCTTCACTAAAACAGAAAGGGGAAGAAGTTCTTAATGTTGCTG
GTCCGATCGTTCCATCTGTGAAACACCCCGGTTCATCACCACAAGGTCTGACAAGGGTC
CAGGTCAGTGCAATGTCTGTGAAGGACAATTTAGTTGTTGCTGGTGGTTTTCAGGGAGA
ACTTATTTGCAAGTATTTGGACAAACCGGGAGTGAGCTTCTGTACAAAAATATCTCATGAT
GAGAATGGCATCACTAATGCAGTAGAGATATACAATGATGCAAGTGGTGCAACACGACT
AATGACTGCAAATAACGACTTGGCAGTACGAGTATTTGACACTGAAAAATTTACAGTGCT
TGAACGCTTCTCTTTTCCTTGGTCTGTAAATCATACGTCTGTCAGTCCAGACGGTAAACTT
GTTGCAGTTCTTGGAGATAATGCAGATTGTTTGCTTGCAGACTGCAAGACTGGAAAGACC
GTGGGAACTCTAAGAGGTCATTTGGATTACTCTTTTGCAGCTGCCTGGCATCCAGATGGT
TATATTTTGGCTACGGGGAATCAGGACACCACCTGCAGACTTTGGGACGTTAGGAAGCT
TTCATCTTCCCTAGCTGTCTTAAAGGGTCGAATGGGTGCTATAAGATCGATACGGTTTTC
ATCAGATGGCCGCTTTATGGCCATGGCAGAGCCAGCCGATTTTGTGCATCTATATGATAC
TAGGCAGAATTATACTAAGAGCCAGGAAATTGATCTTTTTGGGGAGATTGCTGGAATTTC
ATTCAGCCCCGACACTGAAGCATTTTTTGTTGGGGTGGCTGATCGAACGTATGGAAGTC
TTCTTGAGTTCAACCGTCGACGAATGAATTACTATCTTGATTCCATCCTCTGAGTTTTGAA
AGTTAATGGGAGTGGTGTTTTCTTGAAGTGAAATGGCATCATTCTGTTGAACCAATTCTT
GTATATTAGATATGTAACATGTATGAATGTCCATAGAGCAGAGCTTTGCTCAATCAGAGG
CTTCAAAACCCAAATTCCAAGGTCTATCGAAGCCTCTTTGTTTATAAATGGTGTTGTGGTG
GATACAGCTTTCTGGTTCACGCTGTAGATTTTTTAATGTATGTAACAATTTCAGCGGATAT
AAAGTCTTCCAACTTGTAAACCAAAAAAAAAA
SEQ ID NO:193
GGAGGATTTGGCACACGATCACCTGATCATTTTCCCAACCCTAAAATTCTTATAGCTAGG
GTTTATCCGAATAAAACCCTAAACACGGCGCGCGGTCTTCTTCTTTTATTCGGGACTTTTT
CGCAGCAGAGCATTACGATAACGAAACACGAGCAGATCGGCAGCGGCCACCCGCCGAA
GAAGAAGAAAACCCAGTAGTGCGATCAGAGTCTGAGCAGCCATGGCGGAAGCACTGGT
TCTGCGCGGCACAATGGAGGGCCACACCGATGCGGTGACGGCCATCGCGACCCCGATT
GACAACTCGGACATGATCGTCTCTTCATCCCGCGACAAGTCGATCCTCCTGTGGAACCT
AACCAAGGAGCCCGAGAAGTATGGTGTCCCACGGCGCCGCCTGACTGGCCACTCACAC
TTCGTCCAGGACGTGGTCATCTCCTCCGACGGCCAGTTTGCCCTCTCTGGCTCCTGGGA
CAGCGAGCTCCGCCTCTGGGATCTCAACACAGGCCTCACCACCAGGCGCTTCGTCGGC
CACACCAAGGACGTCCTGAGCGTTGCGTTCTCCATCGATAACCGACAGATCGTTTCCGC
CTCCCGTGACCGCACCATCAAGCTCTGGAACACTCTTGGCGAGTGCAAATACACCATCC
AGCCGGATGCAGAGGGCCACTCCAACTGGATCTCCTGCGTCCGCTTTTCCCCTTCGGC
CACCAACCCCACCATCGTATCGTGCTCATGGGACCGCACGGTAAAGGTATGGAATTTGA
CAAACTGTAAGCTCAGGAATACGCTCGTCGGACACGGAGGCTATGTGAACACGGCCGC
CGTTTCGCCCGACGGTTCGCTTTGCGCCAGTGGCGGCAAAGACGGGGTTACCATGCTC
TGGGATTTGGCGGAGGGGAAGCGGCTGTACTCGCTGGATGCCGGGGATATTATTTATG
CTCTGTGCTTTAGCCCCAACAGATACTGGCTCTGTGCTGCCACTCAGCAATGTGTGAAG
ATCTGGGATCTGGAGAGCAAGAGCATTGTGGCCGACCTTCGGCCCGATTTCATTCCCAA
CAAGAAGGCTCAGATACCTTATTGTACCAGTTTGAGTTGGAGTCCAGACGGAAGCACAT
TGTTTTCTGGTTATACAGATGGAAAAATCAGGGTGTGGGGAATTGGACATGTTTAAGCTT
TAGAGGCAATGGTAGATTATGAAGTCAACACCAGGGAGTTTGACCGTTTGGGACATTAAT
TTTGTCGAGCATGTGTTCTTTTTTCACAAAAATTAGTATAATGCATTTATGATTTCAGTTTT
GATACTATGTTAGAATTTATTTTTCTGCACGGAGGGAAGGCACTCCAAGCTTTGGAAGCA
ATATTTAATCAATGAGCATATTCTATTTCATTAAAATGGATATTTAAGGATCTATTGTATGA
AGTACAATGTTTCAGTTTTGGCATTTTAAAAAAAAAA
SEQ ID NO:194
GCATTTTTCATTTTATATCCTTCGAATCCGTCATTCGGCTGAATGATCAGACAAACCCTCG
CAGATCCTGCTCTGTTCTGAAGCATAAACCTCCGCCAAATCCATGCGTTGAATTTCCTGA
AAAAATTCGTGGCTATTCAGTGTCGAACTGCTATCTCCACAGTCAGATCCAGTGGCGATC
AAGGTGAATTAGGGCAGAAGAACGTGAGATCGTGTACAAATGGCTCCCATAAAATCCAC
GTCTCGATCCGCGTCGGTGGCGTTTGCCCCCGACGCGCCCCTGTTGGCAGCGGGTACG
ATGGCCGGAGCCATCGACTTGTCATTCAGCTCCTTAGCCAACTTAGAGATCTTCAAGTTG
GATTTCCAGTCCGACGATCCGGAGCTCCCCGTCGTCGGAGAATGCCCAAGCAACGAAC
GACTGAATCGGCTTTCGTGGGGGAGCGCCGGTGGCTCCTTCGGAATAATTGCCGGTGG
ACTAGTTGATGGAACCATCAACATCTGGAATCCAGCTACTCTTATCAATTCGGAGGACAA
TGGAGATGCACTTATTGCACGGCTTGAACAACACACAGGACCGGTTCGTGGCCTGGAGT
TCAATACAATTTCTACAAATTTGTTGGCCTCCGGAGCTGAGGATGGGGAGCTTTGTATAT
GGGATCTTGCAAATCCTACAGCACCAACCCATTTTCCACCATTGAAGGGTGTGGGATCA
GGTGCTCAAGGGGAAATTTCTTTTTTAGCTTGGAATCGGAAGGTTCAGCATATACTGGCT
TCCACTTCATATAGTGGAACAACAGTGGTTTGGGATTTAAGACGCCAGAAACCTATCATT
AGCTTTCCAGACGCAACTCGAAGACGATGCTCTGTTTTGCAATGGAACCCTGATGCTTCA
ACACAGCTTATTGTAGCGTCAGATGATGATAATTCACCCACTTTACGGGCTTGGGACTTG
CGAAACACAATTTCTCCATACAAGGAATTTGTAGGGCATAGCAGAGGAGTCATAGCTATG
TCGTGGTGCCCAAGCGACAGTTTGTTCTTGCTAACTTGTGCAAAAGATAATCCCACTCTT
TGCTGGGACACTGGATCTGGGGAGATAGTGTGTGAATTACCAGCTGGGGCAAATTGGAA
TTTTGATGTTCAGTGGTCGCCAAAGATACCTGGAATTTTATCAACATCTTCTTTTGATGGG
AAAATTGGCATTCACAATATCGAGGCCTGCAGTCGAAATGTTTCTGGTGAGGTTGAGTTT
GGTGGTGCTATAGTACGCGGGGGACCCTCAGCTCTCTTGAAGGCCCCCAAGTGGTTAG
AGAGACCTGCTGGTGTAAGTTTTGGCTTTGGCGGAAAGCTTGCTTCATTTCGACCCAGTA
CGGTGGCTCAGGCTGCCGATCATAGACATTCTGAGGTCTTTATACACAATCTAGTTACCG
AGGACAATCTGGTTATTCGATCAACAGAATTTGAAGCTGCAATAGCAGATGGTGAAAAAG
TATCATTGCGGGCTTTGTGTGATAGAAAAGCCGAAGAGTCTCAGTCTGACGAAGAGAAA
GAAACATGGAACTTCTTAAGAGTCATGTTTGAAGATGAAGGCACGGCTAGGACAAAGCT
ACTCGAACACCTTGGATTTAAAGTGCAGTCAGAAGAGAATGGGGATTTACAGGAAACTCA
TTCCAGCAAAATTGATGACATTGGTAGTGAAATACGAAAAACTCTTACTCTTGATGATAAA
ACAGAAGAGGATGTGTTGCCTCAGCTGAAAGGTGGACAAGATGCTGCTATCCCTCAAGA
CAATGGAGAAGATTTTTTTGATAATTTACACAGTCCCAAAGAAGAGGTCTCTCTTTCTCAT
GTAGGAAATGATTTTGTTGGAGAAAAGGATAAGGATATGGTAGTAAATGGTGCAGAGATT
GAACATGAAACTGAAGACTTGACTGAATATTCAGACTGGAATGAAGCTATTCAACATTCC
TTAGTTGTTGGTGATTATAAGGGAGCTGTTCTACAATGCCTGTCAGCAAACAGGATGGCT
GATGCTCTTATTATTGCTCACCTTGGTGGGAATTCTTTGTGGGAGAAAACTCGTGATGAA
TATTTGAAGAAAGCTAAATCTTCCTATCTCAAGGTGGTTTCTGCCATGGTCAATAATGATC
TTACAGGCCTTGTCAACAGCAGGCCATTGAAATCTTGGAAAGAAACTCTTGCTATGCTTT
GTACATATTCACAGAGGGAGGAATGGACTGTTTTATGTGATATGCTTGCTTCTAGATTGA
TAGCAGCAGGAAATGTGATGGCAGCAACACTTTGCTACATATGTGCTGGGAATATTGAAA
AGACTGTGGAAATATGGTCTCGAAGTCTTAAGTGATGATTATGACGGGAGGAGCTTTGTTG
ATCATTTGCAAGATGTGATGGAAAAGACTGTTGTTCTTGCTTTAGCAACTGGACAAAAAC
GGGTTAGTCCCTCTCTGTCAAAATTGGTTGAGAATTATGCAGAATTGCTAGCAAGTCAAG
GACTTCTAACCACAGCTATGGAGTATCTAAAGCTCCTGGGCACTGAAGAATCTTCTCATG
AGCTGTCTATTTTACGAGATAGACTATACCTCTCTGGGACAGATAATAAGGTGGAGGCTT
CATCTTTTCCTTTCGAAACAAGGCAGGATCTGACAGAATCCCAGTACAATATGCATCAAA
CAGGTTTTGGGGCACCTGAAACTCAAAAAAATTATCAGGAAAACGTGCATCAGGTTTTAC
CTAGTGGGTCATACACTGACAATTATCAGCCGACTGCTAATACTCATTACATAGCAGGCT
ACCAACCGGCACCACAACAGCAACCTTCCTTCCAAAACTATTTTACTCCTGCATCTTACC
AGCCAGCACCATCTCCAAATGTTTTCTATCCATCACAAGTTTCTCAAGCTGAGCAGAGTA
ATTTTGCTCCACCTGTGAATCAGCCACCAATGAAAACATTTGTTCCATCTACACCACCAAT
CTTGAGAAATGTAGATCAATATCAGACGCCTTCTTTGAACCCTCAACTCTATCAGGGAGT
TTCTTCAGCTACAGTTGAAACACATCCCTACCAAACTGGTGCACCTGCATCTGTTTCAGT
GGGGACAACCCCAGGGCAGCCATCTGTAGTTCCTAATTTCATGGTCCCGGGACCAGTTA
CTGCACCTACAGTTACTCCCCGAGGGTTCATGCCAGTTACAACACCAACTCAACACCCG
CTAGGTAGTGCAAATCCTCCAGTTCAACCCCAAAGTCCCCAGTCCAGTCAGGTGCAGTC
TGTGACAGCTGCAACCACACCACCTCCAACAATACAAAATGTTGATACATCAAATGTGGC
AGCTGAAATTAGACCTGTAATTGGTACATTAAGAAGGCTTTACGACGAAACATCTGAAGC
TCTTGGAGGGGCAAGGGCAAATCCAGCTAAACGGCGTGAAATAGAAGATAATTCCAGAA
AAATTGGGTCTTTGTTTGCAAAATTGAACTCTGGTGATATATCTTCCAATGCTGCATCTAA
ACTGGTCCATCTCTGCCAAGCTCTGGAGTCTAGGGACTATGCAACTGCTTTTCAAATACA
GGTGGGACTGACAACAAGTGACTGGGATGAATGCAGTTTCTGGCTTGCTGCCCTTAAGC
GAATGATCAAAGTAAAACAGAATATGAGATAAATTTCTGATGGTGCTACAGTTAGATTTTT
TCCAAGTGCCATTTTTACATAATATGTGATTAGTTATTCTGTTCATTATTCTGTTTTTCATT
CAATTTGACATTGGAGTTTCAAGGCATTCCAAGGATAGCATGTACACAAGTTGAATTTTTG
CTTGAGAGTTATTGCAATTTTTGTCTTTTTGAGGACATTGTACAAAAAAAAAAAAAAAAA
SEQ ID NO:195
GCACAGGCTATCTTATTTTGAAATCATTTTGAAATTCGTACCGTGGCACGAGGGTGTTTC
AAACCTAGCGTCCCAGGCCAGCCTCCACACGTTGTGTAGCAAAAACAACATAGGTGTAA
GCGGTGACACGATGAGCTGTCTTGTTTCATAGGACTGCAAGGGCAAAGGATATCAGCGT
TGGAATCAAATGGCGGGAGCCGGAGACTCGCAATTACAAACGAGGGCTGCAAGGGCAA
AGGATATCAGCGTCTAGTGGAATTGGAATCAAATGGCGGGAGCCGCAGACTCGCAATTA
CAAACGCTATCGGAGAGAGACAGCACCCCCAACTTCAAGAATTTGCACACCAGAGAATA
TGCTGCTCACAAGAAGAAGGTGCATTCAGTGGCATGGAATTGTACTGGTACCAAGCTTG
CTTCTGGTTCTGTTGATCAAACAGCACGTGTTTGGAACATTGAGCCACATGGTCATAGTA
AGACCAAAGATTTGGAGTTGAAGGGGCATGCTGATAGTGTTGATCAATTGTGCTGGGAC
CCAAAGCATTCAGAGTTGCTTGCTACTGCATCAGGAGACAGGACAGTGCGTCTATGGGA
TGCTCGAAGCGGAAAATGTTCTCACCAGGTTGAACTTAGTGGGGAGAACATCAACATCA
CGTTCAAACCCGATGGGACACATATTGCTGTGGGAAACCGGGATGATGAATTGACTATA
ATCGATGTCCGCAAATTCAAGCCTCTTCATAAGCGTAAATTTAGCTATGAGGTGAATGAA
ATAGCTTGGAATACAACTGGAGAGCTTTTCTTTCTCACAACAGGAAATGGCACTGTTGAA
GTACTGTCATATCCTTCTCTTCAAGTTCTTCATACACTTGTTGCTCACACTGCGGGTTGCT
ATTGCATTGCCATTGATCCTATAGGAAGGTACTTTGCTGTTGGAAGTGCAGATGCCCTTG
TAAGCCTTTGGGATCTTTCTGAGATGTTATGTGTTCGAACATTTACCAAACTCGAGTGGC
CAGTGCGGACAATCAGCTTCAATCATGATGGGCAATACATAGCTTCTGCCAGTGAGGAT
CTTTTTATTGATATTGCTGATGTTCAGACTGGACGGACTGTACATCAAATATCTTGTCGAG
CAGCCATGAATAGTGTTGAGTGGAATCCCAAATACAATCTCTTGGCGTTTGCTGGAGATG
ATAAAAATAAGTACATGCAAGATGAAGGTGTGTTCAGAGTTTTTGGTTTTGAGACTCCATA
AATGCTCTTTAAGTTGACTTGGCTCATGTGGTTATGCTTCATCAAGGACGGCATTTTGAG
GCTAATAATTATTGTCAGATTTACACCATAAAATTACTATGGAAGTTGGATCATTATCTATG
CCATAGTGGAGTAGAACTAGATTTTGGGGAACATTGTTTTTGACTCGGACATTGAAAAAG
TGACTGCAGATTGCACATGGCCTCTTCTGTACCTAAATCTTTTCAAATGGAGTCATTGGTT
GGTTATGTCCAAGCCCTGTTAGATGGGGCACTTTAAAATCAATGCAGGCTGCAAATGG
SEQ ID NO:196
GATGGGGGGAAGTGGGTTCGTATTCAAATTGGTGAGCTAGCAGGCCCGCCATTTGTTGA
TACATAAGCAAGGTCCGAGGGTATCAAAACTGGAGCTACAGTGTTAAAAATCACTTTTTT
AGATATTTAAAACAGTTCGGCTTATATTGGACAGCGATCAAAGTCTGCTAGAAGAAGCGT
CGATTTAACCTCAGGAGGAAAGGGGGAGGAAATCATGGCGGCAACGTCTCCCGTTGGT
GCCGGCTCGGGACGGGAGCTCGCGAATCCTCCCACTGACGGCATTTCTAACTTGAGATT
TTCCAATCACAGCGACCATTTGCTCGTCTCCTCATGGGACAGAAAGGTGCGATTATATGA
TGCAAGCGCGAATTCGTTAAAGGGTCAATTCGTACACGGAGGTCCGGTTTTAGATTGCT
GTTTTCACGACGACGCATCGGGTTTCAGTGGAAGTGCTGACAACACTGTGAGAAGGTAT
GATTTCAGCACCAGGAAGGAGGATATATTGGGAAGGCATGAAGCTCCTGTTCGTTGTGT
TGAGTATTCATATGCAGCAGGGCAAGTGATAACGGGGAGTTGGGACAAGACATTGAAAT
GTTGGGACCCTAGAGGTGCAAGTGGGCAGGAGAAGACTCTTGTTGGTACATATTCACAA
CTGGAGCGCGTGTATTCTATGTCACTAGTTGGTCACCGGTTGGTGGTAGCCACAGCTGG
CAGGCATATAAATGTTTATGATCTCCGTAACATGTCACAACCTGAACAACGAAGGGAATC
GTCCTTGAAGTATCAAACTAGATGTGTGCGTTGCTATCCAAATGGAACAGGTTTTGCCCT
TAGTTCTGTGGAAGGTCGCGTGGCTATGGAGTTTTTTGATTTATCTGAAGCTGGTCAAGC
TAAAAAATATGCCTTTAAATGTCATCGCAAGTCAGAGGCTGGCAGAGATACTGTTTATCC
AGTAAACGCAATTGCCTTTCATCCCATCTATGGTACTTTCGCCACTGGGGGCTGTGATGG
CTATGTTAATGTATGGGATGGCAACAACAAAAAGAGGTTATACCAGTATTCAAAATATCCT
ACAAGCATTGCTGCTTTATCATTTAGTAGAGATGGTCGTTTGTTGGCTGTGCCATCAAGC
TACACATTTGAGGAGGGGGAAAAACCGCATGAACCAGATGCTGTTTTTGTACGAAGTGT
AAATGAAGCTGAAGTGAAACCAAAGCCCAAAGTTTATGCTGCACCCCCATGAGGCCCGT
CTACATTAATATTCTTTCGTAGAGTATGCTTTCTAGTACATCTTCCTTCTCTTCCTGATGTA
CTTGATTGTTACATTGTTACTGTGTTGGGTATATAAGATACGAGATTCTCTGCTTTTGTTG
GCTGCTATGGTGTTTCTTTTATAATTTAATACTCTATGGATTAATCTCTTGATTCTAGAATC
TAAACTACTACCTTGCGGACATGACTGAGCATCTCTCTAACAGCTTGGGTGGGACACCT
CACTTTTTTTAGGGGTTTTGTTTGTACTAAAT
SEQ ID NO:197
CCGGACTAATTGACTTACGCCGCTAAACTCTATACTCACATTTATTCTGTTTGAAGTCATT
TGGCTGGGAAAATCGATCATCATCAAAAGCTTGCAGCAGTTGAGTCCTCGAGAATTCATG
GCTTCAGACGACGAAGAGGGGTTTAAAAACGAAGAGGCTCCGGGTGTTGTCGATGAAG
CAGAAGTACAGGAGGGTCTGCGGGCCTGTTTTCCTTTGTCGTTTGGTAAACAAGAGAAG
AAACAGGCCCCTTTGGAATCCATCCACTCCGCCACCAAAAGGCCAGAGGATCCACGGC
CCAGACGACAGCTTGGTCCTCCAAGACCTCCACCCTCTATTTTGGCAGAACAAGAGGAT
TCTGATAGATTTGTAGGCCCTCCAAGGCCGCCCCAATTCGTTAGAGACGACAATGACGA
TGGTGAGGCGGAGATCATGATAGGACCTCCAAGACCGCCTGCTCAGTATAGCGATGAC
CATGATAACGAGGAGACCATAGGTCCTCCAAAGCCTTCATATTTAGAAAAGGGAGAGGA
GACTGACCAAATGGTTGGACCTTCCAAACGCGGAAGCGACGACGAAACCAGCGGTGAT
TCTGATGATGGAGACGATGCTGTCGATTTCAGGGTGCCTTTAAGTAATGAGATTGTGCTC
AGAGGCCATACCAAGGTTGTTTCAGCCTTGGCTATTGACCAGACAGGATCACGGGTTCT
TACAGGTAGCTATGACTATTCTGTTCGTATGTATGATTTTCAAGGTATGACTTCTCAGCTG
AAGTCATTTAGACAACTAGAACCAGCTGAAGGTCATCAAGTTCGTAGTTTAAGCTGGAGC
CCAACTTCTGATCGATTTTTGTGTGTAACTGGATCAGCCCAGGCAAAGATTTTTGATCGT
GATGGCCTCACTTTAGGAGAGTTTGTCAAAGGTGACATGTATTTGCGTGACCTTAAGAAT
ACAAAGGGGCATATATCAGGCCTGACATGTGGAGAATGGCACCCTAAGGAGAAGCAAAC
AATATTGACATGCTCAGAGGATGGTTCTCTTCGCATTTGGGATGTAAATGACTTTAATACT
CAGAAGCAGGTCATTAAGCCTAAACTTGCGAAACCTGGAAGAGTACCAGTGACTGCTTG
TGCATGGGGTCGTGACGGGAAATGCATTGCAGGTGGTGTTGGTGATGGATCAATACAG
GTGTGGAATCTTAAACCAGGATGGGGAAGCAGACCAGATTTATATGTTGCCAACGGACA
TGATGACGATATCACAGGTCTTCAGTTTTCTGCTGATGGAAATATCCTTCTGACCAGAAG
CACTGATGAAACACTCAAGGTTTGGGATCTACGGAAGGCTATAACTCCACTGCAAGTATT
CAGAGATCTTCCAAACAACTATGCCCAAACTAATGTAGCATTTAGTCCAGATGAGAGGTT
AATATTTACAGGGACCTCTGTTGAAAGGGATGGTAATTCTGGAGGCTTATTATGTTTCTAT
GATAGGCAAACACTGGAATTGGTGTTAAGAATTGGTGTGTCCCCAGTGCACAGTGTTGT
ACGTTGTACTTGGCATCCACGGCATAATCAGGTCTTTGCTACAGTTGGGGATAAAAAAGA
AGGTGGAGCACATATACTATATGATCCAGCTTTGAGTGAGAGGGGGGCACTTGTTTGTG
TTGCAAGAGCACCCAGGAAAAAGTCCTTGGATGATTTTGAACCCAAACCTGTTATTCACA
ACCCACATGCCCTTCCCTTGTTTCGAGATGAACCAAGTCGTAAGAGGCAACGGGAAAAA
GCACGCATGGATCCCATGAAGTCACAGAGGCCAGATTTGCCAGTCACAGGTCCTGGTTT
TGGTGGGAGGGTTGGATCAACGAAGGGCAGTTTGTTGACACAATACCTTCTCAAGGAAG
GGGGGCTGATTAAGGAGACATGGATGGAGGAAGACCCAAGAGAGGCTATACTGAAATAT
GCTGATGTTGCTGCAAAAGATCCCAAGTTTATTGCACCTGCTTATGCACAGACACAGCCG
GAAACGGTTTTTGCTGAAACTGATTCTGAGGAGGAACAGAAATGATATATAACTGATGAT
GTGGGTCATATTCATCCGGGTCTACCAATTTCCAATTTCAGGGCAACATACTATTGAATA
GAGTGCAGTTGACAAGGCAAAAATAAGAGAGCTGGTCCACATTATCAAATTTGCATTATG
ACAAGGCCAAACAAGAGATCTTGTCTATAATGTTAAATTTGTATATTATCTAACTGATTCTT
GTAATGTCTGCAATTGTATACCAAACAGGGTTGTGCTAGTTTAACATTTTAACTTAATGTA
ATCATGTAAGCTTTAGAGAGGTGGCAAGACATTTTCGGTTTATATTATTTATAATTTGTTTA
TGAAATACTTAGTTTATTATATTATCAATCCTTTCTCTATTT
SEQ ID NO:198
GCGAGGGTCCACGATATTTCCTATCGTCCTGTTACTCGATAAAGCATAAACCTTGCAAGC
TCTGAACTAGGTTGCTTTCCATCAGACTGAGCTATTTGAGTTACAGTGACCAGGAACAGC
AACAAGTGATATCTATAGTGTTGAAATTGTGCGTGGAGTGAGGGTTTAAGGGAACAGAAA
GGACAATGAAGGAGAGGGGGCAGAGTCATGCAGGGCAACCGTCGGTCGACGAAAGGT
ACACTCAATGGAAGTCACTGGTGCCCGTTCTCTACGATTGGCTTGCTAATCACAACCTCG
TCTGGCCTTCTCTTTCCTGCCGTTGGGGTCCTCAAATGCACCAAGCAACATACAAGAACA
GTCAACGCCTGTACCTTTCGGAACAGACTGATGGCACTGTGCCAAACACATTGGTAATT
GCTACCTGTGAAGTTGTCAAACCAAGGGTTGCAGCACCTGAACATATATCTCAGTTTAAT
GAAGAGGCACGTTCTCCTTTTGTAAAGAAAGTTCAAGACGATCATACACCCTGGTGAGGTT
AACAGAATCAGAGAATTGCCACAAAACAGCAAAATTGTCCCTACACATACAGATGGCCCA
GATGTCCTCATCTGGGATGTTGATACACAACCAAACCGTCAGGCTACTCTTGGGGCAGC
AGATTCTCGTCCAGACTTGGTTTTGACTGGACATAAAGATAATGCTGAGTTTGCACTTGC
GATGAGTCCCAGTGCACCATTTGTGCTTTCTGGAGGAAAGGACAAATGTGTTTTGCTTTG
GAGTATACAAGATCATATATCTGCTGCAACTGAGCCTTCCTCTGCAAAAGCATCAAAAAC
ACCATCTTCAGCACATGGAGAGAAAGTACCAAAGATCCCTTCGATAGGTCCTAGAGGTG
TTTATAAAGGGCATAAAGATACGGTTGAAGATGTGCAATTTTGTCCTTCAAATGCACAGG
AATTCTGTAGCGTGGGTGATGACTCTGCTCTTATATTGTGGGATGCTCGAAATGGCAATG
AACCAGTCATTAAGGTTGAGAAGGCACACAATGCAGATCTTCATTGTGTTGATTGGAACC
CCCATGATGAGAATCTAATCTTGACTGGGTCTGCTGATAACTCAGTGCGCATGTTTGACC
GCCGGAATCTTACATCCAGTGGGGTTGGCTCACCTGTCCATAAATTTGAAGGTCATTCTG
CTCCTGTGCTTTGTGTGCAGTGGTGTCCAGACAAAGCATCCGTGTTTGGAAGTGCAGCA
GAGGATAGTTACTTAAATGTGTGGGACTATGAGAAGGTTGGAAAGAATGTTGGTAAGAA
GACACCGCCTGGACTTTTCTTCCAGCATGCAGGGCATAGGGATAAGGTGGTTGACTTTC
ATTGGAACAGTTTTGACCCATGGACCATTGTCAGTGTATCTGACGATGGTGAGAGTACTG
GTGGAGGAGGAACTTTACAGATTTGGCGAATGAGTGATTTGATATACAGGCCTGAAGAT
GAGGTTTTGGCAGAACTCGAAAGATTTCGAGCTCATATTCTATCTTGTCAGAACAAGTAG
AGAATGTTACGACTACCTCAGTACTTCTTTTTTTATAATTGTCTATAAACTATCCAATGTAA
ATGTTTACATTGAGGTCATGCATGAGTGTTAATTACGCTTTCACTACTGTTCACTTTTTTCT
ATAGACAGTGGAAATGGAATGATGTTGATCTGGCCATGTATAGGAAATCTAATATAAGGA
TGTCCAAAGTTGTGATTGGCATCAAGTCCAATCCAAAGTTGTGAACAATATTATATCGGTA
TTAACCACTAAAAAAAAAAAAAAAAAAAAA
SEQ ID NO:199
CTGATCTTGAGTGTCGAAGAAGAAGATTAAATGGCGAGCAGTTTCGAGGCGTGAGGGCG
TAGGCTGAAATCTTTAGCCTGAAACCCTTGGCTGAAGTGTTCGTTCGACGAAGCTCTGC
GGCCGTTAAAGCAATCCTCCGGAAGTTGTAAGATTTTGTCGGAAATCCCGGTATGCAGG
GCTTGCTGTTTGATTGAATGTTTTGGAACGGACTTGGATAGTTAATGATAAAGTAAAACTG
GCATCATGTCATCTCTTAGTAGAGAATTGGTGTTCCTTATCCTACAGTTCTTGGATGAGG
AAAAATTCAAGGAGTCGGTGCATAAGCTTGAGCAAGAGTCGGGTTTCTTTTTCAATATGA
AATATTTTGATGAGAAGGCTCAGGCTGGTGAATGGGATGAAGTTGAGCGTTATCTCTCAG
GCTTTACAAAGGTTGATGATAACCGGTATTCTATGAAAATATTCTTCGAGATCAGGAAACA
GAAGTATTTGGAGGCCTTGGACAGACAAGACCGAGCTAAAGCGGTGGATATTTTGGTGA
AAGATCTGAAGGTATTCTCAACATTTAATGAAGAACTATACAAGGAGATAACTCAGCTTCT
AACACTGGATAACTTCAGAGAAAATGAACAACTGTCCAAATACGGTGATACAAAGTCTGC
CAGAACCATTATGATGTCGGAACTTAAAAAATTGATAGAGGCCAATCCCTTATTTCGAGA
AAAGCTCATTTACCCAAACCTAAAAGCATCAAGGTTACGCACATTAATTAATCAAAGCTTG
AACTGGCAACACCAACTTTGCAAAAATCCTAGACCAAACCCAGACATAAAGACATTGTTC
ACTGATCATGCATGTGGACCTCCTAATGGAGCCCGGACACCTACGCAGCCTACGGCTTC
TCTGGGGGTGCTACCCAAAGCAACCACATTTACACCAATTGGACCCCATGGGCCTTTTC
CATCATCTTCAACCGCTACTAGTGGCTTGGCAAGTTGGATGTCAAATCCCAACATGGTGA
CATCTCCCCAAGCTCCTGTTGCTGTGGGACCTAGTGTTCCAGTTCCACCTAATCAAGCTA
CTCTTCTAAAACGTCCCAGAACCCCTCCAGGCAGTTCATCTGTGGTGGATTATCAAACTG
CTGATTCTGAGCAATTAATCAAGCGCTTGCGTCCTGTATCCCAGTCCATTGATGAGGCAA
CCTATCCAGGTCCTACTCTGCGAGTACCATGGTCAACAGATGACCTTCCAAAGACACTA
GCTCGGGCACTTAATGAACCTTATCCTGTCACAAGCATAGATTTCCACCCCTCTCAACAA
ACATTTTTACTAGTGGGCACCAAAAATGGAGAAATAACATTATGGGAGGTTGGTTCAAGA
GAAAAGCTGGCTACAAGGTCCTTTAAAATTTGGGATAATGCAAATTGCTCCAATCATTTG
GAGGCTGCATTTGTCAAAGATTCTTCAGTTTCTATCAATCGAGTATTGTGGAGCCCTGAT
GGAACATTGATAGGTATTGCTTTTACAAAGCATCTTGTTCACACATACACTTTCCAAGGAC
TTGATCTGCGACAACACTTGGAGATTGATGCACATGTTGGTGGAGTGAATGACTTAGCAT
TTTCTCATCCAAATAAGCAACTTTGCGTTGTAACTTGCGGAGATGATAAGATGATTAAGGT
CTGGGATGCTGTAACTGGTCGCAAGCTTTATAACTTTGAGGGCCATGATGCACCTGTCTA
TTCTGTTTGTCCTCATCACAAGGAAAATATCCAGTTTATATTTTCGACTGCCGTTGATGGA
AAAAATTAAAGCTTGGTTGTATGATCATTTGGGATCGAGAGTAGATTATGATGCTCCTGGG
CATTCCTGTACAACGATGATGTATAGTGCTGATGGGACAAGGTTATTTTCTTGTGGCACA
AGCAAAGAAGGAGAATCCTTCCTAGTTGAATGGAATGAGAGTGAAGGACCAATTAAGAG
AACATATTCCGGGCTTAGGAAGAAGGGTTCAGGTGTTGTGCAGTTCGATACAACGCAGA
ACCATTTTTTAGCTGTGGGGGATGAACACCTGATAAAGTTTTGGGATATGGACAGTACCA
ATATGCTTACGAGTTGTGATGCTGAGGGTGGTCTACTGAACCTGCCTCGTCTAAGATTCA
ACAAGGAAGGATCTCTTCTTGCAGTGACCACAGTAAATGGAATCAAGATTCTTGCTAATG
CAGATGGACAGAAATTACTTAAAACAATGGAGAATAGGACCTTTGATTTGCCATCAAGGG
CACACATTGATGCACCTGCAATTAAGCCATGTTGTAACTGGTCATTCAGATGCCTGGCGG
TATTCCCATTGTCGCAGCAAGTGCAACTTCGAGTCCAGCCACTGGAAGAATGGAGCGCA
TTGAGAGAACCTCCTCAGCCAATACTGTTTCTGGAATTAATGGAGTTGATCCAGCACAGA
GTTCAGAGAAACTAAGGTTATCAGATGATTTATCTGAGAAAACTAAAATATGGAAGCTGA
CTGAAATTACTGATTCTATCCAATGTCGGTGTATAACATTGCCGGAGAATGCAGCAGAGC
CTGCAAGCAAGGTTTCACGGCTTCTATACACAAATTCTGGAGTCGGATTACTAGCTCTTG
GATCTAATGCAGTGCATAAATTGTGGAAATGGAACCGCAGTGAGCAGAATCCCAGTGGC
AAGGCAACTGCAAGTGTTCATCCACAGCGCTGGCAACCAACCAGTGGTCTTCTCATGAC
CAATGATATAACTGATATAAATCCACAAGAGGCTGTTCCATGCATAGCTCTGTCAAAGAA
TGACTCATATGTCATGTCAGCATCAGGTGGAAAAGTATCTTTATTCAACATGATGACATTT
AAGGTTATGACTACTTTCATGCCCCCACCCCCAGCATCAACGTTCTTAGCATTCCATCCA
CAGGATAACAATATCATAGCTATTGGAATGGAAGATTCAACTATCCACATATATAATGTTA
GGGTAGATGAGGTTAAAACTAAGTTGAAAGGACATCAGAAGAGAATCACAGGTCTTGCC
TTTTCCAGTACACAGAACATTCTAGTGTCTTCAGGTGCAGATGCACAGCTGTGCGTATGG
AACACAGAGACATGGGAAAAGCGTAAGTCGAAGACTATCCAGATGCCAGTTGGAAAGAC
AGTTTCTGGTGATACACGAGTTCAATTTCATTCTGACCAGCTTCATATACTTGTTGTGCAT
GAAACACAACTTGCCATATATGATGCCTATAAATTAGAACGACAATACCAGTGGGTGCCA
CAAGATGCTCTTTCAGCACCAATATTGTATGCAACATATTCATGTAATAGGCAACTTATTT
ACGCGACATTCAGTGATGGTAATATTGGTGTATATGATGCTGAAATACTTAGACCAAGAT
GCCGCATAGCCCCAACCACCTATTTAAGTTCAGGGACTAGCAGTTCTACCTCTTTACCCT
TGGTTGTTGC
SEQ ID NO:200
GGGGACTTCAAACAAGCCGTCTTTCACTGCATGCTCCCTTTCCCGCAAAACACAAACCAA
ACACGTTAAACTCTAACCCAAACCTTCTCGCAATTCTTACCAAATCCAGTTACTCTGAGCC
AAAACCCTTGGCGGGAAAACCCCAGTTAGGAGCTTCCGGCCATGGCGAAGGACGAAGA
AGAAATTCCGCGGCGAGATGGAGGAGCGCCTGGTGAACGAAGAGTACAAAATCTGGAAG
AAAAATACGCCGTTTCTTTACGATCTGGTGATAACGCACGCCCTCGAATGGCCTTCACTC
ACTGTACAGTGGCTCCCGGACCGCGAAGAGCCCCCTGGAAAAGATTATTCCGTTCAGAA
AATGATACTGGGGACTCATACTTCCGACAACGAGCCCAACTATCTGATGCTGGCCCAAG
TTCAACTCCCGCTAGAGGATGCAGAGAATGACGCCAGGCAGTATGACGACGAGCGCGG
GGAGATTGGAGGGTTCGGCTGCGCCAATGGCAAGGTACAAGTAATACAGCAAATAAATC
ACGATGGAGAGGTCAATAGAGCCCGATACATGCCACAAAATCCTTTCATTATTGCCACGA
AAACAGTTAGTGCAGAAGTCTATGTGTTTGACTACAGCAAGCATCCTTCAAAGCCTCCTC
AAGATGGTGGATGTCATCCTGATCTCAGATTGAGGGGTCATAATACAGAAGGTTATGGTT
TATCATGGAGCCCTTTTAAGCATGGCCATCTTTTAAGTGGTTCAGACGATGCACAGATCT
GTTTGTGGGACATTAATGTACCTGCCAAAAACAAAGTGCTTGAGGCCCAACAAATATTTA
AGGTGCACGAGGGTGTTGTAGAAGATGTTGCATGGCATTTAAGGCATGAGTACCTTTTT
GGGTCTGTTGGAGATGACCGCCATTTGTTGATATGGGATTTGCGTACATCTGCAACTAAT
AAACCACTGCACTCAGTAGTAGCTCATCAAGGTGAGGTTAACTGTCTTGCATTCAATCCT
TTCAATGAGTGGGTACTGGCTACAGGATCCGCAGACAGAACGGTGAAACTTTTTGATCT
GCGCAAGATATCCAGTGCTTTGCATACCTTTTCCTGTCACAAGGAAGAGGTTTTCCAAAT
AGGCTGGAGCCCCAAAAATGAAACAATATTGGCTTCTTGTAGTGCAGACAGAAGGCTTAT
GGTGTGGGACCTCAGCAGGATTGACGAATTCCAAACACCAGAGGATCCTTTAGATGGAC
CACCTGAGTTGCTGTTTATTCATGGTGGACATACTAGTAAGATATCAGATTTCTCATGGAA
TCCATGTGAGGATTGGGTTATAGCTAGTGTAGCTGAAGATAACATTCTCCAAATCTGGCA
AATGGCTGAAAATATATACCATGACGAGGAGGACGATATGCCTCCTGAAGAAGTAGTGT
AACTTTTATCTAGCTAGAAGTTGTGAAATTAAGAGGGATGTGAGGATTGGGTTATAACTA
GTGTAGCTGAAGATAACATTCTCCAAATCTGGCAAATGGCTGAAAATGTATATCATGGTG
AGGGGGATATGGCTTCCAAAGAAACATTGTAAGAGCTAGCTAGAATTTGTGAAATTAAGA
GGTGTATTCACTTTCAGAGTTTCTCAACAAATGACATGGTTCTCATTCCATTTTCTTTTATA
TAATGAGAAGCAAAACTTGGCTTAAAAAAAAAA
SEQ ID NO:201
AAATAAGACCACCATTTGTTACCCTCTGCGAATCACCGTTTATGCTACTAGGAAGCAGAT
CGATTGGGATAAATTCATCCTCTTGGCTCGCATACAGCTTCCCTTCCAACATTCATCTCG
ATGTAAATTACAGTCACAGAACTCATCAAGCAATATGTCTCCAGGAGTGAAGCAAACGGG
CAGTCAGAAGTTCGAATCCGGCCACCAAGATGTTGTCCATGACGTCACAATGGATTACTA
CGGGAAACGCATAGCGACCTGCTCTGCAGATCGGACTATAAAGCTTTTCGGCCTGAACG
CCTCCGATACCCCAAGCCTTCTGGCCTCGCTTACGGGTCACGAAGGCCCTGTCTGGCA
GGTCGCTTGGGCTCACCCAAAGTTCGGGTCCATGCTCGCTTCCTGCTCATATGACGGAC
GCGTAATAATCTGGCGAGAGGGTCAGCAGGAGAATGAATGGTCGCAGGTTCAAGTCTTC
AAAGAGCACGAGGCCTCTGTGAATTCGATCTCATGGGCACCCAATGAATTGGGACTCTG
CCTGGCCTGTGGTTCCTCTGACGGCTCCATAACCGTCTTTACATGCCGCGAGGATGGGT
CCTGGGACAAAACGAAGATAGATCAAGCTCATCAGGTGGGCGTGACGGCAGTTTCATGG
GCGCCGGCTTCCGCTCCCGGTTCCCTTGTGGGTCAGCCCTCAGATCCCATTCAGAAGCT
CGTTTCCGGTGGGTGTGATAATACAGCTAAGGTTTGGAAATTTTACAATGGTTCTTGGAA
GCTAGATTGTTTTCCTCCTCTTCAAATGCATACAGATTCGGTTCGGGATGTTGCTTGGGC
GCCCAATTTAGGGCTTCCCAAGAGTACTATTGCTAGTTGTTCTCAGGATGGGAAGGTTGT
TATATGGACACAGGGGAAAGAAGGGGATAAATGGGAAGGGAGAATTTTGAATGATTTTA
AGATCCCCGTCTGGAGGGTCAACTGGTCCTTGACTGGAAACATCTTAGCTGTCGCTGAT
GGGAACAACAGTGTGACTCTTTGGAAGGAGGCAGTAGACGGGGATTGGAATCAGGTGA
CCACAGTACAGTGACAAATTCCTAAGAGGGAGAATTCAACATTGGATAGCTGATGAAAAA
AGACATCGAAGGACTATCAGAATGTATGAATTGCATGCAAGAGCAAGAATGAATCAGGC
ATTAAAGCGGGAATCATTTATGACTTGGCAACCTGAAAATTCTATTAAATTCACAAAAGTC
ATGACTTGGATGTTTAAAGAAAACAGTTTTAAGTAAGCTTTGGTCCTCTCTTGTGAGAATG
GTATGGTGGTACCTTTGGCTCAAGCAAAACAGTTTTATTTTTAAAGAGGGTAAGAACCAA
TCCTGGTATCAGCTAAAGCCCTTTTAAATTAAGGCTTTTATACACGATGACTCAGCTGCC
AACCGAAATCCAACTGCTTAATCAACTTTTAGCTCACCCGGAGGATCAATAAAGAGTCTC
TTCCAACCAATTACAGTTCCTCCCCTTTTGATGAAAGGTTTCCTTGTTTTTTTGATGATCG
TTCCAAGGTAGTCCACATCCCCATTATGCTACTAGAGTGGTTCTTACTTGAGTGGGATGT
GTTTTTTTTTGTAAACAAAGTTGAGTAAGTATTTCATTAAAATCTGTCCCCAGATGCAACA
AATACCCGAGAAATCAGATAAAAAAAAAA
SEQ ID NO:202
TTTAGCTCCACTTGAAACCTCGACCAAATATAATTTTAGTTTTAGATCTATACTTGCGCAT
CTCATTCCATCTTCTCAACAAGGATACAGGCCTCGACCAAATATAAATTTAGTTTTAGATC
TATACTGCGCATCTAATTCCATCTTCTCAATAAGGATACAAGGATACAGGCCGACGATGG
ACGGTTGAATAGATAAATTTCTATAGCTCGAAATCATCTGCTTGTTGTCGGTTGAATAGAT
AAATTTCTATAGCTCGAAATCATCTGCTTGTTTCTCCTGCGAGCTTGCCTCTTTTTGATCC
ACCTCAGCTGGTCCAGAGAAAAACACGAGATCCGTGACAAGGGAAGGGCAGCAATTGC
AAACTGGATCGAATTTTTACTGCCCAAGCTGCTTTCTTCCGGATTTGCGGATAAGGTTCT
AGGGCTTCCATCGCTTGGCAAATACGAAGTGGCTTGGTCTTCCAGTTTGTTCCTTCTCCT
CGGGCTGGCAGAAGAAGAAGGGCGACAACAAATATACAGAACTCATCGATCAATATGTC
TTCAGGAGTGAAGCAAACGGGGAGTCAGAAGTTCGAATCCGGGCACCAAGATGTTGTCC
ATGACGTCACAATGGATTACTACGGGAAAAGGATAGCGACGTGCTCCGCAGATCGGACC
ATAAAGCTTTTCGGTATGAACACCTCAGATACCCCAACCCTTCTGGCCTCGCTTACTGGT
CATGAAGGCCCTGTCTGCCAGGTTGCCTGGGCTCACCCAAAGTTCGGATCCATGCTTGC
TTCCTGCTCGTATGATCGGCGCGTAATCATCTGGCGAGAGGGTCAGCAGGAGAACGAG
TGGTCGCAGGTTCAAGTCTTCAAAGAGCACGAGGCCTCCGTGAATTCGATCTCGTGGGC
ACCCCACGAACTGGGACTCTGCCTCGCCTGCGGTTCTTCTGACGGCTCCATAACCGTCT
TCACAGGCCGCGAGGATGGATCCTGGGACAAAACGAAGATAGATCAAGCTCACCAGGT
GGGCGTGACGGCAGTTTCATGGGCGCCCGCTTCCGCTCCCGGTTCCCTTGTGGGCCAG
CCCTCAGATCCCGTTCAGAAGCTTGTGTCCGGTGGGTGTGATAACACAGCTAAGGTTTG
GAAATTTTACAATGGTTCTTGGAAGCTAGATTGTTTTCCTCCTCTGCAAATGCATACGGAT
TGGGTTCGGGATGTAGCTTGGGCACCCAATTTAGGGCTTCCCAAGAGCACTATCGCTAG
TTGTTCTCAGGATGGGAGGGTTGTTATATGGACACAGGGTAAAGAAGGGGATAAATGGG
AAGGGAAAATTTTGAATGATTTTAAGACCCCTGTCTGGAGGATCAGCTGGTCCTTGACTG
GAAACATCTTAGCTGTTGCTGATGGGAACAACAATGTGACTCTTTGGAAGGAGGCAGTG
GACGGTGAGTGGAATCAGGTGACCACAGTACAGTGACAAATGTAAGCTTCTTTTTACGAT
TTTGATTTCTTGACGTTTTAATATGGTATGGTATTAAATTTGGAAGGCCTATTCGATTGTTT
GCAAAATAATAAGTTTGTCTCGAAATTGGGTTATCCTATCCAACTTGTTGTGTCTATTGTT
CTAAAGTTTGTCTCGAAATTGGGTTATCCTATCCAACTTCTTGTGTCTA
SEQ ID NO:203
GTCTTCTTCATGGTAGAGATAGTGTCCAACACTAAAAATAATCCAATTTTCTGGTTCCAAC
CTCAGGATATTTTACTTCAGTCTTTTGTCAGAATATTGTCTTCTCCTCGTGAAAACAGTGT
CTACCACTATAAAATCTCCAGTATTCTCTGTGCCTACTATAAATTTCCAGTCTTCTCTGTT
CTGGCTTTGTTGATATGTTGCTTAAATCCTCAAGTGTCAAAGGATTTTGCATAGTGTTTAG
CAGCATTTCCACAAGAAGAAGAAGAGGTGTTTCAATAAGATGAAGAAACGGTCACGACC
CAGCAATGGGCACCTATCGACGGCAGCGAAGAATAAAAGCCGAAAAACTGCCCCTATAA
CAAAAGACCCGTTCTTTGACTCCGCTCATAATCGAAACAAAAGTAAAGGCAAAGGCAAAA
GCAGAGGCAAAGGAGAAGAGATTTTCAGCAGTGATGAAGACGACGACGCTATTGGCAG
AGATGCTCCAGCAGAGGAAGAAGAGGAAATTGCCGAGGAAGAAAGGGAAACCGCAGAT
GAAAAGCGCCTCAGGGTGGCTAAGGCCTATCTGGATAAGATCAGGGCCATAACTAAAGC
CAATGAAGAAGATAACGAGGAGGAGGCGGGCGAAGACGAGGAAACTGAGGCAGAGCG
AAGGGGTAAAAGGGATTCTCTTGTGGCCGAGATTCTTCAGCAGGAACAGCTCGAAGAGA
GTGGGCGAGTGCAGCGCCAGCTAGCCTCCAGAGTTGTGACACCATCAAAGCTGGTAGA
GTGTCGAGTTGTTAAAAGGCACAAGCAGTCTGTCACAGCTGTGGCGCTAACAGAAGATG
ATTTGAGAGGATTTTCAGCATCAAAAGATGGCACTATTATTCACTGGGATGTTGAAACTG
GTGCAAGTGAAAAATATGAATGGCCTAGCCAAGCAGTATCTGTTTCAAGTTCGAATGAAG
TCTCCAAAACACAGAAGGGCAAGGGTTCAAAGAAACAGGGTAGCAAACATGTTTTATCAA
TGGCTGTGAGTTCTGATGGCCGTTATTTGCAACTGGGGGTTTAGATCGTTATATTCATT
TATGGGATACTCGAACACAAAAACATATTCAGGCTTTTCGGGGTCATAGAGGAGCTGTGT
CTTGTTTAGCCTTTCGTCAAGGCACACAGCAACTTATATCAGGATCTTTTGATCGTACAAT
CAAGTTGTGGAGTGCAGAGGATAGGGCCTATATGGACACTCTTTACGGTCACCAAAGTG
AAATCCTTGCAGTTGATTGTTTGCGAAAAGAACGAGTTCTGTCTGTTGGACGCGATCACA
CTTTGAGACACTATGGAAGGTCCCTGAGGAGACGCAGCTGGTCTTCAGAGGGCATGCAGC
ATCTTTAGAATGTTGCTGTTTCATCAATAATGAAGACTTCCTATCTGGCTCTGATGATGGA
AGCATTGAGCTTTGGAGTATGTTGAGAAAGAAGCCAGTTTTTATGGCAAAAAATGCACAT
GGGCATGCTATTGTAGAAAATCTTTCAGAAGATACGAGTACTAGGGAAGAGCCAGATGA
GGAAGTGACAACAAGACAATTACCTAATGGAAATAGTATTGGGAATGGTATGACAAATCA
AATGGGAATCACCCCTTCCGTAGAGTCATGGGTTGGAGCTGTCACAGTGTGCAGAGGAA
CTGATCTTGCTGCATCAGGAGCTGGTAATGGGGTGGTACGGTTATGGGCTATTGAAAAT
AGCAGTAAAAGCCTTAGAGCTCTACATGACATTCCTCTGACTGGATTTGTCAATTCACTTA
CTTTTGCACGGTCTGGACGCTTTCTTATTGCTGGAGTTGGCCAGGAACCTCGACTGGGT
AGATGGGGCCGTATTCAGGCTGCTCGCAATGGTGTGACTTTATGCCCAATTGAGCTTTC
ATGATGGATTTTGGGGGCAAATCAAGATGATGGTATCTGTTTGCATTTCCAGCACAGTTC
TGATTCTTTACTAGGGTAACTGTCACATTCAACAGAGAATCCCAGAATGGAAAGCCTCGC
ATACTTAATTATTGTTTTGATGCCTTTTCTTATAACCTGTACGATTGCCGATATATCACCAA
TTTTGCTGATTTTAATCTGAGTTTGTAGTGCGTACAAAAGGCTGCTATAAAAATATTGAAT
TTTTAGATTCCAAGATCAAAAAAAAAA
SEQ ID NO:204
GGTGAATATGGCAAAACATACGTTAAAAAATCGAGGGGGAAGATGGATGGATGTTAAAC
ATTTGAACTTTCAGACAAACAGCCTTATAAACCCTTCGTGAGCTATATTAAGATGGATGTA
TTCTACATCCCTGAATACCCAAACCTACTCAGGCGTATCAAGCGTACGATAAACCATCTC
TCAGTCACCTACCGTCCGAGGAGAACAGCGGATTTCTATCAGTAGAATAGTTCCATTGAC
AGACGACATTTGCGACTCGGAGCACAGGGCTAGAGCAATTTCAAGGTTTTAAGCAGTTTT
CTGGAGGCTCGGATTTGGCCAGCAATACTTGCAATACTCAATTGGAGGCAAAGCTAAGG
CAGGAGCCCAGGGAGACTTGTACATGAAAAACTTCGCAGCCTGCGGTTCTGGGAGATTT
GAAGGTTGAGCAGTTTTAGCTATTAAACATTTCTTCTAAAGCAAGCTTCGACAGATTCTGT
ACAGTGATAGTTTCTGCGGCCTTTTAAACGGGGCAGAATTGAGAGATTCTAGGTCGCCG
TAAGTGCGATTTTAGGCATTCGGCTGTGCTTCCTCGTATTGCGAAAAGTTAATTAACAGA
GAGTTTCTTCAATGGCTGCTACCTTCGGCACCATTAATACTGCCACAAGTCCTCACAATC
CCAACAAATCCTTCGAGATCGTTCAGCCCCCAAATGATAGTATCTCAACCCTATCTTTCA
GTCCAAAAGCTAACTATTTGGTGGCTACATCCTGGGATAATCAGGTTAGATGCTGGGAG
GTCCTTCAAACTGGGGCTAGCATGCCAAAAGCAGCAATGTCACATGACCAACCGGTTCT
CTGCTCGACATGGAAGGATGATGGCACTGCTGTTTTCTCTGCTGGCTGTGATAAACAAG
CTAAAATGTGGCCACTGCTGACTGGGGGTCAGCCAGTGACTGTTGCCATGCATGATGCA
CCTATTAAGGACATTGCATGGATTCCAGAAATGAACTTGCTTGCAACTGGAAGCTGGGAC
AAGACACTTAAGTATTGGGATACACGACAGTCAAATCCAGTGCATACACAGCAATTACCA
GAGCGTTGTTTTGCATTGAGTGTTCGGCATCCTCTTATGGTTGTTGGGACTGCTGATAGG
AATCTTATAATTTTTAATTTGCAGAATCCTCAAACTGAATTTAAGAGAATTTCATCTCCTCT
AAAATACCAAACAAGATGTGTTGCAGCATTTCCAGACAAGCAAGGATTTTTGGTGGGTTC
CATAGAAGGAAGAGTTGGAGTGCATCATGTAGAAGAGGCACAACAGAGTAAGAACTTCA
CTTTCAAGTGTCACAGGGATTCTAATGACATATATGCAGTTAATTCATTGAACTTCCATCC
GGTTCATCAAACCTTTGCTACAGCAGGATCTGATGGAGCATTCAATTTCTGGGATAAGGA
CAGTAAGCAGAGGCTGAAGGCAATGGCAAGGTCTAATCAACCTATTCCTTGCAGCACAT
TTAATAGTGATGGTTCTCTATATGCTTATGCGGTGAGCTATGACTGGAGTAAGGGTGCAG
AGAATCATAATCCAGCCACTGCTAAACACCATATTCTTCTCCATGTTCCACAGGAAAGTG
AGATTAAAGGGAAACCCCGAGTTACAACAAGCGGGAGGAAATAAAGTCACTTATGTCTTC
TGTTGCAATGTTGAGTGCCCTATTCTCGGTGCATGGATTTTAGCTTTATTAGTTCAGGAAT
TTCATGGAGCCCCTGTTGTTTGTGTGATGTAACTTTTTGATATATGATGTAATATACGATG
TTATATTACAAGGTATTGATATTCAATCCAATTTCATATTCGGGTTCAATGTAGTGCCTCT
CATTTTAGGGTGATAGCATGAGTTTTTTTTTAGTGTTATGACATCGACATCTACGTAGAGT
TGCATGAACAAAAAAAAAA
SEQ ID NO:205
GATAAATATATGACATTTGCTCTTCTGTCGAGAACCTGTGGACCGGGGTAGTAGCATATA
CATATGTAGGTTGAGAGCGAGAGGATAGCGAAATCCTTACCTATAGCCACCACAAGCAG
TGGAAGTAGCACTGGCAGTGAAACTAGAAGCATAAGTAGTTACGGAAGAAGCACCACAA
GCATCAAAGCTAGAAGCAGAGTACTCCGATTTCCATTTTTCCTGGAAAGGGCCATTGAGT
GTATGGTCGTAATGGATAAGGGGACACATCAGACTAACGAGGACGAGAGCGAATCAGA
GTTCATCGACGAGGACGATGTGATTGATGAAATTTCCATAGATGAGGAAGATCTTCCAGA
TGCTGATGTGGAAGGCGAAGATGTGCAAGAAGACAACAAAAGAAGTGAGCCAGATGAGA
ACTCAAGTAGTTTGGACGATGCTATACACACTTTCGAAGGTCATGAAGATACTCTGTTTG
CTGTAGCATGCAGTCCAGTTGATGCAACATGGGTTGCATCAGGTGGTGGAGATGACAAA
GCTTTTATGTGGAGAATAGGCCATGCAACGCCATTTTTTGAGCTAAAAGGGCATACCGAT
TCTGTTGTAGCCTTGTCTTTCAGTAATGATGGGCTTTTACTTGCATCTGGTGGTTTGGAT
GGAGTAGTTCGCATCTGGGATGCTTCTACGGGAAATCTCATACATGTACTAGATGGTCCT
GGAGGGGGGATTGAGTGGGTCAGATGGCATCCAAAAGGGCATTTAGTCTTGGCAGGAT
CAGAGGATTACAGTACTTGGATGTGGAATGCTGATCTTGGAAAGTGCCTTTCAGTATATA
CTGGTCATTGCGAATCTGTCACATGTGGCGATTTTACACCGGATGGAAAGGCTATCTGTA
CTGGGTCTGCAGATGGTTCCTTGCGGGTATGGAATCCACAGACACAAGAAAGTAAACTC
ACTGTAAAAGGGTATCCATACCACACAGAGGGTCTAACATGCTTGAGTATTAGTTCAGAT
TCCACATTGGTTGTTAGTGGCTCCACAGATGGCAGTGTTCACGTGGTTAACATAAAAAAT
GGAAAGGTGGTTGCTTCCTTAGTTGGTCATTCTGGATCAATAGAGTGTGTCAGGTTTTCT
CCTAGCTTGACTTGGGTTGCAACTGGTGGGATGGATAAAAAACTAATGATTTGGGAATTA
CAAAGCTCATCACTGCGATGCACTTGTCAGCATGAGGAAGGCGTGATGAGGTTATCATG
GTCATTGTCATCCCAGCACATAATAACTTCTTCCCTTGATGGAATTGTCCGCCTTTGGGA
TAGTCGGTCAGGGGTTTGTGAAAGAGTTTTCGAGGGTCACAATGATTCAATTCAGGATAT
GGTGGTGACAGTGGATCAACGGTTTATCCTTACAGGATCAGATGACACGACTGCGAAAG
TTTTTGAAATTGGGGCATTTTGATTTTATATAATGTTTCATTCAAAATTATTGCCATGTTTG
ATTCTGAGTCACAGCAGAAGATTAGAGGAGCTGAGACTGAGGTAATAAATTGCTTTCCAC
AAGTTAACATAGGTAACTATCGACTGAAGTGAACTGGGGGGCAGAAGCTAACTATGTTG
GTTTCTTTTTCTGTAAGGTTCTTTTAACAATAATAATGTTGTGTCTACAATTTTTTTGTAATT
ATCTTTTAGCAGTTGACATTAGTATTTGAGTAAACTTTAAAAAAAAAA
SEQ ID NO:206
CCGACGTCATTATAGAAACCCTTCATTTTTAAGTAGTTACAGAGTGATGGCTGCTATCATA
TCGTTGTTCGCAAATCGGGGCACGGTTAGGTACATGAGCATCAGTTCATTATCATAATCG
CACGCTTCCAAAGTTCATCTCTGATTTTCTTCTTCTTACACAGGAAGCAAAATCTGAGATA
ATTGTAAAGACCATACAAGCTTGAATTTGGATTTGCCACGATGCCTGTATTTAGGACTGC
ATTCAACGGCTATGCTGTCAAGTTCAGTCCATTTGTGGAAACTCGTTTGGCTGTGGCAAC
GGCCCAGAATTTTGGGATAATTGGGAATGGTCGGCAGCATGTGCTAGAACTTACACCCA
ATGGGATCGTTGAGGTCTGTGCTTTTGATTCATCTGATGGGCTTTATGATTGTACATGGT
CCGAAGCTAATGAAAATTTAGTTGTTTCAGCCAGTGGTGATGGCAGTGTTAAAATTTGGG
ATATAGCACTTCCACCTGTAGCAAATCCTATTCGAAGCTTGGAAGAGCATGCACGTGAG
GTTTACTCTGTTGATTGGAATCTTGTTAGAAAGGATTGTTTTTTAAGTGCGTCCTGGGATG
ATACTATCAGACTTTGGACGATTGACAGGCCTCAGTCCATGCGTTTGTTCAAAGAACACA
CTTATTGTATATATGCAGCAGTTTGGAATCCAAGACATGCAGATGTTTTCGCCTCTGCTTC
AGGTGATTCCACTGTGAGGATTTGGGATGTCAGAGAACCCAATGCAACCATCATAATTCC
AGCTCATGAACATGAGATTCTTTCTTGTGATTGGAACAAGTATAATGATTGTATGCTGGTG
ACGGGGTCTGTGGATAAACTAATTAAAGTATGGGATATTACGACCTACAGGACTCCAATG
ACAGTTTTGGAAGGACACACATATGCAATCCGGAGAGTTAAATTTTCACCTCACCAGGAA
AGCCTTATTGCATCATGTTCCTACGATATGACAACATGTATGTGGGATTATAGAGCTCCA
GAGGATGCTCTTCTAGCTCGATATGATCATCATACTGAGTTTGCTGTGGGAATTGATATA
AGTGTTCTTGTGGAGGGTCTGTTGGCAAGCACTGGATCGGATGAAACTGTCTATGTTTG
GCAGCATGGAATCGATCCTCGAGCTTGTTGAGACTGGGTGTACTAGCTTGGCCTTTCCC
TCTCAGATTACTGGTTTGGTCTTTTACTTCCTCTCCGAGGATGCAGGAACTGTTTTGCAT
CCATTTTAGATAGCCATTTACATTTTACTTATTATTGGACTTGTAAAGATTTTTGTACCCTT
GTAAATTGCAAATTTAATTTACATTTCTTTTGCATTAAAAAAAAAA
SEQ ID NO:207
GATTTAATGATCTTCCTCCTCCTTCTCATATCAGCTGATCAAAATTCATCAGAAGAGGAGA
GAAAAGAGCTATCGGCATTAAATTTTCAGACAAAGTCTTGGGATTTAGATTCGACTACTCT
TCTTCTGATACTAGCTGATCCAAATCTCAGCAGAAAAGGAGAGAAAAGAGCCGTTGGAAT
TAATTTTTTCAGACGAAGTCCTTGTGATTCTATTTTGCAGAGACTGAACAACAGTGTTACT
CAGTGCAATGGATTCCCGAAACACGCGATCGCGCTTGAATTTGCCTCCTGGAATGTCTC
CGAGCTCCTTGCACTTAGAAACAACAGCAGGCTCTCCTGGCCTCTCGAGAGTAAATTCC
AGTCCCAGTACCCCTTCCCCCAGTCGCACAACAACGTACAGTGACCGGTTCATTCCCAG
CAGAACAGGGTCCAGGCTCAACGGATTTGCACTTATCGACAAACAGCCTCAGCCTTTGC
CTTCGCCCACTCGCTCCGCTGCCGAGGGCCGTGATGATGCATCTTCATCCTCTGCCTCT
GCATACTCCACATTGCTGAGAAACGAGCTCTTTGGGGAAGATGTTGTTGGTCCAGCCAC
TCCTGCCACTCCGGAGAAGTCCACTGGACTCTATGGCGGTTCCAGGGATTCGATCAAGT
CACCTATGAGTCCAAGCAGGAATCTTTTCAGATTTAAGAATGATCATGGAGGGAACAGTC
CCGGTTCGCCTTACTCTGCCTCTACTGTTGGGAGTGAGGGGCTTTTCTCGTCCAATGTC
GGAACTCCACCCAAGCCGGCTCGGAAGATTACTCGTTCTCCTTACAAGGTTTTGGATGC
CCCTGCACTTCAAGATGATTTTTACTTGAATCTCGTGGATTGGTCATCCAATAATGTACTT
GCAGTAGGCTTAGGCACCTGTGTCTACCTATGGAGTGCATGCACCAGTAAGGTGACAAA
ATTGTGTGATCTAGGAGTCAATGATAGTGTCTGTTCAGTTGGATGGACACCACAGGGCA
CACATCTTGCTGTGGGTACTAATATTGGAGAGGTCCAGATCTGGGATACATCTCGCTGTA
AAAAAGTTCGGACCATGGGTGGTCACTGCACTAGAGCAGGAGCACTAGCTTGGAGTTCT
TACATCTTATCATCTGGTAGCAGAGACCGGAATATTCTTCATCGTGATATTCGTGTTCAG
GATGACTTCATAAGAAAGCTTGTTGGGCATAAGTCAGAGGTCTGTGGATTGAAGTGGTCT
TATGATGATCGAGAGCTGGCATCGGGTGGAAATGATAATCAGCTCCTAGTGTGGAATCA
ACAATCGGCACAACCCTTGTTGAGATTCAATGAGCATACTGCTGCTGTTAAGGCCATTGC
ATGGTCACCCCATCAACATGGAATTCTTGCATCTGGAGGTGGGACAGCAGACCGGTGTC
TTCGCTTCTGGAACACAGCAACAGATACACGTTTGAATTGTGTAGACACTGGTAGTCAGG
TTTGTAATCTTGTATGGTGCAAAAATGTCAATGAACTAGTTAGCACGCATGGATATTCTCA
GAACCAGATAATGGTCTGGCGATATCCATCAATGTCAAAGCTGGCTACTCTAACAGGCC
ATACACTTAGGGTTCTCTATCTTGCCATCTCACCTGATGGGCAGACGATTGTCACAGGTG
CAGGGGATGAAACACTGAGATTTTGGAGTATTTTTCCATCTCCAAAGTCACAGAGCGCC
GTTCATGATTCTGGATTATGGTCTCTAGGACGAACTCATATTCGATGATATTTTTATTAGA
GGAAACATATCTTTTCCCTCCCATGAGAAAATATTTCTTGTCCAAATCCATTTTATGCTGC
TAGGTAATCCAACTGGGAACCATAAATCATAGGAATCCCTGTGAGGTGAAGGCAGGAAT
AAATTATGTCCTGAACTTATGTGTCTTCCAACAAAAAACCATAAATCGAAGCATTATTTGT
AAGATCCCCTGTGAGTTGAAGGCAAGAGTAAATTATGTCCTGAAATTATTATTCTTTGTAC
CAAAAGTTTTCAATACAGCTTGAGGTTTTACTGTCTGTTAAATATGAAGAGCGAAGCCAC
AATAGGATTTTCCCTCATTTTTGTTTGGCAACATCAAGTCACACAATCTTTCGAGAATTTA
GGCAAGCTTCATCAAAGTTGATTTGTTTGATAGTTGTGTATTCTTTGGTTTCCAAATATTTT
TTGGGTTTTCTTGTAGGCAGAAGTGAGGCTGGTTGGAAATGTGATCTCTTTAAAGGTTGT
TTCTTTTCGATGGTTAGCTATGGGTTACTGATTTCAGATTGTAATAGTTTATTGCTGAGTG
AACATCTGCATCTCATTTTTCATGCTTGTCTTTATTCAGCTATTCTGCTGCAATTGCTGAA
ATATTTCAAGCTGAAAGTTATGATTCTGGCCAAGAAGTCTACTGAAAATTTGTATTGTGGA
AATATATATATACATCAGAGTCAGTTATTTAAAAAAAAAAA
SEQ ID NO:208
GGTTTTCAGCCCAAACCGTCAAATAATAAATCCATCGATACATCATCCATCCATCCTGATT
CGGCTTCGGCTTCGACTCCCTTTTGTACCCTTTTCCCACCCGGCCATTCATTCACATTTT
CACCCGTTTCTACGCACTGTCTAGCCGCCTTTTAATTTCCATTTAATTAAAAATCCGCTGC
AAACTCTCAGTTCCAGGTATATCTGTACGTTGGCATCTCAAGCGCCGAAAATTGTACAGT
TTGAAAGAGGGCAGGGAAGCCCTTCCCTAAATAGAAATTTATTCTCTGGTAGAGAAGTAT
AGTTTTTGGTGTGCATTTTTTTCTTCTTTTAATTCTTATTTAGAAGAGGGTAAAAGGTGGC
AGTATGGAGAAGAAGAAGGTGGTGGTGCCAATCGTCTGCCATGGACATTCAAGGCCTAT
TGTGGATTTGTTTTACAGCCCCGTCACCCCCGACGGCCTCTTCCTTATCAGCGCTAGCA
AAGATTCTAGTACAATGTTGAGGAATGGTGAAACAGGGGACTGGATCGGGACATTTGAA
GGGCATAAGGGTGCTGTGTGGAGTTGTTGTCTTGATAATCGTGCTTTACGTGCAGCATC
AGGATCAGCTGATTTTAGCGCCAAAATATGGGATGCGTTGACAGGAGATGAGTTGCACT
GTTTTGTGCACAAGCACATTGTTCGAGCATGTGCTTTTTCTGAGAGCACAAGCCTTTTAC
TCACTGGAGGACATGAGAAAATACTCCGTATTTTTGATTTGAACCGTCCTGATGCACCTC
CAAAAGAAGTTGATAATTCTCCTGGTTCAATCAGGACGGTAGCATGGCTTCACAGTGACC
AGACTATATTGAGTTCTAATTCTGATGCCGGAGGTGTGAGGCTGTGGGATTTGAGGACA
GAAAAGATTGTTCGTGTTCTGGAGACAAAATCACCAGTCACAAGTGCTGAAGTGAGTCAA
GATGGACGGTACATTACAACTGCTGATGGCAATAGTGTAAAATTCTGGGATGCTAACCAC
TTTGGAATGGTGAAAAGCTACACAATGCCATGCATGGTGGAGTCTGCTTCATTAGAACCG
ACTATGGGCAACATGTTTGTGGCTGGTGGAGAGGATATGTGGGTTCGTCTTTTTGATTTT
CATACCGGAGAAGAAATTGCTTGTAACAAGGGACATCATGGCCCGGTTCATTGTGTGCG
TTTTGCTCCTGGTGGTGAATCATATTCATCTGGATCTGAAGATGGAACAATCAGAATATG
GCAGACCTTGAATATGAATTCTGAAGAGAATGAAAGTTATGGTGTAAATGGATTGAGTGG
CAAGGTCAGAGTGGGTGTTGATGATGTGGTACAGAAGGTTGAAGGGATTCCAAATTACAG
CAGATGGCCACTTGAATGACAAACCAGAGAAGCCAAATCCATAATAATGCAGTAGTGTAT
ATCCATTGCTGGGAAGCCATTGATCTGTTTGTTGAAGTTTGCTTGCCTGAGGTGTACAAT
TTTTTGTCGCTGATCAAGTGGCAAATCAAACTGCACACAGGATCCTGCTTACTGTACATA
GTAGATTATGAGGTCATATCACGTGGTTCTTTTACATCTGCAGGTGGTGCTCCTTTTTTTC
AAGCTCCTGGAGATTGGTATGGTTGAAGGATGGCAGTAAGGGTCCAGTATTAATTAGATT
ACCTTAATGATTCAGTTCTTTATGTTATCACTGTCTTGACTTTTGGATTAGATGTTTAACTA
GGAAGGTAGTAAGGTAACTAAAATGAATGTTGTAAAAAAAAGGAAGAAAATGAAAAAACA
TAAGTTTGGCCCAGATTCGGTTTATCATAAAATCTGGCTGCATATAAGGTGTCAGTTCAA
ACACAGGCAGGTCAGAATAAATAAATTTTATTTATATAGTTTTTGTTAAAAAAAAAA
SEQ ID NO:209
CTAAGGGATACATTAAAGGCTACTAATCGCCGACAGCATTGACCAGATAGACTCTAAAAC
ATTCCTTCATTTTGTTCTGCTTCGTAAACCACCGTCCATGACTCCAGGGATTTATGGCAC
AGACGTGTTGGTGGCTATGCAAAATTTTTAGTCCGGTATGTACTGTGGTCGGATTGAGCA
AGAATTTCTGGATATTTAGGGTTTATGAATTCGTGAAGCAGAGCGAATCAACATAACCAG
GCGACTGGTATTTAGAATAATATAAAACAAGGGTAGTGTTTCAATGGAAAGGTATTCCCA
AGGCACACAGAAAAAATCAGAAATCTACACATATGAAGCTCCCTGGCAAATATATGGCAT
GAACTGGTCTGTGAGGAAAGATAAGAAATTTCGTCTTCGGATTGGTAGCTTCCTGGAGG
AGTATAACAATAGAGTAGAAATCATTGAGCTGGATGAGGAATCTGGGGAATTCAAGAGTG
ATCCAAGGCTTGCTTTTGACCATCCATATCCCACTACCAAGATAATGTTTGTTCCTGATAA
AGAATGCCAGAGGCCTGACCTTCTGGCCACTACTGGCGACTATCTGAGAATCTGGCAAG
TTTGCGAGGACCGAGTAGAGCCCAAAAGCCTCCTCAATAATAATAAGAACAGTGAATTCT
GTGCACCCTTGACTTCTTTTGACTGGAATGATGCCGATCCCAAGCGAATTGGGACATCG
AGCATCGATACCACTTGTACTATCTGGGACATAGAAAAGGAAGTGGTGGATACACAGCT
GATTGCCCATGATAAGGAGGTCTATGATATTGCCTGGGGGGAGGTAGGGGTTTTTGCTT
CTGTATCTGCAGATGGGTCTGTGAGAGTGTTTGACTTGAGGGATAAGGAGCATTCTACTA
TTATATATGAGAGCTCACAGCCGGAGACCCCATTGCTTCGATTAGGTTGGAACAAGCAA
GATCCGAGGTTTATTGCTACTATTTTGATGGATAGTTGTAAGGTGGTTATCTTGGATATTA
GGTTTCCAACCTTGCCTGTTGCAGACCTCCAGAGGCATCAGGCTAGTGTGAATACTATT
GCATGGGCTCCTCACAGCCCTTGCCATATTTGCACTGCAGGAGATGATTCTCAGGCACT
CATATGGGAATTATCCTCCGTTAGTCAACCACTGGTGGAGGGTGGTGGTCTAGATCCAA
TACTAGCTTACACCGCAGCTGCTGAAATTAATCAATTGCAGTGGTCTTCCATGCAGCCTG
ACTGGGTTGCAATTGCGTTTTCTAACGAAGTACAGATACTAAGGGTTTGAAATTTATTGCT
TTGTAGTTTTTCATTCAAATGTTCTAGAATTTGTCTAAGCTAGCTACTGGTGTTTAACTGAT
ATGGAAAACTTTTGCCATTCTCTTATCTGGATAGTTCCATAATGGTAAAGTACATTATGCA
AAAAAAAAA
SEQ ID NO:210
CCAAACCAACTTTCTTCTTTTCTTTGTTCTCTACCTGTTGAAGAGGATGACCCAGGCACT
GTCAATTGAACGGATTTTGGGCTTTGTCAAATCAGAACTCAAGTCTGCTCGTCCAACAAG
TACGATATAAAAGGGTTGTTCTATTCAATTAGATTCCAGTATTGGCCAGACCATCTACTCT
CTTTCACCTACCTTTTGTTCCCATGAACATCCCTCTTTCTCATCTCCATTCTCATCATTTAG
TCTGAAAACTTCATTTTCATCACACCCACATGCTAATTTACTTTTTTGGACCATACCCACTT
CATCTTCTGTCCTATGTCATTTAAGACGTCACTCAAATGTCTTTCAAGAGCATACCGGCAT
ATTATATCCGTCCTTCTATAAGGTCTCTGGTTTGATTATTTCTTGGAGATGTCGTATATCT
GGTTTCTAGCTAGAGGAGTTTAGGGTTCCCCTGTAACTCTCTAGGTACTCGCTTTCTGTA
GGTGGGTTTGAGGTCTACACATATTTCAATCATTGCGGATTCTCCTGCTTAAGATTTTGAT
CATAATAATTTTGGTGCCAAACACTCTCAAGATCAGTTTACCGTGGCGTTCATAATAAAAG
CCTATTAGTTCACCTCTTCCGTCCAAGGGATAAGACAAACCCTAATCATATCGGGTGTCG
GCTCATCCGGAAGTATGCAATCGGAAAATAATTTGGATGAGTCTTTGCACCTAAGGGAG
GTTCAAGAGTTGCAGGGGCATACTGACACTGTGTGGGCGGTGGCATGGAATCCTGTCA
CTGGAATTGATGGAGCTCCATCTATGCTGGCCTCTTGCAGTGGTGACAAGACTGTTAGG
ATTTGGGAGAATACACACACCCTCAATTCAACATCTCCCTCGTGGGCGTGTAAGGCTGTT
CTGGAAGAAACACATACAAGAACTGTTAGATCATGTGCATGGTCACCTAATGGGAAGTTA
CTTGCAACAGCAAGTTTTGATGCAACTACTGCAATCTGGGAAAATGTTGGTGGTGAATTT
GAATGCATTGCCTCCTTAGAGGGCCATGAGAATGAAGTCAAAAGTGTTTCTTGGAGCGC
GTCTGGTATGCTACTTGCCACATGTGGTCGTGATAAGTCTGTCTGGATTTGGGACGTTCA
ACCGGGGAATGAGTTTGAGTGTGTTTCAGTACTGCAAGGCCATACACAGGATGTCAAGA
TGGTCCAGTGGCACCCTAATCGTGACATTTTGGTCTCTGCTAGTTATGATAATTCTATCAA
GGTTTGGGCAGAGGATGGAGATGGTGATGACTGGGCATGCATGCAAACATTGGGCAATT
CTGTCAGTGGTCATACATCAACAGTGTGCGCAGTATCTTTCAATTCTTCCGGAGACAGAA
TGGTCTCCTGCAGCGACGACTTAACGTTGATGGTTTGGGATACAAGCATAAACCCAGCG
GAGAGAAGTGGAAATGCTGGGCCCTGGAAACATCTTTGCACCATTTCTGGATATCATGAT
CGAACAATATTCTCAGTTCACTGGTCAAGGAGCGGTCTTATTGCTAGTCGAGCGTCCGA
TGATTGTATCAGGCTCTTTAGTGAGAGCACTGATGATTCTGTGACACCGGTTGATGGCAC
ATCGTATAAGTTGATTTTAAAAAAGGAAAAGGCTCACTCAATGGATGTGAATTCAGTCCAA
TGGCATCCCTCAGAGCCTCAGCTACTGGCTTCAGCAAGTGATGATGGTCGTATTAAGAT
ATGGGAAGTTACTCGGATAAATGGACTAGCAAACAGCCATTGACATAGTATCATTCCACA
TGACATCTAAAGGATGGGAGATTGGTATATCTTACCTGCAGGGGAAGCAAATTTCTTTAT
TGTAGCTTCTTGTTGACTCTCAGCTTCTCTGTGGGTTTGCTTTTGGGGAGTACTTTAGTC
AATAAAAGTGAAGTGAATCATGATATAAAGGGTTTAAGTAAAGAGTTGAGAATAAGCAGA
TTTTAGTGAAATACAACTGGTATGGCATTTATTGTGGCTGATTTTCCCTGTATGGCCTCTG
AAGCAGATTTCATGACTAAATTTTCATTAGGGGTTTCTGTAGTTTAACATAAATCAACATC
AAGTCTGATTTGGAATTTTTGTTTTGTTCGTTAAAATTAATTTGATTTCTCTGGTATTGTAA
AAAAAAAAAAAAAAAA
SEQ ID NO:211
TTTTTTTGAACTAGCCACGACACAGAAATCTTCCGCTTCGTGTACGTCATAAAGCGGT
TCGGAAGGCCGAACCAGACGCAGATGCAGAGTGGAAGGCGATTCCAGGATTGATTCAA
GTCACAGAGAAGAAAACCGAGGAGCAGGCGAGGCAGGTTTTTTTGAATGGTAATGGGC
GGTTTTTGATGTCAGATTTGGTGCTGTGAAAACATAAGAGCCGGCATTGTGCATTTCATA
GTTTTTGGTGGTGGGGATTGGACTGAGAAGAGTTATGCGAAACCAGCGATATAGAAGTC
CAACACTGATTTCAGAGCTTTTCTCCGGTCACCAAAACCATGAAGCGGGCTTACAAATTG
CAGGAGTTTGTTGCGCATGCTTCCAATGTCAACTGTCTCAAGATTGGGAAGAAGTCTTCC
AGAGTTCTGGTGACGGGCGGGGAAGACCACAAAGTGAATATGTGGGCTATTGGAAAAC
CGAATGCCATTCTGAGTTTATCTGGTCATTCAAGTCCTGTGGAGTCTGTGACTTTTGATT
CTGCAGAAGCTTTAGTTGTCGCTGGAGCTGCTAGTGGTACAATAAAGCTATGGGATTTG
GAACAAGCAAAAATTGTTCGGACACTCACTGGTCATACGTCCAATTGTATATCAGTGGAT
TTCCATCCATTTGGGGAATTTTTCGCATCTGGCTCCTTGGATACAAACCTAAAATCTGG
GATATTAGACGTAAGGGTTGCATTCACACTTACAAGGGGCACACTCGTGGTGTTAATTCA
ATCAGATTTAGTCCAGATGGTCGTTGGGTGGTGTCAGGTGGGGAGGACAATATTGTAAA
GTTATGGGATCTAACTGCTGGAAAGCTCATGCACGACTTCAAATGCCATGAGGGTCAGA
TACAGTGCATGGATTTCCATCCTCAAGAGTTTCTTCTTGCTACAGGCTCAGCAGACAGGA
CTGTGAAATTCTGGGACCTTGAGACTTTTGAGCTTATTGGATCAGCTGGTCCTGAGACAA
CTGGAGTTCGTGCCATGATTTTCAATCCGGATGGAAGGACTCTGTTAACTGGATTGCATG
AAAGTTTGAAGGTGTTTTCCTGGGAACCTTTGAGATGCTATGATGCAGTCGATGTTGGTT
GGTCTAAATTGGCTGACCTCAACATACACGAAGGAAAGCTTCTTGGTTGTTCATACAATC
AAAGTTGTGTTGGTGTATGGGTTGTGGACATTTCGCGGGTGGGGCCATATGCTGCTGGA
AATGTATCAAGAACAAATGGCCATAATGAAGCAAAATTGGCTTCCAGTGGTCATCCATCT
GTCCAGCAATTAGATAATAACTTAAAGACCAATATGGCGAGGCTTTCCTTGTCACACAGT
ACAGAGTCAGGAATCAAGGAACCAAAGACTACCACATCGTTAACTACCACTGAAGGTCTT
TCTAGCACACCTCAACGAGCTGGAATAGCCTTTTCTTCAAAGAATCTTCCTGCAAGTTCA
GGTCCACCGTCATATGTCTCGACTCCAAAGAAAAATAGTACATCAAGAGTGCAGCCTACA
ACAAATTTTCAAACCTTAAGTAGACCTGATATAGTGCCTGTCATTGTCCCTAGAAGCAATT
CATTAAGACCGGAAACAACATCAGATGTTAAGAAAGAAATGAACAATTTTGGAAGAGTGG
TTCCATCTACAGTATCAACCAAATCAACTGATGTGATTAAATCTGGCAGCAACAGAGATG
AATCTGACAAGATAGACTCCATAAATCAGAAGCGCATGACAGGCAATGACAAAACAGAC
CTAAACATTGCTAGGGCTGAGCAACACGTTTCCTCTAGACTTGACAATACAAACACTAGT
TCTGTTGTTTGTGATGGAAATCAACCAGCGGCAAGATGGATTGGTGCAGCCAAATTCAG
AAGAAATTCACCAGTAGATCCAGTTGTAAGCCCACATGATAGAAGTCCTACTTTTCCATG
GTCTGCAACTGATGATGGAGTTACATGTCAGCCAGATCGACAAGTTACTGCACCTGAATT
ATCAAAAAGAGTGGTAGAGCCTGGTCGTGCTCGTGCTCTGGTTGCAAGTTGGGAAACAC
GAGAGAAGGCTCTCACCGCAGACACACCTGTGCTGGTCAGTGGTCGCCCCCCCACAAG
TCCTGGAGTGGACATGAACTCATTCATCCCGAGAGGAAGCCATGGGACTTCGGAAAGTG
ACTTGACAGTTAGTGATGACAACAGTGCTATAGAAGAGCTCATGCAACAGCACAATGCAT
TTACAAGCATTCTTCAAGCTCGCTTGACTAAGTTGCAGGTAATAAGGAGATTTTGGCAAA
GGAATGACTTGAAAGGTGCTATTGATGCTACGGGAAAGATGGGAGATCATTCGGTATCT
GCTGACGTTATTAGCGTACTGATTGAGAGAAGTGAAATCTTCACGCTGGATATTTGCACA
GTCATACTTCCATTGCTTACTCGGTTGCTTCAGAGCGAGACTGACAGGCACGTTACTGTC
GCTATGGAAACCCTGCTTGTGCTAGTGAAGACTTTTGGTGATGTTATCCGGGCAACTATA
TCAGCAACCCCAACAATTGGAGTTGATCTCCAAGCAGAGCAAAGGCTTGAACGCTGTAA
TCTGTGTTATGTTGAACTGGAAAACATCAAACAGATTCTTGTTCCTTTAATCAGGAGAGGT
GGAGCTGTTGCCAAGTCTGCACAAGAGTTGAGTTTAGCTCTTCAAGAAGTGTGACCCAC
TCTATTTAGGTTGACATTTTTTTTTTGTAATAAATCTTCTTTGACGGGAGGCTGTTTGTATA
TTGGGAAACACTGCATTGTTTGCAATGATAGTGGCAGTGTGGTTGAATATGCACGCCATC
AATTGCAGTGGAATTCTTATCTCAGGTCATTGGAGTGTTTAACCATTCCTATGTTGTCAGC
CACATCTTGTCAGCCACACCTTGATTCTTAGATTAAGAAGTTACTAATTTGTAGATAAATT
CTAACGAAGGTGATGATAGCATACACGTAATGAAATCACAGAGAATGTCTACGCAGTTGA
CACTGCAACTAGTCTTGCGTGAACCTGTAGTTGCGGCATGCTTCAACCTTTGTTGTAATA
TTTAAAATATTTGAGTGGCTTAAGCACTCAATTTCCTGTTTGATCAAGTCAGTTGCTAGAT
TCTTCAGTTGCTTTTCTCCGTTCATACAATCAGCAATTGGATATTGATGAATTTCATTACA
GAGTTTGATAATCTGTGAGCCATCATATATTAATTGTACATTCATAAGCA
SEQ ID NO:212
GTTGATTCTGAACTGAAGGTATACAAAGGCGAAGAATAACAAAGTAGGAGCGCCGTCGG
TGCTCCGTCAAGCCGGCGGTCATGTCCACTCTCGAAATCGAAGCTCGCGATGTTATAAA
AATTGTGCTACAATTTTGCAAAGAGAATTCTTTGCACCAAACATTTCAAACACTGCAAAAC
GAATGCCAGGTTTCTCTTAATACTGTGGACAGCCTTGAAACATTTGTAGCAGATATCAAT
AGCGGAAGATGGGATGTGATACTGCCACAAGTTGCACAGCTTAAGCTTCCTAGAAAGAA
GCTAGAAGACTTGTATGAACAGATTGTATTGGAGATGATTGAACTCCGAGAATTAGATAC
TGCAAGAGCTATTCTAAGGCAGACTCAAGCAATGGGTTTCATGAAGCAGGAGCAACCTG
AGCGCTATCTACGCCTGGAACACCTTCTAGTTCGAACATATTTTGATCCTCGTGAGGCTT
ATCACGAGTCATCAAAAGAGAAGAGGCGGTCACAAATAGCTCAAGCTTTGGCATCCGAA
GTAACTGTTGTGCCGCCTTCTCGATTAATGGCCTTGATTGGTCAATCTTTAAAGTGGCAA
CAACATCAAGGATTACTCCCTCCAGGTACACAGTTTGACCTCTTCAGGGGCACTGCAGC
AGTGAAGGCAGATGAGGAAGAAATGTATCCAACAACACTGGCACATACAATTAAATTTGG
TAAACAAACCCACCCAGAATGTGCTCGCTTCTCTCCAGATGGTCAATACCTTGTCTCTTG
CTCAGTTGATGGATTCATAGAGGTCTGGGACTACATCAGTGGGAAACTTAAGAAGGATCT
TCAGTATCAAGCTGATGATTCCTTTATGATGCATGATGATGCTGTCCTCTGTGTTGACTTC
AGTCGAGATTCAGAGATGCTGGCTTCTGGTTCACAGGATGGGAAAATTAAGGTATGGCG
TATACGAACAGGTCAGTGTTTAAGGCGTCTAGAGCGTGCACATTCTCAGGGTGTTACAA
GCCTTTCCTTCTCTCGTGATGGCAGTCAGCTTTTAAGCACATCATTTGACAGTACTGCAA
GAATACATGGGCTCAAATCTGGGAAAGCATTAAAGGAATTTCGTGGTCATACATCTTATG
TAAATGATGCAATATTCACAAGTGATGGGGGTCGTGTTATTACTGCTTCTAGCGATTGTA
CTGTAAAGGTTTGGGATGTGAAGACAACTGATTGTATTCAAACATTTAAACCTCCACCTC
CTCTCAAGGGAGGTGATGTATCAGTGAATTCTGTCCACCTTTTTCCAAAGAACTCAGAGC
ACATTGTAGTCTGCAACAAGGCATCATCAATCTATATCATGACACTCCAAGGGCAGGTTG
TCAAAAGTTTCTCATCTGGTAAAAGGGAAGGAGGAGATTTTGTTGCTGCATCCATTTCTC
CAAAAGGAGAGTGGATCTATTGCGTTGGGGAAGACAGAAATATCTACTGCTTCAGTCAA
CAGTCTGGGAAGCTTGAACATCTTATGAAGGCACATGACAAGGACATAATTGGTGTAACA
CCTCATCCACATAGGAATTTGTTGGTAACATATAGTGAAGATAGCACAATGAAGATATGG
AAACCCTAATATGGATTTTATACTGCTATTGGATTTGTATAGAAATAATATTTTTTTGAATT
TTGATGGTAGCGTATGGTTGAAGGAAAACTTGGATATATCATGTAAACATTTTTCCCCCAT
AAAGGAATGTATATTTTTTTTATTTACTGCACTTTATATTTCTCTGACCCTCTCTCACACAC
ATGCATTGGCACTACACAATTACACAGAAATATAAGCACTCTGCTTTTACATATGTTTAAA
AAAAAAA
SEQ ID NO:213
GCAACATAGTACTCGATGCCGTCGTTTTGGTTCCACACTGACCAACGAGCTAAGATTGAA
GTTCCGAAGTACGATTCTTATTTCACATATTCTTGCCGTCTTCAGGTTCAGCTCGCAAAG
GGTACCGAGGAGCTCCCAGATGCAGAATTGGAGAACACTTATCTATCTATCCGACTAAC
AGAGAAGTGGTATCTCTCGCTCTAAGCTTCCATCCTGAATTATCTCCGTTTCGGCTCTCC
CACCGAACTAGCGCTACGGGCTTGTATTTTAGTAAGTAATACCGAAAAGCCGCCGAAAG
GGTTTTGAATACGTAGAGAAGAGGAAGGGAGGGATAGAGACAGAGGTGCTCGCACCAG
GAAGGGAGGGATAGAGACAGGGAGGGATAGAGACAGATACAGATGGATATTGAACTGG
AGGATCAACCCTTTGATTTGGATTTCCATCCTTCTGCTCCTATTGTTCCAGTGGCACTTAT
TACAGGCCGCCTCCAGTTGTTTCGTTATGTAGACATTTCATCAGAGCCCGAAAGATTATG
GACTGTGACTGCTCATACTGAGTCCTGTCGAGCTGCTCGATTTATTAATGCTGGTAGTTC
TGTGTTAACGGCGTCTCCAGACTGCTCAATTCTTGCCACAAATGTTGAAACTGGACAGCC
AGTAGCCAGACTTGACAATGCTCACGGGGCTGCAATAAATTGCTTAACCAATTTAACAGA
GTCAACTATTGCATCTGGAGATGAGAATGGAATCATTAAGGTTTGGGATACACGACAGAA
TTCCTGTTGCAACAAGTTTAAGGCTCATGAAGACTATATTTCGGATATGGAGTTTGTTCCT
GATACTATGCAGCTACTTGGAACAAGTGGTGATGGGACCCTTTCTGTTTGCAATCTTCGT
AAAAATAAGGTCCATGCCCGATCTGAATTTTCAGAAGATGAACTGCTTTCAGTGGCCTTG
ATGAAGAATGGGAAGAAGGTAGTATGTGGTTCACAGGAAGGAGTTCTGTTGCTTTATTCA
TGGGGTTATTTTAAGGACTGCAGCGACCGCTTTGTTGGACATCCACACTCAGTTGATGCT
TTGCTAAAGTTAGACGAAGACACAGTGTTAACTGGATCAAGTGATGGAATCATCAGGGTT
GTCTCCATTCTTCCCAACAAAATGATTGGTGTTATAGGAGAGCACAGCAGTTATCCTATT
GAGCGCCTTGCATTTTCACATGATCGGAATGTTCTTGGCAGTGCTTCACATGATCAGATC
CTAAAGCTCTGGGACATACATTATCTTCATGAAGATGATGAGCCTGAAACTAACAAGCAA
GAAGCTGTGAATGACGAAAACGTAGACATGGATTTGGATGTTGATACTGAAAAAAGACCA
AGAGGATCCAAACGAAAGAAGAGAGCAGAGAAGGGCCAAACTTCATCTCAAAAGCAATC
ATCTGATTTTTTTGCGGATATATAACATTGACAGTGTTCAAGCTTATGTATTCCTATTATTT
AAAATTTTTATTCTCACGGGTGACCTGATTTCTAAAACATTGATCCCTTGTTTTCAATTATA
TGTTTGTAATATGTAATATTTTAACGATATCTTCCTCCCAATTTTGCAATAGTGAGAGTGG
GTATTCCTTTATATATACAGGGGCCAAATGCTTTTGCAAATGCCCTTCTGATCCATCGATT
CTAAATGTATGAATTTTTCCCACCGTATACTTTTTAATGAACCGCTTTTTCCTTGAGAGGC
TATGAATGCCTGTAGAACTAATCCTTTAAGTAATTGTATATATTGAATGAGCAGTATATTTT
CTTTTTAAAAAAAAAA
SEQ ID NO:214
GCGATACGTAGGGTCATGGCAACTAATGAAATCTACAGGAAAAGCCACCAGAAAACCAC
GTCCCTTCGCCACTAAAACTCCCATGAAAACAATAAACGCAGAATTCCCATTACTCATAA
AAACCCAACACACATCACCAAGAAATTTGTATCAGGGACGCAATTCCGAAGGGCTCCAG
GGGTTTCAGGGCCGGATCCTCTCTAAATATGGATAGGATCCAGCAAATCCCCCACACCT
GCGTAGCTCGAAAGATTAATTTGCCCCTGGGAATGTCCAAAGAGTCTCTAGCTCTGAAC
CTCCCTGCTAATTTGGCGCCCACTATGTCTCCGCCGAGTATTACTTACAGCGACAGGTTC
ATCCCCAGCAGGAAGGCCTCCAATTTCGAGGAATTCGCACTGCCGGATAAAACGTCGCC
TTCGCCGAATTCGGCTGGGGGTCAGTCCTCGTCCACCAATGGCGAAGGGCGCGATGAT
GCCTGTGCGGCTTATTCGGCTCTGCTGAGGACCGAGCTCTTTCCAGCTACCCCCGATAA
GACCGAGGGTTGCAGGAGGCCCGTGATTGGGAGCCCCAGTGGTAATGTGTTCAGGTTC
AAGTCCCAGCAGTGTAAATCCCAGAGCCCTTTTTCTTTATGTCCTGTTGGGGAAGATGGG
GATCTTAGCGAAACAGGGGCGGTTGCCAGGAAGACTACGAGAAAAATCCCTCGTTCCCC
TTTCAAGGTCTTGGATGCTCCTGCTCTTCAAGATGATTTTTATCTAAATCTTGTGGATTGG
TCATCTCATAACATACTTGCAGTGGGGTTAAGTGCATGTGTTTACCTGTGGAGCGCTAGT
AGTAGTAAGGTGACAAAACTATGCGATTTAGGTTTGGACGACAATGTCTGTTCTGTTGCG
TGGACTCAGCGGGGGACATATCTTGCTGTTGGAACAAACAATGGTGGTGTTCAGATTTG
GGATGCGGCTCATTGTAAGCAGGTTCGGACCATGGAAGGTCATTGTACCAGAGTAGGGA
CTTTAGCTTGGAATTCTCACATATTGTCATCTGGTGGTAGAGATAGAAACATTCTTCAGC
GTGACATTCGAGCTCAAGATGACTTTGTCAGTAAATTTTCTGGCCACAAGTCAGAGGTTT
GTGGGCTGAAGTGGTCCTATGACAACCGGGAACTTGCATCGGGTGGAAATGATAACCAG
CTTTTTGTATGGAACCAACAATCTCAACAACCAGTGCTGAAATATAATGAGCACACAGCT
GCTGTAAAGGCCATTGCCTGGTCTCCCCATCAGCATGGTCTTCTTGCTTCTGGAGGTGG
AACAGCAGATCGCTGCATTCGTTTCTGGAACACAGCAACCAATACTTCCTTAAATTGTGT
GGATACTGGTAGTCAGGTTTGTAACCTTGTATGGTCTAAGAATGTCAATGAGCTTGTGAG
CACCCATGGTTATTCTCAGAACCAAATTATTGTGTGGCGATACCCAACTATGTCGAAACT
TGCGACTCTAACAGGCCACACCCTCAGAGTGCTTTATCTTGCAATTTCTCCTGATGGGCA
GACTATTGTAACGGGTGCAGGTGATGAAACATTGCGCTTTTGGAATGTTTTTCCATCTTC
CAAAACACAGCAGAACACTATTCGTGACATGGGAGTTTGGTCTTCGGGTCGAACTCATAT
TAGGTAACATCAAAACTTCCTCTTCTTAGAGGAAGATGTGAAGTGGCATTGAGCAGTTGT
TGGGGAATTTGGTGGGAAACCCCATACAAGTTGACTGTATTAAGTCTCAGCTGAACAAG
GATCTGGCGTGAGCAGAAGTTGTCAACAGGTTATCTACCTGTTGTAGCTTTTAAGTAGTG
CTTTCTTAAGAGACAACCAGTTGAGGCAGTGGAACAAACTACAGGCTGTGCAGATGCTT
CCCCTTGCCTGGCTGGACAAATGTATTGGTAAAAATGCTAGATCATCTTAGAAAATAGGC
ACGCTTAGTCAAACTCATCCTTGGTGGGGATGACTTTTTCAAAGGTTTGAAAGTGTCTTG
TATCTTTGTTAGGTTCAATGAATTAATTATTTTCTCTAACACTATATTTTCTGGTATGACCG
CTCTACATTGTATATTAACCCTTCCAAATGAGATCACATTTGCAGTCCAAACAATATCCTC
TGTCCATAAGGTTGTACTTGTACTTGAACATACTTTTTGTAGAACATACTACCAAGAGAAA
TATTCAGCAAATATCATTTGAGAGTTTTCCCTGCAAGTTTGGGGATTTCTCAATCAAAAAA
AAAA
SEQ ID NO:215
AATACGGCATACACTTACGAGCATATAGCCATTCTCGCTTCCCGGGGTGTTGGGGCTAT
ATCAAATTCTCAAGCGCAATTGACTGGCTTTCAAAATCATGAATTTGACGTTTCCCTATCG
CGATTGGAGTCGCTGAAATCTTCTGGCTCTTTGAATACCCTTAGACATTAGACGTTTAATA
CACATCGTGAAAGGATCCTAGCCACTTTAAGAATTCGCAGGGTTCCAAGGAGTTCTGGA
TATTCAACCGTGCCAATAAAAAATGGCAGGAGGCCAAGGTGAAGGGGAGGAGAAAGTA
GATAAATTATCGATGGAACTCACAGAAGATGTCATGAAAAGCATGGAGATTGGTGCTGTC
TTTAAAGATTATAATGGGAAAATCAATTCGTTGGATTTTCACAGAACGAACAATTATTTGG
TTACGGCTAGTGATGATGAGGCTATTCGGCTTTTCGACACTGCAAGTGCAACATGGCAA
AAGACCAGTTATAGTAAAAAGTATGGTGTTGACTTGATTTGCTTCACCAACCATCAAACAT
CAGTACTGTACTCTTCGAAAAATGGCTGGGATGAATCATTGCGGCATCTTTCCCTCATGG
ACAATAAATATTTGCGTTATTTCAAAGGTCACCATGATAGGGTGGTTTCCCTTTGTATGTC
ACCAAAGGGTGAGTGCTTCATGTCAGGTTCCTTGGATCGCACTGTGCTTCTTTGGGATCT
ACGAATTGATAAGTGTCAGGGTTTGATACGAGTACGAGGACGACCTGCTGTGGCATATG
ATGAACAAGGGCTTGTCTTTGCTATTTCCAATGAAGGGGGTTTAATAAAAATGTTTGATG
CTCGCTTATATGATAAGGGACCATTTGATACCTTTGTTGTGGAAGGAGACAAATCCGAGG
CATCAGGGATAAAATTTAGCAATGATGGGAAGCTAATTCTTTTGTCTACCATGGACAGCA
ACATACATGTCTTGGATGCATATCAAGGGACAACGGTGCATAGTTTCAGTGTGGAAGCT
GTTCCCAACGGTGGTGAAGCTGTTCCCAATGGTGGAACTCTGGAGGCATCTTTCAGCCC
AGATGGCAAATTTGTGATTTCAGGTTCAGGGAATGGTAATATCCATGCTTGGAGTGTCAA
CTCTGGCAAAGAAGTTGCTTGTTGGACGACTGAAGGAGTCATTCCTGCAGTTGTAAAATG
GGCCCCAAGACGCTTAATGTTTGCTAGTGGATCATCTGTGCTGTCATTATGGGTACCAGA
TTTATCAAAGTTGGCTTCTCTTACTGGTTCAAATTCTAATAGTGCTTATTGATGACATTCA
GTGGAGCCCAAGAGAGACAGATCTGACATTCCTCCAGTGACCCTAGGATGACAACAGCT
CTTAATTTCTTTCTTGGATATATATTTTTCCTTACATCTAGTGTCTTCTGTCTTCTCTGCCT
TTTGTAATTTTATATTCACTGTGCTGGGATTATCCTCTCCCCTTTTTGACCCACTGTTGTG
TGTATTTGAATTTGTACGTATAATTTTTTTTGTTTTTGTTGTTTAATATAATTAGCTCTTAAT
CACAAAAAAAAAA
SEQ ID NO:216
GCGAGGAAGAAGAAGCGTCGAGGGTTTAGTCAAAATCCCAAAACCTTGATCCAATAGAT
GCTCATATGCACCCAGATTGTACCCCCGGAATTCGAAACCCGCTGACGCAGATGTATAT
TTGGGAAAAATTGGCCTTGGGGATGCTGATCTGCTCGAGCCTTGAAAGACTTCTGTGAG
GATCTTCATTATCGGAAATCACCCGAGGACTGCTGGAGATCAGGTGTTTCTGCAATGCA
CCGTGTGGGCAGTACAGGGAATACTTCAAATTCGTCCCGTCCTAGAAGAGAAAAGCGGT
TAACTTATGTGTTAAATGATGCAAATGATTCCAGACATTGTTCAGGGATAAACTGCTTGGT
GATATCAAAGTTATCTTTGCTTGGTGGAAATGACTATTTGTTCAGTGGTAGTCGAGATGG
AACACTCAAACGATGGGAGTTGGCTGATGATTCTGCTGTCTGCAGTGCAACATTTGAATC
TCATGTTGACTGGGTAAATGATGCGGTCCTCACAGGCGAGACACTTGTTTCTTGTTCTTC
AGACACTACTCTCAAGACCTGGCGTCCTTTCTCTGATGGTGTCTGCACCAGAACTCTTCG
TCAACATTCTGATTATGTTACATGCCTTGCAGCGGCATCAAAAAATAGCAATATTGTTGCT
TCAGGAGGTCTTGGTCGTGAAGTGTTCATATGGGATATTGAAGCAGCAATGGCTCCAGT
CTCACGGACTAGTGAAGCAATGGATGATGATACTTCGAATGGAGTTTTGAGTTCTGGGAA
TTCTGTTCTATCTACAACTGTTCGTTCTACTAATGCCACCAACAGTGCTTCTTTACACACT
TCACAATTACAAGGCTACACTCCAATTGCGGCCAAAGGCCACAAAGAATCAGTTTATGCA
TTGGCTATGAACGATGTTGGTACCTTACTCGTATCTGGAGGAACTGAGAAGGTAGTGAG
GGTGTGGGATCCAAGAAGTGGGGCAAAGCAAATGAAGCTGCGGGGGCATACTGACAAT
GTGCGAGCACTCATTTTGGATTCTACTGGCAGGTTCTGTTTATCTGGGTCTTCTGATTCT
ATCATACGGCTCTGGGATCTTGGTCAGCAGCGCTGTGTACATTCATATGCTGTGCATACA
GACTCTGTTTGGGCACTTGCAAGTACGCCAAATTTTAGTCATGTATACAGTGGTGGGAGA
GATCTTTCTTTATACCTGACAGATTTGACTACGAGAGAGAGTTTACTGCTTTGTATGGAGA
AGCATCCTCTTCTACGGTTGACATTGCAGGATGACTCAATATGGGTTGCTACAACAGATT
CTTCTTTACATAGATGGCCAGCAGAAGGACAAAATCCGCCAAAGATGTTTCAAAGGGGT
GGGTCTTTCCTGGCTGGAAACCTATCCTTTACCAGGGCAAGGGCTTGTTTGGAAGGATC
AGCGCCTGTACCTGTGAACACACAACCTTCATTTGTTATACCAGGTTCTCCAGGAATTGT
ACAACATGAAATACTGAACAACAGGCGGCATGTCTTGACGAAGGATGCTGAAGGTACTG
TAAAGCTATGGGAAATTACTCGGGGAGCAGTGCTTGACGACTATGGAAAGGTTTCCTTT
GAAGAAAAGAAAGAGGAATTATTTGAAATGGTCAGTATTCCTGCCTGGTTCACAATGGAT
ACCAGACTTGGAAGCATGTCTGTGCATTTAGATACACCCCAATGCTTTACTGCTGAAATG
TATGCAGTTGACTTAAATGTTCCAGATGCACCAGAGGAACAGAAGATAAATCTTGCACAG
GAGACTCTCCGTGGACTTTTGGCACATTGGTTGTCGAGGAGACGACAACGGTTGGCAAC
TCAAGCTTCTGCTAATGGGGATTTTCCAGCAGGGCAAGAAAATGCTCTCAGGAATCATAT
TTCATCCAGAATAGATGTCCATGATGATGCCGAAACCCATATTGCTGGAATTCTTCCAGC
ATTCGATTTCTCAACTACGTCACCACCATCAATAATTACCGAAGGTTCTCAAGGAGGCCC
ATGGCGTAAAAAGATAACAGATTTAGATGGAACAGAGGATGAAAAGGATTTCCCATGGTG
GTGTCTAGAATGTGTACTGCACGGCCGACTTTCCCCTAGAGAAAGTTTAAAGTGTAGCTT
TTATTTGCATCCATACGAGGGAACAACTGTGCAAGTGCTTACCCAGGGAAAACTCAGTG
CACCCAGAATATTGCGCATACAGAAAGTTATAAATTATGTTTTAGAGAAAATGGTCCTTGA
TAGACCACTGGATTCTAGCAACTCAGAAACAACTTTTACTCCAGGATTAAGTGGAAACCA
ATCACATGCAGCAGTAGTTGGAGATGGTTCTCTTAGATCTGGTGCACGAGTCTGGCAGC
AGAAAGCAAAGCCATTGGTGGAAATTCTTTGTAACAATCAGGTGCTCTCACCTGATATGA
GTTTGGCAACGGTGCGAACATATATCTGGAAGAAACCGGATGACTTGTATCTCTACTACA
GACTAGTGCAAAATAGATGAATCTTCGCACATCCTTTTGAGATGTGTTTAACCTTGTAATA
AAGTAGTCTGATACAATGTTTTGCTTGCTTTCTGAATTGTGTCTTTTTCAGATGGCAATCT
TCATAACTTGTTTTCTTCATCATATGCAAAATCTGTTGGATAATTAAGAGGAACAAAATGA
AAGGTGTAGTATTCATGTATTGATTTCTGGTATTGATACTTTTTCTAAGTGGTGCCATGAG
GAGGTACCTCTTTTTCATTGTGTAGACTAACCATGGAAAAAAACAGAAGAGTTCACATGT
AAAGGCTGCCTCTAACAAGGTGTGGAATAAAATTTGTCTCCAAGCAAGGAGAAAATTCGA
GGGTTATCCTCATTCCTGAGAGCATACAGCGTTATCTTTGAGACGAGTCATCAATGATAA
TATCCTCGTAAAAGGTTACAGGATCTGTTGAAGAGAGTGAATTGATTATCACTGCATGTA
AAACATCTGTCCACAGAATGTTCTTGTGTTTTAATGTAAAATTGGGCACTTCTAAGTGTTC
TAAAAGTAGCTCATATATTGATTTTGATGTAAGCAAAAAAAAAA
SEQ ID NO:217
CCGCAACTGTCACTGATCGAAGAAAAGAAGCAGCAGCAGCAGGAGAGCAAGATCAATGA
TGAAGGGCAAAACAATCCAGATGCAAGCTGCCCATCAAAATCATGATGGCGAAACATCA
GTAGCTTGCGTTCTATGGGATTGGCATGCCAAGCATCTCATCACTGCAGGAGCCGATAA
TACCATTCTCATCCATTCTTATCCTTCATCCTCATCCTCTAAACCCATCACTCTTCGCCAT
CACAAGAATGCAGTCACTGCCCTCGCCATAAATTCCAATGTGAGAAGTCTTGCCTCTGGA
TCCGTCGATCACTCTGTTAAGCTCTACTCTTATCCAGGGGGTGAATTTCAGAGCAATGTA
ACACGGTTTACTCTGCCAATACGATCTCTGGCCTTCAACAAGTCTGGTGAGTTACTAGCT
GCTGCAGGTGACGATGAAGGCATTAAGCTAATTAGCACCATAGATAATTCCATTGCAAGG
GTGCTGAAGGGCCACAATGGCCCCGTTACTAGTATTTCTTTCGATCCGAAAAACGAGTTC
TTGGCATCTTCAGACAGTGATGGCACTGTCATATATTGGGAGCTTTCAACTGGGAAACCT
GTACATACATTGAAAAAGATTGCACCCAATACGACCTCCAATCCGACGAGTTTAAATCAA
ATCAGCTGGCGTCCTGATGGTGAGATGCTGGCTGTCCCAGGCAGGAAGAGTGAGGTTT
CAATGTATGACAGGGATACTGCAGAGAAGCTTTTCAGCCTTAAAGGCGGGCATTCAGAT
ACTATTTGTTCTCTGGCCTGGTCGCCTAATGGGAAGTATATTGCCACAGCTGGGACTGAT
CGGCAGGTCATGGTTTGGGATGCCGATAGGAGGCAGGATATTGATAAGCAACGTTTTGA
CAACCCAATTTGTTCAGTTGCCTGGAAGCCCAGTGACAATGCCTTGGCGGTGATTGATG
TACTGGGCAGATTTGGGGTATGGGAATCACCTATTGCATCTCATATGAAGTCCCCTGCA
GACGGTGCAGAGCGCTACGATAACATGGAAGATGAAGAGCCCTTGATGGCTAGATATGA
AGAGGAATTGGAAGATAGTGTATCTGGAAGCCTGAATGAGATAATAAATGACGACGATG
ACGATGATGAAATGGGTAAGATTCCAAGGAAGATCTTACAGAAGAAGCCTTCTGTTAAAG
TTGAAAAAGGCAAAGAAGAGAGCAACGCTAAGGCTTTTAAGAGTGGTCAAGACTCCTTC
AAATTAAAATCAGCAATGCAAGAGGCGTTTCAGCCTGGTGCAACGCAGCGGCAATCTGG
AAAGCGGAATTTTCTTGCTTATAACATGCTTGGAAGTGTTATCACCTTTGATAATGATGGC
TTTTCTCATATTGAGGTTGACTTCCATGATATAGGGAAAGGTTGTCGTGTGCCCTCCATG
ACTGACTATTTTGGTTTCACCATGGCCTCCCTCAGTGAAAGTGGGAGCGTTTTTGGGAGT
CCGCAGAAGGGCGAGAAGAATCCTAGTACACTCATGTATCGACCTTTCAGTAGTTGGGC
GAACAATAGTGAGTGGTCCATGCGATTTCCAATGGGAGAGGAGGTGAAAGCTGTTGCTC
TTGGTTCAGGTTGGGTTGCAGCAGTGACAAGTCTTAATTTCCTTCGGGTTTTTTCAGAGG
GCGGCTTGCAGAAATTTGTTCTTTCTATGGATGGGCCAGTAGTCACTGCAGCAGGATAT
GAGAACCTCCTTGTTGTTGTATCACATGCTTCAAATCCTCTTTTATCTGGAGATCAGGTG
CTCAGCTTCACTGTGTATGACATTTCTCAAAAAACTTGCCCTCTCTCTGGTCGGCTTCCTT
TAAGCCCTGGCTCTCATCTCACATGGCTTGGATTCAGTGAAGAAGGCTTATTAAGTTCAT
ATGATTCTGAGGGAAATTTAAGAGTGTTCACCAATGACTACAATGGCTGCTGGGTGCCAA
TTTTCAGTGCTGCAAGAGAGAGAAAGTCAGAGACTGAAAGTATCTGGATGGTGGGGCTA
AACAGTACACAGGTTTTTTGTGTTGTGTGCAAATTGCCCGATACCTATCCACAGGTAGCC
CCAAAGCCAGTCTTGAGTGTCCTGAACTTGTCATTACCTCTAGCATGTTCTGATCTTGGA
GCAGATGACCTAGAAAATGAATACCTTAGAGGCAGCTTACTTCTTTCACAGATGCAAAAG
AAAGCTGAAGATGCAGTGGCATGTGGTCGTGAAAGTAACATGGAAGAGGATAGCATTTT
TAAAATGGAAGCTGCACTTGATCGATGCCTTTTACGTCTTATAGCAAATTGTTGCAAGGG
AGATAAATTGGTTAGAGCAACAGAGCTGGCAAGACTATTATCATTGGAAAAGTCTTTACA
GGGTGCCATAAAACTTGTCAGTGCTATGAAGCTCCCAATGCTTGCAGAACGGTTTAATAC
TATACTGGAGGAGAAGATACTTCAGGAAAATATGGAGACGATCTCTTGTAGGAGATTAAC
TTCGGAGGCACAAGACATGGACACTCCAATTTCCATTAGTGTCAAACAAGTTTCATACGG
GGCAAACCTTGGTGATAGCCCCTTTCTTCCAAATCGTCAAGTCGAACCAAAGCATTCCAC
TCCAGTGTTTTCAAAGCCTGATACAAAAATTGAAGTAGATACCTCAGAAGCTATAGCAAA
GGGCTGTGATGCACAAAATGGAAATATAAAGAGTGGTGATGCAGAGGTCCAGCCTGCAA
GCCACAATGACTCTATTCAGAAGCCAAGTAACCCCTTTGCTAAGGCATCCAATACTTCTG
CAAACCAAGCTGTACAGCGCAATGCATCTTTACTGAGTTCCATCAAGCAAATGAAGACTG
CCACAGAAAATGAAGGGAAGCGAAAAGAAAGGGCAAGGTCAGGTTCCTTGCCACAGAAA
CCAGCAAAACAAAGTAAGATTTCATGATAAAAGCATGTGCTCAATAGTGGAAACATAAGA
TACAGTCAGGCTGAAAACCCTGACAAGTAAATGCTCTATGCCATGTAGGTGACATGGAT
GTCTGATAATGATGACTAAGTGCCTCATTTATTCAATTACGACGGATTCAGTTGGCCTTTT
GTAAAAAAAAAA
SEQ ID NO:218
AAATACCTAGTATACATAACAAGATATTATACTCTGTTCCTCGCCCTAGATTGCGATTGAA
AGCAAGCGAGACAGTCGGGGGGCGGGGTTGCTGCTGTTGAAGGCGCCTTGGGCTGTAT
AGTAACAACACAGCCACTGAAATCATTGCTGGGGCGTCGGTAAAGTTGCAGAAGCAGCA
GGACGACTAACAGGATGAAGCAGAAGCGAAAGGGCCATCAAGTTGATGACCCAAAATAT
TCTGTCCAAACCCCCCAAGAGGACGACACACCTAACGAAAGCGGCCCTGCTTCGGAAG
AAGTGGAGAGTAGTGACGAAGAAGGCGGCAATAGCAGCAACATCGAAGATGATATTATT
TATTCTTCTTCCGAAGAAGACCCCGTTGTTAGTTCTGATTACGAAGAAGACGAAGACGCA
GAAAGTGATGCAGAGGGTGTGACTGCAGAGCAAGAGTTGGAAGGGGACATCGACAATG
CCCTCCAAAATTATATGGGAACCCTTACTGTTCTTTCCAATTTTCATGGAGAGAACCTTAA
AAATGCCGAAGGTGAAGATACTTCTGGGGATGATGATGACGAGGAGGAGATGCCTAAAA
GAGCTGAAGAGTCTGATTCTCCGGAGGACGAGAATGATGAGAGGCCTAAAAGAGCTGAA
GAATCTGATTTTTCGGAGGACGAGGATGAGGAGAGGCCTAAAAGAGCTGAAGAGTCTGA
TTCTTCGGAGGACGAGGTTCCTTCCAGAAACACTGTTGGAGATGTTCCCCTCCGTTGGT
ATAAAGACGAACAACACATTGGCTATGATATTAAGGGTAAGAAGATCAAGAAGCAGCCTA
AAAAGGACCAGCTAGATTCATTCCTCGCTAGTACAGATGATTCCAGTGACTGGCGTAAG
GTATATGACGAATATAATGATGAAGAGGTTGAGCTAACAAAGGATGAAATCAAGTTTATTA
GTAGATTACGTAAAGGCACAATACCTCATGCTGATGTCAATCCATATGAGCCTTATGTTG
ATTGGTTTGACTGGAAAGATAAGGGTCATCCCTTGTCTAATGCCCCAGAACCAAAGCGG
AGATTTATCCCATCAAAATGGGAAGCTAAAAAGGTTGTGAAGCTTGTGAGAGCAATTCGG
AAAGGATGGATAACGTTTCAGAAAGCTGAAGAAAAGCCTCGCTTCTATTTGATGTGGGGA
GATGATCTCAAACCATCAGAAAAAATGGCAAATGGGCTGAGTTATATCCCGGCGCCAAA
ACCAAAGTTACCTGGGCATGAAGAATCATACAATCCTCCACCTGAGTATATTCCAACACA
GGAGGAGATCAATTCATACCAGCTTATGTATGAGGAAGACCGTCCTAAATTCATCCCTAA
AAGGTTCGACTCATTGAGAAATGTTCCAGCTTATGACAGATTCCTTTCCGAGATATTTGA
GCGATGTTTAGACCTATACTTGTGTCCAAGAACTCGGAAAAAGCGTATTAACATTGACCC
CGAATCATTGATACCTAAGCTTCCGAAGCCTAAAGATCTGCAGCCTTTTCCCTCTATTTG
CTTCCTTGAATATAAAGGCCATACAGGCGCTGTTTCATGTATATCTCCAGAGTCATCAGG
GCAGTGGCTGGCCTCTGGTTCAAAAGATGGTACTGTGCGTATTTGGGAGGTGGAAACTG
CGCGTTGTCTTAAAGTTTGGGATATTGGGAGGCCCATACAGCATATTGCATGGAATCCA
GTTTCTCAGCTTTCTATTCTGGCAGTTGCAGTGGATGAGGAAGTGCTTGTACTAAATACT
GGACTTGGAAGTGAGGATAGTCAAGAAAAAGTTGCTGAACTATTGCATGTGAAATCAAAA
CCTGTATCAGCTGATGACTTGGGTGATAACACTTCTCTGACTAAGTGGATTAAACACGAA
AAGTTTGATGGGATCAAACTTACCCATTTGAAGCCGGTGCACTTGATATCTTGGCATCAT
AAAGGAGACTACTTTGCGACTGTTGCACCCGATGGTAATACTAGGGCAGTACTTGTGCA
CCAGCTTTCTAAGCAGCAAACCCAAAACCCTTTCAAAAAGATGCAAGGGCGTGTTGTTCA
TGTTCTTTTCCACCCTAGCCGAGCAATTTTCTTTGTTGCTACAAAGACACATGTCCGGGT
TTATGACCTTGTCAAACAACAGCTTGTTAAAAGGCTTGTGACAGGTCTGCATGAGGTTTC
ATCCATGGCAGTGCATCATAAAGGAGATAATCTTCTTGTTGGGAGCAAGGAGGGAAAAG
TGTGCTGGTTTGATATGGATCTTTCAACACAACCTTACAAGACTTTAAAGAACCATTCCAA
AGATATTCATTCTGTTGCATTCCAGATTCTTATCCTTTGTTTGCATCATGCTCAGATGATT
GCAAAGCATATGTTTTTTATGGGTTGGTGTATTCTGATTTGCTTCAGAATCCATTAATTGT
CCCTTTAAAGGTGCTTCAAGGTCACCAGAGTGTAAACGGCATGGGTGTTTTAGATTGCCA
GTTTCATCCTAAGCAGCCGTGGTTGTTTACAGCAGGGGCTGATTCAGTTGTAAAGCTATA
TTGTAATTAATTTATCATCCCTTCAGCAGAATCAGTCAATATGAAGGCGACATTATTGTGC
CAAAATTGGGAGGGTAAAAGACTACCGTGTTAATAATTTTGCATATGTTCAGGGGTATTA
AAAATTCAGAGGATAAATTTCCTCACTCTCAAGTGTTAGATGGTTTTATTCGAGAAAAAAG
ACAGTCAATTCTATATCAATATTTGAAAGTGGAGGCTTGTTGGAGGTTGTCATTTATAGAA
AACAGCTCTAATATGTATCGTTATATTGAAGGGAGAGATCATTAATGCTTTTGTATCAAGC
ATAGGGTGATATTTTGACGTGATGTTCAAAATTATTGTAGGCATATGACGGGATTTTTGAC
AAGGCCTCAGGTTGTTTCTCGAGATGCTACATGAGAATTTATGTTTCAAGTTCCAAATTTT
TTTTTGGTTTAAAGTTCCGGTATTTAGATATGCCGAGAGCTTGATTGGACCGAGAGCTTG
AATATATTCCGAATCTTGATCTTAAAAAAAAAA
SEQ ID NO:219
ACATTTCTCGCTAAACAGAGCCAAATTCACCTTCTGGCTGACGCTGCAACAGGTTGGGT
GATCCAGAGCTTCAATTTGAATATTCAAAGCATTTTTTAGCTCAAAGTTCTACCGTTTCCA
GATTAAGAAGTTAGGTGAAGAGAAATTCTCTTCCTTTTTCTTAAGTACCTTGATTTAAGAA
ACAGAATGATGTCTCTAAAAAGAGGATTCGAAGAATCCTTGGTACCAGCCAAAAGACAGA
AAACAGAGTTGTCCACTGTCACATATGGTGATGGGCCTCGACGGACATCCAGTTTGGAA
TCCCCAATCATGCTGCTGACTGGGCACCATGCAGCTATATATACAATGAAGTTCAATCCC
ACAGGAACAGTGATTGCATCTGGATCACATGAGAGGGAGATATTTTTGTGGAATGTGCAT
GGGGATTGCAAGAATTTTATGGTCTTGAAAGGTCACAAGAATGCAGTTTTGGATCTGCAC
TGGACAACCGATGGCTGCCAAATCATATCAGCAAGCCCTGATAAGACTCTTCGTGCCTG
GGATGTTGAGACAGGAAAGCAAATAAAAAAAATGGCAGAGCACTCATCCTTTGTTAATTC
TTGTTGTCCTTCACGGCGTGGGCCCCCTCTTGTAGTTAGTGGATCTGATGATGGGACTG
CAAAGCTCTGGGATTTACGTCACAGAGGAGCTATTCAAACCTTTCCAGACAAATACCAAA
TCACAGCTGTTGGGTTCTCTGATGCCGCAGACAAGATATACTCTGGTGGAATAGATAATG
AAATTAAGGTGTGGGACCTTAGAAGAGGTGAGGTCACAATGCGACTCCAAGGCCACACA
GACACAATTACAGGCATGCAGTTAAGTTCTGATGGCTCTTATCTCCTGACTAATTCGATG
GATTGTTCTCTTCGTATTTGGGATATGCGTCCATATGCCCCTCAGAATAGATGTGTGAAA
ATCTTAACAGGCCATCAACACAACTTTGAAAAAAACCTTTTGAAATGTAGTTGGTCTTCTG
ATGGAAGTAAAGTCACGGCTGGTAGTGCAGATCGTATGGTTTATATATGGGACACAACC
ACTCGACGTATATTGTACAAGCTCCCAGGCCACACTGGGTCTGTAAATGAGACTGGTTTC
CATCCTACACAGCCAATTATTGGATCATGTAGTAGTGACAAGCAGATATACTTAGGGGAG
ATTGAGCCTAATGTTGGGTATCAAGCTGTAATTTAGACATGATTCAAAGTCTAGACATTAA
TGTTTTGGAACTCTTTTTTCGAAAAAAAAAGAAAAAAAAAA
SEQ ID NO:220
GGAGTTTTTTGCTCTGTTGGAGATTGTGAAAAGGCAGAGGAGCAGAGGCTATGGAGTTC
AGCGACACATATAAGCACACAGGGCCTTGCTGTTTCTCTCCTGATGCACGCTATTTGGC
CATAGCCGTTGACTACAGGCTGGTGATTCGAGATGTCGTTACCCTCAAGGTTGTACAGTT
GTATTCATGCATGGATAAGATCAGCAATATTGAGTGGGCTCTTGATTCAGAGTATATTCTT
TGTGGTCTATATAAGAGAGCAATGGTTCAGGCATGGTCATTATCACAACCTGAATGGACA
TGCAAGATAGATGAGGGGCCTGCAGGAATCGCCCATGCAAGATGGAGTCCCGACAGTC
GCCACATTATTACAACATCTGACTTCCAGTTGCGTCTCACAGTCTGGTCTCTTGTCAATA
CAGCCTGCATACATATACAGTGGCCTAAACATGCATCTAAAGGTGTTTCATTTACCCAGG
ATGGCAAGTTTGCAGCTATAGCTACACGGCGAGATTGCAAGGACTATGTGAATCTTCTTT
CTTGTCATACGTGGGAAGTCATGGGCACATTTACTGTTGACACTATAGATCTTGCAGATC
TTGAATGGTCACCAAATGATAGTGCAATTGTTGTTTGGGACTCCCCACTTGAGTACAAGG
TTCTTATTTACTCTCCAGATGGGCGGTGTTTATTTAAATATCAAGCTTATGACAGTTGGCT
AGGTGTGAAGACTGTTGCATGGTCTCCATGTAGCCAGTTTCTGGCAGTAGGCAGTTATG
ATCAAACACTGAGGACTTTGAATCACCTTACTTGGAAACCTTTTGCAGAATTTGTGCATGT
GAGCACTGTTCGAGGTCCTGCCAGTGCTGTTGTTTTTAAGGAAGTAGAGGAACCATGGA
ATCTTGATGTGTCTGGTCTTCACTTGAATGATGACAATGCTCATGACATCCAAGATGGCA
AGCCAGCTGAAGGCCATTCTAGGGTCCGGTACAAGGTAGTGGAATTTCCTGTCAATGTA
TCTTCACAAAAGCATCCCGTGGATAAACCAAATCCAAAGCAAGGCATTGGCTTGCTAGC
GTGGAGTCGAGATAGCCAATATTTGTTCACTCGTAATGACAATATGCCAACGGCACTTTG
GATATGGGATATTTGTCGCCTTGAGCTTGCTGCACTTCTGATACAGAAAGAGCCCATTCG
TGCAGCTGCCTGGGATCCAGTATATCCTCGTGTGGCTCTTTGTACAGGAAGCTCGCATT
TGTACATGTGGACACCCTCTGGTGCTTGTTGTGTGAATATTCCTCTGCCACAATTTGTCG
TATCAGATTTAAAATGGAACCCAGATGGAACTTCTATGCTTCTAAAGGATCGTGAGTCGT
TCTGTTGTACTTTTGTTCCAATGCTTCCTGAATTCAACGACGATGAAACTAATGAGGAATA
GTACACACAAAGCAAAAAGTGTATCCAACATCACTTGTGACAAGGCCATCGTTGTCTGCG
AATCTTCTTGGTCAACTACAAATGACCTCTGTATACTTCTCCTGGTCGTTGACACTCTAGT
GCTCACAACAGTGATTGTTTACTAGTTTCAGCTATGTCTTAAGTGTCTCCTACATTCTGCA
ACAAGTTATTTAGCTGTACCATTGTTGAAATTGTGCAAAGTTGATCCTTGGAGAGTTTATT
CCAATGTTTAAGATACAACTTGCTGCAGTGCCTTAGGAGTTCCAATTCTTCCCATTCAGC
TATATGAATAGCAACTAAGTGGTTATAGAGCTTCTCTCCATTCTTATTACTTCACTCGAGG
ACGTGATGTCTACTTCAATTTTTATAATATAATGTTGGATTTATCTATCGACCTTTTATTCA
TTCTCTTAGTCTTTGCTAGTGTAGGCATCAATTCTTATTATAAATATATTGTACTGGGGAT
CCAAGACATGGCAATATATGTCGAGATTTTCATTTTCTCAAAAAAAAAA
SEQ ID NO:221
CTGCCACCCACATGGTAGCTTTCTTGTAAATTGCAGAAGTGTTACCATATTGTGATTAAAT
TAAATTGTGTGCTGTTTTTATAGTCATTCCCTGCATTTTTCCCTGAACGTTTCAATCCATTA
CAGGCCCTTTTTCTTGGTGTCAGTTGGTCGAATCGCGTGAGCCCTTGCTAGATTCCAATC
CCGTCCACTGCTGCGTAGTCTATTCGTCTCTCTCATTTTGCTGTTTGATATGATTATGAGG
CTTTGAAGTCTATCGCACTTAAAAAGCTAATCGCCAATTTAGCAGAAAAAAACTGACCCAT
CTTAGATAAGCCCTCCAAATTCTACTCATGAGCCCTGTTTTGCACAAAGTTTAGCATACAA
ATAGACCCCTCTGAGACAAGCCATCAAAGGTCCAATCCCTCTGGTTTCCTTTTCCTTCAA
ATTTAACCGAAAATCGGCAGATAAGGCCCCATAATCCTACCCATAAACCTGTTTTCTCTTA
ATTTTCGCATACGACTTGATTGATTTCGGATATTAGTATGGCCAAGCTCATTGAGACTCAT
TCCTGCGTCCCTTCAACGGAGAGGGGCCGTGGGATTCTCATAGCAGGAGATGCAAAGA
CCAACTCTATCATATACTGCAATGGCCGATCCGTGATCATGAGAAACCTGGACAATCCCC
TGGAAGCTTCTGTTTATGGAGAACATTCATATCCTGCAACTGTGGCTCGCTTCTCACCAA
ATGGAGAGTGGGTTGCATCAGGGGACACTTCTGGCACAGTTCGAATCTGGGGACGAGG
TTCGGATCATACTCTCAAATATGAGTACAAGGCTCTCGCTGGAAGGATTGATGATCTCGA
GTGGTCTGCCGATGGACAGCGGATCGTTGTATGTGGAGACAGTAAAGGCAAATCAATGG
TCCGGGCTTTCATGTGGGACTCTGGCACAAATGTTGGTGAATTTGATGGGCATTCAAGG
CGTGTTCTAAGTTGTTCCTTTAAACCAACGCGACCATTCCGTGTTGCCACATGTGGAGAG
GACTTCTTGGTGAACTTTTATGAAGGACCACCCTTTAGATTTAAGACATCACACAGAGAT
CATTCAAATTATGTGAACTGCGTAAGGTTCGCTCCAGATGGAAGCAAGTTTATTACTGTT
GGTTCGGATAGAAAAGGAGTGATTTTTGATGGCAAGATGGGTGAAAAGATTGGGGAGTT
GTCTAAAGAAGGTGGACACACGGGCAGTATTTATGCTGCTAGCTGGAGCCCTGACAGTA
AACAGGTACTTACTGTGTCTGCAGACAAGTCTGCTAAAATATGGGAGATTAGTGAAACTG
GTAATGGAACTGTGAAGAAAACATTGACTTTTGGAAGCCAAGGAGGAGCTGATGACATG
CTAGTTGGGTGCCTTTGGTTAAATGATTATCTGATTACTGTTTCTCTTGGTGGCATCGTCA
GCTTACTCTCTGCAGTTGATCCAGATAAACCACCAAAGACAATTTCTGGTCACATGAAAA
GTATAAATGCAATTGCATTATCTCTTCAAAGTGGACAAAGTGAGGTTTGCTCAAGCAGCT
ATGATGGTGTAATTGTTAGATGGATTCTTGGAGTTGGCTATGCTGGTCGGGTAGAGAGA
AAAGATAGTACTCAAATCAAATGCCTGGCAACAATTGAAGGAGAGCTGGTAACTTGTGGT
TTTGATAATAAGGTGAGAAGGGTACCTCTGCTATCAGAGCAACATAAAGAGTCAGAACCA
ATTGACATTGGAGCACAACCAAAGGATCTAGATGTTGCAGTTGGTTGCCCTGAGCTTACT
TTTGTTTCCACCGATGCTGGGATAATAATTATTCGTGCGTCAAAAATAGTATCAACTACTA
ATGTTGGGTATGCAGTGACTGCAGCTGCAATATCACCTGATGGAACAGAGGCTGTGGTT
GGTGGTCAGGATGGGAAATTGCGTGTGTACTCCATCAAAGGTGACACTCTTCTGGAAGA
GTCAGTCCTTGAAAGACATCGTGGTCCAATAAATGCAATCCGCTTTTCTCCTGATGGATC
CATGTTTGCATCGGGAGACCTGAACAGAGAAGCAGTTGTGTGGGACCGTATTACTAGAG
AGGTGAAACTTAAGAACATGGTCTACCACACTGCACGTATCAATTGCATTGCATGGTCTC
CAGATAGCTCTAAAGTGGCAACAGGCTCTCTTGACACCTGCATATTGATATATGAGGTGG
GAAAACCAGCATCCAGCCGAATTACAATTAAGGGGGCTCATTTGGGAGGGGTTTATGGT
CTAGCTTTTAGTGACCAGAGTACTGTTATAAGTGCAGGAGAAGATGCATGTGTCCGTGTG
TGGAGTCTCCCATAGCTCTGTAATGTCTGGATCTAATAATTTATATTTAGGGTATAGGGAT
AGATGTCTCGACATACGAGCATGGTTCTTGATGCCATCTTGTGCCATGTTTGTAGTGCTT
TTGCATGAGTTCAAATGTCTTTGTGACATATTGTCTTGAACCACCGAGGATATATCATATT
TATCTTGTAGATGGTTTCAAACGGTGCTGCTTATCTGGGGGTAATGGGGAGCATGGAGT
TTTTCAAAAAAAAAA
SEQ ID NO:222
CTTGGGCCACCACCTTCTCTAGTAGCTGAATCTTGCAGGGGGTCGAAGTCTGTTTTTGAA
GACCAGATTGCAGCCTATTACCCAACCAAATCAGGCCTTGTTGTTGGGTTGCTACAAGTG
ACAATATCTAAGTATTGACCAGGGTGAAGAGATGCCACAACCGTCAGTTATCCTTGCAAC
AGCAGGTTATGATCACACAGTAAGGTTTTGGGAAGCCACTAGTGGCCGCTGCTATCGGA
CTCTTCAGTACCCTGATTCACAAGTCAACCATCTAGAGATAACACCTGACAAGCAGTACT
TGGCTGCAGCTGGGAATCCACATATTCGTTTATTTGAAGTCAATTCCAATAATCCTCAAC
CTGTAATTAGTTATGACTCCCACACAAACAATGTTACGGCAGTGGGATTCCAGTGTGATG
GAAAATGGATGTACTCAGGTTCGGAAGATGGTACTGTGAAGATATGGGATCTGAGAGCT
CCAGGTTTCCAGAGGGAGTATGAAAGTCGGGCTGCTGTTAATACTGTTGTCTTGCATCC
AAATCAGACAGAATTGATATCCGGAGACCAAAATGGAAATATTCGTGTGTGGGATCTCAA
TGCAAATTCTTGCAGTTGTGAACTGGTTCCAGAGGATACAGCTGTAAGATCATTGACAGT
TATGTGGGATGGAAGTCTGGTTGTAGCTGCAAATAATCATGGAACATGCTATGTTTGGAG
GTTGATGCGTGGAACACAGACAATGACGAACTTCGAGCCATTGCATAAGCTTCAAGCAC
ATAATTCATACATTCTGAAATGTCTTCTTTCACCCGAGTTCTGTGAACATCATCGGTATTT
GGCAACAACATCATCTGACCAAACTGTAAAGATATGGAATGTTGATGGCTTCACTCTGGA
GCGTACACTAACAGGACATCAACGTTGGGTTTGGGATTGCGTATTTTCTGTTGATGGAGC
TTTCCTTGTTACTGCTTCCTCAGATTCAACTGCAAGACTGTGGGACCTATCAACTGGAGA
AGCCATCAGAACATATCAAGGGCATCATAAGGCGACTGTATGCTGTGCTCTGCATGATG
GCACAGATGGTGCTTCTTGCTGATGCCTTGTTTGTATGTCCAATAGATTATAACCTATTTA
CTGTGACACTATTCTTCACACCCATGTCAATTGTCCAATGTAGCATGCCCCGCACCTTGT
ACAGAATTGTTAGAGCAAGTTGTACTTCAGGTCGTAGAGTTGCTTTTCACTTGTTATTTGT
GTAAGTGTCATTAGATTAATATGAAATTAATATGAAAGATTGTACGCTAGCTGTGATGGAC
ATTAGGAGAGAAAT
SEQ ID NO:223
ACTTCTCAAATACCTATAGTAGAATCGGAATTGCCATGTATTTACTTCACTTCTTTAATCAT
AAATTCTCATAAACACCTGTCGCATTTTCCATTTCATCCGATTGGGCTCAATCGGAGGTC
ATTCAGGCTGATTATTTGGATTCTGAAAAGAGGGATTCCAGCCTTAAAGAGCCTTCAAGC
TCCCGAAGTTCTGCTTCAGAGAGAGGGATTCGGGAATTTCACAAGGCCAAATTCCGGAA
AATAGGTTCAGAGCAGAGGATCTGGATTTAAATTCGATTTAAAAGTCGATTCTTGTTTGAA
TTCAAACCTCCTTCGTAGATCTGAGATTTAGAGGTTTATTCTTTTGGAGCTTTCTAGTTTG
AGGGTATATGTGAGATCATATATCGGATCCAGTTGATAGTCCACTAAAGTTTCTTTTTTCC
TTCTACTGTTTAATTCATTTTGCAACATTGTGAAAATTGGTTTCAATGTTAACCAAATTTGA
AACCAAGAGCAATAGGGTAAAGGGTCTCAGTTTTCATCCAAAGAGGCCATGGATCCTGG
CAAGTCTTCACAGTGGAGTTATTCAGTTATGGGACTATCGGATGGGAACTCTGATAGACA
AATTCGATGAACATGATGGACCTGTTCGAGGAGTTCATTTTCACAAGACACAGCCACTCT
TTGTGTCTGGAGGTGATGATTACAAGATTAAGGTCTGGAACTACAAGATGCGTCAATGCC
TCTTTACATTTGTAGGGCACCTTGACTACATCCGGACAGTTCATTTTCACAATGAATATCC
TTGGATTGTTAGTGCAAGTGATGATCAGACAATAAGGCTCTGGAACTGGCAATCACGGG
TTTGCATCTCTGTCCTTACCGGTCACAATCACTATGTGATGTCTGCTTCCTTTCATCCTAA
GGAAGATCTAGTTGTTTCAGCATCGTTGGACCAAACTGTTCGTGTATGGGATATAAGTGG
CCTTAGGAAAAAAACTGTGTCTCCAGCTGATGATCTTTCGAGGCTGGCACAGATGAATAC
GGATCTTTTTGGCGGTGGTGATGTTGTAGTGAAGTATGTACTCGAGGGACATGATCGTG
GTGTGAACTGGGCTGCTTTTCATACCAGCTTGCCATTAATTGTTTCTGGTGCAGATGACC
GCCAAGTCAAGCTGTGGCGAATGAATGATACAAAGGCTTGGGAGGTTGACACTCTACGG
GGACATACAAACAACGTGTCCTGTGTGATTTTTCATGCACGACAAGACATTATTGTTTCAA
ATTCTGAAGATAAAAGTATCCGAGTGTGGGATATGTCCAAGCGAACTAGTGTTCAAACTT
TTCGTAGAGAGCATGATCGATTCTGGATTCTTGCAGCACATCCTGAGATGAATCTCCTAG
CAGCTGGGCATGACAGTGGGATGATTGTGTTCAAATTGGAGAGAGAGAGGCCTGCTTAC
GTTGTTTATGGGGGCTCGCTGTTGTATGTCAAGGATCGTTATCTGCGGACTTATGAGTTT
GCAACCCAAAAGGACAATCCACTAATACCAATTAGGAAGCCTGGTTCCATTGGTCCAAAC
CAAGGGCCAAGATCTTTGTCATACAGTCCAACAGAGAATGCTATCTTAATTTGTTCAGAT
GCTGATGGTGGTGCTTATGAACTTTATGCTGTTCCAAAGGATAGTCATGGCAGGAGTGAT
ACAGTACAGGAGGCAAAGAAAGGATTAGGAGGATCTGCTGTCTTTGTGGCTCGCAATCG
ATTTGCTGTTCTTGACAAGAATCACAATCAAGTCACTATCAAGAATTTAAAAAATGAGGTG
ACAAAAAAGTTCGATCTTCCAGTCACAGCAGATGCACTTTTCTATGCAGGGACAGGCAAC
TTGCTTTGCAGATCAGAAGATAGTGTATTTTTGTTTGATATGCAACAGAGGACGGTTTTAG
GAGAAATTCAAACCCCTAATGTCAGGTATGTAGTTTGGTCAAATGACATGGAGAATGTTG
CCTTACTGAGTAAGCATACAATCATTATTGCAAGCAAGAAGCTGTCAAGTACGTGTAGTC
TGCATGAAACTATTCGTGTTAAAAGTGGGGCTTGGGATGACAATGGCATTTTTATGTACT
CAACTTTAAATCACATAAAGTATTGTCTGCCAAATGGAGATAGTGGCATTATCAAGACGC
TGGATGTTCCAGTATACATAACAAAGGTTTCTGGAAAGTCTCTCTATTGCCTCGATAGAG
ACGGCAAGAACCGTGTTATACAGATAGATATTACCGAGTGTCTTTTCAAGCTAGCACTCA
GCAAAAAGAAATATGATTATGTTATAAACATGATTAGAAACTCTCAGCTTTGTGGTCAAGC
AATCATAGCTTACCTACAGCAGAAGGGATTTCCAGAAGTGGCTCTTCACTTTGTCAGAGA
TGAGAGAACTCGATTCAACTTGGCAGTAGAGAGTGGAAATATTGAAATAGCTGTTGCTTC
TGCAAAGGAAATTGATGAGAAGGATCACTGGTACAGGCTAGGAGTGGAGGCCCTTAGAC
AAGGTAATGCTGGAATTGTAGAGTATGCCTATCAAAGAACAAAGAATTTTGAAAGGCTTT
CTTTTCTATATCTCATAACTGGTAACCTAGACAAATTATCAAAGATGTTGAGGATTGCTGA
AATGAAAAATGATGTCATGGGCCAATTTCACAATGCACTATATTTGGGTGATATTCAAGA
GCGGATCAAGATTTTGGAGGAGTCTGGCCACCTGCACCTTGCTTATGCCACTGCATCAT
TGCATGGTCTTGCAGACATTGCTGACAGGCTTGCAGCTGATTTGGGTGGCAATATTCCA
GTTTTACCTCCAGGGAAAAAATCATCACTTCTAATGCCACCTGCCCCCATTCTGCATGGT
GGTGATTGGCCTTTGCTCAGGGTTACGAAAGGTATTTTTGAGGGTGGTTTGGAGAATTC
CACTTCTGCAGCTTATGAAGAAGAGGATGAAGAAGCTGCTGCTGACTGGGGTGAAGACA
TAGATATAGAAAACATTGAAGGGGAAAATGGTGAGGCTACAGTGTTGGATGATCAAGAA
GTTAAAGGTGGAGAGGATGATGAGGGAGGATGGGACATGGAGGATTTGGAACTTCCTC
CAGACGTAGCTGCTGCTAATGTAGGAACCAATCAGAAGACATTGTTTGTAGCCCCAACAT
TAGGTATGCCGGTAAGCCAAATTTGGATGCAAAAGTCTTCTCTCGCAGGTGAGCATGCA
GCTGCAGGCAACTTTGAAACAGCTTTACGTCTACTGACTCGCCAACTGGGCATTAAAAAT
TTCTCTCCACTGAAACCTCTATTTTTGGAACTTTATATGGGTAGTCATACATTCCTTCCTT
CCTTTGCTTCTGTACCTGCTTTTTCCTTGGCACTACAGAGAGGATGGAGTGAATCTGCTA
GTCCGAATATTAGAGGTCCCCCAGCCTTGGTCTATAGGCTTTCGGTGCTAGAGGAGAAA
CTAACGGTAGCCTACAGGGCCACGACAGAAGGTAGGTTTAGTGAAGCACTGAGGCTATT
TCTGAACATTCTGCATACAATTCCAGTTATTGTGGTAGACTCAAGAAAGGAAATTGATGA
GGTAAAAGAATTGATTGGAATTGCTAAAGAGTATGTTCTTGGTCTTCGGATGGAGGTCAA
GAGAAAAGAAATAAGGGATGATGCTGTTCGGCAGCAGGAGCTTGCCGCTTATTTTACTC
ATTGTAATCTGCAGAAAGCTCATTTGAAGCTGGCTTTGCTAAATGCAATGGGTATTTCTTA
TAGATGTAAGAACTATAATACAGCAGCAAACTTTGCTCGAAGGCTTCTGGAGACTGACCC
TTCTTCAAACCATGCAACAAAGGCTCGACAAGTTCTTCAGGTCTGTGAGAGGAACTTGCA
AGATGCGACACAGCTCAACTATGATTTCAGAAATCCTTTTGTTGTCTGTGGGGCAACTTT
TACTCCAATATACCGTGGTCAGAAAGAGGTGTCCTGTCCGTATTGCATGGCTCGTTTTGT
TCCTGACATTGCTGGGAAACTTTGTTCGATATGTGATCTTGCAATAGTGGGTTCAGATGC
ATCTGGACTATTTTGTTTTGCTACTCAAACAAGATGATCTTAGCTTTACAGAACAATGATT
TTGTGATGCCTTTTATATTCTTTGAGAGGTAAATTGTTTGCTGCCATTTCCCATCAGTATA
TGCAAGAAATGTTTCCATTTATGTCAGTTGATACCAACACGGGCGTCAGAATAATCGTAC
AATGCATGCCAGGCTTACGTAAGCAACATGTGAGGATATTTATTTAAACATTATAAGAAG
CAGGACGCATGCATGTTTTATTTGCTGTTTCAGTTCCAGAATTGAAGCATGCACGCTGTC
TTATTAGCTGTTTCAGTTCTGGATTTTGTTTGCTGTTCTTCTTAGTAAGGCAGAAAGCTTT
ACTCACTTTGTAAGATGTGATGTAGGAATTTTGTTTACAGTGCAGGATGGCTATGCCCTC
TATTGTTCTCTATTCTGTGCATTGTGGCAAATGGTTTTGTTATACCATTTTTTTGGCCTTGT
AGGTGAATAATTGGATTTGCTTGAAAAAGTCAATCATCTTAATTCGAGGACCTAATAAAGG
TTACTACTTGAAGCTTTTCAAAAAAAAAA
SEQ ID NO:224
CCCGAAGGGCAACTTTGACGTCACAATAGTCCTAAGTGACGGGGCCGCACAACCAGACT
GCTTATGAATACCAGGCTCCACTACCCATTGCAAGACATTTAGTCAGTTCAAAGGGAGCG
CTCTATTAAAAAAATTCCTCTTGTAAATCGTTGCAAGAGAGCTTGCCATCCAACAACAACA
ACGACTCCAGGACGAATGGATTTACTGCAGAATTACCAGGATGACAGTGAAGATTCGAA
CCCAGAACTTAGAAATCATCCACCGCTGGAAGACGCCACGGCCACTTCGGCGCCCGCA
GGAGTCGAAAATGAGACCTCTTCTTCACCAGACTCCTCCCCTTTGCGCCTTGCATTGCCA
GCAAAATCTTGCGCACCCGATGTGGACGAAACCCTAATGGCCCTCGGCGTTCCAGGCTC
CGAGAAAAAAAACAATCACAACAAGCCCATCGACCCTACGCAGCACAGCGTGACATTCA
ATCCAAGCTATGATCAGCTCTGGGCCCCGCTTTACGGCCCTGCTCATCCTTACGCCAAG
GATGGTATCGCTCAGGGCATGAGAAACCACAAGCTCGGTTTTGTGGAGGATTCGGCCAT
CGAACCGTTCATGTTCGACGAGCAGTACAATACTTTTCACAGGTATGGTTACGCTGCCGA
CCCTTCTGCTTCTTTGGGTAGTACTATTGTTGGTGATTTAGAGTCTTTGAAAAAAAACGAT
GGGGCATCTGTGTATAATTTACCTAAACGTGAGCATAAGAGGCAGAAACTCGAGAAAAAA
ATGATACAGAAGGACGAGAACGAGGAGGAGGAAAAAGAAGTTGGAGAGGAAGTTGACA
ATCCTTCCACGGAAGAGTGGCTGAAGAAGAATAGGAAAAGCCCTTGGGCTGGTAAGAAG
GAGGGTTTGCAGACTGAATTGACTGAGGAGCAGAAGAAGTATGCTCAGGAACATGCTGA
GAAGAAGGGTGACAGGGAGAAGGGTGAGAAGGTTGAAATTGTAGATAAGACTACTTTCC
ATGGCAAGGAGGAGAGGGACTACCAGGGGAGGTCTTGGATTGATCCTCCCAAGGATGC
CAAGGCAACCAATGATCATTGTTATATTCCGAAGAGGTGGGTCCATACATGGAGTGGGC
ATACAAAGGGGGTTTCCGCTATCCGATTCTTCCCCAAGTATGGTCATCTGTTGTTATCTG
CTGGCATGGATACGAAGGTGAAGATCTGGGATGTTTTTAACAGTGGGAAATGCATGAGG
ACTTACATGGGTCATTCGAAGGCGGTGCGTGACATATCATTCAGCAATGATGGTTCTAG
GTTTTTGAGTGCTGGGTATGATAGAAATATCAAATTGTGGGATACGGAGACAGGGAAGG
TGATTTCTACATTCTCCACAGGGAAAATACCATATGTTGTGAAGCTCCATCCGGACGAGG
ATAAGCAAAATGTGCTACTGGCAGGTATGAGTGATAAGAAGATCGTGCAGTGGGACATG
AATAGTGGTGAGATCACTCAAGAGTATGATCAGCATCTTGGTGCTGTGAATACCATTACT
TTTGTAGATAATAATAGGAGGTTTGTGACATCGAGCGATGACAAGTCCCTGAGGGTTTGG
GAATTCGGCATTCCTGTGGTCATTAAATACATCAGTGAGCCTCACATGCATTCAATGCCT
TCGATTTCTCTTCATCCAAACACGAATTGGCTTGCTGCACAGAGTTTGGATAATCAATTC
TGATCTACAGTACAAGGGAGAGGTTCCAACTCAATAAGAAGAAGAGGTTTGCGGGGCAT
ATTGCAGCAGGTTATGCTTGCCAAGTTAACTTTTCACCCGATGGACGATTTGTTATGTCC
GGAGATGGCGAGGGCAGATGTTGGTTCTGGGATTGGAAAACTTGTAAAGTCTTCAGAAC
TTTAAAATGCCATGACAACGTTTGCATAGGCTGTGAGTGGCATCCTCTGGAACAGAGCAA
GGTTGCTACATGTGGCTGGGACGGAATGATTAAATATTGGGATTAAATTCAGTACAATAA
ATGCAAGATCTCTAACTCAACTTGAGTATGTATGAAGGTGCCGTATCAAAAGATTGGTAC
TTCCTTATGGACACACAAGATCGTAAGCATGGCTGAATTGACTTGAGGAAAATGAAACGA
CCTTGCTAATTGTAATGATGTAGAGGCCTCATGTGGTACCTGGTTGTCTACCAGACTATA
CTATACAGTCTGTCCTACCTTTCGAAGGAAACTTTTGAGTGACTCATGTCAATTTGTTCTT
GTATTTGACCTCCAAGAGATGTAACATTGAATTTGATTTCACCTCCATGGTTTCCAAGATG
TGAATCTTTTATTGCTTATTTGTTCACACTGGAAAAAAAAAA
SEQ ID NO:225
GATTGCTCTAATATGAACCAGCTCAACTGGACGCTCTAAACTTACGTATGATTGAACGAA
GTGTTTGGCGCTTGTTTCGCCTTATAAATAGTGCTTCTTGTATCAGTGCGTATTGCATCA
GTGTAGTGAACACTTTGAAAGGAGTTTGGGCTTCGACTCTCTACCCTGTAAACAAGTTCA
GCTTAAACGCTGTTGCAGTAAAAGGTGTTGAAGATGGCAAGGAAGGGTTTGGGTACTGA
TCCGGCAATAGGCTCGTTAATGTCATCAAAGAAGAGGAAAGAGTATAAAGTCACCAATAG
GTTTCAAGAGGGCAAGCGGCCTCTTTATGCCATTGCTTTCAATTTCATTGATGCTCGTTA
TCATAATATTTTCGCAACAGCAGGAGGAACCCGGGTGACCATTTACCAATGCTTGGAAG
GAGGTGCTATTTCTGTTTTGCAGGCATATGTGGACGATGATAAGGATGAATCATTTTACA
CATTAAGTTGGGCATGCGATGTGAATGGTTCACCATTATTAGTTGCTGGTGGTCACAATG
GTATAATTCGAGTGCTTGATGTTGCAAATGAAAAGGTCCATAAGAGTTTCGTTGGCCATG
GAGATTCTGTGAATGAAATAAGGACCCAAGCACTGAAACCTTCCCTTATTTTATCTGCTA
GCAAAGATGAATCTGTACGACTATGGAATGTTCAAACTGGAATATGCATTTTGATATTTGC
TGGAGCTGGGGGACATCGCAATGAAGTTCTCAGTGTAGATTTTCATCCTTCAGATGTATA
TCGTATTGCAAGCTGTGGCATGGACAACACGGTCAAGATCTGGTCTATGAAAGAGTTTTG
GACCTATGTTGAGAAGTCATTTACCTGGACTGATTTGCCATCGAAATTCCCCACGAAGTA
TGTGCAATTTCCTGTTTTTATTGCTGCAGTGCATTCAAACTATGTGGATTGCACTAGATGG
CTTGGCAACTTCATCCTATCAAAGAGTGTGGACAATGAAGTAGTTTTGTGGGAGCCTTAC
AGCAAGGAGCAAAGTACTAGTGATGGAGTTGTGGACATCCTGCAGAAGTATCCAGTGCC
AGAATGTGACATTTGGTTTATCAAGTTTTCATGCGATTTCCACTATAATTCTATGGCAGTT
GGCAACAGGGAAGGAAAAGTTTATGTTTGGGAATTACAGTCAAGCCCTCCTAATTTAATT
GCCAGGTTATCACATGCACATTGCAAAAATCCTATTAGACAGACAGCTATATCTCATGAT
GGAAGTACCATTCTCTGCTGCTGTGACGATGGTAGTATGTGGCGCTGGGACGTGGTTCA
GTAAAATCAGCAATTTCCTTGCTGAATGTGCATTCGTATATGTTTCAGATATTTACACCTG
CACACTAATGACTTTGATTGTTATTAGCTTCTGGTTGATGTTGTGGAAAATCCTGGCTTTA
GCCAAGCTTTTTGTTTTGCCAGAATAGCAATCAGAGTTTTCTGTTTGCTAGAATACAGCAT
GCCCATTTTTATAAAGCTACACTTTGCGGGACATTTGAAACTAGTATGCTCCCCAGGGCC
TTGTGTTCATAAAGATGCCTTCAATGATTCAGTTGTTATTATCATCTTTAATGGAGGCAAT
GGTCTATTCACATCAGATAGTTGATGGCCACATGAGTTGTTTATACAAGTCGTTGTTTTAT
GAGAGAACCTTCTTCAGATGAATTAAAGGTATCAATATAACTATCTCAGTAATTTGAAAAA
CCTAGGATGTTGTTCTGGCAAAAAAAAAA
SEQ ID NO:226
GCAGCTGAAAGCGCTCAAAGCGCTCGCCATGGCAATTCCACAGTGAAGAAGACCAGAG
CAGGGAGTAATGGAGTCTGGGGCAGGAGGTTCAGTTGGGGCTCGCGTGCCGTCTGCGA
AGCCAGAGATGCTGCAACAGCCACCGTATTCTAACGGTGACGATGATAATGATATGGAG
CGTGGAACGGCACCCGTCCCGTCTTCGAATCCCAATACTGTCTCGAAATGGGAGCTGGA
CAAGGATTTTCTGTGCCCTATCTGCATGCAGACGATGAAGGATGCGTTCCTCACGGCTT
GCGGCCACAGCTTCTGCTACATGTGCATCATGACGCATCTCAATAACAAGAGCAATTGC
CCCTGCTGTAGCCTCTATCTCACAAACAATCAGCTCTTTCCCAATTTCTTGCTAAATAAGC
TTTTGAAGAAGACATCTGCATGTCAAATGGCAAGCACTGCTTCACCGGTAGAGAATCTCT
GCTTGTCGCTACAACAGGGAGCAGAAGTCTCAGTAAAGGAGCTGGATTTTCTCTTGACT
CTTCTTGCTGAGAAGAAGCGGAAGATGGAACAAGAGGAAGCTGAAACTAATATGGAGAT
ACTCCTGGACTTCTTGCAACGATTGAGGCAGCAGAAACAGGCAGAGTTGAATGAGGTGC
AAGCTGATCTCCATTATATCAAGGATGACATATTAGCATTGGAGAAACGAAGACTGGAAT
TATCTAGAGCTAGAGAACGATATTCTAGGAAATTACATATGCTTTTAGATGACCCAATGGA
CACTACATTAGGTCATGCTGCAATTGATGATGGAAATAATGTTCGCACAGCTTTTGTACG
TGGTGGCCAAGGTGATGCTATTTCAGGGAAATTTCAGCAGAAAAAGGCTGAAATCAAAG
CACAAGCTAGTTCTCAAGGGATGCAAAAGAGAGCTAATTTTTGTCATTCTGATTCCCAGG
TTTTGCCTACTTTGTCAGGATTGACAATCGCAAGGAAAAGAAGAGTCCTTGCACAGTTCG
ATGATTTACAAGAGTGCTACTTGCAAAAAAGGCGACGCTGGGCCACTCAATTACGAAAAC
AATGTGATGGTGGCTTGCGAAAAGAAAGGGATGGGAACAGTATTAGTAGAGAAGGCTAT
CACGCAGGCCTTGAGGAATTCCAATCTATTCTCACAACATTTACTCGTTACAGCCGTTTG
CGGGTCATTTCAGAACTTCGACACGGGGATCTTTTCCACTCTGCAAATATTGTATCGAGC
ATTGAGTTTGATCGAGATGATGAACTATTTGCAACTGCCGGTGTTTCCAGGCGGATCAAA
GTGTTTGACTTTGCAACAGTAGTGAATGAACCTGCAGATGTGCACTGTCCTGTTGTGGAG
ATGTCCACTCGATCTAAGCTAAGTTGCTTGAGCTGGAACAAGTGTATCAAGTCTCAGATT
GCCAGCAGCGATTATGAAGGCATTGTTACTGTCTGGGATGTAAATACTCGTCAGAGTGT
CATGATGTATGAAGAACATGAAAAACGGGCATGGAGTGTGGATTTTTCACGCACAGAGC
CGACCAGGCTCATCTCAGGGAGTGATGATGGAAAGGTCAAGGTGTGGTGTACAAGGCA
AGAAACAAGTGTTCTCAATATTGACATGAAAGCAAATATTTGTTGTGTCAAGTACAATCCT
GGATCAAGCTATTATGTTGCGGTTGGTTCAGCAGATCATCACATTCATTATTATGACTTGA
GGAACCCTAGTGTTCCATTATACGAATTTAATGGTCATAGGAAAACAGTTTCCTATGTTAA
ATTCATTTCAACAAATGAACTAGCTTCTGCATCTACAGACAGTACACTGCGCTTGTGGGA
TGTGAGGGATAATTGCCTTGTACGGACATTTAAGGGGCACACTAATGAGAAGAACTTTGT
GGGGCTTACAGTCAACTCTGAATACATAGCTTGTGGCAGTGAAACAAATGGGGTGTTTG
TGTATCACAAGGCTATTTCAAAGCCTGCTGCTTGGCATCAATTTGGAAGTCCAGATTTGG
ACGACAGTGACGATGATACATCACATTTCATAAGCGCTGTTTGTTGGAAGAGTGAGAGC
CCTACAATGTTAGCTGCTAATAGCCAAGGAACAATTAAAGTTCTTGTACTTGCACCGTGA
ATTTCAAGTTTCTCATATAATTCTTCAAGGCCCTTCCTAGATATCAGAGTTGGAGAGAATT
TTGAAATTTCTTAAATTAAGATTCTTTCTGGAAATTAGCTCTCTGGATTTCTTGCTGCCTGA
GAATGTTGAGAAATGAGGTTGATGTGGATTATATGCGCAATTTTCAATCTACTCATATTTC
TATAGTGCCATATGCTTGTCGGTTGTCATTGACCTCTAATAGAATAGCCAGAGTATCTTC
CAGTGATGCGGCATGTTATTGCATGTGCTATATGCCCCTCTTAACACTAATCTGAAGGAA
TATCGGCCTCAATTTGGCAAGTTCTTTGGTCTGGGGCGAGCCAACTGGTGAAGTCTAAG
ACTATGAAGGCTAAATTTTTTAAGGGTGTCAGAACTCGAGGAAAGATTCAGGAGATGGAG
AATGCCCAGGAGTAACATTTTGTACTGATACTAAATGTATTATATCTTGGTCTTTTATCGTT
CTTGGAATACTTGCTAGCTGTCATCACACATTTTTATGCTTGAACTACGAGGAATAGTGAA
GCGTTGCCCCAACTTATGGGCTTCCCTCTTCAACTATTATTTTGTACTGATGCTAAATGTA
TTATATCTTAGTCAGTCTTTTATCGTTCTTCAATACTCGCCTGCTGTCATCAAACTTTTCTA
GGCTTGAACTACAAGGAATAGTGAAGCGTTGCTCTAACTTGTGGGCTTCCCTCTTCAACT
ATTATTGGGATTTGCTTCTAGAGCCAAATAGTTGGACACTATCAGAGCTTAGAGAATTGTT
TATATCGGTTCCCTGTAAAAGTTTGTATTTGAGGGCAGTCCGTGTCCCTTGTCAATACTTA
TGAGTTGCAGCTCATAGATTTTTTCAAGTCCCAAGATCAGCAACACGTAAATGAACATGA
AATAAATATTGTAAAATTGTGTATTGGTAAATTTGTTGAAGAAATAACAA
SEQ ID NO:227
TTCCTTCCCGCTGAGTTGCTCGACTTCTACTAGGAAGATTTTAGGTATCCCGCAAAGTTT
AACGCATATCAGTCTCTCCTTATTCCCAATGAGTTGCTCGACTTCTATAAGACGATTTTCG
GCATCCCCCAAAGTTATGCGTTATGTTAACGCATATCAGTATCTCCTTGCTATCACAGTT
GGACGCCCAGAGGCAGCTTCCCATTAGTCGAAGCCGCCCTGGAAAAAAAACGGGTAGG
TTTTAAGGTGTTTTTTTTGTTTTTTGTATTACGAAAGTTTTTGAACTTCATATGGTTCCAGA
CCAACCTTGGATGAATTTAATCAAATTTCAAGTGAACGCGTCTCAGCCTCTCCCCGTGG
TCACAGTTTGACGCTTAGACACAGAATCTCTGCTAGTCCAAGCCACCCCGGAAAAAGCG
GGTTCAGCTTGAAGTTCGGTGGACCTCATCAAATAAATTCGACCGTCAATGAAGTTCTGC
GAATTTCCAATTTAATTGATTTTGTTCGCCGAACGCGGTCACTTATGCTCAAAGCACCAG
TTTAACCATCAACGGCAGAGTCAAAGGCAGCATCACAACCAATTTATATTTTATACCTAGC
AGATCCAATTGATCAAGTAGCATAAGAAGCGTAGGTAGTGGAGGAATAAACAAGGGTGA
AAAAACAAGTTATCGGGACTGTGAATACGTTAGCTGACCAAGTGTATCCATTACGAGTAG
ACCCAGTATCATTCATTGGAGATTGCGGTCCAGCAGTTTCATACCCTATTTTACGGGAAA
AAGCAATAATCCCTACTTACTGGAACAGCAGCATCAGGCCAAGCCCTAGATAAGCTTTCT
GAAGTATTTGGGTAATTGAAACGGTGCGTAGCCATGGCCAACTATGTGGACTCGAAGAA
GAATTTCAAGTGTGTTCCTGCTCTGCAGCAATTTTACACAGGGGGTCCATTCAGGTTGTC
TTCGGACGGTTCATTTTTAGTGTGTGCGTGCAACGATGAAGTAAAAGTGGTAGACTTAGC
CACCGGTTCTGTGAAAAATACGTTGGAGGGAGATTCGGAGCTTATTGTTGCGCTCGCAC
TCACTCCCGATAACAAGTATTTGTTCTCAGCAAGCCGCAGTACTCAAATCAAATTCTGGG
ATTTATCCTCCGCCACCTGCAAACGAACCTGGAAGGCACATAATGGTCCTGTGGCAGAC
ATGGCCTGTGATGCTTCTGGAGGATTGCTTGCTACAGCTGGTGCAGACAGAAGTATTCT
TGTCTGGGATGTGGATGGAGGCTACTGCACACATTCATTTCGAGGGCATCAAGGAGTTG
TAACTACAGTGATATTCCATCCGGATCCCCATTGCCTTTTGCTTTTTTCTGGAAGTGATGA
TGCAACGGTTCGAATATGGGATCTTGTTGCTAAGAAGTGCATTTCTGTGCTGGAAAAGCA
TTTTTCTACTGTGACGAGTTTGGCAATCTCTGAAAATGGTTGGAATTTGCTCAGTGCTGG
TAGAGATAAGGTTGTAAATATATGGGACCTGCGTGATTATCACTGCCGGGCAACAATACC
GACATATGAACCACTAGAAGCTGTTTGTGTTCTCCCAACTGGTTCTAGGCTGGTGTCTGT
TATGAACCAAAGCCGGGCATTGCCTGAAAATCGAAAGAAAAGTGGTGCGGCTCCAGTGT
ATTTTTTGACAGTAGGGGAACGTGGCATTGTTCGTATATGGTATTCAGAAGGTGCTCTGT
GCTTATATGAGCAGAAGTCCTCGGATGCAATAATTAGCTCTGACAAAGACGAGTTGAAG
GGAGGCTTTGTATCTGCTGTTCTACTACCCCTCACTCAAGGAGTGATGTGTGTTACTGCT
GATCAGCGTTTTCTCTTTTACAATCTGGATGAGAGCGATGAAGGAAAATGTGATTTAAAG
GTCAGCAAGCGTCTTATAGGTTACAATGAGGAGATAGTGGACTTGAAATTTCTTGGGGAT
GAGGAGAAGTTTCTTGCAGTTGCAACAAATCTGGAGCAGGTGCGCATGTATGACCTTTC
ATCGATGACATGTGTATACGAGTTGTCAGGACACACAGACATAGTTCTTTGTCTAGATAC
TGTTGTTTTTTCTGGACATAGTCTGTTAGCTTCTGGGTCCAAAGATCACACAGTAAGAATT
TGGGATACAGAGAGCAAATCCTGCATATGTGTAGCAGCAGGGCACATGGGTGCTGTAG
GAGCTGTTGCCTTCTCAAAGAAAGCTAAGAACTTCTTTGTGAGTGGGAGCAGTGATCGC
ACAATCAAAGTATGGAGTTTTGCTAGCGTACTAGATTTTGGTGGTATTAGTAAGTCAATCA
AGCTTTCATCACAGGCTGCAGTTGCAGCTCATGACAAGGACATCAATTCTGTGGCTGTTG
CTCCAAATGATAGTCTTATCTGTACTGGTTCTCAGGATCGAACAGCTCGTATATGGAGAT
TACCAGATCTGGTACCAGTACTAGTGCTAAGAGGCCACAAAAGAGGAGTTTGGTGTGTG
GAATTTTCACCTGTAGATCAATGTGTGATGACAGCATCTGGTGACAAGACAATCAAAATA
TGGGCATTATCGGATGGCTCTTGCTTGAAGACATTTGAAGGTCATACTGCAAGTGTACTG
CGAGCTTCCTTCCTCACTCGTGGGACACAGTTTGTTTCTTCTGGTGCAGATGGCTTGTTG
AAATTGTGGACAATCAAGTCCAATGAATGTATTGCAACATTTGATCAGCACGAGGACAAG
ATCTGGGCAATGGCTGTGGGCAAGAAAACAGAAATGCTGGCAACAGGTGGCAGTGATTC
CCTTGTGAATTTGTGGCATGATTGCACTACCACAGATGAGGAGGAGGCTCTTCTCAAAG
AGGAGGAAGCGGCATTGAAAGACCAGGAATTACTAAATGCTCTTGCAGATACCGACTAT
GTGAAAGCAATTCAACTTGCATTTGAATTAAGAAGACCTTATAAGCTCCTTAATGTCTTCA
CTGAACTGTACAGCAAAGGACATGCCCAGGACCAGATACAGAAAGTGATACGTGAACTA
GGAAATGAAGAGCTAAGATTACTTTTAGAATATGTGCGAGAATGGAATACAAAGCCAAAA
TTTGCCCATGTTGCCCAGTTTGTATTATTTCAACTTTTCAACGTCCTTCCTCCAAAGGAGA
TCATTGAGGTACAAGGTATTAGTGAACTTTTGGAGGGTCTTATACCATATGCACAGCGAC
ATTATAGCAGGATAGATAGACTTATGAGAAGCACTTTTTTGTTGGATTACACTCTTTCATC
AATGTCAGTTCTCTCTCCCACGGAGACAGATTTGTCATCATCTAATTTATTAGCTAGAACA
GCTGATCCACTGCATGCACAGATTGATCAGTTTCACCCAACCCATTTCCCTGAGCCAAAT
TTGACTCCTATACAGTCTCTTTTAGATTCAGGCAATACAGATTCTGTGGAAGTCACAGCA
CGACGAGCCAAGAAGAAAAGAGTGTCAGGTAATGATTCAGAGAAAACAACTGTTGCTGA
AGTTAAAATTGGTGACATGGAGAATGCATTTGATGAGCCTGACGTGGCAGATCAAGGTT
CATCCAGGAAGCATAAACCTGCGAGTTCAAAGAAACGGAAGTCTATAGCCGTTGGAAAT
GCCAGTATTAAGCGTATTGCAAGTGGAAATGCAGTCACTATTGCATTGCAAGTATAAGGC
CTTGTTGATGAAGTTTGATCTACCATTGGTTTTGGAATTAATGTGAACTCTGTAGCAGGAT
TGCTGAGCAGGATTTAGAGGAACGGCTGTATGTTAACTTGAAATTTGTGTGTCGGTCAAG
GGTTGCGGGTGCAACATAGAGAAATGTGTGCCAACATTTGGTGTACCGAATGTCTTCAAT
CTGTAAGATTTTCACGGCAGTTGAACTAGTCATAGTGGAATATTATTTAAATGGTGTATTC
TAGTCACATTATTTCCAATTTTGATAGTATCTGGGCATCTGCTCATTAATTCAAAAAAAAAA
SEQ ID NO:228
CCACATCAATTGACCTGGTTTGTCTGATTGTGCAGATAGCTTTGCTCCGGGGCAAGGAG
AACGCCAAGAGGACACCAGGGCTTCTGATACTTGGCTCGCTCGCTAACTTTTTGGAGAA
CCAGACGTCCTCATACTTTGAGGAACCTAAGCTTTTAGAATTGGAATTCATTTGGAACTA
CTGTTTTTTTTTTTTTCGTTTTGCAGAACCAAACATCCTCATTCTTTGAGGAACCCTAGCTT
TTAGAATTAAAATTCCTTTGCGACTTTTGATTTTTTTGGTTTTGCGAGTTTAAATTTAGGCA
CATTGATTGTTTGTCATATTTTGTGTTTTGTCTGGTTATTTTACTTGAGTAGGAGTAATGGA
ATCGTCTTGCAGTTCAATGAATTCCAACAGGCATTCCACAGAGAAGCGATGTCTACGTCC
TCTGCAGAAACAGGGTGCTTCAATGAATAAACACAGCAGCGATAGATTTATTCCGGCAAG
AGGATCCATCGATTTGGACGTAGCTAGATTCATGGTCACGCAAAAGCAAAAGGACAATAA
CGATATCCATGCCCTCTCCCCTTCCCCCTCCCCTTCCAAAAAAGCATACCAGAAGGAGAT
GGCAGACACTTTATTGAAGAATGCCGGTGCTGCTGACAACAATTGCAGAATTCTTTCTTT
CAATGGAAAATCATCGACTGTTTCCCAAGGTTCACAAGAAAATGTGTTGGCTAATCTCTC
TATTTCTAGAAGAGCACGCAGATATATTCCTCAGTCTGCAGATAGAACCCTGGATGCGCC
CGACCTTCTCGATGATTATTACTTGAATCTATTGGATTGGAGTTCCACCAATGTTTTGTCC
ACAGCCCTGGGCAACACTGTATACTTGTGGGATGCTTCTAATAGCTCTATATCTGAGCTT
CTGATTGCAGATGAAGAAGAGGGTCCCGTTACCAGCGTTAGTTGGGCACCCGATGGGA
GCCAAATTGCCGTTGGGTTGAATAATTCTGTTGTACAGTTATGGGATTCGCAATCTAATA
AAAAGCTAAGAGCCTTGAAAGGTCATCATGACCGTGTTGGTGCACTTTCTTGGAATGGTC
CTATTCTCACCACAGGAGGGCTGGATGGGATTATAATCAATCATGATGTTCGCACCCGT
GACCACATTGTTCAAACATACAAAGGACATACCCAGGAGGTTTGTGGATTGAAATGGTCT
CCTTCCGGCCAACAACTTGCAAGTGGGGGAAATGACAATCTACTCTATATTTGGGACAA
GAGCATGGCATCCCATAATCCGTCCTCTCAATACTTCCACCAATTAGATGAACATTGCGC
AGCAGTCAAGGCCTTGGCATGGTGCCCCTTCCAGACCAATCTTTTGGCATCTGGTGGAG
GGACTTCAGATGGTTCTATAAAGTTTTGGAACACTCAAACAGGTGCCTGCCTGAATACCG
TCGACACCCATTCCCAGGTCTGTTCATTGTTGTGGAATAGACATGAGCGTGAGTTATTGA
GTTCCCATGGGCTAAATCAGAACCAGTTAACCTTATGGAAATACCCATCCATGGTGAAAA
TTACTGAACTTACAGGGCATACTGCTAGAGTTCTTCATATGGCACAGAGTCCTGATGGGT
ACACAGTTGCATCAGCTGCAGCAGATGAAACACTCAAGTTTTGGCAAGTTTTTGGGGCTC
CTGATGCCTCAAAGAAGACTAAGACTAAAGATACAAAAGGGGCTTTCAACATGTTTCATA
TGCACATCCGCTAAGAAGTATCTGTCAAATTTCATGGAATCAGGATGGATGTTCAACTGA
AGAGGGGCTTTCAGCAAGTTTCACGCTCACATCCGCTAAGTATTTGTCCAGATTCATGGA
ATCAGAGCGGATGTTCAACTCAAGGTAGTAATTAGATGTTTTATTTTCTGCTTGCAAAGGT
CTTTTGCAGGCGCTCTATAGTTCTGTTCTCTAGCATGAAGTGTGTATTTTATCTATTGTGG
ACCTTGATGGAATGATGTCGCAACAGCCTTGTGAGACAATGATCCATTTCTTCAGCTGCG
TGATAAGATTTAAAGTGGTTTTAGCAAGTCTTTGGAGCTTCAAAGACGGTTCCAAAAATG
ATACTTTATGGCTGTTAGCAAGCATCTGATTCACAACTATAAAAAGGCCTCCATGCTTCTA
CACATGGAATGATAATGAGGGGATGCTCAATGTTGCAGAAGATCGTTCAAGGTGCCTTTA
ACGAACATGGGATCTTGCCAAGGTGCCTTTAAGGAACATGGGATCTTGCCAGCATTCTA
AGATGTTCGAGAATGGGTTCATACAGAGAAGAGCAGCATGGTTTGTCAATTGCATTTTTT
TAATATCAAGATCTTAATGTACGCTCTTTTGTTATCCATTCTGTACTTAGCTAGTTGCACTA
CAATTTTCCAGCAAAAAAAAAA
SEQ ID NO:229
TGAGTTTTCAAGCAACTGTTAGGCCTTATACTTATGTTGGGTGCCGGAGTCTAGTGGAGT
TATTTTGGCGAGATACAGGGGAAGAGCGCAAAAACAAGTTAGGCGGCACTGTGATGGTG
TGGTGGCTGTATGCAAGACACTAGAAGAAGAAGAAGATCGAGAGAGGAGAGCAACGGC
AACGAGAATGCTAGACGAAATTGTGGCAGACGAGGAGGAGGAATTCAACATATGGAAGA
AGAACACCCCCCTTTTATACGATGTAGTGATAACGCATGCGCTGGAATGGCCTTCCCTCA
CGGTACAGTGGCTCCCCGACCGCCACCAATCTCCCACAAAGGACTACTCTCTGCAGAAA
ATGATAGTGGGCACTCATACATCCGGGGACGAGCCCAATTATTTGATGATAGCAGAGGT
CCAGATGCCCCTGCAATACTCTGAAGACGGTAACGTTGGAGGTTTTGAATCCACCGAAG
CCAAGGTACACATAATCCAGCAGATTAATCATGAAGGAGAAGTTAATAGGGCTCAATACA
TGCCACAGAACTCATTTATTATTGCAACCAAGACAGTGAGCTCAGATGTTTATGTGTTTGA
TTACACTAAGCATTCATCAAATGCCCCTCAAGAAAGGGTTTGCAATCCTGAATTGATATTG
AAGGGTCATACTAATGAAGGATACAGTTTGTCCTGGAGCCCTCTCAAAGAAGGTCAACTA
CTAAGTGGTTCAAATGATGCGCAAATATGCTTCTGGGACATTAATGCTGCATCTGGCAGG
AAAGTGGTTGAGGCCAAGCAAATATTTAAGGTCCATGAGGGGGCAGTTGAGGATGTTTC
GTGGCATTTGAAACATGAATATCTTTTTGGATCTGTTGGAGATGATTGTCATTTGCTTATA
TGGGACACACGTACGGCTGCACCTAACAAGCCTCAACATTCAGTTGTAGCTCATGAGAG
TGAGGTTAACTCTTTGGCTTTTAATCCCTTCAATGAGTGGCTTTTAGCAACAGGATCAGC
GGACAAAACTGTTAAATTATTTGATCTCCGGAAACTCTCTTGTTCACTACATACATTTTCA
AATCATACGGAAGAAGTCTTTCAAATAGAGTGGAGTCCTATGAATGAGACAATATTAGCT
TCGTCTGGTGGTGACAGAAGACTCATGGTTTGGGACCTCAGAAGAATTGGCGACGAGCA
AACATCAGAAGATGCAGAAGATGGACCACCTGAATTGATTTTCATTCATGGAGGGCACAC
AAGCAAGATATCTGATTTCTCATGGAATCTCCATGATGATTGGTTAATTGCTAGTGTAGCA
GAAGATAATATTTTACAAATTTGGCAGATGGCAGAAAATATTTACCATGATGATGCGGATA
TTCTATGATGAAATTTCATAACTCTTACCTACTATGGTTGGTGTGCCTTGGAGAAAATTTC
CAATGCCTTCATGGGTGTGTAATCCATGATAATTACAGGAACATTGTGGATCAAATATTGT
AATTTGTCTTTAATCTTCAGGGTTCGTTACTAACAATTGAGCTCAAATCTCTATTCTGACC
AGCAATCCAGAAGTTGGATGCATTAAATATAAGGTGGATTTTAGTACCTTAAGTTAATGTA
TTTTATAAAATTATGAAGCACTCTGGACATCTGCTCTGTTCATCAAGGCAGACATCACAAG
GAAGTAGAAATTGAAGTTATGTTTCAAAAAAAAAA
SEQ ID NO:230
CACAAGCACAGCAGTGTGCGTTAAAGAAAGGGCGGTTTGTATTGTATAAATGGTAAATCC
GAGCGCAGAGGATGGAATGGCTACAGGTATTTATGCAGTATGCAGTGAACAGAGATGGC
AGGCTGGCCGTGATATTTTTGTGTTTATACCACATGGTCTGAGGGCGATTGATTTTGTCA
GCTGATATCTCTCAGGGCCCTGAGCCTGCCTCCCTGCCTGGCCGCCTGTGTATGAATAT
CAATGACGAAAGAAGATCATGGGGAGTCGAGGGATGAGATGGGGGAAAGAATGGTGAA
TGAGGAATACAAATTATGGAAAAAGAACACCCCATTCCTGTACGATTTGGTCATCACGCA
TGCTCTGGAGTGGCCTTCTTTAACTGTTCAATGGCTGCCTCCCAGCTGCAAGCAGCAAC
AGGACATTATCAAGGATGATGACATTGATCATCCCAACACCCAAATGGTTATTCTCGGTA
CCCATACATCAGATAACGAACCCAACTATCTAATTTTAGCTGAGGTCCAGCTCCATGATG
GAACTGAGGATGAAGATGGCGATGGCGATGTCAAACGGCCCCAGGATAAAATGAAACC
GGGAACCTCTGGAGGTGCAATGGGGAAGGTTCGAATACTTCAGCAGATAAACCATCAGA
AGGAAGTGAACCGGGCGAGATACATGCCTCAAAAACCCACAATCATTGCAACAAAGACT
GTGAATGCGGATGTTTATGTCTTCGATTACAGTAAACACCCGTCCAAGCCTCCCCAGGA
GGGACGTTGCAATCCGGAACTGCGGCTTCAAGGTCACGAGTCTGAAGGCTATGGCTTGT
CTTGGAGCCCTCTCAAGGAAGGTCATCTGTTGAGTGCTTCAGATGATGCCCAAATTTGC
CTCTGGGACATCACTGCTGCGACCAAAGCCCCCAAGGTGGTGGAAGCCAACCAAATATT
TCGGTACCACGATGGACCAGTGGAGGATGTAGCATGGCACGCAATTCATGATCATCTTT
TTGGGTCTGTAGGGGATGACCATCATTTGCTTCTATGGGATATCAGGAACGACTCTGAG
AAGCCACTTCATATTGTTGAAGCTCACCAAGCTGAGGTTAACTGTTTGGCTTTCAATCCTT
TCAATGAATGGATTGTGGCTACAGGCTCTGCAGACAGAACTGTTGCACTCCATGACATTC
GCAAGCTGGACAAAGTTCTTCATACTTGTGCACACCACATGGAAGAAGTGTTCCAAATTG
GTTGGAGTCCCCAAAATGGAGCTATATTAGCATCATGTGGATCGGATAGACGACTCATG
GTTTGGGACCTCAGTAGAATTGGCGATGAGCAAAATCCAGAGGATGCTGAGGAGGCAC
CTCCTGAACTGCTATTCATACATGGAGGACATACCAGCAAGATTTCAGACTTTTCATGGA
ATCCAGCTGAGGAATGGGTAATTGCCAGTGTTGCTGAAGACAACATTCTTCAAGTTTGGC
AGATGTCAGAGCACATTTACAATGATGACAATGATTCACCAACAGCTTAATAGGTTTGTT
GCTGTTTGGACCAGCTAATTGATTGTTTACAGGACCTCGGAAAAATGTGGAGCCTCTTTA
TGCCATATTCCATGGCACAGGGTTGAAGAAGCCTGAATATCAGCAGAAAGCGAGCTTGT
ATATTGCAGCGATGTTAATGATTCTCTTTTATAAGAGGGAAAAAGTTTATAAGCACTTGTG
AAAATTGTCTTGGACGTTTAATTTCTCTTGTGAAAGTATGGAGCTTCTCATCATTTATAGA
GTTGTGCAAAATCACCCATAATGCTATGAATTGACAGGTGACTGTAATCTTACAATTTTCT
GATGCAATTTGAAGATGGATTATGCAAGTCTTTTATCAAAAAAAAAA
SEQ ID NO:231
TTAGTAGAAGTCAGGCTTAGCTTAATAGTAGCGGAAATAGGAGCTAAACTGTGAGACCC
GCCGGTTAGTATGCAGATACCACTTAGGGATTCCACTTACGAGTCCATTGGGGTTATTTT
GGCGGGAGACAGTCGAAGAGCACAAAATCGAAAACAAGTGGTATATTGTGGCTCTGCAA
TGGTGTGTGCAATAGAATAGAGAAGAAAAATAGCAGAAGTCAGAGGAAAGCATGGCGAT
GGCAATGGGAGACGAAAATGCGGCCGATCCAGTTGAAGAATTCAACATATGGAAGAAGA
ACACACCCTTCTTATACGATCTAGTGATAACTCATGCGCTGGAATGGCCTTCCCTCACTG
TTCAGTGGCTCCCCGACCGCCACCAATCATCCACGGCGGACTACTCCCTGCAGAAAATG
ATAGTGGGTACTCATACTTCAGAGGACGAGCCCAATTATTTGATGATAGCAGAGGTCCA
GATTCCCCTGCAAAACTCTGAAGATAATATCATTGGAGGTTTTGAATCCACCGAAGCTAA
GGTACAAATAATCCAGAAGATTAATCATGAAGGAGAAGTTAACAAGGCTCGCTACATGCC
ACAGAATTCATTTGTCATTGCAACGAAGACAGTGAGCTCAGATGTTTATGTGTTTGATTAC
AGCAAGCATCCATCAAAGGCTCCTCAAGAAAGGGTTTGCAATCCTGAATTGATATTGAAA
GGTCATTCTAATGAAGGATATGGTTTGTCATGGAGCCCTCTCAAAGAAGGTTACCTACTA
AGTGGTTCAAACGATGCACAAATATGTTTGTGGGATATAAACGCTGCATTTGGAAAGAAA
GTGCTTGAGGCCAATCAAATATTTAAGGTCCATGAGGGGGCAGTTGGGGATGTTTCATG
GCATTTGAAGCACGAATACCTTTTTGGCTCTGTTGGAGATGATTGCCATTGCTCATATG
GGACATGCGTACAGCTGCACCTAACAAGCCTCAACAATCAGTTATAGCTCATCAGAGTG
AGGTTAACTCTTTGGCCTTTAATCCGTTCAATGAGTGGCTGTTGGCAACAGGGTCAATGG
ACAAAACTGTCAAATTATTTGATCTTCGGAAACTCAGTTGTTCGTTACATACATTTTCCAAT
CATACGGACCAAGTCTTTCAAATAGAGTGGAGTCCGATGAATGAGACAATATTAGCTTCT
TCTGGTGCTGACAGAAGACTCATGGTTTGGGACCTTGCAAGAATTGGCGAAACACCAGA
AGATGAGGAAGATGGGCCACCTGAGTTGCTTTTTGTTCATGGAGGGCACACTAGCAAGA
TATCGGATTTCTCATGGAATTTGAATGATGATCGGGTAATTGCTAGTGTAGCAGAAGATA
ACATTTTGCAAATTTGGCAGATGGCAGAAAATATTTACCATGATGATGAGGATATGCTTTG
AAGAAATTTCATAATTCCTACCTACTATCGTTGGTTTTCTGTGGAGAAAATTTCCTATCCC
TTTGTGGGTGTGTGAAAAACGAAATATAGAGGAACAATGTGGATAATGCATTGTAATTCA
TTTTTTGTATTGAAGAACAATTAATCTCAAATCTTTATCTCCAATGATTCTCTATTTTCTGTC
CGGATTAAGTATAAGGCAGATTTCAGTATATAAGGTCAATGTATTTATAAAAGTATGAAAC
AGACTGGACATTTG
SEQ ID NO:232
CAGGAAACCTTTACCACGACCAACTCCAGCCAGCTAACAATTTCCGACTGTGTACACGG
ACACAAAGTTTATTGCAAATATCTTCCCACGATCTGTAGAGATGGGATTGTTTGAGCCCT
TCAGAGCACTGGGATACATCACTGATGGCGTTCCATTCGCTGTTCAAAGGCGTGGGATT
GAGACTTTTGTTACCCTGAGCGTTGGAAAAGCTTGGCAAATTTATAATTGTGCCAAGCTG
ATTCCAGTCCTTGTAGGACCTCAAATGGATAAAAAGATAAGAGCACTTGCATGTTGGCGA
GATTTTACATTTGCTGCCACTGGGCATGACATTGCAGTGTTTAGGCGAGCTCATCAGGTG
GCTACATGGAGTGGACACAAGGCAAAAGTTACATTGCTTCTTTCTTTTGGGCAACATGTT
TTAAGTGTTGACTTAGAAGGATGCCTTTTTATATGGGCTGTTGCAGAAGTGAATCAAAAC
AAACCACCAATTGGACAAATACAACTTGGGGAGAAGTTTTCTCCAAGTTGTATCATGCAT
CCAGACACTTACCTAAACAAGGTCCTTATTGGGAGTGAAGAAGGTACCTTGCAACTTTGG
AATGTGAATACACGTAAAAAGCTTTATGAGTTTAAGGGCTGGGGGTCTTCCATTCGATGC
TGCGTCTCATCTCCAGCTCTAGATGTGGTTGGAATTGGTTGCTCAGATGGCAAAATCCAT
GTCCATAATCTACGATATGACGAAGAGATAGTGACATTTATGCATTCAACAAGAGGGGCT
GTTACTGCTTTGTCTTTCAGGACAGATGGGCAACCTCTTTTAGCAGCTGGAGGTTCCTCT
GGGGTGATAAGTATATGGAACCTTGAAAAGAAAAAGTTGCAGTCTGTAATAAAAGATGCT
CATGATTCTTCAGTATGTTCTCTTCATTTTTTTGCAAATGAACCTGTTCTGATGAGTTCTG
CAACAGATAATTCAATCAAGATGTGGATCTTTGACACAACTGATGGAGAAGCGCGACTCT
TAAAGTACAGAAGTGGCCATAGTGCACCTCCTATGTGTATAAGGTATTATGGTAAAGGGC
GGCATATTTTATCTGCTGGACAAGATCGAGCTTTCCGGATTTTCTCTGTTATACAGGATC
AACAAAGTAGAGAACTTTCACAGGGTCATGTTGGAAAACGAGCAAAGAAATTAAAAGTGA
AGGATGAAGAAATCAAACTGCCTCCAGTCATTGCTTTTGATGCAGCTGAGATTCGTGAAA
GAGACTGGTGCAATGTTGTCACCTGCCATTTGGATGATCCTTGTGCATACACATGGCGT
CTTCAGAACTTTGTAATTGGTGAACATATTTTGAAGCCATGTTTGGAGGATCCAACACCA
GTGAAGTCTTGCAGTATCAGTGCATGCGGCAATTTTGCAGTGCTTGGAACAGAAGGAGG
GTGGCTTGAGCGGTTTAATCTTCAATCGGGGATTAGTCGAGGGACATATATTGATATAGG
AGAAAAACGACAATGTGCACACAATGGAGCAGTGGTTGGTCTGGCATGTGATGCTACAA
ATACCCTTTTAATAAGTGGAGGCTATAACGGAGACATTAAGGTTTGGGATTTCAAGGGGC
GTGAGCTAAAGTTCCGATGGGAAATTGAGGTCCCATTAATTAAGATCGTATATCACCCAG
GAAATGGTATTCTAGCAACTGCAGCAGATGATATGATTCTTCGTTTATTCGATGTCACTG
CCATGCGACTTGTGCGCATATTTGTTGGGCACATGGACCGTGTCACTGATTTGTGTTTTA
GTGGAGATGGGAAGTGGTTGTTGTCATCTAGTATGGATGGGACCATAAGAGTTTGGGAT
ATAATTTCATCCAGGCAACTCAATGCAATGCACATGGATTCAGCTGTAACGGCTTTGTCA
TTGTCACCGGGTATGGATATGCTGGCAACTACTCATGTTGGTCACAATGGCATCTATCTC
TGGGCAAATCGGATGATATATTCAAAGGCTACTGACATTGAGCCCTTTATAAGTGGGAAG
CAAGTTGTGAAGGTTTCTATGCCAACTGTCTCATCCAAAAGAGAATCGGAGGAAGGGGA
TGAGAAAAGAACAATTGTTGCAGAATCAAATGTCAATAAGTCTGATGTTTCTGGTAGCTT
GATTGGGGATTCATATTCAGCTCAACTTACGCCTGAACTTGTGACGCTTGCACTGCTGCC
TAAAGCTCAGTGGCAAAGTCTTGTCAATTTAGATATTATAAAGATGCGAAATAAGCCCATT
GAACCACCCAAGAAGCCAGAGAAAGCTCCTTTCTTCTTGCCATCATTGCCTACACTCTCT
GGAGAGAGGATTTTTATCCCTAGCTCTATGAATGGGGATGGTGATCAAGATGAAACAAG
AAATGATAAAACTGTTTTTGAAGCAAGGGGTAAAAAACTGGGAGGAGAATCTTTATCATTT
ATGCAACTTCTACAATCCTGCGCAAAGATCAAAGATTTTACAACTTTCACAAATTATTTGA
AGGGTTTATCCCCCTCAGCCGTGGATATGGAATTGCGGTTGCTTCAAATTGTCGATAATG
AAAATATATCAGAAACAGAACACAGTGTAGAACTTCAAGGGATAGGAATGCTGCTGGATT
ACTTTGTAAATGAGGTGTCATGCAATAATAATTTTGAGTTTGTCCAGGCTTTGATTCGACT
ATTTTTAAAGATACACGGTGAAACCATTCGATGCCAAGTATCTTTGCAGGAGAAGGCAAG
GAAGCTACTTGAGATCCAAAGTTCTACATGGGAGAGGCTAGATACAAGCTTCCAAAATGC
GAGATGTATGATTACATTTCTTAGCAGTTCTCAGTTTTGACCAATCATTTATTTGCAGTGT
AGTTGATATGAAGGGAGAAATATGACAGTTGGTTTCAACCTGCTTGCCTATCCCTCCAAA
GAATATATAGTGTAAATTGTCTGTTGCAATAGGGATGGAGCACTAAGTGCTTACTATGGC
AAATGCTATCAATTTTTGAGGGCTTGTAATGGCATTATTTCCTTGAGTTGTTCTTGTAGGT
TGTTGAGTTACCAGTTTTGATCCTTTGATCTTTTGTTGGTTAGCTGAGGCTAATGGCGAG
GCCATGTTTTTCAGTTAATGTAAAATATGATATCTTGTGTATCATGTGACACATTTCAATTC
AGTTATTGGCTTGCAACTTCTAATTACACTCAAAAAAAAAA
SEQ ID NO:233
CGAGGACGTATCGATCAAAAAATCTCTATCATAAGGTGCAGAATGATTGCAGCAGTGTGT
TGGGTTCCTAAGGGTGTTGCAAAGGTCCTACCAGATTCTGCTGAACCGCCTACTCAAGA
GGAGATTCAAGAGTTATTGAAGTGTAACGTTGTAGCAGAGAGTGATGATAATGAAGATAG
TGATGAAGAAAGTGAAGAGATGGATACTGAAACTGATAAAAATACCGACGCAGTGGCAA
AGGCATTAGCAGCAGCAAATGCTCTTGGGAGTCAGTCTTCAGATTTTCAAAGGCAGCAC
AAGGTTGATGATATTGCTAATGGCCTCAAAGAACTTGACATGGATCATTATGACGACGAG
GATGAAGGCATTGACATTTTTGGAAGTGGTTCTCTTGGTAATTGTTACTACCCAGCCAAT
GACATGGACCCTTATCTTGTAGAACAAGATGATGATGATGAAGATGAAATCGAAGACATG
ACAATTAAACCTTCTGACTTGATTATCTTATCAGCTCGTAATGAAGACGATGTTAGCCATC
TTGAGGTCTGGATATATGAAGAAGAAACTGAAGAGGGTGGTTCCAATATGTATGTTCATC
ATGATATCATCTTGCCAGCCTTTCCACTTTCTCTTGCTTGGCTTGATTGTAATCTGAAAGG
TGGAGAAAAAGGAAATTTTGTTGCTGTTGGAACAATGCAACCTGAAATTGAGCTCTGGGA
CCTTGATGTACTTGATGAGGTTGAACCTGCTGTAGTCCTGGGTGGTGCTGTTAAAGATGA
AGCATCTGGTAAGACGACCAAACTGAAGAAAAAGAAGAAAAATAAACAGGCTGTCAATTT
TAAAGAGGGAAGCCATACGGATGCGGTTCTTGGTTTGGCATGGAATATGGAGTACAGGA
ATGTCTTGGCTAGTGCAAGTGCTGACAAATCTGTAAAGATCTGGGATATTGTTGCTGAGA
AATGTGAGCACACAATGCAACCTCACACAGATAAGGTTCAAGCAGTAGCCTGGAATCCT
AATCAGGCAACAGTTCTTCTCAGTGGATCTTTTGATCGTTCTGTGATCATGATGGACATG
AGGGCTCCCACACACTCAGGGATACGGTGGCCAGTTCCAGCAGATGTAGAAAGCCTTG
CATGGGATCCTCATACTGATCACTCATTTATGGTTAGCGCTGAAGATGGCACTGTTCGAG
GTTTTGATATTCGTGCAGCAGCATCTACTGCAGATTTTGATGGCAAGCCAATGTTCATAC
TTCATGCACATGACAAGGCTGTCTGCGCAATTTCATACAACCCTGCTGCCCCAAGTTTAC
TCACAACAGGATCAACAGATAAGATGGTAAAGTTGTGGGATATAACAAATAATCAGCCTT
CATGCATTGCCTCAACAAATCCAAATGTTGGAGCTGTGTTTTCTGCAGCTTTTTCAAAAAA
CAGTCCGTTCTTACTTGCTACTGGAGGCTCAAAGGGAATTTTGCATGTTTGGGATACCCT
AGATAATTCTGAAGTCGCGAGAAGATTTGGAAAGTTCAGACCACAGAATTGAACTTTCAT
ATTTCTATTCAGTTTTGTTCGAAGAATAATATCGATAAAATTTATTTTCATATTCTCCAGAG
GTGGGATTGTGTATTATAATGTGTTAATACTTGGTCATTTATTTGCGGCTTTTGTTGGCAC
ATTGTTGTGCACATTGCTAAATTTCTACGAGCCTTCGGGGACAAGTGTCTAACTGTCCCA
TCAGTAACATTTCTCGAAGTTTTGCTAAAAGTTAATGTTCTCATAGGTTATTCAAAAAAAAA
A
SEQ ID NO:234
CCAAATTCCCGCAATCTCACATTTGAAACCTGTGCTGCCCTCCATTTTATGATCTCAATTC
AACTCGAGTCTAACCTGAGCCAAACTCGATCTCTGCGCATAAAAAACTAGATTCTCCGGG
TTTCATTACCCTTTCCTCGGGCATAAATTCATTTGGGTATTTGAATATTCACCTAGGTATT
CAACATTTCTATTGGGTAGTAAAATGATTATGGATGAGAACGAGTTCTGTGATATTTTCTC
CCTGAGAAAAAGGCTATGCCTGCTGTCTTCCCAGGAGGGGGAGGAAGAAGAAGAATTA
GAGGCAATGTCACAGCTCGATGCCGGAGAATTCACCGTGACAGGCAATGAAGAGGTAG
TTGCCATTGCCGAAGATGATGTAAACACTGGAATCCTCAGTCAGGATCTTTTCAGTTCAC
AAGACTATTGTACACCATCCCAACCACAGGATTCGACCGATTTGGATAGCAAGGATAAG
GCTCCATGCCCACTATCTCCTGTGAAAAGCACGATTCAGAGAAAGAGGTGTAGACCAGA
GCTCCTAAGTAATCCACCTGATAGCATCCAGTTCTCCTTTCAACGATTGGAAAGAGTTAG
AAGTGAAGAATCTATCCAGTCCTCCTCTCAACAATTAGCAAGAGTTAGAAGTGAAGTATC
TAGTTCAGATGATTTCAAAACTCCAAAAATAACCGCATCTGGTCAAAAGAATTATGTCTCA
CAGTCGGCATTGGCTCTACGGGCTCGTGTCATGTCACCACCTTGCATTAAGAATCCATAT
TTAGATGAAAATGAAGAGTTAAATGAAAAGATTCAAAGAAGCACTCGTCGATCTCCCGCT
TGTGTAACTCCTATCCAAAGTGGAGCCTGTTTGTCCCGGTATCGTGCAGATTTTCATGAG
CTAGAGGAAATTGGTCGTGGAAATTTCAGTCGTGTTTACAAAGCTCTGAATAGACTTGAT
GGCTGCTGTTATGCAGTGAAGTGTTCACAAAGCGAGTTACGACTTGACACTGAAAGAAA
GGTGGCATTGATGGAAGTTCAATCGCTGGCAGCCCTAGGACCGCACAAAAATATTGTGG
GATATCACACTGCCTGGTTTGAGAATGATCATTTGTACATTCAAATGGAGCTTTGTGACC
ATAATCTAACAACTGCAAATGACAGAGGAATTCTAAGAACAGACACGGACTTTCTTGAAG
CAGTGTATCAAATTGCTCAGGCATTGGAGTTTATCCACGGGCGTGGAGTTGCACATCTA
GATGTCAAACCCGAGAATATTTATGTTCGTGATGGTACTTACAAGCTTGGAGACTTTGGT
CGTGCTACACTTATTAATGGAACACTCCATGTCGAAGAAGGAGATGCACGGTATATGTCC
CGTGAAATCCTGAATGATAATTATGAACATCTTGACAAGGTGGACATGTTCTCATTGGGT
GCGACCTTTTTTGAACTTTTGATGAGAAAGCAATACCCTGGTTCTGGCAAACGGATAGAC
AGGGACACAGAAATTAAGATCCCAATTCTCCCTGGTTTTTCTATATATTTTCAAAAGCTTC
TACAGGATTTGGTCAGCAATGATCCGGGAAAAAGACCATCTGCTAAAGATGTACTCAAAA
ATCCTATATTTAATAAGGTTCGGGGAGCAAAGGAAGTCTAAGGATATGGACGAAGGATG
AATTAATTTTGAAAATTTTGACAATTCTGCAGGATGGATACTTTTGTCAAGATACATGCAA
AATTCACTTTCCTCGTGGAACGAGGATCGACTTTGGTGCTATACGGAGGACTTGCGATAA
TGCAAAACTTATAGCTGTTACTTAAGGATTTCATTGCAACTAAAAATCTTTGGCTGTAAAC
AATCAAGTAATGATAGATGTGATATGCTGGAGATTTATATTTTGGACGGAAGAGTGGAGT
TATTAGTTTGCAATATTTTTGTTAAATATGCATGGTGAACAGGCAAGATCACTGTGCTTCC
TCATAAATTGAGGCTTGCCTACGTTAATTGTTATATATGGAGAGCCATGCTAATTGTTATA
TATGGGAAGCTCTATCCCAATTGTAATTTTGTAATTTTCAATCACCTTCTTTGGTGGTATTT
CAGTTCTTGCTGTAATCACGACCTTCTGTGCTGATACAGTTTATTAATCAAATGTAATCAT
CACCTTCTGTAGTGATGTTCATGTTAAAAAAAAAA
SEQ ID NO:235
GCTTTGCTTCCAGCACAGTGCAGAGGGGCAACGCAGCACAAAGCTGAACCGCCAAAAC
CCTAATAAGCATTTCCAAGCCACCGTCCACCGGAGGATCGGAGCGGCAAAGCAGGTCG
ATGCTGGCGCCGGCGCTGGAGATGGAGCCGGTGGAGCCGCAGTCCCTGAAGAAGCTC
AGCTTCAAGTCCCTCAAGCGCGCCCTCGACCTCTTCTCCCCCGTCCACGGCCAGATCGC
TCCTCCCGACCCCGAGAGCAAGAAGATGCGCATCAGCTACAAGCTTAATTTCGAGTACG
GTGGAGGTAGCGGGAGTGAGGACCAGGTTCCAAAGCGGAAGGAGAGCGGCGCCGCGC
AGAATCAAGGCCAACAAGCTGCCGGGGCTTCCAATGCGCTTGCTCTTCCGGGTCCAGAA
GGTTCTAAAATTCCACCAATGGAAAAGTCTCAAAATGCTTTAACTGTTGGTCCATCATTAC
GACCTCAAGGATTAAATGATGTTGGTCTACATGGAAAAGGCACTGCTATTATCTCTGCTT
CTGGATCATCAGACAGGAATTTGTCTACCTCAGCCATTATGGAAAGACTTCCAAGCAGGT
GGCCACGTCCTGTGTGGCATCCTCCATGGAAGAACTACAGGGTCATAAGTGGGCATTTG
GGATGGGTGAGATCTATTGCATTTGATCCTAGCAATCAATGGTTCTGCACTGGCTCTGCA
GATCGGACAATTAAGATATGGGATCTAGCAAGCGGAAGGCTGAAGCTAACACTGACAGG
ACATATTGAACAGATTAGGGGCCTTGCAGTGAGTAGTAAGCATACTTATATGTTCTCTGC
TGGTGATGACAAACAAGTTAAATGCTGGGATCTTGAGCAGAATAAGGTTATCCGGTCTTA
TCATGGTCATCTTAGTGGTGTTTACTGCTTGGCTCTTCATCCTACTATTGACATTTTGCTC
ACTGGAGGACGTGATTCTGTCTGTCGGGTATGGGACATCCGGAGTAAAATGCAAATTTTT
GCACTCTCTGGACATGATAATACAGTTTGTTCGGTCTTCGCTCGACCTACGGATCCACAA
GTTGTAACTGGCTCCCATGATACAACTATCAAGTTCTGGGACCTTAGACATGGAAAAACA
ATGACAACTCTCACTAACCATAAGAAATCTGTGCGAGCAATGGCCCAGCATCCTAAGGA
GAACTGTTTCGCATCTGCATCAGCTGATAACATCAAGAAATTCCAGCTTCCTCGAGGGGA
ATTTCTCCACAACATGCTCTCACAGCAGAAAACCATAATCAACACAATGGCAGTCAATGA
AGAAGGTGTGATGGCTACTGGAGGTGACAATGGGAGCTTATGGTTCTGGGACTGGAAG
AGTGGCCACAACTTCCAGCAAGCTCATACGATTGTACAGCCTGGATCCTTGGAGAGCGA
AGCTGGAATATATGCTCTCTCCTATGACTTAACTGGTTCACGGCTGGTGTCATGTGAAGC
GGACAAGACCATAAAAATGTGGAAAGAGGATGAACTTGCTACCCCAGAAACTCACCCTC
TCAATTTCAAGCCTCCCAAAGATATCAGGCGGTTCTAGTTCTTTGTAGGCATCCTATTGC
AATCTTATGTCACCGCTTGGCAACACCGTCCATCCTTTCACGCAGCCTGTTTGCTCCAAC
AGATCTTCTTGCATCTACTCTTCTTCGGAAGA
SEQ ID NO:236
GCAGCAGAAGAGAGAGAGAGAGAGAAACGCGATCGTTCCTCAGTTTCGTCGGAGTGAA
GAAGGAGATCGGATCGGAGGCTCGCCGGCGAGGAGAGGGGAAGACATCGGACATGGA
AGAGGCCGCCAAGGAACAATCCGCCGGTTCCGGGAAGCCGAAGCTGCTCCGCTACGG
GCTGCGATCGGCCGCGAAGCCGAAGGAGGACAAGAAGGAGGAGCAGCTCCACCAGCC
GCCGCCGCCGCCGCCGCCGCAGCAGCAGGCGGCTCCGGCGCCGGCACCGGCGGCCA
CCAGGTCGTCGACGTCGGGGTCGGCGGGGGGCCGGGACCGGAGGCCGCAGCAGCAG
CACGCGGTCGACGAGAAGTACGCGCGGTGGAAGTCCCTCGTCCCAGTCCTCTACGACT
GGCTCGCCAACCACAACCTCCTCTGGCCTTCTCTCTCTTGCCGGTGGGGCCCGCAACTC
GAGCAAGCGACTTATAAGAATCGGCAGCGGCTCTACATTTCTGAGCAGACTGATGGCAG
TGTTCCAAATACTTTGGTGATAGCAAACTGTGAAGTTGTGAAACCTAGAGTTGCAGCTGC
AGAGCACGTGTCCCAGTTTAATGAGGAAGCTCGCTCTCCCTTCATAAGGAAGTACAAGA
CAATTATACATCCTGGAGAGGTTAACAGAATCAGGGAACTTCCTCAGAATCCCAATATTG
TGGCAACTCACACTGACAGCCCAGATGTTCTCATTTGGGATGTGGAATCTCAGCCTAAC
CGGCATGCTGTCTATGGAGCTACAGCTTCTCGTCCAAATCTGATTTTAACTGGACATCAA
GAGAATGCTGAATTTGCCCTTGCAATGTGTCCAGCTGAACCCTTTGTTCTCTCTGGAGGG
AAGGATAAGACGGTGGTTTTGTGGAGTATCCAAGACCATATAACAGCATCTGCAACAGAT
CAAACAACTAATAAATCTCCAGGATCTGGAGGATCCATCATTAAGAAGACTGGGGAAGGT
AATGAGGAAACTGGAAATGGCCCTTCTGTTGGACCACGAGGAATCTACTGTGGACATGA
GGATACTGTTGAAGATGTGGCTTTTTGTCCATCCACTGCACAGGAATTTTGTAGCGTTGG
TGATGATTCATGCCTTATATTGTGGGATGCACGAATTGGGACTAATCCCGTTGCTAAGGT
CGAGAAGGCACATAATGGTGACCTCCATTGTGTGGATTGGAATCCCCATGACAACAACC
TAATCTTAACCGGGTCGGCAGATAACTCTGTTAACATGTTTGATCGGCGAAATCTCACTT
CTAATGGAGTTGGTTCACCAGTCTACAAGTTTGAGGGGCATAAGGCAGCTGTTCTTTGTG
TGCAGTGGTCTCCAGACAAGCCTTCCGTCTTTGGGAGTTCTGCTGAAGACGGTCTCTTG
AACATTTGGGATTATGAGAGGGTTGATAAAAAGGTTGACAGGGCTCCAAATGCTCCTGC
GGGATTGTTTTTCCAGCATGCTGGTCACAGGGACAAAATTGTCGACTTCCACTGGAACA
CAGCTGATCCATGGACTATGGTTAGCGTATCTGATGACTGTGATACTGCTGGAGGAGGT
GGTACATTGCAGATATGGCGAATGAGCGATCTGATCTACAGGCCGGAAGAAGAGGTTTT
GGCTGAGCTGGAGAATTTCAAGGCGCATGTACTGGAATGCTCGAAGGCATGAGAGTGC
CTCGAGAACAGGCCTTCCGGGTCTCAAACACTAACTAGACAAAGCGGGTTTTTGCTGGC
TGTTACTGCTGTAAAATCTGTAGGTACTTAGCCATGGTTTAGACTCATCTGTGAGCGCCA
GGACTCCCCTCTTTACGCAGATGGTGACTGAAGCTGGTTCCGAGATCGGCATATGTAGC
TGGTAGAGGTGTGGATATTGCATAGACCGAACCTCCGCAGGTCCGCATTCTCGAGTGAG
AAACAGAGATAAATTTTAAGGGGGTTACCAAAAAAAAAA
SEQ ID NO:237
TCTTTCACTGCATGCTCCCTTTCCCGCAAAACACAAACCAAACACGTTAAACTCTAACCC
AAACCTTCTCGCAATTCTTACCAAATCCAGTTACTCTGCCAAAACCCTTGGCGGGAAA
ACCCCAGTTAGGAGCTTCCGGCCATGGCGAAGGACGAAGAAGAATTCCGCGGCGAGAT
GGAGGAGCGCCTGGTGAACGAAGAGTACAAAATCTGGAAGAAAAATACGCCGTTTCTTT
ACGATCTGGTGATAACGCACGCCCTCGAATGGCCTTCACTCACTGTACAGTGGCTCCCG
GACCGCGAAGAGCCCCCTGGAAAAGATTATTCCGTTCAGAAAATGATACTGGGGACTCA
TACTTCCGACAACGAGCCCAACTATCTGATGCTGGCCCAAGTTCAACTCCCGCTAGAGG
ATGCAGAGAATGACGCCAGGCAGTATGACGACGAGCGCGGGGAGATTGGAGGGTTCG
GCTGCGCCAATGGCAAGGTACAAGTAATACAGCAAATAAATCACGATGGAGAGGTCAAT
AGAGCCCGATACATGCCACAAAATCCTTTCATTATTGCCACGAAAACAGTTAGTGCAGAA
GTCTATGTGTTTGACTACAGCAAGCATCCTTCAAAGCCTCCTCAAGATGGTGGATGTCAT
CCTGATCTCAGATTGAGGGGTCATAATACAGAAGGTTATGGTTTATCATGGAGCCCTTTT
AAGCATGGCCATCTTTTAAGTGGTTCAGACGATGCACAGATCTGTTTGTGGGACATTAAT
GTACCTGCCAAAAACAAAGTGCTTGAGGCCCAACAAATATTTAAGGTGCACGAGGGTGT
TGTAGAAGATGTTGCATGGCATTTAAGGCATGAGTACCTTTTTGGGTCTGTTGGAGATGA
CCGCCATTTGTTGATATGGGATTTGCGTACATCTGCAACTAATAAACCACTGCACTCAGT
AGTAGCTCATCAAGGTGAGGTTAACTGTCTTGCATTCAATCCTTTCAATGAGTGGGTACT
GGCTACAGGATCCGCAGACAGAACGGTGAAACTTTTTGATCTGCGCAAGATATCCAGTG
CTTTGCATACCTTTTCCTGTCACAAGGAAGAGGTTTTCCAAATAGGCTGGAGCCCCAAAA
ATGAAACAATATTGGCTTCTTGTAGTGCAGACAGAAGGCTTATGGTGTGGGACCTCAGC
AGGATTGACGAATTCCAAACACCAGAGGATGCTTTAGATGGACCACCTGAGTTGCTGTTT
ATTCATGGTGGACATACTAGTAAGATATCAGATTTCTCATGGAATCCATGTGAGGATTGG
GTTATAGCTAGTGTAGCTGAAGATAACATTCTCCAAATCTGGCAAATGGCTGAAAATATAT
ACCATGACGAGGAGGACGATATGCCTCCTGAAGAAGTAGTGTAACTTTTATCTAGCTAGA
AGTTGTGAAATTAAGAGGGATGTGAGGATTGGGTTATAACTAGTGTAGCTGAAGATAACA
TTCTCCAAATCTGGCAAATGGCTGAAAATGTATATCATGGTGAGGGGGATATGGCTTCCA
AAGAAACATTGTAAGAGCTAGCTAGAATTTGTGAAATTAAGAGGTGTATTCACTTTCAGA
GTTTCTCAACAAATGACATGGTTCTCATTCCATTTTCTTTTATATAATGAGAAGCAAAACTT
GGCTTATCATCTTTTTTATTGAGAAAAGACACATTGAAAGAGCTTAAACCTTTAGCTTTTG
CTGTTTTCTGCTGTATTTATGTAGTTTGCAATCTGGTGAAGATTATGCCAGGTCTGCCTTT
CCTTGTAATAGTTTTTATAGACAAATTTATAGCTGAAAGTGATGCCCACGTTCTAGGTAAG
ATGGGAAACATCAGAATGTAAGGGTCTCAAATGTTGTCACTATTCAATTTTCATCTATTTC
TGGTCTGGTGGATTTTGCTGGTAAGACATTTGAAGATGTCCAACACTTGTTCTACAGTGT
ATTGTGAAATATAAGCACATAACTTGATACAAAAAAAAAAAAAAAAAA
SEQ ID NO:238
GCCTAACTCCCCACCACACAAAAACACAAACAGGCAGAGTCCCCCGCAACTCTCCTGTC
CTGTTTCCATTAATCTCTTACGCCGTTGCGTTTTCCCTTCTGATCAGCCCTCCCATTCTGC
TCCCCTCCGTTCCTGCCCTTGGGTTGAGAAAGGCCGTTGTGGCCATTTGTCAGTGTCTG
TTATTTTTGTTTGAGCCGGCACAGCAATCTGCCAGTCTCTATCTGTAACTCACTCGGTTTT
TCTTGAAACAGCAGGAATCACCGTCACCAAGAAGACGCCATTGCAGTGTGTACTCGACT
TCTGCAAGATAGCTGGTCAGTCAAATCAAATTTACACCGACGAGCACTCGGAAATAGTTT
CTTTTCCTCCAAATTTTAATGCAAATTTTTCGACTGCGAAAAAAAAAAATTTGGCTTGAATT
CTGGTTTTTCAGCAAGGAGGTTTTTTTCCTCTGGTTTAACAGCAGCAGCAGGATAGGGAA
ATATTTTGTGAGTACAGTTTCTGAAAAAACATTCGGACAAATTTTTGTCTGCGAGGGTTCG
AGGAATCTCGAAATTTTCACAAGCTGTTTTGGGTTTCTGGGCAAATTTTAGGGCAAATGT
TTTAAGAATGAGAAATCCAAACCGTGTGCTCGTGCATAGCAAGGAAAAAAAAATTATTCT
GTAGAGTATCAGGAAAAATCAGCAGCTCTCGCGAATTCGTCTTTTTCTCTTCCTTTTTTTT
TTGAACTTGGCATGCAGTGTGGGCTTAGAACAGTGCGTTTCAACCGTGGCTCCTTACCG
TGGTTCAAAAATGGGGAAATACATGAGAAAAGGGAAAGGCGTAGGCGAGGTAGCTGTGA
TGGAGGTCTCGCAGGGCTCGCTCGGTGTGAGAACGAGGGCTCGGACGCTTGCAGCAG
CTTCTTCACAGAAAGACCACCGGCGCCTCGGTGCCTCCAAATCTGTGACTACGAAACAC
CAGAGCTCGGCGCCTCCTGCTTCACCATGTGTGGAGTCCTCCATGCACACATGCTACCT
TGAGCTACGAAGTAGAAAGCTGGAGAAATTTTCCAGGTGTTACCACAGCGCCCATGGAG
CAACAAGCCACGGGGAGTCCAAGAGGAGCTTGAGCTTGAGCGAACCTTCGAGGCTGGC
AGTTTCAGAGGAGGCGAGAGTGGCCAGCGACAAAAGTAGTCATAGAGTCCTCCAACAAC
AGTCTTCCGTGGCTCATTCCAGGAACAATTCTGCAACGTTTTCTCACAATGCCAAGCCGG
CCAAAGCAGCCCAAAGGAAGGAGCGCCGTGACGACGACCATACGTCGGCGCGGCCAT
CCGAAGCTCCCCATGAGGACGAGGATGGCATGGAGGTAGAGGCATCATTTGGGGAGAA
CGTAATGGATCTCGACTCGAGGGAAAGAAGAACTAGAGAAACGACGCCATCGAGCTACA
CAAGGGACGTGGAGACGATGGAAACCCCGGGGTCCACGACTAGGCCTCCCTCTAACGC
CGGCAGAAGAAGATTCCAGACCGAAGGGGGGCATGGCACTCGCAACCAATTTCACGTT
CCCACCACCAACGAAATTGAAGAATTCTTCGCCGGTGCAGAGCAGCAAGAACAAAGACG
ATTTACAGATAGGTACAATTATGACCCAGTAAGTGACTCGCCTTTGCCTGGTAGGTTTGA
GTGGGTTAGATTGAGGCCATAAATATCTTTCACAAATAATTGTGGGGCTGCAGGCAGTTG
GTGTGGTTGGGAGTGCTTTACCAGCCCTTGCCATTCTTCTGTACTGCTCTGTTACCAATT
TAACTTGGGGTTTCAACGGAACTCAAAGAAAAAGGAAAACTGTAACTGCCACAAGGCCAT
GTTTGATGCCAAACTGCATCTCTCATTATTTAGGGTGCAGGCTGTATAAAATGTTGTAAAT
TGTAGTATCAATGTGTACAATATGAAGCGGCTGGCATGGCTTCTTCCCTACAATGTATTTT
GCAGGGGCATTCTTCTTTTTTATTCCCAATGAAAAAAATTCAAACAAAAAAAAAAAAAAAA
AAAAA
SEQ ID NO:239
ACCGAATGAACCCATCCTGCGCGCGGATATCTGCATTTGATGCAAAGGAGAAGAGGGC
GAGGGTGTTAAGGAGGGCGGAGTTGCTGTAGAGGAGGGTCGGTTCGGTCATAACTCCT
GAATTTCTTGTTAGTGTTTGATGTATAAGGCTGCTCAGATGTTAAATATATTTGATTCGGT
TTGTTTTGAAGGCAAATGAAACTCTTCGGGCTTATGAAGGTTGGTTTGAATTGAGAAATTT
GGGATATATAAACAGTGGCCATGTTCTCCGGACCATGGAATTGGGGATTTCTCCATAGG
GTTTTAGTGTTTCAGGGTTTTGAGACTTGAATACCTTGAAAAGATTTGATGGGAAGGTTTT
GAGAGATAAAATCTCGAAAAGATTTTATGGGAACAGGTCTTCCGGAGATTTGAGGAATGC
AAAACATGGAGGAAAATGTGCAGAGCAGCTGGAGTCTCCACGGCAACAAAGAAATATGT
GCCCGTTACGAGATTCTAAAGAGGGTCAGCAGCGGAACTTATTTGGATGTGTACAGAGG
GCGGAGAAAGGAGGATGGTCTAATAGTAGCCCTTAAGGGAGGTGCATGACTATCAAAGCT
CTTGGAGAGAGATTGAGGCGCTGCAAAGGCTTTGTGGGTGCCCCAATGTTGTGAGGCT
CTATGAGTGATTTTGGAGTTTCTGACTTCCGATCTCTATTCTGTCATTAAGTCTGCTAAGA
ATAAGGGAGAAAATGGTATTCCTGAGGCCGAGGTTAAGGCCTGGATGATTCAGATCTTG
CAAGGGTTGGCTAACTGCCATGCCAATTGGGTTATCCATCGTGACCTAAAGCCCTCTAAT
ATGCTCATTTCGGCTTATGGAATTCTCAAGCTCGCTGATTTTGGATAGGTAAGTATAGAT
GATTTAGTGGCTTGCCATCTGTTGCATCTTCTGTGTTTTTTTTCTTGTCACCTTCAATCCT
CTAGCTCTGTTTGACCCTATTTCTTGAAGTGTTCGTACATTCTGTTCTATTTCCCATTTCAT
GTTTTTAATATTTTAAAATATGCTGAAACCCTGTTTATGGTTAACTACATCTTCGTAGTAGT
TCAAAATGTTAACAAATCTCATACTTGAAAAGTTCAATCTAAAATCTTGAAAATTTTGACTG
ATTCAAAGCGCTTTGGAAACTAGATAAATGAATTGATTTTGCCTTTTACTATAACTATTCG
AGCAAATGTTGCATAGAAGCGCAAGGTTAATAGTTTCCCAAATTGAAACTGAATAGTGCT
TAAAGTTTAGTGATTCTCCGTCAGTAGGATGCCTGAACATTCAATAACTTAGTGATCTCTG
AATTCGAAAATCAATATTTGCAGGCAAGGATACTTGAAGAGCCTGAAGCGATCTATGAAG
TAGAGTATGAGCTTCCTCAAGAGGGCGATCTATGAAGTAGAGTATGAGCTTCCTCAAGA
GGATATCCTTGCTGATGCCCCAGGAGAAAGGTTGATGGATGAAGATGATAGTGTTAAGG
GAGTATGGAATGAAGGGGAGGAGGATTCATCCACTGCAGTTGAAACTAATTTTGATGATA
TGGCAGAAACTGCGAATTTGGATTTGAGCTGGAAAAATGAAGGTGATATGGTGATGCAG
GGATTCACATCTGGTGTTGGAACTCGATGGTACAGAGCTCCGGATTTTCTCTATGGAGC
AACGATCTATGGAAAAGAAATTGATTTGTGGTCGTTAGGTTGCATTCTGGGGGAGCTCTT
GATTTTAGAACCTCTCTTTTCTGGGACTTCGAACATTGATCAACTTAGCAGGCTGGTTAAA
GTTCTTAGGACTCCAACAGAAAAAAAATTGGCCTGGATGCTCCAATCTTCCTGATTATAG
GAAACTTTGTTTTCCTGGTGATGGAAGCCCTGTTGGTCTGAAGAACCATGTCCCCAATTG
CTCAGACAACATGTTTTCTATTTTGGAAAGACTTGTTTGCTATGACCCTGCAGCTAGGCT
AAATGCTAAAGAGATAGTTGAGAATAAGTATTTTGTTGAGGATCCTTATCCTGTCCTCACC
CATGAATTGAGAGTTCCCTCACCTCTAAGGGAAGAAAACAATTTTTCAGAGGATTGGGCG
AAATGGAAGGATATGGAAGTAGATTCCGACTTGGAAAACATTGATGAGTTCAATGTCGTT
CACTCAAGTGATGGTTTCTGCATTAAATTTTCATAAACTCTAATTGGTTGATCATGTTTCAT
CTTCCTCAAATTGCCTTTTATATGGATTAGTATTCTAAATTAGCTTTGGGGGCTGCATGTC
TTCAAGCATTCACCACGACAGTAAAGTAATCATTATGATTACTAATGTATTGCTTTCATGG
GGTGAGAAACATTATAGCCATGAACCTACTGTTAAATTTGTAGAATTGTCTTATACTTGCT
CAAAATAGTGACTGCATGTTTGAGAAAAAAAAAA
SEQ ID NO:240
ACTCCAACAGAGGAAACAAAAACACATTTCCCGCCTTCGCATTTCCATTTATATCTCAATA
CCAGTTCTCCGACATTTTATCGGCCTTCAAATCTCTGATAAGAAATAATGGCTCCCGTCA
AGAGGATCGAACCAGAGAAAACGAAGGCTAACGAAGGCAAGCCGAAGCGCCGGAAAGT
AGCCTTCGCCATAGACACAGGAATTGAAGCAAATGACTGCATATCACTTCATCTTGTGAG
CACCCCAGAGGAAATGCGAGATGCAGAGGGAGTGGAGGATCAAAGCTTATCATTCAACC
CAGAATATATGCAACATTTTGTTGGGGAACATGGGAAGATCTATGGCTACAAAGGCCTGA
AAATTGATGTGTGGCTGAATGCCCTCTCATTTCATGCCTATGTGGATATCCAGTATGAGA
GCAAAGTTGAGGAAGGAAAGAGCGAGAAAGAAGCAACAGACTTGACAGATATTATGAAG
AGAATATTTGGACGTGGCTTGGTGGAAGATCGTAATGCCTTTATCCAGAGTTTTTCAAGT
AACAGTCAATCTATCGAAAGTATGATCCACAATGAAGGTGAGCGTATAGCAACACGTGAG
ATTTTGACTGATAAGGGTCTATCTGCGCAAGGAGATTCAGAGAGATTGGGAGTCTCAAAC
GAGATATTCCGCCTTGAATTGAGCGATCCACAGATACGAGAATGGCATGCACGCTTGGA
ACCGCTTGTGCTTCTTTTCGTTGAAGGAAGCCAACCTATAGAGCAAGATGACCCAAAATG
GGAAATGTATATCAGGGTCCAGAGGGAGTCATTGAGTGGAGGCAGTGCTGTGTGTAGAT
TATTGGGATTCTGTACTGTTTACCGTTTCTATCATTATCCAGACACCACCCGTCTAAGAAT
TAGCCAGATACTAGTTTTCCCTCCTTATCAAGGTAAAGGGCACGGCTTACTTCTTTTAGA
GGCTGTAAATAAAACTGCAGTTTCGAGGGATTCTTATGATGTAACAGTAGAAGAACCATC
CGAGTCCCTCCAGGAATTACGCGATTGCATGGATACAATACGCCTCCTTTCTTTTGAACC
TGTAATGCCTGCAGTTAAGTCAGCGGTGCAGAAGTTGAAGGAAGCAAACCCCTCAGACA
AAGGTGCAGCTGATCACTGTTTAGAAGGAAATGTAAACAATGAGACTGTGACTACCTCCA
GCACAAAACCCAAAAATAAAAGTGGGTGGTTTCCACCTCCAGGTCTAGTTGAAGAAGTG
AGGAAACATTTGAAGATTAGTAAGAAGCAGTTCAAACGATGCTGGGAGATTCTTCTATAT
TTGAATTTGGACCGCTCTGATTCACAATGTGAGGATAAATATCACATCTCTCTCATGGAA
CAGATAATGTCAGAACTCTTTGATAAATCGAGCGAGAAGAGTGCCAAGGGTAAACGTGT
GATTGATATAGATAACGAGTATGACAATTCAAAGACGTTTATCATGGTCCGCACAAGGAA
TCCAGGCAATGGTGAAGGTTTTTTACCCGAGGCACTGGAGGGTGGGATGGAGGTATCC
CAAGAGGATCAGTTGAAGTCATTATTTGAAGAGCGACTGGAAGAGATTGCACAAATTGCA
GAGAAAGTTCCTAGCCTCTGTAAAGCATTGCAAATGCCATAGTTCTCATTTCTATGGAGC
GTGACCTGCCTGTAGTACGAGGATTTGACCATTTGACTCTGACAAATTTTAGTAGGCTGC
AATGTTCTTTGCAGCAAAAGGTGCACGCCCTCATTCAAGTAAAAGGGTATATTTTGTCTC
ATGTTGGGGTGATAATTCTCCCTGAAAGTCTCCAAAATATATGTGCTGGTTGTAGAATAA
AAATTAGAACAAACGGCCTACTGTCATATTTCATGCTCCATGCTGCTAGCAAGATTGTTTT
TCAAGGTTCTTTTTCGGATATATAACCAGCTGAAGGGTGATAGTTTTCTTGGTACGTTTCA
CCTATTGGTCGAATTTGCTAAAATGGTCACTTCTGGTCATTGCGGATCGGTATAAAGCCG
CTATGATAGATCATATTAATCTGGGGTATATGAAGTGTGAATTATGAACTTAGCTGTACAT
TTCTCTGTTTATTTGGGATGAAGATAGATGTATGCTTAGGAGAATTTTAGACATTAAATTT
GGAAGTGTTTTTTAAAATTAATTTTGATTAGCAT
SEQ ID NO:241
CGCACTTCTGTGCCGGCACTCCTGTCCCACTTTTGTTCGGAATTATGCCGAGGCTTCATA
CATGCGGGTGTTGCTCAGTCGTTTTTCTTTCATAAATAAAATATATGTAGCTTGCAAGGGT
TCTTTTTTATTCTAAATTTCTTGTTCAAGAAGATTTACCTCTAGCAGTTTCTTAAAATTTCC
GGTTGCCATAGTCTAGTGGGGTGAGGGTTCATTCTAGGGGATTTATTGTGTTTCACTTCC
GCATCGAAGCTTCATAGGCTGACGCATTCTGTTCATTTGATAATATCTTTTTGACCTAGCT
GAAAGCTTCTGTGCCCAGGATCTTACGTTACTGCAACCATGCCTGAGGACCGAAAAAAA
ATCCTGGAAGCATTGGCGGCCAAGAGAAAGGCCGAAGCCGAGAGCGGGGAAAAAAAGA
AAAGGCAGAAAAGTTCTTTGAATCCTGCCAAACCTGTTAGCAAACCTGTTAGTAAACCTG
TTGGTGGAATTGGTAGCAAGGGAAAGTCGACATCTGCGCCAATTTCTTCGACTAAGGCT
AAGTCTAAGCACAAGGAAGAGGTAAAGGCAAAGCGTGTAACCAAGATGGATAGGTATGA
GACTGATGAAGACGATGAGTCTGAGGAGGAAGAGGATTTAGATTCTGAATCAGATGATG
ATGAACTGTCTGATGAGGATTCTGAAGATGATATAAAATCCAAGTCTGTAAAGAAATTGC
CTCCTCAGTCCAAGGGAAAAGCTCCTGTTAAAGGAATTAGTAGTTCAAATGGGAAGGGC
AGGGATGAGAAGGGAAAGGGAATTATGAAAGACAAGGGAAAGGCGAAAGCAAAAGTGG
AGGAAAGTAGTAGTGATGCTGAAGGTGACAGTGATGATGATGGAGGTGACTTGTCTGAT
GATCCTTTGCAGGAGGTCGATCCTTCAAATATTCTTCCCTCAAAAACACGTAGACGAGCT
TCACAGCCTACGAATTATCAGTTTGCAAATATGAGTGGGGATGACGACGACGACGACGA
CAGCGACTAGATGATGAACCTGATAAAACTGTTTTGGGGCTCTCGATGGTAAGTTTTCCA
CAGTGATGAGCAATATAATATATGCTCTGAATTGTGAAATGTGGAATGCAAGATCCTCTTT
CTGGTATTAGGTGAACTAGGTAGTAATATTTGCTTTTGCAGTTAGACAGAACAGAATTCC
CTTTGGCCTTCAGTGGTCGGTATTGGTCCTATCGATATGGTATATTTTATTGCTTCACTTT
ATTTATGCTGCGCAAACTGTTGACACTCGGGATCTTTATTTTCGTTTTGACACGTCCTGTG
TTTTTACAAGTAGTTTGTTATTTGATTACATAATGCATGTTACTCTTTGGAAGGAACAGAG
CTACAGGTTCTTTTCTATAAGATATAGTTGTTGTGTAAAAAAAAAA
SEQ ID NO:242
CTTAAACTCCATTTCTCACTCACGAGTTTACCCAAGCCTCTCTGTGCAAGCACGAATTCG
CATTTATTTCTCCATTACGTGACACAGAGGGTTAGGTGATTAATTAACTATCGGAGGATTA
ATTGAAATTCTAATTATCAGTTCGCTCCCTCCGTTTTTCTCTTCCCCCTTTCAGTCTCGTA
TGGATTAGCCACTTCGCTTGTAATAACAATGGCAGAATCATCATCATAATCATGCGGGCG
GCGAGGTGGGTTTAGGGTATAAGGTTAAACAGAGCATCCAACGACCTACCTAGACTGGT
TCACGCGCTAAGATTTTTATTGTAGTATTTTTGTTGCACATAGTGGGCATTGGGGTATTTA
TCCAAAGCCATATTGAAAAATTAAGACTTGTATCGTATGAAAAAATAAATATTCATGGCAG
ATGTACCAGAGTCTCTGCAACAAGAAAAGGATGAACAGGGAACTGATAAAAATTGCTGTG
ATGGCAAATTTCAGAAAGAGATAGACATTGATGACATGGAAGAGGAATATAATGAAAGCA
GCATAGATGATGAGGAAGAAAATTTATCTGATAATGTAGCTACCAATAATATGGGTACTAT
TCCTCAAGGGCAAGCATGCATGGCTGTTACAGTTGAAGGCATTGAGCATGCTAATTCAG
TTGGTTGTGGAAGGAATGGCAGGGAAGGCAGTGAAGAGGTCACTGCTGCTGAAGATAT
GGGGCATGTATCCATAGAAAATATAAGAGAACAAGGTCGTAATAGGAAAAGTTCGGAGC
AATTGCTTGCTTTGTATGAACAAGAAGGGCTGTTGGAAGATGATGAAGATGATGATGATG
TGGACTGGGAGCCTTTTGAAGGTGTCACTGTGCAGATGAAATGGTATTGCACCAATTGC
ACAATGGCCAATTCTGATGATTCAGTCCACTGTGATTCATGTGGAGAACATCGCAATTCA
GACATTCTCAGGCAGGGTTTCTTAGCATCACCCTATCTACCTGCAGAAAGCCCAAGCAG
TTCAGATGTTCCAGATGAGAGGCTAGAAGAAAGCAAATGTGTTATGACCACTCTCACACC
TTCCATATCTCCCATGATTGGGGTGTGTTGTTCATCATTACAGTCAGAGAGACGAACAGT
GGTTGGGTTTGATGAGCGTATGCTTCTGCATTCAGAGATTCAGATGGAGACATATCCTCA
TCCAGAGAGGCCTGATCGCCTTCGTGCAATTGCTGCAAGCCTTAGAGCTGCTGGTTTAT
TTCCAGGAAAATGTTTCTCCATACCTGCAAGAGAGGCTACTTGTGAAGAGCTGCAAACGA
TCCATTCTTTGGAGCACGTGAACGCTGTTGAATCAACAAGCTGTGGAATGTTAAGCCATT
TGTCACCAGATACGTATGCAAATGAGCATTCGTCTCTTGCGGCCAGACTTGCAGCTGGT
TTATGTGCTGACCTTGCAAAAGCAATCATGACAGGCCAAGCTCAGAATGGGTTTGCTTTG
GTAAGGCCTCCTGGTCATCATGCAGGAGTAAAAGATTCCATGGGTTTTTGTCTTCACAAC
AATGCAGCAATTGCTGTCTCAGCATCCCGGGTTGTAGGAGCAAAAAAAGTTTTAATTGTT
GATTGGGATGTACACCATGGGAATGGTACACAGGAGATCTTTGAGGCAGATCAATCGGT
GCTTTATATATCCTTACATAGACACGGAGAGGGTTTCTATCCTGGTAGTGGTGCTGTTAC
CGAGGTTGGTAGCTCCAAGGGAGAGGGATACTCGGTGAACATTCCGTGGAAATGTGGA
GGAGTTGGGGATAATGACTATATTTTTGCATTTCAACATGCCGTTCTTCCAATAGCGGAG
CAGTTTGAACCTGATTTAACAATAATATCGGCAGGCTTTGATGCTGCAAAGGGTGATCCT
CTTGGTCGCTGTGAAGTCACTCCTGATGGTTTTGCACATATGGCTCAGATGTTGAGTTGT
CTTTCAAAGGGAAAGATGCTTGTTATTTTAGAAGGAGGTTATAATCTACGTTCAATATCTG
CCTCAGCTACAGCAGTGATAAAGGTACTTCTTGGTGATAATCCTAAAGCCCTACCCATTG
ATATCCAGCCTTCCAAAGGTGGATTGCAGACACTTCTGGAAGTGTTTGAAATTCAATCAA
AGTATTGGTCTAGCTTAAAGGGGCATGACCAGAAGTTGCGCTCTCAATGGGAAGCACAG
TATGGCAGTAAAAAAAGAAAGGTGATAAGGAAAAGGCACATGCACATTGTTGGAGGTCC
AGTCTGGTGGAAATGGGGGCGTAAAAGAGTGGTGTATTATCATTGGTTTGCAAGGGTAT
CATCTAGAAAGCATTTGTGAAGTGATAGTTTCCATTTCTTCCATTTATTGTTGACAGCGCT
AAAAGATGGGATGGACAGGTTTGAAGGATGGCCTCAGCTTGCTTTTATCTGCAGATAGG
ATGTTACTTCATTTTTGTACAGATTAATTTGCGGCAAATTGGCATCATATATGACTGTCTG
GGAAATCCTGCACTAGTGGTGTAACTTGTGGCGGGAAGGAACTGCACAGACATGAGTTG
TGATTTTGATGTAATCCTAATTTTATCTATTGTCACCTTGGTTGAATTGTCAATATTTTCAT
GTCCCTTGAGGCCAATCTTTGAAATGGAGAATGTTTAATTGTAGAATACATGTGACAATAT
TAATTCTTGAAGGTTGCACATTCTTAGTTTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAA
SEQ ID NO:243
TAGGAGCGAGCGAGAGTATTACCTCATTCAAGACAGGATTCAAGAGCGGAGTGTAGCAT
CGACATAAAATGGCAAGTGGAGCAGGGGCAGCAGGGGTGGTCGAATGGCACCAGAAGC
CTCCAAATCCAAAGAATCCTGTCGTCTTCTTTGACGTAACGATAGGCACCATTCCAGCCG
GACGTATTAAGATGGAGCTCTTTGCCGATATTGTTCCCAGGACAGCTGAAAACTTTCGGC
AGTTCTGCACAGGGGAATACAGGAAAGCAGGCATTCCTATTGGCTATAAAGGTTGCCAT
TTTCACAGGGTTATTAAGGACTTCATGATTCAAGCTGGTGATTTTGTGAAGGGAGATGGC
AGTGGATGCATCTCTATCTATGGTAGCAAGTTTGAAGATGAAAACTTCATTGCTAAACATA
CAGGTCCTGGCCTCCTCTCTATGGCAAATAGTGGGCCTAATACTAATGGATGCCAGTTCT
TTTTAACTTGTGCAAAATGTGATTGGCTGGACAATAAGCATGTGGTTTTTGGGAGGGTAC
TAGGAGAAGGACTCTTGGTTTTGAGGAAAATTGAAAATGTCCAAACTGGGCAACATAATC
GACCCAAGCTACCTTGTGTTATTGCTGAGTGTGGTGAAATGTAACAACCAAATGCCTGTC
AATATTGTGTCATGGGATGATTTATCTGCTCATGGATGTTAGTTCGACTAGCCTCAGTTTT
ATCTTTTGTTTCGTTCCCACAGGGAATTTTTAATGAAATGAGGAGACCATTGCTTAGATAT
TTATGGGTCTATCATTAGCAGATAAAATTCTTTATGGTCACGAACAACAGTAATGCAAGAG
CTTGAATATGTTCCAAGCTCTCCAGTTTGTTTTAATGGCATTGCGGACTTATATAAATCAG
TATACCAGTCCTTTGTTGCCTGGTTTACCAAATTTTGTT
SEQ ID NO:244
AATATACGTGTGCCATTAAGCCTTACGGCCTCTAAAAATCGATTTAATGAACGAAGATAT
GTTGTAAGCGAGGGCCGAAGTGTTCCAACCTTATCCGTGAAGATGATGTGGGAGACCTG
CTAAGAGAAAGCCCGAGCAGACAAGGTGAGTGGACTGCTGTAAGGACCGACAATGGCA
AAATTGGTATCATCAGTATGTGCATTTTCATGTCAGCAGCGGCACCCTCACTCTCGACCT
CGTTTCCTATCCAACCGGGACCATTATAACCATTACCATAATCATTCTCATTATCATAATG
TTTGCTATTTTCCACCAATGATGATGATGCAGCAACAGTTGCAGAAGCAGAAGAGGATGA
CGACGAAGACCATTACATCCTTGTTCAAATGCAACTCTTCAACCACACATTGCTCAAGG
GACTGAAGGAATTCATGGGCTTCAAATTCAGATTACAAGCGGCAATGCTTTCTTGCGAAA
TGAGCATCCTTGGAAGAGTGTTTGCTATATTTTTTATTGTTCATCAAGCAGCTGCCCCATT
TCCTTTTAATCATTTTGACAATTGGTTAGTCCCTCCTGCAAGTGCAGTGCTTTATTCACCT
AACACAAAGGTTCCTCGAACTGGAGAAGTTGCACTAAGAAAGTCTATTCCTGCCAATCCA
GCCATGAAAAGTATTCAGGACTTCTTGGAGGACATTTACTACTTGTTAAGATTTCCTCAAA
GAAAGCCATATGGTACAATGGAGGGAGATGTAAAAAGTGCTCTTCAGATTGCAATAAATG
AGAAGGATTCAATATTAGGTTCAGTTCCATTGGATATGAAAGAAAGGGGATTACAATTGT
ACAACTTCCTTATTGATGGTCAGGGAGGGCTGCAAGTCCTTATTGAATATATAAAAGAAA
AAGATCCAGACAAGGTCTCTGTGAACCTTTCCTCTTCATTGGATACGATTGCTCAATTGG
AACTCTTACAGGCTCCAGGTTTACCATATCTTCTACCTGAAGAATATCAGCAGTATCCAA
GGTTGAATGGAAGAGCCACAATAGAGTTTACAATGGAGAAAGGAGATAATTCAATGTTTT
CTGTGTCAAGTGGAGGTGGGCTCCAGAAAACAGCTACCATCCAGGTTGTTCTTGATGGA
TATTCGGCTCCTCTTACGGCAGGGAACTTTACAAAACTTGTGATAGATGGAGCATATAAT
GGACTTAAGCTGAAAACTACTGAGCAAGCAGTCATTTCTGACAATGAGCGTGCTGAAGC
TGGGTTTAACCTACCTATAGAGATATTGCCGGCTGGTGGGTTTGAACCATTGTATAGGAC
AACTTTGAGTGTTCAGGATGGCGAGCTACCTGTGCTCCCCTTATCTGTATATGGAGCAAT
TGCAATGGCACACAACACAATATCAGAGGACTACTCATCACCATCACAATTTTTCTTTTAT
CTATATGACAAGCGCAATGCTGGTCTAGGTGGACTTTCATTTGATGAGGGGCAATTTTCT
GTTTTTGGATACACAACAGTGGGCAAAGAGATACTTCCACAACTCAAAACAGGAGACATT
ATCAAATCGGCCAAGTTAGTGGATGGCTTTGATCACCTTGTGTTGCCATCTTCAAGTACC
TAGAATCTTTCATTCATTTCATTCAAATTCAATGCTAGAGTTGTAAACAGTGTAATAAGGA
GCAGAAGTTGTGATAGCTTTTAGGAACGATAGACTTTGTAAATTGACAAATCCATGTAATA
TTTATCCAATAGAAAAAGGAAATACATAAAAAAAAAA
SEQ ID NO:245
GTTTTAGTTTTTGACGAGTGTCTTTCTAGGTGGGGTGTTAGGGGAATTGCTCACCGGATT
TTGATTTTCTGGTCTTTTTATGTGCGTGGCTCGTGTAGTTTGATACTGAACATTCAGGAAT
TGAAGCTTCTTCACAAGGATTCGGATTTTGATTTGTTTCAGAAGGTGGGGTACAAGATTC
AGTAGCGAAGTTCAGTGTTTTGCCGAAGGGCTCTTTCAAATTAAGTAAGATGGATCACTA
CTACCAAGATGACTTTGATTATTTGGTGGATGACGAGATGGTTGATTTTGCTGATGACGT
CGAAGATGATGTACGCACCCGGCGCAGGAGTGATATAGACTCAGATTCTGAAAATGATT
TCGATTCAAACAATAAGTCACCTGACACAACAGCTCTTCAAGCCAAGAGAGGAAAAGATA
TTCAGGGTATTCCATGGAATCGATTGAACTTTACTAGGGAAAAGTATAGGGAAACTCGTT
TACAGCAGTATAAGAACTACGAGAACCTTCCACGGCCGCGACGCAGTCGCAATCTGGAC
AAGGAATGTACTAATTTTGAAAGGGGATCCAGTTTTTATGATTTTCGCCACAATACGAGAT
CAGTCAAGGCCACTATTGTCCACTTTCAGTTGAGGAATTTAGTGTGGGCAACATCAAAGC
ATAATGTTTACCTCATGCAGAATTACTCTATCATGCACTGGTCTTCACTAAAACAGAAAGG
GGAAGAAGTTCTTAATGTTGCTGGTCCGATCATTCCATCTGTGAAACACCCCGGTTCATC
ACCACAAGGTCTGACAAGGGTCCAGGTCAGTGCAATGTCTGTGAAGGACAATTTAGTTG
TTGCTGGTGGTTTTCAGGGAGAACTTATTTGCAAGTATTTGGACAAACCGGGAGTGAGCT
TCTGTACAAAAATATCTCATGATGAGAATGGCATCACTAATGCAGTAGAGATATACAATGA
TGCAAGTGGTGCAACACGACTAATGACTGCAAATAACGACTTGGCAGTACGAGTATTTGA
CACTGAAAAATTTACAGTGCTTGAACGCTTCTCTTTTCCTTGGTCTGTAAATGTGAGTGTT
AATTCTTCGTCGAGATCACTGTGTCTTATCGGAGGGAAGTTTCTGCTGGTTTTAATTTGA
CATATCAGATGACATGAACTTATTGCAAGAAATATTACTGTCTTGTTCATCCTGTATATAG
ACGTGTATGTTAAATGTCAAATGTACATTATCCGGTTTCTGAGAGGACATTGATAACTTTG
CAGCATACGTCTGTCAGTCCAGACGGTAAACTTGTTGCAGTTCTTGGAGATAATGCAGAT
TGTTTGCTTGCAGACTGCAAGACTGGAAAGACCGTGGGAACTCTAAGAGGTCATTTGGA
TTACTCTTTTGCAGCTGCCTGGCATCCAGATGGTTATATTTTGGCTACGGGGAATCAGGA
CACCACCTGCAGACTTTGGGACGTTAGGAAGCTTTCATCTTCCCTAGCTGTCTTAAAGGG
TCGAATGGGTGCTATAAGATCGATACGGTTTTCATCAGATGGCCGCTTTATGGCCATGG
CAGAGCCAGCCGATTTTGTGCATCTATATGATACTAGGCAGAATTATACTAAGAGCCAGG
AAATTGATCTTTTTGGGGAGATTGCTGGAATTTCATTCAGCCCCGACACTGAAGCATTTTT
TGTTGGGGTGGCTGATCGAACGTATGGAAGTCTTCTTGAGTTCAACCGTCGACGAATGA
ATTACTATCTTGATTCCATCCTCTGAGTTTTGAAAGTTAATGGGAGTGGTGTTTTCTTGAA
GTGAAATGGCATCATTCTGTTGAACCAATTCTTGTATATTAGATATGTAACATGTATGAAT
GTCCATAGAGCAGAGCTTTGCTCAATCAGAGGCTTCAAAACCCAAATTCCAAGGTCTATC
GAAGCCTCTTTGTTTATAAATGGTGTTGTGGTGGATACAGCTTTCTGGTTCACGCTGTAG
ATTTTTTAATGTATGTAACAATTTCAGCGGAAAAAAAAAA
SEQ ID NO:246
ATCTTCGGGTTTTTCCCCTGAAGAGGTCTCGGTCAAACTCGGGCTTTGAAAGAAGCGTTT
TATGGGAACTTTGTGCCTCGTTTCGTTGACGAATCTGTGCCCCTTTTCTTTGACGAATTT
GGTGAAGAGCTCTGAAGAAGAAGATGTCTTTGCTCTGCTGAGAGGTTTTCGGAACCAGG
GTTCTGCCAAAATCGGTCCGCGTAACTGTTTTGTTGGGTTGTTACCGTGTAGAGTCTGAT
TTGGTATCGTCTATGGATTCTTTCGGAATTGAAAACAAGGAAAGTTAGGGGTTTCTAAGC
CTATTTTGTCCTTGATGCCTTTAGGAAAAAGTATCGATGATTTAGCTGAATTTCTGAGTAA
TCTCAGAAAATAGTAATGGATTGCTCGGGCGACGAGGAGGAGGAGCAGTTTTTTGAGAG
TCTGGAGGAAATGTTATCACCTTCAGATTCAGGATCAGAGGCTGCAGATAATGAGACGG
GGTGCAGAAATGCCGATGCGAGGTCTAAGTACGAGATTTGGAAAAGAGCGCCCAGTAGT
ATTCAAGAAAGGCGGCAGCGGTTTCTTGTGCGGATGGGTTTGGCTAATCCCAGCGAGCT
TGGGAATCAGGTGAATTCCACATCTGCTGAATCAACTTGTTCTACGGAAACTGCAAACAT
TCCAAATGGAATTGAAAGGCTCAGAGAGAATAGTGGGGCCGTATTAAGAACTGCAGGAT
CGAGTGGCAGGAAGACGCACTGTAAAAATGTTATTAATATCGGTTTGAGAGAGGGTTCG
GTTCGGTCTAGTAGCTCAAGCAATGGTACCCCAGATGTGGGCGAGGATAATGGAGAATT
CGGGGGCACCATTTTCTCCCGTTCAGGTGGCACTTGGGAATGCATGTGCAAAATTAAGA
ACTTAGACAGTGGCAAGGAGTTTGTTGTAGACGAGTTGGGGCAAGATGGTCTATGGAAC
AAGCTTAGAGAAGTTGGAACAGATAGACAGCTAACTATGGATGAGTTTGAAAGGTCGCT
GGGGCTCTCTCCTCTTGTGCAAGAACTCATGCGCAGGGAAAGTGGGGTGGCCCAAGCA
GATTGTAACGGAGTACATCATCATGATGCAGAGATTTCAAGCAGCAAAAGGAGAAGCTG
GCTGAAAGCCCTAAAATCTGCTGCCTATTCAATGAGGCGGCCTAAAGAGGATCAAAGTA
ACTATGATTCTGAGAGAAGTGGGCGAAGATCTGGTTCCTTTGATGTTCCTTGGGGGAAG
CCTCAGTGGACTAAAGTGCGCCACTATAGGAAACGTTATAAGGAGTTTACAGCTCTTTAC
ATGGGTCAAGAGATTGAAGCCCACGAGGGCTCTATTTGGACGATGAAGTTCAGTTTGGA
TGGGCGTTATCTAGCCAGTGCTGGTCAGGATTGTGTCATTCATGTACGAGAAGTGATTG
AATCAATGAGAACGTTTGGGGCTGATACACCAGATTTATATGCTTCCAGTGCTTACTTTTC
CATGAATGGTTTGCAAGAGCTAGTCCCCTTAAGTATTGAAGACCATGCAAACAAAATGAA
AAGAGGGAAGATCATAGGCAGCAAAAAGAGTTCAAACTCAGACTGTATAGTTTTACCTAA
TAAGGTGTTCCAGCTTTCAGAAGAGCCAGTGTGTTCTTTTCATGGTCACCTTCTTGATGT
ATTTGATCTTTCATGGTCCCCATCACAGTATTTGCTTTCGTCTTCAATGGATAAAACAGTT
CGACTATGGAAACTGGGACATGAAAGTTGTCTAAAGGTGTTCTCTCATAATGATATTGTG
ACATGTATACAGTTCAATCCTGTTGATGAGAGATACTTCATAAGTGGCTCATTGGATGGA
AAAGCCCGAATTTGGAGTATTCCAGACCGTCAGGTTGTTGACTGGAGTGATCTGCGCGA
AATGGTCACTGCAGTATGTTACACTCCCGATGGACAGGGAGGACTAGTTGGATCTATCA
AGGGAAGTTGCCGTTTCTATAATACATCAGGTAACAAGCTACAGTTGGAAAATCAACTGA
ATGTGCGTAGTAAGAAGAAAAAATCTTCTGGAAAGAAGATTACTGGTTTTCAGTTTGCAC
CAGGAGGAGATTCTCAGAAAGTATTGATAACGTCTGCAGATTCACGAGTTCGAGTTTATA
ATGGTTCTGAGCTTGTTTGCAAATATAAAGGTTTTCGGAACACCTGCAGTCAAATTTCTG
CCTCATTTGCACCGAATGGGCAGCATTTTGTTTGTGCTAGTGAAGATTCACGTGTTTATA
TTTGGAATCATGAGAGCCCTCGTGGTTCTGGTGCAAGGCATGAAAAGTCCTCTTGGTCA
CATGAACACTTTCTATCACAGGGTGTTTCAGTTGCAATTCCTTGGTCTGGTATGAAGCTT
CAGCCTCCAGTTTGGAATTCACCTGAATTTATGCTTGGTCAGAGGCACAATTTGCTTTCA
TTGCAGGGAGGTAAGGATGTTGGATGTCAGAATGGTCTTCTTTCAAGAGAAGCTGGTGA
GGGGCAGGAATCAGAGACTCCTTTGCATTATATTTCTCAAGTATCACATTCATGTGGTTC
ACAGAATATGGTAGATAGGGATGGTCAAGATGATCTTTCTAGATATTCAGCTTGTATCTC
GGATTCACGTCTGTCATCTTTTATGGCTTTCCCTGAATCTCCTGGAAACCCAGATGATCT
AAATTCGAAAGTTTTTTTCTCTGATAGCTCGTCTAAAGGCTCAGCAACATGGCCTGAGGA
GAAGCTTCCTCCAACAAGAAAACAGAGCCGATCAAATAGTACTTCTTCTCACTATGACAC
TCTGAAGACTCATCTTGGTAATACCATCCAAGGCCAATCTGGAGCTTCTGCAGCTGTGG
CATGGGGATTAGTAATAGTCACTGCTGGTCACGGGGGAGAGATCAGATCATTTCAGAAT
TATGGCTTGCCTGTTCGTCTTTGAGAAGTCAAGTATTTATAGTTAAGTGAGAGAGCAAGG
AAACTTTACCCTTTTAAATTGTCATTTCAGCATAAATACACATTTGTATTGAATGCTTTCCT
CATCTCAAATTGTATTTAATCTGCGGGCAGTATGGGAAGATTGCCATTCTTTCAGAAAAT
GAAGGTGCATCTTGATGCTTCCAAGCAGTCCTTGCATTTTTTTAGAGTATTGTGGGCTTC
CAACAATGTGGAGTTGGATAAGTTTGACACTCGATCACAATTGTCCAATCTGTTCGTTAT
CATTTTCCTTTGCTACTCAAAAAAGTGTATGGCATCTGATTGGGCCCTCTTATGCCCTGC
TGATCTGCTCTCCATTGAAGTAGAAAAAGTGGGGGACGGATCAATCAATAGCTCTACTGT
GCAACGAACCCTCTGCTCCATTCATCCAACCAGCTGAGAATGTGTGGTTATCAGTACCAC
TGACTATACACTTAAGTTATTCAGTTGATGCAGAGGTGCTCCTAAGATTGTGTCGCCCCT
ATTGCAGCATGTTCAGCAAGGGACCGTTAGCCCTTGCCTTCCCCAAGTATATAATGTGTA
TGGCAAGATGACTCGTCCAACAGCCTTGAAGATGTAGTCAAGAGTGCAAGCGATTTCCA
CTTGTTTAGTTCCCTGAAGCTGGATTTCTTCAGCCAGGCACGCTTAACTAAATTTCGTTTA
GTTCACCATGACTATTCGTTGAACTTAATGAGATTTAGTTTAACGTAGCCAGCCCGCTTCT
GCATTATCTTAGAAACGGATGTATATTGGAACAGACCAGGTGCTTTAGGTGATTAAGTGC
TGGACAAAACGAGGATCCCTTGAAATTCTTGAGAGAACTTTTTGTTGTTGTTGAGATGAA
CAACATTTGTGAGTTATCACAAATGACCATGCTCTTCGTGCCTGAGACTTTTCCCTGTAG
TATAGACTTGAGTTGTCCACGTGGTAGTGATGGTTTTGCTTTAACCCAATGCTATGTCCA
TGTAAACAGATATTTTGGACATTTTTTGATCTTACTAGTACTGCTTAAGAATTTGTAAATGT
AACTGCTGCCATGAAGATCGTTAGGGCTTTGACAGCAGCATTGTTGGTTAGCCAATTTAG
GATGTTATGATTAAATGCTAGTGTTTTCCAAGTGGTTGTAGCATATCCACTGTTATAACAT
ATTCGGGAGTTAAAAGATGAAATATAAAGGGTCCTACAGACAACCTAAGTTTCAATTCTTT
TTTTGGATTCGACATAGGACGGTGGATACAATTGTTATCCTGCAAGCAATGAAATAAATG
CGCACGAGTGCCTTTTTGTCATGCAAAAAAAAAAAAAAAAAAAAA
SEQ ID NO:247
ACGAATCAATAGTATTAAAGGGCTTTCAAATACTTGCTTTGCAAAGCCCGGGAGGCTTG
GGGACATTGTAGATTGACCTAATAGCCGATTCAAATTTATCACACGATGCCAAGCATACC
AGCAATTGGAGAGTTTACAGTCTGCGAGATCAATCGTGAGCTGCTTACGACTAAAGATGA
ATCCGATACACAAGCCAAGGATGCCTATGCAAAAATTCTGGGGCTTGTGTTCCCTCCTAT
TTCTTTCCAGATAGAAGAGGGGTTTGGCAGTGCATCTAGACAGCAGTTTGATCAAGATCT
GGATAGAGAAGACACAATTGTCACCCCAAGCACCAGTGAGGGCACAAATGCCCTGCAAG
AAGGGGGTCTTCTTTTGAAGGGTGTCTCTGTTCTAAAAAATATCCTGGCCAGTTCATTTG
GACCGATTTTTTCTCCTAATGATACAAAAGTACTAAAGAAGGTTGAGCTTCTTCAAGGAAT
CAGTTGGCATAGGCATAAGCATATTCTAGCATTTATATCGGGATCAAATCAAGTTACTGTA
CATGATTTTCAAGATCCAGAGTGGCGTGAGTCGTCTTTATTAGTTAGTGAATCTCAAAGG
GGCATTGAGGCTCTTGAATGGCGCCCAAATGGTGGCACAACACTGTCTGTTGCTTGCAG
GGGTGGGATCTGCATTTGGTCTGCATCATACCCAGGAAGTGTAGCACCTGTGAGGTCAG
GGGTTGCTTCTTTCCTGGGCACTTCTACTCGGGGAAGTAGTGTGCGATGGACATTAGTT
GACTTTCTCCAAATTCCTGGTGGCAAAGCAGTGACGGCACTATCTTGGAGTCCTACGGG
GAGATTGTTGGCCTCAGCATCACGTGAAGATTCATCATTTACAATCTGGGATGTTGCTCA
AGGAGTCGGAACTCCTCTCCGCCGAGGACTAGGAGGAATATCATTGCTGAAGTGGTCTC
CGACAGGGGATTATTTGTTCTCTGCAAAACCAAATGGGACGTTCTATCTATGGGAAACAA
ATACATGGACTTTAGAGCAATGGTCATCTTCTGGAGGCTGTGTTATTAGTGCAACTTGGG
GACCTGATGGACGTATGCTTTTTATGGCCTTCTCGGAGTCCACAACATTGGGCTCACTTC
ACTTTGCCGGAAGACCTCCATCACTCGATGCACATTTACTACCGATGGAGTTGCCAGAAA
TTGGATCCATAACAGGAGGGTTTGGCAACATTGAAAAGATGGCTTGGGATGGATGTGGA
GAGAGATTGGCTGTGTCATACACTGGTGGAGATTTGATGTACGTTGGTCTTATTGCAATC
TATGATACAAGGAGGACACCATTTATATCAGCATCATTAGTAGGATTTATTCGAGGACCA
GGAGAACAAGTCAAGCCTCTTGCATTTGCATTCCATGATAAGTTCAAGCAAGGACCCCTA
TTATCTGTGTGTTGGAGTAGTGGTCTGTGCTGTACATACCCTTTAATATTTCGAGCACACT
GATTGCTGGTCAAAACCCCTTGTAGGGTGGACTTCTGTTGTATCCAATTTTTATGGCATA
ATTAGCTAGTTTTGGTATTTGCATAGATTCCAGAATTCCATTTAGTTTTTTAGCTCGAGAA
GAAAGAGTTCTTTTCAAGAATTGTGTTGTTTTTAAAAATCGATAAGCCTTAATATGATATTT
TGAAGGAAATCACAATGCCATTGTAAAATGTTTGGCTTCCTATTCATATCGTGTCTATGTT
GGTTGCCCTTTCTTAAAAAAAAAAAAAAAAAAAAAAAA
SEQ ID NO:248
CAGCGAAGCTGCTAATTCTACCTGTTTTAGAGCGGGGGTTTTGAAAGGGTTTTAGATTCC
ATCTGTTTTGAAGGGTTTTAGATTTGTCAGGTTTCCGGAAACGTGGATACCTGCATAATG
GAGGAAGAAAACGCAAAACACACGGAGGAGACAAGGCAGGTGCAAGTTCGATTCACAA
CGAAGTTGCAGCCTGCCCTGAGAGTTCCCACCACGTCTATTGCGATTCCTGCTCACCTC
ACCAGATATGGTCTCTCCGATATCGTCAATACTCTCCTCGGCAATGACAAACCTCAACCT
TTCGACTTTCTTGTTGAAAGTGAACTGGTTCGAACATCTCTTGAAAAGCTGCTCCTCATCA
AGGGCATATCTGCGGAAAAAATACTTAACATTGAATATATATTGGCAGTGGTCCCACCCA
AACAAGAAGAGCCATCGTTGCATGATGATTGGGTTAGTGTAGTTGATGGTTCTTATCCCA
ACTTCATATTTAGTGGGTCGTTTGATAGCATTGGAAGGATATGGAAGGGAGAAGGCTTAT
GCACACATGTTTTGGAGGGACACAGGGATGCTATCACTTCTGCTGCTTTCATCATGCCAT
CAGATTCTAGTGATAGTTTTATAAATCTAGCTACTGCCTCGAAGGATCGAACTCTACGAC
TGTGGCAGTTCAAGCCCAATGAGCACATGACAAATGGGAAGATGGTTAGACCATACAAA
CTTTTGAAAGGCCATACCTCATCTGTTCAAACTGTATCGGCATGTCCACGTAGGAATTTG
ATTTGTTCAGGTTCTTGGGATTGCTCTATAAAAATATGGCAAACTGCGGGAGAGATGGAT
ATTGAGAGCAATGCTGGTTCAGTGAAGAAGAGAAAATTAGAAGACAGCACAGAACAGAT
TATTTCTCAGATTGAAGCATCAAGGACGTTGGAAGGACATAGCCAATGTGTTTCTTCCGT
TGTCTGGCTTGAAAAAGATACGATTTATTCTGCCTCTTGGGACCATTCAGTGCGGAGTTG
GGATGTTGAAACTGGCGTTAATTCATTGACAGTGGGTTGTCGGAAAGCACTGCATTGTCT
GAGTATCGGTGGTGAAGGCTCTGCACTAATTGCTGCAGGCGGGGCTGATTCTGTGCTTC
GAATATGGGATCCGCGTATGCCAGGGACTTTTACTCCCATTCTTCAACTGTCATCGCACA
AATCCTGGATCACAGCTTGCAAGTGGCATCCAAAATCTAGGCATCATTTGATCTCAGCTT
CTCATGATGGGACATTGAAGTTGTGGGATGTCAGAAGCAAGGTCCCGCTTACCACATTG
GAGGCACATAAAGATAAGGTGTTGTGTGCAGACTGGTGGAAAGAAGATTGTGTGATATC
TGGCGGGGCTGATTCAACGTTACAGATATTTTCAAACTTAAACCTCACATGAATCGGTGG
ACATTCTCCGGTGCATGTATGGGCATATTTTTGATAGAGTTCGCGATGTTTTAAATTTATC
CAGAGTCAGTATTAGAGCAAGTCCCAATAAAAGTCTTTGG
SEQ ID NO:249
GGCACGACTTTACCCTTGATGTGTGTTAGGGGGCACTGTTTAAAACCCTCGAGGGAGGG
CTGAGGGCTGCTACAGCATCCGCATTTGCATCTCTGCTACGACTAAGGAATCATCCTCTT
CTACAATGCCATGCCGTGATCGGTCGATTGCATTAAGTGCTGCAAGGATCAAATAGTGG
CACTGTCATGAACAGGTTGAGGTCAAAGCGCAATCATATTTTGGAATTAAGGCTAGGACA
ATCAGAACCAGAAAAAGAAGCAACATTGGCGTCTAATCGGAGCAGGGGCACGAATGCCC
CAATAGTAGTTGAGGATGATGATGATGTAGTGGTGTCATCTCCAAGGTCATTTGCTCTAG
CAAGGAGTTCAGTGTCTCAACGAAGTAGTCGCATACCAATAGTAAATGAAGAAGACTTGG
AGCTTCGACTTGGACTGGCAGTTACAGGAAGGACATCAGCAGAGCACAATCCCCGTCGC
AGGCATGGTAGAGTTCCTCCAAATAAGCCAATTGTTCTTTGTGATGATGCTGGAGAAGCT
GATCAAAGTTCTTCCAAGAAGAGGAGGACTGGACAGCAATTAAGCAGTGATGTTCAATCT
GATGAATCAAAGGAAGTTAAGCTCACATGTGCAATTTGTATATCCACGATGGAGGAGGA
GACATCAACTATATGTGGGCACATCTTCTGTAAAAAATGCATTACAAATGCTATACATCGA
TGGAAGAGATGCCCAACTTGCAGGAAAAAACTTGCAATCAACAACATCCACCGTATCTAT
ATTTCTAGCAGCACTGGTTAGCTATTCATCGTTGTACTTGTTGCCAAATTCTGCTAGTTCG
TAGGAGAAGCTATGTGTTTGTACTTCCTTATGTGTGACATAAGAGAATGTCAAAAGAGAC
ATTTCTCGCACTTGTTCTTTATCTGCACGTTAAAATGTTTTTAGAAACATCTGCCAAAGCA
CTGATGATGATATTTACAACTTGAATGATGACGAAGGCTATTGGAACAGCCATTGGGATC
TATTTAAAAAAAAAAA
SEQ ID NO:250
AAATGCGGAGTGACAATTGAACTCTTCCCATTCCTTCAGTGCGCGGAATATCTTGGGGAA
AAAAAACCGCTGTGTTCATTTTTGTTGATCAACAATTGCTTTTTATCATCGCCACGAAGTA
CAGGGTAGGTTTAATTCGCTTTTGTTCAAGGAAGCCTGCGTGATTTTGTCGATGGTAATC
TATTCTCCTTGGGAGGTGCCCCAAATTAATGTTAATATCAACGGGAGTGGAATTTTCGTT
CCCAGCAGGCGCGAAGAGGAAATAACTAAGTTGCTCTGCAAGCACTGCATGAAATCGCA
TTAATTAAACCTTATTTTGGGTTCTTGAGTAGTGCTATCTATAAAATAATACAAAACTCGCA
ACAATAGAAGGCTCCCAGGTCACTATGGAAGAGCCACCCCCACCTGCGGTGTTACCGTC
ATCTGAGGATACCTCCATTGTTAGCTCGCATAGCTTTGTCAATGCACCCCCAACTGTCCC
TGTTGGTTTAGATGCTTCCATTCCCCAAATTTCTACGCCGGGAATTAACCAACCTGGCTT
AACCATCCCTGTACCACCCGAAGCAGCTCCTCTAACTGCGTCATTAGTTGCTGCTTCTGC
TGGCATGCCCCCTGCAGTTGTACCTTCCTTTGTCCGCCCAGCTATCGTAGCCCATCCAT
CGGTTATGCCCCCGCCCAGCATGCCTCTTGCAGCCTTGCCCATGCCAGTTGCTTCTGCT
GTACCTGTTGCTGCACCACATTTCCCCCCTTCAACCCCAAACGATAATTCTATTACTCCAT
CTATGCCCGTTCCTACTCCCATTGTAGCCTCATCTAGTGTGCCGCCTTCGGTCACAATTC
CTGGAATTGCTCCCCTGCCATTTATTGCTCCTATTCCTGTGCCTTCGTCGCGCCCTGTAG
CTCCATCGCCCTTTATGCCTCCTGCTCGTCCCTTAGGAGCTAGTGTTTCCGTTGCAATGG
ATGTAGACAATACTGATGAACAAGATCAAGATGCAGACAACAAGGGAGAAAGTCCTTCGT
CAAGTCCCGACCATCCAGAAGATCCTTCAGCAGCTGAGTATGAAATTACAGAGGAGAGC
AGAAAGGTTCGAGAGAGGCAAGAGCAGGCTATTCAGGAGCTCTGCTGCGGAGGCGTG
CCTATGCATTGGCAGTCCCTACCAATGACTCTTCGGTGCGTGCTCGTCTTCGTCGCTTG
AACGAACCTATTACCCTGTTCGGTGAGAGGGAGATGGAGAGACGTGACCGGCTCAGAG
CGCTCATGGCGAAGCTTGATGCGGAGGGACAGCTTGAGAAGCTCATGAAGGTTCAGGA
AGAAGAAGAAGCAGCTGCCAATGTTGATGCTGAAGAGGTCCAGGAAATGGAAGGGCCT
CAGGTATATCCTTTTTATACTGAAGGCTCCCAGGAGCTGCTAAAAGCTCGGACTGAGATC
ACGAAATTCTCACTCCCTCGGGCAGTCTCGAGGTTACAGAGAGCCAGGAGAAAGAGAGA
GGATCCTGATGAGGACGAGGATGAAGAGTTGAAATGTGTGTTGCAGCAGTCAGCCCAGA
TCAATATGGATTGTAGTGAAATTGGAGATGATAGACCTCTTTCTGGATGTGCGTTTTCTTC
AGATGGAACTCTGCTAGCAACGAGTGCTTGGAGTGGTGTCACAAAATTATGGAGTGTAC
CAAACATAAATAAGGTTGCTACTTTAAAGGGACATACGGAACGAGTTACTGATGTGGCAT
TTTCTCCTACAAACTGCCATTTAGCGACTGCCTGTGCTGATCGTACTGCAATGCTATGGA
ATTCTGAGGGAGTTCTGATGAAAACATATGAAGGCCATTTGGATCGTCTTGCTCGTCTCG
CATTTCACCCATCTGGGCTATATCTGGGCACTGCTAGCTTTGACAAGACGTGGAGATTGT
GGGATGTTAACACTGGTATTGAATTGCTCTTGCAAGAAGGCCACAGCCGAAGTGTGTAT
GGGATTGCTTTTCAGTGTGATGGTTCATTAGCAGCAACCTGTGGATTAGATGGGTTGGC
ACGCATTTGGGATCTTCGTACGGGAAGAAGCATTCTTGCTCTAGAAGGCCATGTGAAAC
CAGTCCTTGGCATAGATTTCTCGCCAAATGGTTACCACTTAGCAACTGGTAGTGAGGATC
ATACCTGTCGCATTTGGGACCTAAGGAAGAGGCAATCAGAGTATATATCATACCTGCTCATT
CTCACCTTGTTTCACAGGTCAAATTTGAACCACAGGAGGGATATTTCTTGGTCACTGCTT
CATATGACAGTACTGCTAAGGTGTGGTCAGCACGGGACTTCAAATCCATCAAGGTATTG
GCAGGACATGAAGCAAAGGTTACGAGTGTGGATATTACAGCAGATGGGCAATATATTGC
TACTGTGTCACATGATCGTACCATAAAACTTTGGTCCAGCAAAAACAGCACTAATGATAT
GAATATTGGTTGACACCATTACAGATGGGCGATATGTTTCCAGTGTCACATGATCGTACT
AGAAAATTTTAGTCCAGCAAAACCATTAGCAACACTAATGATATGAATATTGGTTGACATC
ATGATGGGTAGTGTTGTTATGTCGTCTCTGTAGGCGATGTGAATGCAATTTGTTGCTGCC
TTCGCAGTATTCAAAACAAAACATTGAAGCAAATGCTGCTGATCCAACGTTTACAGTACTT
AATCTTCCTATAATGCAATCTGCGTTGAGAACATTCTTTTTTCTGAGATATCCATGACATG
TATATGATAAGTGGGCAGTTAGGCTTCACTTTCAGTCTTGCTTCAGCTACTCGGTTGGCA
AAAATTTTGTGAAGGGAAAGGGCAACACATCACTGGACATAATTTTAAAATATGATGCAG
AACTTTTGTTCACGAATGCCTCGTGTACTTTATACAAATTACATTTGGAATGCTAAGATGT
AAATCTACTGCTACTTGAAAAAATGCTTCTTGAACTGATATGCTCCAGG
SEQ ID NO:251
CTTTTACTCTTGCATTTATGCCATCTGAAAACCCGAGGTGTAGTTAATGCACCTCGGCAG
AAGCCAAGATCTCCTTCGGGCTTTGCCGAGAAGTACGAGTGGATTGAAAGGAATTTGGA
GAAATAGGAGCAGGAAGTGAACGGATTTTAAAACTTACAGGATCCTGTGACATGGGCAG
GTTTTTTTGAATGGTAATGGGCGGTTTTTGATGTCAGATTTGGTGCTGTGAAAACATAAG
AGCCGGCATTGTGCATTTCATAGTTTTTGGTGGTGGGGATTGGACTGAGAAGAGTTATG
CGAAACCAGCGATATAGAAGTCCAACACTGATTTCAGAGCTTTTCTCCGGTCACCAAAAC
CATGAAGCGGGCTTACAAATTGCAGGAGTTTGTTGCGCATGCTTCCAATGTCAACTGTCT
CAAGATTGGGAAGAAGTCTTCCAGAGTTCTGGTGACGGGCGGGGAAGACCACAAAGTG
AATATGTGGGCTATTGGAAAACCGAATGCCATTCTGAGTTTATCTGGTCATTCAAGTGCT
GTGGAGTCTGTGACTTTTGATTCTGCAGAAGCTTTAGTTGTCGCTGGAGCTGCTAGTGGT
ACAATAAAGCTATGGGATTTGGAAGAAGCAAAAATTGTTCGGACACTCACTGGTCATAGG
TCCAATTGTATATCAGTGGATTTCCATCCATTTGGGGAATTTTTCGCATCTGGCTCCTTGG
ATACAAACCTAAAAATCTGGGATATTAGACGTAAGGGTTGCATTCACACTTACAAGGGGC
ACACTCGTGGTGTTAATTCAATCAGATTTAGTCCAGATGGTCGTTGGGTGGTGTCAGGTG
GGGAGGACAATATTGTAAAGTTATGGGATCTAACTGCTGGAAAGCTCATGCACGACTTCA
AATGCCATGAGGGTCAGATACAGTGCATGGATTTCCATCCTCAAGAGTTTCTTCTTGCTA
CAGGCTCAGCAGACAGGACTGTGAAATTCTGGGACCTTGAGACTTTTGAGCTTATTGGA
TCAGCTGGTCCTGAGACAACTGGAGTTCGTGCCATGATTTTCAATCCGGATGGAAGGAC
TCTGTTAACTGGATTGCATGAAAGTTTGAAGGTGTTTTCCTGGGAACCTTTGAGATGCTA
TGATGCAGTCGATGTTGGTTGGTCTAAATTGGCTGACCTCAACATACACGAAGGAAAGCT
TCTTGGTTGTTCATACAATCAAAGTTGTGTTGGTGTATGGGTTGTGGACATTTCGCGGGT
GGGGCCATATGCTGCTGGAAATGTATCAAGAACAAATGGCCATAATGAAGCAAAATTGG
CTTCCAGTGGTCATCCATCTGTCCAGCAATTAGATAATAACTTAAAGACCAATATGGCGA
GGCTTTCCTTGTCACACAGTACAGAGTCAGGAATCAAGGAACCAAAGACTACCACATCG
TTAACTACCACTGAAGGTCTTTCTAGCACACCTCAACGAGCTGGAATAGCCTTTTCTTCA
AAGAATCTTCCTGCAAGTTCAGGTCCACCGTCATATGTCTCGACTCCAAAGAAAAATAGT
ACATCAAGGGTGCAGCCTACAACAAATTTTCAAACCTTAAGTAGACCTGATATAGTGCCT
GTCATTGTCCCTAGAAGCAATTCATTAAGACCGGAAACAACATCAGATGCTAAGAAAGAA
ATGAACAATTTTGGAAGAGTGGTTCCATCTACAGTATCAACCAAATCAACTGATGTGATTA
AATCTGGCAGCAACAGAGATGAATCTGACAAGATAGACTCCATAAATCAGAAGCGCATG
ACAGGCAATGACAAAACAGACCTAAACATTGCTAGGGCTGAGCAACACGTTTCCTCTAG
ACTTGACAATACAAACACTAGTTCTGTTGTTTGTGATGGAAATCAACCAGCGGCAAGATG
GATTGGTGCAGCCAAATTCAGAAGAAATTCACCAGTAGATCCAGTTGTAAGCCCACATGA
TAGAAGTCCTACTTTTCCATGGTCTGCAACTGATGATGGAGTTACATGTCAGCCAGATCG
ACAAGTTACTGCACCTGAATTATCAAAAAGAGTGGTAGAGCCTGGTCGTGCTCGTGCTCT
GGTTGCAAGTTGGGAAACACGAGAGAAGGCTCTCACCGCAGACACACCTGTGCTGGTC
AGTGGTCGCCCCCCCACAAGTCCTGGAGTGGACATGAACTCATTCATCCCGAGAGGAA
GCCATGGGACTTCGGAAAGTGACTTGACAGTTAGTGATGACAACAGTGCTATAGAAGAG
CTCATGCAACAGCACAATGCATTTACAAGCATTCTTCAAGCTCGCTTGACTAAGTTGCAG
GTAATAAGGAGATTTTGGCAAAGGAATGACTTGAAAGGTGCTATTGATGCTACGGGAAA
GATGGGAGATCATTCGGTATCTGCTGACGTTATTAGCGTACTGATTGAGAGAAGTGAAAT
CTTCACGCTGGATATTTGCACAGTCATACTTCCATTGCTTACTCGGTTGCTTCAGAGCGA
GACTGACAGGCACCTTACTGTCGCTATGGAAACCCTGCTTGTGCTAGTGAAGACTTTTG
GTGATGTTATCCGGGCAACTATATCAGCAACCCCAACAATTGGAGTTGATCTCCAAGCAG
AGCAAAGGCTTGAACGCTGTAATCTGTGTTATGTTGAACTGGAAAACATCAAACAGATTC
TTGTTCCTTTAATCAGGAGAGGTGGAGCTGTTGCCAAGTCTGCACAAGAGTTGAGTTTAG
CTCTTCAAGAAGTGTGACCCACTCTATTTAGGTTGACATTTTTTTTTGTAATAAATCTTCTT
TGACGGGAGGCTGTTTGTATATTGGGAAACACTGCATTGTTTGCAATGATAGTGGCAGT
GTGGTTGAATATGCACGGCATCAATTGCAGTGGAATTCTTATCTCAGGTCATTGGAGTGT
TTAACCATTCCTATGTTGTCAGCCACATCTTGTCAGCCACACCTTGATTCTTAGATTAAGA
AGTTACTAATTTGTAGATAAATTCTAACGAAGGTGATGATAGCATACACGTAATGAAATCA
CAGAGAATGTCTACGCAGTTGACACTGCAACTAGTCTTGCGTGAACCTGTAGTTGCGGC
ATGCTTCAACCTTTGTTGTAATATTTAAAATATTTGAGTGGCTTAAGCACTCAATTTCCTGT
TTGATCAAGTCAGTTGCTAGATTCTTCAGTTGCTTTTCTCCGTTCATACAATCAGCAATTG
GATATTGATGAATTTCATTACAGAGTTTGATAATCTGTGAGCCATCATATATTAATTGTACA
TTCATAAGCATCAAGATACGTAATTTAAATGATTTCCTTGTGTCTGAAAAAAAAAAAAAAAA
AAACTCGAGACTAGTTCTCTCGTGGTGCGATCCTCTACTCCAAGGGCGGAAAGGCCGTT
GCAAGTAAGTTCCATTGGGAGAGTCTGGGTGATTGAAACCCGCCAGGAAGACGTACTGC
AGTATAATCTCTGCTCCGAACAGGACCTTGATCTGCGATATCAACCTCAAAACTCATACA
GCACTGAAGAAAAAAACTGTTGACTAGGAATGGAGTTGAACTCAGGTCCTGAAGTTGTTT
TTGATTGTTCTATGTCTACAAAAAAATCATGCTCTGGGGCATCTGCAAGAGAATGGCGAA
ATTTCATGTCATCCAACCCATCTGATATGTTCAAACGCACACAATCCAGGAGATTAACTG
AGCAGGCGTCTTCTCATGAAGAGAAGGAGATATTCGACACAATATTTAAAGAAGTTGAAG
AGTTAGGAGTTTCACAACTGCCCTGGAAAAAACGGAAGGCCATGGAGAACCAAAAGGTG
TTAACTTTAGGTGGAAAGCCTTCAAAGAATATCAAAACATCCATTTCCGTAGGTAAACATA
TTAGGAAGAAGCAAAAACGTACGAAGGAGGCGGCAATTAAGAATGAGGAGATTCTTCTC
GGTCGACCTGCAAAACGTGCAAAGAAAACTGAAAATATACGCAAGAGAGAGGATCGAGG
GCTGATGGCATCCGAAGGATATTTCAAATACGGGATTTTGCATGTTAAGCCTTTGGAGAA
AAATGTTGAAGAACGTGAGGAAAATGTATACAGAATGGGAAAAACAAAGAAGAAAGGCA
AATCAAAGAAACACAAGGGAAAAAAGAAGCGTTGATAGTTCTGCATTTCTGAGATGAAGC
ATCTTTCTTTTTATTGTTGGGCTTAGCTTTGGCACTATTGGAATATTTTAGCATCCAGTTTG
CCTTCAGCTATTACAAAAATATTTGAAAGGTTTTGTTTTAAATTTTTTTTTTTTCACAATTTA
TGGATGAGGTACTCCTTATGAATATCTTCAAACTAAGAAATAACTATATATGCAAATAAAT
AAAAACGATTTGTAATTTTTCAATTTCCAAAAAAAAAA
SEQ ID NO:252
GTGTGCAGTTTCACTACTCCCCCACACACTCTCTCTCTCTCTCCTTTTCCCCCAAATCAG
AAGAAGAGACGACGACTGTGTAGTAGTGAGAGAAGCGGCCGCAGACCGGGCGGTACTG
AAATTTTGATGGAGGGTTTCGGGGGAATCTCTCTCTGCTGCTGATCTCCGGGAGGGCTG
GTTCGTCTTCGGCTTCGGGAGGGGGAGGAGCGGGAGATCGTGATTTCGAAGGAGGCAG
AGATGGCCGGATCGGACGAGAACAACCCGGGCGTTGTCGGAGGTGCGCACGTTCAAGA
GGGCTTGCGGGTCGGAGCGGGGAAGATGGGGGCGGGGAATGTTCAGCAGAGACGAGC
TCTGAGCAACATCAACAGCAACATCATCGGGGCTCCTCCTTATCCATGCGCGGTCAACA
AAAGGGTCTTGTCCGAAAAAAATGTCAACTCTGAAAACGATCTCCTCAACGCTGCTCATC
GGCCAATTACTAGGCAGTTTGCTGCTCAGATGGCTTACAAGCAGCAACTTAGACCTGAG
GAGAACAAGAGGACGACCCAATCAGTCTCAAATCCCAGCAAATCTGAAGATTGTGCCAT
CTTAGATGTGGATGACGACAAGATGGCTGATGACTTTCCGGTGCCAATGTTTGTGCAAC
ACACCGAAGCAATGTTAGAAGAAATTGATCGGATGGAGGAGGTTGAGATGGAAGATGTA
GCTGAAGAACCTGTCACGGACATTGACAGCGGTGATAAAGAGAACCAGTTGGCTGTTGT
TGAGTACATTGATGACCTATACATGTTCTATCAGAAAGCCGAGGCTTCTAGTTGCGTTCC
CCCAAACTACATGGATCGGCAGCAGGATATTAATGAGCGGATGAGAGGTATACTAATTG
ACTGGCTGATTGAGGTTCATTACAAGTTTGAATTGATGGATGAGACCTTGTATCTTACGG
TCAATCTCATCGATAGATTCTTAGCTGTTCAACCTGTAGTGAAGAAAAAACTCCAGCTAGT
AGGGGTAACAGCCATGCTTCTGGCATGCAAATATGAAGAGGTCTCAGTTCCAGTAGTGG
AGGATCTCATTCTGATTTCAGACAGGGCTTATAGCAGGAAAGAAGTTCTAGAAATGGAGC
GGTTGATGGTGAATACTTTGCACTTTAACATGTCAGTGCCTACTCCTTATGTTTTCATGAG
GAGATTTCTTAAAGCCGCTCAATCTGACAAGAAGCTCGAGCTCTTGTCATTCTTCATCAT
CGAGCTTTCCCTGGTTGAATATGACATGCTGAAGTTCCCACCCTCTTTATTAGCTGCTTC
TGCAATCTACACTGCTCTGAGTACAATTACCAGAACTAAACAGTGGAGTACAACATGTGA
ATGGCACACCAGCTACTCAGAAGAACAGCTTCTGTAAGTTACTTAGTTTTTCCTTCTTCCA
CACCTTGAGATGGCTTGGTTTTTGTTGAGCTTTCTATTTCAAGCAATTTGTGATTGGGGG
GTTCTGCATTCTTTGTAGGGAATGCGCCAGATTGATGGTGACTTTCCATCACAGGGCTG
GATCAGGGAAGCTCACTGGTGTGCACCGGAAGTATAGCACATCCAAGTTTGGTCATGCT
GCGAGAACTGAGCCTGCTAACTTTCTCTTGGACTTCCGTTTGTAGTGGGTTGCGTGTGT
CGTTGTGTACCTTG1CCTAAATCAAACATAAAAAACTGACTTTTGGGCCAGGGTGGGGGG
CTTAAAAATATAGCAACATAAGTTGCCCTGCGAGTG
SEQ ID NO:253
CCCTAAAACTCATTTCTTCCTTCCCTTCCAAGCTCTCTCTCCTCTGTTTTCTCTGCGCATG
CAAGCTCCACGAGAGGGAAAATCGGCGGCGGCCATCGTCGGAATGGGCAAGTACATGA
AGAAATCCAAGGCAATCCCCCGCGACGTCTCGCTCCTCGAGGCCTCCCCGCGCTCCCC
CTCCGCCACCGGCGTCCGGACCAGAGCCAAAACCCTAGCCTCGCGGCGCCTCCGGAG
GGCCTCCCAGCGGCGGCCGCCGCCCCCCGCTGCCGCCGCCGCCGCCGCCGCGCCGA
GCTTGGACGCCTCTCCCTGTCCGTTCTCTTACCTCCAGCTCCGGAGCCGGAGGCTCCG
GAGACCCCGTCTCGCGCCCTCCCCGGAGGCGAGGATCGACGAAGGACCCGCGGGGAG
CGGTTCTAGGGGAAGCCGCGACGCTTCGTGCTCGGCGAGGACGGCATCGTCGAGCGG
TGGAGTGGAGGGAGAAGGAGCGTGCGTCGGTCGGGGTGACAGGGGAAACGGGGGGG
AGTGTGTCCGCGATGCGGCTGTCGACGCGTCTTATGGAGAGAACGATTTGGAGATCGAA
GACAGAGACAGGAGCACAAGGGAAAGCACGCCGTGTAGCTTGATAAGGGACTCGAACG
CAAACACGCCACCAGGGTCAACCACAAGACAGCAGAGCTCATGCACTGCTCACAGAACT
CAAATGAGCATACTTAGAAGTATCCCGACCTCAGATGAGATGGAGGAGTTCTTTGCATAT
GCCGAGCAACGACAACAAAGATCATTCATTGAAAAGTACAACTTTGATATTGTCAAGGAT
CGTCCCCTCCCAGGTCGTTTTGAATGGGTGCAAGTAATTCCATGAGATTTGCATTGATGC
TGGACGAGGAGGTGTATCGCATACGGAGTCAATGTACTGGAGTAGTAAAGAGCCTGAAC
TGTGACATAACAGACGACGAGCACACTGATGAGATATTAAATTAAAGTCCCTTGACAATG
TCTAAAGAGCCGTGATCTATGAGTAGATTAGAAACCGCCTTTTTAGTTGCAAACGCCATT
GGAGTTTGTTGAGAAAGGAGAATATACCCCTTGTACATAACTTGTTTCTTCAGTATATGCT
CTTTTCTCCTAAAAAAAAAA
SEQ ID NO:254
GCCAACTCGACAGAGAGAGAGAGAGAGAGACAACACGGTTGAAATGGACGGTCACTCC
TCGCACCTCGCCGCGCAGAACCGTTCGCGCGGCTCCCAGACCCCCTCCCCCTCCCACT
CCGCCGCCTCCGCCTCCGCCACCTCCTCCATCCACCTCAAGCGCAAGCTCTCCGCCGC
CAACGCCAGCGCCGCCTCCGCCGCCGCCGCCGCCGCCGCCGCGGCCGCAGCCGCCG
ACGACCACGCCCCACCCTTCCCTCCCTCCTCCATCTCGGCCGACACCCGCGACGGCGC
CCTCACCTCCAACGACGACCTCGAGAGCATCTCTGCCCGCGGCGGAGGGGCCGGCGA
CGACTCCGACGACGATTCCGACGATGAGGAGGAGGACGATGGCGATAACGACGGCGG
ATCCTCTCTCCGCACGTTCACCGCGGCTCGGCTCGAGAACGTGGGCCCGGCCGCGGCT
CGGAACAGGAAGATCAAGGCCGAGAGCAATGCGACGGTCAAGGTCGAGAAGGAGGACT
CCGCCAAGGACGGCGGTAATGGTGCCGGCGTGGGAGCTCTGGGGCCAGCTGCGACCT
CCGGCGCCGGTTCGGGCTCCGGAACTGTGCCGAAGGAGGATGCTGTGAAGATATTCAC
CGAGAATTTACAGGCGAGTGGGGCTTACAGTGCCAGAGAAGAGAACTTAAAAAGAGAGG
AGGAAGCAGGAAGACTCAAGTTTGAATGCCTTTCAAATGATGGTGTTGATGACCATATGG
TTTGGTTAATAGGATTAAAGAATATCTTTGCCAGGCAACTTCCTAATATGCCGAAGGAATA
CATCGTGCGTCTTGTTATGGACAGAAACCACAAGTCAGTAATGGTTATCAGACGGAATTT
AGTTGTTGGTGGTATCACTTATCGCCCATATGCCAGTCAAAAGTTTGGTGAGATAGCTTT
CTGTGCAATTAAGGCTGATGAACAAGTAAAAGGTTATGGCACAAGGCTGATGAATCACTT
AAAACAGCACGCTCGTGACGTTGATGGGCTAACTCATTTTCTGACCTATGCTGACAACAA
TGCTGTTGGTTACTTCATCAAGCAGGGTTTCACGAAAGAGATATACCTGGATAAAGATCG
ATGGCATGGGTATATTAAAGATTATGATGGGGGAATTCTTATGGAATGCAAAATTGATCC
CAAACTTCCTTATACAGATCTATCAACCATGGTTCGCCGTCAAAGGCAGGCGATTGATGA
AAAGATAAGGGAACTCTCTAATTGTCATATTGTCTACCAGGGGATTGATTTCCAGAAGAG
AGATGCAGGGGTTCCCCAAAATACCATCAAAATGGAGGATATCCCTGGCTTGAGGGAGG
CTGGGTGGACACCTGATCAATGGGGCTATTCTAGGTTTAGAGGATTGAGTGATCAAAAG
CGGTTGACTTTCTTTATTCGCCAACTTCTGAAGGTATTGAATGACCATAGTGATGCTTGG
CCATTCAAGGAACCAGTTGATGCTCGTGAGGTCCCTGATTACTATGACATAATTAAAGAC
CCTATGGATTTGAAGACAATGACCAAGAGGGTCGAATCAGAGCAATATTATGTTACGCTC
GAGATGTTCATTGCAGATGTCAAGAGGATGTTTGCTAATGCACGCACCTACAATTCCCCC
GACACTATATACTTCAAAATTGCAACAAGGTATACTTAGTCAGTCCTTGTTATGTATGTCT
TTTGTCAACCCTTCAGGGCAGATAAAAATGATTTACTGGATGTGCAGGCTGGAAGCTCAT
TTCCAGAGCAAGGTACAATCGAATCTCCAGTCTGGTGCCGGAAAAATTCAACAGTAGAG
CATTCGGTAGACTGGAGGCCCTGACCTTACTTCTCTCTATATGAATATGTGGAGCCTTGG
ATACTTACTCTGATCCATGATTGCGCTGGGGAATTAACTAGCTTCGATTGACCATGTAAC
TGAAGACTGATAGTCATATTCCCCGAAAACTGAAATTTCAGTTCCCTTAGTACAATGTAAT
TAAGATGTCTTCACTTTCTAACTCTGGTAGATGGAAATTTAACCAGATTGTGATATTGATA
TATCCGTTCACCTCAAAAAAAAAA
SEQ ID NO:255
CCGTACCTCCGTCTTCCTTTCTATCTCCATTCGAAGCCGATCGAATAAAAACCCTAGCCA
TCCGAACTCGACTCGCCGGTGAGCTTTCTTGATTCGGTCGGGTCCGGGATGTTCAACGG
AATGATGGATCCCGAGCTCTTCAAGCTCGCGCAGGAGCAGATGAACCGCATGTCCCCC
GCCGAGCTGGCCAAGATCCAGCAACAGATGATGTCTAATCCTGAATTGATGAGAATGGC
CTCTGAGAGCATGAAGAATATGAGGCCTGAAGATTTACGACAAGCGGCAGAACAATTAA
AGCATGTTCGTCCTGAGGAGATGGCTGAGATTGGTGAGAAGATGGCTAATGCTAGCCCT
GAGGAGATTGCAGCTGTACGCGCTCGTGCGGATGCCCAGATGACTTATGAGATCAATGC
AGCTAAAATTCTCAAGAAAGAGGGGAATGAACTTCATAGCCAGGGGAGGTTTAAGGATG
CCTCGCAGAAGTATTTGCGTGCCAAGAATAACTTAAAAGGAATTCCGTCGTCTGAAGGCA
AGAATCTTTTATTGGCATGCTCCCTTAACTTGATGTCCTGCTACTTGAAAACAAGGCAGTA
CGAGGAATGCATAAAGGAAGGCTCTGAGGCTTTAGCATGTGAGGAGAAAAATCTCAAAG
CTTTCTACAGGAGGGGCCAAGCATATAGAGAATTAGGTCAATTGAAGGATGCGGTCTCT
GACTTGAGAAAGGCACATGAAATTTCTCCTGATGATGAAACAATTGCGCAGGTTCTAAGG
GATACTGAGGAAAGTTTGACCAAAGAAGGTGGTTCTGCACCAAGAGGGGTGGTCATTGA
GGAAATAACTGAAGAAGATGAGACTTTGGCCTCTGTGAACCATGAAAGCCCATCAGAATA
TTCAGAGAAGCGGCATCAGGAATCAGAGGATGCCCACAAGGGTCCAATTAATGGTGATA
TTATGGGTCAAATGACCAATTCTGAAAGCTTGAAAGCTTTGAAAGGTGACCCAGATGCAA
TCAGGTCATTCCAGAATTTTATTTCTAATGCTGATCCCACAACTTTGGCTGCAATGGGTG
CGGGAAATGCTGGAGAGGTATCCCCTGACTTGATTAAGACCGCTTCTAGTATGATCGGC
AAGATGTCAGCAGAAGAACTCCAGAAGATGATTCAGCTGGCTTCATCGTTTCCCGGGGA
AAACCCTTATGTCACAAGAAACTCAGATAGTAATTCCAATAGCTTTGGAAATGGATCAATT
CCTAATGTGTCGCCTGACATGTTAAAAACTGCGAGCGATATGATGAGTAAGATGTCACCT
GATGATCTTCAGAGGATGTTTGAAATGGCATCTTCTTCGAGAGGGAAGGACCCTTCTCTG
GATGCTAACCATGCAAGTTCAAGCTCTGGGGCGAATTTGGCCGCCAATTTGAATCATATT
TTGGGTGAAAGTGAACCGAGTTCGTCTTATCACATACCTTCAAGTTCAAGGAATATTTCAT
CTTCTCCTCTATCAAATTTCCCATCATCACCAGGTGATATGCAAGAGCAGATAAGAAATC
AAATGAAAGACCCAGCTATGAGGCAGATGTTTACATCTATGATGAAGAATATGAGTCCAG
AGATGATGGCAAACATGGGCAAACAGTTTGGACTCGAGCTTTCTCCAGAAGACGCTGCA
AAAGCCCAGGAAGCAATGTCTTCTTTATCACCAGAGATGTTGGACAAGATGATGCGCTG
GGCAGACAGGGCTCAAAGAGGAGTCGAGACAGCTAAGAAGACCAAGAACTGGTTGCTT
GGGCGGCCCGGTATGATTTTGGCCATATGCATGCTCCTCTTGGCAGTGATCCTTCACCG
GCTTGGCTTTATCGGGAGCTAGAGCGGCGGAATCCCTTTGCTCGTTCAAGCTTTATGCG
ACAAGTCGGCCAGAAAGCAGTAGCATCGGATTGGAACACGCAATGCATCTTCGTCAATC
CATCTTCAGATGAAATTGGCTCCCCTAGGCAGGGGACGTTCTTTAGTCGTCGGAAGGTC
TGTCGGATAGAATGGGTAGAACATAGCATATACCCATGAAGTGCTGGTGGTTTGTGATTT
ATGCTGACGGTTGGACTTCCTTCCATTTAACATAAATTGTACTGCTTGTAGAGATAAAAA
TGTCCGAGCTGGAATCTCGCAAGAGATTCTGATCAAGAAATTTCATTTTTTCTTTTTAGTC
TGGTCCTATTCTATATTTATTTTACAAAAAGAAATCCATTACTTGTTCTTTAAAAAAAAAA
SEQ ID NO:256
AAAGAGAAGCAAAAACCTGGCCTCATCGGCGCTTCTCCTCCTTCTCTGAACTCCACATCA
GGCTCGTCAGTGGCAGAATTCCCGTCTTCTTCTTGTTGACCGCCGTTTGGTTACTGTACT
CTCATCTACCGGTGGTGCCGAAGTATTGGAGGGGAGGGGATTTAGGAAGGTGATCTGG
TTATTGAATTGGGGTCGTTTGTCTGTTGAAGGGATGATAGCGGCGATATCCTGGGTCCC
TAGAGGGGCTTCGAAGGCGGTCCCCGAGGTGGCCGAACCGCCTTCCAAGGAGGAAATT
GAAGAGATTTTGAAGAGTGGAGTTGTGGAAAGAAGTGGAGATAGTGATGGTGAGGAGGA
TGATGAAAACATGGATGCAGTTGCTTCAGAAAAGGCTGATGAAGTTTCCACTGCATTATC
TGCCGCTGATGCACTCGGGAGAATTTCCAACGTGACAAAAGCTGGATCTGGTTTTGAGG
ATATAGCTGATGGTTTGAGGGAGCTTGATATGGATAATTACGATGAAGAAGATGAAGATG
TCAAGCTATTTAGCACTGGACTTGGTGACTTGTACTACCCAAGTAATGACATGGATCCCT
ACCTCAAGGATAAGGATGATGATGATGACACTGAAGAGATTGAGGACCTGTCTATTAAAC
CAATGGACTCTCTCATTGTTTGTGCACGTACAGATGATGAAGTCAATCTTCTTGAGGCGT
GTACATTCATTAGATAATGCATTTTTTTTGCTGCACTTTCCCAGGCTGTACTGGGGTTCAC
ATAAAAACCGCTACTGATTGGTCAATTCATTGCATAAAGCAGGTCTATTTATTGGAGCCAT
CATTATCTGATGAATCAAATATGTATGTTCACCATGAAGTAGTTATTTCAGAATTTCCCCT
CTGCACAGCATGGCTTGACTGTCCAATTAAAGGTGGCGACAAAGGAAATTTTATTGCCGT
TGGCTCAATGGAGCCTGCCATTGAGATTTGGGATCTTGATATTATTGATGCTGTAGAACC
ATGTCTTGTATTAGGCGGTCAAGAAGAGTTGAAGAAGAAAAAGAAGAAAGGAAAGAAGG
CATCGATTAAGTACAAGGAGGGTAGTCACACGGATTCAGTGCTTGGCCTTGCTTGGAAC
AAGGAGTTCAGGAATATACTTGCCAGTGCAAGTGCTGACAGGCAAGTCAAAATTTGGGA
TGTTGCAGCTGGAAAATGCAATATTACCATGGAACACCACACCGACAAGGTTCAAGCAG
TTGCATGGAATCATCATGCTCCACAAGTTCTTCTTAGTGGATCTTTTGATCATTCGGTAGT
CATGAAGGATGGAAGAATACCTTCACATTCTGGATATAGATGGTCTGTGACGGCAGATGT
CGAAAGCTTGGCGTGGGATCCACATTCTGAACACTTCTTTGTGGTGTCCCTTGAAGATG
GCACCGTTAGAGGATTTGATGTACGAGCCGCTATATCTAATTCTGCCTCCCAGTCACTGC
CAAGTTTTACTCTTCATGCGCATGAAAAAGCTGTCAGCACAATCTCATACAATCCTGCAG
CACCGAATCTTCTCGCAACAGGGTCGACAGATAAGATGGTTAAACTCTGGGATCTGTCC
AATAACCAGCCTTCATGTATTGCTTCAAGAAATCCAAAAGCTGGTGCTGTCTTTTCTGTTT
CCTTCTCGGAGGATAGTCCCTTATTACTTGCTATTGGAGGTTCAAAGGGGAGACTTGAAG
TATGGGATACATCATCTGATGCAGCTGTATCCCGACGATTTGGAAAACATGGCAAGCCG
AAAACAGCAGAACCTGGTTCTTGATGCTGCCAAGTGAGAACATGTTAAAAATATCTTAGA
TCTTCTCAATGTCGGCCTTGCTTAGTTCTTAGAGCGCATTGAAATGCCTGAAGTCAGGGG
TAGTTCTGCAAGTCGAGTCATGTTCTTTTAGCAGCTTGTTATTTAGCATTTTAAGTGTCTT
CGTATTATTTTGGGTGTTGAGAAGGAAAAAAGAAAGTGTTTTTTCTTTTTCTTGGGGAGG
TGGGGGCTTTGGCGTGAGGTTACTGCAAGTATGCAAGAGCATCACAAGGAAAAGGTCCC
AGGGAAAAGTGTTGAACATGGTAAATGAAGCTAATGCAATGGTGTCTTTTTGACTCCGAA
AAAAAAAA
SEQ ID NO:257
AAGTGCAGGTCAGCGTCGACACCAACAAAAGCTGGCCCCTTCCCTCCTTCGTATATATC
TCGTCTCGTCTCGTCTCGTCTCTTCCGCAGAAACCCTCCCCGCAGCAGAACTCCGCCTC
GTCATACGAAAATCTCGCGTCGTTTGCAGAAAACAAGGATCGATTTGACTACCCAAAACG
CAGATAGATAGAGAGAGAGAGAGAGAGGATGAAGTTCTGCAAGAAATACCAGGAGTACA
TGCAAGGGCAAGAAGGGAAGAAGCTTCCCGGTCTTGGGTTCAAGAAGCTCAAGAAGATC
TTGAAGAGATGCAGGCGAAGAGACTCGCTTCATTCCCAGAAGGCCCTTCAAGCCGTCCA
AAATCCCCGCACTTGCCCTGCTCACTGTTCGGTGTGTGACGGAAGTTTCTTCCCCTCTCT
TCTCGAGGAGATGTCTGCTGTTTTAGGCTGTTTTAACAAGCAGGCGCAGAAGTTGCTTGA
GCTTCACCTGGCTTCGGGATTTCAGAAGTATTTGATGTGGTTCAAAGGCAAGCTTCGAG
GAAACCATGTTGCTTTAATTCAAGAAGGAAAAGATCTGGTTACTTATGCATTGATAAATGC
AATAGCCATTAGGAAGATATTGAAGAAGTATGACAAGATTCATCTCTCTACTCAAGGACA
AGCTTTCAAGTCACAAGTGCAAAGGATGCACATGGAGATCCTCCAGTCTCCATGGTTGT
GCGAGCTTATCGCCTTCCACATTAATGTACGGGAAACGAAGGCAAACTCTGGGAAGGGC
CATGCCCTTTTGAGGGTTGCTCTCTAGTTGTCGATGATGGCAAACCATCGCTCTCTTGT
GAACTCTTCGACTCTATCAAGCTAGATATCGACTTGACCTGCTCTATATGCTTGGACACG
GTGTTTGATTCGGTTTCTCTCACTTGTGGCCACATATACTGCTATATGTGTGCTTGCTCA
GCTGCATCTGTGACGATTGTTGATGGACTGAAAGCAGCAGAACCCAAGGAAAAATGTCC
TTTATGTCGAGAGGCCAGAGTTTTTGAAGGTGCCGTACATTTGGATGAACTCAATATATT
GCTCAGTAGAAGCTGTCCGGAGTATTGGGCGGAGAGGCTTCAAACAGAGAGAGTTGAA
AGGGTTCGGCAGGCTAAGGAGCACTGGGAATCCCAGTGCCGAGCGTTCATGGGCGTGG
AATAAGGCGGTTGCCCTTTAAGGAGATTGATTTCTCTCTTGCGTGTCAGCTATTTATACA
GTCCTGTTTATAGTCAAAAGGATCCGGCAGGCGAAGAAGCACTGGGAATCCCAGTGCCG
AGCGTTTATGGGCGCGGAATAAGGCGGTTTCCCCTTTAACGAGATTGATTTCTGTCTTGC
ATGTCAGCTATTATACAGTCCTGTTTATAGTCCTGTGATGTAATAAAAAGCTGCATTGCTG
AAAACTTTCTCGCCTTGAACGCCCCTCATTGTTAAGTCTTTGCGTTCTTGGGCCTGTTGC
TCTATTCCTTTTTGAACATATAAACATGTACGTTTTCAATCAAAAAAAAA
SEQ ID NO:258
AAGCAATGGTATCAACGCAGAGTACGCGGGAAAACCCCTCCATTTTCTTCCCTCCCCCT
CTTAAACCCTGGTTGCTTCCCGTGGTGCTCTCTCTCTCTCTCTCTAGGCAATTGGGCATG
GCGGCGGCGGCGGCGGCGTCCCTGCCGTTCAAGAAGAACTACAGGAGCTCCCAGGCG
CTGCAGCAGTTCTACGCCGGCGGTCCTTTCGCCGTCTCGTCCGACGGCTCCTTCATCGC
GTGCAACTGCGGCGACTCCATCAAGATCGTCGATTCCTCCAACGCCTCCCTCAGGCCCT
CCATCGACTGCGGCTCCGACACCATCACCGCCCTCTCCCTCAGCCCCGACGGCAAGTT
GCTCTTCTCCGCCGGCCACAGCCGCCAGATTAGGGTTTGGGACTTGTCCACCTCCACCT
GCCTGCGCTCCTGGAAGGGGCATGATGGTCCGGTGATGAGCATGGCCTGCCCCGTCTC
TGGGGGTTTGCTGGCGACGGGAGGAGCTGATAGGAAAGTCATGGTGTGGGACGTCGAC
GGTGGTTTCTGCACCCATTTCTTCAAAGGTCACGACGGCGTCGTTAGCACTGTCCTCTTC
CATCCCGACTCGAATCGCTCTCTTCTGTTCTCTGGAAGTGATGATGGAACTATACGGGTT
TGGGATCTCTTGGCAAAGAAGTGTGCTTCGACACTGAGAGGACATGATTCAACAGTCAC
TTCTCTGGCTTTTTCCGAGGATGGCTTGACATTGCTTGCCGCTGGAAGAGATAAGGTTGT
ATCTTTGTGGGACCTTCATAATTATGCCTGCAAGAAGACCATACCTATGTATGAGGTGCT
TGAATCTGTATGCGTGATACATAGTGGAACTGTTTTGGCTTCACAACTGGGGTTAGACGA
TCAGCTGAAAGTGACAAAAGAAAGTGCACAGAATATTCACTTTATTACTGTTGGTGAACG
CGGCATTTTACGGATATGGAAATCTGAAGGTTCAGTTTGCCTGTTTAAGCAAGAACATTC
TGATGTAACTGTCATCTCAGATGAGGACGACTCAAGGAGTGGCTTCACTGCTGCTGTCA
TGCTACCCTTGGATCAAGGATTGCTATGCGTGACAGCTGATCAACAGTTTTTATTCTATTA
TCCAGAGAAACATCCTGAAGGGATATTTTCGCTGACCCTATGTAGAAGACTCGTAGGCTA
CAATGAGGAAATAGTTGATATGAAGTTCCTAGGAGAGGAGGAAAATTTCCTTGCTGTTGC
TACTAATCTCGAACAGGTACGGGTTTATGAACTTGCTTCCATGTCATGTTCGTACGTCTT
GGCTGGTCATACTGAAACTGTCCTGTGCCTTGACACTTGTATTTCAAGTTCTGGAAGGAC
GCTTATTGTCACGGGAAGTAAGGACAACTCTGTTAGGTTGTGGGATTCAGAAAGCAGAC
ATTGCATTGGAGTTGGAGTAGGTCACATGGGCGCTGTAGGTGCAGTTGCTTTCTCAAGG
AAGAGACAAGATTTTTTTGTTAGTGGCAGCAGCGATCGTACACTTAAGGTTTGGAGCTTG
GATGGCATCTCAGAGGACGGTGTAGACTCAACAAATTTGAAAGCAAAAGCTGTTGTGGC
AGCTCATGATAAAGATATTAATTCTGTTGCTGTTGCACCAAATGACAGTTTAGTTTGTTCT
GGTTCTCAGGATCGCACAGCTTGTGTTTGGAGGCTTCCAGACCTGGTATCTGTAGTTGT
ACTTAAAGGGCACAAGAGGGGGATTTGGTCTGTAGAGTTTTCACCAGTTGATCAATGTGT
GCTCACTGCATCTGGCGATAAAACGGTGAAGATATGGGCCATATCTGATGGTTCTTGCTT
AAAGACCTTTGAAGGGCATGTCTCTAGTGTACTAAGAGCATCATTTCTCACCCGTGGGAC
ACAGTTTGTTTCTTGTGGTGCTGATGGCTTGGTAAAACTGTGGACTGTTAGGACAAATGA
ATGCATTGCTACATATGATCAACATAGCGATAAGGTATGGGCTTTGGCTGTTGGAAAGAA
GACCGAAATGCTTGCTACAGGTGGCAGTGATGCTGTTGTAAATCTCTGGTACGATTCAAC
TGCTTCTGACAAAGAGGATGCTTTTCGTAAAGAGGAAGAAGGTGTTTTGAAAGGTCAAGA
GTTAGAAAATGCAGTATCTGATGCTGACTACACCAAGGCAATCGAACTTGCTCTTGAACT
TCGGAGGCCTCATAAGTTATTTGAATTGTTCTCCGAACTTTGCAGGACGCGAGAAGTAG
GAGATCGTGTAGAGAGAATACTTTCTGCTCTCAGCGGCGAAGAGGTTTGTCTGCTTCTT
GAATATATCCGGGAGTGGAATGCGAAGCCAAAATTGTGCCACGTTGCCCAATCTGTGCT
ATCACAAGTCTTCAGAATTCTTTCTCCAACGGAGATAGTTGAGATTAAAGGCATCGGGGA
GCTCCTTGAAGGTCTCATTCCATATTCTCAGAGGCATTTCAGCAGGATAGACAGGCTTGT
TAGAAGCACGTATTTACTGGATTACACATTGACAGGAATGTCTGTTATTGAACCTGAAGC
AGACAGATCAGCAGTCAATGATGGGTCTCCAGATAAATCAGGCCTCGAGAAACTAGAGG
ATGGTCTTCTGGGAGAAAATGTTGGTGAGGAGAAAATCCAAAACAAGGAGGAGCTGGAA
AGTAGTGCATACAAGAAAAGAAAATTACCGAGATCTAAAGATCGTTCTAAGAAGAAATCA
AAGAATGTCGTGTACGCAGATGCAGCAGCAATCTCTTTCAGAGCTTGAATACCCGGGTC
CAGAGTTAAAAAGTGTTTATTATGACGAGCATTGCAGCGAGGACCATGTTGATGTCATTC
CCATCCTGCATTCTCAATGCCAG
SEQ ID NO:259
AAGCAGTGGTATCAACGCAGAGTACGCGGGGACCAGGAATTCAATCATCCCGCTCCTGT
AGACCGTTGCAAATCTTCCTCTTCACGCCGCGACTGCTGGAAAAGCCCTCTCTCCGCGA
GCCAAGAAGAGGGCTTTTCTGTCCAGTCTTGCACGAGGTCGAAGATCAAGGATCACCCA
GATACGGAGAGACAGAAATAAGCAAAACCCGGAAAATGGATTCGGCTCCGAGGAGGAA
GAGCGGCGGCATCAATCTCCCGTCCGGGATGTCCGAGACCTCGCTTCGGCTTGACGGG
TTCTCGGGTTCATCATCGTCTTTCCGGGCCATCTCCAACCTGACGTCCCCGTCCAAGTC
CTCTTCCATCAGCGACAGATTCATCCCTTGCAGATCCTCTTCGAGGCTCCACACTTTCGG
GCTCGTCGAAAGGGGGTCGCCGGTTAAGGAAGGAGGGAACGAGGCGTACTCGAGGTT
GTTGAAGGCTGAGCTCTTTGGGTCTGATTTCGGTTCTCTCTCTCCTGCAGGTCAAGGGT
CGCCCATGAGCCCCAGCAAGAACATGTTGCGGTTCAAGACCGAGAGTTCGGGCCCGAA
CTCGCCGTTTTCGCCGTCGATCCTGAGGCAGGACAGCGGTTTCTCCAGTGAAGCTTCGA
CGCCTCCTAAGCCACCAAGGAAGGTGCCCAAGACGCCGCATAAGGTTCTGGATGCCCC
ATCGCTGCAAGATGACTTCTATTTGAATCTGGTTGACTGGTCATCTCAGAATACGCTTGC
GGTCGGGTTGGGCACTTGTGTTTACCTATGGAGTGCATCAAACAGCAAAGTGACAAAGC
TCTGTGATCTGGGACCTAATGATGGTGTCTGTGCAGTACAGTGGACTCGAGAGGGTTCA
TACATTTCAATTGGTACCAGCCTCGGTCAAGTTCAGATATGGGATGGAACTCAGTGTAAG
CGAGTTCGGACAATGGGTGGTCATCAGACTAGAACAGGGGTCTTGGCATGGAATTCGC
GCATATTAGCTTCTGGTAGCAGGGACCGGGTCATACTTCAGCATGATCTTCGCGTTCCA
AATGAATTCATAGGCAAGCTTGTTGGCCACAAATCTGAGGTATGTGGATTGAAATGGTCC
CATGATGACAGAGAACTCGCATCAGGTGGGAACGACAATCAGGTAATTCTCGTTTTCTG
GTAAGTCGTTTTATCTTCCATATTAAACTGAGTATGGAATATTAAACAAGCTTATTATTGGT
TGATCAGCTACTGGTGTGGAATCAGCACTCTCAGCAACCAGTTCTGAAGCTTACTGAGC
ATACAGCAGCAGTGAAAGCCATAGCATGGTCTCCCCATCAGAATGGGCTACTTGCATCA
GGAGGAGGAACTGCTGATCGATGCATACGCTTCTGGAACACCACCAATGGCCATCAGAC
GAGCAGTGTCGACACTGGGAGTCAGGTATGTAATCTGGCTTGGAGTAAGAATGTCAATG
AGTTGGTGAGCACTCATGGGTATTCACAGAATCAAATAATGGTGTGGAAATATCCATCCA
TGGCAAAGGTTGCAACTCTAACTGGCCATAGTCTTCGAGTACTTTATCTTGCCATGTCTC
CAGATGGCCAGACAATAGTGACAGGAGCTGGTGACGAGACCTTGCGGTTTTGGAATGTA
TTCCCATCGGCTAAGGCACCGGCACCGGTCAAGGATACGGGTCTCTGGTCTTTGGGGC
GAACCCACATCAGATAAAAGCTGCATGCTTGTTCAAGCTAGCTCGGTCTCTCTTGAAAAG
AAACATGTTTGATTTCATTACATGTAAAAAAATCTTGCACGGTCAACTGACATTACACTTTT
TGTAAATTGTGAGTAGAATAGGAGAAACTTTTGTACAAGATTAATACGTGTGGCATAATAA
GATTTGTAAATGATCTTTGATCAAAAAAAAAAA
SEQ ID NO:260
CTGCCATGGCGGTACCTCGTTGAAACGCCGCAGGACGGAGAAGAAGAAGCTGCAAGAC
AACTCTCTCTCTCTCTCTCTCTACCTTGAGTGAGCTCCGGCTATGGAGGACGAGGCGGA
GATATACGACGGCGTGAGAGCCCAGTTCCCCCTCACCTTCGGCAAGCAGTCCAAGCCC
CAGACTTCCCTCGAATCCGTCCACAGCGCCACCCGCCGCGGCGGCCCCGCCCCCGCC
CCCGCCCCCGCCTCCTCCTCCTCTCTTCCCTCTACTACCTCCCCCTCCGCTGCCGGCGG
CGCCGGCAAGAGCAGCGGCCTCCCTTCTCTCTCCTCCTCCTCCACAGCCTGGCTCGAG
GGGCTCCGGGCCGGAAACCCTAGGGCCGGCCGCGAGGCCGGAATCGGCTCCCGCGG
TGGCGACGGTGAGGACGGCGGTCGTGCGATGATAGGTCCACCTCGCCCGCCGCCGGG
ATTTAGCGCCAATGATGATGGCGGGGGCGAGGACGACGACGACGACGGCGACGGGGT
TATGGTTGGCCCGCCGCCGCCTCCTCCTGGGAACCTTGGAGATGGTGATGACGATGAG
GAGGAGGAAGAGGCGATGATTGGGCCGCCGAGGCCGCCGGTTGTGGATTCCGACGAG
GAGGAAGAAGAGGAAGAAGAGGAGAATCGGTATCGGTTACCTCTCAGCAATGAGATCGT
GCTCAAAGGCCATAATAAGATCGTATCTGCTCTTGCGGTTGATCCCACTGGTTCTCGGGT
TCTATCCGGCAGTTATGATTACACTGTACGGATGTTCGACTTTCAAGGAATGAATTCTCG
CCTATCATCATTTAGAGATTTTGAGCCAGTTGAAGGTCATCAAGTCCGTAACTTAAGCTG
GAGTCCGACGGCAGACCGATTTCTGTGCGTAACTGGCTCTGCTCAAGCTAAGATCTATG
ATCGTGATGGGCTTACATTAGGTGAATTTGTGAAGGGAGACATGTATATCCGTGATTTGA
AAAATACTAAGGGCCATATAACTGGATTGACTTGGGGAGAGTGGCACCCTAAAACAAAG
GAGACGATTTTAACATCATCTGAGGATGGATCTCTTCGCATATGGGATGTGAATGACTTC
AAAAGTCAAAAGCAGGTTATTAAACCAAAGCTTGCAAGGCCTGGAAGAGTACCTGTGAC
AACATGCACTTGGGATCGTGAGGGCAAATGCATTGCAGGTGGTATTGGAGATGGTTCAA
TACAGATCTGGAACCTTAAACCCGGATGGGGAAGCAGGCCGGACATTCATGTCGAGCAA
GCTCATGCAGATGATATAACTGGGCTTAAGTTTTCCAGTGATGGGAAAATTTTACTGACA
AGAAGCTTTGATGATTCACTTAAGGTGTGGGATTTGCGCCTTATGAAAAATCCCCTCAAA
GTTTTTGAAGATCTTCCAAATCACTATGCTCAGACAAATATTGCATGTAGTCCTGACGAAC
AGCTATTCTTGACTGGAACTTCTGTTGAAAGGGAGTCTACAATTGGAGGCTTGTTGTGTT
TCTTTGATCGATCTAAACTTGAGCTTGTATCAAGAATTGGGATTTCTCCTACTTGTAGTGT
CGTGCAGTGTGCCTGGCACCCCAGGCTGAACCAGATCTTTGCAACTTCTGGGGATAAAA
GCCAAGGAGGAACTCACGTTCTTTATGATCCTACTCTCAGCGAGAGAGGAGCTCTTGTTT
GTGTTGCTCGTGCACCAAGGAAAAAATCTGTGGATGACTTTGAGTTAAAGCCTGTCATTC
ACAACCCCCATGCATTGCCTTTGTTCAGGGATCAGCCAAGCCGTAAGCGTCAACGTGAA
AAGATCCTCAAGGATCCATTAAAATCCCACAAACCAGAGCTTCCCATGAATGGACCTGGA
CATGGTGGAAGAGTTGGTGCAAGCAAGGGAAGTTTATTGACCCAGTATCTTCTTAAGCAA
GGAGGTATGATTAAGGAGACGTGGATGGATGAAGATCCCAGAGAAGCTATACTGAAGCA
TGCTGACGCTGCAGAAAAAAATCCAAAGTTCACCCGTGCATATGCTGAAACCCAGCCAG
ACCCTGTTTTTGCAAAATCTGATTCTGAAGATGAAGATAAATGATGTCCTATCCTGCAGTG
CTTCTTTTGGATCCTTGCTGCTGTCATTGGACATGTGAGGAAGAAAACTGGGATCGGACT
ACTGATACAGCAAGCAAATCATCTGTTGACCAGCTGGAGTGTCAACAAAGAATGGGTTTT
CTTTACTCAATTAATTTGACTGGTCTGCTGGGTTCAATTAATCGAGACATCAACTCCGAGT
TCAAAATCAGGAATTCTCTAGGAAAGTATCTCTTGCATTTTGCTGGCCTTCTTCATCATAT
GATGTGCAGTTTACATTATTATGGTTCGAGTATTATTTAGCTGCCCTATCTTAAGTCATGG
AGGCTTTACTTTGCTGAGAAATGTACCTTTAGCGATGTGATGCAATAGCAGTCTTTCATAT
TTATGTGGTGTAAGAGATGGAATCAATTGTTTCGCTTCCTCATCAGATCAATTAAGTTCAT
GCAACTAAGCTTTTCACTGTAAAAAAAAAA
SEQ ID NO:261
MGDGSLGSGGRGNSGGGGGGGSRPEWLQQYDLIGKIGEGTYGLVFLARIKHPSTNRGKYI
AIKKFKQSKDGDGVSPTAIREIMLLREISHENVVKLVNVHINPVDMSLYLAFDYADHDLYEIIRH
HRDKVNQAINPYTVKSLLWQLLNGLNYLHSNWIIHRDLKPSNILVMGEGEEQGVVKIADFGLA
RVYQAPLKPLSDNGVVVTIWYRAPELLLGAKHYTSAVDMWAVGCIFAELLTLKPLFQGQEVK
ANPNPFQLDQLDKIFKVLGHPTQEKWPMLVNLPHWQSDVQHIQRHKYDDNALGNVVRLSSK
NATFDLLSKMLEYDPQKRITAAQALEHEYFRMEPLPGRNALVPSSPGDKVNYPTRPVDTTTD
IEGTTSLQPSQSASSGNAVPGNMPGPHVVTNRPMPRPMHMVGMQRVPASGMAGYNLNPS
GMGGGMNPSGIPMQRGVANQAQQSRRKDPGMGMGGYPPQQKQRRF
SEQ ID NO:262
MEKYQQLAKIGEGTYGIVYKAKDKKSGELLALKKIRLEAEDEGIPSTAIREISLLKQLQHPNIVR
LYDVVHTEKKLTLVFEFLDQDLKKYLDACGDNGLEPYTVKSFLYQLLQGIAFCHEHRVLHRDL
KPQNLLINMEGELKLADFGLARAFGIPVRNYTHEVVTLWYRAPDVLMGSRKYSTQVDIWSVG
CIFAEMVNGRPLFPGSSEQDQLLRIFKTLGTPSLKTWPGMAELPDFKDNFPKYVVQSFKKIC
PKKLDKTGLDLLSRMLQYDPAKRISAEQAMGHPYFKDLKLRKPKAAGPGP
SEQ ID NO:263
MDQYEKIEKIGEGTYGVVYKAIDRSTNKTIALKKIRLEQEDEGVPSTAIREISLLKEMQHGNIVK
LQDVVHSERRLYLVFEYLDLDLKKHMDSCPEFSKDTHTIKMFLYQILRGISYCHSHRVLHRDL
KPQNLLLDRRTNSLKLADFGLARAFGIPVRTFTHEVVTLWYRAPEILLGSRHYSTPVDVWSV
GCIFAEMVNRRPLFPGDSEIDELFKIFRIMGTPNEDSWPGVTSLPDFKSTFPKWASQDLKTVT
PTVDPAGIDLLSKMLCMDPRRRITAKVALEHEYFKDVGVIP
SEQ ID NO:264
MVMKSKLDKYEKLEKLGEGTYGVVYKAQDKTTKEIYALKKIRLESEDEGIPSTAIREIALLKEL
QHPNVVRIHDVIHTNKKLILVFEFVDYDLKKFLHNFDKGIDPKIVKSLLYQLVRGVAHCHQQKV
LHRDLKPQNLLVSQEGILKLGDFGLARAFGIPVKNYTNEVVTLWYRAPDILLGSKNYSTSVDI
WSIGCIFVEMLNQKPLFPGSSEQDQLKKIFKIMGTPDATKWPGIAELPDWKPENFEKYPGEP
LNKVCPKMDPDGLDLLDKMLKCNPSERIAAKNAMSHPYFKDIPDNLKKLYN
SEQ ID NO:265
MDQYEKVEKIGEGTYGVVYKAIDRLTNETIALKKIRLEQEDEGVPSTAIREISLLKEMQHGNIV
RLQDVVHSENRLYLVFEYLDLDLKKHMDSSPDFAKDPRLVKIFLYQILRGIAYCHSHRVLHRD
LKPQNLLIDRRTNALKLADFGLARAFGIPVRTFTHEVVTLWYRAPEILLGSRHYSTPVDVWSV
GCIFAEMVNQRPLFPGDSEIDELFKIFRILGTPNEDTWPGVTALPDFKSAFPKWPAKNLQDM
VPGLNSAGIDLLSKMLCLDPSKRITARSALEHEYFKDIGFVP
SEQ ID NO:266
MEKYEKLEKVGEGTYGKVYKAKDKATGQLVALKKTRLEMDEEGVPPTALREVSLLQLLSQSL
YVVRLLSVEHVDGGSKRKPMLYLVFEYLDTDLKKFIDSHRKGPNPRPVPAATVQNFLYQLLK
GVAHCHSHGVLHRDLKPQNLLVDKEKGILKIADLGLGRAFTVPLKSYTHEVVTLWYRAPEVLL
GSAHYSIGVDMWSVGCIFAEMVRRQALFPGDSEFQQLLHIFRLLGTPTEEQWPGVTTLRDW
HVYPQWEPQNLARAVPSLGPDGVDLLSKMLKYDPAERISAKAALDHPFFDSLDKSQF
SEQ ID NO:267
MERPATAAVSAMEAFEKLEKVGEGTYGKVYRAREKATGKIVALKKTRLHEDEEGVPPTTLRE
ISILRMLSRDPHIVRLMDVKQGQNKEGKTVLYLVFEYMETDLKKYIRGFRSSGESIPVNIVKSL
MYQLCKGVAFCHGHGVLHRDLKPHNLLMDKKTLTLKIADLGLARAFTVPIKKYTHEILTLWYR
APEVLLGATHYSTAVDMWSVGCIFAELVTKQALFPGDSELQQLLHIFRLLGTPNEKMWPGVS
SLMNWHEYPQWKPQSLSTAVPNLDKDGLDLLSQMLHYEPSRRISAKAAMEHPYFDDVNKT
CL
SEQ ID NO:268
MGCVLGREVSSGIVTESKGRDSSEVETSKRDDSVAAKVEGEGKAEEVRTEETQKKEKVEDD
QQSREQRRRSKPSTKLGNLPKHIRGEQVAAGWPSWLSDICGEALNGVVIPRRANTFEKIDKIG
QGTYSNVYKAKDLLTGKIVALKKVRFDNLEPESVRFMAREILILRHLDHPNVVKLEGLVTSRM
SCSLYLVFEYMEHDLAGLAASPAIKFTEPQVKCYMHQLLSGLEHCHNRRVLHRDIKGSNLLID
NGGVLKIGDFGLASFYDPDHKHRMTSRVVTLWYRPPELLLGANDYGVGIDLWSAGCILAELL
AGKPIMPGRTEVEQLHKIYKLCGSPSEEYWKKYKLPNATLFKPREPYRRCIRETFKDFPPSSL
PLIETLLAIDPAERGTATDALQSEFFRTEPYACEPSSLPQYPPSKEMDAKKRDDEARRLRAAS
KGQADGSKKERTRDRRVRAVPAPEANAELQHNIDRRRLISHANAKSKSEKFPPPHQDGALG
FPLGASHRFDPAVVPPDVPFTSTSFTSSKEHDQTWSGPLVDPPGAPRRKKHSAGGQRESS
KLSMGTNKGRRADSHLKAYESKSIA
SEQ ID NO:269
MYSKSSAVDDSRESPKDRVSSSRRLSEVKTSRLDSSRRENGFRARDKVGDVSVMLIDKKVN
GSARFCDDQIEKKSDRLQKQRRERAEAAAAADHPGAGRVPKAVEGEQVAAGWPVWLSAV
AGEAIKGWLPRRADTFEKLDKIGQGTYSSVYKARDVTNNKIVALKRVRFDNLDTESVKFMAR
EIHILRMLDHPNVIKLEGUTSRMSCSLYLVFEYMEHDLTGLASRPDVKFSEPQLKCYMKQLLS
GLDHCHKHGVLHRDIKGSNLLIDNNGILKIADFGLASVFDPHQTAPLTSRVVTLWYRPPELLL
GASRYGVEVDLWSTGCILGELYTGKPILPGKTEVEQLHKIFKLCGSPSDDYWRRLHLPHAAV
FKPPQPYRRCVAEIFKELPPVALGLLETLISVDPSQRGTAAFALRSEFFTASPLPCDPSSLPKY
PPSKEIDMKLREEEARRRGAAGGKNELEKRGTKDSRTNSAYYPNAGQLQVKQCHSNANGR
SEIFGPYQEKTVSGFLVAPPKQARVSKETRKDYAEQPDRASFSGPLVPGPGFSKAGKELGH
SITVSRNTNLSTLSSLVTSRTGDNKQKSGPLVSESANQASRYSGPIREMEPARKQDRRSHVR
TNIDYRSREDGNSSTKEPALYGRGSAGNKIYVSGPLLVSSNNVDQMLKEHDRRIQEHARRA
RFDKARVGNNHPQAAVDSKLVSVHDAG
SEQ ID NO:270
MGCIPTIISDGRRRSAAPDKRRPRPRRSSSEGEAPPHATAAGSEGGESARGAPGKERPEPA
PRFVVRSPQGWPPWLVAAVGHAIGEFVPRCADSFRKLAKIGEGTYSNVYKARDLVTGKTVA
LKKVRFDNLEAESIKFMAREILVLTRLNHPNVIKLEGPVTSRMSSGLYLAFEYMEHDLSGIAAR
QNGKFTEPQVKCFMRQLLSGLEHCHNHDVLHRDIKCSNLLIDNEGNLKIADFGLATFYDPER
KQVMTNRVVTLWYRAPELLLGATSYGIGIDLWSAGCILAELLYGKPIMPGRTEVEQLHKIFKL
CGSPSEAYWNKFKLPNANIFKPPQPYARCIAETFKDFPPSALPLLETLLSIDPDERGTATTALN
SEFFAAEPHACEPSSLPKYPPSKEMDLKLIKEKTRRDSSKRPSAIHGSRRDGIHDRAGRVIPA
PEATAENQATLHRPRAMKKANPMSRSEKFPPAHMDGWGSSANAWLSGPASNAAPDSRR
HRSLNQNPSSSVGKASTGSSTTQETLKVAPELLQVGSSSLHPCHRMLVYGSNLTIRSK
SEQ ID NO:271
MGCICAKQADRGPASPGSGILTGAGTGTGTRSSKIPSGLFEFEKSGVKEHGGRSGELRKLE
EKGSLSKRLRLELGFSHRYVEAEQAAAGWPSWLTAVAGDAIQGLVPLKADSFEKLEKIGQGT
YSSVFRARELANGRMVALKKVRFDNFQPESIQFMAREISILRRLDHPNIMKLEGIITSRMSNSI
YLVFEYMEHDLYGLISSPQVKFSDAQVKCYMKQLLSGIEHCHQHGVIHRDVKSSNILVNNEGI
LRIGDFGLANILNPKDRQQLTSHVVTLWYRPPELLMGSTSYGVTVDLWSVGCVFAELMFRKP
ILRGRTEVEQLHKIFKLCGSPPDGYWKMCKVPQATMFRPRHAYECTLRERCKGIATSAMKL
METFLSIEPHKRGTASSALISEYFRTVPYACDPSSLPKYPPNKEIDAKHREEARRKKARSRVR
EAEVGKRPTRIHRASQEQGFSSNIAPKEKRSYA
SEQ ID NO:272
MAVAAPGHLNVNESPSWGSRSVDCFEKLEQIGEGTYGQVYMAKEKKTGEIVALKKIRMDNE
REGFPITAIREIKILKKLHHENVIKLKEIVTSPGPEKDEQGRPEGNKYKGGIYMVFEYMDHDLT
GLADRPGMRFSVPQIKCYMRQLLTGLHYCHINQVLHRDIKGSNLLIDNEGNLKLADFGLAFSF
SNDHNANLTNRVITLWYRPPELLLGATKYGPAVDMWSVGCIFAELLHGKPIFPGKDEPEQLN
KIFELCGAPDEINWPGVSKIPWYNNFKPTRPMKRRLREVFRHFDRHALELLERMLTLDPSQRI
SAKDALDAEYFWADPLPCDPKSLPKYESSHEFQTKKKRQQQRQHEETAKRQKLQHPPQHP
RLPPVQQSGQAHAQMRPGPNQLMHGSQPPVATGPPGHHYGKPRGPSGGAGRYPSSGNP
GGGYNHPSRGGQGGSGGYNSGPYPPQGRAPPYGSSGMPGAGPRGGGGNNYGVGPSNY
PQGGGGPYGGSGAGRGSNMMGGNRNQQYGWQQ
SEQ ID NO:273
MGCICTKGILPAHYRIKDGGLKLSKSSKRSVGSLRRDELAVSANGGGNDAADRLISSPHEVE
NEVEDRKNVDFNEKLSKSLQRRATMDVASGGHTQAQLKVGKVGGFPLGERGAQVVAGWP
SWLTAVAGEAINGWVPRRADSFEKLEKIGQGTYSSVYRARDLETNTIVALKKVRFANMDPES
VRFMAREIIIMRKLDHPNVMKLEGLITSRVSGSLYLVFEYMDHDLAGLAATPSIKLTESQIKCY
MQQLLRGLEYCHSHGVLHRDIKGSNLLVDNNGNLKIGDFGLATFFRTNQKQPLTSRVVTLWY
RPPELLLGSSDYGASVDLWSSGCILAELFAGKPIMPGRTEVEQLHIIFKLCGSPSEEYWKKS
KLPHATIFKPQQPYKRCLLETFKDFPSSALGLLDVLLAVEPECRGTASSALQNEFFTSNPLPS
DPSSLPKYPSSKEFDARLRDEEARKHKATAGKARGLESIRKGSKESKVVPTSNANADLKASI
QKRQEQSNPRSTGEKPGGTTQNNFILSGQSAKPSLNGSTQIGNANEVEALIVPDRELDSPR
GGAELRRQRSFMQRRASQLSRFSNSVAVGGDSHLDCSREKGANTQWRDEGFVARCSHPD
GGELAGKHDWSHHLLHRPISLFKKGGEHSRRDSIASYSPKKGRIHYSGPLLPSGDNLDEMLK
EHERQIQNAVRKARLDKVKTKREYADHGQTESLLCWANGR
SEQ ID NO:274
MDPDPSPDPDPPKSWSIHTRREIIARYEILERVGSGAYSDVYRGRRLSDGLAVALKEVHDYQ
SAFREIEALQILRGSPHVVLLHEYFWREDEDAVLVLEFLRSDLAAVIADASRRPRDGGGGGA
AALRAGEVKRWMLQVLEGVDACHRNSIVHRDLKPGNLLISEEGVLKIADFGQARILLDDGNV
APDYEPESFEERSSEQADILQQPETMEADTTCPEGQEQGAITREAYLREVDEFKAKNPRHEI
DKETSIFDGDTSCLATCTTSDIGEDPFKGSYVYGAEEAGEDAQGCLTSCVGTRWFRAPELLY
GSTDYGLEVDLWSLGCIFAELLTLEPLFPGISDIDQLSRIFNVLGNLSEEVWPGCTKLPDYRTI
SFCKIENPIGLESCLPNCSSDEVSLVRRLLCYDPAARATPMELLQDKYFTEEPLPVPISALQVP
QSKNSHDEDSAGGWYDYNDMDSDSDFEDFGPLKFTPTSTGFSIQFP
SEQ ID NO:275
MDPDPSPSPDPPKSWSIHTRREIIARYEILERVGSGAYSDVYRGRRLSDGLAVALKEVHDYQ
SAFREIEALQILRGSPHVVLLHEYFWREDEDAVLVLEFLRSDLAAVIADASRRPRGGGVAPLR
AGEGKRWMLQVLEGVDACHRNSIVHRDLKPGNLLISEEGVLKIADFGQARILLDDGNVAPDY
EPESFEERSSEQADILQQPETMEADTTCPEGQEQGAITREAYLREVDEFKAKNPRHEIDKET
SIYDGDTSCLATCTTSDIGEDPFKGSYVYGAEEAGEDAQGSLTSCVGTRWFRAPELLYGSTD
YGLEVDLWSLGCIFAELLTLEPLFPGISDIDQLSRIFNVLGNLSEEVWPGCTKLPDYRTISFCKI
ENPIGLESCLPNCSSDEVSLVRRLLCYDPAARATPMELLQDKYFTEEPLPVPISALQVPQSKN
SHDEDSAGGWYDYNDMDSDSDFEDFGPLKFTPTSTGFSIQFP
SEQ ID NO:276
MSNQHRRSSFSSSTTSSLAKRHASSSSSSLENAGKAFAAAAVPSHLAKKRAPLGNLTNLKA
GDGNSRSSSAPSTLVANATKLAKTRKGSSTSSSIMGLSGSALPRYASTKPSGVLPSVNPSIP
RIEIAVDPMSCSMVVSPSRSDMQSVSLDESMSTCESFKSPDVEYIDNEDVSAVDSIDRKTFS
NLYISDAAAKTAVNICERDVLMEMETDEKIVNVDDNYSDPQLCATIACDIYQHLRASEAKKRP
STDFMDRVQKDITASMRAILIDWLVEVAEEYRLVPDTLYLTVNYIDRYLSGNVMNRQRLQLLG
VACMMIAAKYEEICAPQVEEFCYITDNTYFKEEVLQMESSVLNYLKFEMTAPTVKCFLRRFVR
AAQGVNEVPSLQLECMANYIAELSLLEYDMLCYAPSLVAASAIFLAKFVITPSKRPWDPTLQH
YTLYQPSDLGNCVKDLHRLCFNNHGSTLPAIREKYSQHKYKYVAKKYCPPSIPPEFFHNLVY
SEQ ID NO:277
MNKENAVGTKSEAPTIRITRSRSKALGTSTGMLPSSRPSFKQEQKRTVRANAKRSASDENK
GTMVGNASKQHKKRTVLNDVTNIFCENSYSNCLNAAKAQTSRQGRKWSMKKDRDVHQSG
AVQIMQEDVQAQFVEESSKIKVAESMEITIPDKWAKRENSEHSISMKDTVAESSRKPQEFICG
EKSAALVQPSIVDIDSKLEDPQACTPYALDIYNYKRSTELERRPSTIYMETLQKDVTPNMRGIL
VDWLVEVSEEYKLVPDTLYLTVNLIDRSLSQKFIEKQRLQLLGVTCMLIASKYEEICPPRVEEF
CFITDNTYTSLEVLKMESRVLNLLHFQLSVPTVKTFLRRFVQAAQVSSEVPSVELEYLANYLA
ELTLVEYSFLKFLPSLMAASAVLLARWTLNQSDNPWNLTLEHYTKYKASELKAAVLALEDLQL
NTSGSTLNAIREKYRQQKVNYSLLHSKANHEIL
SEQ ID NO:278
MAGSDENNPGVVGGAHVQEGLRVGAGKMGAGNVQQRRALSNINSNIIGAPPYPCAVNKRV
LSEKNVNSENDLLNAAHRPITRQFAAQMAYKQQLRPEENKRTTQSVSNPSKSEDCAILDVDD
DKMADDFPVPMFVQHTEAMLEEIDRMEEVEMEDVAEEPVTDIDSGDKENQLAVVEYIDDLY
MFYQKAEASSCVPPNYMDRQQDINERMRGILIDWLIEVHYKFELMDETLYLTVNLIDRFLAVQ
PVVKKKLQLVGVTAMLLACKYEEVSVPVVEDLILISDRAYSRKEVLEMERLMVNTLHFNMSVP
TPYVFMRRFLKAAQSDKKLELLSFFIIELSLVEYDMLKFPPSLLAASAIYTALSTITRTKQWSTT
CEWHTSYSEEQLLECARLMVTFHQRAGSGKLTGVHRKYSTSKFGHAARTEPANFLLDFRL
SEQ ID NO:279
MASRPIVPVQARGEAAIGGGAGKAAIGGGAGKQQKKNGAAEGRNRKALGDIGNLVTVRGIE
GKVQPHRPITRSFCAQLLANAQAAAAAENNKKQAVVNVNGAPSILDVPGAGKRAEPAAAAA
AAVAKAAQKKVVKPKQKAEVIDLTSDSERAIEAKKKQQHHEPTKKEGEKSSRRNMPTLTSVL
TARSKAACGMTKKPKEKVVDIDAGDAHNELAAFEYIEDIYTYYKEAENESLPRNYMSSQPEIN
EKMRAILVDWLIEIHNKFDLMPETLYLTINIIDRFLSVKAVPRRELQLLGMGALFTASKYEEIVVA
PEVNDLVCIADRAYSHEQVLAMEKTILGKLEWTLTVPTHYVFLVRFIKASLGDRKLENMVYFL
AELGVMNYATLTYCPSMVAASAVYAARCTLGLTPLWNDTLKLHTGFSESQLMDCARLLVGY
HAKAKENKLQVVYKKYSSSQREGVALIPPAKALLCEGGGLSSSSSLASSS
SEQ ID NO:280
MGLPDENNAALSKPTNLQVGGLEIGGRKFGQEIRQTRRALSVINQNLVGDRAYPCHVVNKR
GHSKRDAVCGKDQVDPVHRPLTRKFAAQTASTQQHCIEEAKKPRTAVQERNEFGDCIFVDV
EDCQPSSENQPVPMFLEIPESRLDDDMEEVEMEDIVEEEEEEPIMDIDGRDKKNPLAVVDYIE
DIYANYRRTENCSCVSANYMAQQADINEKMRSILIDWLIEVHDKFDLMHETLFLTVNLIDRFLA
RQSVVRKKLQLVGLVAMLLACKYEEVSVPVVGDLILISDKAYTRKEVLEMESLMLNSLQFNM
SVPTPYVFMRRFLKAAESDKKLEVLSFFLIELSLVEYEMVKFPPSLLAAAAIFTAQCTLYGFKQ
WTKTCEWHSNYTEDQLLECARMMVGFHQKAATGKLTGVHRKYGTSKFGYTSKCEPANFLL
GEMKNP
SEQ ID NO:281
MGLPDENNAALSKPTNLQVGGLEIGGRKFGQEIRQTRRALSVINQNLVGDRAYPCHVVNKR
GHSKRDAVCGKDQVDPVHRPLTRKFAAQTASTQQHCIEEAKKPRTAVQERNEFGDCIFVDV
EDCQPSSENQPVPMFLEIPESRLDDDMEEVEMEDIVEEEEEEPIMDIDGRDKKNPLAVVDYIE
DIYANYRRTENCSCVSANYMAQQADINEKMRSILIDWLIEVHDKFDLMHETLFLTVNLIDRFLA
RQSVVRKKLQLVGLVAMLLACKYEEVSVPVVGDLILISDKAYTRKEVLEMEKLMLNSLQFNM
SVPTPYVFMRRFLKAAESDKKLEVLSFFLIELSLVEYEMVKFPPSLLAAAAIFTAQCTLYGFKQ
WTKTCEWHSNYTEDQLLECARMMVGFHQKAATGKLTGVHRKYGTSKFGYTSKCEAANFLL
GEMKNP
SEQ ID NO:282
MAMVQRQGHDPSSPQEQEDGPSSFLSDDALYCEEGRFEEDDGGGGGQVDGIPLFPSQPA
DRQQDSPWADEDGEEKEEEEAELQSLFSKERGARPELAKDDGGAVAARREAVEWMLMVR
GVYGFSALTAVLAVDYLDRFLAGFRLQRDNRPWMTQLVAVACLALAAKVEETDVPLLVELQE
VGDARYVFEAKTVQRMELLVLSTLGWEMHPVTPLSFVHHVARRLGASPHHGEFTHWAFLR
RCERLLVAAVSDARSLKHLPSVLAAAAMLRVIEEVEPFRSSEYKAQLLSALHMSQEMVEDCC
RFILGIAETAGDAVTSSLDSFLKRKRRCGHLSPRSPSGVIDASFSCDDESNDSWATDPPSDP
DDNDDLNPLPKKSRSSSPSSSPSSVPDKVLDLPFMNRIFEGIVNGSPI
SEQ ID NO:283
MEASYQPHHHGHLRQHDPSSSQQEEQVPFDALYCSEEHWGEEDEEEGLASDGLLSEERD
HRLLSPRALLDQDLLWEDEELASLFSKEEPGGMRLNLENDPSLADARREAVEWIMRVHAHY
AFSALTALLAVNYWDRFTCSFALQEDKPWMTQLSAVACLSLAAKVEETQVPLLIDFQVEDSS
PVFEAKNIQRMELLVLSSLEWKMNPVTPLSFLDYMTRRLGLTGHLCWEFLRRCENVLLSVIS
DCRFTCYLPSVIAASTMLHVINGLKPRLDVEDQTQLLGILAMGMDKIDACYKLIDDDHALRSQ
RYSHNKRKFGSVPGSPRGVMELCFSSDGSNDSWSVAASVSSSPEPHSKKSRAGEEAEDRL
LRGLEGEEDDPASADIFSFPH
SEQ ID NO:284
MALQEEDTRRHYPTAPPFSPDGLYCEDETFGEDLADNACEYAGGGARDGLCEIKDPTLPPS
LLGQDLFWEDGELASLVSRETGTHPCWDELISDGSVALARKDAVGWILRVHGHYGFRPLTA
MLAVNYLDRFFLSRSYQRDRPWISQLVAVACLSVAAKVEETQVPILLDLQVANAKFVFESRTI
QRMELLLMSTLDWRMNSVTPISFFDHILRRFGLTTNLHRQFFWMCERLLLSVVADVRLASFL
PSVVATAAMLYVNKEIEPCICSEFLDQLLSLLKINEDRVNECYELILELSIDHPEILNYKHKRKR
GSVPSSPSGVIDTSFSCDSSNDSWGVASSVSSSLEPRFKRSRFQDQQMGLPSVNVSSMGV
LNSSY
SEQ ID NO:285
MGQIQYSEKYFDDTYEYRHVVLPPDVAKLLPKNRLLSENEWRAIGVQQSRGWVHYAIHRPE
PHIMLFRRPLNYQQQQENQAQQNMLAK
SEQ ID NO:286
MGSIDPPKAEQNGTAAAADPGQKPGAGDAMPPPPPVKHSNGTAAEPDVATKRRRMSVL
PLEVGTRVMCRWRDGKYHPVKVIERRKLNPGDPNDYEYYVHYTEFNRRLDEWVKLEQLDL
NSVETVVDEKVEDKVTGLKMTRHQKRKIDETHVEGHEELDAASLREHEEFTKVKNIATIELGR
YEIETWYFSPFPPEYNDCSKLYFCEFCLNFMKRKEQLQRHMKKCDLKHPPGDEIYRSGTLS
MFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHS
EESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLLSYKGYWTRVLLDI
LKKHKANISIKELSDMTAIKADDILNTLQSLDLIQYRKGQHVICADPKVLDRHLKAAGRGGLEV
DVSKLIWTPYREQG
SEQ ID NO:287
MAQKHSTAPDPAAEPKKRRRVGFSGIDAGVDPNGCFKVYLVSREEEVGAPDSFCLDPVDLS
HFFEEEDGKIYGYEGLKISVWVSCVSFHSYAEIAFESKSDGGKGITDLNTALKNMFGETLVDN
KDDFLQTFSKETQFIRSTVSAGEILKHKHSDDHVNDSVSNLKVGSDVEAVRMLMGDMTAGH
LYSRLVPLVLLLVDGSSPIDVTDSSWELYLLIQKTSDQQGNFHDRLLGFAAVYRFYHYPDSSR
LRLGQILVLPLYQRKGYGRYLLEVLNNVAIADDVYDFTIEEPVDNLQHLRTCIDVQRLLSFDKV
QQAVNSTVSQLKQGKLSKKTYIPRLLPPPSVVEDARKRFKINKKQFLQCWEILVYLGLDPADK
SIQDYFSVISNRVRADILGKDSETAGKKVIEVPSDFDPEMSFVMHRAKAGGEANGIQVEDNQ
NKQEEQLQQLIDERLKDIKLIAEKVTQK
SEQ ID NO:288
MAQKHSTAPDPAAEPKKRRRVGFSGIDAGVDPNGCFKVYLVSREEEVGAPDSFCLDPVDLS
HFFEEEDGKIYGYEGLKISVWVSCVSFHSYAEIAFESKSDGGKGITDLNTALKNMFGETLVDN
KDDFLQTFSKETQFIRSTVSAGEILKHKHSDGHVNDSVSNLKVGSDVEAVRMLMGDMTAGH
LYSRLVPLVLLLVDGSNPIDVTDSSWELYLLIQKTSDQQGNFHDRLLGFAAVYRFYHYPDSLR
LRLGQILVLPLYQRKGYGHYLLEVLNNVAIADDVYDFTIEEPVDNLQHLRTCIDVQRLLSFDKV
QQAVNSTVSQLKQGKLSKKTYIPRLLPPPSVVEDARKRFKINKKQFLQCWEILVYLGLDPADK
SIQDYFSVISNRVRADILGKDSETAGKKVIEVPSDFDPEMSFVLHRAKAGGETNGIQVEDNQN
KQEEQLQQLIDERLKDIKLIAQKVSRK
SEQ ID NO:289
MALPMEFWGVEVKAGQPLKVNPGNAKILHLSQASLGECKSSKGNESVPLHVKFGDQKLVLG
TLSTENFPQLAFDLVFEKEFELSHNWKSGSVYFCGYKSVVDDDDEFSDLESDSEEEDLPMI
GVENGKVAAQASAKTATASANASKVESSGKQKARIPQPMKVDEDDSDEDDDDEDEDESDE
EGVDGEADSDEEEDESDEEETPKKAEIGKKRAADSATKTPVPAKKSKLPTPQKTDGKKGGH
TATPHPAKQAGKNPANSANKSQSPKSAGQVSCKSCSKTFNSDGALQSHSKAKHGGK
SEQ ID NO:290
MEFWGVEVKAGQPLKVNPGNAKILHLSQASLGECKSSKGNESVPLHVKFGDQKLVLGTLST
ENFPQLAFDLVFEKEFELSHNWKSGSVYFCGYKSVVHDDDDEFSDLESDSEEEDLPMIGVE
NGKVAAQASAKTATASANASKVESSGKQKASIPQPMKVDEDDSDEDDDEDDDDEDESDEG
VDGEADSDEEEDESDEEETPKKAEIGKKRAADSATKTPVPAKKSKLPTPQKTDGKKGGHTA
TPHPAKQAGKNPANSANKSQSPKSAGQVSCKSCSKTFNSDGALQSHSKAKHGGK
SEQ ID NO:291
MEFWGVEVKSGEPLNVEPGAETVVHLSQACLGETKEKTKESVLLYVHIGVQKLVLGTLSADK
FPQIPFDLVFEKSFKLSHNWKNGSVFFSGYKTLLPCGSDADSPYSDSDTDEGLPINVTAQAD
VPAKKAPVTANANAAKPNLASAKQKVKIVESNEDGKNEGDDDEDADVSSDDDAEDDSGDE
DMVDGGDESSDEDDDDSEEGESSEEEEPKAQPSKKRPADSVLKTPASDKKSKLETPQKTD
GKKASEHVATPYPSKQAGKAIASKGQAKQQTPNSNEFSCKPCNRSFKSDQALQSHNKAKH
GGS
SEQ ID NO:292
MDTGGNSLPSGPDGVKRKVCYFYDPEVGNYYLLQHMQVLKPVPARDRDLCRFHADDYVAF
LRSITPETQQDQLRQLKRFNVGEDCPVFDGLHSFCQTYAGGSVGGAVKLNHGLCDIAINWA
GGLHHAKKCEASGFCYVNDIVLGILELLKQHERVLYVDIDIHHGDGVEEAFYTTDRVMTVSFH
KFGDYFPGTGDIRDIGYGKGKYYSLNVPLDDGIDDESYHSLFKPIIGKVMEVFKPGAVVLQCG
ADSLSGDRLGCFNLSIKGHAECVRYMRSFNVPVLLLGGGGYTIRNVARCWCYETGVALGLE
VDDKMPQHEYYEYFGPDYTLHVAPSNMENKNSRQLLEEIRSKLLENLSKLQHAPSVPFQER
PPDTELPEADEDQEDPDERWDPDSDMDVDEDRKPLPSRVKRELIVEPEVKDQDSQKASIDH
GRGLDTTQEDNASIKVSDMNSMITDEQSVKMEQDNVNKPSEQIFPK
SEQ ID NO:293
MDTGGNSLPSGPDGVKRKVCYFYDPEVGNYYYGQGHPMKPHRIRMTHALLAHYGLLQHM
QVLKPVPARDRDLCRFHADDYVAFLRSITPETQQDQLRQLKRFNVGEDCPVFDGLHSFCQT
YAGGSVGGAVKLNHGLCDIAINWAGGLHHAKKCEASGFCYVNDIVLGILELLKQHERVLYVDI
DIHHGDGVEEAFYTTDRVMTVSFHKFGDYFPGTGDIRDIGYGKGKYYSLNVPLDDGIDDESY
HSLFKPIIGKVMEVFKPGAVVLQCGADSLSGDRLGCFNLSIKGHAECVRYMRSFNVPVLLLG
GGGYTIRNVARCWCYETGVALGLEVDDKMPQHEYYEYFGPDYTLHVAPSNMENKNSRQLL
EDIRSKLLENLSKLQHAPSVPFQERPPDTELPEADEDQEDPDERWDPDSDMDVDEDRKPLP
SRVKRELIVEPEVKDQDSQKASIDHGRGLDTTQEDNASIKVSDMNSMITDEQSVKMEQDNV
NKPSEQIFPK
SEQ ID NO:294
MRPKDRISYFYDGDVGSVYFGPNHPMKPHRLCMTHHLVLSYELHTKMEIYRPHKAYPAELA
QFHSPDYVEFLHRITPDTQHLFPNDLAKYNLGEDCPVFENLFEFCQIYAGGTIDAARRLNNQL
CDIAINWAGGLHHAKKCEASGFCYINDLVLGILELLKYHARVLYIDIDVHHGDGVEEAFYFTDR
VMTVSFHKFGDMFFPGTGDVKEIGGKEGKFYAINVPLKDGIDDTSFTRLFKAIISKVVETYQP
GAIVLQCGADSLAGDRLGCFNLSIDGHSECVRFVKKFNLPLLVTGGGGYTKENVARCWVVE
TGVLLDTELPNEIPENEYFKYFAPDYSLKIPRGNIVLENLNSKSYLSAIKVQVLENLRNIQHAPS
VQMQEVPPDFYIPDFDEDEQNPDERMDQHTQDKQIQRDDEYYDGDNDNDHNMDDS
SEQ ID NO:295
MTVAEDFHVNNRSKMVSQATPESRLTGGEDDNSLHNQVDELLCQELPERQVILEFEGTRPK
PYFSDHNGGENSALGVRATEDDLNSDVEAEEKQKEMTLEDMYKNDGTLYDDDEDDSDWEP
VKRQVELMRWFCTNCTMVNVEDVFLCDICGEHRDSGILRHGFYASPFMQDVGAPSVEAEV
QESREDHARSSPPSSSTVVGFDEKMLLHSEVEMKSHPHPERADRLQAIAASLATAGIFPGRC
RSLPVREITKEELQMVHSSEHVDAVEEMTSHMFSSYFTPDTYANEHSARAARIAAGLCADLAS
TIISGRSKNGFALVRPPGHHAGIKHAMGFCLHNNAAVAALAAQGAGAKKVLIVDWDVHHGN
GTQEIFDGNKSVLYISLHRHEGGNFYPGTGAAHEVGTMGAEGYCVNIPWSRRGVGDNDYVF
AFHHIVLPIASAFAPDFTIISAGFDAARGDPLGCCDVTPAGYAQMTHMLSALSGGKLLVILEGG
YNLRSISSSAVAVIKVLLGDSPISEIADAVPSKAGLRTVLEVLKIQRSYWPSLESIFWELQSQW
GMFLVDNRRKQIRKRRRVLVPIWWKWGRKSVLYHLLNGHLHVKTKR
SEQ ID NO:296
MAAAPSSPPTNRVDVFWHDGMLSHDTGRGVFDTGSDPGFLDVLEKHPENPDRVRNMVSIL
KRGPISPFISWHTATPAMSQLLSFHSPEYINELVEADKNGGKVLCAGTFLNPGSWDAALLAA
GNTLSAMKYVLDGKGKIAYALVRPPGHHAQPSQADGYCFLNNAGLAVRLALDSGCKRVVVV
DIDVHYGNGTAEGFYQSSDVLTISLHMNHGSWGPSHPQSGSVDELGEDEGYGYNMNIPLPN
GTGDRGYEYAVTELVVPAVESFKPEMVVLVVGQDSSAFDPNGRQCLTMDGYRAIGRTIRGL
ADRHSGGRILIVQEGGYHVTYSAYCLHATVEGILDLPDPLLADPIAYYPEDEAFPVKVVDSIKR
YLVDKVPFLKEH
SEQ ID NO:297
MVESSGGASLPSVGQDARKRRVSYFYEPTIGDYYYGQGHPMKPHRIRMAHNLIVHYYLHRR
MEISRPFPAATTDIRRFHSEDYVTFISSVTPETVSDPAFSRQLKRFNVGEDCPVFDGIFGFCQ
ASAGGSMGAAVKLNRGDSDIALNWAGGLHHAKKSEASGFCYVNDIVLGILELLKVHKRVLYV
DIDVHHGDGVEEAFYTTDRVMTVSFHKFGDFFPGSGHIKDTGAGPGKNYALNVPLNDGIDD
ESFRGMFRPIIQKVMEVYQPDAVVLQCGADSLSGDRLGCFNLSVKGHADCLRFLRSFNVPL
MVLGGGGYTMRNVARCWCYETAVAVGVEPENDLPYNEYYEYFGPDYTLHVEPCSMENLNA
PKDLERIRNMLLEQLSRIPHAPSVPFQMTPPITQEPEEAEEDMDERPKPRIWNGEDYESDAE
EDKSQHRSSNADALHDENVEMRDSVGENSGDKTREDRSPS
SEQ ID NO:298
MVVPSSNPHNREMAIRRRMASTFNKREDDFPSLREYNDYLEEVEEMTFNLIEGVDVPTIEAKI
AKYQEENAEQIMINRAKKAEEFAAALAASKGLPPQTDPDGALNSQAGLSVGTQGQYAPAIAG
GQPRPTGMAPQPVPLGTGLDIHGYDDEEMIKLRAERGGRAGGWSIELSKKRALEEAFGSLW
L
SEQ ID NO:299
MAAIISCHHYHSCCSSLIASKWVGARIPTSCFGRSSTQSNNAASVRQFVTRCSSSPSSRGQ
WQPHQNGEKGRSFSLRECAISIALAVGLVTGVPSLDMSTGNAYAASPALPDLSVLISGPPIKD
PRALLRYALPINNKAIREVQKPLEDITDSLKVAGLRALDSVERNVRQASRVLKQGKNLIVSGLA
ESKKDHGVELLDKLEAGMDELQQIVEDGNRDAVAGKQRELLNYVGGVEEDMVDGFPYEVP
EEYKNMPLLKGRAAVDMKVKVKDNPNLEECVFRIVLDGYNAPVTAGNFVDLVERHFYDGME
IQRADGFVVQTGDPEGPAESFIDPSTEKPRTIPLEIMVDGEKAPVYGATLEELGLYKAQTKLP
FNAFGTMAMARDEFEDNSASSQIFWLLKESELTPSNANILDGRYAVFGYVTENQDFLADLKV
GDVIESVQVVSGLDNLANPSYKIAG
SEQ ID NO:300
MAGEDFDIPPADEMNEDFDLPDDDDDAPVMKAGDEKEIGKQGLKKKLVKEGDAWETPDNG
DEVEVHYTGTLLDGTQFDSSRDRGTPFKFTLGQGQVIKGWDQGIKTMKKGENAIFTIPPELA
YGEAGSPPTIPPNATLQFDVELLSWTSVKDICKDGGIFKKILVEGEKWENPKDLDEVLVKYEF
QLEDGTTIARSDGVEFTVKEGHFCPAVAKAVKTMKKGEKVLLTVKPQYGFGEKGKPASGDE
GAVPPNATLQITLELVSWKTVSEVTDDKKVIKKILKEGEGYERPNEGAVVEVKLIGKLQDGTV
FVKKGHDDCEELFKFKIDEEQVVDGLDKAVMNMKKGEVALLTVAPEYAFGSSESKQDLAVV
PPSSTVYYEVELVSFVKDKESWDMNTEEKIEAAGKKKEEGNVIFKAGKYAKASKRYEKAVKY
IEYDTSFSEDEKKQAKALKVACNLNDAACKLKLKDYNQAEKLCTKVLELDSRNVKALYRRAQ
AYIELSDLDLAEFDIKKALEIDPHNRDVKLEYKVLKEKVKEFNKKDAKFYGNMFAKMSKLEPV
EKTAAKEPEPMSIDSKA
SEQ ID NO:301
MSTVYVLEPPTKGKVVLNTTHGPLDVELWPKEAPKAVRNFVQLCLEGYYDNTIFHRIIKDFLV
QGGDPTGSGTGGESIYGDAFSDEFHSRLRFKHRGLVACANAGSPHSNGSQFFITLDRCDWL
DRKNTIFGKITGDSIYNLSGLAEVETDKSDRPLDPPPKIISVEVLWNPFEDIVPRAPVRSLVPTV
PDVQNKEPKKKAVKKLNLLSFGEEAEEEEKALVVVKQKIKSSHDVLDDPRLLKEHIPSKQVDS
YDSKTARDVQSVREALSSKKQELQKESGAEFSNSFREIADDEDDDDDDASFDARMRRQILQ
KRKELGDLPPKPKPKSRDGISARKERETSISRDKDDDDDDDQPRVEKLSLKKKGIGSEARGE
RMANADADLQLLNDAERGRQLQKQKKHRLRGREDEVLTKLETFKASVFGKPLASSAKVGDG
DGDLSDWRSVKLKFAPEPGKDRMTRNEDPNDYVVVDPLLEKGKEKFNRMQAKEKRRGRE
WAGKSLT
SEQ ID NO: 302
MASAISMHSSGLLLLQGTNGKDVTEMGKAPASSRVANMQQRKYGATCCVARGLTSRSHYA
SSLAFKQFSKTPSIKYDRMVEIKAMATDLGLQAKVTNKCFFDVEIGGEPAGRIVIGLFGDDVP
KTVENFRALCTGEKGFGYKGCSFHRIIKDFMIQGGDFTRGNGTGGKSIYGSTFEDENFALKH
VGPGVLSMANAGPSTNGSQFFICTVKTPWLDNRHVVFGQVVDGMDVVQKLESQETSRSDV
PRQPCRIVNCGELPLDG
SEQ ID NO:303
MAASFTALSNVGSLSSPRNGSEIRRFRPSCNVAASVRPPPLKAGLSASSSSSFSGSLRLIPLS
SSPQRKSRPCSVRASAEAAAAQSKVTNKVYLDISIGNPVGKLVGRIVIGLYGDDVPQTAENFR
ALCTGEKGFGYKGSTVHRVIKDFMIQGGDFDKGNGTGGKSIYGRTFKDENFKLSHVGPGVV
SMANAGPNTNGSQFFICTVKTPWLDQRHVVFGQVLEGMDIVRLIESQETDRGDRPRKRVVV
SDCGELPVV
SEQ ID NO:304
MAEAIDLTGDGGVMKTIVRRAKPDAVSPSETLPLVDVRYEGVLAETGEVFDSTHEDNTLFSF
EIGKGSVISAWDTALRTMKVGEVAKITCKPEYAYGSTGSPPDIPPDATLIFEVELVACKPCKGF
SVTSVTEDKARLEELKKQREIAAATKEEEKKRREEAKAAAAARVQAKLDAKKGHGKGKGKA
K
SEQ ID NO:305
MGNPKVFFDMSIGGQPAGRIVMELYADVVPRTAENFRALCTGEKGAGRSGKPLHYKGSSFH
RVIPGFMCQGGDFTAGNGTGGESIYGSKFADENFVKKHTGPGVLSMANAGPGTNGSQFFV
CTAKTEWLDGKHVVFGQIVDGMDVVKAIEKVGSSSGRTSKPVVVADCGQLS
SEQ ID NO:306
MPNPKVFFDMTIGGAAAGRVVMELYADTTPRTAENFRALCTGEKGVGRSKKPLHYKGSKFH
RVIPSFMCQGGDFTAGNGTGGESIYGVKFADENFIKKHTGPGILSMANAGPGTNGSQFFICT
TKTEWLDGKHVVFGKVVEGMEVVKAIEKVGSSSGRTSKPVVVADCGQLP
SEQ ID NO:307
MAEAIDLTGDGGVMKTIVRRAKPDAVSPSETLPLVDVRYEGVLAETGEVFDSTHEDNTLFSF
EIGKGSVISAWDTALRTMKVGEVAKITCKPEYAYGSTGSPPDIPPDATLIFEVELVACKPCKGF
SVTSVTEDKARLEELKKQREIAAATKEEEKKRREEAKAAAAARVQAKLDAKKGHGKGKGKA
K
SEQ ID NO:308
MATARSFFLCALLLLATLYLAQAKKSEDLKEVTHKVYFDVEIAGKPAGRIVMGLYGKAVPKTA
ENFRALCTGEKGTGKSGKPLHYKGSSFHRIIPSFMLQGGDFTLGDGRGGESIYGEKFADENF
KLKHTGPGLLSMANAGPDTNGSQFFITTVTTSWLDGRHVVFGKVLSGMDVVYKVEAEGRQS
GTPKSKVVIADSGELPL
SEQ ID NO:309
MMRREISVLLQPRFVLAFLALAVLLLVFAFPFSRQRGDQVEEEPEITHRVYLDVDIDGQHLGRI
VIGLYGEVVPRTVENFRALCTGEKGKSANGKKLHYKGTPFHRIISGFMIQGGDVIYGDGKGY
ESIYGGTFADENFRIKHSHAGIISMVNSGPDSNGSQFFITTVKASWLDGEHVVFGRVIQGMDT
VYAIEGGAGTYNGKPRKKVIIADSGEIPKSKWDEER
SEQ ID NO:310
MWATAEGGPPEVTLETSMGSFTVELYFKHAPRTSRNFIELSRRGYYDNVKFHRIIKDFIVQG
GDPTGTGRGGESIYGKKFEDEIKPELKHTGAGILSMANAGPNTNGSQFFITLAPCPSLDGKH
TIFGRVCRGMEIIKRLGSVQTDNNDRPIHDVKILRTSVKD
SEQ ID NO:311
MSNPKVFFDILIGKMKAGRVVMELFADVTPKTAENFRALCTGEKGIGRSGKPLHYKGSTFHRI
IPNFMCQGGDFTRGNGTGGESIYGMKFADENFKIKHTGLGVLSMANAGPDTNGSQFFICTE
KTPWLDGKHVVFGKVIDGYNVVKEMESVGSDSGSTRETVAIEDCGQLSEN
SEQ ID NO:312
MDDDFEFPASSNVENDDDDGMDMDDMGGDVPEEEDPVASPAVLKVGEEREIGKAGFKKKL
VKEGEGWETPSSGDEVEVHYTGTLLDGTKFDSSRDRGTPFKFKLGRGQVIKGWDEGIKTMK
KGENAIFTIPPELAYGESGSPPTIPPNATLQFDVELLSWSSVKDICKDGGILKKVLVEGEKWD
NPKDLDEVFVKYEASLEDGTLISKSDGVEFTVGDGYFCAALAKAVKTMKKGEKVLLTVMPQY
AFGETGRPASGDEAAVPPDASLQIMLELVSWKTVSDVTKDKKVLKKTLKEGEGYERPNDGA
AVQVRLCGKLQDGTVFVKKDDEEPFEFKIDEEQVIDGLDRAVKNMKKGEVALVTIQPEYAFG
PTESQQDLAVVPANSTVYYEVELLSFVKEKESWEMNNQEKIEAAARKKEEGNAAFKAGKYV
RASKRYEKAVRFIEYDSSFSDEEKQQAKTLKNTCNLNDAACKLKLKDFKEAEKLCTKVLEGD
GKNVKALYRRAQAYIQLVDLDLAEQDIKKALEIDPNNRDVLLEYKILKEKVREYNKRDAQFYG
NMFAKMNKLEHSRTAGMGAKHEAAPMTIDSKA
SEQ ID NO:313
MAKPRCFMDISIGGELEGRIVGELYTDVAPKTAENFRALCTGEKGIGPHTGAPLHYKGVRFH
RVIKGFMVQGGDISAGDGTGGESIYGLKFEDENFDLKHERKGMLSMANSGPNTNGSQFFITT
TRTSHLDGKHVVFGRVVKGMGVVRSVEHVTTAAGDCPTVDVVIADCGEIPAGADDGIRNFF
KDGDTYPDWPADLDESPAELSWWMDAVDSIKAFGNGSYKKQDYKMALRKYRKALRYLDIC
WEKEGIDEVESSSLRKTKSQIFTNSSACKLKLCDLKGALLDAEFAVRDGENNAKAYFRQGQA
HMELNDIDAAAESFSKALELEPNDVGIKKELNAAKKKIFERREQEKRAYRKMFL
SEQ ID NO:314
MTKRKNPLVFLDVSIDGDPVERIVIELFADTVPRTAENFRSLCTGEKGVGKTTGKPLHYKGSY
FHRIIKGFMAQGGDFSNGNGTGGESIYGGKFADENFKLAHDGPGLLSMANGGPNTNGSQFF
IIFKRQPHLDGKHVVFGKVMRGMEVVKKIEQVGSANGKPLQPVKIVDCGETSETGTQDAVVE
EKSKSATLKAKLRSARDSSSESRGKRRQRKSRKERTRKRRRYSSSDSYSSESSDSDSES
YSSDTESESKSHSESSVSDSSSSDGRRRKRKSTKREKLRRQRGKDSRGEQKSARYDKKSR
HKSADSSSDSESESSSRSRSRDDKKKSSRRESARSVSKLKDAEANSPENLESPRDREIKKV
EDNSSHEEGEFSPKNDVQHNGHGTDAKFGKYDDQRPRSDGSKKSSGSMRDSPKRLANSV
PQGSPSSSPAHKASEPSSSIRARNPSRSPAPDGNSKRIRKGRGFTERFSYARRYRTPSPED
VTYRPYHYGRRNFHDRRNDRYSNYRSYSERSPHRRYRSPPRGRSPPRYQRRRSRSRSVS
RSPGGNKGRYRGRDQSRSRSRSRSRSPRRGSSPANKQLPLSERLKSRLGTRVDEHSPRR
RRSSSRSHDSSRSRSPDEVPDKHEGKAAPVSPARSRSSSPSGRGLVSYGDASPDSGIN
SEQ ID NO:315
MSVLLVTSLGDIVVDLHADRCPLTCKNFLKLCRIKYYNGCVFHTVQKDFTAQTGDPTGTGTG
GDSVYKFLYGDQARFFMDEIHLDLKHSKTGTVAMASGGENLNASQFYFTLRDDLDYLDGKH
TVFGEVAEGLETLTRINEAYVDEKGRPYKNIRIRHTYILDDPFDDPPQLAELIPDASPEGKPKD
EVVDDVRLEDDWVPLDEQLGPAQLEEAIRAKEAHSRAVVLESIGDIPDAEIKPPDNVLFVCKL
NPVTEDEDLHTIFSRFGTVVSADVIRDFKTGDSLCYAFIEFENKDSCEQAYFKMDNALIDDRRI
KVDFSQSVAKLWSQFKRKDSQAAKGKGCFKCGAPDHMARECPGSSTRQPLSKYILKEDNA
QRGGDDSRYEMVFDEDAPESPSHGKKRRGRDDRDDRHKMSRQSVEETKFNDREGGHSV
DKHRQSERSKHREDEMSRDSKASEAGRRRIDRDFPEEERDGEKYTESHRDRDGKRGDYR
DYRKGRADVQTHGDRRGDENYRRKSAAYDDGHEGAGAARRKDSNDDHHAYRRGYGDSR
KGTRDEDDDGRGRRDDPSYRRSSGHKDSSNGGREEQKYRSGETDGKSHPERSHRGDRR
R
SEQ ID NO:316
MRPFNGGSSIACLVLVIAAGALAESQGPHLGSARVVFQTNYGDIEFGFFPGVAPRTVDHIFKL
VRLGCYNTNHFFRVDKGFVAQVADVANGRTAPMNDEQRTEAEKTIVGEFSNVKHVRGILSM
GRYDDPDSAQSSFSILLGDAPHLDGKYAIFGRVTKGDETLKKLEQLPTRREGMFVMPTERITI
LSSYYYDTGAESCEEENSTLRRRLAASAVEVERQRMKCFP
SEQ ID NO:317
MPNPKVFFDMQVGGAPAGRIVMELYADVVPKTAENFRALCTGEKGTGRSGKPLHFKGSSF
HRVIPGFMCQGGDFTRGNGTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGSQFFI
CTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGSGRTSKPVVIADSGQLA
SEQ ID NO:318
MRFTSITSAIALFAAAASALDKPLDIKVDKAVECSRKTKAGDKIQVHYRGTLEADGSEFDASYK
RGQPLSFHVGKGQVIKGWDQGLLDMCPGEKRTLTIQPDWGYGSRGMGPIPANSVLIFETEL
VEIAGVAREEL
SEQ ID NO:319
MGNPKVFFDMSIGGQPAGRIVMELYADVVPRTAENFRALCTGEKGAGRSGKPLHYKGSSFH
RVIPGFMCQGGDFTAGNGTGGESIYGSKFADENFVKKHTGPGVLSMANAGPGTNGSQFFV
CTAKTEWLDGKHVVFGQIVDGMDVVKAIEKVGSSSGRTSKPVVVADCGQLS
SEQ ID NO:320
MAVATRSRWVAMSVAWILVLFGTLALIQNRLSDTGASSDPKLVHRKVGEEKKKPDDLEEVTH
KVFFDVEIGGKPAGRIVMGLFGKTVPKTVENFRALCTGEKGIGKSGKPLNYKGSQFHRIIPKF
MIQGGDFTLGDGRGGESIYGNKFSDENFKLKHTDAGRLSMTNAGPDTNGSQFFITTVTTSW
LDGRHVVFGKVLSGMDVVHKIEAEGGQSGQPKSIVVISDSGELDL
SEQ ID NO:321
MAVTLHTNLGDIKCEIFCDEVPKAAEHNARGILSMANSGPNTNGSQFFIAYAKQPHLNGLYTI
FGRVIHGFEVLDIMEKTQTGPGDRPLAEIRLNRVTIHANPLAG
SEQ ID NO:322
MAVATRSRWVAMSVAWILVLFGTLALIQNRLSDTGASSDPKLVHRKVGEEKKKPDDLEEVTH
KVFFDVEIGGKPAGRIVMGLFGKTVPKTVENFRALCTGEKGIGKSGKPLNYKGSQFHRIIPKF
MIQGGDFTLGDGRGGESIYGNKFSDENFKLKHTDAGRLSMANAGPDTNGSQFFITTVTTSW
LDGRHVVFGKVLSGMDVVHKIEAEGGQSGQPKSIVVISDSGELDL
SEQ ID NO:323
MGNPKVFFDMSIGGQPAGRIVMELYADVVPRTAENFRALCTGEKGAGRSGKPLHYKGSSFH
RVIPGFMCQGGDFTAGNGTGGESIYGSKFADENFVKKHTGPGVLSMANAGPGTNGSQFFV
CTAKTEWLDGKHVVFGQIVDGMDVVKAIEKVGSSSGRTSKPVVVADCGQLS
SEQ ID NO:324
MSPVAANAMEEAAEPEVPAPVTPSKDDADTDAAVSRFLGFCKSKLGLAEGNCVQSSTLLRK
TAHVLRSSGTVIGTGTAEEAERYWFAFVLYTVRRVGERKAEDEQNGSDETEVPLSRILKASV
LNLIDFFKEIPQFVIKAGAIVSGIYGANWDSRLEAREMQTNYVHLCILCKFYKRICGEFFILNDA
KDDMKSADSSTSDPVIMYQPFGWLLFLALRIHALSRFKDLVSSTNALVSVLAILIIHLPTRFRKF
SISDSSQLVKRSEKGVDLVGSLAYRYDTSEDEIKRTLEKANNVIAEILGITPPPASECKAENLE
NVDTDGLIYFGNLMEETSLSSILSTLEKIYEDATRNDSEFDERVFINDDDSLLVSGSLSGAAINL
TGAKRKYDSFASPAKTITRPLSPSRSPASHINGIIGGTNLRITATPVATAMTTAKWLRTFVSPL
PSKPSTDLQGFLASCDRDVTSDVIRRANIILEAIFPNSPIGERTVTGGLQNANLMDNMWAEQR
RLEALKLYYRVLEAMCRAEAQILHSNNLTSLLTNERFHRCMLACSAELVLATHKTVTMLFPAV
LERTGITAFDLSKVIESFVRHEETLPRELRRHLNTLEERLLENMVWERGSSMYNSLVVARPAL
APEINRLGLLPEPMPSLDAIALLINFSSSGLPQSPVQKHEASPGQNGDIRSPKRISTEYRSVLV
ERNFTSPVKDRLLALSNIKSKLPPPPLQSAFASPTRPHPGGGGETCAETAIHIFFSKITKLAAV
RINAMLERLQLSQQIKEGVYCLFQQILSQRTNLFFNRHIDQVILCCFYGVAKINQINLTFREIIYN
YRKQPQCKPQVFRNVFVDWSTRRNGKAGNEHVDIISFYNEIFIPSVKPLLVELGPTGATTRTN
RTSEVGNKNDAQCPGSPKISSFPTLPDMSPKKVSASHNVYVSPLRSSKMDASISHSSKSYYA
CVGESTHAYQSPSKDLVAINSRLNGNRKVRGTLNFDDVDAGLVSDSMVANSLYLQNGSSMS
SSTAKSSEKPES
SEQ ID NO:325
MRPILMKGHERPLTFLKYNREGDLLFSCAKDHTPTVWFADNGERLGTYRGHNGAAVWCCDV
SRDSMRLITGSADTTAKLWSVQNGTQLFTFNFDSPARSVDFSIGDKLAVITTDPFMELPSAIH
VKRIARDPADQASESVLVLRGHQGRIARAVWGPLNKTIISAGEDAVIRIWDSETGKLLRESDK
ETGHKKAVTSLMKSVDGSHFVTGSQDKSAKLWDIRTLTLIKTYVTERPVNAVTMSPLLDHVV
LGGGQDASAVTMTDHRAGKFEAKFFDKILQEEIGGVKGHFGPINALAFNPDGKSFSSGGED
GYVRLHHFDPDYFNIKI
SEQ ID NO:326
MDKKRTVVPLVCHGHSRPVDLFYSPITPDGFFLISASKDSSPMLRNGETGDWIGTFEGHKG
AVWSCCLDTNALRAASGSADFSAKLWDALSGDELHSFEHKHIVRSCAFSEDTHLLLTGGVE
KILRIFDLNRPDAPPREVDNSPGSIRTVAWLHSDQTILSSCTDIGGVRLWDVRSGKIVQTLETK
SPVTSSEVSQDGRYITTADGSTVKFWDANHFGLVKSYNMPCNIESASLEPKLGNKFLAGGED
MWVHIFDFHTGEEIGCNKGHHGPVHCVRFSPGGESYASGSEDGTIRIWQTGPANNVEGDA
NPSNGPVTGKAKVGADEVTRKVEDLQIGKEGKDWREG
SEQ ID NO:327
MAEGLILKGTMRAHTDMVTAIAIPIDNSDMVVTSSRDKSIILWHLTKEEKVYGVPRRRLTGHS
HFVQDVVLSSDGQFALSGSWDGELRLWDLATGVSARRFVGHTKDVLSVAFSIDNRQIVSAS
RDRTIKLWNTLGECKYTIQEGEAHTDWVSCVRFSPNTLQPTIVSASWDRTIKVWNLTNCKLR
NTLAGHNGYVNTVAVSPDGSLCASGGKDGVILLWDLAEGKRLYNLEAGAIIHSLCFSPNRYW
LCAATENSIKIWDLESKSIVEDLRVDLKNEADKTDGTTTAASNKKVIYCTSLNWSADGSTLFS
GYNDGVIRVWGTGRY
SEQ ID NO:328
MAEGLHLKGTMKAHTDMVTAIAVPIDNADMIVTSSRDKSIILWHLTKEDKVYGVPRRRLTGHS
HFVQDVVLSSDGQFALSGSWDGELRLWDLATGVSARRFVGHTKDVLSVAFSIDNRQIVSAS
RDRTIKLWNTLGECKYTIQEGEAHNDWVSCVRFSPNTLQPTIVSASWDRTVKVWNLTNCKL
RNTLQGHSGYVNTVAVSPDGSLCASGGKDGVILLWDLAEGKKLYSLEAGAIIHSLCFSPNRY
WLCAATENSIKIWDLESKSIVEDLRVDLKNEADMSDGTTGAMSSNKKVIYCTSLNWSADGST
LFSGYNDGVIRVWGIGRY
SEQ ID NO:329
MAEGLHLKGTMKAHTDMVTAIAVPIDNADMIVTSSRDKSIILWHLTKEDKVYGVPRRRLTGHS
HFVQDVVLSSDGQFALSGSWDGELRLWDLATGVSARRFVGHTKDVLSVAFSIDNRQIVSAS
RDRTIKLWNTLGECKYTIQEGEAHNDWVSCVRFSPNTLQPTIVSASWDRTVKVWNLTNCKL
RNTLQGHSGYVNTVAVSPDGSLCASGGKDGVILLWDLAEGKKLYSLEAGAIIHSLCFSPNRY
WLCAATENSIKIWDLESKSIVEDLRVDLKNEADMSDSTTGAMSSNKKVIYCTSLNWSADGST
LFSGYNDGVIRVWGIGRY
SEQ ID NO:330
MSGVPAPPFATTTPENGTMSSNSPAFHRDSDDDDDQGEVFLDDSDIIHEVAVDDEDLPDAD
DEADEAEEADDSLHIFTGHNGEVYSLACSPTDATLVATGAGDDKGFLWRIGHGDWAVELQG
HKDSISSLAFSLDGQLLASGSLDGVIQIWDVPSGNLKGTLDGPGGGIEWIRWHPKGHIILAGS
EDSTVWMWNADKMAYLNMFSGHGNSVTCGDFTPDGKTICTGSDDATLRIWNPKSGENIHV
VKGHPYHAEGLTSMAISSDSGLAITGAKDGSVRIVNISSGRVVSSLDAHADSVEFVGLALSSP
WAATGSLDQKLLIIWDLQHSSPRATCDHEDGVTCLSAVGASRFLASGCVDGKVRVWDSLSG
DCVRTFHGHSDAIQSLSVSANEEFLVSVSIDGTARVFEIAEFH
SEQ ID NO:331
MGTSQHQLSSCLQLLPRRRGNKNLIFRRTMASGGAAAVAPPPGYKPYRHLKTLTGHVAAVS
CVKFSNDGTLLASASLDKTLIIWSSAALSLLHRLVGHSEGVSDLAWSSDSHYICSASDDRTLRI
WSSRSPFDCLKTLRGHTDFVFCVNFNPQSSLIVSGSFDETIRIWEVKTGRCLNVIRAHSMPVT
SVHFNRDGSLIVSGSHDGSCKIWDTKNGACLKTLIDDTVPAVSFAKFSPNGKFILVATLNDTL
KLWNYATGKFLKIYTGHKNSVYCLTSTFSVTNGKYIVSGSEDRCICIWDLQGKNLIQKLEGHS
DTVISVTCHPSENKIASAGLDSDRTVRIWLQDA
SEQ ID NO:332
MPSQKIETGHQDIVHDVAMDYYGKRVATASSDTTIKIIGVSNSSGSQHLASLSGHKGPVWQV
AWAHPKFGSILASCSYDGQVILWKEGNQNDWAQAHVFNDHKSSVNSIAWAPHELGLCLAC
    GSSDGNISVFTARPDGGWDTTRIEQAHPVGVTSVSWAPSMAPGALVGSGLLDPVQKLASG
GCDNTVKVWKLYNGTWKMDCFPALQMHSDWVRDVAWAPNLGLPKSTIASASQDGTVVIWT
VAKEGEQWQGKVLKDFKTPVWRVSWSLTGNLLAVADGNNNVTLWNEAVDGEWQQVTTVE
P
SEQ ID NO:333
MKIAGLKSVENAHDESVWAAAWVPATESRPALLLTGSLDETVKLWRPDELALERTNAGHFL
GWSVAAHPSGVIAASASIDSFVRVFDVDTNATIATLEAPPSEVWQMQFDPKGTTLAVAGGG
SASIKLWDTATWELNATLSIPRPEQPKPSEKGNKKFVLSVAWSPDGRRLACGSMDGTISIFD
VARAKFLHHLEGHFMPVRSLVFSPVEPRLLFSASDDAHVHMYDSEGKSLVGSMSGHASWV
LSVVSPDGAALATGSSDRTVRLWDLSMRAAVQTMSNHSDQVWGVAFRPMAGAGVRAGG
RLASVSDDKSISLYDYS
SEQ ID NO:334
MEIDLGNLAFDVDFHPSEQLVASGLITGDLLLYRYGDGSSPEKLLEVRAHGESCRAVRFINDG
KAILTGSPDCSILATDVETGSVVARVENAHEAAVNRLVNLTESTIATGDDNGCIKVWDTRQRS
CCNTFSAHEDFISDMTFASDSMKLVVTSGDGTLSVCNLRSNKVQTRSEFSEDELLSVVIMKN
GRKVVCGTQSGTLLLYSWGFFKDCSDRFVDLSPSSVDALLKLDEDRIIAGTENGLISLIGILPN
RIIQPIAEHSDHPIERLAFSHDKKFLGSISHDQTLKLWDLNDILGSEDSPSSQAAIDDSDSDEM
DVDANPPDSSKGNKKKHSGKGNDVGNANNFFADLGD
SEQ ID NO:335
MSQQPSVILATASYDHTIRFWEAKSGRCYRTIQYPDSQVNRLEITPHKRYLAVAGNPSIRLFD
VNSNTPQPVMSFDSHTNNVMAVGFQYDGNWMYSGSEDGTVRIWDLRARGCQREYESRGA
VNTVVLHPNQTELISGDQNGNIRVWDLTANSCSCELVPEVDTAVRSLTVMWDGSLVVAANN
NGTCYVWRLLRGSQTMTNFEPLHKLQAHNGYILKCLLSPEFCEPHRYLATASSDHTVKIWNV
EGFTLEKTLIGHQRWVWDCVFSVDGAYLITASSDTTARLWSMSTGQDIRVYQGHHKATTCC
ALHDGAEGSPG
SEQ ID NO:336
MEDAMDMEVEVEVEAEEHSPSSSNPSGSSFRRFGLKNSIQTNFGSDYVFEITPKFDWSLMG
VSLSSNAVKLYSPTTGQYCGECRGHSDTVNGISFSGPSSPHVLHSCSSDGTIRAWDTRSFK
EVSCISAGPSQEIFSFSFGGSSDSLLSAGCKSQILFWDWRNKKQVACLEDSHVDDVTQVCFV
PHHQNKLISASVDGLICIFDTAGDINDDEHMESVINVGTSIGKVGIFGQTFEKLWCLTHIETLSV
WDWKEGTNEANFEDARKLASDSWSLDHIDYFVDCHSAEEGEGLWVIGGTNAGTLGYFPVK
YKGGAAIGSPEAVLGGGHSDVVRSVLPMSGMAGTTSKTRGIFGWTGGEDGRLCCWLSDDS
SATSRSWMSSNLVLKSSRSHHKKNRHQPY
SEQ ID NO:337
MSQHQEYPMEYAADDYDVGEVEDDMYFHERVMGDSDTDEDEYDHLDNKITDTSAADARR
GKDIQGIPWERLSVTREKYRRTRIEQYKNYENVPQSGESSEKDCKPTRKGGNYYEFWRNTR
SVKSTILHFQLRNLVWSTTKHDVYLMSHFSIIHWSSLTCKKTEVLDVYGHVAPREKHPGSLLE
GFTQTQVSTLAVRDKLLIAGGFQGELICKNLDRPGVSYCCRTTYDDNAITNAVEIYDYPSGAV
HFMASNNDCGVRDFDMEKFELSRHFTFPWPVNHTSLSPDGKLLVIVGDNPEGIVVDSQRGK
TIRPLQGHLDFSFASAWHPDGHIFATGNQDKTCRIWDIRNLSKSVAVLKGNLGAIRSIRFTSD
GRFMAMAEPADFVHVYDVKSGYEKEQEIDFFGEISGVSFSPDTESLFVGVWDRTYGSLLQY
NRCRNYSYLDSM
SEQ ID NO:338
MGASSDPNPDVSDEHQKRSEIYTYEAPWHIYAMNWSVRRDKKYRLAIASLLDHPAAAAAVP
NRVEIVQLDDSTGEIRADPNLSFDHPYPATKAAFVPDKDCQRADLLATSSDFLRIWRIADDSS
RVDLRSFLNGNKNSFFCRPLTSFDWNEAEPKRIGTSSIDTTCTIWDIERETVDTQLIAHDKEV
YDIAWGGVSVFASVSADGSVRVFDLRDKEHSTIIYESSEPDTPLVRLGWNKQDPRYMATIIM
DSAKVVVLDIRYPTMPVVELQRHQASVNAIAWAPHSSCHICTAGDDSQALIWDLSSMAQPVE
GGLDPILAYTAGAEIEQLQWSSSQPDWVAIAFSLKLQ
SEQ ID NO:339
MRGGGGGGDATGWDEDAYRESVLKEREVQTRTVFRAAFAPSPSPSPSPDAVVVASSDGS
VASYSISACLSDHRLQSLRFADAKSQNVLEAEPACFLQGHDGPAYDVKFYGEGEDSLLLSCG
DDGRIRGWMWRDITSSEAHDHSQGNSAKPVLDLVNPQSRGPWGALSPIPENNALAVDVKR
GSIYAAAGDSCAYCWDVECGKIKTVFKGHSDYLHCIAARNSSSQIITGSEDGTARIWDCRSG
KCVQVIDPDKDHKKGFFASVSCLALDASESWLVCGRGRDLSVWSISASDCIAKISTNAPAQD
VLFDDNQILLVGAEPLISRLDMNGAVLSQIHCAPQSVFSVSLHQSGVTAVGGYGGLVDVISQF
GSHLCTFRCKCI
SEQ ID NO:340
MEAPIIDPLQGDFPEVIEEYLEHGIMKCIAFNRRGTLLAAGCTDGSCIIWDFETRGVAKELRDK
ECTAAITSVCWSKYGHRILVSASDKSLILWDVLSGEKIAHTTLQHTVLQACLHPGSSTPSICLA
CPFSSAPMIVDLNTGSTTALPVLTADVSNGATPLSRNKTSDTSVTYSPCNACFNKHGDLVYA
GTSKGEILIIDHKNVRVCAIVLVSGGAVIKNVVFSRNGQYMLTNSNDRLIRIYKNLLPPKDGLK
MLDELNESFNESDDVEKLKAIGSKCLELLHEFQDSTTRVQWKAPCFSGDGEWVIGGAASRG
EHKIYIWDRAGHLVKILEGPKEALMDLAWHPVHPIIISVSLTGLVYIWAKDYTENWSAFAPDFK
ELEENEEYVEREDEFDLVPETEKVKGLDVHEDDEVDVLTVERDSVFSDSDMSQEELCFLPA
VPCLDIPEQQDKCVGSCSKLPDGNHSGSPLSVEAGQNGNASNHNSSPLEPMENSTADDTD
GVRLKRKRKPSEKGLELQAEKVKKPVKPLKSSGRLSKTNKPVIDPDSSNGVYGDDGSD
SEQ ID NO:341
MRGVSWPEDGNNPSTSSSSQRNQQQAHAPRAVSGHAASHPSASNIFKLLVQREVSPRSKH
SSKKLWREASKCQPYPFQQSCEAVRDVRQGLISVWESASLRHLSAKYCPLVPPPRSTIAAAF
SPDGKILASTHGDHTVKLIDSQTGSCLKVLRGHRRTPWVRFHPLYPEILASGSLDHEVRLW
DANTAECIGSRNFYRPIASIAFHARGELLAVASGHKLYIWHYNRRGETSSPTIVLRTQRSLRA
VHFHPHAAPFLLTAEVNDLDSADSAMTLATSPGYLHYPPPTVYFADAHSHERSRLADELPLM
PLPLLMWPSFTRDDGRVPLQRIDGDVGLNGQQRVDSSSSVRLWTYSTPSGQYELLLSPVES
GNSPSMPEETGNNAFSSAVEAEVSQSAMDTVEDMEVQPEERNTQFFSFSDPRFWELPLLH
GWLVGQTQAGPRSVRQSSPGDIETQSAFGEVASVSPITSGVMPVSMDPSRFGGRSGSRYR
SPGSRGVHVTGPNNDGPRDENDPQSVVSKLRSELAASLAAAASTELPCTVKLRIWPHDVKD
PCAQLDLESCRLTIPHAVLCSEMGAHFSPCGRFLAACVACVLPHLESDPGLHGQVNQDVTG
VATSPTRHPISAHQIMYELRIYSLEEATFGIVLASRPVRAAHCLTSIQFSPTSEHLLLAYGRRHS
SLLKSIVIDGENTVPIYTILEVYRVSDMELVRVLPSAEDEVNVACFHPSVGGGLIYGTKEGKLRI
LHYDSSHGLNLKSSGFLDENVPEVQTYALEC
SEQ ID NO:342
MDSAVAIAALSLVVGAAIALLFFGNYFRKRRSEVVAMAEADLQPHPKNPSRPPPQPAAKKVH
AKSHAHGADKDKNKRHHPLDLNTLKGHGDSVTGLCFASDGRSLATACADGVVRVFKLDDAS
NKSFKFLRINLPAGGHPTAVAFGDGVSSVIVASQHLSGCSLYMYGEEKPTNLDSNKQQTKLP
MPEIKWEHHKVHEQKAILTLSGAAANYDSGDGSTIIASCSEGTDIIIWHAKTGKILGNVDTNQL
KNTMSAISPNGRFIAAAAFTADVKVWEIVYSKDGSVKGVTKVMQLKGHKSAVTWLCFTPNSE
QIVTASKDGSIRIWNINVRYHLDEDTKTLKVFPIPLQDSSGTTLHYERLSLSPDGKILAATHGS
MLQWLCIETGKVLDTAEKAHDGDITCMSWAPQSIPTGDKKVNVLATASGDKKVKLWAAPPL
PS
SEQ ID NO:343
MEVEPKKASKTFPVKPKLKPKPRTPSGKTPESKYWSSFKTTHPLDNLSFSVPSLAFSPSPPH
LLAAAHSATVSLFSPHRTTISSFSDVVSSLSFRSDGQLLAASDLSGLIQVFDVRSRTPLRRLRS
HARPVRFVRYPVLDKLHLVSGGDDALVKYWDVAGESVVSELRGHKDYVRCGDCSPADANC
FVTGSYDHVVKLWDVRVRDGNRAATEVNHGSPVQDVIFLPSGSLVATAGGNSVKIWDLIGG
GRMVYSMESHNKTVTSICVGTMGAQQSGEEGVQLRILSVGLDGYMKVFDYSRMKVTHSMR
FPAPLLSIGFSPDSNVRAIGTSNGILYVGKRKAKENAEGGANGILGLGSVEEPRRRVLKPSFY
RYFHRGQSEKPSEGDYLVMRPKKVKLAEHDKLLKKFQHKNALISVLGGNDPEKVVAVMEEL
VARRALLKCVLNLDADELGLILTFLHKNSTVPRYSSLLLGLAKKVIDLRLEDIRASDALKGHIRN
LKRSVDEEIRIQEGLQEIQGMVSPLLRIAGRR
SEQ ID NO:344
MQGGSSGVGYGLKYQARCISDVKADTDHTSFLTGTLSLKEENEVHLLRLSSGGTELICEGLF
SHPSEIWDLSSCPFDQRIFSTVFSTGESYGAAVWQIPELYGQLNSPQLEKIASLDAHSRKISC
VLWWPSGRHDKLVSIDEENIFLWGLDCSKKSAQVQSQESAGMLHNLSGGAWDPHDVNTVA
ATCESSIQFWDLRTMKKANSLESVHARDLDYDMRKKHLLVTSEDESGVRVWDLRMPKAPIQ
EFPGHTHWTWAVRCNPDYEGLILSAGTDSAVNLWWSSTASSDELISERLIDSPTRKLDPLLH
SYNDYEDSVYGLAWSSREPWIFASLSYDGRVVVESVKPFLSRK
SEQ ID NO:345
MAEEEGSAELEQQLEEEFAVWKKNTPILYDLLISHALEWPSLTVHWAPLLPQPSSSAAAAAG
DPSLAAHRLVLGTHTSDGAPNFLILADALLPSSESDHCGDDAVLPKVEISQKIRVDGEVNRAR
FMPQNHNIVGAKTNGCEVYVFDCSKQAAKQHDGGFDPDLRLTGHDGEGYGLSWSPLKENY
LLSASHDKKICLWDISAAAQDKVLGAMHVFEAHEGAVGDASWHSKNDNLFGSAGDDCQLMI
WDLRTNKAQQCVKAHEKEVNSVSFNSYNDWILATASSDTTVGLFDMRKLTTPLHVFSSHEG
EVLQVEWDPNHEAVLASSSEDRRVMVWDLNRIGDEQQEGDASDGPAELLFSHGGHKAKIS
DFSWNKNEPWVISSVAEDNSVQVWQMAESICGDDDDMQAMEGYI
SEQ ID NO:346
MGNYGEEDEDQYFDALEETASVSDRGSNSSDCCSSGSGLDENVLDSLGFEFWTKFPESVR
ARRNRFLMLTGLGIEANSVDKEDAFPPSCNEIEVYTCKVTRDDGAVQRSLDSYNCISLLQSST
SIRSNQEVESLRGDSLLSSFRGRSKESDDLTELCGMGCPESKRNAVSEFGSVSQGSIEELR
RIVASSPLVHPLLHRKLEYERELIETKQKMGAGWLRKFGSATCISGRQGDTWSDPDDLEITA
GMKMRRVRAHSSKKKYKELSSLYAAQEFLAHEGSISTMKFSMDGQYLASAGEDTVRVWK
VTEEDRSERVNVTVDPSCLYFALNESTQLASLNTNKEHIGKAKTFQRSSDSSCVILPLKVFQIT
EKPWHEFKGHNGEVLDLSWSSKGYLLSSSTDKTVRLWRVGCDRCQRVYSHNDYVTCISFN
PVNENFFISGSIDGKVRIWNVFGGQWAYIDCREIVSAVCYRSDGKGAIVGTMTGNCLFYSIK
DNHLQMDAQVYLHGKKKSPGKRITGFQFPPNDPGKLMITSADSVIRVLSGLDVVCKLKGPRN
SGGPMIATFTSDGKHVISASEDSNVYIWNYAGQDKTSSRVKKIWSCESFWSSNASVALPWC
GIRTVPEALAPPSRSEERRASCAENGENHHMLEEYFQKMPPYSPDCFSLSRGFFLELLPKG
SATWPEEKLSDTSPPTVSSQAISKLEYKFLKSACHSVLSSAHMWGLVIVTAGWDGRIRTYHN
YGLPVRS
SEQ ID NO:347
MDIDFKEYRLRCELRGHEDDVRGVCVCGDGSIGTSSRDRTVRLWAPSAGERRKYEVARVLL
GHKSFVGPLAWVPPSEELPEGGIVSGGMDTLVMAWDLRNGEAQTLKGHQLQVTGIVLDGG
DIVSASVDCTLIRWKNGQLTEHWEAHKAPIQAVIRLPSGELVTGSSDTTLKLWRGKTCTQTFV
GHTDTVRGLAVMPDLGILSASHDGSIRLWAVSGECLMEMVDHTSIVYSVDSHASGLIVSGSE
DRFAKIWKDGVCFQSIEHPGCVWDVKFLEDGDIVTACSDGTIRIWTNQEDRMANSTELELFD
LELSSYKRSRKRVGGLKLEELPGLEALQVPGTSDGQTKVIREGDNGVAYAWNSTELKWDKI
GEVVDGPEDSMNRPALDGVQYDYVFDVDIGDGEPTRKLPYNRSDNPYDTADKWLLKENLPL
SYRQQIVEFILANSGQRDFNLDPSFRDPYTGSSAYVPGAPSQLAAKQARPTFKHIPKKGMLV
FDAAQFDGILKKINEFNNTLLSNQEKKNLSLTDIEISRLGAVVKILKDTSHYHSSKFADADFDLM
LKLLESWPYEMMFPVIDIFRMVILHPDGADGLLRHQEDKKDVLMESIKPATGNPSVPANFLTS
IRAVTNLFKNSAYYSWLQKHRSEMLDAFSSCSSSSNKNLQLSYATLLLNYAVLLIEKKDEEGQ
SQVLSAALELAENESLEVDARYRALVAIGSLMLDGLVKRIALDFDVEHIAKAARTSKEAKIAEV
GADIELLIKQS
SEQ ID NO:348
MEFTEAYKQSGPCCFSPNARFIAVAVDYRLVIRDTLSLKVVQLFSCLDKISYIEWALDSEYILC
GLYKRPMIQAWSLIQPEWTCKIDEGPAGIAYARWSPDSRHILTTSDFQLRLTVWSLVNTACV
HVQWPKHASKGVSFTRDGKFAAICTRHDCKDYINLLSCHNWEIMGVFAVDTLDLADIQWSP
DDSAIVIWDSPLEYKVLVYSPDGRCLFKYQAYESGLGVKSVSWSPCGQFLAVGSYDQMLRV
LSHLTWKTFAEFTHLSNVRAPCCAAIFKEVDEPLQIDMSELSLSDDYMQGNSGDAPEGHYRV
RYDVTEVPITLPCQKPPADRPNPKQGIGLMSWSNDSQYICTRNDSMPTILWIWDMRHLELAA
ILVQKDPIRAAVWDPTGTRLVLCTGSSHLYMWTPSGAYCVSVPLSQFNITDLKWNSDGSCLL
LKDKESFCCAAAPLPPDESSDYSSDD
SEQ ID NO:349
MATIAALDDDMVRSMSIGAVFSDFVGKLNSLDFHRKDDILVTAGEDDSVRLYDIANARLLKTT
FHKKHGTDRVCFTHHPNSLICSSTKNLDTGESLRYISMYDNRSLRYFKGHKQRVVSLCMSPI
NDSFMSGSLDHSVRMWDLRVNACQGILRLRGRPTVAYDQQGLVFAVAMEGGAIKLFDSRS
YDKGPFDAFLVGGDTSEVCDIKFSNDGKSVLLSTTNNNIYVLDAYAGDKQCGFNLEPSPSTPI
EASFSPDGQYVVSGSGDGTLHAWNISRRNEVACWNSHIGVASCLKWAPRRAMFVAASTVL
TFWIPNSEPELASAKGEAGVPPEQV
SEQ ID NO:350
MSVAELKERHRAATETVNSLRERLKQKRVQLLDTDVAGYARTQGKTPVTFGATDLVCCRTL
QGHTGKVYSLDWTPERNRIVSVSQDGRFIVWNALTSQKTHAIRLPCAWVMTCAFAPNGQSV
ACGGLDSVCSIFNLNSPVDRDGNLPVSRMLSGHKGYVSSCQYVPDGDAHLITGSGDQTCVL
WDITTGLRTSVFGGEFQSGHTADVLSVSINGSSPRIFVSGSCDSTARMWDTRVASRAVHTY
HGHEGDVNAVKFFPDGNRFGTGSDDGTCRLFDIRTGHELQVYYQQRGIDEIPHVTSIAFSIS
GRLLIAGYSNGDCFVWDTLLAQVVLNLGSLQNSHEGRISCLGVSADGSALCTGSWDTNLKI
WAFGGIRRVT
SEQ ID NO:351
MKKRPRGASLDQAVVDIRRREVGGLSGLSFARRLAASEGLVLRLDIYNKLKGHRGCVNTVG
FNLDGDIVISGSDDRHVKLWDWQTGKVKLSFDSGHLSNVFQAKIMPYTDDRSIVTCAADGQA
RHAQILEGGQVQTMLLAKHRGRAHKLAIDPGSPHIVYTCGEDGLVQRLDLRSNTARELFTCR
EVYGTHVEVVHLNAIAIDPRNPNLFVIGGSDEYARVYDIRNYKWNGSHNFGRSANYFCPSHLI
GEAHVGITGLAFSGQSELLVSYNDESIYLFTQEMGLGPDPLSASTKSVDSNSSEVTSPTAVN
VDDNVTPQVYKGHRNCETVKGVGFFGPKCEYVVSGSDCGRIFIWKKKGGQLIRVMAADKHV
VNCIEPHPHIPALASSGIENDIKIWTPKAIERATLPMNVEQLKPKARGWMNRISSPRQLLLQLY
SLERWPEHGGETSSGLAAGQEELTELFFALSANGNGSPDGGGDPSGPLL
SEQ ID NO:352
MSKRGYKLQEFVAHSSNVNCLSIGKKACRLFLTGGDDCKVNLWAIGKPNSLMSLCGHTNAV
ESVAFDSAEVLVLAGASSGVIKLWDVEEAKMVRGLTGHRSNCTAMEFHPFGEFFASGSTDT
NLKIWDIRKKGCIHTYKGHTRGISTIRFSPDGRWVVSGGNDNVVKVWDLTAGKLLHDFKFHE
NHIRSIDFHPLEFLLATGSADRTVKFWDLETFELIGSSRPEAAGVRAIAFHPDGRTLFCGLEDS
LKVYSWEPVICHDGVDMGWSTLADLCIHDGKLLGCSYYQSSVGWVADASLIEPYGTNVKP
QQKDSGDDEIEHQESRPSAKVGTTIRSTSIMRCASPDYETKDIKNIYVDTASGNPVSSQRVG
TTNFAKVTQPLDFNDTPNLTLRRQGLVTETPDGLSGHVPSKSITQPKVVSRDSPDGKDSSRR
ESITFSRTKPGMLLRPAHSRRPSSTKYDVDRLSACAEIGVLSSAKSGSESLVDSFLNIKVAPE
DGARNGCEDNHSSVKNVSVESEKVLPLQTPKTEKCDQTVGFKEEINSVKFVNGVAVVPGRT
RTLVEKFEKREKLNSTEDQTINTPENPTLDKTPPPSLAENEEKSDRLNIVERKATRMSSHMVT
AEDRTPVTLVGSPEDQSTVMAPQRELPADESSKTPPLPVEDLEIHHGSNVSEDKATILSSQT
VSEEDSKRSTLIRNFRRRDRFKSTEGRSPVMATQRKLPTDESGKTSSLPMEDLEIKGGLNVS
EDKATSFSSRAPPREDRAHSALVRNVRKRDKFKSTNDTITVMVHQRGLSTDEASTVSVERV
ERRQLSNNVENPLNNLPPHSVPPTTTRGEPQYVGSESDSVNHEDVTELLLGNHEVFLSTLR
SRLTKLQVV
SEQ ID NO:353
MSTFLTGTALSNPNPNKSYEVVQPPNDSVSSLSFNPKANFLVATSWDNQVRCWEIVRSGTS
LGTTPKASISHDQPVLCSTWKDDGTTVFSGGCDKQVKWPLSGGQPMTVAMHDAPIKEIS
WIPEMNLLVTGSWDKTLRYWDTRQANPVHIQQLPERCYALTVRHPLMVVGTADRNLIIYNLQ
SPQTEFKRISSPLKYQTRCLAAFPDQQGFLVGSIEGRVGVHHLDDSQQSKNFTFKCHREGS
EIYSVNSLNFHPVHHTFATAGSDGAFNFWDKDSKQRLKAMSRCSQPIPCSTFNNDGSIFAYS
ACYDWSKGAENHNPATAKTYIFLHLPQESEVKGKPRLGTTGRK
SEQ ID NO:354
MEVEAQQRDVNNVMCQLVDPEGTTLGPPMYLPQDVGPQQLQQMVNKLLSNEDKLPYTFYI
SDQELVVPLESYLQKNKVSVEKVLSIVYQPQAIFRIRPVNRCSATIAGHSEAVLSVAFSPDGK
QLASGSGDTTVRLWDLSTQTPMFTCKGHKNWVLSIAWSPDGKHLVSGSKAGEIQCWDPLT
GQPSGNPLVGHKKWITGISWEPVHLSSPCRRFVSSSKDGDARIWDVTLRRCVICLSGHTLAV
TCVKWGGDGVIYTGSQDCTIKVWETSQGKLIRELKGHGHWVNSLALSTEYVLRTGAFDHTG
KQYSSAEEMKQVALERYKKMKGNAPERLVSGSDDFTMFLWEPSVSKHPKTRMTGHQQLVN
HVYFSPDGQWVASASFDKSVKLWNGITGKFVAAFRGHVGPVYQISWSADSRLLLSGSKDST
LKIWDIRTKKLKRDLPGHADEVFAVDWSPDGEKVVSGGKDKVLKLWMG
SEQ ID NO:355
MDAGSAHSSSNMKTQSRSPLQEQFLQRRNSRENLDRFIPNRSAMDFDYAHYMLTEGRKGK
ENPAVSSPSREAYRKQLAETLNMNRTRILAFKNKPPTPVELIPHELTSAQPAKPTKTRRYIPQ
TSERTLDAPDLLDDYYLNLLDWGSSNVLSIALGNTVYLWNASDGSTSELVTIDDETGPVTSVS
WAPDGRHIAVGLNNSDVQLWDSADNRLLRTLRGGHRSRVGSLAWNNHILTTGGMDGLIVN
NDVRVRSHIVDTYRGHTQEVCGLKWSASGQQLASGGNDNILHIWDRSTASSNSPTQWLHR
LEEHTAAVKALAWCPFQGNLLASGGGGGDRTIKFWNTHTGACLNSVDTGSQVCALLWNKN
ERELLSSHGFTQNQLTLWKYPSMVKIAELTGHTSRVLFMAQSPDGCTVASAAGDETLRFWN
VFGVPEVAKPAPKANPEPFAHLNRIR
SEQ ID NO:356
MEEAIPFKNLPSREYQGHKKKVHSVAWNCTGTKLASGSVDQTARVWHIEPHGHGKVKDIEL
KGHTDSVDQLCWDPKHADLIATASGDKTVRLWDARSGKCSQQAELSGENINITYKPDGTHV
AVGNRDDELTILDVRKFKPIHKRKFNYEVNEIAWNMSGEMFFLTTGNGTVEVLAYPSLRPVD
TLMAHTAGCYCIAIDPVGRYFAVGSADSLVSLWDISEMLCVRTFTKLEWPVRTISFNHTGDYV
ASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKYNLLAYAGDDKNKYQADEGVFRIFG
FESA
SEQ ID NO:357
MGKDEEEMRGEIEERLINEEYKVWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYS
VQKLVLGTHTSENEPNYLMLAQVQLPLEDAENDARHYDDDRADVGGFGCANGKVQIIQQIN
HDGEVNRARYMPQNSFIIATKTVSAEVYVFDYSKHPSKPPLDGACSPDLRLRGHSTEGYGLS
WSKFKQGHLLSGSDDAQICLWDINATPKNKSLDAMQIFKVHEGVVEDVAWHLRHEYLFGSV
GDDQYLLIWDLRTPSVTKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKIST
ALHTFDAHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIH
GGHTSKISDFSWNTCEDWVASVAEDNILQIWQMAENIYHDEDDVPGEESNKGS
SEQ ID NO:358
MMRGFSCTEDGDAPSTSSTSPPPPPPPPHRQQMQAPRASSSSSGQPTSRRSTGNVFKLLA
RREVSPRSKHSLKKFWGEASECQLCPFQQSYEAVRDVRRSLISWVEAFSLQHLSAKYCPLM
PPPRSTIAAAFSPDGKILASTHGDHTVKLIDSQTGSCLKVLRGHRRTPWVVRFHPLYPEILAS
GSLDHEVHLWDANTAECIGSRNFYRPIASIAFHAQGDLLAVASGHKLYIWHYNRSGETSSPTI
VLRTPRSLRAVHFHPHAAPFLLTAEVNDLDLTDSAMTLATSPGYLHYPPPTIYLADAHSNERS
RLEDELPLMPSPLLMWPSFTRDDGRATLPHIGGDVGLSGQQRVDSLSSGQYEFHPSPIEPS
SGTSMHEEMGTDPFSSVRESEVTQSAMNIVDNTEVQPEERSTYSFSFSDPRFWELPSVYG
WLVGQTQAAPRTAPSPGALETASALGEVASVSPVRSEFMPGGMDQPRLGGRSGSGCRSS
GSRMMRTAGLNDHPHDENYPQSVVSKLRSELEASLAAAASTELPCTVKLRVWPYDMKDPC
ALFRSESCRLTIPHAVLCSEMGAHFSPCGRFFAACVACVLPQLEADPVLHGQVDPDVTGVAT
SPTRHPVSAYQIMYELRIYSLEEATFGMVLASRSIRAAHCLTSIQFSPTSEHLLLAYGRRHNSL
LKSIVIDGENTVPIYSILEVYRVSDMELVRVLPSAEDEVNVACFHPSVGGGLVYGTKEGKLRIL
QIDSSGGLNPKSTGFLDENMAEVPTYALEC
SEQ ID NO:359
MGEGDLPRTEAGVLRGHEGAVLAARFNGDGNYCLSCGKDRTIRLWNPHRGIHIKTYKSHGR
EVRDVHCTSDNSKLISCGGDRQIFYWDVSTGRVIRRFRGHDSEVNAVKFNDYASVVVSAGY
DRSVRAWDCRSHSTEPIQIINTFQDSVMSVCLTKTEIIGGSVDGTVRTFDIRIGREISDDLGQP
VNCISMSNDGNCILASCLDSTLRLVDRSAGELLQEYKGHTCKSYKLDCCLTNTDAHVAGGSE
DGYVFFWDLVDASVISKFRAHSSVTSVSYHPKEDCMITASVDGTIKVWKT
SEQ ID NO:360
MACIKGVGRSASVAMAPDGGYLATGTMAGTVDLSFSSSASLEIFGLDFQSDDRDLPLIAESP
SSERFNRLSWGKNGSGSDEFSLGLIAGGLVDGTIGLWNPLSLIRSEAGDKAIVGHLSRHKGP
VRGLEFNVIAPNLLASGADDGEICIWDLAAPREPSHFPPLRGSGSAAQGEISFLSWNSKVQHI
LASTSYNGTTVVWDLKKQKPVISFSDSVRRRCSVLQWNPDLATQLVVASDEDSSPTLRLWD
MRNIMSPVKEFAGHTRGVIAMSWCPNDSSYLVTCAKDNRTICWDTVTGEIVCELPAGSNWN
FDVHWYPKIPGVISASSFDGKIGIYNVEGCSRYGVRENEFGAATLRAPKWFKRPVGASFGFG
GKVVSFHTRSTGGPSVNSSEVFVHDIITEQTLVSRSSEFEAAIQSGDRPSLRALCEKKSQHC
ESTDDQETWGFLKVLLEDDGTARSKLLAHLGFDIPTETNDGSQEDLSQQVNALGLEDVTADK
VVQEDNNESMVFPTDNGEDFFNNLPSPRADTPVSTSADGFPTVNAAVEPSQDEVDGLEESS
DPSFDDSVQRALVVGDYKAAVALCMSANKLADALVIAHVGGASLWESTRDKYLKMSRLPYL
KVVFAMVNNDLQSLVDTRPLKFWKETLAILCSFAQGEEWAMLCNSLASKLMAAGNMLAATL
CFICAGNIDKTVEIWSRSLATEHDGMSYMDLLQDLMEKTIVLALASGQKQFSASVCKLVEKYA
EILASQGLLTTAMDYLKLLGTDDLSPELAVLRDRIAFSVEAEKGANISAFNGSQDPRGAVYGV
DQSNYGMVDTSQHYYPEAAQPQVPHTVPGSPYGENYQQPFGSSFGKGYNTPMQYQAPSQ
ASMFVPSEPPQNAQPSFVPTPVTSQPTTRSQFIPAPPLALRNPEQYQQPTLGSHLYPGSVN
PTFQPLPHAPGPVAPVPPQVSSVPGQNMPQAVAPTQMRGFMPVTNPGVVQNPGPISMQPA
TPIESAAAQPVVSPAAPPPTVQTADTSNVPAPQKPVIATLTRLYNETSEALGGSRANPAKKRE
IEDNSRKIGALFAKLNSGDISKNAADKLVQLCQALDNGDYSTALQIQVLLTTSWDECNFWLA
TLKRMIKTRQNVRLS
SEQ ID NO:361
MKERGKGAGRSVDERYTQWKSLVPVLYDWLANHNLVWPSLSCRWGPQLEQATYKNRQRL
YLSEQTDGSVPNTLVIANVEVVKPRVAAAEHISQFNEEARSPFVKKFKTIIHPGEVNRIRELPQ
NSKIVATHTDSPDVLIWDVETQPNRHAVLGASTSRPDELTGHKDNAEFALAMSPTEPFVLSG
GKDRYVVLWSIQDHISTLAADPGSAKSPGSAGTNNKQSSKAAGGNDKTGDSPSIEPRGVYL
GHGDTVEDVTFCPSSAQEFCSVGDDSCLILWDARTGSSPAIKVEKAHHADLHCVDWNPHDV
NLILTGSADNTVRMFDRRNLTSGGVGSPVHTFEGHNAAVLCVQWSPDKSSVFGSSAEDGIL
NIWDHEKIGRKIETVGSKVPNSPPGLFFRHAGHRDKVVDFHWNSSDPWTIVSVSDDGESTG
GGGTLQIWRMIDLIYRPEEEVLAELDKFKSHILSCTS
SEQ ID NO:362
MAKIAPGCEPVAGTLTPSKKREYRVTNRLQEGKRPLYAVVFNFIDSRYFNVFATVGGNRVTV
YQCLEGGVIAVLQSYIDEDKDESFYTVSWACNIDRTPFVVAGGINGIIRVIDAGNEKIHRSFVG
HGDSINEIRTQPLNPSLIVSASKDESVRLWNVHTGICILIFAGAGGHRNEVLSVDFHPSDKYRI
ASCGMDNTVKIWSMKEFWTYVEKSFTWTDLPSKFPTKYVQFPVFIAPVHSNYVDCNRWLGD
FVLSKSVDNEIVLWEPKMKEQSPGEGSVDILQKYPVPECDIWFIKFSCDFHYHSIAIGNREGKI
YVWELQSSPPVLIAKLSHPQSKSPIRQTAMSFDGSTILSCCEDGTIWRWDAITASTS
SEQ ID NO:363
MNTAMHFGAGWRSIAEMGYTMSRLEIEPESCEDEKSLDGVGNSQGPNELPRCLDHELAHLT
NLKSRPHEHLIRDFPGRRALPVSTVKMLAGRECNYSRRGRFSSADCCHMLSRYVPVNGPSP
LDQMN SRAYVSQFSADGSLFVAGFQGSHIRIYNVDKGWKCQKNILTKSLRWTITDTSLSPDQ
RYLVYASMSPIVHIVDIGSAAMDSLANITEIHEGLDFSADSGPYSFGIFSVKFSTDGREVVAGS
SDDSIYVYDLVANKLSLRIPAHESDVNTVCFADESGHIIYSGSDDTYCKVWDRRCLSARNKPA
GVLMGHLEGITFIDSRGDGRYFISNGKDQTIKLWDIRKMGSDICRRGFRNFEWDYRWMDYP
PRARDSKHPFDLSVATYKGHSVLRTLIRCYFSPVHSTGQKYIYTGSHDSCVYIYDVVTGAQV
AALKHHKSPVRDCSWHPEYPMIVSSSWDGDIVKWEFFGNGETEIPAMKKRIRRRHLY
SEQ ID NO:364
MEPQPQAPKKRGRKPKPKEDKKEEQLHQPPPPPPPQQQAAPAPAPAATRSSTSGSAGGR
DRRPQQQHAVDEKYARWKSLVPVLYDWLANHNLLWPSLSCRWGPQLEQATYKNRQRLYIS
EQTDGSVPNTLVIANCEVVKPRVAAAEHVSQFNEEARSPRRKYKTIIHPGEVNRVRELPQNP
NIVATHTDSPDVLIWDVESQPNRHAVYGATASRPNLLILTGHQENAEFALAMCPAEPFVLSGG
KDKTVVLWSIQDHITASATDQTTNKSPGSGGSIIKKTGEGNEETGNGPSVGPRGIYCGHEDT
VEDVAFCPSTAQEFCSVGDDSCLILWDARVGTNPVAKVEKAHNGDLHCVDWNPHDNNELT
GSADNSVNMFDRRNLTSNGVGSPVYKFEGHKAAVLCVQWSPDKPSVFGSSAEDGLLNIWD
YERVDKKVDRAPNAPAGLFFQHAGHRDKIVDFHWNAADPWTMVSVSDDCDTAGGGGTLQI
WRMSDLIYRPEEEVLAELENFKAHVLECSKA
SEQ ID NO:365
MGIFEPYRAVGYITTGVPFSVQRLGTETFVTVSVGKAFQVYNCAKLSLVLVGPQLPKKIRALA
SYREYTFAAYGSDIGIFKRAHQLATWSGHTAKVCLLLLFGEHILSVDVDGNAYIWAFKGMNY
NLSPVGHILLDSNFTPSCIMHPDTYLNKVILGSQEGPLQLWNISTKTKLYEFKGWNSSVSSCV
SSPALDVVAVGCADGKIHVHNIRYDEELVTFSHSMRGSVTALSFSTDGQPLLASGSSSGVVS
IWNLDKRRLQSVIRDAHDGSIISLHFFANEPVLMSSSADNSIKMWIFDTSDGDPRLLRFRSGH
SAPPLCIRFYANGRHILSAGQDRAFRLFSVVQDQQSRELSQRHVSKRAKKLKLKEEEIKLKPV
IAFDVAEIRERDWCNVVTSHMDTPQAYVWRLQNFVIGEHILRPCPNKPTPVKACMISACGNF
AILGTAGGWIERFNLQSGISRGSYIDQLEGTNSAHDGEVVGVACDATNTLMISAGYAGDIKV
WDFKGRELKSRWEIGSSLVKISYHRLNGLLATVADDFIIRLFDAVALRMVRKFEGHTDRITDL
CFSEDGKWLLSSSMDGSLRIWDIILARQVDAVFVDVSITALSLSPNMDILATTHVDQNGVFLW
VNQSMFS1GDSDINLYASGKEVVTVKLPSVSSVEGSQVEESNEPTIRHSESKDVPSFRPSLEQ
IPDLVTLSLLPKSQWQSLINLDIIKVRNKPVEPPKKPEKAPFFLPSIPSLSGEILFKPSEMSDKG
DMKADEDKSKITPEVPSSRFLQLLHSCSEAKNFSPFTTYIKGLSPSTLDLELRMLQIIDDDAVD
ADADDPQDVDKRQELLSIELLMDYFIHEISCRSNFEFVQALVRLFLKIHGETIRRQSVLQNKAK
VLLETQCSVWQRVDKLFQGARCMVAFLSNSQF
SEQ ID NO:366
MEETKVTCGSWIRRPENVNLAVLGRSPRRRGSAALEIFAFDPKSTSLSSSPLVAHVIEEIEGD
FLAIAVHPNGEDIVCFASSGSCLSFELSGQESNLKLLTKELPPLRGIGPQKCMAFSVDGSRFA
TGGVDGRLRILEWPSLRIILDEPKAHKSIRDLDFSLDSEFLATTSTDGSARIWKAEDGLPCTTL
TRRSDEKIELCRFSKDGTKPFLFCTVQRGDKAVTGVWDISTWNKIGHKRLLRKPAVVMSISL
DGKYLAQGSKDGDMCVVEVKKMEVSHWSKRLHLGTSLTSLEFCPIERVVITTSDEWGVLVT
KLNVPADWKAWQVYLLLLGLFLASLVAFYIFYENSDSFWGFPLGKDQPARPKIGSVLGDPKS
ADDQNMWGEFGPLDM
SEQ ID NO:367
MADPVEHQHQQHQQHQLQQQRRRGWRIQGGQYLGEISALCFLHLPPPPLSLSSSPVLSLS
SGLDSESRDRPACSFRFPSAGSGSQVSLFDLASGAMVRTFYVFRGIRVHGIVLGCADFPGG
SSSSSSTLDYVIAVYGERRVKLFRLSVRLGRGAGEGSGTVLSADLELVSAAPRLSHWVMDV
RFLKENGTSEDELQRCLTVAIGCSDNSIRLWDVDKCSFVLAVSSPERCLLYSMRLWGDNLED
LQVASGTIYNEILIWKVVPNHDAPSSNELTEEGLTNSCAGNSVHECLRYEAYHICRLVGHEGS
IFRIAWSSDGSKLVSVSDDRSARIWEVHCKVQYSEDAGEVGLLFGHSARVWDCYISDNLIVT
AGEDCSCRVWGLDGQQHDVIKEHIGRGIWRCLYDPWSSLLVTGGFDSAIKVHKLDASLAEA
SAKQSNIKDLSDGTELFTTHLPNSSGHSGHMDSKSEYVRCLSFSCEDVMYIATNHGYLYHAK
LCNDGDLRWTELAQVSNEVQIICMELLPSNPYDPRIDADDWVAVGDGKGWTTVVRVVVKNSD
SPKVSTSFSWAAEMDRQLLGIHWCKSLGHRFIFTADPRGALKLWRFFEVSQSSSLYPENSP
RISLIAEFKSDLGARIMCLDVAFESELLICGDLRGNLVLFPLLKDLLLDTFVVSAAKISPVNHFK
GAHGISAVSSISVAHMSFNHIELRSTGADGCICYMEYDKGLQSLNFVGMKQVKELSMIESVST
ENESTGYRTSGSYASGFASTDFIIWNLVTEAKVLQVSCGGWRRPHSYYLGDVPEMKNCFAY
VKDDIIYIRRHWIKDSKDKILPQNLRLQFHGREVHSLCFVTGDFQLRKNKQSSWIVTGCEDGT
VRLTRYTQCTDNWSSSKLLGEHVGGSAVRSICCVSNIHTTSSGTSVSDVKGIENLPKDIKGTL
MEDECNPSLLISVGAKRVLTSWLLRRRKQDGKEDDVTDLQEAENSSLPSSAGSSTFSFQWL
STDMPVKYSVPSKKSGSIKKLIGVSDTNVRCKSLLPDSEALQSKVSAVDKNEDDWRYLAVTA
FLVRHSGSRLIVCFIIVACSDATLAIRALVLPYRLWFDVALMVPLSSPVLSLQHVIIGRCQLPDE
NVQIGNVYVVISGATDGSIAFWDLTESVEAFMRRLSNIHLEKFMDCQKRPRTGRGSQGGRW
WRSLSKIACKEQPINDPVTAKAIKELNRKLTGGVACGSSSSMLDASPELDSNAANSSFEIIEV
NPFHVLNGVHQSGVNCLHVCETKHGQSSDGRFLYQLVSGGDDQALHLLKFEVLVQPPVQV
PDVPNSDIRNSILVEEFLLDEQNQKTKCTIEFISQEKIASAHNSAVKGVWTDGTWVFSTGLDQ
RVRCWISKDRGTPTELAHFIISVPEPEALDARSICWDQYQIAVAGRGMQMIEFHVPSSEIR
SEQ ID NO:368
MPYKLSATLSNHSSDVRAVASPSDDLILSASRDSTAISWFRQSPSSFTPASVIRAGSRFVNAI
AYLPPTPRAPQGYAVVGGQDTVVNVFALGPGDKEEPEYTLVGHTDNVCALSVNSDDTIISGS
WDKTAKVWKDFALVYDLKGHQQSVWAVLAMNEKEFLTASADRTIKYWVQHKTMQTYEGHR
DAVRGLALIPDIGFASCSNDSEIRVWTMGGDVVYTLSGHTSFVYSLSVLPNGDLVSAGEDRS
VRVWRDGECSQVIVHPAISVWAVSTMPNGDIISGSSDGVVRVFSESEKRWATASELKALED
QIASQSLPSQQVGDVKKTDLPGPEALSVPGKKAGEVKMIRSGDVVEAHQWDSLASSWQKIG
EVVDAIGSGRKQLHDGKEYDYVFDVDIQEGAPPLKLPYNVSENPYTAAQRFLEQNDLPTGYL
DQVVKFIEQNTAGVKLGNDGYVDPFTGASRYQPATQSTSNTASSSYMDPFTGGSRHIAESA
PSNVPQGSHATGIIPFSKPIFFKLANVSAMQAKMFQFDEVLRNEISTATLAMRPDEVIMVNET
FTYLSKVVTSTSSARTSLGWIHIETIMQILDRWPVPQRFPVIDLGRLVTAYCMNAFSGPGDLE
KFFCLFRTSEWTSITSGSKALTKAQETNVLLLFRTIANSLDGAPLNDMEWIKQIFRELAQTPQ
LVLNKSHRLALASVLFNFSCIGLKGPVPADVRTLHLTIILQVLRSPNDDPEVAYRTCVALGNML
YSDKTRGTPRDAQSPSPTELKSAVAAIKGGFSDPRINDVHREIMSLI
SEQ ID NO:369
MPPQKIESGHKDTVHDLAMDYYGKRLATASSDHTINVVGVSSSGSQHLATLIGHQGPVWQIS
WAHPKFGSLLASCSYDGRVIIWREGNPNEWTQAQVFEEHKSSVNSVAWAPHELGLCLACG
SSDGNISVFTARQDGGWDTSRIDQAHPVGVTSVSWAPSTAPGALVGSGMMEPVQKLCSGG
CDNTVKVWKLYNRVWKLDCFPVLQMHTDWVRDVAWAPNLGLPKSTIASASQDGRVIIWTLA
KEGDQWQGKVLYDFRTPVWRVSWSLTGNILAVADGNNNVSLWNEAVDGEWIQVSTVEP
SEQ ID NO:370
MSAPMLEIEARDVVKIVLQFCKENSLHQTFQTLQSECQVSLNTVDSIETFVADINSGRWDAIL
PQVAQLKLPRNTLEDLYEQIVLEMIELRELDTARAILRQTQAMGVMKQEQPERYLRLEHLLVR
TYFDPNEAYQDSTKEKRRAQIAQALAAEVTVVPPSRLMALVGQALKWQQHQGLLPPGTQFD
LFRGTAAMKQDVDDMYPTTLSHTIKFGTKSHAECARFSPDGQFLVSCSVDGFIEVWDYMSG
KLKKDLQYQADETFMMHDDPVLCVDFSRDSEMLASGSQDGKIKVWRIRTGQCLRRLERAHS
QGVTSVLFSRDGSQLLSTSFDGSARIHGLKSGKQLKEFRGHSSYVNDAIFSNDGSRVITASS
DCTVKVWDVKTSDCLQTFKPPPPLRGGDASVNSVHLFPKNADHIVVCNKTSSIYIMTLQGQV
VKSLSSGKREGGDFVAACVSPKGEWIYCVGEDRNLYCFSCQSGLEHLMKVHEKDVIGVTH
HPHRNLVATYSEDSTMKLWKP
SEQ ID NO:371
MDLLQSYAEDNDGDLGRHSSPEPSPPRLLPSKSAAPKVDDTTLALTVAQTNQTLARPIDPSQ
HAVAFNPTYDQLWAPICGPAHPYAKDGIAQGMRNHKLGFVEDAAIGSFLFDEQYNTFQRYG
YAADPCASTGNEYVGDLDALKQNDGISVYNIRQQEQKKYAEEYAKKKGEERGEGGREKAEV
VSDKSTFHGKEERDYQGRSWIAPPKDAKATNDHCYIPKRLVHTWSGHTKGVSAIRFFPKHG
HLILSAGMDTKVKIWDVFNSGKCMRTYMGHSKAVRDISFCNDGTKFLTAGYDKNIKYWDTET
GKVISTFSTGKIPYVVKLHPDDEKQNILLAGMSDKKIVQWDMNTGQITQEYDQHLGAVNTITF
VDDNRRFVTSSDDKSLRVWEFGIPVVIKYISEPHMHSMPSISLHPNTNWLAAQSLDNQILIYS
TRERFQLNKKKRFAGHIVAGYACQVNFSPDGRFVMSGDGEGRCWFWDWKSCKVFRTLKC
HEGVCIGCEWHPLEQSKVATCGWDGLIKYWD
SEQ ID NO:372
MESNGNLEQTLQDGRIYRQLNSLIVAHLRDHNFPQAASAVALATMTPLNVEAPRNRLLELVA
KGLAVEKGELLRGVSHAGTNDLGGSIPASYGLVPAPWTAIDFSSLRDTKGMSKSFTKHETRH
LSDHKNVARCARFSTDGRFFATGSADTSIKLFEVSKIKQMMLPDSTDGAIRAVIRTFYDHTHP
VNDLDFHPQNTVLISAAKDHTVKFFDYSKATAKRAFRVIQDTHNVRSVAFHPSGDFLLAGTD
HPIPHLYDVNTFQCYLSANVPEFAVNAAINQVRYSSSGGMYVTASKDGTIRFWDGASANCV
RSIAGAHGAAEVTSANFTKDQRYVLSCGLDSTVKLWEVGTGRLVKQYLGATHMQLRCQAVF
NNTEEFVLSIDEPSNEIVVWDAMTAEKVARWPSNHNGPPRWIEHSPTEAAFVSCGTDRSIRF
WKETH
SEQ ID NO:373
MSNFQGEDGEYVADDFEAEDGDEELHGRESADPESDVDEIDTPSNRFTDTTADQARRGRDI
QGIPWERLSITREKYRRTRLEQYKNYENVPQSGEKSGKDCTVTEKGNSFYEFRRNSRSVKS
TILHFQLRNLVWATSKHDVYLMSNYSVVHWSSLTGKKSEVLNLAGHVAPNEKHPGSLLEGFT
QTQVSTLAVKDRFLVAGGFQGELICKFLDRPGISFCSRTTYDDNAITNAVEIYVSPSGGIHFIA
SNNDCGVRDFDMENFELSKHFRFPWPVNHTSLSPDGKLLVIVGDDPEGILVDAKTGKTIMPL
RGHLDFSFASEWHPDGVTFATGNQDKTCRIWDIRNLSKSIAVLKGNLGAIRSIRYTSDGRYM
AIAEPADFVHVYDTKTGYKKEQEIDFFGEISGMSFSPDTESLFIGVWDRTYGSLLEYGRRRNF
SYLDCLV
SEQ ID NO:374
MGVEEDLEDLNALAESTDAAVDGQAALASAVDSVTLQPAPPILPPVIPPPAVPVVAPVPTIPP
VLRPLAPLPIRPPVLRPPAPKRDEAGSSDSDSDHDGTAAGSTAEYEITEESRLVRERHEKAM
QDLMMKRRGAALAVPTNDKAVRARLRRLGEPMTLFGEREMERRDRLRMLMAKLDAEGQLE
KLMKAHEDEEAAASAAPEDVEEEMLQYPFYTEGSKALFNARIDIAKFSITRAALRLERARRRR
DDPDEDVDAEIDWALKKAESLSLHCSEIGDDRPLSGCSFSHDGKLLATCSMSGVAKLWDTC
RMPQVNRVLTLKGHTERATDVAFSPVQNHIATASADRTAKLWNTEGTILKTFEGHLDRLGRI
AFHPSGKYLGTTSFDKTWRLWDIESGEELLLQEGHSRSIYGIDFHRDGSLVASCGLDALARV
WDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLYTIPAHANLISE
VKFEPQEGYFLVTASYDTTAKVWSARDFKPVKTLSVHEAKITSVDITADASHIVTVSHDRTIKL
WTSNDDVKEQAMDVD
SEQ ID NO:375
MVKAYLRYEPAAAFGVIASVESNIAYDASGKHLLAPALEKVGVWHVRQGVCTKALAPSASSA
AGPSLAVTAIASSPSSLIASGYADGSIRIWDFEKGSCETTLNGHKGAVSVLRYGKLGSLLASG
SKDNDIILWDVVGETGLYRLRGHRDQVTDLVFLDSDKKLVSSSKDKYLRVWDLETQHCMQIV
GGHHSEIWSLDTDPEERYLVTGSADPELRFYTVKNDSSDERSEADASGGVGNGDLASHNK
WDVLKQFGEIQRQSKDRVATVRFNKNGNLLACQAAGKLVEVFRVLDEAEAKRKAKRRLHRK
REKKGADVNENGDSSRGIGEGHDTMVTVADVFKLLQTIRASKKICSISFCPVAPKSSLATLAL
SLNNNLLEFHSIEADKTSKMLTIELQGHRSDVRSVTLSSDNTLLMSTSHNSVKIWNPSTGSCL
RTIDSGYGLCGLIVPQNKHALIGTKDGAIEIFDVGSGTCIEVVEAHGGSIRSIVAIPNQNGFVTG
SADHDIKFWEYGMKQKPGDNSKHLTVSNVRTLKMNDDVLWAVSPDAQKIAVALLDCTVKV
FFMDSLKLMHSLYGHRLPVLCLDISSDGDLIVTGSADKNLMIWGLDFGDRHKSIFAHGDSIMA
VQFVGNTHYMFSVGKDRLVKYWDADKFELLLTLEGHHADIWCLAISNRGDFLVTGSHDRSIR
RWDRTEEPFFIEEEKEKRLEEMFESDLDNAFGNKYVPKEEIPEEGAVALAGKKTQETLSATD
SIIEALDIAEVELKRIAEHEEEKNNGKTAEFHPNYVMLGLSPSDFILRALSNVQTNDLEQTLLAL
PFSDALKLLSYLKDWTTYPDKVELVSRIATVLLQTHYNQLVSTPAARPLLTTLKDILHKKVKEC
KDTIGFNLAAMDHLKQLMALRSDALFQDAKVKLLEIRSQLSKRLEERTDPREAKRRKKKQKK
STNMHAWP
SEQ ID NO:376
MGGVQAEREDKDKVSLELTEEILQSMEVGMTFRDYSGRISSMDFHRASSYLVTASDDESIRL
YDVASATCLKTINSKKYGVDLVSFTSHPMTVIYSSKNGWDESLRLLSLHDNKYLRYFKGHHD
RVVSLSLCPRNECFISGSLDRTVLLWDQRAEKCQGLLRVQGRPATAYDDPGLVFAIAFGGCV
RMFDARKYEKGPFEIFSVGGDVSDANVVKFSNDGRLMLLTTTDGHIHVLDSFRGTLLYTFNV
KPTSSKSTLEASFSPEGMFVISGSGDGSVYAWSVRGGKEVASWLSTDTEPPVIKWAPGNLM
FATGSSELSFWIPDLSKLGAYVGRK
SEQ ID NO:377
MAAFGAAPAGNHNPNKSSEVIQPPSDSVSSLCFSPRANHLVATSWDNQVRCWELTKNGAS
VTSVPKASMSHDQPVLCSAWKDDGTTVFSGGCDKQAKMWSLMSGGQPVTVAMHDAPIKEI
AWIPEMNVLVTGSWDKTLKKYWDTRQSNPVHTQQLPERCYAMTVRYPLMVVGTADRNLIVF
NLQNPQAEFKRFSSPLKYQTRCVAAFPDQQGFLVGSIEGRVGVHHLDDSQISKNFTFKCHR
DNNDIYSVNSLNFHPVHHTFATAGSDGTFNFWDKDSKQRLKAMSRCSQPIPCSTFNNDGTIY
AYSVCYDWSKGAENHNPATAKTYIFLHLPQESEVKAKPRVGTTNRK
SEQ ID NO:378
MNCSISGEVPEEPVVSTKSGHVFERRLIERYVSDYGKCPVSGEPLTMDDVLPVKMGKIVKPR
PLQAASIPGLLSIFQNEWDSLMLSNFALEQQLHTARQELSHALYQHDAACRVIARLKKERDEA
RSLLALAERQIPMTASSDIAVNAPAMSNGRKASLDEEPGYAGKKMRPGISASIIAEITDCNLAL
SQQRKKRQIPSTLAPVEDLERYTQLSSYPLHKTGKPGITSLDICHSKDIIATGGIDTSAVLFDRS
SGQIMSTLSGHSKKVTSVNFDAQGDMVLTGSADKTVRIWQGSEDGSYNCRHILKDHTAEVQ
AITVHATNNYFATASLDNTWCFYEFSTGLCLTQVEGASGSEGYTSAAFHPDGLILGTGTSNA
DVKIWDVKTQANVTTFSGHTGAITAISFSENGYFLATAAQDGVKLWDLRKLKNFRTFSAYDK
DTGTNSVEFDHSGCYLGLAGSDIRVYQVASVKSEWNCVKTFPDLSGTGKVTCVKFGPDSKY
IAVGSMDHNLRIFGLPSEDGAMES
SEQ ID NO:379
MAAPGVETLKKEIKELKEKIAQHRLDTDGEQPLPAAAKSKSVPEVSAALKQRRILKGHFGKIY
ALHWSADSRHLVSASQDGKLIIWNGFTTNKVHAIPLRSSWVMTCAYSPSGNLVACGGLDNL
CSVYKVPHGGNKESSSAQKTYGELAQHEGYLSCCRFIKDNEIVTSSGDSTCILWDVETKTPK
AIFNDHTGDVMSLAVFDDKGVFVSGSCDATAKLWDHRVHKQCVMTFQGHESDINSVQFFPD
GDAFGTGSDDSSCRLFDIPAYQQINKYSSDKILCGITSVAFSKTGKSLFAGYDDYNTYYWDTL
SGNQVEVLTGHENRVSCLGVSEDGKALATGSWDTLLKIWA
SEQ ID NO:380
MGGVEDESEPASKRMKLSSRVLRGLANGSSRTEPAAGSSLDLMARPLPIEGDEEVIGSKGVI
KRVEFVRLIAKALYSLGYEKSGARLEEESGIPLQSSVVNLFMQQISDGLWDESVVTLHKIGLS
DENLVKSASFLILEQKFLELLDQEKAMDNKTLRTEITPLCIKNSRVRELSSCIISPSSCGLLNQ
NKRNSTRARSRSELLEELQKLLPPAVIIPERRLEHLVEQALVLQTDACMLHNSIDMEMSLYTD
HQCGKEHIPCRTLQILQSHNDEVWLVQFSHNGKYLASASNDRSAIIWEVDENGSVSLKHKLT
GHQKPISSVCWSPDDRQLLTCGVGETVRRWDVSSGECLRVYEKAGHGUSCAWFPDGKWI
CYGVSDRSICMCDLEGKEIECWKGQRTLSISDLEITSDGKQIISICRETAILLLDREAKYERMIE
ENQTITSFSLSKDNRYLLVNLLNQEIHLWDIKGDFRLVAKYKGLKRSRFVIRSCFGGLKQAFV
ASGSEDSQVYIWHKGSGELIEPLPGHSGAVNCVSWNPANHHMLASASDDRTIRIWGLNELN
TRHKGARPNGVHYCNGNGTS
SEQ ID NO:381
MTQLAETYACMPSTERGRGIMAGNPKPGSNSVLYTNGRSVVILNLDNPLDISVYAEHAYPAT
VARFSPNGEWVASADSSGAVRIWGAYNDHVLKKEFKVLSGRIDDLQWSPDGLRIVASGDGK
GKSLVRAFMWDSGTNVGEFDGHSRRVLSCAFKPTRPFRIVTCGEDFLVNFYEGPPFKFKLS
RRDHSNFVNCLRFSPDGNRFISVSSDKKGIIYDGKTGEKIGELSSDGGHTGSIYAVSWSPDS
KQVITVSADKSAKIWDISEDGSGNLRKTLTSSGSGGVDDMLVGCLWQNNHLVTVSLGGTISI
YTAGDLDKAPVSFSGHMKNVSSLSVLKGDPKVILSSSYDGLIIKWIQGIGFSGRVQRKESTQI
KCLAAVDEEIVTSGYDNKVCRVSGSGDAEFIDIGCQPKDLSLALQCPEFALVSTDTGVVLLRG
AKIVSTINLGFAVTASTVAPDGTEAIIGAQDGKLRIYSISGDTLTEEAVLEKHRGAISVIHYSPDL
SMFASGDLNREAVVWDRASREVRLKNILYHTARINCLAWSPDSSTVATGSLDTCVIIYEVDKP
ASNRLTIKGAHLGGVYGLAFTDDFSVVSSGEDACIRVWKINRQ
SEQ ID NO:382
MKVKVISRSTDEFTRERSQDLQRVFRNFDPNLRTQEKAVEYVRALNAAKLDKVFARPFVGA
MDGHVDSVSCMAKNPNYLKGIFSGSMDGDIRLWDIASRRTVCQFPGHQGPVRGLAASTDG
QILVSCGIDSTVRLWNVPVATLGESDGTHENLAKPLAVYVWKNAFWAVDHQWDGELFATAG
AQVDIWNQNRSQPISSFEWGTDTVISVRFNPGEPNVLATSGSDRSITLYDLRMSSPTRKVIM
RTKTNAISWNPMEPMNFTAANEDCNCYSYDARKLEEAKCVHKDHVSAVMDIDYSPTGREFV
TGSYDRTVRIFQYNGGHSREVYHTKRMQRVFCVKFSCDASYVISGSDDTNLRLWKAKASEQ
LGVVLPRERRKHEYHEAVKSRYKHLPEVKRIVRHRHLPKPIYKAGILRRTVNFADRRKEERR
KAHSAPGSSSAEPLRKRRIIKEIE
SEQ ID NO:383
MVPSIKNPKKAKRKNKGSKNGDGSSSSSSIPSMPTKVWQPGVDKLEEGEELQCDPSAYNSL
HAFHIGWPCLSFDIVRDTLGLVRTEFPHQVYFVAGTQAEKPTWNSIGIFKVSNITGKRRELVP
SKPTDDADEESDSSDSDEDSDDEVGGSGTPILQLRKVGHEGCVNRIRAMNQNPHICASWG
DSGHVQIWDFSSHLNALAESEADVSQGASSVFNQAPLVKFGGHKDEGYALDWSPLVPGRL
VSGDCKNSIHLWEPTSGSTWNVDSTPFIGHAASVEDLQWSPTEENVFASCSVDGTIAIWDTR
LGKTPAASFKAHDADVNVISWNRLATCMLASGCDDGTFSIHDLRLLKEGDSVVAHFEYHKHP
VTSIEWSPHEASTLAVSSADCQLTIWDLSLEKDEEEEAEFKAKTKEQVNAPEDLPPQLLFVH
QGQKDLKELHWHAQIPGMIVSTAADGFNILMPSNIQSTLPSDGA
SEQ ID NO:384
MERYKVIKELGDGTYGSVWKALNQQTHEIVAIKKMKRKYYIWEECINLREVKSLRKLNHPNIIK
LKEVIRENNELFFIFEYMECNLYQIMKERSTPFSETAIIKFCYQILQGLSYMHRNGYFHRDLKP
ENLLVTSDLIKIADFGLAREVLTSPPYTDYVSTRWYRAPEVLLQSPTYTTAIDMWAVGAILAEL
FTLHPLFPGESELDEIYKICGVLGTPDYETWPDGMQLAAFRNFIFPQFLPVNLSVLIPHASPEA
IDLITRLCSWDPQKRPTAEQALHHPFFRIGMSIPLSLGGHFQDNTCAAEVDTNFHSKKACKG
RGMGEKESSLECFLGLSLGLKPSLGHLGAMGSQGVGAVKQEVGSSPGCQSNPKQSLFQVL
NSRAILPLFSSSPNLNVVPVKSSLPSAYTVNSQVMWPTIAGPPAAAVTVSTLQPSILGDFKIFG
KSMGLASQYAGKEASPFS
SEQ ID NO:385
MGEMGRGINNSSNNNNSNRPAWLQHYDLVGKIGEGTYGLVFLARSKLPNNRGLRIAIKKFK
QSKDGDGVSPTAIREIMLLREFSHENVVKLVNVHINHVDMSLYLAFDYAEHDLYEIIRHHREKL
NHHNINQYTVKSLLWQLLNGLNYLHSNWIVHRDLKPSNILVMGEGEEHGVVKIADFGLARIYQ
APLKPLSDNGVVVTIWYRAPELLLGAKHYTSAVDMWAVGCIFAELITLKPLFQGVEVKASPNP
FQLDQLDKIFKVLGHPTIEKWPTLMNLPHWSKNLQQIQQHKYDNAGLHIGPIPAKSPAYDLLS
KMLEYDPRKRITAAQALEHEYFRIDPQPGRNALVPSQPGEKAINYPPRLVDANTDFDGTIAPQ
PSQVSSGNAPSGSIASAAVPAVRPLPQQMQLMGMQRMQNPGMAAFNLGAQASMSGLNHN
NIALQRGSSQQQAHQQVRRKEPNSGFPNTGYPPPPKSRRL
SEQ ID NO:386
MDKYEKLEKVGEGTYGKVYKARDKMTGQLVALKKTRLEMDEEGVPPSSLREISLLQMLSQSI
YVVRLLCVEHVTKKGKPLLYLVFEYLDTDLKKFIDYRRSVNAGPLPQNVIQSFMYQLLKGVAH
CHSHGVLHRDLKPQNLLVDKSKGLLKVGDLGLGRAFTVPLKCYTHEVVTLWYRAPEVLLGST
HYSTPVDIWSVGCIFAEMVRRQPLFPGDCEIQQLLHIFTLLGTPTEEMWPGVKRLRDWHEYP
QWKPENLARAVPNLSPTGLDLISKMLQCDPAKRISAKAAMNHPYFDDLDKSQF
SEQ ID NO:387
MDGYEKMDKVGEGTYGKVYMARDKKTGQLVALKKTRLENDGEGIPPTALREISLLQMLSQDI
YIVRLLDVKHTENKLGKPLLYLVFEYMESDLKKYIDSYRRSHTKMPPSMIKSFMYQLCRGVAY
CHSRGVMHRDLKPHNLLVDKEKGVLKIADLGLSRAFTVPVKKYTHEIVTLWYRAPEVLLGAT
HYSLPVDIWSVGCIFAEMSRMQALFTGDSEVQQLMNIFRFLGTPNEEVWPGVTKLKDWHIYP
EWKPQDISHAVPDLEPSGLDLLSQMLVYEPSKRISAKKALEHPYFDDLDKSQF
SEQ ID NO:388
MDAYEKLEKVGEGTYGKVYKAKDKNTGQLVALKKTRLESDDEGIPPTALREISLLQMLSQDIH
IVRLLDVEHTENKNGKPLLYLVFEYMDSDLKKYIDGYRRSHTKVPPNIIKSFMYQLCQGVAYC
HSRGVMHRDLKPHNLLVDKQRGVVKIADLGLGRAFTIPIKKYTHEIVTLWYRAPEVLLGATHY
STPVDIWSVGCIFAEMVRLQALFIGDSEVQQLFKIFSFLGTPNEEIWPGVTKFRDWHIYPQWK
PQDISSAVPDLEPSGVDLLSKMLVYEPSKRISAKKALEHPYFDDLDKSQF
SEQ ID NO:389
MDSYEKLEKVGEGTYGKVYKAKDKKTGKLVALKKTRLENDGEGIPPTALREISLLQMLSQDM
NIVRLLDVEHTENKNGKPLLYLVFEYMDSDLKKYVDGYRRSHTKMPPKIIKSFMYQLCQGVA
YCHSRGVMHRDLKPHNLLVDKQRGVLKIADLGLGRAFTVPIKKYTHEIVTLWYRAPEVLLGAT
HYSTPVDIVVSVGCIFAEMSRMHALFCGDSEVQQLMSIFKFLGTPNEGVWPGVTKLKDWHIY
PEWRPQDLSRAVPDLEPSGVDLLTKMLVYEPSKRISAKKKALLQHPYFDDLDKSQF
SEQ ID NO:390
MEKYEKLEKVGEGTYGKVYKGRDKRTGRLVALKKTPFHQEEGIPPTAIREISLLKSLSQCIYIV
KLLDVKASFNGKGKHVLFMVFEYADSDLKKHIDAHRQCNTKLSPRSIQSYMFQLCKGIAYCH
SHGVLHRDLKPQNILVDQKIGLLKIADLGLGRACTVPIKSYTFEVVTLWYRAPEVLLGAKRYS
MALDIWSLGCIFAELCNLQALFAGDSQIQQLINIFRLLGTPNEQLWPGVTQLSDWHEFPQWR
PQDLSKVVFNLDPNGVDLLSKMLQYDPAKRISAKEALDHPYFDSLDKSQF
SEQ ID NO:391
MGCVCGKPSARAADYVESPAEKGASSNSRSSSMASRRLVAPAVMDQGIDAENGHEGDYRT
KLRGKQSNGADPVSLLSDDAEKQRHSRHHQHQQHHPIRPHHLRPQGEFVPNANSNPRFGN
PPRHIEGEQVAAGWPAWLTAVAGEAIKGWIPRRADSFEKLDKIGQGTYSNVYKARDLDTGKI
VALKKVRFDNLEPESVRFMAREIQVLRRLDHPNVVKLEGLVTSRMSCSLYLVFEYMDHDLAG
LAACPGIKFTEPQVKCYMQQLLRGLDHCHSRGVLHRDIKGSNLLIDNGGILKIADFGLATFFH
PDQRQPLTSRVVTLWYRPPELLLGATEYGVAVDLWSSTGCILAELLAGKPIMPGRTEVEQLHKI
FKLCGSPSEDYWKKSKLPHATIFKPQQPYKRCVAETFKDFPPSALALMEVLLAIEPADRGTAT
SALKSDFFTTKPLACDPSSLPKYPPSKEFDAKIRDEEARRQRAAGGRGRDAARRPSRESRAI
PAPEANAELAISIQKRRLSSQGPSKSKSEKFNPQQEDGAVGFPIEPPRPMHIGIDAGATSRMY
SQQFGPSHSGPLSNQISSSIWGKNQKEDEIQMAPGRPSRSSKATISDFRKPGACAPQPGAD
LSHLSSLVATARSNAGIDTHKDRSGMWQHNRIDAIDGVHNNGKHEFLEVPEHPNRQDWTRF
QQPESFKGLDNYHLQDLPATHHRKDERVASKEATMNWQGYGGQGGDKIHYSGPLLPPSGN
IDEILKEHERHIQHAVRRARQDKGRPQRSNLSQNERKAFEHRSFVSGVNGNAGYSDLVNEL
PISVGSNRLKVSKTRGTEEIVELRELEREPLSSVMEKYEREHEM
SEQ ID NO:392
MGCVCAKQSDILGEPESPKVKGSNLASSRWSVSSETKQLPQHSDSGILHHQHYYHPRDESD
EAKLKESNYGGSKRRTRQGRDPADLDMGIFVRTPSSQSEAELVAAGWPAWMAAFAGEAIH
GWIPRRAESFEKLYKIGQGTYSNVYKARDLDNGKIVALKKVRFDSLDAESVRFMAREILVLRK
LDHPNIVKLEGLVTSEVSSSLYLVFEYMEHDLAGLAACPGIKFTEPQVKCYMQQLLQGLDHC
HRHGVLHRDIKGSNLLIDNGGILKIADFGLATFFYPDQKQLLTSRVVTLWYRPPELLLGATDY
GVAVDIWSAGCILAELLAGKPILPGRTEVEQLHKIFKLCGSPSEDYWKESKLPHATIFKPQHPY
KSCIAFAFKDFSPSALALLETLLAIEPGHRGEASGALKSEFFTTEPLSCDPSSLPKYPPSKEFD
AKLRAQETRRQRDVGVRGHGSEAARRTSRLSRAGPTPNEGAELTALTQKQHSTSHATSNIG
SEKPSTKKEDYTAGLHIDPPRPVNHSYETTGVSRAYDAIRGVAYSGPLSQTHVSGSTSGKKP
KRDHVKGLSGQSSLQPSKPFIVSDSRSERIYEKSHVTDLSNHSRLAVGRNRDTTDPHKSLST
LMQQIQDGTLDGIDIGTHEYARAPVSSTKQKSAQLQRPSALKYVDNVQLQNTRVGSRQSDE
RPANKESDMVSHRQGQRIHCSGPLLHPSANIEDLLQKHEQQIQQAVRRAHHGKREALSNKS
SLPGKKPVDHRAWVSSGKGNKESPYFKGKGNKELSDLKGGPTAKVTNFRQKVM
SEQ ID NO:393
MAVANPGQLNLQEAPSWGSRSVNCFEKLEQIGEGTYGQVYMAKEIETGEIVALKKIRMDNE
REGFPITAIREIKLLKKLQHENVIKLKEIVTSPGPEKDEQGKSDGNKYNGSIYMVFEYMDHDLT
GLAERPGMRFSVPQIKCYMKQLLIGLHYCHINQVLHRDIKGSNLMDNNGILKLADFGLARSFC
SDQNGNLTNRVITLWYRPPELLLGSTKYGPAVDMWSVGCIFAELLYGKPILPGKNEPEQLTKI
FELCGSPDESNWPGVSKLPWYSNFKPQRQMKRRVRESFKNFDRHALDLVEKMLTLDPSQR
ISAKDALDAEYFWTDPVPCAPSSLPRYEPSHDFQTKRKRQQQRQHDEMTKRQKISQHPPQ
QHVRLPPIQNAGQGHLPLRPGPNPTMHNPPPQFPVGPSHYTGGPRGAGGQNRHPQNIRPL
HAAQGGGYNANRGYGGPPQQQGGGYPPHGMGNQGPRGGQFGGRGAGYSQGGPYGGP
VGGRGPNVGGGNRGPQFWSEQ
SEQ ID NO:394
MQNMEDNVQSSWSLHGNKEICARYEILERVGSGTYSDVYRGRRKADGLIVALKEVHDYQSS
WREIEALQRLCGCPNVVRLYEWFWRENEDAVLVLEFLPSDLYSVIKSGKNKGENGIPEAEVK
AWMIQILQGLADCHANWVIHRDLKPSNLLISADGILKLADFGQARILEEPEAIYEVEYELPQEDI
VADAPGERLMEEDDSVKGVRNEGEEDSSTAVETNFGDMAETANLDLSWKNEGDMVMQGF
TSGVGTRWYRAPELLYGATIYGKEIDLWSLGCILGELLILEPLFSGTSDIDQLSRLKVLGTPT
EENWPGCSNLPDYRKLCFPGDGSPVGLKNHVPSCSDSVFSILERLVCYDPAARLNAKEVLE
NKYFVEDPYPVLTHELRVPSPLREENNFSEDWAKWKDMEADSDLENIDEFNVVHSSDGFCI
KFS
SEQ ID NO:395
MDLNQYPEDLNPELPEGTDNVDNPDNNKGSPVPSPHPPLKPLDPSERYRKGITLGQGTYGI
VYKAFDTVTNKTVAVKKIHLGKAKEGVNVTALREIKLLKELSHPNIIQLIDAYPHKQNLHIVFEF
METDLEAVIKDRNLVFSPADIKSYLQMTLKGLAVCHKKWVLHRDMKPNNLLIAADGQLKLGD
FGLARLFGSPDRKFTHQVFAVWYRAPELLFGAKQYGPAVDIWATGCIFAELLLRKPFLQGVS
DLDQIGKIFAAFGTPRQSQWPDVASLPDFVEFQFVPAPSLRSLFPMASEDALDLLSKMFTLD
PKNRITAQQALEHRYFSSVPAPTRPDLLPKPSKVDSSRPPKHASPDGPVVLSPSKARRVMLF
PNNLAGILPKQVSQSTTGGTPIEFDMPTQKLREVCPRSRITESGKKHLKRKTMDMSAALDEC
AREQEGQEGKTILDPDHQRSAKKEKHM
SEQ ID NO:396
MAGGQENCVRITRARAACVSKASAPVIQSQVDEKKSRKRAPKRAAVDDLAANASGSQPKRR
AVLGDVTNLHAAATDCLSTAEDQVDAPNPSIKGRARNKKKEARTSTKVVKDEIHPESNPLAD
HSSNLSECQKPPAAKLAEQRSLRGVPSKAKQGGSSNSQSCSKHTDIDKDHTDPQMCTTYV
EDIYEYLRNAELKNRPSANFMETAQNDITPNMRAILVDWLVEVSEEYKLVPDTLYLTVSYIDR
YLSANPTSRHKLQLLGVSCMLIASKYEEVCPPHVEEFCYITDNTYTRDEMLSMERKILIFLNFE
MTKPTTKSFLRRFVRASQAGNKAPSLHMEFLANYLAELTLMECSFLQYLPSLIAASTVFLSRL
TLDFLTNPWNPTLAHYTGYKASQLKDCVMAIYNVQMNRKGSTLVAIREKYQQHKFKCVASLP
PPPFIAERFFEDTPN
SEQ ID NO:397
MTGTQASNVRITRARAAKSTLNNALPPLPPAQGKPRGKRAATESNISGFSVAAEPLKRRAVL
SDVSNICKEAAAVDCLKKPKAVKVVSQNANAKGRGRGIPRNNKKITQEAEIKKETSPAICNVD
DASAGNAIGDDKQNNNVNPLKEVQDNPKELNPIAEQISVHPHCKQSVEKPNEKEIVVSDNKA
AIASLKQQSTLQSLRIPKQPKYSLKQGNPVPLANLHEDVGRSSCSDFIDIDSEYKDPQMCTAY
VTDIYANMRVVELKRRPLPNFMETTQRDINANMRSVLIDWLVEVSEEYKLVPDTLYLTVSYID
RFLSANVVNRQRLQLLGVSCMLVASKYEEICAPPVEEFCYITDNTYKKEEVLEMEISVLNRLQ
YDLTTPTTKTFLRRFIRAAQASCKVSSLHLEFMGNYLAELTLVEYDFLKYLPSLIAAAAVFVAR
MTLDPMVHPWNSTLQ HYTGYKVSDMRDCICAIHDLQLNRKGCTLAAIREKYNQPKFKCVAN
LFPPPIISPQFLIDNEV
SEQ ID NO:398
MAAPNQNALLINNNNRRPLVDIGNLVGALNAQCNISKNGARKRAFGDIGNLVEDLDAKCTISK
YWVRKRPRTNFGVNANKGASSSTQGQGIVVRGEQKAWDRIVWGNKQSCAIKMNAQHVTAT
QRGTAISISDIIDSSVQDGGIKAPSQLKARKQTVRTVTATLTARSEDSLRDVLEVPPGIDDGDR
DNPLAVVEYVEDIYHFYRKIEVRSCVPPDYMTRQLEIKDSMRGVIIDWLIEVHRTFLLMPETLY
LTVNIIDRYLSIQSVTRNELQLMGITAMFIASKYEEISPPKINDLVYITKDAYTSKQIVNMEHTILN
RLKFKLTVPTPYVFLVRFLKAAGPDKVMKNLAFFLVDLCLLHYKMIKYSPSMLAAAAVYTAQC
TLKKHPYWNKTLILHIGYSEAHLRECAHLMADLHLKAEGSNLKSVYKKYSYPIFGSVAFLSPA
KIPAGTVAAPAIDKCAHQIYLRNLR
SEQ ID NO:399
MFPNKQTQGLVQNKKMASKAAQPKAMVPPQRVPPAANNRRALGDIGNIVADVGGKCNVTK
DGVNGKPLAQVSRPITRSFGAQLLAQAAANKGISAANNQTQVPVVIPKADVRGNKQRRTSKS
KDIPPTTVVTNESDDCVIIEQAQRIKPTCNHNVGAVGNKEKPQLLTAKPKSLTASLTSRSAVAL
RGFRFDDEMTEAEEDPLPNIDVGDRDNQLAVVEYVEDIYKFYRRTEQMSCVPDYMPRQQEI
NPKMRAVLINWLIEVHYRFGLMPETLYLTTNLIDRYLATQLVSRSNYQLVGATAMLLASKYEEI
WAPEMNDFLDILENKFERKHVLVMEKAMLNKLKFHLTVPTPYVFLVRFLKAAASDEEMENLV
FFLMELSLMQYVMIKFPPSMLAAAAVYTAQITLKKTTVWNDVLKRHTGYSEIDLKECTRLMVA
FHQSSEESKLNVVFKKYSMPEYDSVALIKPAKLPA
SEQ ID NO:400
MAPSFDCVANAYIESCEDQEKLRQNAQILAQSGENDVDEPVSMLVQRETHYMLPEDYLQRL
RNRTLDVNVRREAVGWILKVHSFYNFGAPTAYLAVNYLDRFLSRHRMPQGVKAWMIQLMAV
ACLSLAAKMEETQVPLPSDLQREDARFIFDARTIQRMELLILSTLQWGMRSITPFSFIDYFAYR
AVQGHGHGHDATPKAVMSRAIELILSTTEEIDFMEYRPSAIAAAALLCAAEEVVPLQAVHYKR
ALSSSITDVDKDKMFGCYNLIQETIIEGGCYWTPMSLQSTEKTPVGVLDAAACLSNTPTSSYS
VKPYASVTAAKRRKLNEICSALLVSQAHPC
SEQ ID NO:401
MAANFWTSSHCKELLDAEKVGIVHPLDKDQGLTQEDVKIIKINMSNCIRTLAQYVKLRQRWA
TAITYCRRVYTRKSFTEYDPQLVAPTCLYLASKAEESTVQAKLVIFYMKKYSKHRYEIKDMLE
MEMKLLEALDYYLVIYHPYRPLIQFLQDAGLNDLKVTAWALVNDTYRTDMLTYPPYMIALACIY
FACIMEEKDAQAWFEELRVDMNEIKNISMEIVDYYDNYRVIPDEKMNSALNKLPHRF
SEQ ID NO:402
MAPALSSSYECLSHLLCAEDASNVVGCWDEDESKIFCEEEEGFGIQHFPDFPVPDDDEIRVL
VRKESQYMPGKSYVQSYQNLGLDFTARQNAIGWILKVHGSYNFGPLTAYLSINYLDRFLSRN
PLPKAKVWMLQLLSVACLSLAAKMEETQVPLLLDLQAEEPDFLFEPRTIQRMELLVLSTLEWR
MLSVTPFSFVDYFLQGGGGRKPPPRAMVARANELIFNTHTVLDFLEHRPSAIAAAAVICAAEE
VLPLEAAQYKETILSCSLVDKEWVFGSYNLIQEVLIEKFSTPKKAKSASSSIPQSPVGVLDAFC
LSNNSNNTSLEASLSVNLYASVAAKRRKLNDYCNTWRMFQHSTC
SEQ ID NO:403
MAPNCIDCAPSDLFCAEDAFGVVEWGDAETGSLYGDEDQLHYNLDICDQHDEHLWDDGEL
VAFAEKETLYVPNPVEKNSAEAKARQDAVDWILKVHAHYGFGPVTAVLSINYLDRFLSANQL
QQDKPWMTQLAAVACLSLAAKMDETEVPLLLDFQVEEAKYIFESRTIQRMELLVLSTLEWRM
SPVTPLSYIDHASRMIGLENHHCWIFTMRCKEILLNTLRDAKFLGLLPSVVAAAIMLHVIKETEL
VNPCEYENRLLSAMKVNKDMCERCIGLLIAPESSSLGSFSLGLKRKSSTINIPVPGSPDGVLD
ATFSCSSSSCGSGQSTPGSYDSNNSSILCISPAVIKKRKLNYEFCSDLHCLED
SEQ ID NO:404
MPQIQYSEKYTDDTYEYRHVAVLPPETAKLLPKNRLLNENEWRAIGVQQSRGWVHYAIHRPE
PHIMLFRRPLNYQQNQQQQAGAQSQPMGLKAQ
SEQ ID NO:405
MDQIEYSEKYYDDTYEYRHVELPPDVARLLPKNRLLTENEWRGIGVQQSRGWVHYAIHCSE
PHIMLFRRPLNYEQNHQHPEPHIMLFRRPLNCQPNHQPQAHHPT
SEQ ID NO:406
MDQIEYSEKYYDDTYEYRHVELPPDVARLLPKNRLLTENEWRGIGVQQSRGWVHYAIHCSE
PHIMLFRRPLNYEQNHQHPEPHIMLFRRPLNCQPNHQPQAHHPT
SEQ ID NO:407
MPQIQYSEKYYDDTYEYRHVVLPPDVARLLPKNRLLNENEWRGIGVQQSRGWVHYAIHRPE
PHIMLFRRHLNYQQNQQQQAQQQPAQAMGLQA
SEQ ID NO:408
MALVETEPVTLIHPEEPKKFKKKPTPGRGGVISHGLTEEEARVKAIAEIVGAMVEGCRKGEDV
DLNALKAAACRRYGLSRAPKLVEMIAALPDGERAAVLPKLKAKPVRTASGIAVVAVMSKPHR
CPHIATTGNICVYCPGGPDSDFEYSTQSYTGYEPTSMRAIRARYNPYVQTRSRIDQLKRLGH
TVDKVEFILMGGTFMSLPADYRDYFIRNLHDALSGHTSSNVEEAVCYSEHSATKCIGLTIETR
PDYCLGPHLRQMLSYGCTRLEIGVQSTYEDVARDTNRGHTVAAVADCFCLAKDAGFKVVVAH
MMPDLPNVGVERDMESFREFFENPAFRADGLKIYPTLVIRGTGLYELWKTGRYRNYPPEQL
VDIIARVLALVPPWTRVYRVQRDIPMPLVTSGVEKGNLRELALARMDDLGLKCRDVRTREAGI
QDIHHKIRPEVVELVRRDYCANEGWETFLSYEDTRQDILVGLLRLRKCGHNTTCPELKGRCSI
VRELHVYGTAVPVHGRDADKLQHQGYGTLLMEQAERIAWKEHRSIKLAVISGVGTRHYYRKL
GYELEGPYMMKYLN
SEQ ID NO:409
MLGFRDLYTSICEHLQRASGRLPIIAAATSLISTPEIAAVEKENKAPNSVDKMGMGSADESGR
FSTSNGQFMNMNNGVVKEEWKGGVPVVPSAPTTVPVITNVKLETPSSPDHDMARKRKLGFL
PLEVGTRVLCKWRDGKFHPVKIIERRKLPNGATNDYEYYVHYTEFNRRLDEWVKLEQLELDS
VETDADEKVDDKAGSLKMTRHQKRKIDETHVEGNEELDAASLREHEEFTKVKNITKIELGRYE
IETWYFSPFPSEYNNCEKLYFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMF
EVDGKKNKVYAQNLCYLAKLFLDHKTLYYDVDLFLFYILCECDERGCHMVGYFSKEKHSEES
YNLACILTLPPYQRKGYGKFLISFSYELSKKEGKVGTPERPLSDLGLLSYRGYWTRVLLDILKK
HKSNISIKELSDMTAIKADDVLSTLQGLDLIQYRKGQHAICADPKVLDRHLKAVGRGGLEVDV
CKLIWTPYKEQ
SEQ ID NO:410
MGSLDESTCSEEIRDEGKDSIRTKFKVESTVNNAQNGGNDNSKKKRAAGLPLEVGIRLLCKW
RDSKLHPVKIIERRKLPNGFPQDYEYYVHYTEFNRRLDEWVKLEQFELDSVETDADEKIEDK
GGSLKMTRHQKRKIDEIHVEEGQGHEDFDPASLREHEEFTKVKNIAKVELGRYEIETWYFSP
FPPEYSHCEKLFFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRNGTLSMFEVDGKKNK
IYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHVVGYFSKEKHSDEAYNLACILTL
PPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLLSYRGYWTRILLDILKKQRGNISIKE
LSDMTAIKVEDVISTLQVLDLIQYRKGQHVICADPKVLDRHLKAAGIAGLEVDVSKLIWTPYKE
QCG
SEQ ID NO:411
MASAPMVGCDDSRDKHRWVESKVYMRKGHGKGSKGNAGFNAQNSTAQVRRENDNM
NSIADNGKSEAASEGLSSLSRKQITVNQDHPPNETSSMPAVGGLQNIDTHVTFKLEGCSKQE
IWELRKKLTNELEQVRGTFKKLEARELQLRGYSVSAGVNTSYSASQFSGNDMRNNGGKEVT
SEVASGGAITPKQAQRESNPPRQLSISLMENNQAASDMGEKGKRTPKANQYYRNSEFVLGK
DKFPPAESKKSKSTGNKKISCSKVFSKETMQVGKEFMPQKSVNEVFKQCSLLLTKLMKHKY
GWVFNLPVDAQALGLHDYHTIIKRPMDLGTVKSKLEKNLYNSPASFAEDVKLTFSNAMTYNP
KGHEVHTMAEQLLQLFEERVVKTIYEEHLDGKMRFGSGQGLGASSSTKKLPFQDSKKNIKKS
EPAGGPSPPKPKSTNHHASRTPSAKKPKAKDPHKRDMTYEEKQKLSTNLQNLPQERLELIV
QIIKKRNPSLCQHDEEIEVDIDSFDTETLWELDRFVTNYKKSLSKNKKKALLADQAKRASEHG
SARNKHPMIGRELPMNNKKGEQGEKVVEIDHMPPVNPPVVVEVEKDGVYAKRSSSSSSSSSD
SGSSSSDSDSGSSSGSESDAYAATSPPAGSNTSARG
SEQ ID NO:412
MEGHSGALGFGQGFSRSSQSPNLSPSPSHSASASVTSSGQKRKRNEVEHAGVASNSTGM
FAVPPSHIYSHLHPMSMSMPMPMHNSHPSSLSESRDGALTSNDDDDNLTGGNQSQLDSMS
AGNTDGREDFDDEDDDDDDEEDDDEVEGDEEDQDHDPDADDDSDDGHDSMRTFTAARLD
NGAPNSRNLKPKADAAGVAIAPTVKTEPILDTVKEEKVSGNNNNNSVSANNAQVAPSGSAVL
LSAVKEEANKPTSTDHIQTSGAYCAREESLKREEDADRLKFVCFGNDGIDQHMIWLIGLKNIF
ARQLPNMPKEYIVRLVMDRSHKSVMIIKQNQVVGGITYRPYLSQKFGEIAFCAITADEQVKGY
GTRLMNHLKQHARDVDGLTHFLTYADNNAVGYFIKQDFTKEIKLEKERWHGYIKDYDGGILM
ECKIDPKLPYTDLPAMIRVVQRQTIDEKIRELSNCHIVYSGIDIQKKEAGIPRKPIKVEDIPGLKEA
GWTTDQWGHSRFRLLNSPSEGLPNRQVLHAFMRSLHKAMVEHADAWPFKEPVDPRDVPD
YYDIIKDPMDVKRMFTNARTYNTHETIYYKCANR
SEQ ID NO:413
MEESGNSLTSGPDGSKRRVSYFYDSDIGNYYYSQGHPMKPHRIRMAHSLIVHYALDEKMEV
CRPNLLQSRELRVFHADDYISFLQSVTPETQHEQLRQLKRFNVGEDCPVFDGLYNFCQTYA
GGSVGAAIKLNNKEADIAINWSGGLHHAKKCEASGFCYVNDIVLAILELLKVHQRVLYIDIDIHH
GDGVEEAFYSTDRVMSVSFHKFGDYFPGTGHLKDVGYGKGKYYSLNVPLNDGIDDESYKNL
FRPIIQKVMEIYQPEAVVLQCGADSLSGDRLGCFNLSVKGHADCVRFLRSFNVPLVLVGGGG
YTIRNVARCWCYETAVAVGVEPQDKLPYNEYYEYFGPDYTLHVAPSNMENQNSAKELAKIR
NTLLEQLKRIQHVPSVPFQERPPDTKFPEEDEEDYEKRPKGHKWGGEYFGSESDEEQKPQ
NRDIDISDKPGIRRQSPPNVEAAKKIKVEEEDGDIGIVNENDGAKWPLGEAG
SEQ ID NO:414
MEESGNSLTSGPDGSKRRVSYFYDSDIGNYYYSQGHPMKPHRIRMAHSLIVHYALDEKMEV
CRPNLLQSRELRVFHADDYISFLQSVTPETQHEQLRQLKRFNVGEDCPVFDGLYNFCQTYA
GGSVGAAIKLNNKEADIAINWSGGLHHAKKCEASGFCYVNDIVLAILELLKVHQRVLYIDIDIHH
GDGVEEAFYSTDRVMSVSFHKFGDYFPGTGHLKDVGYGKGKYYSLNVPLNDGIDDESYKNL
FRPIIQKVMEIYQPEAWLQCGADSLSGDRLGCFNLSVKGHADCVRFLRSFNVPLVLVGGGG
YTIRNVARCWCYETAVAVGVEPQDKLPYNEYYEYFGPDYTLHVAPSNMENQNSAKELAKIR
NTLLEQLKRIQHVPSVPFQERPPDTKFPEEDEEDYEKRPKGHKWGGEYFGSESDEEQKPQ
NRDIDISDKPGIRRQSPPNVEAAKKIKVEEEDGDIGIVNENDGAKWPLGEAG
SEQ ID NO:415
MEFWGVEVKPGEALTCDPGDERYLHMSQAAIGDKEGAKENERVSLYVHVDGKKFVLGTLS
RGKCDQIGLDLVFEKEFKLSHTSQTGSVFVSGYTTVDHEALDGFPDDEDLESSEDEEEELAQ
ITTLTAKENGGKTGAKPVKPESKSSVTDKAAAKGKPEVKPPVKKQEDDSDSDEDEDEDEDE
DEDDDDEDDEDMKDASASDDGDEEDDSDEESDDDEEEDEETPKPAAGKKRPMPASDNKS
PATDKKAKITTPAGGQKPGADKGKKTEHIATPYPKHGAKGPASGVKGKETPLGSKQTPGSK
VKNSSTPESGKKSGQFKCQSCSRDFATEGALSSHNAAKHGGK
SEQ ID NO:416
MMETGGNSLPSGPDGVKRKVAYFYDPEVGNYYYGQGHPMKPHRIRMTHALLVQYGLHKE
MQILKPYPARDRDLCRFHADDYVAFLRGITPETIQDQVKALKRFNVGDDCPVFDGLYQYCQT
YAGGSVGGAVKLNHKLCDIAINWAGGLHHAKKCEASGFCYVNDIVLAILELLKYHKRVLYVDI
DIHHGDGVEEAFYTTDRVMTVSFHKFGDYFPGTGDIRDIGCGKGKYYAVNVPLDDGIDDESF
QSLFKPIIQQVMLVYNPEAIVLQCGADSLSGDRLGCFNLSVKGHAECVRYMRSFNVPLLMVG
GGGYTVRNVARCWCYETGVAVGVEIDDKMPQHEYYEYFGPDYTVHVAPSNMENKNTKQYL
DKIRSKILENINSLPCAPSAQFQVQPPDTDFPELEEEDYDERTRSHKWDGASCDSDSENGDL
KHRNHDVEESAFPRHNLANISYNTKIKLEGVGTGGLDMAAGTDTKKNDESFEAMDYESGEE
LRQDHFASTINASQPCDPALLTGVQNQLQSTDTVKPIEQSGNAPGIPPPSVATVSTGTRPSSI
SRTSSLNSMSSVKQGSILGPNPPQGLNASGLQFPVPTSNSPIRQGGSYSITVQAPDKQGLQ
NHMKGPQNMPGNS
SEQ ID NO:417
MPPKDRVAYFYDGDVGSVYFGPNHPMKPHRLCMTHHLVLSYELHKKMEIYRPHKAYPVELA
QFHSADYVEFLHRITPDTQHLFTKELVKYNMGEDCPVFENLFEFCQIYAGGTIDAAHRLNNQI
CDIAINWSGGLHHAKKCEASGFCYINDLVLGILELLKHHARVLYVDIDVHHGDGVEEAFYFTD
RVMTVSFHKYGDMFFPGTGDVKEVGEREGKYYAINVPLKDGIDDASFTRLFKTIITKVVDIYQ
PGAIVLQCGADSLAGDRLGCFNLSIDGHAQCVRIVKKFNLPLLVTGGGGVTKENVARCWSVE
TGVLLDTELPNEIPDNDYIKYFAPDYSLKINTAGNMENLNSKTYLSAIKVQVMENLRAIQHAPS
VQMHEVPPDFYIPDIDEDELNPDERMDQHTQDRQIQRDDEYYDGDNDIDHDMEEAS
SEQ ID NO:418
MDSSKSEEANILHVFWHEGMLNHDLGTGVFDTLEDPGFLEVLEKHPENADRVRNMLSILRK
GPIAPYTEWHTGRAAYLSELYSFHRPDYVDMLAKTSTAGGKTLCHGTRLNPGSWEAALLAA
GTTLEAMRYILDGHGKLSYALVRPPGHHAQPTQADGYCFLNNAGLAVELAVASGCKRVAVV
DIDVHYGNGTAEGFYERDDVLTISLHMNHGSVVGPSHPQTGFHDEVGRGKGLGFNLNVPLP
NGTGDKGYEHAMHELVVPAISKFMPEMIVLVIGQDSSAFDPNGRECLTMEGYRKIGQIMRQQ
ADQFSGGRLVVVQEGGYHITYAAYCLHATLEGVLCLPHPLLSDPIAYYPEHDIYSERVTFIKNY
WQGIISTTDKRN
SEQ ID NO:419
MEESGNALVSGPDGSKRRVTYFYDADIGNYYYGQGHPMKPHRMRMAHNLIVHYGLHQRME
VCRPHLAQSKDIRAFHTDDYIHFLSSVAPDTQQEQLRQLKRFNVGEDCPVFDGLFNFCQSSA
GGSIGAALKLNRKDADIAINWAGGLHHAKKCEASGFCYVNDIVLGILELLKVHQRVLYIDIDIHH
GDGVEEAFYTTDRVMTVSFHKFGDYFPGTGHIKDVGYGKGKYYALNVPLNDGIDDESYKHL
FRPIIQKVMEVYQPEAVVLQCGADSLSGDRLGCFNLSVKGHADCVRFVRSFNIPLMLVGGG
GYTIRNVARCWCYETAVAVGVEPQDKLPYNEYYEYFGPDYTLYVAPSNMENLNTEKDLEKM
RNVLLEQLSKIQHTPSVPFQERPPDTEFNDEEEEDMEKRSKCRIWDGEYVGSEPEEDGKLP
RFDADTYERSVLKHENKRLVPVSNVEPLKRIKQEEDGAAV
SEQ ID NO:420
MPPKDRVAYFYDGDVGSVYFGPNHPMKPHRLCMTHHLVLSYELHKKMEIYRPHKAYPVELA
QFHSADYVEFLHRITPDTQHLFTKELVKYNMGEICPVFENLFEFCQIYAGGTIDAAHRLNNQI
CDIAINWSGGLHHAKKCEASGFCYINDLVLGILELLKHHARVLYVDIDVHHGDGVEEAFYFTD
RVMTVSFHKYGDMFFPGTGDVKEVGEREGKYYAINVPLKDGIDDASFTRLFKTIITKVVDIYQ
PGAIVLQCGADSLAGDRLGCFNLSIDGHAQCVRIVKKFNLPLLVTGGGGYTKENVARCWSVE
TGVLLDTELPNEIPDNDYIKYFAPDYSLKINTAGNMENLNSKTYLSAIKVQVMENLRAIQHAPS
VQMHEVPPDFYIPDIDEDELNPDERMDQHTQDRQIQRDDEYYDGDNDIDHDMEEAS
SEQ ID NO:421
MDLNLVSHGEEEEGVRRRKVGIVYDERMCKHATPEDQPHPEQPDRIRVIWDKLNSAGVLHK
CVMVEAKEASEEQLAGVHSRKHIEVMKSIGTARYNKKKRDKLAASYSSIYFSQGSSEAALLA
AGSVVEISEKVASGELDAGVNVRPPGHHAEADKAMGFCLFNNIAIAAKHLVHERPELGVQEV
LIVDWDVHHGNGTQHMFWTDPHVLYFSVHRFDAGTFYPGGDDGFYDKIGEGKGAGYNINV
PWEQGKCGDADYLAVWDHVLVPVAKSYDPDMVLISGGFDAALGDPLGGCRLTPYGYSLMT
KKLMEFAGGKIVLALEGGYNLKSLADSFLACVEALLKDGPSRSSVTHPFGSTWRVIQAVRK
ELSSFWPALNEELQLPRLLKDASESFDKLSSSSSDESSASEDEKKIAEVTSIMEVSPDPSSILA
LTAEDIAQPLAGLKIEEAGTDSQRSSDHTLLDLTNDDTQKLKQFEGEIFVMIGDEESVPSASS
SKDQNESTVVLSKSNIKAHSWRLTFSSIYVWYASYGSNMWNPRFLCYIEGGQVEGMAKRCC
GSEDKTPPQRIQWKVVPHRMFFGRSYTNTVVGSGGVSFLDPNCSDTSEAHVCLYKITLAQFN
DLLLQENNLNCGTEHPLVDLSSIDAIRNGNSILEUKDSWYGTLIYLGMEGGLPIVTFTCSVCD
VEKFKHGQLPLCPPSSRYENILIERGLVQGKKLSEDDATAYIRAASTSPLL
SEQ ID NO:422
MADEDLDLSDVGEVEDEPGEEIESTPPLAVGQEKEINSLALKKKLLKVGTRWETPENGDEVT
VHYTGTLPDGTKFDSSRDRGEPFTFKLGQGQVIKGWDQGIVFMKKGERALFTIPPELAYGSS
GVRPTIPPNATLQFDVELLSWTNIVDVCNDGGILKRIISEGEKYERPKDPDEVTVKYEAKLED
GTLVAKSPEEGVEFYVNDGHFCPAIAKAVKTMKRGESVILTIKPTYAFGERGKDAEEGFAAIP
PNATLTTSLELVSFKAVIAVTEDKKVIKKILKEADGYDKPSDGTVVQIRYTAKLQDGTIFEKKGY
EGEEPFQFVVDEEQVIAGLDKAVETMKTGEIALITIGAEYGFGNFETQRDLAVIPPNSTLIYEV
EMISFTKEKESWDMDTTEKIEASKQKKEQGNSLFKVGKYQRAAKKYEKAAKYIEHDSSFSAE
EKKQSKVLKVSCNLNHAACRLKLKDFKEAVKLCSKVLELESQNVKALYRRAQAYIETADLDLA
EFDIKKALEIEPQNREVQLEYKILKQKQIEYNKKDAKLYGNMFAKLNKLEAFEGKVLS
SEQ ID NO:423
MADEGLELSDVAEVEDEPGEEFESAPPLVVGQEKELNSSGLKKKLLKAGTRCETPENGDEV
TVHYTGTLLDGTKFDSSRDRGEPFTFNIGQGQVIKGWDQGIVTMKKREHALFTIPPELAYGA
SGMPPTIPPNATLQFDVELLSWTNIVDVCKDGGILKRIISDGEKYERPKDPDEVTVKYEAKLE
DGMLVAKSPEEGVEFYVNDGNFCPAIVKAVKTMKKGENVTLTIKPAYAFGEQGKDAEEGFA
AIPPNATITINLQLVSFKAVKEVTEDKVIKKILKEADGYDKPSDGTVVQIRYTAKLQDGTIFEKK
GYAGEEPFQFVVDEEQVIAGLDKAVETMKTGEVALITIGPEYGFGNIETQRDLAVIPPYSTLIY
EVEMVSFTKEKESWDMNTTENIEASKQKKEQGNSLFKVGKYLRAAKKYDKAAKYIEHDNSF
SAEEKKQSKVLKVSCNLNHAACCLKLKDFKKAVKLCSKVLELESQNVKALYRRAQAYIETADL
DLAEFDIKKALEIEPQNREVRLEYLILKQKQIEYNKKDAKLYGNMFARQNKLEAIEGKD
SEQ ID NO:424
MPNPKVFFDMQVGGAPAGRIVMELYADVVPKTAENFRALCTGEKGTGRSGKPLHFKGSSF
HRVIPGFMCQGGDFTRGNGTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGSQFFI
CTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGSGRTSKPVVIADSGQLA
SEQ ID NO:425
MPNPKVFFDMQVGGAPAGRIVMELYADVVPKTAENFRALCTGEKGNGRSGKPLHFKGSSF
HRVIPGFMCQGGDFTRGNGTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGSQFFI
CTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGSGRTSKPVVIADSGQLA
SEQ ID NO:426
MPNPKVFFDMQVGGAPAGRIVMELYADVVPKTAENFRALCTGEKGTGRSGKPLHFKGSSF
HRVIPGFMCQGGDFTRGNGTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGSQFFI
CTAQTSWLDGKHVVFGQVVEGLEVVRDIEKVGSGSGRTSKPVVIADSGQLA
SEQ ID NO:427
MPNPIVFFDMQVGGAPAGRIVMELYADVVPKTAENFRALCTGEKGTGRSGKPLHFKGSSF
HRVIPGFMCQGGDFTRGNGTGGESIYGEKFADENFVKKHTGPGILSMANAGPNTNGSQFFI
CTAQTSWLDGKIVVFGQVVEGLEVVRDIEKVGSGSGRTSKPVVIADSGQLA
SEQ ID NO:428
MADDFELPESAGMMENEDFGDTVFKVGEEKEIGKQGLKKLLVKEGGSWETPETGDEVEVH
YTGTLLDGTKFDSSRDRGTPFKFKLGQGQVIKGWDQGIATMKKGENAVFTIPPDLAYGESGS
QPTIPPNATLKFDVELLSWASVKDICKDGGIFKKIIKEGEKWEHPKEADEVLVKYEARLEDGTV
VSKSEEGVEFYVKDGYFCPAFAIAVKTMKKGEKVLLTVKPQYGFGHQGREAIGNDVARSTN
ATLLVDLELVSWKVVDEVTDDKKVLKKILKQGEGYERPNDGAVVKVKYTGKLEDGTIFEEKG
SDEEPFEFMAGEEQVVDGLDRAVMTMKKGEVALVSVAAEYGYQTEIKTDLAVVPPKSTLIYE
VELVSFVKEKESWDMNTAEKIEAAGKKKEEGNALFKVGKYFRASKKYEKATKYIEYDTSFSE
EEKKQSKPLKVTCNLNNAACKLKLKDYTQAEKLCTKVLEVESQNVKALYRRAQAYIQTADLE
LAELDIKKALEIDPNNRDVKLEYRALKEKQKEYNKKEAKFYGNMFARMSKLEELESRKSGSQ
KVETANKEEGSDAMAVDGESA
SEQ ID NO:429
MAASLTPLGAGLAYATIYDQAKVRKLEPTKRSLIALCQHSDSQHRRFITRKYHVNVQILNRRD
AIRLIGLAAGLCIDLSLMYDARGAGLPPQENAKLCDTTCEKELENAPMITTESGLQYKDIKIGN
GPSPPIGFQVAANYVAMVPSGQVFDSSLDKGQPYIFRVGSGQVIKGLDEGLLSMKVGGKRR
LYIPGPLAFPKGLNSAPGRPRVAPSSPVIFDVSLEFIPGLESEEE
SEQ ID NO:430
MSAASLSADMAIRGTILGKTALHVLGPQVVSQCRQPVMFKCPPHTLRKMRFSAQDLQSKNF
YSGFTPFKSVFISTSKRSWQAGSARAMSQDAAFQSKVTTKCFLDIEIGGDPAGRIVLGLFGE
DVPKTAENFRALCTGEKGFGYKGSSFHRIIKDFMLQGGDFDRGDGTGGKSIYGRTFEDENF
KLAHVGPGVLSMANAGPNTNGSQFFICTVKTPWLDKRHVVFGQVIEGMEIVKKLESEETNRT
DRPKRPCRIVDCGELP
SEQ ID NO:431
MGRIKPQTLLQQSKKKKVPGRISVSTIIVCNLIIIFLMFSLVGIYRQRAKRNRATSRSDGDEEME
NFGRSKINSVPHQAIVNTTKGLITLELFGKSSAHTVEKFVEWSERGYFNGLPFYRVIKHFVIQV
GDPKFAGNREDWTVGGQENVQLEFSPKHEAFMLGTSKLEDQGDGFELFITTAPIPDLNDKLN
VFGRVIKGQDVVQEIEEVDTDEHFQPKSPIIINDVRLKDEL
SEQ ID NO:432
MARQSTLLLFWSLVFLGAIVFTQAKHEELEEVTHKVYFDVDIAGKPAGRVVIGLFGKAVPKTV
ENFRALCTGEKGVGKSGKPLHYKGSFFHRIIPSFMIQGGDFTLGDGRGGESIYGTKFADENF
KLKHTGPVFITTVTTDWLDGRHWFGKIISGMDVVYKVEAEGRQSGQPKRKVKIADSGELSM
D
SEQ ID NO:433
MARQSTLLLFWSLVFLGAIVFTQAKHEELEEVTHKVYFDVDIAGKPAGRVVIGLFGKAVPKTV
ENFRALCTGEKGVGKSGKPLHYKGSFFHRIIPSFMIQGGDFTLGDGRGGESIYGTKFADENF
KLKHTGPGFLSMANAGPDTNGSQFFITTVTTDWLDGRHVVFGKIISGMDVVYKVEAEGRQS
GQPKRKVKIADSGELSMD
SEQ ID NO:434
MEMDEIQEQSQPQSSEKQDISQESDTGNDKTINAEKITSENAEVEEDDMLPPKVNTEVEVLH
DKVTKQIIKEGSGNKPSRNSTCFLHYRAWAESTMHKFQDTWQEQQPLELVLGREKKELSGF
AIGVAGMKAGERALLHVDWQLGYGEEGNFSFPNVPPRANLIYEAELIGFEEAKEGKARSDMT
VEERIEAADRRRQQGNELFKEDKLAEAMQQYEMALAYMGDDFMFQLFGKYKDMANAVKNP
CHLNMAQCLLKLNRYEEAIGQCNMVLAEDEKNIKALFRRGKARATLGQTDDAREDFQKVRK
FSPEDKAVIRELRLLAEHDKQVYQKQKEMFKGLFGQKPEQKPKKLHWFVVFWQVLLSMIRTI
FRMRSKTD
SEQ ID NO:435
MAGAGEGTPEVTLETSMGPITVELYHKHAPKTCRNFLELSRRGYYNNVKFHRVIKDFMVQG
GDPTGTGRGGESIYGPRFEDEITRDLKHTGAGILSMANAGPNTNGSQFFISLAPTPWLDEKH
TIFGRVCKGMDVVKRLGNVQTDKNDRPIHDVKILRTTVKD
SEQ ID NO:436
MMDPELMRLAQEQMSKISPDELMKMQRQIMANPDLMRMASENMKNLKPEDIRFAAEQMKN
VRKEEMAEISERISRASPEEIEAMKARANLQSAYQLQVAQNLKDQGNQLHARMKYSEAAEK
YLQARNNLTGIPFSEAKSLLLASSSNLMSCYLKTGQYEECVQTGSEVLAYDAMNVKALYRRG
QAYKQIGKLELAVADLRKAVEVSPEDETIAQALREASTELMEKGGTQDQNGPRIEEIIEEEAV
QPTAEKYPQSAPMVTSVTEDVSDDEQGSEDQNGFSRDSFQATNAPDGQMYAESLRNLTEN
PDMLRTMQSLMKNVDPDSLVALSGGKLSPDMVKTVSGMFGRMSPEEIQNMMKMSSTLSRQ
NPSTSSRFDDITRGHSNMDSSPQSVSVDNDLFEENQNRVGESSTNLSSSAAFSGMPNFSAE
MQEQVRNQMNDPATRQMFTSMIQNMSPEMMASMSEQFGVKLSPEDAVKAQNAMASLSPN
DLDRLMNWATRLQTAIDYARKIKNWILGRPGLIFAISMLLLAIILHRFGYIGD
SEQ ID NO:437
MGVEKEILRPGNGPKPRPGQSVTVHCTGYGKNEDLSQKFWSTKDPGQKPFTFTIGQGRVIK
GWDEGVLDMQLGEIFKLRCSPDYGYGSNGFPAWGIRPNSVLVFEIEVLSVN
SEQ ID NO:438
MPNPRCYLDITIGEELEGRILVELYSDVVPKTAENFRALCTGEKGIGPHTGVPLHYKGLPFHR
VIKGFMIQGGDISAQNGTGGESIYGLKFDDENFQLKHERRGMLSMANSGPNTNGSQFFITTT
RTSHLDGKHVVFGKVIKGMGVVRGIEHTPTESNDRPSLDVVISDCGEIPEGSDDGIANFFKD
GDLYPDWPADLDEKSAEISWWMNAVDSAKCFGNENYKKGDYKMALRKYRKALRYLDICWE
KEEIDEEKSNHLRKTKSQIFTNSSACKLKLGDLKGALLDTEFAMRDGEDNVKALFRQGQAYM
ALKDVDSAVASFKKALQLEPNDAGIRKELAVATKMINDRRDQERRAYARMFQ
SEQ ID NO:439
MGDVIDLNGDGGVLKTIIRSAKPGAMQPTEDLPNVDVHYEGTLADTGEVFDTTREDNTLFSF
ELGKGTVIKAWDIAVKTMKVGEVARITCKPEYAYGSAGSPPDIPENATLIFEVELVACKPRKG
STFGSVSDEKARLEELKKQREIAAASKEEEKKRREEAKATAAARVQAKLEAKKGQGRGKGK
SKGK
SEQ ID NO:440
MGLGLKIASASFLPIFNIMATRSLCILLVCFIPVLAHVLSLQDPELGTVRVYFQTTYGDIEFGFFP
HVAPKTVEHIYKLVRLGCYNSNHFFRVDKGFVAQVADVVGGREVPLNSEQRKEGEKTIVGEF
SEVKHVRGILSMGRYSDPDSASSSFSILLGNAPHLDGQYAVFGKVTKGDDTLKRLEEVPTRQ
EGIFVMPLERIRILSTYYYDTNERESNLTCDHEVSILKRRLVESAYEIEYQRRKCLP
SEQ ID NO:441
MASKRSLRTMNVWPTLPPLVLLLLLCFSSMSSSVVAKKSDVSELQIGVKHKPKSCDIQAHKG
DRIKVHYRGSLTDGTVFDSSFERGDPIEFELGSGQVIKGWDQGLLGMCVGEKRKLRIPSKLG
YGAQGSPPKIPGGATLIFDTELVAVNGKGISNDGDSDL
SEQ ID NO:442
MSGAPAERPISYFDITIGGKPIGRIVFSLYADLVPKTAENFRALCTGEKGIGKSGKPLCYAGSG
FHRVIKGFMCQGGDFTAGNGTGGESIYGEKFEDEAFPVKHTKPFLLSMAMAGKDTNGSQFFI
TVSQTPHLDDKHVVFGEVIKGKSNRNENYPTASGDVPTSPIIISACGVLSPDDPSLAASEETI
GDSYEDYPEDDDSDVQNPEVALDIARKIRELGNKLFKEGQIELALKKYLKSIRYLDVHPVLPD
DSPPELKDSYDALLAPLLLNSALAALRTQPADAQTAVKNATRALERLELSDADKAKALYRRAS
AHVILKQEDEAEEDLVAASQLSPEDMAISSKLKEVKDEKKKKREKEKKAFKKMFSS
SEQ ID NO:443
MASSLRSSLFSSWALDSKSVCSLFNLNPGKMGLPSISTPLNWRTCCCSHSSELLELNEGLQS
SRRKTVMGLSTVIALSLVYCDEVGAVSTSKRALRSQKVPEDEYTTLPNGLKYYDLKVGSGTE
AVKGSRVAVHYVAKWKGITFMTSRQGMGITGGTPYGFDVGASERGAVLKGLDLGVQGMRV
GGQRILIVPPELAYGNTGIQEIPPNATLEFDVELISIKQSPFGSSVKIVEG
SEQ ID NO:444
MGAIEDEEPPLKRLKVSSPGLRRGLEEEAPSLSVGSVSILMAKSLSLEEGETVGSKGLIRRVE
FVRIITQALYSLGYQKAGALLEEESGILLQSSNVALFRKQILDGKWDESVVTLRGIDQVEVEGN
TLKAASFLILQQKFFELLDKGNIPEAMKTLRLEISPMQLNTKRVHELASCIVFPSRCEELGYSK
QGNPKSSQRMKVLQEIQQLLPPSIMIPEKRLERLVEQALNVQREACIFHNSLDPALSLYTDHQ
CGRDQIPTTTLQVLESHKNEVWFLQFSNNGKYLASASKDCSAIIWEITEGDSFSMKHRLSAH
QKPVSFVAWSPDDKLLLTCGIEEVVKLWNVETGECKLTYDKANSGFTSCGWFPDGERFISG
GVDKCIYIWDLEGKELDSWKGQGMPKISDLAVTSDGKEIISICGDNAIVMYNLDTKTERLIEEE
SGITSLCVSKDSRFLLLNLANQEIHLWDIGARSKLLLKYKGHRQGRYVIRSCFGGSDLAFVVS
GSEDSQVYIWHRGNGELLAVLPGHSGTVNCVSWNPVNPHVFASASDDYTIRIWGVNRNTFR
SKNASSSNGVVHLANGGP
SEQ ID NO:445
MPGTTAGAGIEPIEPQSLKKLSLKSLKRSFDLFASLHGEPQPPDQRSQRIRIACKVRAEYEVV
KNLPTLPQREVGSSVSNSNVGETHSSLTTNQAQGFPTDTSGDLSKDEGKEITSIAVHLQPQT
GLIDGKAGAIAGTSTAISSVGSSDRYQPSAAIMKRLPSKWPRPIWHPPWKNYRVISGHLGWV
RSVAFDPGNEWFCTGSADRTIKIWEVATGKLKLTLTGHIEQIRGLAVSSRHPYLFSAGDDKQ
VKCWDLEYNKAIRSYHGHLSGVYCLALHPTLDILCTGGRDSVCRVWDIRTKAQIFALSGHEN
TVCSVFTQAIDPQVVTGSHDTTIKLWDLAAGKTMSTLTYHKKSVRAIAKHPFEHTFASASADN
IKKFKLPKGEFLHNMLSQQKTIVNAMAINEDNVLVSAGDNGSLWFWDWKSGHNFQQAQTIV
QPGSLDSEAGIYALQYDITGSRLVSCEADKTIKMWKEDETATPESHPINFKAPKDIRRF
SEQ ID NO:446
MRPILMKGHERPLTFLKYNRDGDLLFSCAKDHTPTVWYGHNGERLGTYRGHNGAVWCCDV
SRDSTRLITSSADQTAKLWNVETGAQLFSFNFESPARAVDLAIGDKLVVITTDPFMELPSAIHI
KRIEKDLSKQTADSVLTITGIKGRINRAVWGPLNSTIISGGEDSVVRIWDSETGKLLRESDKET
GHQKPITSLCKSADGSHFLTGSLDKSARLWDIRTLTLIKTYVTERPVNAVAISPLLDHVVIGGG
QEASHVTTTDRRAGKFEAKFFHKILEEEIGGVKGHFGPINSLAFNPDGRSFASGGEDGYVRL
HHFDPDYFHIKM
SEQ ID NO:447
MRPILMKGHERPLTFLKYNRDGDLLFSCAKDHTPTVWYGHNGERLGTYRGHNGAVWCCDV
SRDSTRLITSSADQTAKLWNVETGNQLFSFNFESPARAVDLAIGDKLVVITTDPFMELPSAIHI
KRIEKDLSKQTADSVLTITGIKGRINRAVWGPLNSTIISGGEDSVVRIWDSETGKLLRESDKET
GHQKAITSLCKSADGSHFLTGSLDKSARLWDIRTLTLIKTYVTERPVNAVAISPLLDHVVIGGG
QEASHVTTTDRRAGKFEAKFFHKILEEEIGGVKGHFGPINSLAFNPDGRSFASGGEDGYVRL
HHFDPDYFHIKM
SEQ ID NO:448
MAENNVGDFIPLDRQEYPSKPAPGAVDSSFWKSFKKKEVSRQIAGVTCINFCPEPPHDFAVT
SSTRVHIYDGKSCELKKTITKFKDVAYSGVFRSDGQIIAAGGETGVIQVFNAKSQMVLRQLKG
HGRPVRVVRYSPQDKLHLLSGGDDSMVKWWDITTQEELLNLEGHKDYVRCGAASPSSVNL
WATGSYDHTVRLWDLRNSKTVLQLKHGKPLEDVLFFPSGGLLATAGGNVVKVWDILGGGRP
IHTMETHQKTVMAMCISKVPRSGQALGDAPSRLVTASLDGYMKVFDLDHFKVTHSARYPAPI
LSMGISSLCRTMAVGTSSGLLFIRQRKGQIEDKIHSDSSGLQVNPVNDEKDSAVLKPNQYRY
YLRGRSEKPSEGDYVVKRMAKVYFQEYDKDLRHFNHSKALVSALKAADSKGTVAVIEELVAR
KRLIQTLSILNLDELELLINFLSRFILVPKYSRFLISLTDRVLDARAVDLGKSENLKKQIADLKGIV
VQELRVQQSMQELQGIIEPLIRASAR
SEQ ID NO:449
MDVETSGKPTGNKRTYTRLPRQVCVFWQEGRCTRESCNFLHVDEPGSVKRGGATNGFAP
KRSYNGSDERDTLAAGPPGGSRRNISARWGRGRGGIFISDERQKIRNKVCNYWLAGNCQR
GEECKYLHSFVMGSDVKFLTQLSGHVKAIRGIAFPSDSGKLYSGGQDKKVIVWDCQTGQGT
DIPLNDEVGCLMSEGPWIFVGLPNAVKAWNILTSTELSLVGPRGQVHALAVGNGMLFAGTHD
GSILAWKFSPASNTFEPAASLVGHTQAVVSLVSGADRLYSGSMDKTIRVWDLGTFQCLQTLR
DHTSVVMSLLCWDQFLLSCSLDNTVKVWVATSSGALEVTYTHNEEHGVLALCGMNDEQAK
PVLLCSCNDNTVRLYDLPSFSERGRIFSRNEVRTFQIAPGGLFFTGDATGELKVWNWATQKS
SEQ ID NO:450
MSVQELRERHAAATAKVNALRERIKAKRLQLLDTDVATYASSNGRTPISFSFTDLVCCRTLQ
GHTGKVYSLDWTSEKNRIVSASQDGRLIVWNALTSQKTHAIKLPCAWVMTCAFSPSGQAVA
CGGLDSVCSIFQLNNQLDRDGHLPVSRILSGHRSYVSSCQYVPDGDTHVITGSGDRTCIQW
DVTTGQRIAIFGGEFPLGHTADVMSVSISAANPKEFVSGSCDTTTRLWDTRIASRAIRTFHGH
EADVNTVKFFPDGLRFGSGSDDGTCRLFDIRTGHQLQVYRQPPRENQSPTVTAIAFSFSGRL
LFAGYSNGDCFVWDTILEKVVLNLGELQNTHNGRISCLGLSADGSALCTGSWDKNLKIWAFG
GHRKIV
SEQ ID NO:451
MKVKIISRSTDEFTRERSNDLQRVFRNFDPNLHTQARAQEYVRALNAAKLDKIFAKPFLAAMS
GHIDGISAMAKSPRHLKSIFSGSVDGDIRLWDIAARRTVQQFPGHRGAVRGLTVSTEGGRLIS
CGDDCVRLWDIPVAGIGESSYGSENVQKPLATYVGKNSFRAVDYQWDSNVFATGGAQVDI
WDHDRSEPTNSFAWGSDTVISVRFNPAEKDIFATTASDRSIVLYDLRMASPLNKLIMQTRNN
AIAWNPREPMNFTAANEDCNCYSYDMRRMNISTCVHQDHVSAVMDIDYSPSGREFVTGSY
DRTVRIFPYNAGHSREIYHTKRMQRVFCVKFSGDATYVVSGSDDANIRLWKAKASEQLGVLL
PRERKRHEYLDAVKERFKHLPEIKRIERHRHLPKPIYKAALLRHTVNAAAKRKEERKRAHSAP
GSVVTNPLRKKRIVAQLE
SEQ ID NO:452
MDHYYQDDFDYLVDDEMVDFADDVEDDVRTRRRSDIDSDSENDFDLNNKSPDTTALQAKR
GKDIQGIPWNRLNFTREKYRETRLQQYKNYENLPRPRRSRNLDKECTNFERGSSFYDFRHN
TRSVKATIVHFQLRNLVWATSKHNVYLMQNYSIMHWSSLKQKGEEVLNVAGPIVPSVKHPGS
SPQGLTRVQVSAMSVKDNLVVAGGFQGELICKYLDKPGVSFCTKISHDENGITNAVEIYNDA
SGATRLMTANNDLAVRVFDTEKFTVLERFSFPWSVNHTSVSPDGKLVAVLGDNADCLLADC
KTGKTVGTLRGHLDYSFAAAWHPDGYILATGNQDTTCRLWDVRKLSSSLAVLKGRMGAIRSI
RFSSDGRFMAMAEPADFVHLYDTRQNYTKSQEIDLFGEIAGISFSPDTEAFFVGVADRTYGS
LLEFNRRRMNYYLDSIL
SEQ ID NO:453
MAEALVLRGTMEGHTDAVTAIATPIDNSDMIVSSSRDKSILLWNLTKEPEKYGVPRRRLTGHS
HFVQDVVISSDGQFALSGSWDSELRLWDLNTGLTTRRFVGHTKDVLSVAFSIDNRQIVSASR
DRTIKLWNTLGECKYTIQPDAEGHSNWISCVRFSPSATNPTIVSCSWDRTVKVWNLTNCKLR
NTLVGHGGYVNTAAVSPDGSLCASGGKDGVTMLWDLAEGKRLYSLDAGDIIYALCFSPNRY
WLCAATQQCVKIWDLESKSIVADLRPDFIPNKKAQIPYCTSLSWSADGSTLFSGYTDGKIRV
WGIGHV
SEQ ID NO:454
MAAIKSTSRSASVAFAPDAPLLAAGTMAGAIDLSFSSLANLEIFKLDFQSDDPELPVVGECPS
NERLNRLSWGSAGGSFGIIAGGLVDGTINIWNPATLINSEDNGDALIARLEQHTGPVRGLEFN
TISTNLLASGAEDGELCIWDLANPTAPTHFPPLKGVGSGAQGEISFLAWNRKVQHILASTSYS
GTTVVWDLRRQKPIISFPDATRRRCSVLQWNPDASTQMVASDDDNSPTLRAWDLRNTISPY
KEFVGHSRGVIAMSWCPSDSLFLLTCAKDNRTLCWDTGSGEIVCELPAGANWNFDVQWSP
KIPGILSTSSFDGKIGIHNIEACSRNVSGEVEFGGAIVRGGPSALLKAPKWLERPAGVSFGFG
GKLASFRPSTVAQAADHRHSEVFIHNLVTEDNLVIRSTEFEAAIADGEKVSLRALCDRKAEES
QSDEEKETWNFLRVMFEDEGTARTKLLEHLGFKVQSEENGDLQETHSSKIDDIGSEIGKTLTL
DDKTEEDVLPQLKGGQDAAIPQDNGEDFFDNLHSPKEEVSLSHVGNDFVGEKDKDMVVNG
AEIEHETEDLTEYSDWNEAIQHSLVVGDYKGAVLQCLSANRMADALIIAHLGGNSLWEKTRD
EYLKKAKSSYLKVVSAMVNNDLTGLVNSRPLKSWKETLAMLCTYSQREEWTVLCDMLASRLI
AAGNVMAATLCYICAGNIEKTVEIWSRSLKYDYDGRSFVDHLQDVMEKTVVLALATGQKRVS
PSLSKLVENYAELLASQGLLTTAMEYLKLLGTEESSHELSILRDRLYLSGTDNKVEASSFPFET
RQDLTESQYNMHQTGFGAPETQKNYQENVHQVLPSGSYTDNYQPTANTHYIAGYQPAPQQ
QPSFQNYFTPASYQPAPSPNVFYPSQVSQAEQSNFAPPVNQPPMKTFVPSTPPILRNVDQY
QTPSLNPQLYQGVSSATVETHPYQTGAPASVSVGTTPGQVVPNFMVPGPVTAPTVTPRG
FMPVTTPTQHPLGSANPPVQPQSPQSSQVQSVTAATTPPPTIQNVDTSNVAAEIRPVIGTLR
RLYDETSEALGGARANPAKRREIEDNSRKIGSLFAKLNSGDISSNAASKLVHLCQALESRDYA
TAFQIQVGLTTSDWDECSFWLAALKRMIKVKQNMR
SEQ ID NO:455
MAGAADSQLQTLSERDSTPNFKNLHTREYAAHKKKVHSVAWNCTGTKLASGSVDQTARVW
NIEPHGHSKTKDLELKGHADSVDQLCWDPKHSELLATASGDRTVRLWDARSGKCSQQVEL
SGENINITFKPDGTHIAVGNRDDELTIIDVRKFKPLHKRKFSYEVNEIAWNTTGELFFLTTGNG
TVEVLSYPSLQVLHTLVAHTAGCYCIAIDPIGRYFAVGSADALVSLWDLSEMLCVRTFTKLEW
PVRTISFNHDGQYIASASEDLFIDIADVQTGRTVHQISCRAAMNSVEWNPKYNLLAFAGDDKN
KYMQDEGVFRVFGFETP
SEQ ID NO:456
MAATSPVGAGSGRELANPPTDGISNLRFSNHSDHLLVSSWDRKVRLYDASANSLKGQFVHG
GPVLDCCFHDDASGFSGSADNTVRRYDFSTRKEDILGRHEAPVRCVEYSYAAGQVITGSWD
KTLKCWDPRGASGQEKTLVGTYSQLERVYSMSLVGHRLVVATAGRHINVYDLRNMSQPEQ
RRESSLKYQTRCVRCYPNGTGFALSSVEGRVAMEFFDLSEAGQAKKYAFKCHRKSEAGRD
TVYPVNAIAFHPIYGTFATGGCDGYVNVWDGNNKKRLYQYSKYPTSIAALSFSRDGRLLAVA
SSYTFEEGEKPHEPDAVFVRSVNEAEVKPKPKVYAAPP
SEQ ID NO:457
MASDDEEGFKNEEAPGVVDEAEVQEGLRACFPLSFGKQEKKQAPLESIHSATKRPEDPRPR
RQLGPPRPPPSILAEQEDSDRFVGPPRPPQFVRDDNDDGEAEIMIGPPRPPAQYSDDHDNE
ETIGPPKPSYLEKGEETDQMVGPSKRGSDDETSGDSDDGDDAVDFRVPLSNEIVLRGHTKV
VSALAIDQTGSRVLTGSYDYSVRMYDFQGMTSQLKSFRQLEPAEGHQVRSLSWSPTSDRFL
CVTGSAQAKIFDRDGLTLGEFVKGDMYLRDLKNTKGHISGLTCGEWHPKEKQTILTCSEDGS
LRIWDVNDFNTQKQVIKPKLAKPGRVPVTACAWGRDGKCIAGGVGDGSIQVWNLKPGWGS
RPDLYVAKGHDDDITGLQFSADGNILLTRSTDETLKVWDLRKAITPLQVFRDLPNNYAQTNVA
FSPDERLIFTGTSVERDGNSGGLLCFYDRQTLELVLRIGVSPVHSVVRCTWHPRHNQVFATV
GDKKEGGAHILYDPALSERGALVCVARAPRKKSLDDFEAKPVIHNPHALPLFRDEPSRKRQR
EKARMDPMKSQRPDLPVTGPGFGGRVGSTKGLLTQYLLKEGGLIKETWMEEDPREAILKY
ADVAAKDPKFIAPAYAQTQPETVFAETDSEEEQK
SEQ ID NO:458
MKERGQSHAGQPSVDERYTQWKSLVPVLYDWLANHNLVWPSLSCRWGPQMHQATYKNS
QRLYLSEQTDGTVPNTLVIATCEVVKPRVAAAEHISQFNEEARSPFVKKFKTIIHPGEVNRIRE
LPQNSKIVATHTDGPDVLIWDVDTQPNRQATLGAADSRPDLVLTGHKDNAEFALAMSPSAPF
VLSGGKDKCVLLWSIQDHISAATEPSSAKASKTPSSAHGEKVPKIPSIGPRGVYKGHKDTVED
VQFCPSNAQEFCSVGDDSALILWDARNGNEPVIKVEKAHNADLHCVDWNPHDENLILTGSA
DNSVRMFDRRNLTSSGVGSPVHKFEGHSAPVLCVQWCPDKASVFGSAAEDSYLNVWDYE
KVGKNVGKKTPPGLFFQHAGHRDKVVDFHWNSFDPWTIVSVSDDGESTGGGGTLQIWRMS
DLIYRPEDEVLAELERFRAHILSCQNK
SEQ ID NO:459
MSSLSRELVFLILQFLDEEKFKESVHKLEQESGFFFNMKYFDEKAQAGEWDEVERYLSGFTK
VDDNRYSMKIFFEIRKQKYLEALDRQDRAKAVDILVKDLKVFSTFNEELYKEITQLLTLDNFRE
NEQLSKYGDTKSARTIMMSELKKLIEANPLFREKLIYPNLKASRLRTLINQSLNWQHQLCKNP
RPNPDIKTLFTDHACGPPNGARTPTQPTASLGVLPKATTFTPIGPHGPFPSSSTATSGLASW
MSNPNMVTSPQAPVAVGPSVPVPPNQATLLKRPRTPPGSSSVVDYQTADSEQLIKRLRPVS
QSIDEATYPGPTLRVPWSTDDLPKTLARALNEPYPVTSIDFHPSQQTFLLVGTKNGEITLWEV
GSREKLATRSFKIWDNANCSNHLEAAFVKDSSVSINRVLWSPDGTLIGIAFTKHLVHTYTFQG
LDLRQHLEIDAHVGGVNDLAFSHPNKQLCVVTCGDDKMIKVWDAVTGRKLYNFEGHDAPVY
SVCPHHKENIQFIFSTAVDGKIKAWLYDHLGSRVDYDAPGHSCTTMMYSADGTRLFSCGTSK
EGESFLVEWNESEGAIKRTYSGLRKKGSGVVQFDTTQNHFLAVGDEHLIKFFWDMDSTNMLT
SCDAEGGLLNLPRLRFNKEGSLLAVTTVNGIKILANADGQKLLKTMENRTFDLPSRAHIDAAS
ATSSPATGRMERIERTSSANTVSGINGVDPAQSSEKLRLSDDLSEKTKIWKLTEITDSIQCRCI
TLPENAAEPASKVSRLLYTNSGVGLLALGSNAVHKLWKWNRSEQNPSGKATASVHPQRWQ
PTSGLLMTNDITDINPEEAVPCIALSKNDSYVMSASGGKVSLFNMMTFKVMTTFMPPPPAST
FLAFHPQDNNIIAIGMEDSTIHIYNVRVDEVKTKLKGHQKRITGLAFSSTQNILVSSGADAQLCV
WNTETWEKRKSKTIQMPVGKTVSGDTRVQFHSDQLHILVVHETQLAIYDAYKLERQYQWVP
QDALSAPILYATYSCNRQLIYATFSDGNIGVYDAEILRPRCRIAPTTYLSSGTSSSTSLPLVVAA
HPHEPNQFAIGLSDGAVQVLEPSESEGKWGVSPPPENGVVPAVVAGPSTSNQGSEQAPR
SEQ ID NO:460
MAKDEEEFRGEMEERLVNEEYKIWKKNTPFLYDLVITHALEWPSLTVQWLPDREEPPGKDY
SVQKMILGTHTSDNEPNYLMLAQVQLPLEDAENDARQYDDERGEIGGFGCANGKVQVIQQI
NHDGEVNRARYMPQNPFIIATKTVSAEVYVFDYSKHPSKPPQDGGCHPDLRLRGHNTEGYG
LSWSPFKHGHLLSGSDDAQICLWDINVPAKNKVLEAQQIFKVHEGVVEDVAWHLRHEYLFG
SVGDDRHLLIWDLRTSATNKPLHSVVAHQGEVNCLAFNPFNEWVLATGSADRTVKLFDLRKI
SSALHTFSCHKEEVFQIGWSPKNETILASCSADRRLMVWDLSRIDEFQTPEDALDGPPELLFI
HGGHTSKISDFSWNPCEDWVIASVAEDNILQIWQMAENIYHDEEDDMPPEEVV
SEQ ID NO:461
MSPGVKQTGSQKFESGHQDVVHDVTMDYYGKRIATCSADRTIKLFGLNASDTPSLLASLTG
HEGPVWQVAWAHPKFGSMLASCSYDGRVIIWREGQQENEWSQVQVFKEHEASVNSISWA
PNELGLCLACGSSDGSITVFTCREDGSWDKTKIDQAHQVGVTAVSWAPASAPGSLVGQPSD
PIQKLVSGGCDNTAKVWKFYNGSWKLDCFPPLQMHTDWVRDVAWAPNLGLPKSTIASCSQ
DGKVVIWTQGKEGDKWEGRILNDFKIPVWRVNWSLTGNILAVADGNNSVTLWKEAVDGDW
NQVTTVQ
SEQ ID NO:462
MSSGVKQTGSQKFESGHQDVVHDVTMDYYGKRIATCSADRTIKLFGMNTSDTPTLLASLTG
HEGPVWQVAWAHPKFGSMLASCSYDRRVIIWREGQQENEWSQVQVFKEHEASVNSISWA
PHELGLCLACGSSDGSITVFTGREDGSWDKTKIDQAHQVGVTAVSWAPASAPGSLVGQPSD
PVQKLVSGGCDNTAKVWKFYNGSWKLDCFPPLQMHTDWVRDVAWAPNLGLPKSTIASCSQ
DGRVVIWTQGKEGDKWEGKILNDFKTPVWRISWSLTGNILAVADGNNNVTLWKEAVDGEW
NQVTTVQ
SEQ ID NO:463
MKKRSRPSNGHLSTAAKNKSRKTAPITKDPFFDSAHNRNKSKGKGKSRGKGEEIFSSDEDD
DAIGRDAPAEEEEEIAEEERETADEKRLRVAKAYLDKIRAITKANEEDNEEEAGEDEETEAER
RGKRDSLVAEILQQEQLEESGRVQRQLASRVVTPSKLVECRVVKRHKQSVTAVALTEDDLR
GFSASKDGTIIHWDVETGASEKYEWPSQAVSVSSSNEVSKTQKGKGSKKQGSKHVLSMAV
SSDGRYLATGGLDRYIHLWDTRTQKHIQAFRGHRGAVSCLAFRQGTQQLISGSFDRTIKLWS
AEDRAYMDTLYGHQSEILAVDCLRKERVLSVGRDHTLRLWKVPEETQLVFRGHAASLECCC
FINNEDFLSGSDDGSIELWSMLRKKPVFMAKNAHGHAIVENLSEDTSTREEPDEEVTTRQLP
NGNSIGNGMTNQMGITPSVESWVGAVTVCRGTDLAASGAGNGVVRLWAIENSSKSLRALH
DIPLTGFVNSLTFARSGRFLIAGVGQEPRLGRWGRIQAARNGVTLCPIELS
SEQ ID NO:464
MAATFGTINTATSPHNPNKSFEIVQPPNDSISSLSFSPKANYLVATSWDNQVRCWEVLQTGA
SMPKAAMSHDQPVLCSTWKDDGTAVFSAGCDKQAKMWPLLTGGQPVTVAMHDAPIKDIAW
IPEMNLLATGSWDKTLKYWDTRQSNPVHTQQLPERCFALSVRHPLMVVGTADRNMIFNLQN
PQTEFKRISSPLKYQTRCVAAFPDKQGFLVGSIEGRVGVHHVEEAQQSKNFTFKCHRDSNDI
YAVNSLNFHPVHQTFATAGSDGAFNFWDKDSKQRLKAMARSNQPIPCSTFNSDGSLYAYAV
SYDWSKGAENHNPATAKHHILLHVPQESEIKGKPRVTTSGRK
SEQ ID NO:465
MVVMDKGTHQTNEDESESEFIDEDDVIDEISIDEEDLPDADVEGEDVQEDNKRSEPDENSSS
LDDAIHTFEGHEDTLFAVACSPVDATWVASGGGDDKAFMWRIGHATPFFELKGHTDSVVAL
SFSNDGLLLASGGLDGVVRIWDASTGNLIHVLDGPGGGIEWVRWHPKGHLVLAGSEDYSTW
MWNADLGKCLSVYTGHCESVTCGDFTPDGKAICTGSADGSLRVWNPQTQESKLTVKGYPY
HTEGLTCLSISSDSTLVVSGSTDGSVHVVNIKNGKVVASLVGHSGSIECVRFSPSLTWVATG
GMDKKLMIWELQSSSLRCTCQHEEGVMRLSWSLSSQHIITSSLDGIVRLWDSRSGVCERVF
EGHNDSIQDMVVTVDQRFILTGSDDTTAKVFEIGAF
SEQ ID NO:466
MPVFRTAFNGYAVKFSPFVETRLAVATAQNFGIIGNGRQHVLELTPNGIVEVCAFDSSDGLY
DCTWSEANENLVVSASGDGSVKIWDIALPPVANPIRSLEEHAREVYSVIDWNLVRKDCFLSAS
WDDTIRLWTIDRPQSMRLFKEHTYCIYAAVWNPRHADVFASASGDCTVRIWDVREPNATIIIP
AHEHEILSCDWNKYNDCMLVTGSVDKLIKVWDIRTYRTPMTVLEGHTYAIRRVKFSPHQESLI
ASCSYDMTTCMWDYRAPEDALLARYDHHTEFAVGIDISVLVEGLLASTGWDETVYVWQHG
MDPRAC
SEQ ID NO:467
MDSRNRRSRLNLPPGMSPSSLHLETTAGSPGLSRVNSSPSTPSPSRTTTYSDRFIPSRTGS
RLNGFALIDKQPQPLPSPTRSAAEGRDDASSSSASAYSTLLRNELFGEDVVGPATPATPEKS
TGLYGGSRDSIKSPMSPSRNLFRFKNDHGGNSPGSPYSASTVGSEGLFSSNVGTPPKPARK
ITRSPYKVLDAPALQDDFYLNLVDWSSNNVLAVGLGTCVYLWSACTSKVTKLCDLGVNDSVC
SVGWTPQGTHLAVGTNIGEVQIWDTSRCKKVRTMGGHCTRAGALAWSSYILSSGSRDRNIL
HRDIRVQDDFIRKLVGHKSEVCGLKWSYDDRELASGGNDNQLLVWNQQSAQPLLRFNEHT
AAVKAIAWSPHQHGILASGGGTADRCLRFWNTATDTRLNCVDTGSQVCNLVWCKNVNELV
STHGYSQNQIMVWRYPSMSKLATLTGHTLRVLYLAISPDGQTIVTGAGDETLRFWSIFPSPKS
QSAVHDSGLWSLGRTHIR
SEQ ID NO:468
MEKKKVVVPIVCHGHSRPIVDLFYSPVTPDGLFLISASKDSSTMLRNGETGDWIGTFEGHKG
AVWSCCLDNRALRAASGSADFSAKIWDALTGDELHCFVHKHIVRACAFSESTSLLLTGGHEK
ILRIFDLNRPDAPPKEVDNSPGSIRTVAWLHSDQTILSSNSDAGGVRLWDLRTEKIVRVLETK
SPVTSAEVSQDGRYITTADGNSVKFWDANHFGMVKSYTMPCMVESASLEPTMGNMFVAGG
EDMWVRLFDFHTGEEIACNKGHHGPVHCVRFAPGGESYSSGSEDGTIRIWQTLNMNSEEN
ESYGVNGLSGKVRVGVDDVVQKVEGFQITADGHLNDKPEKPNP
SEQ ID NO:469
MERYSQGTQKKSEIYTYEAPWQIYGMNWSVRKDKKFRLGIGSFLEEYNNRVEIIELDEESGE
FKSDPRLAFDHPYPTTKIMFVPDKECQRPDLLATTGDYLRIWQVCEDRVEPKSLLNNNKNSE
FCAPLTSFDWNDADPKRIGTSSIDTTCTIWDIEKEVVDTQLIAHDKEVYDIAWGEVGVFASVS
ADGSVRVFDLRDKEHSTIIYESSQPETPLLRLGWNKQDPRFIATILMDSCKVVILDIRFPTLPVA
ELQRHQASVNTIAWAPHSPCHICTAGDDSQALIWELSSVSQPLVEGGGLDPILAYTAAAEINQ
LQWSSMQPDWVAIAFSNEVQILRV
SEQ ID NO:470
MQSENNLDESLHLREVQELQGHTDTVWAVAWNPVTGIDGAPSMLASCSGDKTVRIWENTH
TLNSTSPSWACKAVLEETHTRTVRSCAWSPNGKLLATASFDATTAIWENVGGEFECIASLEG
HENEVKSVSWSASGMLLATCGRDKSVWIWDVQPGNEFECVSVLQGHTQDVKMVQWHPNR
DILVSASYDNSIKVWAEDGDGDDWACMQTLGNSVSGHTSTVWAVSFNSSGDRMVSCSDDL
TLMVWDTSINPAERSGNAGPWKHLCTISGYHDRTIFSVHWSRSGLIASGASDDCIRLFSEST
DDSVTPVDGTSYKLILKKEKAHSMDVNSVQWHPSEPQLLASASDDGRIKIWEVTRINGLANS
H
SEQ ID NO:471
MKRAYKLQEFVAHASNVNCLKIGKKSSRVLVTGGEDHKVNMWAIGKPNAILSLSGHSSAVES
VTFDSAEALVVAGAASGTIKLWDLEEAKIVRTLTGHRSNCISVDFHPFGEFFASGSLDTNLKI
WDIRRKGCIHTYKGHTRGVNSIRFSPDGRWVSGGEDNIVKLWDLTAGKLMHDFKCHEGQI
QCMDFHPQEFLLATGSADRTVKFWDLETFELIGSAGPETTGVRAMIFNPDGRTLLTGLHESL
KVFSWEPLRCYDAVDVGWSKLADLNIHEGKLLGCSYNQSCVGVWVVDISRVGPYAAGNVS
RTNGHNEAKLASSGHPSVQQLDNNLKTNMARLSLSHSTESGIKEPKTTTSLTTTEGLSSTPQ
RAGIAFSSKNLPASSGPPSYVSTPKKNSTSRVQPTTNFQTLSRPDIVPVIVPRSNSLRPETTS
DVKKEMNNFGRVVPSTVSTKSTDVIKSGSNRDESDKIDSINQKRMTGNDKTDLNIARAEQHV
SSRLDNTNTSSVVCDGNQPAARWIGAAKFRRNSPVDPVVSPHDRSPTFPWSATDDGVTCQ
PDRQVTAPELSKRVVEPGRARALVASWETREKALTADTPVLVSGRPPTSPGVDMNSFIPRG
SHGTSESDLTVSDDNSAIEELMQQHNAFTSILQARLTKLQVIRRFWQRNDLKGAIDATGKMG
DHSVSADVISVLIERSEIFTLDICTVILPLLTRLLQSETDRHLTVAMETLLVLVKTFGDVIRATISA
TPTIGVDLQAEQRLERCNLCYVELENIKQILVPLIRRGGAVAKSAQELSLALQEV
SEQ ID NO:472
MSTLEIEARDVIKIVLQFCKENSLHQTFQTLQNECQVSLNTVDSLETFVADINSGRWDVILPQV
AQLKLPRKKLEDLYEQIVLEMIELRELDTARAILRQTQAMGFMKQEQPERYLRLEHLLVRTYF
DPREAYH ESSKEKRRSQIAQALASEVTVVPPSRLMALIGQSLKWQQHQGLLPPGTQFDLFR
GTAAVKADEEEMYPTTLAHTIKFGKQSHPECARFSPDGQYLVSCSVDGFIEVWDYISGKLKK
DLQYQADDSFMMHDDAVLCVDFSRDSEMLASGSQDGKIKVWRIRTGQCLRRLERAHSQGV
TSLSFSRDGSQLLSTSFDSTARIHGLKSGKALKEFRGHTSYVNDAIFTSDGGRVITASSDCTV
KVWDVKTTDCIQTFKPPPPLKGGDVSVNSVHLFPKNSEHIVVCNKASSIYIMTLQGQWKSFS
SGKREGGDFVAACISPKGEWIYCVGEDRNIYCFSQQSGKLEHLMKAHDKDIIGVTPHPHRNL
LVTYSEDSTMKIWKP
SEQ ID NO:473
MDIELEDQPFDLDFHPSAPIVAVALITGRLQLFRYVDISSEPERLWTVTAHTESCPAARFINAG
SSVLTASPDCSILATNVETGQPVARLDNAHGAAINCLTNLTESTIASGDENGIIKVWDTRQNS
CCNKFKAHEDYISDMEFVPDTMQLLGTSGDGTLSVCNLRKNKVHARSEFSEDELLSVALMK
NGKKVVCGSQEGVLLLYSWGYFKDCSDRFVGHPHSVDALLKLDEDTVLTGSSDGIIRVVSIL
PNKMIGVIGEHSSYPIERLAFSHDRNVLGSASHDQILKLWDIHYLHEDDEPETNKQEAVNDEN
VDMDLDVDTEKRPRGSKRKKRAEKGQTSSQKQSSDFFADI
SEQ ID NO:474
MDRIQQIPHTCVARKINLPLGMSKESLALNLPANLAPTMSPPSITYSDRFIPSRKASNFEEFAL
PDKTSPSPNSAGGQSSSTNGEGRDDACAAYSALLRTELFPATPDKTEGCRRPVIGSPSGNV
FRFKSQQCKSQSPFSLCPVGEDGDLSETGAVARKTTRKIPRSPFKVLDAPALQDDFYLNLVD
WSSHNILAVGLSACVYLWSASSSKVTKLCDLGLDDNVCSVAWTQRGTYLAVGTNNGGVQI
WDAAHCKQVRTMEGHCTRVGTLAWNSHILSSGGRDRNILQRDIRAQDDFVSKFSGHKSEV
CGLKWSYDNRELASGGNDNQLFVWNQQSQQPVLKYNEHTAAVKAIAWSPHQHGLLASGG
GTADRCIRFWNTATNTSLNCVDTGSQVCNLVWSKNVNELVSTHGYSQNQIIVWRYPTMSKL
ATLTGHTLRVLYLAISPDGQTIVTGAGDETLRFWNVFPSSKTQQNTIRDMGVWSSGRTHIR
SEQ ID NO:475
MAGGQG EGEEKVDKLSMELTEDVMKSMEIGAVFKDYNGKINSLDFHRTNNYLVTASDDEAI
RLFDTASATWQKTSYSKKYGVDLICFTNHQTSVLYSSKNGWDESLRHLSLMDNKYLRYFKG
HHDRVVSLCMSPKGECFMSGSLDRTVLLWDLRIDKCQGLIRVRGRPAVAYDEQGLVFAISN
EGGLIKMFDARLYDKGPFDTFVVEGDKSEASGIKFSNDGKLILLSTMDSNIHVLDAYQGTTVH
SFSVEAVPNGGEAVPNGGTLEASFSPDGKFVISGSGNGNIHAWSVNSGKEVACWTTEGVIP
AVVKWAPRRLMFASGSSVLSLWVPDLSKLASLTGSNSNSAY
SEQ ID NO:476
MHRVGSTGNTSNSSRPRREKRLTYVLNDANDSRHCSGINCLVISKLSLLGGNDYLFSGSRD
GTLKRWELADDSAVCSATFESHVDWVNDAVLTGETLVSCSSDTTLKTWRPFSDGVCTRTLR
QHSDYVTCLAAASKNSNIVASGGLGREVFIWDIEAAMAPVSRTSEAMDDDTSNGVLSSGNS
VLSTTVRSTNATNSASLHTSQLQGYTPIAAKGHKESVYALAMNDVGTLLVSGGTEKVVRVW
DPRSGAKQMKLRGHTDNVRALILDSTGRFCLSGSSDSIIRLWDLGQQRCVHSYAVHTDSVW
ALASTPNFSHVYSGGRDLSLYLTDLTTRESLLLCMEKHPLLRLTLQDDSIWVATTDSSLHRW
PAEGQNPPKMFQRGGSFLAGNLSFTRARACLEGSAPVPVNTQPSFVIPGSPGIVQHEILNNR
RHVLTKDAEGTVKLWEITRGAVLDDYGKVSFEEKKEELFEMVSIPAWFTMDTRLGSMSVHLD
TPQCFTAEMYAVDLNVPDAPEEQKINLAQETLRGLLAHWLSRRRQRLATQASANGDFPAGQ
ENALRNHISSRIDVHDDAETHIAGILPAFDFSTTSPPSIITEGSQGGPWRKKITDLDGTEDEKD
FPWWCLECVLHGRLSPRESLKCSFYLHPYEGTTVQVLTQGKLSAPRILRIQKVINYVLEKMVL
DRPLDSSNSETTFTPGLSGNQSHAAVVGDGSLRSGARVWQQKAKPLVEILCNNQVLSPDM
SLATVRTYIWKKPDDLYLYYRLVQNR
SEQ ID NO:477
MMKGKTIQMQAAHQNHDGETSVACVLWDWHAKHLITAGADNTILIHSYPSSSSSKPITLRHH
KNAVTALAINSNVRSLASGSVDHSVKLYSYPGGEFQSNVTRFTLPIRSLAFNKSGELLAAAGD
DEGIKLISTIDNSIARVLKGHNGPVTSISFDPKNEFLASSDSDGTVIYWELSTGKPVHTLKKIAP
NTTSNPTSLNQISWRPDGEMLAVPGRKSEVSMYDRDTAEKLFSLKGGHSDTICSLAWSPNG
KYIATAGTDRQVMVVDADRRQDIDKQRFDNPICSVAWKPSDNALAVIDVLGRFGVWESPIAS
HMKSPADGAERYDNMEDEEPLMARYEEELEDSVSGSLNEIINDDDDDDEMGKIPRKIPRKILQKKP
SVKVEKGKEESNAKAFKSGQDSFKLKSAMQEAFQPGATQRQSGKRNFLAYNMLGSVITFDN
DGFSHIEVDFHDIGKGCRVPSMTDYFGFTMASLSESGSVFGSPQKGEKNPSTLMYRPFSSW
ANNSEWSMRFPMGEEVKAVALGSGWVAAVTSLNFLRVFSEGGLQKFVLSMDGPVVTAAGY
ENLLVVVSHASNPLLSGDQVLSFTVYDISQKTCPLSGRLPLSPGSHLTWLGFSEEGLLSSYD
SEGNLRVFTNDYNGCWVPIFSAARERKSETESIWMVGLNSTQVFCVVCKLPDTYPQVAPKP
VLSVLNLSLPLACSDLGADDLENEYLRGSLLLSQMQKKAEDAVACGRESNMEEDSIFKMEAA
LDRCLLRLIANCCKGDKLVRATELARLLSLEKSLQGAIKLVSAMKLPMLAERFNTILEEKILQEN
METISCRRLTSEAQDMDTPISISVKQVSYGANLGDSPFLPNRQVEPKHSTPVFSKPDTKIEVD
TSEAIAKGCDAQNGNIKSGDAEVQPASHNDSIQKPSNPFAKASNTSANQAVQRNASLLSSIK
QMKTATENEGKRKERARSGSLPQKPAKQSKIS
SEQ ID NO:478
MKQKRKGHQVDDPKYSVQTPQEDDTPNESGPASEEVESSDEEGGNSSNIEDDIIYSSSEED
PVVSSDYEEDEDAESDAEGVTAEQELEGDIDNALQNYMGTLTVLSNFHGENLKNAEGEDTS
GDDDDEEEMPKRAEESDSPEDENDERPKRAEESDFSEDEDEERPKRAEESDSSEDEVPSR
NTVGDVPLRWYKDEQHIGYDIKGKKIKKQPKKDQLDSFLASTDDSSDWRKVYDEYNDEEVE
LTKDEIKFISRLRKGTIPHADVNPYEPYVDWFDWKDKGHPLSNAPEPKRRFIPSKWEAKKVV
KLVRAIRKGWITFQKAEEKPRFYLMWGDDLKPSEKMANGLSYIPAPKPKLPGHEESYNPPPE
YIPTQEEINSYQLMYEEDRPKFIPKRFDSLRNVPAYDRFLSEIFERCLDLYLCPRTRKKRINIDP
ESLIPKLPKPKDLQPFPSICFLEYKGHTGAVSCISPESSGQWLASGSKDGTVRIWEVETARCL
KVWDIGRPIQHIAWNPVSQLSILAVAVDEEVLVLNTGLGSEDSQEKVAELLHVKSKPVSADDL
GDNTSLTKWIKHEKFDGIKLTHLKPVHLISWHHKGDYFATVAPDGNTRAVLVHQLSKQQTQN
PFKKMQGRVVHVLFHPSRAIFFVATKTHVRVYDLVKQQLVKRLVTGLHEVSSMAVHHKGDN
LLVGSKEGKVCWFDMDLSTQPYKTLKNHSKDIHSVAFHDSYPLFASCSDDCKAYVFYGLVY
SDLLQNPLIVPLKVLQGHQSVNGMGVLDCQFHPKQPWLFTAGADSVVKLYCN
SEQ ID NO:479
MMSLKRGFEESLVPAKRQKTELSTVTYGDGPRRTSSLESPIMLLTGHHAAIYTMKFNPTGTVI
ASGSHEREIFLWNVHGDCKNFMVLKGHKNAVLDLHWTTDGCQIISASPDKTLRAWDVETGK
QIKKMAEHSSFVNSCCPSRRGPPLVVSGSDDGTAKLWDLRHRGAIQTFPDKYQITAVGFSD
AADKIYSGGIDNEIKVWDLRRGEVTMRLQGHTDTITGMQLSSDGSYLLTNSMDCSLRIWDMR
PYAPQNRCVKILTGHQHNFEKNLLKCSWSSDGSKVTAGSADRMVYIWDTTTRRILYKLPGHT
GSVNETGFHPTQPIIGSCSSDKQIYLGEIEPNVGYQAVI
SEQ ID NO:480
MEFSDTYKHTGPCCFSPDARYLAIAVDYRLVIRDVVTLKVVQLYSCMDKISNIEWALDSEYILC
GLYKRAMVQAWSLSQPEWTCKIDEGPAGIAHARWSPDSRHIITTSDFQLRLTVWSLVNTACI
HIQWPKHASKGVSFTQDGKFAAIATRRDCKDYVNLLSCHTWEVMGTFTVDTIDLADLEWSP
NDSAIVVWDSPLEYKVLIYSPDGRCLFKYQAYDSWLGVKTVAWSPCSQFLAVGSYDQTLRT
LNHLTWKPFAEFVHVSTVRGPASAVVFKEVEEPWNLDVSGLHLNDDNAHDIQDGKPAEGHS
RVRYKVVEFPVNVSSQKHPVDKPNPKQGIGLLAWSRDSQYLFTRNDNMPTALWIWDICRLE
LAALLIQKEPIRAAAWDPVYPRVALCTGSSHLYMWTPSGACCVNIPLPQFVVSDLKWNPDGT
SMLLKDRESFCCTFVPMLPEFNDDETNEE
SEQ ID NO:481
MAKLIETHSCVPSTERGRGILIAGDAKTNSIIYCNGRSVIMRNLDNPLEASVYGEHSYPATVAR
FSPNGEWVASGDTSGTVRIWGRGSDHTLKYEYKALAGRIDDLEWSADGQRIVVCGDSKGK
SMVRAFMWDSGTNVGEFDGHSRRVLSCSFKPTRPFRVATCGEDFLVNFYEGPPFRFKTSH
RDHSNYVNCVRFAPDGSKFITVGSDRKGVIFDGKMGEKIGELSKEGGHTGSIYAASWSPDS
KQVLTVSADKSAKIWEISETGNGTVKKTLTFGSQGGADDMLVGCLWLNDYLITVSLGGIVSLL
SAVDPDKPPKTISGHMKSINAIALSLQSGQSEVCSSSYDGVIVRWILGVGYAGRVERKDSTQI
KCLATIEGELVTCGFDNKVRRVPLLSEQHKESEPIDIGAQPKDLDVAVGCPELTFVSTDAGIIII
RASKIVSTTNVGYAVTAAAISPDGTEAVVGGQDGKLRVYSIKGDTLLEESVLERHRGPINAIRF
SPDGSMFASGDLNREAVVWDRITREVKLKNMVYHTARINCIAWSPDSSKVATGSLDTCILIYE
VGKPASSRITIKGAHLGGVYGLAFSDQSTVISAGEDACVRVWSLP
SEQ ID NO:482
MPQPSVILATAGYDHTVRFWEATSGRCYRTLQYPDSQVNHLEITPDKQYLAAAGNPHIRLFE
VNSNNPQPVISYDSHTNNVTAVGFQCDGKWMYSGSEDGTVKIWDLRAPGFQREYESRAAV
NTVVLHPNQTELISGDQNGNIRVWDLNANSCSCELVPEDTAVRSLTVMWDGSLVVAANNHG
TCYVWRLMRGTQTMTNFEPLHKLQAHNSYILKCLLSPEFCEHHRYLATTSSDQTVKIWNVD
GFTLERTLTGHQRWVWDCVFSVDGAFLVTASSDSTARLWDLSTGEAIRTYQGHHKATVCCA
LHDGTDGASC
SEQ ID NO:483
MLTKFETKSNRVKGLSFHPKRPWILASLHSGVIQLWDYRMGTLIDKFDEHDGPVRGVHFHKT
QPLFVSGGDDYKIKVWNYKMRQCLFTFVGHLDYIRTVHFHNEYPWIVSASDDQTIRLWNWQ
SRVCISVLTGHNHYVMSASFHPKEDLVVSASLDQTVRVWDISGLRKKTVSPADDLSRLAQM
NTDLFGGGDVVVKYVLEGHDRGVNWAAFHTSLPUVSGADDRQVKLWRMNDTKAWEVDTL
RGHTNNVSCVIFHARQDIIVSNSEDKSIRVWDMSKRTSVQTFRREHDRFWILAAHPEMNLLA
AGHDSGMIVFKLERERPAYVVYGGSLLYVKDRYLRTYEFATQKDNPUPIRKPGSIGPNQGPR
SLSYSPTENAILICSDADGGAYELYAVPKDSHGRSDTVQEAKKGLGGSAVFVARNRFAVLDK
NHNQVTIKNLKNEVTKKFDLPVTADALFYAGTGNLLCRSEDSVFLFDMQQRTVLGEIQTPNV
RYVVWSNDMENVALLSKHTIIIASKKLSSTCSLHETIRVKSGAWDDNGIFMYSTLNHIKYCLPN
GDSGIIKTLDVPVYITKVSGKSLYCLDRDGKNRVIQIDITECLFKLALSKKKYDYVINMIRNSQL
CGQAIIAYLQQKGFPEVALHFVRDERTRFNLAVESGNIEIAVASAKEIDEKDHWYRLGVEALR
QGNAGIVEYAYQRTKNFERLSFLYLITGNLDKLSKMLRIAEMKNDVMGQFHNALYLGDIQERI
KILEESGHLHLAYATASLHGLADIADRLAADLGGNIPVLPPGKKSSLLMPPAPILHGGDWPLLR
VTKGIFEGGLENSTSAAYEEEDEEAAADWGEDIDIENIEGENGEATVLDDQEVKGGEDDEGG
WDMEDLELPPDVAAANVGTNQKTLFVAPTLGMPVSQIWMQKSSLAGEHAAAGNFETALRLL
TRQLGIKNFSPLKPLFLELYMGSHTFLPSFASVPAFSLALQRGWSESASPNIRGPPALVYRLS
VLEEKLTVAYRATTEGRFSEALRLFLNILHTIPVIVVDSRKEIDEVKEUGIAKEYVLGLRMEVKR
KEIRDDAVRQQELAAYFTHCNLQKAHLKLALLNAMGISYRCKNYNTAANFARRLLETDPSSN
HATKARQVLQVCERNLQDATQLNYDFRNPFVVCGATFTPIYRGQKEVSCPYCMARFVPDIA
GKLCSICDLAIVGSDASGLFCFATQTR
SEQ ID NO:484
MDLLQNYQDDSEDSNPELRNHPPLEDATATSAPAGVENETSSSPDSSPLRLALPAKSCAPD
VDETLMALGVPGSEKKNNHNKPIDPTQHSVTFNPSYDQLWAPLYGPAHPYAKDGIAQGMRN
HKLGFVEDSAIEPFMFDEQYNTFHRYGYAADPSASLGSHIVGDLESLKKNDGASVYNLPKRE
HKRQKLEKKMIQKDENEEEEKEVGEEVDNPSTEEWLKKNRKSPWAGKKEGLQTELTEEQK
KYAQEHAEKKGDREKGEKVEIVDKTTFHGKEERDYQGRSWIDPPKDAKATNDHCYIPKRWV
HTWSGHTKGVSAIRFFPKYGHLLLSAGMDTKVKIWDVFNSGKCMRTYMGHSKAVRDISFSN
DGSRFLSAGYDRNIKLWDTETGKVISTFSTGKIPYVVKLHPDEDKQNVLLAGMSDKKIVQWD
MNSGEITQEYDQHLGAVNTITFVDNNRRFVTSSDDKSLRVWEFGIPVVIKYISEPHMHSMPSI
SLHPNTNWLAAQSLDNQIMYSTRERFQLNKKKRFAGHIAAGYACQVNFSPDGRFVMSGDG
EGRCWFWDWKTCKVFRTLKCHDNVCIGCEWHPLEQSKVATCGWDGMIKYWD
SEQ ID NO:485
MARKGLGTDPAIGSLMSSKKRKEYKVTNRFQEGKRPLYAIAFNFIDARYHNIFATAGGTRVTI
YQCLEGGAISVLQAYVDDDKDESFYTLSWACDVNGSPLLVAGGHNGIIRVLDVANEKVHKSF
VGHGDSVNEIRTQALKPSLILSASKDESVRLWNVQTGICILIFAGAGGHRNEVLSVDFHPSDV
YRIASCGMDNTVKIWSMKEFWTYVEKSFTWTDLPSKFPTKYVQFPVFIAAVHSNYVDCTRW
LGNFILSKSVDNEVVLWEPYSKEQSTSDGVVDILQIYPVPECDIWFIKFSCDFHYNSMAVGN
REGKVYVWELQSSPPNLIARLSHAHCKNPIRQTAISHDGSTILCCCDDGSMWRWDVVQ
SEQ ID NO:486
MESGAGGSVGARVPSAKPEMLQQPPYSNGDDDNDMERGTAPVPSSNPNTVSKWELDKDF
LCPICMQTMKDAFLTACGHSFCYMCIMTHLNNKSNCPCCSLYLTNNQLFPNFLLNKLLKKTS
ACQMASTASPVENLCLSLQQGAEVSVKELDFLLTLLAEKKRKMEQEEAETNMEILLDFLQRL
RQQKQAELNEVQADLHYIKDDILALEKRRLELSRARERYSRKLHMLLDDPMDTTLGHAAIDD
GNNVRTAFVRGGQGDAISGKFQQKKAEIKAQASSQGMQKRANFCHSDSQVLPTLSGLTIAR
KRRVLAQFDDLQECYLQKRRRWATQLRKQCDGGLRKERDGNSISREGYHAGLEEFQSILTT
FTRYSRLRVISELRHGDLFHSANIVSSIEFDRDDELFATAGVSRRIKVFDFATVVNEPADVHCP
VVEMSTRSKLSCLSWNKCIKSQIASSDYEGIVTVWDVNTRQSVMMYEEHEKRAWSVDFSRT
EPTRLISGSDDGKVKVWCTRQETSVLNIDMKANICCVKYNPGSSYYVAVGSADHHIHYYDLR
NPSVPLYEFNGHRKTVSYVKFISTNELASASTDSTLRLWDVRDNCLVRTFKGHTNEKNFVGL
TVNSEYIACGSETNGVFVYHKAISKPAAWHQFGSPDLDDSDDDTSHFISAVCWKSESPTMLA
ANSQGTIKVLVLAP
SEQ ID NO:487
MANYVDSKKNFKCVPALQQFYTGGPFRLSSDGSFLVCACNDEVKVVDLATGSVKNTLEGDS
ELIVALALTPDNKYLFSASRSTQIKFWDLSSATCKRTWKAHNGPVADMACDASGGLLATAGA
DRSILVWDVDGGYCTHSFRGHQGVVTTVIFHPDPHCLLLFSGSDDATVRIWDLVAKKCISVL
EKHFSTVTSLAISENGWNLLSAGRDKVVNIWDLRDYHCRATIPTYEPLEAVCVLPTGSRLVSV
MNQSRALPENRKKSGAAPVYFLTVGERGIVRIWYSEGALCLYEQKSSDAIISSDKDELKGGF
VSAVLLPLTQGVMCVTADQRFLFYNLDESDEGKCDLKVSKRLIGYNEEIVDLKFLGDEEKFLA
VATNLEQVRMYDLSSMTCVYELSGHTDIVLCLDTVVFSGHSLLASGSKDHTVRIWDTESKSCI
CVAAGHMGAVGAVAFSKKAKNFFVSGSSDRTIKVWSFASVLDFGGISKSIKLSSQAAVAAHD
KDINSVAVAPNDSLICTGSQDRTARIWRLPDLVPVLVLRGHKRGVWCVEFSPVDQCVMTAS
GDKTIKIWALSDGSCLKTFEGHTASVLRASFLTRGTQFVSSGADGLLKLWTIKSNECIATFDQ
HEDKIWAMAVGKKTEMLATGGSDSLVNLWHDCTTTDEEEALLKEEEAALKDQELLNALADT
DYVKAIQLAFELRRPYKLLNVFELYSKGHAQDQIQKVIRELGNEELRLLLEYVREWNTKPKF
AHVAQFVLFQLFNVLPPKEIIEVQGISELLEGLIPYAQRHYSRIDRLMRSTFLLDYTLSSMSVLS
PTETDLSSSNLLARTADPLHAQIDQFHPTHFPEPNLTPIQSLLDSGNTDSVEVTARRAKKKRV
SGNDSEKTTVAEVKIGDMENAFDEPDVADQGSSRKHKPASSKKRKSIAVGNASIKRIASGNA
VTIALQV
SEQ ID NO:488
MESSCSSMNSNRHSTEKRCLRPLQKQGASMNKHSSDRFIPARGSIDLDVARFMVTQKQKD
NNDIHALSPSPSPSKKAYQKEMADTLLKNAGAADNNCRILSFNGKSSTVSQGSQENVLANLS
ISRRARRYIPQSADRTLDAPDLLDDYYLNLLDWSSTNVLSTALGNTVYLWDASNSSISELLIAD
EEEGPVTSVSWAPDGSQIAVGLNNSVVQLWDSQSNKKLRALKGHHDRVGALSWNGPILTT
GGLDGIIINHDVRTRDHIVQTYKGHTQEVCGLKWSPSGQQLASGGNDNLLYIWDKSMASHN
PSSQYFHQLDEHCAAVKALAWCPFQTNLLASGGGTSDGSIKFWNTQTGACLNTVDTHSQV
CSLLWNRHERELLSSHGLNQNQLTLWKYPSMVKKITELTGHTARVLHMAQSPDGYTVASAAA
DETLKFWQVFGAPDASKKTKTKDTKGAFNMFHMHIR
SEQ ID NO:489
MLDEIVADEEEEFNIWKKNTPLLYDVVITHALEWPSLTVQWLPDRHQSPTKDYSLQKMIVGT
HTSGDEPNYLMIAEVQMPLQYSEDGNVGGFESTEAKVHIIQQINHEGEVNRAQYMPQNSFII
ATKTVSSDVYVFDYTKHSSNAPQERVCNPELILKGHTNEGYSLSWSPLKEGQLLSGSNDAQI
CFWDINAASGRKVVEAKQIFKVHEGAVEDVSWHLKHEYLFGSVGDDCHLLIWDTRTAAPNK
PQHSVVAHESEVNSLAFNPFNEWLLATGSADKTVKLFDLRKLSCSLHTFSNHTEEVFQIEWS
PMNETILASSGGDRRLMVWDLRRIGDEQTSEDAEDGPPELIFIHGGHTSKISDFSWNLHDDW
LIASVAEDNILQIWQMAENIYHDDADIL
SEQ ID NO:490
MTKEDHGESRDEMGERMVNEEYKLWKKNTPFLYDLVITHALEWPSLTVQWLPPSCKQQQD
IIKDDDIDHPNTQMVILGTHTSDNEPNYLILAEVQLHDGTEDEDGDGDVKRPQDKMKPGTSG
GAMGKVRILQQINHQKEVNRARYMPQKPTIIATKTVNADVYVFDYSKHPSKPPQEGRCNPEL
RLQGHESEGYGLSWSPLKEGHLLSASDDAQICLWDITAATKAPKVVEANQIFRYHDGPVEDV
AWHAIHDHLFGSVGDDHHLLLWDIRNDSEKPLHIVEAHQAEVNCLAFNPFNEWIVATGSADR
TVALHDIRKLDKVLHTCAHHMEEVFQIGWSPQNGAILASCGSDRRLMVWDLSRIGDEQNPE
DAEEAPPELLFIHGGHTSKISDFSWNPAEEWVIASVAEDNILQVWQMSEHIYNDDNDSPTA
SEQ ID NO:491
MAMAMGDENAADPVEEFNIWKKNTPFLYDLVITHALEWPSLTVQWLPDRHQSSTADYSLQK
MIVGTHTSEDEPNYLMIAEVQIPLQNSEDNIIGGFESTEAKVQIIQKINHEGEVNKARYMPQNS
FVIATKTVSSDVYVFDYSKHPSKAPQERVCNPELILKGHSNEGYGLSWSPLKEGYLLSGSND
AQICLWDINAAFGKKVLEANQIFKVHEGAVGDVSWHLKHEYLFGSVGDDCHLLIWDMRTAAP
NKPQQSVIAHQSEVNSLAFNPFNEWLLATGSMDK1TVKLFDLRKLSCSLHTFSNHTDQVFQIE
WSPMNETILASSGADRRLMVWDLARIIGETPEDEEDGPPELLFVHGGHTSKISDFSWNLNDD
RVIASVAEDNILQIWQMAENIYHDDEDML
SEQ ID NO:492
MGLFEPFRALGYITDGVPFAVQRRGIETFVTLSVGKAWQIYNCAKLIPVLVGPQMDKKIRALA
CWRDFTFAATGHDIAVFRRAHQVATWSGHKAKVTLLLSFGQHVLSVDLEGCLFIWAVAEVN
QNKPPIGQIQLGEKFSPSCIMHPDTYLNKVLIGSEEGTLQLWNVNTRKKLYEFKGWGSSIRC
CVSSPALDVVGIGCSDGKIHVHNLRYDEEIVTFMHSTRGAVTALSFRTDGQPLLAAGGSSGVI
SIWNLEKKKLQSVIKDAHDSSVCSLHFFANEPVLMSSATDNSIKMWIFDTTDGEARLLKYRSG
HSAPPMCIRYYGKGRHILSAGQDRAFRIFSVIQDQQSRELSQGHVGKRAKKLKVKDEEIKLPP
VIAFDAAEIRERDWCNVVTCHLDDPCAYTWRLQNFVIGEHILKPCLEDPTPVKSCSISACGNF
AVLGTEGGWLERFNLQSGISRGTYIDIGEKRQCAHNGAVVGLACDATNTLUSGGYNGDIKV
WDFKGRELKFRWEIEVPUKIVYHPGNGILATAADDMILRLFDVTAMRLVRIFVGHMDRVTDL
CFSGDGKWLLSSSMDGTIRVWDIISSRQLNAMHMDSAVTALSLSPGMDMLATTHVGHNGIY
LWANRMIYSKATDIEPFISGKQVVKVSMPTVSSKRESEEGDEKRTIVAESNVNKSDVSGSLIG
DSYSAQLTPELVTLALLPKAQWQSLVNLDIIKMRNKPIEPPKKPEKAPFFLPSLPTLSGERIFIP
SSMNGDGDQDETRNDKTVFEARGKKLGGESLSFMQLLQSCAKIKDFTTFTNYLKGLSPSAV
DMELRLLQIVDNENISETEHSVELQGIGMLLDYFVNEVSCNNNFEFVQALIRLFLKIHGETIRC
QVSLQEKARKLLEIQSSTWERLDTSFQNARCMITFLSSSQF
SEQ ID NO:493
MIAAVCWVPKGVAKVLPDSAEPPTQEEIQELLKCNVVAESDDNEDSDEESEEMDTETDKNT
DAVAKALAAANALGSQSSDFQRQHKVDDIANGKKELDMDHYDDEDEGIDIFGSGSLGNCYY
PANDMDPYLVEQDDDDEDEIEDMTIKPSDLIILSARNEDDVSHLEVWIYEEETEEGGSNMYV
HHDIILPAFPLSLAWLDCNLKGGEKGNFVAVGTMQPEIELWDLDVLDEVEPAVVLGGAVKDE
ASGKTTKLKKKKKNKQAVNFKEGSHTDAVLGLAWNMEYRNVLASASADKSVKIWDIVAEKC
EHTMQPHTDKVQAVAWNPNQATVLLSGSFDRSVIMMDMRAPTHSGIRWPVPADVESLAWD
PHTDHSFMVSAEDGTVRGFDIRAAASTADFDGKPMFILHAHDKAVCAISYNPAAPSLLTTGST
DKMVKLWDITNNQPSCIASTNPNVGAVFSAAFSKNSPFLLATGGSKGILHVWDTLDNSEVAR
RFGKFRPQN
SEQ ID NO:494
MIMDENEFCDIFSLRKRLCLLSSQEGEEEEELEAMSQLDAGEFTVTGNEEVVAIAEDDVNTGI
LSQDLFSSQDYCTPSQPQDSTDLDSKDKAPCPLSPVKSTIQRKRCRPELLSNPPDSIQFSFQ
RLERVRSEESIQSSSQQLARVRSEVSSSDDFKTPKITASGQKNYVSQSALALRARVMSPPCI
KNPYLDENEELNEKIQRSTRRSPACVTPIQSGACLSRYRADFHELEEIGRGNFSRVYKALNRL
DGCCYAVKCSQSELRLDTERKVALMEVQSLAALGPHKNIVGYHTAWFENDHLYIQMELCDH
NLTTANDRGILRTDTDFLEAVYQIAQALEFIHGRGVAHLDVKPENIYVRDGTYKLGDFGRATLI
NGTLHVEEGDARYMSREILNDNYEHLDKVDMFSLGATFFELLMRKQYPGSGKRIDRDTEIKIP
ILPGFSIYFQKLLQDLVSNDPGKRPSAKDVLKNPIFNKVRGAKEV
SEQ ID NO:495
MLAPALEMEPVEPQSLKKLSFKSLKRALDLFSPVHGQIAPPDPESKKMRISYKLNFEYGGGS
GSEDQVPKRKESGAAQNQGQQAAGASNALALPGPEGSKIPPMEKSQNALTVGPSLRPQGL
NDVGLHGKGTAIISASGSSDRNLSTSAIMERLPSRWPRPVWHPPWKNYRVISGHLGWVRSI
AFDPSNQWFCTGSADRTIKIWDLASGRLKLTLTGHIEQIRGLAVSSKHTYMFSAGDDKQVKC
WDLEQNKVIRSYHGHLSGVYCLALHPTIDILLTGGRDSVCRVWDIRSKMQIFALSGHDNTVC
SVFARPTDPQVVTGSHDTTIKFWDLRHGKTMTTLTNHKKSVRAMAQHPKENCFASASADNI
KKFQLPRGEFLHNMLSQQKTIINTMAVNEEGVMATGGDNGSLWFWDWKSGHNFQQAHTIV
QPGSLESEAGIYALSYDLTGSRLVSCEADKTIKMWKEDELATPETHPLNFKPPKDIRRF
SEQ ID NO:496
MEEAAKEQSAGSGKPKLLRYGLRSAAKPKEDKKEEQLHQPPPPPPPQQQAAPAPAPAATR
SSTSGSAGGRDRRPQQQHAVDEKYAPWKSLVPVLYDWLANHNLLWPSLSCRWGPQLEQA
TYKNRQRLYISEQTDGSVPNTLVIANCEVVKPRVAAAEHVSQFNEEARSPFIRKYKTIIHPGEV
NRIRELPQNPNIVATHTDSPDVLIWDVESQPNRHAVYGATASRPNLILTGHQENAEFALAMC
PAEPFVLSGGKDKTVVLWSIQDHITASATDQTTNKSPGSGGSIIKKTGEGNEETGNGPSVGP
RGIYCGHEDTVEDVAFCPSTAQEFCSVGDDSCLILWDARIGTNPVAKVEKAHNGDLHCVDW
NPHDNNLILTGSADNSVNMFDRRNLTSNGVGSPVYKFEGHKAAVLCVQWSPDKPSVFGSS
AEDGLLNIWDYERVDKKVDRAPNAPAGLFFQHAGHRDKIVDFHWNTADPWTMVSVSDDCD
TAGGGGTLQIWRMSDLIYRPEEEVLAELENFKAHVLECSKA
SEQ ID NO:497
MAKDEEEFRGEMEERLVNEEYKIWKKNTPFLYDLVITHALEWPSLTVQWLPDREEPPGKDY
SVQKMILGTHTSDNEPNYLMLAQVQLPLEDAENDARQYDDERGEIGGFGCANGKVQVIQQI
NHDGEVNRARYMPQNPFIIATKTVSAEEVYVFDYSKHPSKPPQDGGCHPDLRLRGHNTEGYG
LSWSPFKHGHLLSGSDDAQICLWDINVPAKNKVLEAQQIFKVHEGWEDVAWHLRHEYLFG
SVGDDRHLLIWDLRTSATNKPLHSVVAHQGEVNCLAFNPFNEWVLATGSADRTVKLFDLRKI
SSALHTFSCHKEEVFQIGWSPKNETILASCSADRRLMVWDLSRIDEFQTPEDALDGPPELLFI
HGGHTSKISDFSWNPCEDWVIASVAEDNILQIWQMAENIYHDEEDDMPPEEVV
SEQ ID NO:498
MGKYMRKGKGVGEVAVMEVSQGSLGVRTRARTLAAASSQKDHRRLGASKSVTTKHQSSA
PPASPCVESSMHTCYLELRSRKLEKFSRCYHSAHGATSHGESKRSLSLSEPSRLAVSEEAR
VASDKSSHRVLQQQSSVAHSRNNSATFSHNAKPAKAAQRKERRDDDHTSARPSEAPHEDE
DGMEVEASFGENVMDLDSRERRTRETTPSSYTRDVETMETPGSTTRPPSNAGRRRFQTEG
GHGTRNQFHVPTTNEIEEFFAGAEQQEQRRFTDRYNYDPVSDSPLPGRFEWVRLRP
SEQ ID NO:499
MQNMEENVQSSWSLHGNKEICARYEILKRVSSGTYLDVYRGRRKEDGLIVALKEVHDYQSS
WREIEALQRLCGCPNVVRLYEVILEFLTSDLYSVIKSAKNKGENGIPEAEVKAWMIQILQGLAN
CHANWVIHRDLKPSNMLISAYGILKLADFGSMSFLKRAIYEVEYELPQEDILADAPGERLMDE
DDSVKGVWNEGEEDSSTAVETNFDDMAETANLDLSWKNEGDMVMQGFTSGVGTRWYPA
PDFLYGATIYGKEIDLWSLGCILGELLILEPLFSGTSNIDQLSRLVKVLGLQQKKNWPGCSNLP
DYRKLCFPGDGSPVGLKNHVPNCSDNMFSILERLVCYDPAARLNAKEIVENKYFVEDPYPVL
THELRVPSPLREENNFSEDWAKWKDMEVDSDLENIDEFNVVHSSDGFCIKFS
SEQ ID NO:500
MAPVKRIEPEKTKANEGKPKRRKVAFAIDTGIEANDCISLHLVSTPEEMRDAEGVEDQSLSFN
PEYMQHFVGEHGKIYGYKGLKIDVWLNALSFHAYVDIQYESKVEEGKSEKEATDLTDIMKRIF
GRGLVEDRNAFIQSFSSNSQSIESMIHNEGERIATREILTDKGLSAQGDSERLGVSNEIFRLEL
SDPQIREWHARLEPLVLLFVEGSQPIEQDDPKWEMYIRVQRESLSGGSAVCRLLGFCTVYRF
YHYPDTTRLRISQILVFPPYQGKGHGLLLLEAVNKTAVSRDSYDVTVEEPSESLQELRDCMD
TIRLLSFEPVMPAVKSAVQKLKEANPSDKGAADHCLEGNVNNETVTTSSTKPKNKSGWFPP
PGLVEEVRKHLKISKKQFKRCWEILLYLNLDRSDSQCEDKYHISLMEQIMSELFDKSSEKSAK
GKRVIDIDNEYDNSKTFIMVRTRNPGNGEGFLPEALEGGMEVSQEDQLKSLFEERLEEIAQIA
EKVPSLCKALQMP
SEQ ID NO:501
MPEDRKKILEALAAKRKAEAESGEKKKRQKSSLNPAKPVSKPVSKPVGGIGSKGKSTSAPIS
STKAKSKHKEEVKAKRVTKMDRYETDEDDESEEEEDLDSESDDDELSDEDSEDDIKSKSVK
KLPPQSKGKAPVKGISSSNGKGRDEKGKGIMKDKGKAKAKVEESSSDAEGDSDDDGGDLS
DDPLQEVDPSNILPSKTRRRASQPTNYQFANMSGDDDDDDDSD
SEQ ID NO:502
MADVPESLQQEKDEQGTDKNCCDGKFQKEIDIDDMEEEYNESSIDDEEENLSDNVATNNMG
TIPQGQACMAVTVEGIEHANSVGCGRNGREGSEEVTAAEDMGHVSIENIREQGRNRKSSEQ
LLALYEQEGLLEDDEDDDDVDWEPFEGVTVQMKWYCTNCTMANSDDSVHCDSCGEHRNS
DILRQGFLASPYLPAESPSSSDVPDERLEESKCVMTTLTPSISPMIGVCCSSLQSERRTVVGF
DERMLLHSEIQMETYPHPERPDRLRAIAASLRAAGLFPGKCFSIPAREATCEELQTIHSLEHV
NAVESTSCGMLSHLSPDTYANEHSSLAARLAAGLCADLAKAIMTGQAQNGFALVRPPGHHA
GVKDSMGFCLHNNAAIAVSASRVVGAKKVLIVDWDVHHGNGTQEIFEADQSVLYISLHRHGE
GFYPGSGAVTEVGSSKGEGYSVNIPWKCGGVGDNDYIFAFQHAVLPIAEQFEPDLTIISAGFD
AAKGDPLGRCEVTPDGFAHMAQMLSCLSKGKMLVILEGGYNLRSISASATAVIKVLLGDNPK
ALPIDIQPSKGGLQTLLEVFEIQSKYWSSLKGHDQKLRSQWEAQYGSKKRKVIRKRHMHIVG
GPVWWKWGRKRVVYYHWFARVSSRKHL
SEQ ID NO:503
MASGAGAAGVVEWHQKPPNPKNPVVFFDVTIGTIPAGRIKMELFADIVPRTAENFRQFCTGE
YRKAGIPIGYKGCHFHRVIKDFMIQAGDFVKGDGSGCISIYGSKFEDENFIAKHTGPGLLSMA
NSGPNTNGCQFFLTCAKCDWLDNKHVVFGRVLGEGLLVLRKIENVQTGQHNRPKLPCVIAE
CGEM
SEQ ID NO:504
MAKLVSSVCAFSCQQRHPHSRPRFLSNRDHYNHYHNHSHYHNVCYFPPMMMMQQQLQK
QKRMTTKTITSLFKCNSSNHTLLKGLKEFMGFKFRLQAAMLSCEMSILGRVFAIFFIVHQAAAP
FPFNHFDNWLVPPASAVLYSPNTKVPRTGEVALRKSIPANPAMKSIQDFLEDIYYLLRFPQRK
PYGTMEGDVKSALQIAINEKDSILGSVPLDMKERGLQLYNFLIDGQGGLQVLIEYIKEKDPDKV
SVNLSSSLDTIAQLELLQAPGLPYLLPEEYQQYPRLNGRATIEFTMEKGDNSMFSVSSGGGL
QKTATIQVVLDGYSAPLTAGNFTKLVIDGAYNGLKLKTTEQAVISDNERAEAGFNLPIEILPAG
GFEPLYRTTLSVQDGELPVLPLSVYGAIAMAHNTISEDYSSPSQFFFYLYDKRNAGLGGLSFD
EGQFSVFGYTTVGKEILPQLKTGDIIKSAKLVDGFDHLVLPSSST
SEQ ID NO:505
MDHYYQDDFDYLVDDEMVDFADDVEDDVRTRRRSDIDSDSENDFDSNNKSPDTTALQAKR
GKDIQGIPWNRLNFTREKYRETRLQQYKNYENLPRPRRSRNLDKECTNFERGSSFYDFRHN
TRSVKATIVHFQLRNLVWATSKHNVYLMQNYSIMHWSSLKQKGEEVLNVAGPIIPSVKHPGS
SPQGLTRVQVSAMSVKDNLVVAGGFQGELICKYLDKPGVSFCTKISHDENGITNAVEIYNDA
SGATRLMTANNDLAVRVFDTEKFTVLERFSFPWSVNHTSVSPDGKLVAVLGDNADCLLADC
KTGKTVGTLRGHLDYSFAAAWHPDGYILATGNQDTTCRLWDVRKLSSSLAVLKGRMGAIRSI
RFSSDGRFMAMAEPADFVHLYDTRQNYTKSQEIDLFGEIAGISFSPDTEAFFVGVADRTYGS
LLEFNRRRMNYYLDSIL
SEQ ID NO:506
MDCSGDEEEEQFFESLEEMLSPSDSGSEAADNETGCRNADARSKYEIWKRAPSSIQERRQ
RFLVRMGLANPSELGNQVNSTSAESTCSTETANIPNGIERLRENSGAVLRTAGSSGRKTHCK
NVINIGLREGSVRSSSSSNGTPDVGEDNGEFGGTIFSRSGGTWECMCKIKNLDSGKEFVVD
ELGQDGLWNKLREVGTDRQLTMDEFERSLGLSPLVQELMRRESGVAQADCNGVHHHDAEI
SSSKRRSWLKALKSAAYSMRRPKEDQSNYDSERSGRRSGSFDVPWGKPQWTKVRHYRKR
YLEFTALYMGQEIEAHEGSIWTMKFSLDGRYLASAGQDCVIHVREVIESMRTFGADTPDLYA
SSAYFSMNGLQELVPLSIEDHANKMKRGKIIGSKKSSNSDCIVLPNKVFQLSEEPVCSFHGHL
LDVFDLSWSPSQYLLSSSMDKTVRLWKLGHESCLKVFSHNDIVTCIQFNPVDERYFISGSLD
GKARIWSIPDRQWDWSDLREMVTAVCYTPDGQGGLVGSIKGSCRFYNTSGNKLQLENQLN
VRSKKKKSSGKKITGFQFAPGGDSQKVLITSADSRVRVYNGSELVCKYKGFRNTCSQISASF
APNGQHFVCASEDSRVYIWNHESPRGSGARHEKSSWSHEHFLSQGVSVAIPWSGMKLQPP
VWNSPEFMLGQRHNLLSLQGGKDVGCQNGLLSREAGEGQESETPLHYISQVSHSCGSQN
MVDRDGQDDLSRYSACISDSRLSSFMAFPESPGNPDDLNSKVFFSDSSSKGSATWPEEKLP
PTRKQSRSNSTSSHYDTLKTHLGNTIQGQSGASAAVAWGLVIVTAGHGGEIRSFQNYGLPV
RL
SEQ ID NO:507
MPSIPAIGEFTVCEINRELLTTKDESDTQAKDAYAKILGLVFPPISFQIEEGFGSASRQQFDQD
LDREDTIVTPSTSEGTNALQEGGLLLKGVSVLKNILASSFGPIFSPNDTKKVLKKVELLQGISWH
RHKHILAFISGSNQVTVHDFQDPEWRESSLLVSESQRGIEALEWRPNGGTTLSVACRGGICI
WSASYPGSVAPVRSGVASFLGTSTRGSSVRWTLVDFLQIPGGKAVTALSWSPTGRLLASAS
REDSSFTIWDVAQGVGTPLRRGLGGISLLKWSPTGDYLFSAKPNGTFYLWETNTWTLEQWS
SSGGCVISATWGPDGRMLFMAFSESTTLGSLHFAGRPPSLDAHLLPMELPEIGSITGGFGNI
EKMAWDGCGERLAVSYTGGDLMYVGLIAIYDTRRTPFISASLVGFIRGPGEQVKPLAFAFHD
KFKQGPLLSVCWSSGLCCTYPLIFRAH
SEQ ID NO:508
MEEENAKHTEETRQVQVRFTTKLQPALRVPTTSIAIPAHLTRYGLSDIVNTLLGNDKPQPFDF
LVESELVRTSLEKLLLIKGISAEKILNIEYILAVVPPKQEEPSLHDDWVSVVDGSYPNFIFSGSF
DSIGRIWKGEGLCTHVLEGHRDAITSAAFIMPSDSSDSFINLATASKDRTLRLWQFKPNEHMT
NGKMVRPYKLLKGHTSSVQTVSACPRRNLICSGSWDCSIKIWQTAGEMDIESNAGSVKKRK
LEDSTEQIISQIEASRTLEGHSQCVSSVVWLEKDTIYSASWDHSVRSWDVETGVNSLTVGCR
KALHCLSIGGEGSALIAAGGADSVLRIWDPRMPGTFTPILQLSSHKSWITACKWHPKSRHHLI
SASHDGTLKLWDVRSKVPLTTLEAHKDKVLCADWWKEDCVISGGADSTLQIFSNLNLT
SEQ ID NO:509
MNRLRSKRNHILELRLGQSEPEKEATLASNRSRGTNAPIVVEDDDDVVVSSPRSFALARSSV
SQRSSRIPIVNEEDLELRLGLAVTGRTSAEHNPRRRHGRVPPNKPIVLCDDAGEADQSSSKK
RRTGQQLSSDVQSDESKEVKLTCAICISTMEEETSTICGHIFCKKCITNNHRWKRCPTCRKKL
AINNIHRIYISSSTG
SEQ ID NO:510
MEEPPPPAVLPSSEDTSIVSSHSFVNAPPTVPVGLDASIPQISTPGINQPGLTIPVPPFEAPLT
ASLVAASAGMPPAVVPSFVRPAIVAHPSVMPPPSMPLAALPMPVASAVPVAAPHFPPSTPND
NSITPSMPVPTPIVASSSVPPSVTIPGIAPLPFIAPIPVPSSRPVAPSPFMPPARPLGASVSVAM
DVDNTDEQDQDADNKGESPSSSPDHPEDPSAAEYEITEESRKVRERQEQAIQELLLRRRAY
ALAVPTNDSSVRARLRRLNEPITLFGEREMERRDRLRALMAKLDAEGQLEKLMKVQEEEEAA
ANVDAEEVQEMEGPQVYPFYTEGSQELLKARTEITKFSLPRAVSRLQRARRKREDPDEDED
EELKCVLQQSAQINMDCSEIGDDRPLSGCAFSSDGTLLATSAWSGVTKLWSVPNINKVATLK
GHTERVTDVAFSPTNCHLATACADRTAMLWNSEGVLMKTYEGHLDRLARLAFHPSGLYLGT
ASFDKTWRLWDVNTGIELLLQEGHSRSVYGIAFQCDGSLAATCGLDGLARIWDLRTGRSILA
LEGHVKPVLGIDFSPNGYHLATGSEDHTCRIWDLRKRQSVYIIPAHSHLVSQVKFEPQEGYFL
VTASYDSTAKVWSARDFKSIKVLAGHEAKVTSVDITADGQYIATVSHDRTIKLWSSKNSTNDM
NIG
SEQ ID NO:511
MKRAYKLQEFVAHASNVNCLKIGKKSSRVLVTGGEDHKVNMWAIGKPNAILSLSGHSSAVES
VTFDSAEALVVAGAASGTIKLWDLEEAKIVRTLTGHRSNCISVDFHPFGEFFASGSLDTNLKI
WDIRRKGCIHTYKGHTRGVNSIRFSPDGRWVVSGGEDNIVKLWDLTAGKLMHDFKCHEGQI
QCMDFHPQEFLLATGSADRTVKFWDLETFELIGSAGPETTGVRAMIFNPDGRTLLTGLHESL
KVFSWEPLRCYDAVDVGWSKLADLNIHEGKLLGCSYNQSCVGVWVVDISRVGPYAAGNVS
RTNGHNEAKLASSGHPSVQQLDNNLKTNMARLSLSHSTESGIKEPKTTTSLTTTEGLSSTPQ
RAGIAFSSKNLPASSGPPSYVSTPKKNSTSRVQPTTNFQTLSRPDIVPVIVPRSNSLRPETTS
DAKKEMNNFGRBBVVPSTVSTKSTDVIKSGSNRDESDKIDSINQKRMTGNDKTDLNIARAEQHV
SSRLDNTNTSSVVCDGNQPAARWIGAAKFRRNSPVDPVVSPHDRSPTFPWSATDDGVTCQ
PDRQVTAPELSKRVVEPGRARALVASWETREKALTADTPVLVSGRPPTSPGVDMNSFIPRG
SHGTSESDLTVSDDNSAIEELMQQHNAFTSILQARLTKLQVIRRFWQRNDLKGAIDATGKMG
DHSVSADVISVLIERSEIFTLDICTVILPLLTRLLQSETDRHLTVAMETLLVLVKTFGDVIRATISA
TPTIGVDLQAEQRLERCNLCYVELENIKQILVPLIRRGGAVAKSAQELSLALQEV
SEQ ID NO:512
MAGSDENNPGVVGGAHVQEGLRVGAGKMGAGNVQQRRALSNINSNIIGAPPYPCAVNKRV
LSEKNVNSENDLLNAAHRPITRQFAAQMAYKQQLRPEENKRTTQSVSNPSKSEDCAILDVDD
DKMADDFPVPMFVQHTEAMLEEIDRMEEVEMEDVAEEPVTDIDSGDKENQLAVVEYIDDLY
MFYQKAEASSCVPPNYMDRQQDINERMRGILIDWUEVHYKFELMDETLYLTVNLIDRFLAVQ
PVVKKKLQLVGVTAMLLACKYEEVSVPVVEDLILISDRAYSRKEVLEMERLMVNTLHFNMSVP
TPYVFMRRFLKAAQSDKKLELLSFRIELSLVEYDMLKFPPSLLAASAIYTALSTITRTKQWSTT
CEWHTSYSEEQLLECARLMVTFHQRAGSGKLTGVHRKYSTSKFGHAARTEPANFLLDFRL
SEQ ID NO:513
MQAPREGKSAAAIVGMGKYMKKSKAIPRDVSLLEASPRSPSATGVRTRAKTLASRRLRRAS
QRRPPPPAAAAAAAAPSLDASPCPFSYLQLRSRRLRRPRLAPSPEARIDEGPAGSGSRGSR
DASCSARTASSSGGVEGEGACVGRGDRGNGGECVRDAAVDASYGENDLEIEDRDRSTRE
STPCSERDSNANTPPGSTTRQQSSCTAHRTQMSILRSIPTSDEMEEFFAYAEQRQQRSFIEK
YNFDIVKDRPLPGRFEWVQVIP
SEQ ID NO:514
MDGHSSHLAAQNRSRGSQTPSPSHSAASASATSSIHLKRKLSAANASAASAAAAAAAAAAA
ADDHAPPFPPSSISADTRDGALTSNDDLESISARGGGAGDDSDDDSDDEEEDDGDNDGGS
SLRTFTAARLENVGPAAARNRKIKAESNATVKVEKEDSAKDGGNGAGVGALGPAATSGAGS
GSGTVPKEDAVKIFTENLQASGAYSAREENLKREEEAGRLKFECLSNDGVDDHMVWLIGLK
NIFARQLPNMPKEYIVRLVMDRNHKSVMVIRRNLVVGGITYRPYASQKFGEIAFCAIKADEQV
KGYGTRLMNHLKQHARDVDGLTHFLTYADNNAVGYFIKQGFTKEIYLDKDRWHGYIKDYDG
GILMECKIDPKLPYTDLSTMVRRQRQAIDEKIRELSNCHIVYQGIDFQKRDAGVPQNTIKMEDI
PGLREAGWTPDQWGYSRFRGLSDQKRLTFFIRQLLKVLNDHSDAWPFKEPVDAREVPDYY
DIIKDPMDLKTMTKRVESEQYYVTLEMFIADVKRMFANARTYNSPDTIYFKIATRLEAHFQSKV
QSNLQSGAGKIQQ
SEQ ID NO:515
MFNGMMDPELFKLAQEQMNRMSPAELAKIQQQMMSNPELMRMASESMKNMRPEDLRQAA
EQLKHVRPEEMAEIGEKMANASPEEIAAVRARADAQMTYEINAAKILKKEGNELHSQGRFKD
ASQKYLRAKNNLKGIPSSEGKNLLLACSLNLMSCYLKTRQYEECIKEGSEALACEEKNLKAFY
RRGQAYRELGQLKDAVSDLRKAHEISPDDETIAQVLRDTEESLTKEGGSAPRGVVIEEITEED
ETLASVNHESPSEYSEKRHQESEDAHKGPINGDIMGQMTNSESLKALKGDPDAIRSFQNFIS
NADPTTLAAMGAGNAGEVSPDLIKTASSMIGKMSAEELQKMIQLASSFPGENPYVTRNSDSN
SNSFGNGSIPNVSPDMLKTASDMMSKMSPDDLQRMFEMASSSRGKDPSLDANHASSSSGA
NLAANLNHILGESEPSSSYHIPSSSRNISSSPLSNFPSSPGDMQEQIRNQMKDPAMRQMFTS
MMKNMSPEMMANMGKQFGLELSPEDAAKAQEAMSSLSPEMLDKMMRWADRAQRGVETA
KKTKNWLLGRPGMILALCMLLLAVILHRLGFIGS
SEQ ID NO:516
MIAAISWVPRGASKAVPEVAEPPSKEEIEEILKSGVVERSGDSDGEEDDENMDAVASEKADE
VSTALSAADALGRISKVTKAGSGFEDIADGLRELDMDNYDEEDEDVKLFSTGLGDLYYPSND
MDPYLKDKDDDDDTEEIEDLSIKPMDSLIVCARTDDEVNLLEVYLLEPSLSDESNMYVHHEVV
ISEFPLCTAWLDCPIKGGDKGNFIAVGSMEPAIEIWDLDIIDAVEPCLVLGGQEELKKKKKKGK
KASIKYKEGSHTDSVLGLAWNKEFRNILASASADRQVKIWDVAAGKCNITMEHHTDKVQAVA
WNHHAPQVLLSGSFDHSVVMKDGRIPSHSGYRWSVTADVESLAWDPHSEHFFVVSLEDGT
VRGFDVRAALSNSASQSLPSFTLHAHEKAVSTISYNPAAPNLLATGSTDKMVKLWDLSNNQP
SCIASRNPKAGAVFSVSFSEDSPLLLAIGGSKGRLEVWDTSSDAAVSRRFGKHGKPKTAEPG
S
SEQ ID NO:517
MKFCKKYQEYMQGQEGKKLPGLGFKKLKKILKRCRRRDSLHSQKALQAVQNPRTCPAHCS
VCDGSFFPSLLEEMSAVLGCFNKQAQKLLELHLASGFQKYLMWFKGKLRGNHVALIQEGKD
LVTYALINAIAIRKILKKYDKIHLSTQGQAFKSQVQRMHMEILQSPWLCELIAFHINVRETKANS
GKGHALFEGCSLVVDDGKPSLSCELFDSIKLDIDLTCSICLDTVFDSVSLTCGHIYCYMCACS
AASVTIVDGLKAAEPKEKCPLCREARVFEGAVHLDELNILLSRSCPEYWAERLQTERVERVR
QAKEHWESQCRAFMGVE
SEQ ID NO:518
MVSTQSTRENPSIFFPPPLKPWLLPVVLSLSLSRQLGMAAAAAASLPFKKNYRSSQALQQFY
AGGPFAVSSDGSFIACNCGDSIKIVDSSNASLRPSIDCGSDTITALSLSPDGKLLFSAGHSRQI
RVWDLSTSTCLRSWKGHDGPVMSMACPVSGGLLATGGADRKVMVWDVDGGFCTHFFKG
HDGVVSTVLFHPDSNRSLLFSGSDDGTIRVWDLLAKKCASTLRGHDSTVTSLAFSEDGLTLL
AAGRDKVVSLWDLHNYACKKTIPMYEVLESVCVIHSGTVLASQLGLDDQLKVTKESAQNIHFI
TVGERGILRIWKSEGSVCLFKQEHSDVTVISDEDDSRSGFTAAVMLPLDQGLLCVTADQQFL
FYYPEKHPEGIFSLTLCRRLVGYNEEIVDMKFLGEEENFLAVATNLEQVRVYELASMSCSYVL
AGHTETVLCLDTCISSSGRTLIVTGSKDNSVRLWDSESRHCIGVGVGHMGAVGAVAFSRKR
QDFFVSGSSDRTLKVWSLDGISEDGVDSTNLKALAVVAAHDKDINSVAVAPNDSLVCSGSQ
DRTACVWRLPDLVSVVVLKGHKRGIWSVEFSPVDQCVLTASGDKTVKIWAISDGSCLKTFEG
HVSSVLRASFLTRGTQFVSCGADGLVKLWTVRTNECIATYDQHSDKVWALAVGKKTEMLAT
GGSDAVVNLWYDSTASDKEDAFRKEEEGVLKGQELENAVSDADYTKAIELALELRRPHKLFE
LFSELCRTREVGDRVERILSALSGEEVCLLLEYIREWNAKPKLCHVAQSVLSQVFRILSPTEIV
EIKGIGELLEGLIPYSQRHFSRIDRLVRSTYLLDYTLTGMSVIEPEADRSAVNDGSPDKSGLEK
LEDGLLGENVGEEKIQNKEELESSAYKKRKLPRSKDRSKKKSKNVVYADAAAISFRA
SEQ ID NO:519
MDSAPRRKSGGINLPSGMSETSLRLDGFSGSSSSFRAISNLTSPSKSSSISDRFIPCRSSSRL
HTFGLVERGSPVKEGGNEAYSRLLKAELFGSDFGSLSPAGQGSPMSPSKNMLRFKTESSGP
NSPFSPSILRQDSGFSSEASTPPKPPRKVPKTPHKVLDAPSLQDDFYLNLVDWSSQNTLAVG
LGTCVYLWSASNSKVTKLCDLGPNDGVCAVQWTREGSYISIGTSLGQVQIWDGTQCKRVRT
MGGHQTRTGVLAWNSRILASGSRDRVILQHDLRVPNEFIGKLVGHKSEVCGLKWSHDDREL
ASGGNDNQLLVWNQHSQQPVLKLTEHTAAVKAIAWSPHQNGLLASGGGTADRCIRFWNTT
NGHQTSSVDTGSQVCNLAWSKNVNELVSTHGYSQNQIMVWKYPSMAKVATLTGHSLRVLY
LAMSPDGQTIVTGAGDETLRFWNVFPSAKAPAPVKDTGLWSLGRTHIR
SEQ ID NO:520
MEDEAEIYDGVRAQFPLTFGKQSKPQTSLESVHSATRRGGPAPAPAPASSSSLPSTTSPSAA
GGAGKSSGLPSLSSSSTAWLEGLRAGNPRAGREAGIGSRGGDGEDGGRAMIGPPRPPPGF
SANDDGGGEDDDDDGDGVMVGPPPPPPGNLGDGDDDEEEEEAMIGPPRPPVVDSDEEEE
EEEEENRYRLPLSNEIVLKGHNKIVSALAVDPTGSRVLSGSYDYTVRMFDFQGMNSRLSSFR
DFEPVEGHQVRNLSWSPTADRFLCVTGSAQAKIYDRDGLTLGEFVKGDMYIRDLKNTKGHIT
GLTWGEWHPKTKETILTSSEDGSLRIWDVNDFKSQKQVIKPKLARPGRVPVTTCTWDREGK
CIAGGIGDGSIQIWNLKPGWGSRPDIHVEQAHADDITGLKFSSDGKILLTRSFDDSLKVWDLR
LMKNPLKVFEDLPNHYAQTNIACSPDEQLFLTGTSVERESTIGGLLCFFDRSKLELVSRIGISP
TCSVVQCAWHPRLNQIFATSGDKSQGGTHVLYDPTLSERGALVCVARAPRKKSVDDFELKP
VIHNPHALPLFRDQPSRKRQREKILKDPLKSHKPELPMNGPGHGGRVGASKGSLLTQYLLKQ
GGMIKETWMDEDPREAILKHADAAEKNPKFTRAYAETQPDPVFAKSDSEDEDK
SEQ ID NO:521
GATTTTAAGTAACTCAATTAGCAGTTCCAACATTAAACCATTATTATTACCCCTTTTATC
SEQ ID NO:522
CTCAAAAAGTACITGGATGCGTGCGGTGACAACGGACTCGAACCGTACACTGTCAAATC
T
SEQ ID NO:523
TTGTCAAGTTGCAGGACGTAGTGCACAGTGAGAGGCGTCTATATCTAGTTTTTGAGTACT
SEQ ID NO:524
GAAGAAATTATATAACTAGATACAAGGTTAGCTAGGTATATAATAGCGGTACAAGTCTTT
SEQ ID NO:525
CGACAAATCAAGTAGAACTTCTCTCGGCAGCATCAGTTTTTCTAATCCATGCCTTGTTGC
SEQ ID NO:526
CTCAGTTCTGATAATGCCTCGGATATATGGCCGAGTGTTCGCTGGACGGCCTCTTATGTT
SEQ ID NO:527
GGAGATTCTGAACTGCAACAGCTCCTACACATTTTCAGACTGTTGGGTACTCCAAATGAA
SEQ ID NO:528
GACTGGTAAAATCGTTGCACTAAAAAAAGGTCCGGTTTGACAACTTGGAACCTGAAAGCGT
SEQ ID NO:529
AAACACCAATCTATCAACACTGTCGAGTTTAGTCACTAGTAGAACCGGAGATAACAAACA
SEQ ID NO:530
CTATGATCCTGAGCGCAAGCAAGTTATGACCAATAGAGTCGTTACACTATGGTACCGAG
C
SEQ ID NO:531
TGTTGTGAAGGTAGTTATAGCCATCGATTAGACAGTGATTAAAGTAGTACCCGTGCCAAT
SEQ ID NO:532
CCACATACAAGAGTTGTTACGCTACACATCCTATACCATCAAAGGAACGTTGGAATGCCA
SEQ ID NO:533
TATGATCGACACAAGCATTTTGTGTTGGAGCCTCAGCTAATTGTATGTCATCGAGTACTT
SEQ ID NO:534
AAAATTTTTGCTACGGATAATGTTGTGAGGCGAGGCAGTCGAAATTACGGAGGTTGACTT
SEQ ID NO:535
ATGCAGGGATCAAATTTGTGAGTACTACGTAAAATTTTGCTACGGAGGCGAGGCAGTCG
A
SEQ ID NO:536
GAAGAATACAGGCTCGTACCTGATACACTGTACCTGACTGTTAACTACATAGATCGGTAT
SEQ ID NO:537
TCCACCCTAAATGCGATACGTGAAAAGTATAGACAACAGAAGGTAAACTATTCATTACTG
SEQ ID NO:538
AGGCTTCTAGTTGCGTTCCCCCAAACTACATGGATCGGCAGCAGGATATTAATGAGCGG
A
SEQ ID NO:539
GAGAAAAATGACAGATTGATATCGATGATGATGACTGTCGTGTCATCAGTAGTGTGCTTT
SEQ ID NO:540
TTTCCAATTGTAGTTCGTCTTTTATTGTAACAATAAATTGATAGATACTGATTCGAAATA
SEQ ID NO:541
ACATTTATGCTAACTATAGGAGAACGGAGAATTGTAGCTGCGTCTCTGCTAACTACATGG
SEQ ID NO:542
TTCTGGCTTAAAGGCTATTCTTTGTGCACAATGACCTGAGGGAGGTCTCGACAGACCACT
SEQ ID NO:543
TTCATCCGGGTCCTGGTTATCATACTCTTATATATGTTGGGGAATAACGGTTCATATGTT
SEQ ID NO:544
GGGTGTGCTTAATAGTTCTTATTAGTCTTAGCTTATTATCTTTGATTGGACATGCTATAA
SEQ ID NO:545
CTTGCTAAGTAGACATGTTATATTTCTAATGCTTTGAGAACAATATTACAGTATAATTAG
SEQ ID NO:546
AATCATCGACTAGACCGATGGTCAAAGTGGTAATCATGTAATTAAACGCGTTTGTCATTG
SEQ ID NO:547
ATGGAAAAATCTATGGATATGAAGGATTGAAGATATCCGTCTGGGTAAGCTGTGTATCAT
SEQ ID NO:548
TTATGATTTGAGAAAACCCTTGCAGGCTGCGATTTGCGGATCATGACAGCATAGTTTTGC
SEQ ID NO:549
GTTTTGTTGTGAGGGCTTGGTAGGTTTTCATTATATTGTAATGTCGACGACAGAGATTTT
SEQ ID NO:550
CCAATTAATGTTACTGCTCAAGCTGACGTACCTGCGAAAAAAGCACCAGTGACTGCTAAT
SEQ ID NO:551
TGATGTCAAAACGTAGCTCTTTTTTGTGTGAGCTATCCTGCTAAATTAAACCTCAGCAAA
SEQ ID NO:552
ACATGAGTATTATGAATACTTCGGTCCTGACTATACACTTCATGTTGCTCCGAGTAACAT
SEQ ID NO:553
GAATTGGCGATCACAATCTACTGTAGTCAATACTCAAGTGGGAGGTGTAAATAGATTCCA
SEQ ID NO:554
GATCATGTGTAATCAGTATATCAGGTTAGAAACAGTACTCTTGAGCTTAGCGGGCACTGT
SEQ ID NO:555
TCCTGTGAAGGTGGTCGACTCAATCAAAAGGTACCTTGTAGATAAGGTACCTTTTCTCAA
SEQ ID NO:556
GCATTTTATACGACGGATAGAGTCATGACCGTATCTTTCCATAAGTTTGGGGACTTCTTC
SEQ ID NO:557
CCTCGTTTCTTTGCGGTTCGGACGCATCATGGATGTATCTCCAAAGAGTAATCTGTCGAT
SEQ ID NO:558
AATTCAGATCTATTAGTGAAAGTTGGCATGAGTCTCAATCTTAGGGGAATACAGTACGGA
SEQ ID NO:559
TGATATGAGTATCATAACTCGGATGGTGACAACTTTGTACTACGGTCGGCACCGGTAGAT
SEQ ID NO:560
CATATACAATCTTAGTGGATTAGCTGAGGTCGAAACTGACAAGAGTGATCGCCCGTTGG
A
SEQ ID NO:561
CATGGCTAACGCTGGCCCTAGCACTAATGGGAGCCAATTTTTCATATGCACTGTAAAGAC
SEQ ID NO:562
AACAAAGTCTACCTTGACATTAGCATCGGTAACCCTGTCGGGAAACTAGTCGGAAGAATT
SEQ ID NO:563
TGTGCTTGGATATACTGTATAAGCATTCTATATTATGCTTGTTGGCTTCGTTTTGAGGGA
SEQ ID NO:564
TTAACGTCGACCGCTTCTCTGCCCCTTGAATTTTCCCGAGAAAACCAGGAACCTGCCAAA
SEQ ID NO:565
TGTTGAATACGATGTATTATAATGTTGGTGTCTTGGTGAAATACAGAATTATGCTTGCGT
SEQ ID NO:566
ATCGCTGTGGCTGATCTCGTCGCTCCGGCTTTTCATAAAAATCATGGCTGAGGCAATCG
A
SEQ ID NO:567
CTCGCAACCCTATATCTCGCTCAGGCGAAGAAGTCTGAGGATTTGAAAGAGGTGACTCA
C
SEQ ID NO:568
TGTTTTTGGGTACACGCAGTTAGGATAACTAGCATGAAAGCCCGATCCCGCATATACAG
G
SEQ ID NO:569
GAGGACTAGCCGGAACTTCATCGAACTCTCTCGGAGGGGTTACTACGATAACGTCAAGT
T
SEQ ID NO:570
GATGGCTAGCACTGTGTAGAAAGGTGAATTTAAAGTACTTGTCTACACTGCTTATTAAAT
SEQ ID NO:571
TGAGACTGTCTTGGCGTGTATTTTGGAATAAACTATTATCACGTTTTGTTAAATATAATA
SEQ ID NO:572
TTACAAAATGGCTCTCAGAAAGTATCGAAAGGCCCTGCGCTATCTGGATATCTGCTGGG
A
SEQ ID NO:573
AATTTTATGTTTGCTACTGCTTAGTGCTTAATGGACTTGCGTAGGTATTCAAATTACAGA
SEQ ID NO:574
TGGAACCGTGGTATCGGCTGACGTTATCCGTGATTTTAAGACTGGAGATAGTTTATGCTA
SEQ ID NO:575
CTTTGATGTATCCTCAGTGTACTGCTTTTAGCTATGTATAGATCGAGTCAACTCATTGAA
SEQ ID NO:576
TTTTTATTATTTACCTTCGCCTTTACGCTGCATACGTTAATAGGTTATTATTTCCTTCAA
SEQ ID NO:577
ATTTGTCCATGACAATCGTAGTCGAAGACACGATACGCTCTTAGATGGTACGGAAATCTG
SEQ ID NO:578
TGAATAGAGATAACTTTTCTGAGTGTGAAGTTGGATATTACGTTGCAAATAGCCGAATGAA
SEQ ID NO:579
GCTTTAGGTTAGGGATCCCTGTAAGCTGATGATAGATATTGGAGATGGTACTTGTAAGAT
SEQ ID NO:580
TGTTGTGTTTGGAAAGGTGCTGTCTGGGATGGATGTTGTCCACAAGATTGAGGCTGAAG
G
SEQ ID NO:581
GGAAAGCGGGGAATGAGCATGTGGATATTATCTCTTTCTACAATGAAATATTCATTCCTT
SEQ ID NO:582
CATCAGGACGTTGACTCTAATTAAGACATATGTGACAGAGCGCCCTGTTAATGCGGTTAC
SEQ ID NO:583
CTTTAGGTTTGATCTGTCTGTTTTGTCTATCCTGCGAGTTTCGAGCATGTGCGTGTGTGA
SEQ ID NO:584
CAGCCCCAATAGATACTGGCTCTGTGCCGCTACTGAGAACAGTATTAAAATCTGGGACC
T
SEQ ID NO:585
AAGAATGAAGCTGATATGAGTGATGGAACTACGGGGGCCATGAGCTCAAATAAGAAGGT
C
SEQ ID NO:586
TGACTACAATTAGCACCTCACCATTATCGAACTGTATAATTGTGCTTGCCTGCTATTATT
SEQ ID NO:587
TTGAAGCGGAAATATATATTTATGCTACTACATAAGTAATGTACTACTTGACAAGATGAG
SEQ ID NO:588
TACTCGATGTGGTATAGAATTTATCCAATGTACTCCTAAATGTAGATACATCGTGTATTG
SEQ ID NO:589
GCTTCGTCTGATACCACTATCAAGATAATAGGCGTGAGCAATAGCTCTGGATCACAGCAC
SEQ ID NO:590
GGTCGGCTTGCTAGTGTATCTGATGACAAGAGCATATCACTCTATGATTACTCATGAAGG
SEQ ID NO:591
GAAAGGAGAAAAGCATGGAGATCGATCTCGGAAACCTCGCATTCGACGTCGATTTTCAT
C
SEQ ID NO:592
GATTCAGTACCCGGATTCGCAAGTCAACCGGTTGGAGATAACTCCACATAAGCGGTACC
T
SEQ ID NO:593
TTCCATGTATCAAGCCGCATCAATGTTTGTCGCTGCAATTAACATGTGTGCAGTCGATCC
SEQ ID NO:594
TTCAGCGCATTGTGTAAATGTAGATAGGTGATATATTTCTCGTTGCAATGTAGGGTAAGA
SEQ ID NO:595
TCCAATAATCACATTTACCATCAACAGGCATCAGCAACATACTGTTGTAGTGTAATTAAT
SEQ ID NO:596
GGGCATTCTGACTACCTGCACTGTATAGCTGCACGGAACTCTTCTAGTCAGATTATAACA
SEQ ID NO:597
AATCGTCTGGTAGATTGTCAAAAACTAATAAACCTGTGATTGATCCGGATTCTAGTAATG
SEQ ID NO:598
AGTTGAGGATTCTCCACTATGACAGCTCTCATGGCTTGAATCTAAAGTCATCTGGTTTTC
SEQ ID NO:599
GAACAATCATTCTGTAGAACACTAGAGTCTATATGCTTGACTGTATCGGTTAATTAATTC
SEQ ID NO:600
AGATAGCGATAGAGTTATACTGCATGTACTGAGGTAAATGTTTTGATTACTCCACCCAAT
SEQ ID NO:601
AAGAATTGTTAGGAGGTGTATACTTTCTGTAACTGTATTCAATGAGCATACACCTGACGG
SEQ ID NO:602
CAACTCATATAATGACTGGATTCTGGCAACCGCGTCTTCAGACACAACAGTTGGACTATT
SEQ ID NO:603
AGTGTAAAAGGATGCCCCTAATAGATTATATGCCAAGTGTAGTATATATAATAGTGCTTT
SEQ ID NO:604
AAGAATCTACAGTTGTCTTATGCTACTCTATTACTCAATTATGCTGTGCTATTGATTGAG
SEQ ID NO:605
TCTGAATACATACTTTGTGGTCTCTATAAAAGACCAATGATACAGGCATGGTCATTAATT
SEQ ID NO:606
TAAATCTTCTCATGTGCCTGGCGTAAATTTTGCAGTTATTACTAGACCAAGATAGTTTCA
SEQ ID NO:607
ACATGGATTCGATCAATCGCCACATGACAACTAAAACAAGCGGTTCACGTGATTGTAATT
SEQ ID NO:608
AGATGAGTATGCTCGGGTGTATGATATTCGCAATTACAAGTGGAATGGATCGCATAATTT
SEQ ID NO:609
TCTTTGATTCTGTTGTATGGTGTATCTTATTGTATCTTCTATCTGCCCCCCATGTAATTC
SEQ ID NO:610
TTCGTTGTGTAGTACTGGGAGTTACTACTTGTATGTATGTAAATCATGTGGCGTCTGTCC
SEQ ID NO:611
GGAGATGTGTAATATGTCTGAGCGGTCACACTCTAGCTGTTACATGCGTAAAGTGGGGA
G
SEQ ID NO:612
CCACCGTTGCGTAACTCGAATAGCCGGATTTTCGTTTTCGTTTTTATTTCCCCGTTAATT
SEQ ID NO:613
TGAGATGCTCTGTGTGAGGACTTTTACGAAACTTGAATGGCCCGTAAGGACAATAAGCTT
SEQ ID NO:614
TGGGTTGTTGCGACGGGTTCTACAGATAAGACTGTTAAGTTATTTGATCTACGCAAGATC
SEQ ID NO:615
GCAGAGGTGCCTACATATGCTTTAGAATGCTAGTAGCTTGGAAGTGCAACACGCTCGTG
A
SEQ ID NO:616
AGTAAAGTTTAACGACTATGCATCTGTCGTAGTATCAGCCGGCTATGATCGTTCAGTGCG
SEQ ID NO:617
CGTTAGGATAGTCTTTAAAGGAGTTGGTGATTATTGATTTCCACCCAATATATGTAGCGT
SEQ ID NO:618
GAGCAAGCTACTTACAAAAATCGACAGCGTCTTTACCTATCTGAACAGACAGATGGCAGT
SEQ ID NO:619
TCCTTCCGACAAGTACCGTATTGCAAGTTGTGGTATGGACAATACGGTTAAAATCTGGTC
SEQ ID NO:620
TTTCACTCGATGACGGTTGGCCGGATAAATAATCGCTTATATAGTCCTAATAAGAACCAT
SEQ ID NO:621
ATATGTAGGTGGTAGAGGTGTGGATATTGCATAGACCGAACCTCCGCAGGTCCGCATTC
T
SEQ ID NO:622
CCATTGAACTACTTATGGATTACTTTATACATGAAATATCATGCCGGAGTAATTTTGAGT
SEQ ID NO:623
AGCATTAGAGACCTGGATTTTAGTCTAGATTCAGAGTTTTTGGCTACGACATCTACTGAT
SEQ ID NO:624
AAAGGTTTATCCCTCATTGGATTTGATATATAAACTGAGAGTGTTTTGCCCCCCATTAAA
SEQ ID NO:625
GTACAGCGTGTATTTCTTGTTACGATACTTGAGGGGTTAGAGGCACCTACGAATTAGGAA
SEQ ID NO:626
ATATCCTTATGAATGAAGTTTGGATGATAAGTGGCGCCAGACTTTCTACTCACCCTTTTT
SEQ ID NO:627
TGATCACATCGTTGTTTGCAATAAGACGTCATCAATTTATATCATGACTCTACAGGGACA
SEQ ID NO:628
TTTTCCCAGTGTACTGCGAGAGTGATGCTACATAAGTTTACTCTTGTGTCTAACTTTTCC
SEQ ID NO:629
AGATTCTACAGATGGCGCTATACGAGCTGTTATACGGACATTTTATGACCATACACATCC
SEQ ID NO:630
TGCTACGGGAAACCAGGACAAAACTTGTAGGATTTGGGACATACGAAACTTATCTAAGTC
SEQ ID NO:631
CAAGTCATATAGTTACAGTGTCGCATGACAGAACAATTAAGCTCTGGACTAGTAACGACG
SEQ ID NO:632
TGCCACATCGTAACCATCATAGCACTTATCATCTAATTATGGTGAAAGGGAGTTATATAT
SEQ ID NO:633
GTTTATACTTATAAACAACAGAGAGACAACTGTACAGGTGTTGTAAACACTCCCAGTGTG
SEQ ID NO:634
CTGTGTTTTAGCCCGAGGGCCAATCACTTAGTTGCTACTTCGTGGGATAATCAGGTACG
G
SEQ ID NO:635
GCAAAGTAGAGTTTAAGTTTCGTTGTGCTTGGACCGGAAAACTCACATGCTTAGAGTTTA
SEQ ID NO:636
AAGATTTGGGCATAACTTGTATGAACTTTTTCTGTTGTCGACACTGTAATTACACGAGCT
SEQ ID NO:637
AAACAGATGCATGTATGCTTCATAACTCTATAGATATGGAAATGTCACTGTACACTGATC
SEQ ID NO:638
TTATTGGTGCACAGGACGGAAAATTGCGCATATATTCTATTTCAGGTGATACATTAACAG
SEQ ID NO:639
AGGCACAGACACTTGCCTAAACCAATATACAAGGCAGGTATTCTAAGGCGCACCGTGAA
T
SEQ ID NO:640
CATGCGAAGGTTTCTGGGAATTTTCAGTAGAAAATTCGGTCGTGGCGGCCATCCTCGAT
A
SEQ ID NO:641
TTAAGCTGATAGCTTTAGTTCCTACGTGGAATGTATAAATGCACCATTGTCCATAAGGCA
SEQ ID NO:642
GGATGCTCTGGTTACATGACTACTCCTTAGGGAATCAGTCAGACATTTTAAATAACTTCC
SEQ ID NO:643
TCATTAAGCGGTACTGGCAGAGGACATGTCTATTTATACAAGCAAATGGTCCTATTGGCT
SEQ ID NO:644
ATGTTGGTCAGACCTCAAATATTGTACTCCCCACACTAGGGAGCATTTACGGTGAATATA
SEQ ID NO:645
TCCTCTCGACCCTTAGAGTCCTCTGCGAATCTTGTTGTTAGTTACTGTGTACGCTGTAAC
SEQ ID NO:646
AAGCATGTTTTGAATTTATGGTGGTGGCATGTGGATATTTGAACTTGGTTGAGAAAAATT
SEQ ID NO:647
CATTCCTATTGAAGGGTCAACCTTTAATTTTGGCTAGCAGGACTGTATAGGATTATATGC
SEQ ID NO:648
TTATTGTATTTTAGATTCTTGATGGCCATCTAAACTTCTGGCTGCTTGGTGCAACATTGA
SEQ ID NO:649
ATAGCTAATGATTCCATGCTATCCATGGTATCTACTTCACGATAATAAAGGTCTTAGTCC
SEQ ID NO:650
CACCTAATAGGCCTGAGTATTGCTCACCACTATGCTGATATGGGGAGCAATAACGTTAGT
SEQ ID NO:651
TTTCTTTTCACTTTGTACTAATGATCATTGTGACCACAAAATCTTTATACACAATACAGA
SEQ ID NO:652
CTTGTCACTATCCTCATATTGATATCACCTCGTGTATGTTGTGGGGTGGCAAAATTACTT
SEQ ID NO:653
TATTTTAACTCAGCGACTTACCAGCCTAGTAAGCAATGGGGAGCTTGCATGTATTAGTTT
SEQ ID NO:654
ATTCGTCCTGGTCCTTTAGGACATGTACTTATGTCCATGCAAGTGCTTCTTGCCTAAGCT
SEQ ID NO:655
TTCTAGGCGATATATATCGCCGTAACTTTGGATGTGTTAAGAATATAGGGGATCATTAGC
SEQ ID NO:656
AGTTGCAGAGTGTGTAGCAACTGATGAGCATAGTTGTTATGTTTCTCAACTCAGTTGCAC
SEQ ID NO:657
AAGAAACTCATACACTGGACAGGCCAACCTTCCAAATATGTGTTTAGAAAACCTTTGTCT
SEQ ID NO:658
AAGGGGTGCTATCCATATCTAGAATCTACCATGCTCAATGAGGTATCTTCATTAGTATAC
SEQ ID NO:659
ATCTAATGCTAGTTTATTGATTTCTATGATCCAAGACCTCGTCATAGATCAAGTGCCTAG
SEQ ID NO:660
TTGTTATTAAATACCATTCAATATGCTTATGATTCATGAATGCTTAAGAGATTCTGCTGC
SEQ ID NO:661
GCTTCTAAACTGTAGAAGCCTGTTATCTTTAGACTCGTGGTTATGTGAACTACTTTTACA
SEQ ID NO:662
GGCTGTGGGGATTCGAGCCTGATGGTTATGCACTGTGGCCAGCAAGATGTTGAAGTTTT
A
SEQ ID NO:663
GCCTGATGGTTATGCACTGTAAGTGATCTGATTTGATTAACTATTTTATCAATTAATTTT
SEQ ID NO:664
ATGGTCATTATCCGAGATAGTGCGCTTTGTCATGGGAAAATGACTATTGAATGTGAGTTT
SEQ ID NO:665
TTTTCTGGTGCATCCTTAACACAGCTTGGTTACATGGTGAATTACAGTATTTGAAGGAGT
SEQ ID NO:666
AGATTTAATGCCACTTAGGTGATCGGTGACCCACTTGTACATATAGATGTTGGCGATGTT
SEQ ID NO:667
AAGAAATTCATCAATTCTTTGAAATTATTGTTCCCTTTTGATGCGGCCCCTTTCTGGAGG
SEQ ID NO:668
TAAAGTATATTTTAGCCGCTGTTGTTGTAAATTTATGTTTTTCATTGCTATCAACATTTA
SEQ ID NO:669
GGTTTTCCTATAAGATGTATGAATTCGCACTGTGGTGCAATTTTATGAATTAAACTCAAA
SEQ ID NO:670
TTTACTATTCCGTCTGGGCTTAGAGATGTACGTTAATTGGTCATTTAAGACGACTCAGTT
SEQ ID NO:671
TCAAATCTAGTCAATATCCGTGTTGAGCTAAACAAGCGCTGAAAGTTTGCTCGAATCAGC
SEQ ID NO:672
AGAAAGTTGTGTACTAATTTGTATTGTAACGTCCATTTATCCAACGAGTCCTCCATTCAT
SEQ ID NO:673
CAGTACTGTATTCGAAGATCCTGAAAATTTACTAAAACAAATGGAATATCAACAACCTAG
SEQ ID NO:674
TTGCTCTATATAATTTGTGCTCGTGTGTGTACTTGAAGATCCATCCTCACATAGTCCAAT
SEQ ID NO:675
GTGTGTATAGTTTTATAACACTCTATGGTATCACTACCACTATGGGCCTGTTTAGTCCAA
SEQ ID NO:676
GAAGCAGAATCAGCTTTGACCAGTATTTAGTGTCTTGTATACAATTCTTGTTTCAGTGAA
SEQ ID NO:677
AAATCAAGATTAAAATCCGAAACCAAGGCTAACCAGCAAACTGTGAGGTGTACATTGTTG
SEQ ID NO:678
TTCCAAGCAGAAGGGCACATGTTGTGACATCAAGTAGTAGATTGTTCTGCAGATTCTGGT
SEQ ID NO:679
GTTAATGTAATACATTTAGTTTTTAGATAACTGTTAATGTGTAGTAAAGCACTAGGAAGA
SEQ ID NO:680
GAGGCTTCAAAGGTTTTTGTGTCTTTTCTAGTTATTATAAACGCTTCATAGGTTCCTAGG
SEQ ID NO:681
GAAGATTGTAAGTTGGGTGAACTTTTTTACCACGCTAGGTTGATCTATTTTGACTCTT
SEQ ID NO:682
AAAATAGCTGCGCGTACCACAAAGGTGACAAACGCCGGATTTCTCTTATCAGACTTGTCA
SEQ ID NO:683
TTTAATTATCATAGTTTTATTCCGGCTATCTTGATCATTCACGGAAGTCCCGAGAGTCAA
SEQ ID NO:684
GTGGAGTGAACGTGGTTACTTCAATGGATTACCCTTCTATCGTGTCATTAAACACTTTGT
SEQ ID NO:685
GCTAACTCTTCTAGTTGAGATCTCCATCAATTAATGGATACAAACATTGAGTTTCACTTT
SEQ ID NO:686
GGATCACTACTGGATTCCGTTACATTAGTTATTGCAAGTTGGTTATTATGTACGTTTATA
SEQ ID NO:687
ATGAACAAATGCAATTACCCTGTTTTATTCTATCCCGCTTTAATTAATATTGGTCATGTT
SEQ ID NO:688
TTTGCTTGTGGATTGTACTGTGGTACATGGTATAAATCTATAGGCTATGTCGATTATTTT
SEQ ID NO:689
ATATAAGATATAAGATATTGCCAGCAAACTATTTGACAGGTTATTTAATAAAGTGTGCTA
SEQ ID NO:690
TTTTAAATGTGGACAGAGGCACTATAAGAATGCGAAATATCGTCGGAGCACGACTAATTG
SEQ ID NO:691
ATAGACTAGTTCTACAAAGCCCTAGGATGATGGACTTCATTTCTTTTGCATTAAGATGAA
SEQ ID NO:692
GATTTCTTATGGGGTTGGAACATTCCTCGCTGCCTTCTGGTAATATTAGGTTATGCGTTT
SEQ ID NO:693
AATTGAGGTTGACTGTGTACTTCTCCAGTGGACAGGAGAAAGCGATAAAATTCAAACGTT
SEQ ID NO:694
AAGGAAGGGCAAATAGAGCTCGCGCTCAAGAAATACCTTAAATCGATACGGTATTTGGAT
SEQ ID NO:695
TAATTTAAGAGCTATGAAACAACTACCTTTTGGAATGGTTTTGTTTTTAGCATCCCAATT
SEQ ID NO:696
TTGTAAATTATGCTGGTTCCATATGGGGGTTAATCAGTATCCTGGTTATTTGTGACACCA
SEQ ID NO:697
GTTGTGAACTATCAATAGACGGGGATGGTCCTTTTTAGCTGCTCCTTAAGCAGCTCAAAT
SEQ ID NO:698
TCAATTCCGGTCATATGTAGACGACTATAATGTTGTTTGTGTCCTATAACTATAGTGTTG
SEQ ID NO:699
CATTTTACACCCTATAACAAAATATAGTGTCATAAGTTTACACCAGGTAACAACTCTATA
SEQ ID NO:700
ATGGAGAGTTTTATTCATTACATGAAAGAGTATGTCACCTTTCGTGCTCCATCTATTGAT
SEQ ID NO:701
TTTCACGTCCTGTATACTCACTCAAGCAACTTTAGGATGAAGAGCTAAAGTATATCAAAG
SEQ ID NO:702
AATGCACTCTTTATAAAGTGGGATGAGGTATGTGTTTCCTTCCTATTGGCTAACCTGAAT
SEQ ID NO:703
ATTGGGCAATCGTTATTGATTTTACCTATCGCTATCTCACTGTCCGCCAATTTAGTGTAA
SEQ ID NO:704
TTTCAGCGGATATAAAGTCTTCCAACTTGTAAACCGGTGCTGTGAAGATTAAAAGTCCTT
SEQ ID NO:705
GCTTTAGAGGCAATGGTAGATTATGAAGTCAACACCAGGGAGTTTGACCGTTTGGGACA
T
SEQ ID NO:706
CATTCAATTTGACATTGGAGTTTCAAGGCATTCCAAGGATAGCATGTACACAAGTTGAAT
SEQ ID NO:707
CATAAAATTACTATGGAAGTTGGATCATTATCTATGCCATAGTGGAGTAGAACTAGATTT
SEQ ID NO:708
CTCTTGATTCTAGAATCTAAACTACTACCTTGCGGACATGACTGAGCATCTCTCTAACAG
SEQ ID NO:709
CAGGGTTGTGCTAGTTTAACATTTTAACTTAATGTAATCATGTAAGCTTTAGAGAGGTGG
SEQ ID NO:710
GTAAATGTTTACATTGAGGTCATGCATGAGTGTTAATTACGCTTTCACTACTGTTCACTT
SEQ ID NO:711
AATTAAAGCTTGGTTGTATGATCATTTGGGATCGAGAGTAGATTATGATGCTCCTGGGCA
SEQ ID NO:712
TTATCTAGCTAGAAGTTGTGAAATTAAGAGGGATGTGAGGATTGGGTTATAACTAGTGTA
SEQ ID NO:713
AATGAATCAGGCATTAAAGCGGGAATCATTTATGACTTGGCAACCTGAAAATTCTATTAA
SEQ ID NO:714
TTCTTGACGTTTTAATATGGTATGGTATTAAATTTGGAAGGCCTATTCGATTGTTTGCAA
SEQ ID NO:715
TTCTTATAACCTGTACGATTGCCGATATATCACCAATTTTGCTGATTTTAATCTGAGTTT
SEQ ID NO:716
CAATTTCATATTCGGGTTCAATGTAGTGCCTCTCATTTTAGGGTGATAGCATGAGTTTTT
SEQ ID NO:717
TCCACAAGTTAACATAGGTAACTATCGACTGAAGTGAACTGGGGGGCAGAAGCTAACTA
T
SEQ ID NO:718
TTTAGATAGCCATTTACATTTTACTTATTATTGGACTTGTAAAGATTTTTGTACCCTTGT
SEQ ID NO:719
TTGCTGAAATATTTCAAGCTGAAAGTTATGATTCTGGCCAAGAAGTCTACTGAAAATTTG
SEQ ID NO:720
AAACATAAGTTTGGCCCAGATTCGGTTTATCATAAAATCTGGCTGCATATAAGGTGTCAG
SEQ ID NO:721
ATGTTCTAGAATTTGTCTAAGCTAGCTACTGGTGTTTAACTGATATGGAAAACTTTTGCC
SEQ ID NO:722
TTTGGGGAGTACTTTAGTCAATAAAAGTGAAGTGAATCATGATATAAAGGGTTTAAGTAA
SEQ ID NO:723
AGAAGTTACTAATTTGTAGATAAATTCTAACGAAGGTGATGATAGCATACACGTAATGAA
SEQ ID NO:724
GMTTTTGATGGTAGCGTATGGTTGAAGGAAAACTTGGATATATCATGTAAACATTTTTC
SEQ ID NO:725
TTAATGAACCGCTTTTTCCTTGAGAGGCTATGAATGCCTGTAGAACTAATCCTTTAAGTA
SEQ ID NO:726
TTTCTCTAACACTATATTTTCTGGTATGACCGCTCTACATTGTATATTAACCCTTGCAAA
SEQ ID NO:727
TATATTCACTGTGCTGGGATTATCCTCTCCCCTTTTTGACCCACTGTTGTGTGTATTTGA
SEQ ID NO:728
GAGCATACAGCGTTATCTTTGAGACGAGTCATCAATGATAATATCCTCGTAAAAGGTTAC
SEQ ID NO:729
TTTATTCAATTACGACGGATTCAGTTGGCCTTTTGTAACATTCAAGTATCCATCTATCAC
SEQ ID NO:730
ATGTTCAGGGGTATTAAAAATTCAGAGGATAAATTTCCTCACTCTCAAGTGTTAGATGGT
SEQ ID NO:731
CAAAGTCTAGACGTTAATGTTTTGGAACTCTTTTTTCGAATTTGTGCCTATTGAATCACT
SEQ ID NO:732
TATAAATATATTGTACTGGGGATCCAAGACATGGCAATATATGTCGAGATTTTCATTTTC
SEQ ID NO:733
CTTTTGCATGAGTTCAAATGTCTTTGTGACATATTGTCTTGAACCACCGAGGATATATCA
SEQ ID NO:734
GTTTGTATGTCCAATAGATTATAACCTATTTACTGTGACACTATTCTTCACACCCATGTC
SEQ ID NO:735
AGATCTAGTTGTTTCAGCATCGTTGGACCAAACTGTTCGTGTATGGGATATAAGTGGCCT
SEQ ID NO:736
TGCCGTATCAAAAGATTGGTACTTCCTTATGGACACACAAGATCGTAAGCATGGCTGAAT
SEQ ID NO:737
TTGATGGCCACATGAGTTGTTTATACAAGTCGTTGTTTTATGAGAGAACCTTCTTCAGAT
SEQ ID NO:738
ATTTCTATAGTGCCATATGCTTGTCGGTTGTCATTGACCTCTAATAGAATAGCCAGAGTA
SEQ ID NO:739
TTCACGGCAGTTGAACTAGTCATAGTGGAATATTATTTAAATGGTGTATTCTAGTCACAT
SEQ ID NO:740
TGCAGGCGCTCTATAGTTCTGTTCTCTAGCATGAAGTGTGTATTTTATCTATTGTGGACC
SEQ ID NO:741
TGTCTTTAATCTTCAGGGTTCGTTACTAACAATTGAGCTCAAATCTCTATTCTGACCAGC
SEQ ID NO:742
CATITATAGAGTTGTGCAAAATCACCCATAATGCTATGAATTGACAGGTGACTGTAATCT
SEQ ID NO:743
GGAGAAAATTTCCTATCCCTTTGTGGGTGTGTGAAAAACGAAATATAGAGGAACAATGTG
SEQ ID NO:744
ACCAATCATTTATTTGCAGTGTAGTTGATATGAAGGGAGAAATATGACAGTTGGTTTCAA
SEQ ID NO:745
AAGTTAATGTTCTCATAGGTTATTCATTGGAGTTGTCTCGTATGTACGCTGTGCCGTAGT
SEQ ID NO:746
CTCATAAATTGAGGCTTGCCTACGTTAATTGTTATATATGGAGAGCCATGCTAATTGTTA
SEQ ID NO:747
GCAGATCATGTAATTGTATCTCAAATTATAGTATCCGTATTCTGTACAAATGCTCCGGAA
SEQ ID NO:748
TCTTTACGCAGATGGTGACTGAAGCTGGTTCCGAGATCGGCATATGTAGCTGGTAGAGG
T
SEQ ID NO:749
TTCACATTGAGGGTTGCCGTCGGTATTCGCCGATGATATCCTGTTTTACGCGCAACAGTT
SEQ ID NO:750
TCATTATTTAGGGTGCAGGCTGTATAAAATGTTGTAAATTGTAGTATCAATGTGTACAAT
SEQ ID NO:751
GCATTCACCACGACAGTAAAGTAATCATTATGATTACTAATGTATTGCTTTCATGGGGTG
SEQ ID NO:752
AAAGGGTATATTTTGTCTCATGTTGGGGTGATAATTCTCCCTGAAAGTCTCCAAAATATA
SEQ ID NO:753
AAATTTCCGGTTGCCATAGTCTAGTGGGGTGAGGGTTCATTCTAGGGGATTTATTGTGTT
SEQ ID NO:754
GCAGTGATAAAGGTACTTCTTGGTGATAATCCTAAAGCCTTACCCATGGATATCCAGCCT
SEQ ID NO:755
TTCTTTAACAAGGTAAAAATCCCCCCCTTGGCATGTAGCTCAATTAGTTGTAATGGAACT
SEQ ID NO:756
AGTTGTAAACAGTGTAATAAGGAGCAGAAGTTGTGATAGCTTTTAGGAACGATAGACTTT
SEQ ID NO:757
TGAACCAATTCTTGTATATTAGATATGTAACATGTATGAATGTCCATAGAGCAGAGCTTT
SEQ ID NO:758
AGCCAGGCACGCTTAACTAAATTTCGTTTAGTTCACCATGACTATTCGTTGAACTTAATG
SEQ ID NO:759
CAAAACCCCTTGTAGGGTGGACTTCTGTTGTATCCAATTTTTATGGCATAATTAGCTAGT
SEQ ID NO:760
AATTTGGTGATTATTCCTTACCATATCGTACTGTACAGATACGGTAAGGTCGAAATATAT
SEQ ID NO:761
CATGCCGTGATCGGTCGATTGCATTAAGTGCTGCAAGGATCAAATAGTGGCACTGTCAT
G
SEQ ID NO:762
CAAACATAAATAAGGTTGCTACTTTAAAGGGACATACGGAACGAGTTACTGATGTGGCAT
SEQ ID NO:763
ATTTATGGATGAGGTACTCCTTATGAATATCTTCAAACTAAGAAATAACTATATATGCAA
SEQ ID NO:764
CTTGGTTTTTGTTGAGCTTTCTATTTCAAGCAATTTGTGATTGGGGGGTTCTGCATTCTT
SEQ ID NO:765
ATGTCTAAAGAGCCGTGATCTATGAGTAGATTAGAAACCGCCTTTTTAGTTGCAAACGCC
SEQ ID NO:766
TTGCAACAAGGTATACTTAGTCAGTCCTTGTTATGTATGTCTTTTGTCAACCCTTCAGGG
SEQ ID NO:767
GGCGGAATCCCTTTGTTCTTTCGAGCTTTACGTGACAAGTCGGCCAGAAAGCAGTAGCA
T
SEQ ID NO:768
TTGATGTACGAGCCGCTATATCTAATTCTGCCTCCCAGTCACTGCCAAGTTTTACTCTTC
SEQ ID NO:769
GTCTTGCATGTCAGCTATTATACAGTCCTGTTTATAGTCCTGTGATGTAATAAAAAGCTG
SEQ ID NO:770
AAGTAGGAGATCGTGTAGAGAGAATACTTTCTGCTCTCAGCGGCGAAGAGGTTTGTCTG
C
SEQ ID NO:771
AATTGTGAGTAGAATAGGAGAAACTTTTGTACAAGATTAATACGTGTGGCATAATAAGAT
SEQ ID NO:772
TGATGTGCAGTTTACATTATTATGGTTCGAGTATTATTTAGCTGCCCTATCTTAAGTCAT

Claims (104)

1.一种分离多核苷酸,其包含选自由SEQ ID NO:1-237和其保守性变体组成的群组的核酸序列。
2.根据权利要求1所述的分离多核苷酸,其包含选自由SEQ ID NO:1-12、14-58、60-62、64-70、72-75、77-83、85-86、88-91、93-119、121-130、132-148、150-156、158-191、193-207、209-218、220-221、223-231、233-237和其保守性变体组成的群组的核酸序列。
3.根据权利要求1所述的分离多核苷酸,其包含选自由SEQ ID NO:1-12、14、16-26、30-37、40-41、43-76、78-103、106、108-113、116-121、124-125、128-147、150-152、154-155、161-162、164-172、174、177-183、185-191、193-197、200-204、208-213和215-234和其保守性变体组成的群组的核酸序列。
4.根据权利要求1所述的分离多核苷酸,其包含选自由SEQ ID NO:1-12、14、16-26、30-37、40-41、43-58、60-62、64-70、72-75、78-83、85-86、88-91、93-103、106、108-113、116-119、121、124-125、128-130、132-147、150-152、154-155、161-162、164-172、174、177-183、185-191、193-197、200-204、209-213、215-218、220-221、223-231和233-234和其保守性变体组成的群组的核酸序列。
5.根据权利要求1所述的分离多核苷酸,其中所述多核苷酸具有包含在一种桉树(Eucalyptus)或松树(Pinus)属的野生型植株中所表达的基因中的序列。
6.根据权利要求1所述的分离多核苷酸,其中所述变体与SEQ ID NO:1-237中任何一者具有大于或等于99%、98%、97%、96%、95%、94%、93%、92%、91%、90%、89%、88%、87%、86%、85%、84%、83%、82%、81%、80%、79%、78%、77%、76%、75%、74%、73%、72%、71%、70%、69%、68%、67%、66%、65%、64%、63%、62%、61%或60%的序列一致性。
7.根据权利要求1所述的分离多核苷酸,其中所述多核苷酸编码选自由周期素、周期素依赖性激酶、周期素依赖性激酶抑制剂、组蛋白乙酰基转移酶、组蛋白去乙酰化酶、肽基-脯氨酰基顺-反异构酶、视网膜母细胞瘤相关蛋白、WEE1样蛋白和WD40重复蛋白组成的群组的蛋白质。
8.根据权利要求7所述的分离多核苷酸,其中所述变体编码具有与SEQ ID NO:261-497中任何一者具有大于60%、65%、70%、75%、80%、85%或90%序列一致性的氨基酸序列的蛋白质,并且其中由所述多核苷酸编码的所述蛋白质具有由所述SEQ ID NO:1-237中任何一者所编码的蛋白质的活性。
9.一种DNA构建体,其包含至少一个具有SEQ ID NO:1-237和其保守性变体中任何一者的序列的多核苷酸。
10.根据权利要求9所述的DNA构建体,其进一步包含启动子,其中所述启动子和所述多核苷酸是经操作连接。
11.根据权利要求10所述的DNA构建体,其中所述启动子是选自由组成型启动子、强启动子、可诱导启动子、可调控启动子、经瞬时调控启动子和组织偏好启动子组成的群组。
12.根据权利要求9所述的DNA构建体,其中所述多核苷酸编码RNA转录本。
13.根据权利要求12所述的DNA构建体,其中所述多核苷酸相对于所述启动子是正义或反义方向。
14.根据权利要求12所述的DNA构建体,其中所述RNA转录本诱导具有选自由1-237组成的群组的核酸序列的多核苷酸的RNA干扰。
15.一种经根据权利要求9所述的DNA构建体转化的植物细胞。
16.一种包含根据权利要求15所述的植物细胞的转基因植物。
17.根据权利要求16所述的转基因植物,其中所述植物的表现型与未经所述DNA构建体转化的相同种的植物表现型不同。
18.根据权利要求17所述的转基因植物,其中在所述转基因植物中不同的表现型是选自由木质素品质、木质素结构、木材组成、木材外观、木材密度、木材强度、木材刚度、纤维素聚合化、纤维尺寸、内腔尺寸、其他植物组份、植物细胞分裂、植物细胞发育、每单位面积细胞数目、细胞尺寸、细胞形状、细胞壁组成、木材形成速率、木材美学外观、茎缺陷形成、平均微纤丝角度、S2细胞壁层宽度、生长速率、根形成速率、根与枝营养发育比率、叶面积指数和叶形组成的群组。
19.根据权利要求16所述的转基因植物,其中所述植物是木本植物。
20.根据权利要求19所述的转基因植物,其中所述植物是树木。
21.根据权利要求20所述的转基因植物,其中所述植物是一种桉树或松树属。
22.根据权利要求16所述的转基因植物,其中与未经所述DNA构建体转化的相同种植物相比,所述植物展示一种或一种以上由以下特征组成的群组的特征:耐旱性增加、除草剂抗性、高度减小或增加、分枝减少或增加、耐寒耐冻性增强、活力提高、颜色增强、健康与营养特征增强、储存性改善、产量增加、耐盐性增强、木材抗腐烂性增强、真菌疾病抗性增强、对昆虫害虫的吸引力改变、重金属耐性增强、疾病耐性增强、昆虫耐性增强、水胁迫耐性增强、甜度提高、质地改善、磷酸盐含量降低、出芽增加、微量营养素吸收增加、淀粉组成改善、花寿命提高、产生新颖树脂和产生新颖蛋白质或肽。
23.根据权利要求16所述的转基因植物,其中与未经所述DNA构建体转化的相同种植物相比,所述植物展示一种或一种以上由以下特征组成的群组的特征:幼年期缩短、幼年期增长、形成应力木的倾向、自体脱落分枝、生殖发育加速或生殖发育延迟。
24.一种分离多核苷酸,其包含编码选自SEQ ID NO:261-497中任何一者的多肽的催化域或底物结合域的序列,其中所述多核苷酸编码一多肽,所述多肽具有所述选自SEQ ID NO:261-497中任何一者的多肽的活性。
25.一种制造经转化植物的方法,所述方法包括:用包含至少一个具有SEQ ID NO:1-237中任一序列的多核苷酸的DNA构建体转化植物细胞;和在促进植物生长的条件下培养所述经转化植物细胞。
26.根据权利要求25所述的方法,其中所述DNA构建体进一步包含启动子,其中所述多核苷酸与所述启动子是经操作连接。
27.根据权利要求25所述的方法,其中所述至少一个多核苷酸编码选自由周期素、周期素依赖性激酶、周期素依赖性激酶抑制剂、组蛋白乙酰基转移酶、组蛋白去乙酰化酶、肽基-脯氨酰基顺-反异构酶、视网膜母细胞瘤相关蛋白、WEE1样蛋白和WD40重复蛋白组成的群组的蛋白质。
28.根据权利要求25所述的方法,其中所述植物细胞是位于植物外植体组织内。
29.根据权利要求25所述的方法,其中所述转基因植物展示与未经所述DNA构建体转化的相同种植物不同的表现型。
30.根据权利要求25所述的方法,其中在所述转基因植物中不同的表现型是选自由木质素品质、木质素结构、木材组成、木材外观、木材密度、木材强度、木材刚度、纤维素聚合化、纤维尺寸、内腔尺寸、其他植物组份、植物细胞分裂、植物细胞发育、每单位面积细胞数目、细胞尺寸、细胞形状、细胞壁组成、木材形成速率、木材美学外观、茎缺陷形成、平均微纤丝角度、S2细胞壁层宽度、生长速率、根形成速率、根与枝营养发育比率、叶面积指数和叶形组成的群组。
31.根据权利要求25所述的方法,其中与未经所述DNA构建体转化的相同种植物相比,所述转基因植物展示一种或一种以上由以下特征组成的群组的特征:耐旱性增加、除草剂抗性、高度减小或增加、分枝减少或增加、耐寒耐冻性增强、活力提高、颜色增强、健康与营养特征增强、储存性改善、产量增加、耐盐性增强、木材抗腐烂性增强、真菌疾病抗性增强、对昆虫害虫的吸引力改变、重金属耐性增强、疾病耐性增强、昆虫耐性增强、水胁迫耐性增强、甜度提高、质地改善、磷酸盐含量降低、出芽增加、微量营养素吸收增加、淀粉组成改善、花寿命提高、产生新颖树脂和产生新颖蛋白质或肽。
32.一种从转基因树木获得的木材,所述转基因树木已由根据权利要求9所述的DNA构建体转化。
33.一种从转基因树木获得的木浆,所述转基因树木已由根据权利要求9所述的DNA构建体转化。
34.根据权利要求33所述的木浆,其中所述DNA构建体包含一核苷酸序列,所述核苷酸序列编码含有SEQ ID NO:261-497中任何一者氨基酸序列的多肽。
35.一种制造木材的方法,所述方法包括:用包含具有选自由SEQ ID NO:1-237和其保守性变体组成的群组的核酸序列的多核苷酸的DNA构建体转化植物;在促进植物生长的条件下培养所述经转化植物;和从所述植物获得木材。
36.一种制造木浆的方法,所述方法包括:用包含具有选自由SEQ ID NO:1-237和其保守性变体组成的群组的核酸序列的多核苷酸的DNA构建体转化植物;在促进植物生长的条件下培养所述经转化植物;和从所述植物获得木浆。
37.一种分离多肽,其包含由根据权利要求1所述的分离多核苷酸所编码的氨基酸序列。
38.一种分离多肽,其包含选自由261-497组成的群组的氨基酸序列。
39.一种改变植物的植物表现型的方法,所述方法包括改变所述植物中由SEQ ID NO:1-237中任何一者所编码的多肽的表达。
40.根据权利要求39所述的方法,其中所述表达是经上调、下调、沉默或发育调控。
41.根据权利要求39所述的方法,其中所述植物表现型是选自由木质素品质、木质素品质、木质素结构、木材组成、木材外观、木材密度、木材强度、木材刚度、纤维素聚合化、纤维尺寸、内腔尺寸、其他植物组份、植物细胞分裂、植物细胞发育、每单位面积细胞数目、细胞尺寸、细胞形状、细胞壁组成、木材形成速率、木材美学外观、茎缺陷形成、平均微纤丝角度、S2细胞壁层宽度、生长速率、根形成速率、根与枝营养发育比率、叶面积指数和叶形组成的群组。
42.根据权利要求39所述的方法,其中与未经所述DNA构建体转化的相同种植物相比,所述植物展示一种或一种以上由以下特征组成的群组的特征:耐旱性增加、除草剂抗性、高度减小或增加、分枝减少或增加、耐寒耐冻性增强、活力提高、颜色增强、健康与营养特征增强、储存性改善、产量增加、耐盐性增强、木材抗腐烂性增强、真菌疾病抗性增强、对昆虫害虫的吸引力改变、重金属耐性增强、疾病耐性增强、昆虫耐性增强、水胁迫耐性增强、甜度提高、质地改善、磷酸盐含量降低、出芽增加、微量营养素吸收增加、淀粉组成改善、花寿命提高、产生新颖树脂和产生新颖蛋白质或肽。
43.一种包含选自由SEQ ID NO:471-697组成的群组的核酸序列的多核苷酸。
44.根据权利要求43所述的多核苷酸,其中所述多核苷酸包含少于约100个核苷酸碱基。
45.一种将两个不同样品中的基因表达相关联的方法,所述方法包括:检测第一样品中一个或一个以上基因的表达水平,所述基因编码由选自SEQ ID NO:1-237和其保守性变体组成的群组的核酸序列所编码的产物;检测第二样品中所述一个或一个以上基因的表达水平;将所述第一样品中所述一个或一个以上基因的表达水平与所述第二样品中所述一个或一个以上基因的表达水平相比较;和将所述第一与所述第二样品之间的所述一个或一个以上基因的表达水平差异相关联。
46.根据权利要求45所述的方法,其中所述第一样品和所述第二样品各自来自于不同类型的植物组织。
47.根据权利要求45所述的方法,其中所述第一样品和所述第二样品来自相同组织,并且其中所述第一样品和所述第二样品各自在一年的不同季节时收集。
48.根据权利要求45所述的方法,其中所述第一样品和所述第二样品获自不同发育阶段的植物。
49.一种将所具有的植物表现型与植物内一个或一个以上基因的基因表达水平相关联的方法,所述方法包括:检测具有一表现型的第一植物中一个或一个以上基因的表达水平,所述基因编码由选自SEQ ID NO:1-237和其保守性变体组成的群组的核酸序列所编码的产物;检测无所述表现型的第二植物中所述一个或一个以上基因的表达水平;将所述第一植物中所述一个或一个以上基因的表达水平与所述第二植物中所述一个或一个以上基因的表达水平相比较;和将所述第一与所述第二植物之间所述一个或一个以上基因的表达水平差异与所具有的所述表现型相关联。
50.一种将基因表达与细胞周期阶段相关联的方法,所述方法包括:检测细胞周期在第一阶段的第一植物细胞中一个或一个以上基因的表达水平,所述基因编码由选自SEQ ID NO:1-237和其保守性变体组成的群组的核酸序列所编码的产物;检测细胞周期在第二个不同阶段的第二植物细胞中所述一个或一个以上基因的表达水平;将所述第一植物细胞中所述一个或一个以上基因的表达水平与所述第二植物细胞中所述一个或一个以上基因的表达水平比较;和将所述第一与所述第二样品之间所述一个或一个以上基因的表达水平差异与所述细胞周期第一或第二阶段相关联。
51.根据权利要求45所述的方法,其中所述第一和所述第二样品均获自由维管组织、顶端分生组织、维管形成层、木质部、韧皮部、根、花、球果、果实和种子组成的群组的植物组织。
52.根据权利要求51所述的方法,其中所述第一样品和所述第二样品的所述植物组织各自获自不同类型的组织。
53.根据权利要求51所述的方法,其中所述第一和所述第二样品各自获自不同发育阶段的植物组织。
54.根据权利要求45所述的方法,其中所述第一和所述第二样品各自获自细胞周期不同阶段的植物细胞。
55.根据权利要求49或50中任一权利要求所述的方法,其中所述第一和所述第二植物或植物细胞均为选自桉树和松树的相同种。
56.根据权利要求49或50中任一权利要求所述的方法,其中所述第一和所述第二植物或植物细胞为选自巨桉(Eucalyptus grandis)或辐射松(Pinus radiata)的物种。
57.根据权利要求45、49或50中任一权利要求所述的方法,其中所述检测步骤是通过使用在标准杂交条件下能够与选自由SEQ ID NO:1-237组成的群组的核酸序列杂交的一个或一个以上多核苷酸而得以实现。
58.根据权利要求45、49或50中任一权利要求所述的方法,其中所述检测步骤是通过使用在标准杂交条件下能够与由选自SEQ ID NO:1-237组成的群组的核酸序列所编码的基因产物杂交的一个或一个以上多核苷酸而得以实现。
59.根据权利要求45、49或50中任一权利要求所述的方法,其中所述检测步骤是通过与经标记核酸杂交而得以实现。
60.根据权利要求56所述的方法,其中所述一个或一个以上多核苷酸被标记一可检测标记。
61.根据权利要求57所述的方法,其中所述一个或一个以上多核苷酸的至少一者与所述一个或一个以上基因之一的3′非翻译区杂交。
62.根据权利要求58所述的方法,其中所述一个或一个以上多核苷酸的至少一者与所述一个或一个以上基因之一的所述3′非翻译区杂交。
63.根据权利要求57所述的方法,其中所述一个或一个以上多核苷酸包含选自由SEQID NO:471-697组成的群组的核酸序列。
64.根据权利要求58所述的方法,其中所述一个或一个以上多核苷酸包含选自由SEQID NO:471-697组成的群组的核酸序列。
65.根据权利要求57所述的方法,其中所述一个或一个以上多核苷酸是选自由DNA和RNA组成的群组。
66.根据权利要求58所述的方法,其中所述一个或一个以上多核苷酸是选自由DNA和RNA组成的群组。
67.根据权利要求45、49或50中任一权利要求所述的方法,其进一步包含在所述检测步骤之前扩增所述第一和所述第二植物或植物细胞中所述一个或一个以上基因的步骤。
68.根据权利要求45、49或50中任一权利要求所述的方法,其进一步包含在所述检测步骤之前用可检测标记来标记所述第一和所述第二植物或植物细胞中所述一个或一个以上基因的步骤。
69.一种用于检测一个或一个以上基因的表达的组合,所述组合包含两个或两个以上寡核苷酸,其中每个寡核苷酸能够与选自由SEQ ID NO:1-237组成的群组的核酸序列杂交。
70.一种用于检测一个或一个以上基因的表达的组合,所述组合包含两个或两个以上寡核苷酸,其中每个寡核苷酸能够与由选自由SEQ ID NO:1-237组成的群组的核酸序列所编码的基因产物杂交。
71.根据权利要求69所述的组合,其中所述两个或两个以上寡核苷酸的每一者与选自由SEQ ID NO:1-237组成的群组的核酸序列的不同序列杂交。
72.根据权利要求70所述的组合,其中所述两个或两个以上寡核苷酸的每一者与由选自由SEQ ID NO:1-237组成的群组的核酸序列的不同序列所编码的核苷酸序列杂交。
73.根据权利要求69所述的组合,其中所述两个或两个以上寡核苷酸的至少一者与选自由SEQ ID NO:1-237组成的群组的核酸序列的3′非翻译区杂交。
74.根据权利要求70所述的组合,其中所述两个或两个以上寡核苷酸的至少一者与与选自由SEQ ID NO:1-237组成的群组的核酸序列的3′非翻译区互补的核酸序列杂交。
75.根据权利要求69或70中任一权利要求所述的组合,其中所述两个或两个以上寡核苷酸的每一者均包含少于约100个核苷酸碱基。
76.根据权利要求69所述的组合,其中所述两个或两个以上寡核苷酸的至少一者包含选自由SEQ ID NO:471-697组成的群组的核酸序列。
77.根据权利要求70所述的组合,其中所述两个或两个以上寡核苷酸的至少一者包含选自由SEQ ID NO:471-697组成的群组的核酸序列。
78.根据权利要求69所述的组合,其中所述两个或两个以上寡核苷酸的每一者与编码蛋白质的基因杂交,所述蛋白质是选自由周期素、周期素依赖性激酶、周期素依赖性激酶抑制剂、组蛋白乙酰基转移酶、组蛋白去乙酰化酶、肽基-脯氨酰基顺-反异构酶、视网膜母细胞瘤相关蛋白、WEE1样蛋白和WD40重复蛋白组成的群组。
79.根据权利要求70所述的组合,其中所述两个或两个以上寡核苷酸的每一者与由编码蛋白质的基因所编码的核酸序列杂交,所述蛋白质是选自由周期素、周期素依赖性激酶、周期素依赖性激酶抑制剂、组蛋白乙酰基转移酶、组蛋白去乙酰化酶、肽基-脯氨酰基顺-反异构酶、视网膜母细胞瘤相关蛋白、WEE1样蛋白和WD40重复蛋白组成的群组。
80.  根据权利要求78所述的组合,其中所述两个或两个以上寡核苷酸的每一者与编码所述蛋白质中的不同蛋白质的基因杂交。
81.根据权利要求79所述的组合,其中所述两个或两个以上寡核苷酸的每一者与由编码所述蛋白质中的不同蛋白质的基因所编码的核酸序列杂交。
82.根据权利要求78所述的组合,其中所述两个或两个以上寡核苷酸的每一者与不同基因杂交。
83.根据权利要求79所述的组合,其中所述两个或两个以上寡核苷酸的每一者与由不同基因所编码的核酸序列杂交。
84.根据权利要求69或70中任一权利要求所述的组合,其包含约2至约5000个所述两个或两个以上寡核苷酸。
85.根据权利要求69或70中任一权利要求所述的组合,其中所述两个或两个以上寡核苷酸的每一者均标记可检测标记。
86.一种包含设置于固体载体上的根据权利要求69-85中任一权利要求所述的组合的微阵列,其中所述两个或两个以上寡核苷酸的每一者在所述固体载体上占据独特位置。
87.一种用于检测样品中一个或一个以上基因的方法,所述方法包括:使所述样品与两个或两个以上寡核苷酸接触,其中每个寡核苷酸均能够在标准杂交条件下与包含选自由SEQ ID NO:1-237组成的群组的核酸序列的基因杂交;和检测与所述一个或一个以上寡核苷酸杂交的所述感兴趣的一个或一个以上基因。
88.一种用于检测样品中由一个或一个以上基因所编码的一个或一个以上核酸序列的方法,所述方法包括:使所述样品与两个或两个以上寡核苷酸接触,其中每个寡核苷酸均能够在标准杂交条件下与由包含选自由SEQ ID NO:1-237组成的群组的核酸序列的基因所编码的核酸序列杂交;和检测与所述一个或一个以上寡核苷酸杂交的所述一个或一个以上核酸序列。
89.根据权利要求87所述的方法,其中所述两个或两个以上寡核苷酸的每一者与包含选自由SEQ ID NO:1-237组成的群组的核酸序列的不同序列的基因杂交。
90.根据权利要求88所述的方法,其中所述两个或两个以上寡核苷酸的每一者与由包含选自由SEQ ID NO:1-237组成的群组的核酸序列的不同序列的基因所编码的核酸序列杂交。
91.根据权利要求87所述的方法,其中所述两个或两个以上寡核苷酸的至少一者与一基因的3′非翻译区杂交,所述基因包含选自由SEQ ID NO 1-237组成的群组的核酸序列。
92.根据权利要求88所述的方法,其中所述两个或两个以上寡核苷酸的至少一者与与一基因的3′非翻译区互补的核酸序列杂交,所述基因包含选自由SEQ ID NO 1-237组成的群组的核酸序列。
93.根据权利要求87或88中任一权利要求所述的方法,其中所述两个或两个以上寡核苷酸的每一者包含少于约100个核苷酸碱基。
94.根据权利要求87所述的方法,其中所述两个或两个以上寡核苷酸的至少一者包含选自由SEQ ID NO 471-697组成的群组的核酸序列。
95.根据权利要求88所述的方法,其中所述两个或两个以上寡核苷酸的至少一者包含选自由SEQ ID NO 471-697组成的群组的核酸序列。
96.根据权利要求87所述的方法,其中所述两个或两个以上寡核苷酸的每一者与编码蛋白质的基因杂交,所述蛋白质是选自由周期素、周期素依赖性激酶、周期素依赖性激酶抑制剂、组蛋白乙酰基转移酶、组蛋白去乙酰化酶、肽基-脯氨酰基顺-反异构酶、视网膜母细胞瘤相关蛋白、WEE1样蛋白和WD40重复蛋白组成的群组。
97.根据权利要求88所述的方法,其中所述两个或两个以上寡核苷酸的每一者与由编码蛋白质的基因所编码的核酸序列杂交,所述蛋白质是选自由周期素、周期素依赖性激酶、周期素依赖性激酶抑制剂、组蛋白乙酰基转移酶、组蛋白去乙酰化酶、肽基-脯氨酰基顺-反异构酶、视网膜母细胞瘤相关蛋白、WEE1样蛋白和WD40重复蛋白组成的群组。
98.根据权利要求96所述的方法,其中所述两个或两个以上寡核苷酸的每一者与编码所述蛋白质中不同蛋白质的基因杂交。
99.根据权利要求97所述的方法,其中所述两个或两个以上寡核苷酸的每一者与由编码所述蛋白质中不同蛋白质的基因所编码的核酸序列杂交。
100.根据权利要求87或88中任一权利要求所述的方法,其中所述两个或两个以上寡核苷酸是设置于固体载体上,其中所述两个或两个以上寡核苷酸的每一者占据所述固体载体上的独特位置。
101.根据权利要求100所述的方法,其中所述固体载体包含约2至约5000个所述两个或两个以上寡核苷酸。
102.根据权利要求87或88中任一权利要求所述的方法,其进一步包含在所述接触步骤之前扩增所述样品中的所述一个或一个以上基因或核酸序列的步骤。
103.根据权利要求87或88中任一权利要求所述的方法,其进一步包含在所述接触步骤之前用可检测标记来标记所述样品中的所述一个或一个以上基因或核酸序列的步骤。
104.一种用于检测基因表达的试剂盒,所述试剂盒包含根据权利要求86所述的微阵列和一种或一种以上用于核苷酸杂交反应的缓冲液或试剂。
CNA200480042006XA 2003-12-30 2004-12-30 细胞周期基因和相关使用方法 Pending CN1954071A (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US53303603P 2003-12-30 2003-12-30
US60/533,036 2003-12-30

Publications (1)

Publication Number Publication Date
CN1954071A true CN1954071A (zh) 2007-04-25

Family

ID=34748844

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA200480042006XA Pending CN1954071A (zh) 2003-12-30 2004-12-30 细胞周期基因和相关使用方法

Country Status (10)

Country Link
US (2) US7598084B2 (zh)
EP (1) EP1711592A4 (zh)
JP (2) JP2007523636A (zh)
CN (1) CN1954071A (zh)
AR (1) AR047574A1 (zh)
AU (1) AU2004311384B2 (zh)
BR (1) BRPI0418229A (zh)
NZ (1) NZ548845A (zh)
WO (1) WO2005065339A2 (zh)
ZA (1) ZA200606198B (zh)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102888410A (zh) * 2012-07-31 2013-01-23 普罗米绿色能源(深圳)有限公司 桉树pgef17基因及其植物表达载体、宿主细胞和应用
CN102888409A (zh) * 2012-07-31 2013-01-23 普罗米绿色能源(深圳)有限公司 桉树pgef10基因及其植物表达载体、宿主细胞和应用
CN102888411A (zh) * 2012-07-31 2013-01-23 普罗米绿色能源(深圳)有限公司 桉树pgef13基因及其植物表达载体、宿主细胞和应用
CN108368518A (zh) * 2015-10-02 2018-08-03 主基因有限公司 制备单倍体和随后的双单倍体植物的方法
CN110317817A (zh) * 2019-07-16 2019-10-11 北京林业大学 Ylb9基因序列、应用及调控植物木质素合成的方法
CN110885813A (zh) * 2019-12-17 2020-03-17 中国农业大学 水稻组蛋白去乙酰化酶基因hda710在延迟叶片衰老中的应用
CN112458189A (zh) * 2020-10-24 2021-03-09 宁波国际旅行卫生保健中心(宁波海关口岸门诊部) 一种用于单增李斯特菌荧光raa检测的引物和探针序列及其应用
CN113881699A (zh) * 2021-11-05 2022-01-04 河南大学 Mac3a和mac3b在植物器官大小调控中的应用
CN116218899A (zh) * 2023-02-14 2023-06-06 中国科学院遗传与发育生物学研究所 水稻特异调控粒宽基因slg2及其应用

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110047644A1 (en) * 1999-03-11 2011-02-24 Marion Wood Compositions and methods for the modification of gene transcription
AR047574A1 (es) 2003-12-30 2006-01-25 Arborgen Llc 2 Genesis Res 1 Genes del ciclo celular y metodos de uso relacionados
US8088975B2 (en) * 2006-10-27 2012-01-03 Ceres, Inc. Phenylpropanoid related regulatory protein-regulatory region associations
WO2008069878A2 (en) 2006-10-27 2008-06-12 Ceres, Inc. Modulating lignin in plants
US9758790B2 (en) 2004-12-08 2017-09-12 Ceres, Inc. Modulating the level of components within plants
WO2006138005A2 (en) * 2005-05-10 2006-12-28 Monsanto Technology, Llc Genes and uses for plant improvement
AU2008231785A1 (en) * 2007-03-23 2008-10-02 Basf Plant Science Gmbh Transgenic plant with increased stress tolerance and yield
WO2008120659A1 (ja) * 2007-03-30 2008-10-09 Oji Paper Co., Ltd. 遺伝子発現情報を用いた植物の形質を判定もしくは予測する方法
JP5701610B2 (ja) 2007-12-28 2015-04-15 スヴェトリー・テクノロジーズ・アーベー 改良された生育特徴を有する木本植物および転写因子を用いてそれを作製するための方法
CN112048010B (zh) * 2020-08-20 2022-06-17 华南农业大学 水稻rip2蛋白在调控植物叶夹角中的应用

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3587718T2 (de) 1984-03-06 1994-08-04 Mgi Pharma Inc Herbizide Resistenz in Pflanzen.
GB8626879D0 (en) 1986-11-11 1986-12-10 Ici Plc Dna
AU700315B2 (en) 1993-10-28 1998-12-24 Houston Advanced Research Center Microfabricated, flowthrough porous apparatus for discrete detection of binding reactions
US5547861A (en) * 1994-04-18 1996-08-20 Becton, Dickinson And Company Detection of nucleic acid amplification
US6252139B1 (en) * 1996-07-18 2001-06-26 The Salk Institute For Biological Studies Method of increasing growth and yield in plants
DE69822206T2 (de) 1997-12-19 2005-02-17 Affymetrix, Inc., Santa Clara Erkenntnisse der genomforschung für die suche nach neuartigen wirkstoffen
CA2266295A1 (en) * 1999-02-26 2000-09-19 Heberle-Bors, Erwin Method of modifying plant metabolism and development
EP1163341A2 (en) * 1999-03-19 2001-12-19 CropDesign N.V. Method for enhancing and/or improving plant growth and/or yield or modifying plant architecture
WO2000065040A2 (en) * 1999-04-22 2000-11-02 Pioneer Hi-Bred International, Inc. Cell cycle genes from plants and methods of use
US7151202B1 (en) 1999-07-19 2006-12-19 Japan Science And Technology Agency Environmental stress resistance gene
KR20080023768A (ko) 2000-03-30 2008-03-14 화이트헤드 인스티튜트 포 바이오메디칼 리서치 Rna 간섭의 rna 서열 특이적인 매개체
US6606568B2 (en) * 2000-06-28 2003-08-12 Midwest Research Institute Method for predicting dry mechanical properties from wet wood and standing trees
US6525319B2 (en) * 2000-12-15 2003-02-25 Midwest Research Institute Use of a region of the visible and near infrared spectrum to predict mechanical properties of wet wood and standing trees
CA2705037A1 (en) * 2000-10-10 2002-04-18 Arborgen, Llc Method for pine cell tissue culture on support membrane
AR047574A1 (es) 2003-12-30 2006-01-25 Arborgen Llc 2 Genesis Res 1 Genes del ciclo celular y metodos de uso relacionados

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102888409A (zh) * 2012-07-31 2013-01-23 普罗米绿色能源(深圳)有限公司 桉树pgef10基因及其植物表达载体、宿主细胞和应用
CN102888411A (zh) * 2012-07-31 2013-01-23 普罗米绿色能源(深圳)有限公司 桉树pgef13基因及其植物表达载体、宿主细胞和应用
CN102888411B (zh) * 2012-07-31 2014-07-02 普罗米绿色能源(深圳)有限公司 桉树pgef13基因及其植物表达载体、宿主细胞和应用
CN102888409B (zh) * 2012-07-31 2015-04-22 普罗米绿色能源(深圳)有限公司 桉树pgef10基因及其植物表达载体、宿主细胞和应用
CN102888410B (zh) * 2012-07-31 2015-04-22 普罗米绿色能源(深圳)有限公司 桉树pgef17基因及其植物表达载体、宿主细胞和应用
CN102888410A (zh) * 2012-07-31 2013-01-23 普罗米绿色能源(深圳)有限公司 桉树pgef17基因及其植物表达载体、宿主细胞和应用
CN108368518A (zh) * 2015-10-02 2018-08-03 主基因有限公司 制备单倍体和随后的双单倍体植物的方法
CN108368518B (zh) * 2015-10-02 2023-12-05 主基因有限公司 制备单倍体和随后的双单倍体植物的方法
CN110317817B (zh) * 2019-07-16 2021-03-19 北京林业大学 Ylb9基因序列、应用及调控植物木质素合成的方法
CN110317817A (zh) * 2019-07-16 2019-10-11 北京林业大学 Ylb9基因序列、应用及调控植物木质素合成的方法
CN110885813A (zh) * 2019-12-17 2020-03-17 中国农业大学 水稻组蛋白去乙酰化酶基因hda710在延迟叶片衰老中的应用
CN112458189A (zh) * 2020-10-24 2021-03-09 宁波国际旅行卫生保健中心(宁波海关口岸门诊部) 一种用于单增李斯特菌荧光raa检测的引物和探针序列及其应用
CN113881699A (zh) * 2021-11-05 2022-01-04 河南大学 Mac3a和mac3b在植物器官大小调控中的应用
CN113881699B (zh) * 2021-11-05 2024-01-09 河南大学 Mac3a和mac3b在植物器官大小调控中的应用
CN116218899A (zh) * 2023-02-14 2023-06-06 中国科学院遗传与发育生物学研究所 水稻特异调控粒宽基因slg2及其应用
CN116218899B (zh) * 2023-02-14 2024-05-31 中国科学院遗传与发育生物学研究所 水稻特异调控粒宽基因slg2及其应用

Also Published As

Publication number Publication date
WO2005065339A3 (en) 2006-09-21
EP1711592A4 (en) 2007-08-22
JP2011097941A (ja) 2011-05-19
AR047574A1 (es) 2006-01-25
ZA200606198B (en) 2008-07-30
AU2004311384A1 (en) 2005-07-21
AU2004311384B2 (en) 2012-01-12
WO2005065339A2 (en) 2005-07-21
NZ548845A (en) 2010-03-26
US7598084B2 (en) 2009-10-06
EP1711592A2 (en) 2006-10-18
US20100122382A1 (en) 2010-05-13
BRPI0418229A (pt) 2007-04-27
US20060010516A1 (en) 2006-01-12
JP2007523636A (ja) 2007-08-23

Similar Documents

Publication Publication Date Title
US20100122382A1 (en) Cell Cycle Genes and Related Methods
Zhao et al. XND1, a member of the NAC domain family in Arabidopsis thaliana, negatively regulates lignocellulose synthesis and programmed cell death in xylem
Mitsuda et al. NAC transcription factors, NST1 and NST3, are key regulators of the formation of secondary walls in woody tissues of Arabidopsis
Lu et al. Genomewide analysis of the lateral organ boundaries domain gene family in Eucalyptus grandis reveals members that differentially impact secondary growth
JP5155875B2 (ja) 木質部および細胞壁の遺伝子マイクロアレイ
Ku et al. IbMADS1 (Ipomoea batatas MADS-box 1 gene) is involved in tuberous root initiation in sweet potato (Ipomoea batatas)
Chai et al. The NAC transcription factor OsSWN1 regulates secondary cell wall development in Oryza sativa
US20090313727A1 (en) Nucleic acid molecules and other molecules associated with the sucrose pathway
JP2012152214A (ja) 植物のリグニンを調節するための組成物および方法
Glazinska et al. De novo transcriptome profiling of flowers, flower pedicels and pods of Lupinus luteus (yellow lupine) reveals complex expression changes during organ abscission
Moriguchi et al. Characterization of gene repertoires at mature stage of citrus fruits through random sequencing and analysis of redundant metallothionein-like genes expressed during fruit development
BRPI0613141A2 (pt) polinucleotìdios isolados, construção de dna, célula de planta, planta transgênica, madeira e polpa de madeira
EP3390641B1 (en) Compositions and methods for manipulating the development of plants
Zhao et al. Regulation of lignin biosynthesis by an atypical bHLH protein CmHLB in Chrysanthemum
Puglia et al. Hybrid transcriptome sequencing approach improved assembly and gene annotation in Cynara cardunculus (L.)
US8110726B2 (en) Polynucleotides encoding cellulose synthase from pinus radiata and methods of use for regulating polysaccharides of a plant
US20040121321A1 (en) Nucleic acid molecules and other molecules associated with the gibberellin pathway
AU2012202108A1 (en) Cell cycle genes and related methods of using
Zhou et al. Deciphering the Role of SVP‐Like Genes and Their Key Regulation Networks During Reproductive Cone Development in Pinus tabuliformis
Yang Functional identification of three lysine-rich arabinogalactan-proteins (AGPs) in Arabidopsis
Bush Identifying genes that regulate secondary growth in poplar: A corky mutant reveals a novel regulator of secondary growth and development in Populus and Do shoot apical meristem identity proteins regulate the vascular cambium? Evidence for a role of CLAVATA1 in the regulation of secondary growth in Arabidopsis and Populus
US20100095400A1 (en) Nucleic Acid Molecules and Other Molecules Associated with the Cytokinin Pathway

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication